Writing

What breaks when you put agents in production.

Essays on memory, MCP, trust scoring, decision provenance, and why single-shot benchmarks hide what matters. On-site first, then the OrgX essays in order of strength.

MCP Protocol

What I Learned Running MCP in Production

What broke, what patterns held up, and what I would do differently after shipping the OrgX MCP server — 61 tools across 16 categories — plus integrations with ~20 external MCP services.

March 5, 2026 · 10 min read
OrgX essays · useorgx.com/blog

Writing from the platform itself

The OrgX blog is where the substrate work gets argued out: memory, benchmarks, MCP, trust, and where autonomy stops being a demo and starts being real infrastructure.

Evals · methodology

How we prove OrgX works

Weekly benchmarking: 12 tasks, 7 domains, 3 execution modes against a single-agent baseline and a human baseline — with full provenance.

Published on useorgx.com ↗
Systems thinking

The OrgX way

Distributed AI tool usage is a coordination failure. The founder is the integration layer that should be code, not willpower.

Published on useorgx.com ↗

Read the full OrgX blog ↗