Writing | Hope Atina

MCP Protocol

Why MCP Servers Need OAuth 2.1 + DCR, Not API Keys

Every production MCP server is going to need dynamic client registration within six months. Here's why we built it into orgx-mcp from day one and how we did it.

April 20, 20269 min readMCPOAuthSecurityInfrastructureAuthentication

Read featured post →

Agent Infrastructure

The Claim Graph: Why Typed Provenance Beats RAG for Agent Systems

RAG retrieves text. A claim graph retrieves decisions, their justifications, and who approved them. For agent systems, the second one is what actually matters.

April 17, 2026 · 8 min read

Infrastructure

Durable Objects for Agent Memory: A Cloudflare Pattern

If your MCP server's session state disappears on deploy, you haven't finished building it. Here's the pattern we use in OrgX to keep it alive.

April 14, 2026 · 7 min read

Evals

We Ran 136 Agent Tasks. Here's What Single-Shot Benchmarks Hide.

Most agent benchmarks are one prompt, one model, one score. That structurally conceals the thing that breaks in production: cascading context across sessions.

April 10, 2026 · 10 min read

Governance

Trust Models for Agent Autonomy: When to Let Agents Act Alone

A practical framework for strict, balanced, and open autonomy, with trust earned through evidence instead of granted by prompt.

March 18, 2026 · 11 min read

Essay

Why Most Agent Frameworks Solve the Wrong Problem

Most agent frameworks optimize the prompt loop. Production agent infrastructure has to optimize governance, durability, memory, and review.

March 12, 2026 · 12 min read

MCP Protocol

What I Learned Running MCP in Production

What broke, what patterns held up, and what I would do differently after shipping the OrgX MCP server — 61 tools across 16 categories — plus integrations with ~20 external MCP services.

March 5, 2026 · 10 min read

Developer Tooling

Building a Homebrew-Installable Dev Tool in Rust: The PerfPulse Story

Why Rust for CLI tools, cross-platform M1/Intel builds, Homebrew tap distribution, and integrating Claude API for AI-powered recommendations.

January 10, 2025 · 10 min read

Benchmarks · substrate

Memory is the structural lift — Phase 2 substrate benchmark

136 tasks across multiple models and orchestration cells. Single-shot benchmarks structurally hide what agents cannot fake: cascading context.

Published on useorgx.com ↗

MCP · memory

You are the API between your AI tools. OrgX MCP fixes that.

Manual context-carrying between ChatGPT, Claude, and Cursor is the leak. MCP is the continuity layer for unified organizational context.

Published on useorgx.com ↗

Evals · methodology

How we prove OrgX works

Weekly benchmarking: 12 tasks, 7 domains, 3 execution modes against a single-agent baseline and a human baseline — with full provenance.

Published on useorgx.com ↗

Trust · credibility

Our autonomous benchmark has independent judges now

Published artifacts, independent judgments, token-level costs, and failure cases requiring human review.

Published on useorgx.com ↗

Infrastructure · onboarding

The most underrated product surface in AI is the setup script

Initial configuration determines whether AI tools share organizational context or operate in isolation. OrgX Wizard as infrastructure.

Published on useorgx.com ↗

Systems thinking

The OrgX way

Distributed AI tool usage is a coordination failure. The founder is the integration layer that should be code, not willpower.

Published on useorgx.com ↗

Systems thinking · trust

Why AI-generated brand content is mostly slop

It is not prompting — it is systemic design. A model asked to carry taste, memory, and QA by itself will fail predictably.

Published on useorgx.com ↗

Evals · craft

We generated 75 ad concepts. The useful part was killing 60.

Filtering and curation are higher-leverage than volume in AI generation. Rigor in selection matters more than throughput.

Published on useorgx.com ↗

Read the full OrgX blog ↗

What breaks when you put agents in production.

Why MCP Servers Need OAuth 2.1 + DCR, Not API Keys

The Claim Graph: Why Typed Provenance Beats RAG for Agent Systems

Durable Objects for Agent Memory: A Cloudflare Pattern

We Ran 136 Agent Tasks. Here's What Single-Shot Benchmarks Hide.

Trust Models for Agent Autonomy: When to Let Agents Act Alone

Why Most Agent Frameworks Solve the Wrong Problem

What I Learned Running MCP in Production

Building a Homebrew-Installable Dev Tool in Rust: The PerfPulse Story

Writing from the platform itself

Memory is the structural lift — Phase 2 substrate benchmark

You are the API between your AI tools. OrgX MCP fixes that.

How we prove OrgX works

Our autonomous benchmark has independent judges now

The most underrated product surface in AI is the setup script

The OrgX way

Why AI-generated brand content is mostly slop

We generated 75 ad concepts. The useful part was killing 60.