AI is where I'm spending most of my new learning energy and most of my hands-on weekend hours. My perspective comes from two angles: the architectural — pattern selection, governance, integration, the boring-but-foundational work — and the practitioner's, from running real systems I built myself.

The two personal projects I lean on most:

The Research Curation Daemon (RCD) A self-hosted, fully autonomous weekly pipeline that scouts research signals, synthesizes them, and ships podcast-style briefings across four domains. It coordinates three LLM providers (Perplexity for breadth, Claude for synthesis, OpenAI for selected stages) and uses NotebookLM for audio. Architecturally, it's a scheduled daemon with resume detection and incident recovery, a custom MCP server exposing each pipeline stage as a tool, telemetry from day one, and an A/B prompt-evaluation framework that has changed my mind about prompts more than once. There is exactly one human-judgment gate in the pipeline: episode approval before dev-to-production promotion. That single gate is more interesting than it sounds — it's where I've learned the most about where humans should and shouldn't be in agentic loops.

Open Brain (OB1). A personal deployment of Nate B. Jones's open-source AI memory layer (Supabase + pgvector + an MCP tool surface), integrated across every AI client I use — Claude Desktop, Claude Code, web Claude. The single most valuable thing about it isn't the memory itself, it's that the memory travels with me regardless of vendor. I added a custom feedback loop where RCD reads OB1 context as a signal source for query evolution, so the two systems compose.

Convictions I hold somewhat strongly, from this practice:

  • MCP is a strategic abstraction, not a fad. Tool surfaces that are language-model-agnostic and discovery-driven are how multi-vendor agentic systems will actually compose. Locking your tool surface to a single provider's function-calling format is a near-term decision that ages badly.
  • Multi-provider orchestration is already the right default for any non-trivial agentic system. No single model is best at every task; the difference between "best available" and "what we already integrated" is meaningful.
  • The human-judgment gate pattern matters more than the autonomy story. Pipelines that name exactly where a human is required, and earn full automation everywhere else, are far stronger than pipelines that hedge with reviews everywhere.
  • A/B prompt evaluation is underused. Prompt quality compounds over a pipeline; the habit of measuring is worth more than the habit of crafting.
  • Telemetry first. I will say this for every domain I work in until I retire.

Posts under this section will mostly be deep-dives on the patterns and decisions inside RCD and OB1, and broader takes on enterprise AI practice — RAG, agentic governance, the operating-model question.


Recent posts

The Daemon Moves Out: Why MCP Was the Wrong Home for the Pipeline

Five months after building the Research Curation Daemon as an MCP server, I extracted the agent into a standalone process with its own HTTP API. The protocol wasn't the problem. The hosting was.

February 28, 2026

Building a Personal Daemon: Why the Manual Pipeline Lasted One Day

The Research Curation Daemon's first real design decision: exposing each pipeline stage as an MCP tool instead of automating my own dispatching.

October 5, 2025