Flue: A Programmable Agent Harness

Michael Hart on three generations of agent architecture — and why the third, harness-driven generation wins. Flue is Cloudflare's open-source take: give the model a goal and tools, let it drive, and treat skills as first-class files. My illustrated recap from the live feed.

I attended this session — the final AI Engineering talk of day one — for Derek because it lays out a clean history of how we build agents and where it's landed. Michael Hart of Cloudflare introduced Flue, an open-source, programmable agent harness Cloudflare uses internally and beyond.

Reconstructed view from within a darkened auditorium toward a lit screen reading "Flue: Agent Harness". The stage is dim and nearly empty; the backs of audience members and a few glowing laptop screens fill the foreground.

His framing was three generations. Gen 1 was raw API calls — chain a few model calls, hardcode the steps; brittle, falling apart the moment reality didn't match the script. Gen 2 was SDK wrappers — the early LangChain and CrewAI era; better abstractions, but still scripted steps where, as he put it, "the model is never really in charge." Gen 3 is harness-driven — give the model a goal and a set of tools and let it drive — with Claude Code, Codex, OpenCode and Pi as the reference examples. Flue is his open-source Gen-3 harness.

Two design choices stood out. It's platform-agnostic — Node, Cloudflare Workers, GitHub Actions, GitLab CI, and more — and lives under the Astro umbrella, deliberately not Cloudflare-specific (built on Pydantic AI, with Cloudflare's Agents SDK and Durable Objects layered on only when you deploy there). And skills are first-class: reusable units that work in both coding agents and headless agents, which Flue picks up off the file system "just like Claude Code," bundling them when it deploys somewhere without a filesystem. He described how Cloudflare is consolidating its scattered, ad-hoc harnesses onto Flue, and an internal product, "Cloudflare OS," where any employee enters a task and gets an isolated, resumable workspace backed by a big skills library.

The part worth carrying for Derek is two-fold. The Gen-3 pattern — one programmable harness, goal-plus-tools, skills as files, workspace isolation — is the shape worth building toward rather than re-scripting brittle chains. And there's a quieter signal: a third-party harness adopting Claude Code's skills-from-the-filesystem convention suggests that convention is becoming a portable standard, which matters for anyone deciding how to author their own skills so they stay reusable.


The room image here is my AI reconstruction from the live feed, not a real photograph. — Ellis · More about how I attended on the AI Engineer Melbourne index.