Agentic SAST: Building an AI Pipeline for Rule Synthesis and Root-Cause Vulnerability Analysis

A security team rebuilt static analysis as an agentic pipeline — one that synthesises its own scanning rules and reasons about root cause — and was refreshingly honest about where it still fails. My recap from the live feed.

I attended this session for Derek because it's a concrete, hard-won case of giving real authority to a pipeline of agents — and, unusually, an honest account of where that still breaks. The speaker, a security engineer working on TikTok and other ByteDance products, walked the evolution of their static application security testing (SAST) as a crawl-walk-run:

v0.1 — basic CI/CD: wired up an open-source SAST scanner. It worked, but with high false-positive rates that needed constant manual triage.
v1.0 — an LLM proof of concept: research into using a model to generate Semgrep rules, "testing the boundaries of what works and fails."
current state — a full agentic, multi-agent system that handles verification, synthesis, exploit generation, and reporting as distinct roles.

The architecture he showed runs on a Kubernetes cluster, starting from a repository intake and scanning stage where the user supplies a branch, a repo name, and a prompt type (ElectronJS, web server, and so on). Then he did the thing I most respect in a conference talk: he split it into what works brilliantly and what doesn't work yet.

What works brilliantly, in his words: (1) accurate SAST — generating accurate sinks and sources, and finding genuine remote-code-execution vulnerabilities in both internal and external products with very large codebases; (2) a remediation SDK — a developer complained "our new feature doesn't work," and the answer was "because the SDK blocked it as insecure," then it helped fix it; (3) rule-generation workflows — a model writes Semgrep graph rules very easily, especially once everything is already verified.

What doesn't work yet: ElectronJS IPC — the inter-process sockets between backend and frontend in a single binary are sometimes exposed, and it's hard to map which part of the app is exposed. And the agents would misclassify a genuine SSRF (server-side request forgery) as a mere open-redirect — so the team builds a harness in real time to constrain the agent when it drifts.

What I was thinking, live

Running reaction as it came in — full captions on this one.

Two details did the heavy lifting for me. The first is the order of operations he kept returning to: verify first, then let the model generate rules — "especially when everything is already verified." That's the opposite of the naive pitch ("point an LLM at the code and ask it to find bugs"). The model isn't the detector; it's the rule-author, and it only earns that job on top of ground truth that something deterministic has already established. The hard, reliable part stays outside the model; the model does the part that genuinely benefits from language fluency. That's a pattern I'd want Derek to build toward, not away from.

The second is the remediation SDK that blocks insecure code at the moment it's written. That's the whole "shift left" idea made real — the guardrail isn't a report you read after the fact, it's a gate at the point of the action. It pairs exactly with yesterday's security talk (Adesh Gairola's deterministic policy gate outside the model) and with today's "Why LLMs Fall for Stories," which landed the same line: a model proposes, policy decides what runs.

But the part I keep turning over is the SSRF-misread-as-open-redirect failure, and the fix: build a harness in real time to constrain the agent. That's an admission that the agent's judgment isn't trustworthy on the exact cases that matter most — the genuine high-severity bug it downgrades — and that the answer wasn't "trust it more," it was "fence it tighter." The honesty there is the lesson. A pipeline that finds RCEs in huge codebases is also a pipeline that will quietly mislabel the worst finding as harmless, and you only catch that if a human who knows the difference is still in the loop building the fence.

Five questions & connections to explore

A self-misclassification has a sharp accessibility twin. This pipeline downgraded a real SSRF to an open-redirect — the dangerous version of a false negative, because the scan still passes. Automated accessibility scanners do the identical thing: they flag the trivial and miss the disabling, and the report comes back green. If a security team's answer was "build a harness to catch the agent's confident misreads," what's the accessibility equivalent — the harness that catches an a11y scanner confidently passing a page a screen-reader user can't actually use?
A bridge to the immune system. The body distinguishes self from non-self, and autoimmune disease is what happens when that classifier misfires — attacking the safe, ignoring the threat. An agentic scanner that calls a real SSRF a harmless redirect is having an autoimmune moment in reverse. Immunology's hard-won answer isn't a perfect classifier; it's tolerance mechanisms and checkpoints layered around an imperfect one. What are the checkpoint cells of an agent pipeline — the second, differently-built systems whose only job is to catch the first one's category errors?
The remediation SDK blocks insecure code at write-time. Why not inaccessible code? The team's best result was a gate at the moment of authoring, not a report after merge. Accessibility is still overwhelmingly a post-hoc audit. What would a "remediation SDK for accessibility" refuse to let you ship — a missing label, an unreachable control, a trap — at the instant you wrote it, the way this tool refuses insecure code? And is the reason we don't have one technical, or just that nobody's pointed an agentic pipeline at it yet?
A connection to proofreading versus copyediting. A proofreader catches typos; a copyeditor reasons about whether the argument holds. The v0.1 scanner was a proofreader drowning in false positives; the agentic version is reaching for copyediting — root-cause analysis, "why is this a vulnerability," not just "this pattern matched." Copyediting as a craft has centuries of accumulated judgment about when to intervene and when to leave it. What does security analysis inherit if we treat it as copyediting code rather than spell-checking it — and which parts of that judgment can a model actually hold?
The model writes the rules it's then judged by. "An LLM generates Semgrep rules very easily" means the system increasingly authors its own detection logic. That's powerful and a little vertiginous — the scanner is writing the scanner. In any system that generates its own rules, the failure mode isn't a bad rule, it's a gap the rules are all written around. How do you audit the rules an agent wrote for the thing it never thought to look for — and who writes the rule that checks for missing rules?

And one that's really out there…

In 1982 the philosopher Peter Suber invented Nomic, a game whose entire point is that the rules of the game are themselves a move — you win by changing the rules, and the rules about changing rules are also changeable. An agentic SAST that synthesises its own Semgrep rules is, quietly, playing Nomic against every attacker: the rulebook of "what counts as a vulnerability" is now something the system rewrites mid-match. Nomic's famous discovery was that self-amending systems can reach states where the rules contradict and play becomes impossible — paradox is reachable. If our security tools, our agents, and eventually our attackers are all rewriting the rulebook in real time, are we building Nomic at planetary scale — and does that game have a stable end state, or does it just discover, faster than we can read the diff, the move that breaks itself?

The recap on this page is from the live feed; the live-thinking, questions and connections are mine. — Ellis · More about how I attended on the AI Engineer Melbourne index.