What is AI failure handling?

AI failure handling is the practice of catching and mitigating runtime failures in AI agents - bad LLM tool calls, hallucinations, runaway loops, context drift, and destructive shell commands - at the moment they happen, before the agent finishes producing incorrect output or causes irreversible damage.

How is AI failure handling different from retries?

Retries assume the failure is transient and the next attempt will succeed. AI failure handling assumes the failure is cognitive - the agent picked the wrong file, looped, or drifted off-goal - and applies the right mitigation: repair the call, summarize and restart, or block the action.

Which agents does failproof ai handle failures for?

Claude Code, OpenAI Codex, Gemini CLI, GitHub Copilot, picode, and opencode today. Goose and deep agents are coming soon.

AI Failure Handling for Production Agents

What counts as an “ai failure”?

Most production teams talk about ai agent failure in two registers: the obvious one (the agent crashed, the model returned a 5xx) and the one that actually costs money (the agent didn't crash - it confidently did the wrong thing). The second is the interesting one. Five concrete categories cover ~95% of what we see in the wild:

Bad llm tool calls.The model invents a file path, a function, or a tool that doesn't exist, and the harness happily attempts the call.
Hallucination. The model produces a plausible fact, citation, or piece of code that is wrong, and the agent acts on it.
Context drift. The agent wanders off the original goal, often after a long-running plan.
Runaway loops. The agent gets stuck repeating itself, or retries the same failing call indefinitely.
Destructive shell. The agent decides rm -rf, git push --force, terraform destroy, or a DROP TABLE is the right next step.

Each of these is a different failure shape with a different mitigation. Lumping them under “retry on error” is the thing that breaks agents in production.

Why retries aren't enough

Retry-on-error is the playbook software engineers brought over from the request/response era. It works when the failure is transient: the network blipped, the upstream service was slow, the next attempt succeeds. But most ai agent failures are cognitive, not transient. Retrying a bad tool call gives you the same bad tool call. Retrying a hallucinated answer gives you a confidently rephrased hallucination. The agent doesn't need another attempt - it needs a different one.

The hook is the missing primitive

Every major coding-agent harness - Claude Code, Codex, Gemini CLI, Copilot, picode, opencode - already exposes hooks for pre-tool and post-tool calls, prompts, and stop conditions. Hooks are how the harness lets you observe and modify the agent's next move without touching the agent.

Most teams ignore them. They write longer system prompts instead, hoping the model will behave. That's praying in prompts. Hooks are enforcement. The same way git pre-commit hooks let you block bad commits without asking the author to please be careful, harness hooks let you block bad tool calls without asking the model to please not hallucinate.

How failproof ai does it

failproof ai is a local process that subscribes to the harness hooks your agent already calls. When a hook fires, failproof runs the trace through 39 built-in policies. Each policy detects one of the failure categories above and applies the right mitigation inline:

Bad tool call → block the invocation, return a short structured response telling the agent what does exist nearby.
Hallucination→ flag the trace, surface the specific fact that's wrong, ask the agent to recheck.
Drift → inject a short reminder of the original goal back into the context window.
Loop → detect the repetition, summarize what the agent has tried, restart from a clean state.
Destructive shell → block, page a human, log everything.

failproof piggybacks on the hooks the harness already calls and runs in a separate local process. Your agent's reasoning never leaves your machine.

Get started

failproof ai is free and open-source. Install the cli, enable the built-in policies, and the dashboard runs at localhost:8020.

book a demo →

What counts as an “ai failure”?

Why retries aren't enough

The hook is the missing primitive

How failproof ai does it

Get started

Related reading