You write a spec, run agent-fox code, and walk away. The fox reads your
specs, plans the work, spins up isolated git worktrees, runs each coding
session with the right context, handles merge conflicts, retries failures,
extracts learnings into structured memory, and merges clean commits to
develop. You come back to a finished feature branch and a standup report.
The typical workflow has four stages:
-
Write specs. Describe your feature as a structured specification package under
.agent-fox/specs/— a PRD, acceptance criteria (EARS syntax), design document, test contracts, and a task list. Each spec maps to one coherent feature or change. Use the/af-specskill in Claude Code to generate the full five-file package from a PRD, a GitHub issue URL, or a plain-English description. Runagent-fox lint-specsto validate specs before planning; use--fixto auto-repair common issues and--aifor semantic analysis of acceptance criteria. -
Plan. Run
agent-fox planto compile your specs into a dependency graph of tasks. The planner is deterministic — same specs, same graph, every time. It parses task groups from each spec, builds intra-spec chains (groups execute sequentially), wires cross-spec dependencies declared in PRDs, and injects review agents at the right positions. Use--analyzeto see a parallelism analysis, or--fastto exclude optional tasks. -
Execute. Run
agent-fox code --parallel 4to start autonomous execution. The orchestrator dispatches agents to each ready task in dependency order. Each agent works in an isolated git worktree on its own feature branch, so multiple agents work simultaneously without conflicts. Reviewer agents (pre-review, drift-review modes) check specs before coding starts; audit-review and Verifier agents check the result after. Failed tasks are retried with escalation to stronger models. Completed work is merged intodevelopunder a serializing lock via squash merge (with AI-assisted conflict resolution when needed). -
Monitor. Run
agent-fox statusfor a progress dashboard — task counts by state, token usage, estimated cost, cost breakdown by archetype and spec, and details on any blocked or failed tasks. Runagent-fox standupfor a daily activity report covering agent sessions, human commits, and file overlaps. Both commands support--jsonfor machine consumption.
agent-fox uses a four-entry archetype registry with a mode system to divide labor:
- Coder — the primary implementation agent. Receives the full spec context and implements one task group per session. Follows a test-first workflow: group 1 writes failing tests, subsequent groups implement code.
- Reviewer — a single archetype with four modes that cover all review
roles:
- pre-review — reviews spec quality before implementation. Checks completeness, consistency, feasibility, and security. Can block coding if critical findings exceed a threshold.
- drift-review — validates spec assumptions against the actual codebase. Detects drift between what specs expect and what actually exists. Automatically skipped when the spec references no existing code.
- audit-review — validates test quality against test spec contracts after tests are written. Triggers coder retries when tests are missing, weak, or misaligned with their specifications.
- fix-review — reviews fix-mode patches (quality fixes, night-shift repairs) with full tool access and extended turn budget.
- Verifier — performs post-implementation verification. Runs the test suite, checks each requirement against acceptance criteria, and triggers coder retries when verification fails.
- Maintainer — drives night-shift operations with three modes (hunt, fix-triage, extraction). Not assignable to spec tasks.
Review and verification archetypes can run multiple instances in parallel on the same task, with outputs merged using mode-specific convergence strategies. For full archetype details, see the Archetypes section in the Architecture Guide.
When quality checks are failing, run agent-fox fix to auto-detect available
tools (pytest, ruff, mypy, etc.), cluster failures by root cause using AI,
generate fix specs, and dispatch coding agents to resolve each cluster. Use
--auto for iterative improvement passes after the initial repair.
For ongoing codebase health, agent-fox night-shift runs as a continuously
running maintenance daemon. It hunts for technical debt across eight
categories — linter debt, dead code, test coverage gaps, dependency freshness,
deprecated API usage, documentation drift, TODO/FIXME resolution, and quality
gate failures — then groups findings by root cause and files GitHub issues.
Issues labelled af:fix are automatically picked up and repaired through a
two-agent pipeline (Coder, Reviewer in fix-review mode). Use --auto to label every
discovered issue for hands-off repair.
agent-fox maintains a persistent knowledge store (DuckDB) that captures what
agents learn during sessions — patterns, gotchas, architectural decisions,
conventions, anti-patterns, and fragile areas. Each new session starts with
a fresh context window but receives curated, relevant facts from prior
sessions so the same mistakes are never repeated. The knowledge system handles
deduplication, contradiction detection, and age-based confidence decay
automatically. Run agent-fox onboard to bootstrap the knowledge store from
an existing codebase by ingesting ADRs, git history, and source analysis.
When tasks fail or become blocked, run agent-fox reset to clear failed
tasks and retry them. For targeted recovery, pass a specific task ID. For a
full restart, use --hard to reset all tasks, clean up worktrees and
branches, compact the knowledge store, and roll back develop.
For a detailed understanding of how agent-fox works internally — how specs become task graphs, how the orchestrator dispatches and serializes sessions, the archetype mode system, the knowledge lifecycle, and the night-shift hunt-triage-fix pipeline — see the Architecture Guide. The architecture docs are written for senior engineers joining the project and stay at the conceptual level without code snippets or class hierarchies.
| Document | Description |
|---|---|
| CLI Reference | All commands, flags, and exit codes |
| Configuration Reference | Every config.toml section and option |
| Archetypes | Archetype registry, modes, and convergence |
| Profiles | Agent profiles, resolution, and customization |
| Skills | Claude Code skill reference |
| Architecture Guide | System internals and design rationale |