AGENTS.md

Agent Reference

Complete reference for all agents in AgentSys.

TL;DR: 49 agents across 20 plugins (18 have agents). opus for reasoning, sonnet for patterns, haiku for execution. Each agent does one thing well.

Quick Navigation

Plugin	Agents	Jump to

Design principle: Each agent has a single responsibility. Complex work is decomposed into specialized agents that do one thing extremely well, then orchestrated together.

Related docs:

/next-task Workflow - How agents work together

Overview

AgentSys uses 49 specialized agents across 20 plugins (18 have agents; gate-and-ship is commands-only; zig-lsp is a config-only LSP plugin with no commands or agents). Each agent is optimized for a specific task and assigned a model based on complexity:

Model	Use Case	Cost
opus	Complex reasoning, quality-critical work	High
sonnet	Moderate reasoning, pattern matching	Medium
haiku	Mechanical execution, no judgment	Low

Agent types:

File-based agents (39) - Defined in plugins/*/agents/*.md with frontmatter
Role-based agents (10) - Defined inline via Task tool with specialized prompts

next-task Plugin Agents

task-discoverer

Model: sonnet Purpose: Find and prioritize tasks from configured sources.

What it does:

Loads claimed tasks from tasks.json (excludes them)
Fetches from GitHub Issues, GitHub Projects (v2 boards), GitLab, local files, or custom CLI
Excludes issues that already have an open PR (GitHub source only)
Applies priority scoring (labels, blockers, age, reactions)
Presents top 5 via AskUserQuestion checkboxes
Posts "Workflow Started" comment to GitHub issue

Tools available:

Bash (gh, glab, git)
Grep, Read
AskUserQuestion

worktree-manager

Model: haiku Purpose: Create git worktrees for isolated development.

What it does:

Creates ../worktrees/{task-slug}/ directory
Creates feature/{task-slug} branch
Claims task in tasks.json
Creates flow.json in worktree

Tools available:

Bash (git only)
Read, Write

exploration-agent

Model: opus Purpose: Deep codebase analysis before planning.

What it does:

Extracts keywords from task description
Searches for related files
Traces dependency graphs
Analyzes existing patterns
Outputs exploration report

Tools available:

Read, Glob, Grep
Bash (git only)
LSP
Task (for sub-exploration)

Why opus: Exploration quality directly impacts planning quality. Poor exploration = poor plan = poor implementation. The compound effect justifies the cost.

planning-agent

Model: opus Purpose: Design step-by-step implementation plans.

What it does:

Synthesizes exploration findings
Creates implementation steps
Identifies risks and critical paths
Outputs structured JSON
Posts summary to GitHub issue

Tools available:

Read, Glob, Grep
Bash (git only)
Task (for research)

Output format:

{
  "steps": [
    { "action": "modify", "file": "src/auth.ts", "description": "..." }
  ],
  "risks": ["..."],
  "complexity": "medium"
}

Why opus: Planning is the leverage point. A good plan makes implementation straightforward. A bad plan causes rework cycles.

implementation-agent

Model: opus Purpose: Execute approved plans with production-quality code.

What it does:

Executes plan step-by-step
Creates atomic commits per step
Runs type checks, linting, tests after each step
Updates flow.json for resume capability

Tools available:

Read, Write, Edit
Glob, Grep
Bash (git, npm, node)
Task (for sub-tasks)
LSP

Restrictions:

MUST NOT create PR
MUST NOT push to remote
MUST NOT invoke review agents

Why opus: Implementation quality matters. Bad code gets caught in review but wastes cycles. Good code flows through.

Note: delivery-validator and test-coverage-checker moved to prepare-delivery plugin.

sync-docs-agent

Model: sonnet Purpose: Update documentation for recent changes.

What it does:

Finds docs referencing changed files
Updates CHANGELOG entry
Fixes outdated imports/versions
Delegates simple edits to simple-fixer
Invokes /ship when complete

Tools available:

Bash (git)
Read, Grep, Glob
Task (for simple-fixer)

simple-fixer

Model: haiku Purpose: Execute mechanical edits without judgment.

What it does:

Receives structured fix list from parent
Executes each fix: remove-line, replace, insert
No decision-making, just execution

Tools available:

Read, Edit
Bash (git)

Why haiku: Pure execution. No reasoning needed. Haiku is fast and cheap.

ci-monitor

Model: haiku Purpose: Poll CI status with sleep/check loops.

What it does:

Polls gh pr checks every 15 seconds
Reports status changes
On failure: delegates to ci-fixer
On success: continues workflow

Tools available:

Bash (gh, git)
Read
Task (for ci-fixer)

Why haiku: Polling is mechanical. No judgment needed.

ci-fixer

Model: sonnet Purpose: Fix CI failures and review comments.

What it does:

Analyzes CI logs to diagnose failure
Applies fixes:
- Lint auto-fix
- Type error resolution
- Test failure fixes
Addresses PR review comments
Commits and pushes fixes

Tools available:

Bash (git, npm)
Read, Edit
Grep, Glob

deslop Plugin Agents

deslop-agent

Model: sonnet Purpose: Clean AI slop from code with certainty-based findings.

What it does:

Parses arguments (mode, scope, thoroughness)
Invokes deslop skill to run detection
Returns structured findings with certainty levels
HIGH certainty items marked for auto-fix by orchestrator

Tools available:

Bash (git, node)
Skill (for deslop)
Read, Glob, Grep

Why sonnet: Slop detection is pattern-based. Sonnet handles patterns well and is faster/cheaper than opus.

Cross-plugin usage: Also used by next-task Phase 8 with scope=diff to clean new code before review.

enhance Plugin Agents

plugin-enhancer

Model: sonnet Purpose: Analyze plugin structures.

Checks:

plugin.json manifest validity
MCP tool definitions (additionalProperties, required array)
Security patterns (unrestricted Bash, command injection)
Component organization

Tools available:

Read, Glob, Grep
Bash (git)

agent-enhancer

Model: opus Purpose: Analyze agent prompts.

Checks (14 patterns):

Frontmatter validity
Tool restrictions
XML structure
Chain-of-thought appropriateness
Example quality
Anti-patterns (vague language, prompt bloat)

Tools available:

Read, Glob, Grep
Bash (git)

Why opus: Agent quality compounds. Bad agent prompts = bad agent outputs across all uses.

claudemd-enhancer

Model: opus Purpose: Analyze CLAUDE.md/AGENTS.md files.

Checks:

Structure (critical rules, architecture, commands)
References to actual files
Token efficiency
README duplication
Cross-platform compatibility

Tools available:

Read, Glob, Grep
Bash (git)

docs-enhancer

Model: opus Purpose: Analyze documentation quality.

Modes:

AI-only: Aggressive token reduction
Both: Balance readability with AI-friendliness

Checks:

Link validity
Structure and chunking
Semantic boundaries
Heading hierarchy

Tools available:

Read, Glob, Grep
Bash (git)

prompt-enhancer

Model: opus Purpose: Analyze prompt engineering patterns.

Checks (16 patterns):

Clarity (vague instructions)
Structure (XML, headings)
Examples (few-shot patterns)
Context/WHY presence
Output format specification
Anti-patterns (redundant CoT)

Tools available:

Read, Glob, Grep
Bash (git)

hooks-enhancer

Model: opus Purpose: Analyze hook definitions.

Checks:

Frontmatter presence and structure
Required name/description fields
Basic formatting expectations

Tools available:

Read, Glob, Grep

skills-enhancer

Model: opus Purpose: Analyze SKILL.md quality.

Checks:

Frontmatter presence and structure
Required name/description fields
Trigger phrase clarity ("Use when user asks")

Tools available:

Read, Glob, Grep

cross-file-enhancer

Model: sonnet Purpose: Analyze cross-file semantic consistency.

Checks:

Tools used vs declared in frontmatter
Agent references exist
Duplicate instructions across files
Contradictory rules (ALWAYS vs NEVER)
Orphaned agents
Skill tool mismatches

Tools available:

Read, Glob, Grep, Bash(git:*)

drift-detect Plugin Agent

plan-synthesizer

Model: opus Purpose: Deep semantic analysis for drift detection.

What it does:

Receives data from JavaScript collectors
Performs semantic matching (not string matching)
Identifies:
- Issues that should be closed (already done)
- "Done" phases that aren't done
- Release blockers
Outputs prioritized reconstruction plan

Tools available:

Read, Write

Why opus: Semantic matching requires deep understanding. "user authentication" must match auth/, login.js, session.ts. Opus handles this.

repo-intel Plugin Agent

map-validator

Model: haiku Purpose: Validate repo-intel output for obvious errors.

What it does:

Verifies map isn't empty
Flags suspiciously small symbol counts
Checks for missing language detection
Returns single-line status

Tools available:

Read

Why haiku: Validation is deterministic and lightweight.

perf Plugin Agents

perf-orchestrator

Model: opus Purpose: Coordinate /perf investigations across all phases.

What it does:

Enforces perf rules and phase order
Spawns theory, profiling, and logging helpers
Ensures checkpoints + evidence after each phase

Tools available:

Read, Write, Edit, Task, Bash(git:), Bash(npm:), Bash(cargo:), Bash(go:), Bash(pytest:), Bash(mvn:), Bash(gradle:*)

perf-theory-gatherer

Model: opus Purpose: Generate hypotheses based on git history and evidence.

Tools available:

Read, Bash(git:), Bash(npm:), Bash(pnpm:), Bash(yarn:), Bash(cargo:), Bash(go:), Bash(pytest:), Bash(python:), Bash(mvn:), Bash(gradle:)

perf-theory-tester

Model: opus Purpose: Validate hypotheses with controlled experiments.

Tools available:

Read, Write, Edit, Bash(git:), Bash(npm:), Bash(pnpm:), Bash(yarn:), Bash(cargo:), Bash(go:), Bash(pytest:), Bash(python:), Bash(mvn:), Bash(gradle:)

perf-code-paths

Model: sonnet Purpose: Map entrypoints and likely hot files before profiling.

Tools available:

Read, Grep, Glob

perf-investigation-logger

Model: sonnet Purpose: Append structured investigation logs with evidence.

Tools available:

Read, Write

perf-analyzer

Model: opus Purpose: Synthesize findings into evidence-backed recommendations.

Tools available:

Read, Write

audit-project Plugin Agents

These are role-based agents invoked via Task tool with specialized prompts. They use the built-in review subagent type with domain-specific instructions.

code-quality-reviewer

Activation: Always active Purpose: Review code quality and error handling.

Focuses on:

Code style and consistency
Best practices violations
Error handling and failure paths
Maintainability issues
Code duplication

security-expert

Activation: Always active Purpose: Find security vulnerabilities.

Focuses on:

SQL injection, XSS, CSRF vulnerabilities
Authentication and authorization flaws
Secrets exposure, insecure configurations
Input validation, output encoding

performance-engineer

Activation: Always active Purpose: Find performance bottlenecks.

Focuses on:

N+1 queries, inefficient algorithms
Memory leaks, unnecessary allocations
Blocking operations, missing async
Bundle size, lazy loading

test-quality-guardian

Activation: Always active (reports missing tests) Purpose: Validate test coverage and quality.

Focuses on:

Test coverage for new code
Edge case coverage
Test design and maintainability
Mocking appropriateness

architecture-reviewer

Activation: Conditional (if FILE_COUNT > 50) Purpose: Review code organization.

Focuses on:

Code organization and modularity
Design pattern violations
Dependency management
SOLID principles

database-specialist

Activation: Conditional (if database detected) Purpose: Review database operations.

Focuses on:

Query optimization, N+1 queries
Missing indexes
Transaction handling
Connection pooling

api-designer

Activation: Conditional (if API detected) Purpose: Review API design.

Focuses on:

REST best practices
Error handling and status codes
Rate limiting, pagination
API versioning

frontend-specialist

Activation: Conditional (if frontend detected) Purpose: Review frontend code.

Focuses on:

Component design and composition
State management patterns
Performance (memoization, virtualization)
Accessibility

backend-specialist

Activation: Conditional (if backend detected) Purpose: Review backend service and domain logic.

Focuses on:

Service boundaries and layering
Domain logic correctness
Concurrency and idempotency
Background job safety

devops-reviewer

Activation: Conditional (if CI/CD detected) Purpose: Review infrastructure and CI/CD.

Focuses on:

Pipeline configuration
Secret management
Docker best practices
Deployment strategies

learn Plugin Agent

learn-agent

Model: opus Purpose: Research any topic online and create comprehensive learning guides with RAG-optimized indexes.

What it does:

Uses progressive query architecture (funnel approach: broad → specific → deep)
Gathers 10-40 online sources based on depth level
Scores sources by authority, recency, depth, examples, uniqueness
Uses just-in-time retrieval to save tokens (only fetches high-scoring sources)
Creates structured learning guides with examples and best practices
Updates CLAUDE.md/AGENTS.md master indexes for future RAG lookups
Runs enhance:enhance-docs and enhance:enhance-prompts for quality

Tools available:

WebSearch, WebFetch, Read, Write, Glob, Grep, Skill

Output:

Topic-specific guide in agent-knowledge/
Updated master index in agent-knowledge/CLAUDE.md
Source metadata with quality scores in agent-knowledge/resources/

agnix Plugin Agent

agnix-agent

Model: sonnet Purpose: Lint agent configuration files using agnix CLI.

What it does:

Parses arguments (path, --fix, --strict, --target)
Invokes the agnix skill with the appropriate flags
Returns structured validation results

Tools available:

Bash(agnix:), Bash(cargo:), Skill, Read, Glob, Grep

Output:

Structured JSON with error/warning counts
List of diagnostics with file, line, rule, message
Fix status if --fix was used

prepare-delivery Plugin Agents

prepare-delivery-agent

Model: sonnet Purpose: Orchestrate pre-ship quality gate pipeline via skill.

What it does:

Runs prepare-delivery:test-coverage-checker and prepare-delivery:delivery-validator in sequence
Aggregates pass/fail results into a single quality gate verdict
Blocks shipping if any mandatory check fails

Tools available:

Bash (git, npm)
Skill, Task, Read, Grep, Glob

test-coverage-checker

Model: sonnet Purpose: Validate test quality for new code.

What it does:

Identifies new/modified functions
Checks if tests exist
Checks if tests are meaningful (not just path matching)
Reports coverage status

Tools available:

Bash (git, npm)
Read, Grep, Glob

Advisory only: Does not block workflow. Reports findings but continues.

delivery-validator

Model: sonnet Purpose: Final validation before shipping.

Checks:

Review status - no open issues (or explicit override)
Tests pass
Build passes
Task requirements met (extracts from task, maps to changes)
No regressions

Tools available:

Bash (git, npm)
Read, Grep, Glob

On failure: Returns to implementation with fix instructions.

Restrictions:

MUST NOT create PR
MUST NOT push
MUST NOT skip sync-docs:sync-docs-agent

gate-and-ship Plugin

No agents - command-only orchestrator that delegates to prepare-delivery and ship plugins.

consult Plugin Agent

consult-agent

Model: sonnet Purpose: Cross-tool AI consultation - get a second opinion from another AI tool.

What it does:

Formats context and question for the target tool (Gemini, Codex, Claude, OpenCode, Copilot)
Invokes the target tool non-interactively
Returns the structured response for comparison

Tools available:

Bash, Read, Glob, Grep, Skill

debate Plugin Agent

debate-orchestrator

Model: sonnet Purpose: Structured multi-round debate between AI tools.

What it does:

Frames the debate topic and assigns positions to AI tools
Manages rounds - each tool argues, then rebuts
Synthesizes final summary with key agreements and disagreements

Tools available:

Bash, Read, Glob, Grep, Skill

web-ctl Plugin Agent

web-session

Model: sonnet Purpose: Browser automation with persistent state.

What it does:

Manages headless browser sessions with auth handoff
Navigates pages, extracts content, fills forms
Maintains session state across multiple interactions

Tools available:

Bash, Read, Write, Skill

ship Plugin Agent

release-agent

Model: sonnet Purpose: Versioned release with automatic ecosystem detection.

What it does:

Detects ecosystem (npm, cargo, go, etc.) and version strategy
Bumps version, creates changelog entry, tags release
Delegates publish to CI via tag push or gh release create

Tools available:

Bash (git, gh, npm, cargo)
Read, Write, Glob, Grep

skillers Plugin Agents

skillers-recommender

Model: opus Purpose: Suggest skills, hooks, and agents from observed workflow patterns.

What it does:

Reads compacted knowledge themes from transcript analysis
Identifies repetitive manual patterns that could be automated
Recommends new skills, hooks, or agents with draft implementations

Tools available:

Read, Glob, Grep, Write

Why opus: Pattern synthesis across diverse workflows requires deep reasoning to distinguish signal from noise.

skillers-compactor

Model: sonnet Purpose: Compact transcripts into knowledge files.

What it does:

Reads raw transcripts from Claude Code, Codex, or OpenCode
Extracts observations and clusters them into knowledge themes
Writes compacted knowledge files for skillers-recommender

Tools available:

Read, Write, Glob, Grep, Bash

onboard Plugin Agent

onboard-agent

Model: sonnet Purpose: Codebase onboarding - project orientation for newcomers.

What it does:

Scans project structure, README, and config files
Identifies architecture patterns, key entry points, and conventions
Generates a concise orientation guide tailored to the contributor's role

Tools available:

Read, Glob, Grep, Bash (git)

can-i-help Plugin Agent

can-i-help-agent

Model: sonnet Purpose: Match contributor skills to project needs.

What it does:

Scans open issues, good-first-issue labels, and help-wanted tags
Profiles contributor strengths from their history or stated skills
Returns ranked list of issues the contributor is best suited to tackle

Tools available:

Bash (gh, git)
Read, Glob, Grep

Model Selection Rationale

Agent Type	Model	Reasoning
Analysis/reasoning	opus	Quality compounds - errors propagate
Pattern matching	sonnet	Good at structured tasks, fast
Mechanical execution	haiku	No judgment needed, cheapest

Key insight: For enhancers and analyzers, quality loss is exponential. Each imperfection in analysis creates downstream problems.

Tool Restrictions

Agents have restricted tool access for safety:

Agent	Restricted From	Why
implementation-agent	PR creation, git push	Workflow enforces order
prepare-delivery:delivery-validator	PR creation, git push	Must pass validation first
worktree-manager	Most tools	Only needs git
simple-fixer	Most tools	Only needs edit

Navigation

← Back to Documentation Index | Main README

Related:

/next-task Workflow - How agents orchestrate together
/ship Workflow - Shipping agents in action

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Agent Reference

Quick Navigation

Overview

next-task Plugin Agents

task-discoverer

worktree-manager

exploration-agent

planning-agent

implementation-agent

sync-docs-agent

simple-fixer

ci-monitor

ci-fixer

deslop Plugin Agents

deslop-agent

enhance Plugin Agents

plugin-enhancer

agent-enhancer

claudemd-enhancer

docs-enhancer

prompt-enhancer

hooks-enhancer

skills-enhancer

cross-file-enhancer

drift-detect Plugin Agent

plan-synthesizer

repo-intel Plugin Agent

map-validator

perf Plugin Agents

perf-orchestrator

perf-theory-gatherer

perf-theory-tester

perf-code-paths

perf-investigation-logger

perf-analyzer

audit-project Plugin Agents

code-quality-reviewer

security-expert

performance-engineer

test-quality-guardian

architecture-reviewer

database-specialist

api-designer

frontend-specialist

backend-specialist

devops-reviewer

learn Plugin Agent

learn-agent

agnix Plugin Agent

agnix-agent

prepare-delivery Plugin Agents

prepare-delivery-agent

test-coverage-checker

delivery-validator

gate-and-ship Plugin

consult Plugin Agent

consult-agent

debate Plugin Agent

debate-orchestrator

web-ctl Plugin Agent

web-session

ship Plugin Agent

release-agent

skillers Plugin Agents

skillers-recommender

skillers-compactor

onboard Plugin Agent

onboard-agent

can-i-help Plugin Agent

can-i-help-agent

Model Selection Rationale

Tool Restrictions

Navigation