AtlasColony

Multi-agent AI coding with zero infrastructure.
No databases. No message queues. No APIs between agents. Just git.

Configure a colony of Claude Code agents. Write your goals. Walk away.
They decompose, implement with TDD, review every line, and merge to main.

5 agents autonomously building a REST API with auth. Real terminal. Real time. No edits.

What is this?

You write a goals file describing what you want built. AtlasColony spins up a team of AI agents that autonomously decompose your goals into tasks, implement them with TDD, review every line of code, and merge to main.

No human intervention. No babysitting. No copy-pasting between chat windows.

You write GOALS.md
  → CEO pivots strategy if needed
    → ATLAS breaks goals into tasks
      → 5 coders race to implement (TDD, git worktrees, isolated branches)
        → AUDIT reviews every diff, runs tests, merges or rejects
          → BETA-TESTER validates everything against goals, files bugs
            → Repeat until done

Everything is coordinated through git. No message queues. No databases. No APIs. No infrastructure. Just git commits and file moves.

Why?

Problem	AtlasColony
AI coding assistants need constant hand-holding	Agents work autonomously in loops — no prompting
One AI instance = one thread of work	N parallel coders racing through your task queue
No quality control on AI-generated code	Dedicated AUDIT agent reviews every line before merge
Context lost between sessions	Persistent memory via git history, task files, and reports
Complex infrastructure to coordinate agents	Zero infra — git is the only message bus
One-size-fits-all agent setups	Fully configurable — define any agents, any roles, any team structure

How It Compares

	AtlasColony	Claude Agent Teams	Roo Code	OpenHands
Orchestration	Git only — zero infra	Built into Claude Code	VS Code extension	Runtime + sandbox
State management	Git commits	Session state	Editor state	Runtime containers
Agent isolation	Git worktrees	Subprocess sandboxes	Single context	Docker sandboxes
Setup time	2 min, zero deps	Built into Claude Code	VS Code install	Docker + config
Audit trail	`git log` — every decision, every line	Session logs	Editor history	Sandbox logs
Self-healing	Watchdog with lease-based recovery	Manual restart	—	Health checks
Fully open source	MIT	Anthropic feature	MIT	MIT
Custom agent roles	Any role via templates	Fixed patterns	Single agent	Predefined agents

AtlasColony's bet: You don't need databases, message queues, or custom runtimes to coordinate AI agents. Git already solves distributed coordination — we just use it.

Quick Start

# 1. Clone AtlasColony
git clone https://github.com/TomOst-Sec/atlas-colony.git
cd atlas-colony

# 2. Initialize your project
./bin/colony init https://github.com/you/your-project.git

# 3. Write your goals
vim the-project/_colony/GOALS.md

# 4. Launch the colony
./bin/colony start all

# 5. Watch it build
./bin/colony dashboard

That's it. Your agents are now autonomously building your project.

What People Build With This

SaaS MVPs — Write goals for a full-stack app, come back to a working prototype
Open source maintenance — Point it at your issue backlog, let it grind through bugs
Code migrations — "Migrate this Express app to Fastify" as a single goal
Research projects — Non-coding colonies that research, synthesize, and write reports
Refactoring sprints — "Add TypeScript types to all 200 files" with 10 parallel coders

How It Works

The Agent Hierarchy

        ┌──────────┐
        │   CEO    │  Strategic oversight, goal pivots
        └────┬─────┘
             │
        ┌────┴─────┐
        │  ATLAS   │  Decomposes goals → tasks
        └────┬─────┘
             │
    ┌────────┼────────┐
    │        │        │
┌───┴──┐ ┌──┴───┐ ┌──┴───┐
│ALPHA │ │ALPHA │ │BRAVO │  5 parallel coders (TDD)
│  1   │ │ 2,3  │ │ 1,2  │
└───┬──┘ └──┬───┘ └──┬───┘
    │        │        │
    └────────┼────────┘
             │
    ┌────────┼────────┐
    │        │        │
┌───┴──┐ ┌──┴───┐ ┌──┴───┐
│AUDIT │ │ BETA │ │ DOCS │  Review + test + document
│      │ │TESTER│ │      │
└───┬──┘ └──┬───┘ └──┬───┘
    │        │        │
    └────────┼────────┘
             │
          [ main ]

The Task Lifecycle

queue/  ──→  active/  ──→  review/  ──→  done/
  ↑                           │
  └────── rejected ───────────┘  (+ bug report)
  ↑                                      │
  └──── beta-tester finds regression ────┘

ATLAS reads your goals and generates detailed task files in queue/
Coders race to claim tasks (atomic, first-come-first-served)
Each coder creates an isolated git worktree, implements with TDD, pushes a branch
AUDIT checks out the branch, runs the full test suite, reviews every line of the diff
If clean → merge to main. If problems → reject with bug report back to queue
BETA-TESTER validates all merged commits against GOALS.md — files bug tasks if something breaks or drifts from the vision
ATLAS reads bug reports and generates fix tasks. Repeat.

Coordination = Git

All agent communication happens through files in _colony/:

_colony/
├── GOALS.md           ← You write this
├── ROADMAP.md         ← ATLAS maintains this
├── CEO-DIRECTIVE.md   ← CEO writes strategic pivots
├── queue/             ← Pending tasks
├── active/            ← Tasks being worked on
├── review/            ← Awaiting AUDIT review
├── done/              ← Completed and merged
├── bugs/              ← Bug reports (AUDIT/BETA-TESTER → ATLAS)
├── reports/           ← Hourly status + daily + beta-test reports
└── logs/              ← Per-agent activity logs

No Redis. No Kafka. No PostgreSQL. Just git add, git commit, git push.

Configure Your Colony

AtlasColony is fully configurable via colony.yml. Run 3 agents or 30. Define custom roles. Mix models. Create any team structure.

Default: 10-Agent Colony

# colony.yml
teams:
  alpha:
    task_ratio: 60
  bravo:
    task_ratio: 40

agents:
  ceo:
    type: overseer
    model: opus
    cycle: 60
    skills: [strategic-oversight]

  atlas:
    type: planner
    model: opus
    cycle: 30
    skills: [task-generation]

  audit:
    type: reviewer
    model: opus
    cycle: 15
    skills: [code-review]

  beta-tester:
    type: tester
    model: opus
    cycle: 45
    skills: [beta-testing]

  docs:
    type: documenter
    model: sonnet
    cycle: 120
    skills: [documentation]

  alpha-1:
    type: coder
    team: alpha
    model: opus
    skills: [tdd-coding]

  alpha-2:
    type: coder
    team: alpha
    model: opus
    skills: [tdd-coding]

  alpha-3:
    type: coder
    team: alpha
    model: opus
    skills: [tdd-coding]

  bravo-1:
    type: coder
    team: bravo
    model: opus
    skills: [tdd-coding]

  bravo-2:
    type: coder
    team: bravo
    model: opus
    skills: [tdd-coding]

Minimal: 3-Agent Colony

teams:
  dev:
    task_ratio: 100

agents:
  planner:
    type: planner
    model: opus
    cycle: 30
    skills: [task-generation]

  coder:
    type: coder
    team: dev
    model: opus
    skills: [tdd-coding]

  reviewer:
    type: reviewer
    model: opus
    cycle: 15
    skills: [code-review]

Research Colony (non-coding)

teams:
  research:
    task_ratio: 60
  writing:
    task_ratio: 40

agents:
  director:
    type: overseer
    model: opus
    cycle: 60
    skills: [strategic-oversight]

  researcher-1:
    type: coder
    team: research
    model: opus
    skills: [research]

  researcher-2:
    type: coder
    team: research
    model: opus
    skills: [research]

  writer:
    type: coder
    team: writing
    model: opus
    skills: [technical-writing]

  editor:
    type: reviewer
    model: opus
    cycle: 20
    skills: [code-review]

Scale to 20 Coders

teams:
  frontend:
    task_ratio: 30
  backend:
    task_ratio: 40
  infra:
    task_ratio: 30

agents:
  ceo:
    type: overseer
    model: opus
    cycle: 60
  atlas:
    type: planner
    model: opus
    cycle: 20          # faster cycles for more coders
  audit:
    type: reviewer
    model: opus
    cycle: 10

  # 6 frontend coders
  fe-1: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }
  fe-2: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }
  fe-3: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }
  fe-4: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }
  fe-5: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }
  fe-6: { type: coder, team: frontend, model: sonnet, skills: [tdd-coding] }

  # 8 backend coders
  be-1: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-2: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-3: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-4: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-5: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-6: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-7: { type: coder, team: backend, model: opus, skills: [tdd-coding] }
  be-8: { type: coder, team: backend, model: opus, skills: [tdd-coding] }

  # 6 infra coders
  ops-1: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }
  ops-2: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }
  ops-3: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }
  ops-4: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }
  ops-5: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }
  ops-6: { type: coder, team: infra, model: sonnet, skills: [tdd-coding] }

Create Custom Agents

Custom Agent Types

Create a new template in templates/agents/:

<!-- templates/agents/security-auditor.md.tmpl -->
---
name: ${AGENT_NAME}
model: ${MODEL}
allowedTools: [Read, Write, Edit, Bash, Grep, Glob, Agent, WebSearch]
---

# ${AGENT_NAME} — Security Auditor

You are a security-focused reviewer. You scan every diff for vulnerabilities
before code reaches main.

## Your Cycle
Every ${CYCLE_MINUTES} minutes:
1. Pull latest
2. Scan review/ for tasks
3. Run SAST tools on each branch
4. Check OWASP Top 10
5. Flag or approve

Then reference it in colony.yml:

agents:
  sec-check:
    type: security-auditor    # matches your template filename
    model: opus
    cycle: 20
    skills: [security-analysis]

Custom Skills

Create a new skill in templates/skills/:

<!-- templates/skills/security-analysis.md -->
---
description: "Security analysis skills — OWASP, SAST, dependency scanning"
---

# Security Analysis Skills

## OWASP Top 10 Checklist
- [ ] Injection (SQL, command, LDAP)
- [ ] Broken authentication
- [ ] Sensitive data exposure
...

Commands

colony init <remote>          # Initialize a new colony project
colony start [agent|all]      # Start agent(s)
colony stop [agent|all]       # Stop agent(s)
colony status                 # Show colony status
colony dashboard              # Live auto-refreshing dashboard
colony pause                  # Pause all agents
colony resume                 # Resume all agents
colony logs [agent]           # Tail agent logs
colony add-agent <name> <type> [team] [model]  # Add agent to config
colony nuke --confirm         # Remove ALL colony files, leave clean source code

`colony nuke` — Clean Exit

When your project is done and ready to ship, run colony nuke --confirm to strip all colony infrastructure from your project repo. It removes _colony/, .claude/, CLAUDE.md, worktrees, and all remote task branches — leaving only your clean source code. Reports and completed task history are archived to a timestamped backup folder before deletion.

Your project goes from "managed by a robot army" to "clean professional repo" in one command.

Dashboard

The live dashboard shows everything at a glance:

╔════════════════════════════════════════════════════════════════╗
║  ATLAS COLONY DASHBOARD                   2026-03-15 14:30   ║
╠════════════════════════════════════════════════════════════════╣
║                                                               ║
║  AGENTS                                                       ║
║  Strategy:                                                    ║
║    ● CEO        — Strategic oversight (60min)                 ║
║  Management:                                                  ║
║    ● ATLAS      — Task generation (30min)                     ║
║    ● AUDIT      — Code review (15min)                         ║
║  Testing:                                                     ║
║    ● BETA-TESTER — Goal validation (45min)                    ║
║  Documentation:                                               ║
║    ● DOCS        — README & docs sync (120min)                ║
║  Alpha Team (3 coders):                                       ║
║    ● ALPHA-1    — Coder (continuous)                          ║
║    ● ALPHA-2    — Coder (continuous)                          ║
║    ● ALPHA-3    — Coder (continuous)                          ║
║  Bravo Team (2 coders):                                       ║
║    ● BRAVO-1    — Coder (continuous)                          ║
║    ● BRAVO-2    — Coder (continuous)                          ║
║                                                               ║
║    10/10 agents running                                         ║
║                                                               ║
╠════════════════════════════════════════════════════════════════╣
║                                                               ║
║  PIPELINE                                                     ║
║    queue/  ──▶  active/  ──▶  review/  ──▶  done/            ║
║      5           3             1            12                ║
║                                                               ║
╚════════════════════════════════════════════════════════════════╝

Reports

Hourly status — _colony/reports/hourly-STATUS.md Quick snapshot: pipeline depth, tasks merged, active work, health check.

Daily report — _colony/reports/YYYY-MM-DD.md Full analysis: velocity, quality metrics, team performance, roadmap progress, recommendations.

Beta test report — _colony/reports/beta-test-YYYYMMDD-HHMM.md Commit-level validation against goals, regression detection, and auto-filed bug tasks.

Both generated automatically by AUDIT and BETA-TESTER agents. Zero human effort.

Architecture

┌─────────────────────────────────────────────────────────┐
│                    Your Machine                          │
│                                                          │
│  ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐         │
│  │ tmux │ │ tmux │ │ tmux │ │ tmux │ │ tmux │  ...     │
│  │ CEO  │ │ATLAS │ │AUDIT │ │ α-1  │ │ β-1  │         │
│  └──┬───┘ └──┬───┘ └──┬───┘ └──┬───┘ └──┬───┘         │
│     │        │        │        │        │               │
│     └────────┴────────┴────────┴────────┘               │
│                       │                                  │
│              ┌────────┴────────┐                        │
│              │   git repo      │                        │
│              │  _colony/       │                        │
│              │  ├── queue/     │  ← tasks flow through  │
│              │  ├── active/    │    these directories    │
│              │  ├── review/    │    via git commits      │
│              │  ├── done/      │                        │
│              │  └── ...        │                        │
│              └────────┬────────┘                        │
│                       │                                  │
│              ┌────────┴────────┐                        │
│              │  GitHub Remote  │                        │
│              └─────────────────┘                        │
└─────────────────────────────────────────────────────────┘

Each agent runs in its own tmux session as a Claude Code instance with a specialized agent definition. They communicate exclusively through git commits — moving task files between directories, pushing branches, and writing reports.

Zero infrastructure. If your machine can run git and tmux, it can run AtlasColony.

Requirements

Claude Code (npm install -g @anthropic-ai/claude-code)
Anthropic API key (set ANTHROPIC_API_KEY)
git and tmux
yq (YAML processor — installed automatically during init)
Linux, macOS, or WSL on Windows

FAQ

How much does it cost to run?

Depends on the number of agents and their models. A full 9-agent Opus colony uses significant API credits. Tips:

Use sonnet for coders (fast, cheap, excellent at coding)
Use opus for CEO, ATLAS, and AUDIT (need deep reasoning)
Start with the minimal 3-agent config to test

Can I run this across multiple machines?

Yes! Since coordination is through git, agents can run on any machine that can push/pull from the same repo. Just run colony start <agent> on each machine with the appropriate agent name.

What happens if an agent crashes or stalls?

The watchdog runs every 5 minutes and detects two failure modes:

Crashed (tmux session dead) — restarts the agent with exponential backoff
Stalled (tmux alive but no git activity for 30-45 min) — kills the session, moves stuck tasks back to queue, restarts with fresh context

The watchdog uses lease-based tracking: tasks carry Claimed-At timestamps so stuck tasks are recovered precisely instead of relying on file age. Restart backoff escalates (5m → 10m → 20m → 40m → 60m cap) to avoid hammering a persistently failing agent. Orphaned worktrees are cleaned up automatically.

Can I use this with other AI models?

AtlasColony is built on Claude Code, which currently supports Claude models. The architecture is model-agnostic in principle — the coordination protocol is just git files. Future versions may support other agent runtimes.

How do I stop everything?

colony pause          # Graceful — agents finish current work and wait
colony stop all       # Kill all tmux sessions immediately
tmux kill-server      # Nuclear option

Can I add or remove agents while the colony is running?

Yes. Edit colony.yml, run colony init to regenerate configs, then colony start <new-agent>. Existing agents don't need to restart — they'll pick up changes on their next git pull.

How is this different from just running Claude Code?

Claude Code is a single agent working in a single thread. AtlasColony is N agents working in parallel with separation of concerns — planners don't write code, coders don't review their own work, and every merge goes through an independent audit. It's the difference between one developer and a development team.

How is this different from Cursor's Background Agents or Codex?

Those are cloud-hosted, closed-source, and vendor-locked. AtlasColony runs on your machine, coordinates through git (which you already understand and can inspect), and is fully open source. You own the entire workflow.

Built-In Agent Types

Type	Role	Cycle	Skills
`overseer`	Strategic direction, goal pivots, directives	Periodic	Strategic oversight, risk detection
`planner`	Task decomposition, roadmap, bug processing	Periodic	Architecture, dependency analysis
`reviewer`	Code review, testing, merge/reject, reports	Periodic	Security analysis, QA, debugging
`tester`	Validates commits against goals, regression detection, bug filing	Periodic	Functional testing, goal alignment
`documenter`	Keeps README, changelog, and docs in sync with codebase	Periodic (2hr)	Technical writing, code reading, accuracy verification
`coder`	TDD implementation, branch per task	Continuous	TDD, debugging, git worktrees

Create custom types by adding .md.tmpl files to templates/agents/.

Contributing

Contributions welcome! See CONTRIBUTING.md.

Areas we'd love help with:

New agent type templates (DevOps, security auditor, documentation writer)
New skill definitions
Example colony.yml configs for different use cases
Dashboard improvements
Multi-model support (beyond Claude)
Performance optimization for large colonies (20+ agents)

Acknowledgments

Built with:

Claude Code by Anthropic
BMAD-METHOD for agile AI workflows
Superpowers for TDD and debugging skills
Agency Agents for agent personas

Star this repo if your AI agents should work as hard as you do.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
assets		assets
benchmarks		benchmarks
bin		bin
cmd/atlas		cmd/atlas
dashboard		dashboard
docs		docs
internal		internal
lib		lib
scripts		scripts
templates		templates
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile.tui		Makefile.tui
README.md		README.md
colony.yml		colony.yml
demo.tape		demo.tape
go.mod		go.mod
go.sum		go.sum
verify-fix.sh		verify-fix.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AtlasColony

What is this?

Why?

How It Compares

Quick Start

What People Build With This

How It Works

The Agent Hierarchy

The Task Lifecycle

Coordination = Git

Configure Your Colony

Default: 10-Agent Colony

Minimal: 3-Agent Colony

Research Colony (non-coding)

Scale to 20 Coders

Create Custom Agents

Custom Agent Types

Custom Skills

Commands

`colony nuke` — Clean Exit

Dashboard

Reports

Architecture

Requirements

FAQ

Built-In Agent Types

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AtlasColony

What is this?

Why?

How It Compares

Quick Start

What People Build With This

How It Works

The Agent Hierarchy

The Task Lifecycle

Coordination = Git

Configure Your Colony

Default: 10-Agent Colony

Minimal: 3-Agent Colony

Research Colony (non-coding)

Scale to 20 Coders

Create Custom Agents

Custom Agent Types

Custom Skills

Commands

colony nuke — Clean Exit

Dashboard

Reports

Architecture

Requirements

FAQ

Built-In Agent Types

Contributing

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`colony nuke` — Clean Exit

Packages