Multi-Agent Ralph Loop

Autonomous orchestration framework for Claude Code with MemPalace-inspired memory, parallel-first Agent Teams, Aristotle First Principles methodology, and quality gates.

What It Does

Ralph extends Claude Code into a multi-agent development framework with a structured memory system inspired by the Memory Palace technique. Every task is analyzed from first principles, decomposed into parallel subtasks, assigned to specialized teammates, and validated through quality gates before completion.

Capability	Description
MemPalace Memory	4-layer memory stack (L0-L3) with Obsidian vault knowledge graph and learned rules taxonomy
Parallel-First	All independent tasks execute in parallel via Agent Teams (mandatory for complexity >= 3)
6 Teammates	ralph-coder, ralph-reviewer, ralph-tester, ralph-researcher, ralph-frontend, ralph-security
Hook System	Lifecycle hooks for validation, quality gates, security guards, and automatic learning
Aristotle Analysis	5-phase first principles deconstruction before every non-trivial task
Quality Gates	4-stage blocking validation: correctness, quality, security, consistency
Comprehensive Tests	Full test suite covering layers, hooks, security, skills, and pipeline

MemPalace Memory System

Inspired by the MemPalace repository (Memory Palace technique for LLM agents), Ralph implements a layered memory architecture with key differences based on our implementation findings.

Layer Stack (Session Wake-up)

Layer	File	Purpose
L0	`~/.ralph/layers/L0_identity.md`	Agent identity + principles
L1	`~/.ralph/layers/L1_essential.md`	Actionable rules (filtered from corpus)
L2	`.claude/rules/learned/{halls,rooms,wings}/`	Project-specific taxonomy (on-demand)
L3	Obsidian vault grep	Full knowledge base queries (on-demand)

Learned Rules Taxonomy

Rules organized in 3 dimensions for flexible retrieval:

Dimension	Directory	Organization
Halls (by type)	`.claude/rules/learned/halls/`	decisions, patterns, anti-patterns, fixes
Rooms (by topic)	`.claude/rules/learned/rooms/`	hooks, memory, agents, security, testing
Wings (by scope)	`.claude/rules/learned/wings/`	`_global/`, `multi-agent-ralph-loop/`

Key Implementation Findings

These findings emerged during our MemPalace implementation and may be relevant to others building LLM memory systems:

Finding	Detail
Encoding doesn't reduce tokens	Unicode PUA encoding increased BPE tokens. Word count metrics falsely reported reduction.
Selection beats encoding	Choosing fewer rules achieved the target; compressing the same rules did not.
Taxonomy needs noise filtering	46% of auto-learned rules were noise (cross-domain repeats, vague bundles). Filtering is essential.

Full analysis: AAAK_LIMITATIONS_ADR

Learning Pipeline (Automatic)

SESSION (any repo)
  |
  +-- Stop      --> continuous-learning.sh --> vault + procedural memory
  +-- PostToolUse --> semantic extractors  --> vault facts & decisions
  +-- SessionStart --> vault-graduation.sh --> promote to local rules
  +-- SessionEnd   --> vault-index-updater  --> update indices

All learning flows project -> global -> vault. Only universal patterns graduate to global scope.

Quick Start

git clone https://github.com/alfredolopez80/multi-agent-ralph-loop.git
cd multi-agent-ralph-loop

# Validate global infrastructure
bash scripts/validate-global-infrastructure.sh

# Run tests
python3 -m pytest tests/ -q

# Use
/orchestrator "Create a REST API endpoint"
/iterate "Fix all lint errors"
/security src/

Agent Teams

6 specialized teammates for parallel execution:

Teammate	Role	Tools
`ralph-coder`	Implementation	Read, Edit, Write, Bash
`ralph-reviewer`	Code review (OWASP)	Read, Grep, Glob
`ralph-tester`	Testing	Read, Edit, Write, Bash(test)
`ralph-researcher`	Research (Zai MCP)	Read, Grep, Glob, WebSearch
`ralph-frontend`	Frontend (WCAG 2.1 AA)	LSP, Read, Edit, Write, Bash
`ralph-security`	Security (6 pillars)	LSP, Read, Grep, Glob, Bash

Agent Teams is enabled via CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 in settings.json. Teammates are spawned in parallel by the orchestrator, iterate, parallel, security, and task-batch skills.

Core Skills

Skill	Purpose
`/orchestrator`	Full 10-step workflow: evaluate, clarify, classify, plan, execute, validate, retrospect
`/iterate`	Iterative execution until VERIFIED_DONE
`/parallel`	Run multiple independent tasks concurrently
`/task-batch`	Autonomous batch execution from PRD files
`/gates`	Multi-language quality gate validation
`/security`	Multi-agent security audit (OWASP, semgrep, gitleaks)
`/autoresearch`	Autonomous experimentation loop with Smart Setup
`/adversarial`	Spec refinement with multi-model cross-validation
`/bugs`	Systematic bug hunting
`/ship`	Pre-launch checklist (gates + security + review)
`/spec`	Verifiable technical specification before coding

Quality Gates

4-stage validation, all blocking except consistency:

CORRECTNESS -- Syntax valid, logic sound
QUALITY -- Types, no debug artifacts
SECURITY -- semgrep + gitleaks + OWASP validation
CONSISTENCY -- Linting and style (advisory)

Hook enforcement via TeammateIdle and TaskCompleted events ensures no agent completes without passing gates.

Parallel-First Rule

All independent tasks MUST execute in parallel. Sequential execution requires documented dependency.

Complexity 1-2: Direct execution (no team required)
Complexity 3+:  Agent Teams with parallel teammates (MANDATORY)

See: .claude/rules/parallel-first.md

Architecture

User Request --> Claude Code
                    |
            Aristotle Analysis (5 phases)
                    |
            Task Classification (1-10)
                    |
        +-----------+-----------+
        |                       |
    Agent Teams            Quality Gates
    (parallel)             (blocking)
        |                       |
    ralph-coder ---+     CORRECTNESS ok
    ralph-tester --+     QUALITY ok
    ralph-reviewer-+     SECURITY ok
    ralph-security-+     CONSISTENCY ok
        |                       |
        +-----------+-----------+
                    |
              VERIFIED_DONE
                    |
            MemPalace Learning
            (session -> vault -> global)

Security

The framework includes multiple layers of security enforcement:

Layer	Trigger	Purpose
`git-safety-guard.py`	PreToolUse (Bash)	Blocks destructive git operations and command chaining
`repo-boundary-guard.sh`	PreToolUse (Bash)	Prevents operations outside current repo
`audit-secrets.js`	PostToolUse	Audit logging for 20+ secret patterns
`teammate-idle-quality-gate.sh`	TeammateIdle	Quality gates before teammate goes idle
`task-completed-quality-gate.sh`	TaskCompleted	Multi-gate validation before task completion
`task-plan-sync.sh`	TaskCreated	Syncs task creation to plan-state.json

Context Optimization

Ralph uses symlinks (not copies) for all global rules, skills, and agents. This eliminates content duplication and reduces context overhead by ~29% (~10K tokens saved per session).

# Sync rules from repo to global (creates symlinks)
bash scripts/sync-rules.sh

# Preview changes without executing
bash scripts/sync-rules.sh --dry-run

Distribution policy: See docs/architecture/DISTRIBUTION_POLICY.md for the symlink vs copy strategy per component type (Rules=COPY, Hooks=COPY, Agents=SYMLINK, Skills=MIXED).

Requirements

Tool	Version	Required
Claude Code	v2.1.42+	Yes
Bash	4.0+	Yes
jq	1.6+	Yes
git	2.0+	Yes
python3	3.8+	Yes (for tests)
Obsidian	Any	Optional (for vault KG)
GitHub CLI	Any	Optional
semgrep	Any	Optional (security)
gitleaks	Any	Optional (secrets)

Testing

python3 -m pytest tests/ -q                    # Full test suite
bash scripts/validate-global-infrastructure.sh  # Infrastructure checks

Configuration

The system is model-agnostic -- all skills and agents inherit the configured model from settings, no per-command flags required.

{
  "env": {
    "ANTHROPIC_DEFAULT_SONNET_MODEL": "your-model",
    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
  }
}

Skills are symlinked to multiple platform directories. Source of truth: .claude/skills/ in this repo.

Global Infrastructure

All Ralph advantages (Plan Mode, Aristotle, Parallel-First, Agent Teams) work in any project -- rules, skills, and agents are symlinked globally.

# Validate global infrastructure
bash scripts/validate-global-infrastructure.sh

# Auto-fix broken symlinks
bash scripts/validate-global-infrastructure.sh --fix

Autoresearch

Autonomous experimentation loop inspired by karpathy/autoresearch. Continuously modifies code, measures metrics, and keeps only improvements.

Smart Setup reduces configuration from 14+ manual parameters to 2-3 guided questions:

Phase	Name	What it does
0	SCOUT	Silent auto-detection of project type, scripts, metrics
1	WIZARD	2-3 AskUserQuestion with pre-filled options and previews
2	VALIDATE	Dry-run verification (eval works, metric extracts, git clean)

9 domain templates: ML Training, Node.js Tests, Bundle Size, Python Tests, Prompt Engineering, SQL, Rust, Lighthouse, Custom.

/autoresearch "optimize my tests"     # Smart mode (auto-detect)
/autoresearch --manual                # Classic setup

Documentation

Topic	Location
Architecture	`docs/architecture/`
AAAK Limitations ADR	`docs/architecture/AAAK_LIMITATIONS_ADR_2026-04-07.md`
Anti-Rationalization	`docs/reference/anti-rationalization.md`
Aristotle Methodology	`docs/reference/aristotle-first-principles.md`
Security	`docs/security/`
Hooks Reference	`docs/hooks/`
Benchmarks	`docs/benchmark/`
Batch Execution	`docs/batch-execution/`

Acknowledgments

MemPalace -- Original Memory Palace technique research for LLM agents that inspired our layered memory architecture. Our implementation diverges in key areas documented in AAAK_LIMITATIONS_ADR.
Claude Code -- Base orchestration platform with hooks, skills, and Agent Teams APIs.
karpathy/autoresearch -- Inspiration for the autonomous experimentation loop.

License

MIT License - see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 508 Commits
.claude		.claude
.github/workflows		.github/workflows
.ralph/plans		.ralph/plans
config		config
docs		docs
scripts		scripts
tests		tests
.filter-repo-rules.txt		.filter-repo-rules.txt
.gitignore		.gitignore
.gitmodules		.gitmodules
.shellcheckrc		.shellcheckrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE-COMPACT.md		CLAUDE-COMPACT.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.es.md		README.es.md
README.md		README.md
SIMULATION_README.md		SIMULATION_README.md
TESTING.md		TESTING.md
agent-browser-policy.json		agent-browser-policy.json
agent-browser.json		agent-browser.json
install.sh		install.sh
multi-agent-ralph-loop-threat-model.md		multi-agent-ralph-loop-threat-model.md
simulate-context-auto.sh		simulate-context-auto.sh
simulate-context.sh		simulate-context.sh
test-context-thresholds.sh		test-context-thresholds.sh
test-git-root.txt		test-git-root.txt
test-quality-validation.js		test-quality-validation.js
test-security-check.ts		test-security-check.ts
uninstall.sh		uninstall.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Ralph Loop

What It Does

MemPalace Memory System

Layer Stack (Session Wake-up)

Learned Rules Taxonomy

Key Implementation Findings

Learning Pipeline (Automatic)

Quick Start

Agent Teams

Core Skills

Quality Gates

Parallel-First Rule

Architecture

Security

Context Optimization

Requirements

Testing

Configuration

Global Infrastructure

Autoresearch

Documentation

Acknowledgments

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Ralph Loop

What It Does

MemPalace Memory System

Layer Stack (Session Wake-up)

Learned Rules Taxonomy

Key Implementation Findings

Learning Pipeline (Automatic)

Quick Start

Agent Teams

Core Skills

Quality Gates

Parallel-First Rule

Architecture

Security

Context Optimization

Requirements

Testing

Configuration

Global Infrastructure

Autoresearch

Documentation

Acknowledgments

License

References

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages