Optifiner

Optifiner is a self-evolving code framework that automatically improves codebases through multi-agent AI-driven optimization. It spawns parallel AI agents that propose and test code improvements, keeping only changes that measurably improve performance against benchmark metrics.

🚀 Key Features

Multi-Agent Evolution: Deploy 10+ parallel AI agents that autonomously improve your code
Benchmark-Driven: All improvements are validated against your custom evaluation metrics
Git-Integrated: Every improvement is tracked, version-controlled, and reversible
Real-Time Visualization: Monitor evolution progress through an interactive web dashboard
Multi-Model Support: Works with Claude, GPT-4, Gemini, and other LLMs
Generational Optimization: Runs multiple generations with automatic convergence detection
Production-Ready: Docker support, scalable architecture, comprehensive observability

🎯 Use Cases

Performance Optimization: Automatically refactor slow code for better throughput/latency
Algorithm Improvement: Evolve sorting, pathfinding, and scheduling algorithms
Game AI Enhancement: Improve NPC behavior and game mechanics
Competitive Programming: Auto-optimize solutions for algorithmic contests
ML Model Tuning: Refine hyperparameters and training code

📋 Quick Start

Prerequisites

Python 3.10+
Node.js 18+
Docker & Docker Compose (for full stack)
API keys for at least one LLM provider:
- Anthropic (Claude)
- Google (Gemini)
- OpenAI (GPT)

Installation

# Clone the repository
git clone https://github.com/yourusername/optifiner.git
cd optifiner

# Install worker dependencies
cd services/worker
pip install -r requirements.txt

# Install web UI dependencies
cd ../../apps/web
npm install

Basic Usage

1. Create an Evaluator

Your codebase needs a benchmark script that outputs JSON with your metric:

# optifiner_benchmark.py
import json
import subprocess
import time

def evaluate():
    """Run benchmarks and return a score."""
    start = time.time()
    result = subprocess.run(['python', 'main.py'], capture_output=True)
    elapsed = time.time() - start

    if result.returncode == 0:
        return 100.0 / (elapsed + 1)  # Faster = higher score
    return 0.0

if __name__ == '__main__':
    score = evaluate()
    # Output JSON format (supports both higher-is-better and lower-is-better metrics)
    print(json.dumps({
        "score": score,
        "metric_name": "throughput",     # e.g., "FPS", "throughput", "cycles", "latency_ms"
        "test_gate": True,               # Set False if tests fail
        "higher_is_better": True,        # True for FPS/throughput, False for cycles/latency
    }))

Metric Direction: The system auto-detects whether higher or lower is better based on metric names like "cycles", "latency", "ms" (lower is better) vs "FPS", "throughput" (higher is better). You can also explicitly set "higher_is_better": false for lower-is-better metrics.

2. Run Evolution

cd services/worker

# Single generation with 5 agents
python cli.py /path/to/your/repo \
  --evaluator /path/to/evaluate.py \
  --agents 5 \
  --generations 1 \
  --model-provider google \
  --model-name gemini-2.5-flash

# Multiple generations with parallel execution
python cli.py /path/to/your/repo \
  --evaluator /path/to/evaluate.py \
  --agents 10 \
  --parallel 4 \
  --generations 5 \
  --output results.json

3. View Results

# Results are committed to git
git log --oneline

# View detailed evolution metrics
cat results.json | jq '.'

Web Dashboard

cd apps/web

# Development
npm run dev          # Runs on http://localhost:5173

# Production build
npm run build
npm run preview

🏗️ Architecture

Optifiner consists of three main components:

┌─────────────────────────────────────────────────────────────┐
│                     Web Dashboard (React)                    │
│         Real-time visualization of evolution progress        │
└──────────────────────────┬──────────────────────────────────┘
                           │ WebSocket
                           ▼
┌─────────────────────────────────────────────────────────────┐
│                   API Backend (FastAPI)                      │
│              Project management & coordination               │
└──────────────────────────┬──────────────────────────────────┘
                           │
        ┌──────────────────┴──────────────────┐
        ▼                                      ▼
   ┌─────────────┐                   ┌─────────────────┐
   │   Redis     │                   │   PostgreSQL    │
   │  Task Queue │                   │   History DB    │
   └─────────────┘                   └─────────────────┘
        ▲
        │ Celery Tasks
        │
┌──────┴──────────────────────────────────────────────────────┐
│              LangGraph Evolution Worker                      │
│                                                              │
│  ┌────────────────────────────────────────────────────┐    │
│  │  Agent Pool (Analyzer, Refactorer, Optimizer, etc) │    │
│  │                                                    │    │
│  │  Each Agent:                                       │    │
│  │  • Analyzes code with LLM                         │    │
│  │  • Proposes improvements                          │    │
│  │  • Edits files in sandbox workspace              │    │
│  │  • Runs evaluator benchmarks                      │    │
│  │  • Commits improvements if score improves        │    │
│  └────────────────────────────────────────────────────┘    │
│                                                              │
│  Tools: read_file, write_file, edit_file, grep, eval...    │
└──────────────────────────────────────────────────────────────┘

📚 Documentation

Getting Started - Detailed setup and configuration guide
Architecture - System design, component details, and workflows
Agent Types - Description of each agent type and its capabilities
API Reference - CLI commands, endpoints, and configuration options
Examples - Real-world example projects and use cases
Deployment - Production deployment with Docker Compose

🛠️ Configuration

Environment Variables

# LLM Provider (google, anthropic, or openai)
MODEL_PROVIDER=google
MODEL_NAME=gemini-2.5-flash
GOOGLE_API_KEY=your-key-here

# Evolution parameters
AGENTS=10              # Number of parallel agents
GENERATIONS=5          # Number of evolution generations
MAX_ITERATIONS=15      # Max tool calls per agent
PARALLEL=4             # Parallel execution workers

# Workspace
WORKSPACE_ROOT=/tmp/optifiner-workspace

# Database (for full stack)
DATABASE_URL=postgresql://user:pass@localhost/optifiner
REDIS_URL=redis://localhost:6379

Supported Models

Provider	Models
Anthropic	`claude-opus-4-20250514`, `claude-sonnet-4-5-20250514`, `claude-haiku-4-20250514`
Google	`gemini-2.5-flash`, `gemini-3-flash-preview`
OpenAI	`gpt-4o`, `gpt-4-turbo`

📊 Example Workflow

1. Initialize Evolution
   └─ Create workspace (isolated copy of repo)
   └─ Get baseline score from evaluator

2. Generation 1 (10 agents in parallel)
   ├─ Agent 1 (analyzer): Identifies bottlenecks
   │  └─ Proposes refactoring → Tests → Score improves! ✓
   ├─ Agent 2 (optimizer): Tweaks parameters
   │  └─ Proposes changes → Tests → No improvement ✗
   ├─ Agent 3 (feature): Adds caching
   │  └─ Proposes changes → Tests → Score improves! ✓
   └─ ...more agents...

3. Generation 2
   └─ Builds on successful changes from Gen 1
   └─ Proposes additional improvements

4. Results
   └─ All improvements committed to git
   └─ Fitness curve plotted
   └─ Summary report generated

🔧 Development

Project Structure

optifiner/
├── apps/
│   ├── web/              # React frontend UI
│   └── api/              # FastAPI backend
├── services/
│   └── worker/           # LangGraph evolution agent
├── packages/
│   └── shared/           # Shared utilities
├── examples/             # Example projects
├── infra/               # Docker & deployment
├── docs/                # Documentation
└── scripts/             # Utility scripts

Building from Source

# Install all dependencies
npm install -ws

# Run linter
npm run lint -ws

# Run tests
npm run test -ws

# Build everything
npm run build -ws

Docker

# Build all images
docker-compose build

# Start all services
docker-compose up

# View logs
docker-compose logs -f worker

🤝 Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch (git checkout -b feature/your-feature)
Commit your changes (git commit -am 'Add your feature')
Push to the branch (git push origin feature/your-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built with LangGraph for agent orchestration
Powered by leading LLM providers: Anthropic, Google, and OpenAI
UI inspired by modern DevOps dashboards

💬 Support & Community

Issues: Report bugs on GitHub Issues
Discussions: Join our GitHub Discussions
Documentation: See the docs folder for detailed guides

🚦 Status

✅ Core evolution engine working
✅ Multi-agent orchestration with LangGraph
✅ React web dashboard
🔄 Full API backend (in progress)
🔄 Distributed task queue (in progress)
📋 Production deployment guide

Start evolving your code today! 🧬

Name		Name	Last commit message	Last commit date
Latest commit History 264 Commits
.claude		.claude
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
api		api
apps		apps
docs		docs
examples		examples
infra		infra
public/videos		public/videos
scripts		scripts
services/worker		services/worker
.gitignore		.gitignore
CODEBASE_MAP.md		CODEBASE_MAP.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optifiner

🚀 Key Features

🎯 Use Cases

📋 Quick Start

Prerequisites

Installation

Basic Usage

1. Create an Evaluator

2. Run Evolution

3. View Results

Web Dashboard

🏗️ Architecture

📚 Documentation

🛠️ Configuration

Environment Variables

Supported Models

📊 Example Workflow

🔧 Development

Project Structure

Building from Source

Docker

🤝 Contributing

📝 License

🙏 Acknowledgments

💬 Support & Community

🚦 Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Optifiner

🚀 Key Features

🎯 Use Cases

📋 Quick Start

Prerequisites

Installation

Basic Usage

1. Create an Evaluator

2. Run Evolution

3. View Results

Web Dashboard

🏗️ Architecture

📚 Documentation

🛠️ Configuration

Environment Variables

Supported Models

📊 Example Workflow

🔧 Development

Project Structure

Building from Source

Docker

🤝 Contributing

📝 License

🙏 Acknowledgments

💬 Support & Community

🚦 Status

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages