Opticon

Multi-agent orchestration platform where AI agents each control their own cloud Linux desktop to execute tasks in parallel. Submit a prompt, review the task breakdown, then watch agents work in real time.

How It Works

Submit a prompt — e.g. "Research the top 5 AI frameworks and create a comparison spreadsheet"
Review tasks — the orchestrator decomposes the prompt into independent subtasks via a kanban board
Watch agents work — each agent boots a cloud Linux desktop and executes its task with full visibility
See results — live desktop streams, reasoning sidebar, shared whiteboard, and session replays

Each agent runs a vision-based observe-think-act loop: screenshot the desktop, send it to an LLM, receive a mouse/keyboard action, execute it, repeat.

Architecture

Browser (Next.js)  ←Socket.io→  Backend (Node.js)  ←Socket.io→  Python Workers
                                      │                              │
                                      │                         Dedalus Labs SDK
                                      │                         (agent brain)
                                      │                              │
                                 Orchestrator                   E2B Desktop SDK
                                 (Dedalus K2 Think)             (computer control)
                                                                     │
                                                              E2B Cloud Sandboxes
                                                              (isolated Linux VMs)

Frontend: Next.js 16, React 19, Tailwind CSS 4, shadcn/ui
Real-time: Socket.io (browser ↔ backend ↔ workers)
Orchestrator: Dedalus Labs TypeScript SDK (K2 Think for task decomposition)
Agent brain: Dedalus Labs Python SDK (vision loop with tool calling)
Computer use: E2B Desktop SDK (cloud Linux sandboxes with noVNC streaming)
Auth: NextAuth with Google OAuth
Database: Neon PostgreSQL via Drizzle ORM
Billing: Flowglad

Project Structure

/frontend
  /app                  Next.js App Router pages and API routes
  /components           React components (agent grid, thinking sidebar, kanban board)
  /lib                  Shared utilities, types, socket setup, session store
  server.ts             Custom HTTP server with Socket.io
/workers
  worker.py             Python agent worker (vision loop)
  e2b_tools.py          E2B Desktop SDK tool wrappers
  replay.py             Session replay/timelapse recording
  /tools                Additional tool modules

Setup

Prerequisites

Node.js 18+
Python 3.10+
API keys for Dedalus Labs and E2B

Install Dependencies

# Frontend
cd frontend
npm install

# Python workers
pip install -r requirements.txt

Environment Variables

Create frontend/.env.local:

DEDALUS_API_KEY=         # Dedalus Labs SDK (orchestrator + agent workers)
E2B_API_KEY=             # E2B sandbox provisioning
PYTHON_PATH=             # Full path to python3 binary

Run

cd frontend
npm run dev

This starts the Next.js dev server with Socket.io. The backend spawns Python worker processes automatically when a session starts.

Key Design Decisions

Agents run outside sandboxes — Python workers send commands to E2B VMs remotely. API keys and agent code are never exposed to content inside the VM.
Push-based task assignment — The backend assigns tasks to agents (agents don't pull). Avoids race conditions without distributed locking.
Vision-based computer use — Screenshots are injected as actual images into the LLM conversation, not as base64 text in tool results. This is critical for the model to actually "see" the desktop.
Session persistence — Sessions survive browser tab close. Agents keep running and you can reconnect via session ID.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.next		.next
frontend		frontend
symphony		symphony
workers		workers
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitignore		.gitignore
.mcp.json		.mcp.json
CLAUDE.md		CLAUDE.md
DEVPOST.md		DEVPOST.md
Dockerfile		Dockerfile
PLAN.md		PLAN.md
PYTHON_WORKERS_PLAN.md		PYTHON_WORKERS_PLAN.md
README.md		README.md
READNME.txt		READNME.txt
SPEC.md		SPEC.md
TAB-10-spec.md		TAB-10-spec.md
backend(node)plan.md		backend(node)plan.md
package-lock.json		package-lock.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Opticon

How It Works

Architecture

Project Structure

Setup

Prerequisites

Install Dependencies

Environment Variables

Run

Key Design Decisions

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Languages

Folders and files

Latest commit

History

Repository files navigation

Opticon

How It Works

Architecture

Project Structure

Setup

Prerequisites

Install Dependencies

Environment Variables

Run

Key Design Decisions

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Languages

Packages