NeuroRouter

AI sessions break. Corrupted tool chains return 400s. Content filters block legitimate output. Rate limits lock you out mid-task. Runaway agents burn your budget in minutes. And the vendors close the bugs as stale.

NeuroRouter sits between your tools and the API. It blocks secrets, strips wasted tokens, repairs broken requests, and keeps your work alive when everything else fails.

Get The Free Version

The community edition is published at obstalabs/neurorouter. Install it with Homebrew, Scoop, or download the release binaries directly.

Homebrew

macOS or Linux via the public Obsta Labs tap.

brew tap obstalabs/tap
brew install obstalabs/tap/neurorouter

Scoop

Windows via the public Obsta Labs bucket.

scoop bucket add obstalabs https://github.com/obstalabs/scoop-bucket
scoop install obstalabs/neurorouter

Direct Binary

Download tarballs and zip archives from the release page.

https://github.com/obstalabs/neurorouter/releases/latest

Three Layers

Everything happens before the request leaves your machine. No latency. No infra. No key exposure. No lock-in.

How It Works

One environment variable. Works with Claude Code, Codex, OpenClaw, Cursor, Aider, Continue.dev, and any tool that lets you set an API base URL. Your API key is forwarded as-is at the HTTP layer — never parsed, never stored, never logged.

Concrete Example

A session sends the same file content 3 times, includes a 120KB thinking block, and repeats system instructions in every request. NeuroRouter strips it before the API call. Less wasted context. Lower cost. Same workflow.

Output Purity Score

OPS shows how much unnecessary traffic was removed before billing. A session at OPS 75% means 25% of what would have been sent was noise — caught and removed. Track it in your status line or your logs.

Input Purity Score

ContextSpectre measures IPS — how clean your context stays over time. Without filtering, raw tool output (git diffs, test logs, build output) enters context and gets re-read on every turn. Typical unfiltered sessions score IPS 50-60.

With NeuroRouter Pro active, IPS stays at 90+. The proxy strips noise before it enters context, so the model only sees clean tool results. Less noise in, less re-read tax out.

OPS and IPS are complementary. OPS measures what NeuroRouter removes from each request (output purity). IPS measures what remains in context across turns (input purity). Together they show the full picture — waste prevented at the request layer and waste prevented from accumulating over the session.

What NeuroRouter Is Not

Security

Your API keys never leave your machine. NeuroRouter runs locally, uses environment variables or client passthrough auth, and never stores, transmits, or logs credentials. Keys exist only in process memory for the duration of the upstream request.

This is a structural difference from cloud LLM proxies. A 2026 study found 26 LLM proxy services collecting user credentials. The LiteLLM supply-chain breach (March 2026) compromised thousands of organizations. NeuroRouter eliminates this class of risk entirely — there is no server to breach and no database to leak.

Pricing

Free

$0 — AGPL v3, self-hosted

Cleans AI requests before they hit the model. Removes wasted context, blocks secrets, and strips structural noise — locally, with zero setup. One protocol per instance, one session at a time.

All filters, secret blocking, OPS metrics
One proxy instance per live session — Claude or Codex, not both
Use freely under AGPL terms

Pro

$29 / month

Keep going no matter what. Session corrupted? Repaired before the 400. Credits burning? Mechanical work routed to cheaper models automatically. Cooldown incoming? Work rescued. Content filter kills your output? Retried and recovered. One daemon, all sessions, no lost work.

Session multiplexing — Claude, Codex, and any OpenAI-compatible tool in one process
Continuity repair — broken tool chains and oversized requests blocked before they become upstream 400s
Binary content sanitization — terminal output and SSH results cleaned before they corrupt stored history
Auto model routing — mechanical work cascaded to cheaper models with context windowing to fit smaller windows
Content filter recovery — false positives retried automatically
Context overflow protection — requests exceeding model context windows rejected locally, not forwarded
Always-on safety guards — circuit breaker after 3 consecutive failures, duplicate and burst detection prevent retry spirals
Upstream status alerts — outage, quota exhaustion, and rate limits surfaced immediately instead of silent passthrough
Context rescue — work extracted before compaction or cooldown
Rate limit prediction — warns before lockout, suggests cheaper models
Sensitive path protection — redacts file content, teaches the model to handle it safely
No AGPL obligations

Team

$49 / seat / month

Enforce clean, safe AI usage across your team. Shared rules, consistent routing, and guardrails that prevent costly mistakes before they happen.

Everything in Pro
Shared routing rules and filter configs
Capture analysis — mine sessions for cost, security, and workflow patterns
Aggregate insights (patterns, not people)
Policy enforcement and org-wide hygiene

Enterprise

Custom pricing

Control AI usage at scale without losing speed. Org-wide policies, secure routing, and protection against data leaks, runaway cost, and workflow breakdowns.

Everything in Team
Path obfuscation — usernames, internal URLs, org paths never reach the API
Compliance pipeline — export to Splunk, Datadog, Elastic
Hosted proxy option — zero ops
500K context window support
Dedicated support