codebase-memory-mcp

The fastest and most efficient code intelligence engine for AI coding agents. Full-indexes an average repository in milliseconds, the Linux kernel (28M LOC, 75K files) in 3 minutes. Answers structural queries in under 1ms. Ships as a single static binary for macOS, Linux, and Windows — download, run install, done.

High-quality parsing through tree-sitter AST analysis across all 64 languages, enhanced with LSP-style hybrid type resolution for Go, C, and C++ (more languages coming soon) — producing a persistent knowledge graph of functions, classes, call chains, HTTP routes, and cross-service links. 14 MCP tools. Zero dependencies. Plug and play across 10 coding agents.

Built-in 3D graph visualization (UI variant) — explore your knowledge graph at localhost:9749

Why codebase-memory-mcp

Extreme indexing speed — Linux kernel (28M LOC, 75K files) in 3 minutes. RAM-first pipeline: LZ4 compression, in-memory SQLite, fused Aho-Corasick pattern matching. Memory released after indexing.
Plug and play — single static binary for macOS (arm64/amd64), Linux (arm64/amd64), and Windows (amd64). No Docker, no runtime dependencies, no API keys. Download → install → restart agent → done.
64 languages — vendored tree-sitter grammars compiled into the binary. Nothing to install, nothing that breaks.
120x fewer tokens — 5 structural queries: ~3,400 tokens vs ~412,000 via file-by-file search. One graph query replaces dozens of grep/read cycles.
10 agents, one command — install auto-detects Claude Code, Codex CLI, Gemini CLI, Zed, OpenCode, Antigravity, Aider, KiloCode, VS Code, and OpenClaw — configures MCP entries, instruction files, and pre-tool hooks for each.
Built-in graph visualization — 3D interactive UI at localhost:9749 (optional UI binary variant).
14 MCP tools — search, trace, architecture, impact analysis, Cypher queries, dead code detection, cross-service HTTP linking, ADR management, and more.

Quick Start

Download the binary for your platform from the latest release:
- codebase-memory-mcp-<os>-<arch>.tar.gz — standard (MCP server only)
- codebase-memory-mcp-ui-<os>-<arch>.tar.gz — with embedded graph visualization

Extract and install:

tar xzf codebase-memory-mcp-*.tar.gz
mv codebase-memory-mcp ~/.local/bin/
codebase-memory-mcp install

Restart your coding agent. Say "Index this project" — done.

The install command auto-detects all installed coding agents and configures MCP server entries, instruction files, skills, and pre-tool hooks for each.

Graph Visualization UI

If you downloaded the ui variant:

codebase-memory-mcp --ui=true --port=9749

Open http://localhost:9749 in your browser. The UI runs as a background thread alongside the MCP server — it's available whenever your agent is connected.

Auto-Index

Enable automatic indexing on MCP session start:

codebase-memory-mcp config set auto_index true

When enabled, new projects are indexed automatically on first connection. Previously-indexed projects are registered with the background watcher for ongoing git-based change detection. Configurable file limit: config set auto_index_limit 50000.

Keeping Up to Date

codebase-memory-mcp update

The MCP server also checks for updates on startup and notifies on the first tool call if a newer release is available.

Uninstall

codebase-memory-mcp uninstall

Removes all agent configs, skills, hooks, and instructions. Does not remove the binary or SQLite databases.

Features

Architecture overview: get_architecture returns languages, packages, entry points, routes, hotspots, boundaries, layers, and clusters in a single call
Architecture Decision Records: manage_adr persists architectural decisions across sessions
Louvain community detection: Discovers functional modules by clustering call edges
Git diff impact mapping: detect_changes maps uncommitted changes to affected symbols with risk classification
Call graph: Resolves function calls across files and packages (import-aware, type-inferred)
Cross-service HTTP linking: Discovers REST routes and matches them to HTTP call sites with confidence scoring
Auto-sync: Background watcher detects file changes and re-indexes automatically
Cypher-like queries: MATCH (f:Function)-[:CALLS]->(g) WHERE f.name = 'main' RETURN g.name
Dead code detection: Finds functions with zero callers, excluding entry points
Route nodes: REST endpoints are first-class graph entities
CLI mode: codebase-memory-mcp cli search_graph '{"name_pattern": ".*Handler.*"}'
Single binary, zero infrastructure: SQLite-backed, persists to ~/.cache/codebase-memory-mcp/

How It Works

codebase-memory-mcp is a structural analysis backend — it builds and queries the knowledge graph. It does not include an LLM. Instead, it relies on your MCP client (Claude Code, or any MCP-compatible agent) to be the intelligence layer.

You: "what calls ProcessOrder?"

Agent calls: trace_call_path(function_name="ProcessOrder", direction="inbound")

codebase-memory-mcp: executes graph query, returns structured results

Agent: presents the call chain in plain English

Why no built-in LLM? Other code graph tools embed an LLM for natural language → graph query translation. This means extra API keys, extra cost, and another model to configure. With MCP, the agent you're already talking to is the query translator.

Performance

Benchmarked on Apple M3 Pro:

Operation	Time	Notes
Linux kernel full index	3 min	28M LOC, 75K files → 2.1M nodes, 4.9M edges
Linux kernel fast index	1m 12s	1.88M nodes
Django full index	~6s	49K nodes, 196K edges
Cypher query	<1ms	Relationship traversal
Name search (regex)	<10ms	SQL LIKE pre-filtering
Dead code detection	~150ms	Full graph scan with degree filtering
Trace call path (depth=5)	<10ms	BFS traversal

RAM-first pipeline: All indexing runs in memory (LZ4 HC compressed read, in-memory SQLite, single dump at end). Memory is released back to the OS after indexing completes.

Token efficiency: Five structural queries consumed ~3,400 tokens via codebase-memory-mcp versus ~412,000 tokens via file-by-file grep exploration — a 99.2% reduction.

Installation

Pre-built Binaries

Platform	Standard	With Graph UI
macOS (Apple Silicon)	`codebase-memory-mcp-darwin-arm64.tar.gz`	`codebase-memory-mcp-ui-darwin-arm64.tar.gz`
macOS (Intel)	`codebase-memory-mcp-darwin-amd64.tar.gz`	`codebase-memory-mcp-ui-darwin-amd64.tar.gz`
Linux (x86_64)	`codebase-memory-mcp-linux-amd64.tar.gz`	`codebase-memory-mcp-ui-linux-amd64.tar.gz`
Linux (ARM64)	`codebase-memory-mcp-linux-arm64.tar.gz`	`codebase-memory-mcp-ui-linux-arm64.tar.gz`
Windows (x86_64)	`codebase-memory-mcp-windows-amd64.zip`	`codebase-memory-mcp-ui-windows-amd64.zip`

Every release includes checksums.txt with SHA-256 hashes. All binaries are statically linked — no shared library dependencies.

Windows note: SmartScreen may show a warning for unsigned software. Click "More info" → "Run anyway". Verify integrity with checksums.txt.

Setup Scripts

Automated download + install

macOS / Linux:

curl -fsSL https://raw.githubusercontent.com/DeusData/codebase-memory-mcp/main/scripts/setup.sh | bash

Windows (PowerShell):

irm https://raw.githubusercontent.com/DeusData/codebase-memory-mcp/main/scripts/setup-windows.ps1 | iex

Install via Claude Code

You: "Install this MCP server: https://github.com/DeusData/codebase-memory-mcp"

Build from Source

Prerequisites: C compiler + zlib

Requirement	Check	Install
C compiler (gcc or clang)	`gcc --version` or `clang --version`	macOS: `xcode-select --install`, Linux: `apt install build-essential`
C++ compiler	`g++ --version` or `clang++ --version`	Same as above
zlib	—	macOS: included, Linux: `apt install zlib1g-dev`
Git	`git --version`	Pre-installed on most systems

git clone https://github.com/DeusData/codebase-memory-mcp.git
cd codebase-memory-mcp
scripts/build.sh                    # standard binary
scripts/build.sh --with-ui          # with graph visualization
# Binary at: build/c/codebase-memory-mcp

Manual MCP Configuration

If you prefer not to use the install command

Add to ~/.claude/.mcp.json (global) or project .mcp.json:

{
  "mcpServers": {
    "codebase-memory-mcp": {
      "command": "/path/to/codebase-memory-mcp",
      "args": []
    }
  }
}

Restart your agent. Verify with /mcp — you should see codebase-memory-mcp with 14 tools.

Multi-Agent Support

install auto-detects and configures all installed agents:

Agent	MCP Config	Instructions	Hooks
Claude Code	`.claude/.mcp.json`	4 Skills	PreToolUse (Grep/Glob/Read reminder)
Codex CLI	`.codex/config.toml`	`.codex/AGENTS.md`	—
Gemini CLI	`.gemini/settings.json`	`.gemini/GEMINI.md`	BeforeTool (grep/read reminder)
Zed	`settings.json` (JSONC)	—	—
OpenCode	`opencode.json`	`AGENTS.md`	—
Antigravity	`mcp_config.json`	`AGENTS.md`	—
Aider	—	`CONVENTIONS.md`	—
KiloCode	`mcp_settings.json`	`~/.kilocode/rules/`	—
VS Code	`Code/User/mcp.json`	—	—
OpenClaw	`openclaw.json`	—	—

Hooks are advisory (exit code 0) — they remind agents to prefer MCP graph tools when they reach for grep/glob/read, without blocking the tool call.

CLI Mode

Every MCP tool can be invoked from the command line:

codebase-memory-mcp cli index_repository '{"repo_path": "/path/to/repo"}'
codebase-memory-mcp cli search_graph '{"name_pattern": ".*Handler.*", "label": "Function"}'
codebase-memory-mcp cli trace_call_path '{"function_name": "Search", "direction": "both"}'
codebase-memory-mcp cli query_graph '{"query": "MATCH (f:Function) RETURN f.name LIMIT 5"}'
codebase-memory-mcp cli list_projects
codebase-memory-mcp cli --raw search_graph '{"label": "Function"}' | jq '.results[].name'

MCP Tools

Indexing

Tool	Description
`index_repository`	Index a repository into the graph. Auto-sync keeps it fresh after that.
`list_projects`	List all indexed projects with node/edge counts.
`delete_project`	Remove a project and all its graph data.
`index_status`	Check indexing status of a project.

Querying

Tool	Description
`search_graph`	Structured search by label, name pattern, file pattern, degree filters. Pagination via limit/offset.
`trace_call_path`	BFS traversal — who calls a function and what it calls. Depth 1-5.
`detect_changes`	Map git diff to affected symbols + blast radius with risk classification.
`query_graph`	Execute Cypher-like graph queries (read-only).
`get_graph_schema`	Node/edge counts, relationship patterns. Run this first.
`get_code_snippet`	Read source code for a function by qualified name.
`get_architecture`	Codebase overview: languages, packages, routes, hotspots, clusters, ADR.
`search_code`	Grep-like text search within indexed project files.
`manage_adr`	CRUD for Architecture Decision Records.
`ingest_traces`	Ingest runtime traces to validate HTTP_CALLS edges.

Graph Data Model

Node Labels

Project, Package, Folder, File, Module, Class, Function, Method, Interface, Enum, Type, Route

Edge Types

CONTAINS_PACKAGE, CONTAINS_FOLDER, CONTAINS_FILE, DEFINES, DEFINES_METHOD, IMPORTS, CALLS, HTTP_CALLS, ASYNC_CALLS, IMPLEMENTS, HANDLES, USAGE, CONFIGURES, WRITES, MEMBER_OF, TESTS, USES_TYPE, FILE_CHANGES_WITH

Qualified Names

get_code_snippet uses qualified names: <project>.<path_parts>.<name>. Use search_graph to discover them first.

Supported Cypher Subset

query_graph supports: MATCH with labels and relationship types, variable-length paths, WHERE with comparisons/regex/CONTAINS, RETURN with property access and COUNT/DISTINCT, ORDER BY, LIMIT. Not supported: WITH, COLLECT, OPTIONAL MATCH, mutations.

Ignoring Files

Layered: hardcoded patterns (.git, node_modules, etc.) → .gitignore hierarchy → .cbmignore (project-specific, gitignore syntax). Symlinks are always skipped.

Configuration

codebase-memory-mcp config list                          # show all settings
codebase-memory-mcp config set auto_index true           # auto-index on session start
codebase-memory-mcp config set auto_index_limit 50000    # max files for auto-index
codebase-memory-mcp config reset auto_index              # reset to default

Persistence

SQLite databases stored at ~/.cache/codebase-memory-mcp/. Persists across restarts (WAL mode, ACID-safe). To reset: rm -rf ~/.cache/codebase-memory-mcp/.

Troubleshooting

Problem	Fix
`/mcp` doesn't show the server	Check `.mcp.json` path is absolute. Restart agent. Test: `echo '{}' \| /path/to/binary` should output JSON.
`index_repository` fails	Pass absolute path: `index_repository(repo_path="/absolute/path")`
`trace_call_path` returns 0 results	Use `search_graph(name_pattern=".PartialName.")` first to find the exact name.
Queries return wrong project results	Add `project="name"` parameter. Use `list_projects` to see names.
Binary not found after install	Add to PATH: `export PATH="$HOME/.local/bin:$PATH"`
UI not loading	Ensure you downloaded the `ui` variant and ran `--ui=true`. Check `http://localhost:9749`.

Language Support

64 languages. Benchmarked against 64 real open-source repositories (78 to 49K nodes):

Tier	Score	Languages
Excellent (>= 90%)		Lua, Kotlin, C++, Perl, Objective-C, Groovy, C, Bash, Zig, Swift, CSS, YAML, TOML, HTML, SCSS, HCL, Dockerfile
Good (75-89%)		Python, TypeScript, TSX, Go, Rust, Java, R, Dart, JavaScript, Erlang, Elixir, Scala, Ruby, PHP, C#, SQL
Functional (< 75%)		OCaml, Haskell

Plus: Clojure, F#, Julia, Vim Script, Nix, Common Lisp, Elm, Fortran, CUDA, COBOL, Verilog, Emacs Lisp, MATLAB, Lean 4, FORM, Magma, Wolfram, JSON, XML, Markdown, Makefile, CMake, Protobuf, GraphQL, Vue, Svelte, Meson, GLSL, INI.

Architecture

src/
  main.c              Entry point (MCP stdio server + CLI + install/update/config)
  mcp/                MCP server (14 tools, JSON-RPC 2.0, session detection, auto-index)
  cli/                Install/uninstall/update/config (10 agents, hooks, instructions)
  store/              SQLite graph storage (nodes, edges, traversal, search, Louvain)
  pipeline/           Multi-pass indexing (structure → definitions → calls → HTTP links → config → tests)
  cypher/             Cypher query lexer, parser, planner, executor
  discover/           File discovery (.gitignore, .cbmignore, symlink handling)
  watcher/            Background auto-sync (git polling, adaptive intervals)
  traces/             Runtime trace ingestion
  ui/                 Embedded HTTP server + 3D graph visualization
  foundation/         Platform abstractions (threads, filesystem, logging, memory)
internal/cbm/         Vendored tree-sitter grammars (64 languages) + AST extraction engine

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 200 Commits
.github/workflows		.github/workflows
cmd/codebase-memory-mcp/assets		cmd/codebase-memory-mcp/assets
docs		docs
graph-ui		graph-ui
internal/cbm		internal/cbm
scripts		scripts
src		src
test-infrastructure		test-infrastructure
tests		tests
tools		tools
vendored		vendored
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.cppcheck		.cppcheck
.gitattributes		.gitattributes
.gitignore		.gitignore
BENCHMARK.md		BENCHMARK.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
Makefile.cbm		Makefile.cbm
README.md		README.md
SECURITY.md		SECURITY.md
THIRD_PARTY.md		THIRD_PARTY.md
server.json		server.json

Folders and files

Latest commit

History

Repository files navigation

codebase-memory-mcp

Why codebase-memory-mcp

Quick Start

Graph Visualization UI

Auto-Index

Keeping Up to Date

Uninstall

Features

How It Works

Performance

Installation

Pre-built Binaries

Setup Scripts

Install via Claude Code

Build from Source

Manual MCP Configuration

Multi-Agent Support

CLI Mode

MCP Tools

Indexing

Querying

Graph Data Model

Node Labels

Edge Types

Qualified Names

Supported Cypher Subset

Ignoring Files

Configuration

Persistence

Troubleshooting

Language Support

Architecture

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages