architecture

Architecture

High-performance code intelligence system in Rust. Indexes code, tracks relationships, serves via MCP.

How It Works

Parse fast - Tree-sitter AST parsing (same as GitHub code navigator) for Rust, Python, TypeScript, JavaScript, Java, Kotlin, Go, PHP, C, C++, C#, Swift, and GDScript
Extract real stuff - functions, traits, type relationships, call graphs
Embed - semantic vectors built from your doc comments
Index - Tantivy + memory-mapped symbol cache for <10ms lookups
Serve - MCP protocol for AI assistants, ~300ms response time (HTTP/HTTPS) and stdio built-in (0.16s)

In This Section

How It Works - Detailed system architecture
Memory Mapping - Cache and storage design
Embedding Model - Semantic search implementation
Language Support - Parser system and adding languages

Architecture Highlights

Parallel indexing pipeline: 5-stage architecture (DISCOVER → READ → PARSE → COLLECT → INDEX) with work-stealing queues. Phase 2 runs EmbeddingPool for parallel embedding generation.

Memory-mapped storage: Vector cache for semantic search:

segment_0.vec - 384-dimensional vectors, <1μs access after OS page cache warm-up

Embedding lifecycle management: Old embeddings deleted when files are re-indexed to prevent accumulation.

Lock-free concurrency: DashMap for concurrent reads, RwLock for Tantivy writes.

IndexFacade: Unified interface wrapping DocumentIndex, Pipeline, and SemanticSearch.

Language-aware semantic search: Embeddings track source language, enabling filtering before similarity computation. No score redistribution - identical docs produce identical scores regardless of filtering.

Hot reload: File watcher with 500ms debounce triggers re-indexing of changed files only.

Performance

Parser benchmarks on a 750-symbol test file:

Language	Parsing Speed	vs. Target (10k/s)	Status
Rust	91,318 symbols/sec	9.1x faster ✓	Production
Python	75,047 symbols/sec	7.5x faster ✓	Production
TypeScript	82,156 symbols/sec	8.2x faster ✓	Production
PHP	68,432 symbols/sec	6.8x faster ✓	Production
Go	74,655 symbols/second	7.5x faster ✓	Production

Run performance benchmarks:

codanna benchmark all          # Test all parsers
codanna benchmark python       # Test specific language

Next Steps

Learn about User Guide for usage
Explore Advanced features
Read Contributing to add features

Back to Documentation

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
embedding-model.md		embedding-model.md
how-it-works.md		how-it-works.md
language-support.md		language-support.md
memory-mapping.md		memory-mapping.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Architecture

How It Works

In This Section

Architecture Highlights

Performance

Next Steps

FilesExpand file tree

architecture

Directory actions

More options

Directory actions

More options

Latest commit

History

architecture

Folders and files

parent directory

README.md

Architecture

How It Works

In This Section

Architecture Highlights

Performance

Next Steps