rag-engine

Lightweight hybrid RAG engine combining vector search, BM25, and knowledge graphs. Built for multilingual document retrieval with GDPR compliance.

Architecture

graph TB
    Client[Client / API Consumer]
    API[FastAPI REST API]
    Ingest[Document Ingestion Pipeline]
    Chunk[Smart Chunker]
    Embed[Multilingual Embedder]
    Qdrant[(Qdrant Vector DB)]
    BM25[(BM25 Index)]
    KG[(Knowledge Graph)]
    Hybrid[Hybrid Retriever]
    Rerank[Re-Ranker]
    GDPR[GDPR Compliance Layer]

    Client --> API
    API --> Ingest
    Ingest --> Chunk
    Chunk --> Embed
    Embed --> Qdrant
    Chunk --> BM25
    Chunk --> KG
    API --> Hybrid
    Hybrid --> Qdrant
    Hybrid --> BM25
    Hybrid --> KG
    Hybrid --> Rerank
    Rerank --> API
    GDPR --> Qdrant
    GDPR --> BM25
    GDPR --> KG

Key Features

Hybrid Retrieval — Vector (Qdrant) + BM25 + Knowledge Graph with weighted re-ranking
Multilingual — Italian, English, Russian out of the box via intfloat/multilingual-e5-large
Smart Chunking — Adaptive strategies: fixed, semantic (paragraph-aware), document-aware (headings/articles)
GDPR Compliant — Per-tenant data isolation, right to erasure with cascading deletes, audit logging
REST API — FastAPI with OpenAPI documentation at /docs
Quality Metrics — RAGAS-style evaluation: precision, recall, MRR, nDCG, F1

Quick Start

git clone https://github.com/ForwardCodeSolutions/rag-engine.git
cd rag-engine
cp .env.example .env
# Edit .env with your API keys
docker compose up -d

The API is available at http://localhost:8000/docs.

Local Development

uv sync       # Install dependencies
make check    # Lint + tests
make dev      # Start with hot reload

Test Suite

244 tests across unit, integration, and property-based:

tests/
  unit/           — 198 unit + 9 property-based tests (models, ingestion,
                     BM25, knowledge graph, hybrid retriever, Qdrant,
                     embedding, GDPR, auth, document endpoints, evaluation,
                     parsers, hypothesis-driven invariants)
  integration/    — 37 tests (full retrieval pipeline, GDPR cascade,
                     API endpoints, quality metrics)

Authentication

All endpoints (except /health) require an X-API-Key header. Set the API_KEY environment variable in .env:

API_KEY=your-secret-key-here

API Endpoints

Health (no auth required)

curl http://localhost:8000/api/v1/health

{"status": "healthy", "qdrant_connected": true, "version": "0.1.0"}

Upload Document

curl -X POST http://localhost:8000/api/v1/documents/upload \
  -H "X-API-Key: your-secret-key-here" \
  -F "tenant_id=tenant-1" \
  -F "document_type=general" \
  -F "[email protected]"

{
  "id": "a1b2c3d4-...",
  "filename": "document.pdf",
  "tenant_id": "tenant-1",
  "language": "en",
  "chunk_count": 12
}

Search Documents

curl -X POST http://localhost:8000/api/v1/documents/search \
  -H "X-API-Key: your-secret-key-here" \
  -H "Content-Type: application/json" \
  -d '{"query": "machine learning", "tenant_id": "tenant-1", "top_k": 5}'

{
  "query": "machine learning",
  "results": [
    {"document_id": "a1b2c3d4-...", "chunk_index": 3, "text": "...", "score": 0.85}
  ],
  "total_results": 1
}

Delete Document (GDPR)

curl -X DELETE "http://localhost:8000/api/v1/documents/doc-123?tenant_id=tenant-1&reason=user+request" \
  -H "X-API-Key: your-secret-key-here"

{
  "document_id": "doc-123",
  "tenant_id": "tenant-1",
  "bm25_chunks_removed": 3,
  "graph_chunks_removed": 3,
  "message": "Document doc-123 deleted successfully"
}

Delete Tenant Data (GDPR Right to Erasure)

curl -X DELETE "http://localhost:8000/api/v1/tenants/tenant-1/data?reason=GDPR+erasure+request" \
  -H "X-API-Key: your-secret-key-here"

{
  "tenant_id": "tenant-1",
  "documents_removed": 0,
  "message": "All data for tenant tenant-1 deleted successfully"
}

Project Structure

src/rag_engine/
  api/              FastAPI routes (upload, search, health, GDPR)
  core/             Hybrid retriever, re-ranker, evaluation metrics
  ingestion/        Document parsers (PDF, DOCX, TXT), chunkers, language detection
  models/           Pydantic models (document, search, config, health, GDPR)
  services/         Embedding service, GDPR compliance service
  storage/          BM25 index, Knowledge Graph (NetworkX), Qdrant vector store
  utils/            Structured logging, audit trail

Design Decisions

See docs/decisions/ for Architecture Decision Records:

ADR	Decision
ADR-001	Hybrid retrieval (Vector + BM25 + Graph)
ADR-002	Qdrant as vector database
ADR-003	Adaptive chunking strategy
ADR-004	Multilingual support approach
ADR-005	GDPR compliance by design

How to Extend

Add document parsers — implement BaseParser in src/rag_engine/ingestion/parsers/
Add chunking strategies — extend BaseChunker in src/rag_engine/ingestion/chunker.py
Add languages — BM25 indexes are automatically created per language; embedding model handles 100+ languages
Add retrieval methods — pass results to HybridRetriever.search() via vector_results parameter

Tech Stack

Python 3.11+, FastAPI, Pydantic v2
Qdrant (vector search), rank-bm25 (BM25Plus), NetworkX (knowledge graph)
sentence-transformers (multilingual embeddings)
structlog (structured logging), langdetect (language detection)
uv (package manager), ruff (linter/formatter), pytest

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
docs		docs
src/rag_engine		src/rag_engine
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rag-engine

Architecture

Key Features

Quick Start

Local Development

Test Suite

Authentication

API Endpoints

Health (no auth required)

Upload Document

Search Documents

Delete Document (GDPR)

Delete Tenant Data (GDPR Right to Erasure)

Project Structure

Design Decisions

How to Extend

Tech Stack

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rag-engine

Architecture

Key Features

Quick Start

Local Development

Test Suite

Authentication

API Endpoints

Health (no auth required)

Upload Document

Search Documents

Delete Document (GDPR)

Delete Tenant Data (GDPR Right to Erasure)

Project Structure

Design Decisions

How to Extend

Tech Stack

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages