GitHub - lui62233/memory-cloud: Adaptive AI Memory Platform — Self-hosted MCP server that gets smarter as you use it. Hybrid search, neural memory graph, AI reranking, and web UI. Built with FastAPI, Next.js, PostgreSQL, and Qdrant.

Adaptive AI Memory Platform — Self-hosted, open source.
Give your AI assistants persistent memory that gets smarter as you use it.

Works with Claude, ChatGPT, Gemini, and any MCP-compatible client.
Python SDK (KaguraClient / KaguraAgent)

Why Kagura Memory Cloud?

Your AI forgets everything after each conversation. Kagura fixes that — and gets smarter every time you search.

Most AI memory tools are just vector databases with a chat wrapper. Kagura is different:

Feature	Description
Adaptive Memory	Every search automatically strengthens connections between related memories. The more you use it, the better `explore()` discovers hidden relationships.
Hybrid Search	Semantic (OpenAI/Ollama) + BM25 keyword — 96% top-1 accuracy
AI Reranking	Ollama (local, free), Voyage AI, or Cohere — cross-encoder reranking for precision
Neural Memory Graph	Hebbian learning builds a knowledge graph in the background. `explore()` traverses it for serendipitous discovery.
14 MCP Tools	remember, recall, forget, reference, explore, merge_contexts, and 8 more
Multi-Provider	OpenAI or Ollama (local, private, zero cost) for embeddings
Team Ready	Workspaces, RBAC, context isolation, shared memory
Web UI	Next.js dashboard — contexts, search settings, member management
5-Minute Setup	`./setup.sh` and you're done

Architecture

Workspace (team/org)
├── Context A ("my-project")     ← like a folder
│   ├── Memory 1                 ← 3-layer: summary / context / content
│   ├── Memory 2
│   └── Neural edges (Hebbian)   ← automatic connections
├── Context B ("learning-notes")
│   └── ...
└── Members (Owner/Admin/Member/Viewer)

Adaptive Memory: Two Search Paths

Kagura separates precision search and discovery into two independent paths, each optimized for its purpose:

recall()  ──→ Hybrid Search (semantic + BM25) ──→ [Reranker] ──→ Precise results
                      │
                      └──→ Hebbian Learning (background) ──→ Graph edges grow
                                                                │
explore() ──→ Graph Traversal (Neural Memory) ←─────────────────┘  Related discoveries

recall() — Precision search. Hybrid (semantic 60% + BM25 40%) with optional AI reranking. Returns the most relevant memories.
explore() — Discovery. Traverses the Neural Memory graph to find related memories that keyword search would miss.
Hebbian learning — Every recall() silently strengthens edges between co-retrieved memories. No explicit training needed — the graph grows organically as you use the system.

This separation is intentional: mixing graph signals into recall degrades precision (validated via benchmarks). Instead, each path does what it's best at.

Data isolation: All data is filtered by workspace_id → context_id → user_id. Memories never leak across boundaries. Single Qdrant collection with payload filtering.

Tech stack: FastAPI (async) · PostgreSQL · Qdrant · Redis · Next.js 16 · OAuth2 · MCP over Streamable HTTP

Quick Start

System Requirements

	Minimum	Recommended
CPU	2 cores	4+ cores
RAM	4 GB	8+ GB
Disk	10 GB free	20+ GB free

Prerequisites

Docker & Docker Compose
Python 3.11+
Node.js 20+
OpenAI API key (for embeddings) — or Ollama for local embeddings
OAuth2 credentials (optional — password + MFA login available without OAuth)

Setup

One-line setup:

git clone https://github.com/kagura-ai/memory-cloud.git
cd memory-cloud
./setup.sh

With Claude Code:

git clone https://github.com/kagura-ai/memory-cloud.git
cd memory-cloud
claude   # then run /setup

Step-by-step setup:

# 1. Clone
git clone https://github.com/kagura-ai/memory-cloud.git
cd memory-cloud

# 2. Configure environment (generates secrets, prompts for API keys)
(cd backend && python3 -m src.cli.setup_env)

# 3. Start all services
docker compose up -d

# 4. Run migrations
(cd backend && alembic upgrade head)

# 5. Create admin account (interactive — sets password, MFA, API key, embedding provider)
(cd backend && python3 -m src.cli.create_admin)

# Backend API:  http://localhost:8080
# Frontend UI:  http://localhost:3000
# API docs:     http://localhost:8080/redoc

.env.local settings (auto-configured by setup_env):

Setting	Required	Description
`API_KEY_SECRET`	Yes	Secret for API key encryption (auto-generated)
`JWT_SECRET`	Yes	Secret for JWT tokens (auto-generated)
`OPENAI_API_KEY`	Yes*	OpenAI API key for embeddings
`OLLAMA_BASE_URL`	No	Ollama URL (default: `http://localhost:11434`)
`EMBEDDING_PROVIDER`	No	`openai` (default) or `ollama`
`GOOGLE_CLIENT_ID/SECRET`	No	Google OAuth2 login (optional — password login available)
`GITHUB_CLIENT_ID/SECRET`	No	GitHub OAuth2 login (optional)

* Either OPENAI_API_KEY or a running Ollama instance is required for memory features.

Admin CLI

Command	Purpose
`python3 -m src.cli.setup_env`	Generate secrets + configure `.env.local` (run before Docker)
`python3 -m src.cli.create_admin`	Create admin + workspace + API key + `.mcp.json` + embedding setup
`python3 -m src.cli.reset_password`	Reset password and/or MFA
`python3 -m src.cli.delete_admin`	Delete admin (for re-creation)

Run from backend/ directory. Docker API container must be running.

Platform-specific notes

WSL (Windows): Install Docker Desktop for Windows and enable WSL integration
macOS: Install Docker Desktop for Mac. brew install [email protected] node
Linux (Ubuntu/Debian): sudo apt install docker.io docker-compose-v2 python3.11 nodejs npm
GCP (Production): Set production values in .env.local (DATABASE_URL, QDRANT_URL, ENVIRONMENT=production, CORS_ORIGINS)
Frontend env vars: Copy frontend/.env.example to frontend/.env.local and set:
- NEXT_PUBLIC_API_URL — backend URL (default: http://localhost:8080)
- NEXT_PUBLIC_APP_URL — frontend URL for metadata
- NEXT_PUBLIC_PLAN_FREE_DISPLAY_NAME / BASIC / PRO — plan display name customization (default: S/M/L)

MCP Client Setup

Claude Code (Recommended)

Claude Code + Kagura Memory Cloud gives your AI assistant persistent, searchable, team-shareable memory that works across sessions, machines, and projects.

Why not just use Claude Code's built-in memory?

	Claude Code Memory	Kagura Memory Cloud
Storage	Local files (`~/.claude/`)	Cloud (PostgreSQL + Qdrant)
Search	File name only	Hybrid Search (semantic + full-text)
Sharing	Single machine	Team workspace with RBAC
Structure	Flat markdown	3-layer architecture + Neural Memory graph
Cross-project	Per-project only	Any project via MCP

Setup (3 steps):

Start services and open http://localhost:3000/workspace/integrations/api-keys to create an API key
Copy .mcp.json.example to .mcp.json and fill in your workspace ID and API key:

cp .mcp.json.example .mcp.json
# Edit .mcp.json — set workspace_id (from URL bar) and API key

Restart Claude Code and verify:

You: "List my memory contexts"
→ AI calls list_contexts()

You: "Remember: our API uses JWT with 1h expiry and refresh token rotation"
→ AI calls remember() — stored permanently

You: "What do we know about auth?"
→ AI calls recall() — finds it instantly, even months later

.mcp.json is in .gitignore — never commit it (contains API keys). Place in project root for per-project config, or ~/.claude/.mcp.json for global access.

Auto-sync Claude Code memory to Kagura (optional)

Kagura Memory Cloud includes a hook that automatically syncs Claude Code's local memory files to the cloud whenever they're updated. This means your local ~/.claude/ memories are also searchable via Hybrid Search and shared with your team.

Add these environment variables to .env.local:

KAGURA_MCP_URL=http://localhost:8080/mcp/w/{workspace_id}
KAGURA_MCP_TOKEN=kagura_{your_api_key}
KAGURA_CONTEXT_ID={context_id}  # optional — auto-detected from project name if omitted

The sync hook is pre-configured in .claude/settings.json and runs automatically on every memory file write.

Claude Code integration templates (copy to your project)

This repo's .claude/ directory is a ready-to-use template for integrating Kagura Memory Cloud into any project. Copy what you need:

Slash commands — drop into your project's .claude/commands/:

/recall <query>   → Search past knowledge via MCP
/remember <text>  → Save decisions, patterns, learnings via MCP
/guide            → Show setup and usage guide

Hooks (.claude/settings.json) — auto-configured safety guards:

Auto-format    → ruff (Python) / prettier (TypeScript) on every save
Secret detect  → Blocks hardcoded API keys or passwords
SQL injection  → Blocks f-string SQL (enforces parameterized queries)
Memory sync    → Auto-syncs Claude Code memory files to Kagura Cloud

Agents (.claude/agents/) — specialized sub-agents:

code-reviewer  → Read-only code review against project standards
test-runner    → Run tests, diagnose failures, auto-fix

Rules (.claude/rules/) — auto-loaded project conventions:

backend.md     → FastAPI/Python patterns, async, testing
frontend.md    → Next.js/TypeScript, Tailwind, SWR
security.md    → Auth, RBAC, SQL safety, CORS

All of these are designed to work together with Kagura Memory Cloud MCP tools. Adapt them for your own project by editing the prompts and context IDs.

Claude Code Plugin (use Kagura in any project)

The kagura-memory plugin adds session management and memory workflow skills to Claude Code. Install it once and use it across all your projects.

Install:

# From marketplace
/plugin marketplace add kagura-ai/memory-cloud
/plugin install kagura-memory@kagura-memory-cloud

Available skills:

Skill	Description
`/kagura-memory:session-start`	Restore previous session context on start
`/kagura-memory:session-summary`	Save session knowledge before ending
`/kagura-memory:recall`	Search past knowledge
`/kagura-memory:remember`	Save new knowledge
`/kagura-memory:guide`	Usage guide, connection status, and setup help
`/kagura-memory:smoke-test`	Verify all MCP tools work

Recommended workflow:

/kagura-memory:session-start       # ← Start here: restore context from last session
  ... work normally ...
/kagura-memory:recall              # Search past decisions, patterns, fixes
/kagura-memory:remember            # Save important learnings as you go
  ... finish work ...
/kagura-memory:session-summary     # ← End here: save session knowledge for next time

Skills wrap the raw MCP tools (recall, remember, etc.) with workflow logic — context selection, git state analysis, and structured prompts. Use skills for session management and guided workflows; use MCP tools directly for fine-grained operations.

Run /kagura-memory:guide for setup help and an optional SessionStart hook you can add to your project.

Prerequisite: MCP connection must be configured (.mcp.json with API key). Run /kagura-memory:guide in your project to set it up.

Claude Desktop / Claude Chat (Web)

Claude Desktop: Same .mcp.json format as Claude Code — place in your project root or ~/.claude/.mcp.json.

Claude Chat (claude.ai): Add as a remote MCP server in Settings > Integrations:

Click "Add Integration" → "Custom MCP Server"
Enter the MCP endpoint URL: https://your-domain.com/mcp/w/{workspace_id}
Add the Authorization: Bearer kagura_{your_api_key} header

Claude Chat requires a publicly accessible URL (not localhost). Use a production deployment or tunnel (e.g., ngrok, Cloudflare Tunnel).

ChatGPT Desktop

ChatGPT desktop app supports MCP servers. Add via Settings > MCP Servers:

Server URL: https://your-domain.com/mcp/w/{workspace_id}
Authentication: Bearer token kagura_{your_api_key}

Like Claude Chat, ChatGPT requires a public URL. For local development, use a tunnel or the REST API directly.

Gemini CLI

Add to .gemini/settings.json (project root or ~/.gemini/settings.json):

{
  "mcpServers": {
    "kagura-memory": {
      "url": "http://localhost:8080/mcp/w/{workspace_id}",
      "headers": {
        "Authorization": "Bearer kagura_{your_api_key}"
      }
    }
  }
}

Other MCP Clients / REST API

Any MCP-compatible client can connect via Streamable HTTP. For clients without MCP support, use the REST API directly:

# Search memories
curl -X POST -H "Authorization: Bearer kagura_{your_key}" \
  -H "Content-Type: application/json" \
  -d '{"query": "your search", "context_id": "..."}' \
  http://localhost:8080/api/v1/memories/search

MCP Tools

Tool	Description	Required Role
`remember`	Store a new memory (summary + content + type)	Member+
`recall`	Search memories with Hybrid Search	Viewer+
`reference`	Get full 3-layer details of a memory	Viewer+
`update_memory`	Update an existing memory in-place or upsert by external ID	Member+
`forget`	Soft-delete a memory (30-day retention)	Member+
`explore`	Discover related memories via Neural Memory graph	Viewer+
`list_edges`	List edges connected to a memory	Viewer+
`create_edge`	Create an edge between two memories	Member+
`update_edge`	Update edge weight or type	Member+
`delete_edge`	Delete an edge between two memories	Member+
`get_context_info`	Get context metadata and guidelines	Viewer+
`list_contexts`	List available contexts in workspace	Viewer+
`create_context`	Create a new context	Owner/Admin
`update_context`	Update context settings (summary, usage guide, resource_id, is_public)	Editor+
`delete_context`	Delete a context and all its memories	Owner/Admin
`merge_contexts`	Merge memories from source context into target context	Owner/Admin
`update_search_config`	Tune hybrid search weights and reranker settings per context	Editor+

Workspace roles: Owner > Admin > Member > Viewer (read-only). Context roles: Owner > Editor > Viewer. Private contexts are visible only to the creator. Members may be restricted to specific contexts via allowlist.

REST API

In addition to MCP tools, a full REST API is available:

Memory: remember, recall, reference, forget, explore (/api/v1/memory/*)
Contexts: CRUD, search settings (/api/v1/contexts/*)
Attachments: File upload/download for memories (/api/v1/attachments/*, 5MB limit)
Workspaces: Management, members, invitations (/api/v1/workspaces/*)
Admin: Users, plan management, neural config (/api/v1/admin/*)

Full API documentation: http://localhost:8080/redoc

Authentication

Two OAuth2 providers are supported:

Google OAuth2 — Required. Set GOOGLE_CLIENT_ID and GOOGLE_CLIENT_SECRET
GitHub OAuth2 — Optional. Set GITHUB_CLIENT_ID and GITHUB_CLIENT_SECRET

Users with the same email address across providers share a single account.

Plan Tier Customization

Plans control resource limits per workspace. Defaults:

Plan	Contexts	Memories	MCP calls/day
S (Free)	1	1,000	1,000
M (Basic)	3	10,000	10,000
L (Pro)	20	100,000	50,000

Override via environment variables:

PLAN_FREE_MAX_CONTEXTS=5
PLAN_FREE_MEMORY_LIMIT=5000
PLAN_BASIC_MAX_CONTEXTS=10
PLAN_PRO_MAX_CONTEXTS=50

For self-hosted single-user setups, assign the L (Pro) plan to your workspace. Plan changes are admin-only by default. For SaaS deployments with self-service billing, enable Stripe:

BILLING_ENABLED=true
STRIPE_SECRET_KEY=sk_...
STRIPE_WEBHOOK_SECRET=whsec_...
STRIPE_PRICE_BASIC=price_xxx
STRIPE_PRICE_PRO=price_yyy

Development with Claude Code

This project is designed to be developed with Claude Code and Kagura Memory Cloud itself.

Getting Started with Claude Code

Install Claude Code
Clone the repo and start services (see Quick Start)
Create an API key in the Web UI (http://localhost:3000)
Create .mcp.json in the project root (see Claude Code setup)

.mcp.json is in .gitignore — never commit it (contains API keys)

Verify in Claude Code:

> list_contexts()
> create_context(name="my-project")

Pre-configured tooling loads automatically from .claude/:

Slash Commands

Command	Description
`/setup`	Set up dev environment from scratch (detects WSL/macOS/Linux)
`/issue-start <number>`	Start work on a GitHub issue (branch + past knowledge recall)
`/remember <text>`	Save patterns, decisions, or learnings to memory
`/recall <query>`	Search past knowledge for relevant patterns
`/guide`	Show Kagura Memory Cloud usage guide
`/test`	Run full test suite (backend pytest + frontend build)
`/quality`	Run all quality checks (ruff, pyright, frontend build)
`/self-review`	Pre-PR self-review against SOLID/DRY/KISS/security checklist
`/self-maint`	Audit `.claude/` config against current codebase state
`/api-docs-audit`	Audit OpenAPI tags and endpoint documentation

Hooks (Automatic Safety Guards)

Auto-format: Python (ruff) / TypeScript (prettier) on every file save
Secret detection: Blocks commits containing hardcoded API keys or passwords
SQL injection prevention: Blocks f-string SQL queries (enforces SQLAlchemy)
.env protection: Prevents accidental modification of .env files
Migration protection: Prevents modification of lock files
Memory sync: Auto-syncs Claude Code memory files to Kagura Memory Cloud (details)

Agents

code-reviewer: Read-only code review against project standards (runs on Sonnet)
test-runner: Test execution, failure diagnosis, and auto-fix (runs on Sonnet)

Rules (Auto-loaded Context)

rules/backend.md: FastAPI/Python patterns, async requirements, testing conventions
rules/frontend.md: Next.js/TypeScript patterns, Tailwind, SWR conventions
rules/security.md: Auth requirements, RBAC, SQL safety, CORS policy

Documentation

Core Concepts — Workspace, Context, Memory, Neural Memory, MCP Tools
Architecture — System design and data flow
Getting Started — Detailed setup guide
API Reference — REST API documentation
Chunking Guide — Best practices for memory storage
Resource Tokens Guide — External data ingestion via resource tokens
Neural Memory Evaluation — Benchmark results, architecture decisions
Search Quality Benchmark — Accuracy tests, reranking, best practices
Deployment — Production deployment with Caddy reverse proxy
Contributing — Development setup, code style, PR workflow
Security — Vulnerability reporting, security design
Python SDK — KaguraClient and KaguraAgent for Python

🇯🇵 日本語ガイド / Japanese Guide

Claude Code プラグインの使い方

インストール

# マーケットプレイスから追加
/plugin marketplace add kagura-ai/memory-cloud
/plugin install kagura-memory@kagura-memory-cloud

利用可能なスキル

スキル	説明
`/kagura-memory:session-start`	前回のセッションコンテキストを復元
`/kagura-memory:session-summary`	セッション終了前に知識を保存
`/kagura-memory:recall`	過去の知識を検索
`/kagura-memory:remember`	新しい知識を保存
`/kagura-memory:guide`	使い方ガイド・接続確認・セットアップ
`/kagura-memory:smoke-test`	全MCPツールの動作確認

推奨ワークフロー

/kagura-memory:session-start       # ← ここから：前回のコンテキストを復元
  ... 通常の作業 ...
/kagura-memory:recall              # 過去の設計判断・パターン・修正を検索
/kagura-memory:remember            # 重要な学びを都度保存
  ... 作業完了 ...
/kagura-memory:session-summary     # ← ここで終了：次回のためにセッション知識を保存

スキルは MCP ツール（recall, remember など）をワークフローロジックで包んだものです。コンテキスト選択、git 状態の分析、構造化プロンプトが組み込まれています。セッション管理やガイド付きワークフローにはスキルを、細かい操作には MCP ツールを直接使ってください。

前提条件: MCP接続の設定が必要です（.mcp.json に API キーを設定）。プロジェクトで /kagura-memory:guide を実行してセットアップしてください。

Contributing

See CONTRIBUTING.md for development setup, code style, and PR workflow.

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.github		.github
backend		backend
claude-skills		claude-skills
docs		docs
frontend		frontend
terraform/single-server		terraform/single-server
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.mcp.json.example		.mcp.json.example
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
marketplace.json		marketplace.json
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

Why Kagura Memory Cloud?

Architecture

Adaptive Memory: Two Search Paths

Quick Start

System Requirements

Prerequisites

Setup

Admin CLI

MCP Client Setup

Claude Code (Recommended)

Claude Desktop / Claude Chat (Web)

ChatGPT Desktop

Gemini CLI

Other MCP Clients / REST API

MCP Tools

REST API

Authentication

Plan Tier Customization

Development with Claude Code

Getting Started with Claude Code

Slash Commands

Hooks (Automatic Safety Guards)

Agents

Rules (Auto-loaded Context)

Documentation

🇯🇵 日本語ガイド / Japanese Guide

インストール

利用可能なスキル

推奨ワークフロー

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages