cache-llm 🧠

A blazing fast local proxy server that caches LLM API calls to save you money during agent development.

cache-llm is a zero-config, ultra-fast proxy server that intercepts your outgoing LLM requests (e.g., to OpenAI), forwards them to the real API, and stores the response in a local SQLite database.

The next time your code makes the exact same request with the exact same prompt, cache-llm instantly returns the cached response in <2ms.

Why?

When building autonomous AI agents or complex AI workflows, you end up running the exact same test suites and prompts thousands of times. This burns through your OpenAI/Anthropic API credits incredibly fast, and slows down your local development cycle by thousands of seconds.

With cache-llm, your API bill shrinks to almost zero during local iterative testing, and your tests run instantly.

Installation & Usage

You can run it instantly without installing:

npx @dinakars777/cache-llm

This will start the proxy server on port 8080, targeting https://api.openai.com, and caching responses in ./.llm-cache.db.

Options

-p, --port: Which port to run the proxy on (Default: 8080).
-t, --target: The base URL of the LLM API (Default: https://api.openai.com).
-d, --db: The location to store the SQLite database (Default: ./.llm-cache.db).

Configuring your App

Just point your project's BASE_URL to the proxy!

OpenAI Node.js SDK

import OpenAI from 'openai';

const openai = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: 'http://localhost:8080/v1' // Point this to cache-llm
});

LangChain, AutoGen, etc.

Just set the environment variable:

export OPENAI_BASE_URL="http://localhost:8080/v1"

How It Works

cache-llm computes a deterministic sha256 hash of the HTTP Method, URL path, Authorization header, and raw request body.
If the hash exists in the local SQLite DB, it immediately returns the JSON response.
If it's a MISS, it securely forwards the request to the target API, stores the response in the DB, and then returns it to your app.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
dist		dist
node_modules		node_modules
src		src
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cache-llm 🧠

Why?

Installation & Usage

Options

Configuring your App

OpenAI Node.js SDK

LangChain, AutoGen, etc.

How It Works

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cache-llm 🧠

Why?

Installation & Usage

Options

Configuring your App

OpenAI Node.js SDK

LangChain, AutoGen, etc.

How It Works

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages