LightShield

LightShield is a lightweight Python middleware that reduces prompt injection risk by tagging system and user content and enforcing a strict instruction hierarchy before prompts are sent to an LLM. It does not modify model weights or call a separate classifier—only prompt construction and response sanitization.

Install

pip install ollama   # required for the Ollama shield

Clone or install the LightShield package so you can import it.

Use (Ollama)

Import and create a shield

Choose the engine (e.g. "ollama"). The shield wraps that backend so every chat call is tagged and responses are sanitized.
```
from lightshieldai import Shield

shield = Shield()
```

Call chat

Pass the same model and messages you would use with Ollama. Use standard role and content keys. LightShield uses system and user messages only (RAG/retrieved layers are for a later release).

response = shield.chat(
    model="qwen2.5",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "What is 2+2?"},
    ],
)

Read the response

The returned object has the same shape as Ollama’s response. The message content is sanitized so internal LightShield tags are not exposed.
```
print(response["message"]["content"])
```
Other Ollama features

Other calls are passed through to the underlying backend (e.g. shield.list(), shield.pull("qwen2.5")). Only chat() is wrapped and sanitized. Streaming is disabled so that responses can be sanitized reliably.

What LightShield does

Before the call: Builds a system message that includes an authority/hierarchy paragraph and wraps your system and user text in unique tags so the model sees clear boundaries and priorities (system over user).
After the call: Strips those tags from the model’s reply so they never reach your application.

Building blocks (advanced)

If you want to plug LightShield into another API or build your own flow, you can use the lower-level pieces:

LayerPrompt — Creates one tag per layer (system, user, retrieved) and returns authority_text() plus a tags dict. Use tags["system"].wrap(...) and tags["user"].wrap(...) to build the strings you send.
Tag — Single tag with short_id and wrap(content) for <LS_id>content</LS_id>.

You would then call your own LLM with the wrapped prompts and implement sanitization (strip <LS_*> and </LS_*>) using the same tag ids from that LayerPrompt instance.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dist		dist
lightshieldai.egg-info		lightshieldai.egg-info
lightshieldai		lightshieldai
tests		tests
website		website
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baseline_results.csv		baseline_results.csv
pyproject.toml		pyproject.toml
rag.py		rag.py
rag_baseline_results.csv		rag_baseline_results.csv
rag_shield_results.csv		rag_shield_results.csv
requirements.txt		requirements.txt
shield_results.csv		shield_results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LightShield

Install

Use (Ollama)

What LightShield does

Building blocks (advanced)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LightShield

Install

Use (Ollama)

What LightShield does

Building blocks (advanced)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages