aiPolaris

Federal AI agent orchestration stalls on compliance gaps, audit failures, and capability boundaries that aren't enforced until production. aiPolaris prevents that - from the first commit.

AI systems fail from architectural ambiguity, not model weakness.

What aiPolaris Is

The agent orchestration layer in the VPL Solutions AI platform.

Product	Role
Meridian	Governed RAG control plane — retrieval + governance
aiNexus	Enterprise data pipeline — Graph API, ADLS, AI Search
aiPolaris	Agent orchestration — LangGraph DAG, sandboxing, audit trail

aiPolaris reads the aiNexus index. It does not own or populate it.

Architecture

                    ┌─────────────────────────────────────────┐
                    │         Entra ID Auth (MSAL)            │
                    │       RBAC per endpoint capability       │
                    └──────────────────┬──────────────────────┘
                                       │
                         POST /query (StreamingResponse)
                                       │
                    ┌──────────────────▼──────────────────────┐
                    │           LangGraph DAG                  │
                    │                                          │
                    │  ┌──────────┐      ┌──────────────┐      │
                    │  │ Planner  │─────▶│  Retriever   │      │
                    │  │tools:none│      │tools:        │      │
                    │  └──────────┘      │ai_search_read│      │
                    │                    │only          │      │
                    │                    └──────┬───────┘      │
                    │                           │              │
                    │                ┌──────────▼──────────┐   │
                    │                │    Synthesizer      │   │
                    │                │    tools: none      │   │
                    │                │  answer + citations │   │
                    │                └─────────────────────┘   │
                    │                                          │
                    │  TraceContext: trace_id + StepLog        │
                    │  CapabilityViolationError enforced       │
                    │  Session memory: TTL=1800s               │
                    └──────────────────────────────────────────┘
                                       │
                    ┌──────────────────▼──────────────────────┐
                    │  Azure AI Search (READ ONLY)            │
                    │  Index owned by aiNexus                 │
                    │  Confidence threshold: 0.75             │
                    └──────────────────────────────────────────┘

GCCH: set environment=gcch in Terraform workspace.
All endpoints switch automatically. Zero code changes.

Key Properties

Deterministic — temperature=0, pinned model, pinned prompts (prompts.lock). Same input → same output. Verifiable from trace_id alone.

Sandboxed — every agent node has a declared tool manifest. CapabilityViolationError raised immediately on any violation. Tested.

Auditable — every invocation has a trace_id. Every node appends a StepRecord. Full execution reconstructable from the audit log. Satisfies NIST AU-2, IR-4.

GCCH-ready — all endpoints parameterized by Terraform workspace variable. Commercial → GCCH is one command.

Eval Acceptance Criteria

Metric	Threshold
p95 latency	< 4,000 ms
p50 latency	< 1,500 ms
Avg confidence	> 0.75
Correct refusal rate	100%
Incorrect refusal rate	< 10%
Follow-up pass rate	4/4 (100%)
Sandbox tests	100% pass
Replay match rate	≥ 95%

Getting Started

make install       # pip install -e .[dev] + pre-commit hooks
make dev           # uvicorn on port 8000
make graph-viz     # export LangGraph DAG as Mermaid
make eval          # run full eval harness (20 questions)
make tf-plan env=commercial
make tf-plan env=gcch

ADRs

ADR	Decision
ADR-001	LangGraph over LangChain / SK / AutoGen
ADR-002	Read-only agents, CapabilityViolationError
ADR-003	GCCH Terraform workspaces
ADR-004	TraceContext on every invocation
ADR-005	Prompt hash-pinning via prompts.lock
ADR-006	Streaming responses
ADR-007	Session memory (TTL=1800s)

Philosophy

Control precedes generation. Observability precedes scale. Governance precedes automation.

I design systems where failure modes are explicit, not discovered in production.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.claude/commands		.claude/commands
.github		.github
agent		agent
api		api
docs		docs
eval		eval
infra/terraform		infra/terraform
pipeline		pipeline
release_records		release_records
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
prompts.lock		prompts.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aiPolaris

What aiPolaris Is

Architecture

Key Properties

Eval Acceptance Criteria

Getting Started

ADRs

Philosophy

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

aiPolaris

What aiPolaris Is

Architecture

Key Properties

Eval Acceptance Criteria

Getting Started

ADRs

Philosophy

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages