John London B-A-M-N

John London · AI Systems Architect

I design the workflows — AI implements them.

Available for contracts, consulting, and full-time roles.
Async-first · Local-first · Deterministic-by-design

🌐 Landing Page · 𝕏 Twitter · 🦙 Ollama · 📧 Email

Who I Am

I'm a 36-year-old single father and primary caregiver for my young son with autism. I walked away from my career 4 years ago because he needed someone present — and spent those 4 years teaching myself AI from the ground up, starting with hardware, because my long-term goal is total resource self-sufficiency with AI.

Background: Precision machinist (±0.00015" tolerances) → Contract IT → Independent AI Systems Architect

Education: B.A. Psychology — George Mason University

Philosophy: I treat AI model failures as diagnostic problems, not tuning problems. My psychology training taught me behavioral diagnostics — isolating variables, identifying failure patterns, tracing root causes. I apply that same methodology to agent systems.

What I Build

I build governed orchestration systems that turn probabilistic AI models into reliable, observable, operator-controlled workforces. My work focuses on:

AI Agent Systems Architecture — coordination layers, tool governance, observability pipelines
Local-First AI Infrastructure — Ollama clusters, model routing, fallback chains, distributed inference
Agent Safety & Observability — zero-silent-action policies, audit trails, real-time monitoring

I specialize in systems that other engineers find too complex to own.

Featured Projects

⭐ Sheppard AI agent for Ollama handling memory, automation, and knowledge distillation using Redis, PostgreSQL, and ChromaDB.	SOLLOL Performance-aware load balancing for distributed Ollama clusters with Ray and Dask.
BrokeLLM Local control plane for model-slot routing, fallbacks, and provider switching across CLI coding tools.	AgentFabric Distributed CLI orchestration framework for multi-agent systems with governed MCP architecture.
Amnesic Distributed memory architecture for agent coordination with externalized long-term reasoning state.	Vigilance Real-time observability and audit system enforcing zero-silent-action policy for AI agents.
LlamaForge LoRA fine-tuning pipeline with CPU/GPU DDP, GGUF conversion, and Ollama integration.	Conflux AI development accelerator for structured multi-agent coding workflows with deterministic task graphs.

Technical Stack

Category	Technologies
Languages	Python, JavaScript/TypeScript
AI/ML	Ollama, PyTorch, DDP, LoRA, GGUF, RAG
Infrastructure	Ray, Dask, Docker, Redis, PostgreSQL, ChromaDB
Architecture	MCP, Agent Orchestration, Distributed Systems

Let's Work Together

I'm open to contract work, consulting engagements, and full-time roles in:

AI infrastructure and orchestration systems
Agent workflow design and governance
Local-first AI deployment and optimization
Developer tooling and observability

📧 [email protected]
🌐 bamnlanding.lovable.app

"I spent four years machining parts at ±0.00015" tolerances. I bring that same precision, constraint awareness, and failure-resilient design to every system I architect."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

John London B-A-M-N

Achievements