For AI agents: a documentation index is available at /llms.txt — markdown versions of all pages are available by appending index.md to any URL path.

Posts

2026

Multi-Agent Code Generation Has a Specification Problem, Not a Coordination Problem

26 March 2026·4 mins

New research shows that splitting coding work across multiple LLM agents causes a 25–39pp accuracy drop that only better specifications can fix—not fancy coordination tools.

Coding Agent Security Just Became a Product Category

24 March 2026·5 mins

NVIDIA, Sysdig, and a wave of indie tools are shipping OS-level monitoring for coding agents. The industry just admitted that sandboxing alone isn't enough.

Your Coding Agent Has a Supply Chain Problem

23 March 2026·3 mins

Cursor's Kimi revelation shows why practitioners need to trace the hidden dependencies in their AI development tools.

Sashiko shows AI code review works by doing less, not more

22 March 2026·4 mins

Google's new AI tool catches 53% of Linux kernel bugs by pattern-matching rather than trying to understand code, suggesting narrow AI applications beat ambitious ones.

Rover Makes Websites the Agent Runtime

21 March 2026·3 mins

While others build complex infrastructure for AI agents to navigate websites, Rover inverts the model by making the website itself the execution environment.

OpenAI buying Astral is fine. Making uv a dependency of your agent stack isn't.

20 March 2026·4 mins

The real risk of OpenAI acquiring Astral isn't that uv goes proprietary. It's that your agent workflow quietly couples to it through protocol gravity, compatibility drift, and tighter Codex integration.

Agent Drift Is Consensus Built on Hallucinated Reality

19 March 2026·4 mins

Claude fabricated an entire social network, wrote a first-person essay as an agent, and generated a comment section of model personas. The lesson isn't that LLMs hallucinate. It's that hallucination becomes a coordination mechanism.

AI Agents Have Stable 'Coding Styles' That Change With Each Version

18 March 2026·5 mins

New research shows AI coding agents exhibit consistent biases in problem-solving approaches that persist within model families but change across versions, creating novel challenges for production systems.

Skills aren't a cheat code for coding agents. They're configuration drift waiting to happen.

17 March 2026·7 mins

SWE-Skills-Bench finds most agent skills don't improve real repo outcomes, and some make things worse. Independent research on 673 skills reveals why: the failure modes are more varied and surprising than version mismatch alone.

An AI Agent Built a JavaScript Engine. But the pudding is missing the proof.

16 March 2026·4 mins

JSSE passes 99.81% of test262 with zero human code. That's the easy part. Maintainability, harness trust, and the missing layers above conformance are where agent-generated code gets hard.

↑