The Problem: 96% of AI Web Compute is Wasted
Billions of times per day, across every AI product on earth, burning compute, money, and carbon is waster due to an inefficient web, in adapted to AI Agents. Every time an AI reads or generates an user interface, it burns through thousands of tokens of structural boilerplate. A simple card component costs 400+ tokens in React or HTML. The actual semantic information in that card? Around Fifteen tokens. That is a 96% overhead ratio!
In this hackathon alone, with 1,000 participants making 100 AI interface calls, an estimated 196 million tokens were wasted on structural boilerplate, roughly $590,000 in compute and 413kg of CO₂: the equivalent of driving London to Edinburgh and back, four times over, during a single weekend. Let’s fix this!
The Solution: Building the Infrastructure for the Web 4.0, Agentic
AIP (Agent Interface Protocol) is the answer: a complete runtime for AI agents to generate, modify, read, and interact with interfaces at near-instantaneous speed. At its core is AIB (Agent Interface Bytecode), a novel mathematically-optimal machine-optimised intermediate representation achieving 5:1 compression over equivalent HTML/CSS/JS. This is the infrastructure for the agentic web: the layer that makes AI-generated interfaces as natural, efficient and instantaneous as AI-generated text.
The mathematically-optimal process: We generate a tree of webpage elements by frequency, which constitutes an huffman tree (e.g. like the Morse Code), from a public scan of 8 million websites. We then matched the webpage elements by frequency to the BPE Tokeniser (Qwen) by encoding, thus ensuring that the more frequent an element is, the faster it is to compute, and that each element is token-optimal in generation. Based on this, we generated a synthetic dataset of 5,000 Interface-AIP pairs, which we then used to finetune Qwen-2.5-Coder 2.5B to generate AIP natively. Achieving sub 1 second page generation locally and a compression of around 80% compared to regular HTML. And this, without compromising on quality nor customisability.
Features:
- Full sub 2 second page generation
- The page is NOT static! You can navigate, expand, regenerate and edit elements live
- Page is personalised based on user preferences
- MCP Integration for direct actions on the webpage
- Full voice agent with Eleven Labs
- Fine-Tuned Qwen-2.5 1.5B for faster generation
- Claude for Content Generation
More information attached on the maths whitepaper here: xxx
Inspiration
We kept running into the same wall. Every AI product, ChatGPT, Claude, Lovable, v0, Bolt, generates interfaces the same way: by writing thousands of tokens of HTML and CSS from scratch, every single time, for every single user. It felt wrong. Fundamentally wrong in the way that sending a fax to schedule a meeting feels. You can see the better system waiting to exist.
The insight came from information theory. Shannon's source coding theorem tells us that any compression below the entropy floor is impossible, and any compression above it is waste. We researched what the actual entropy of a user interface is, and when we ran the numbers across one million websites, the answer was so stark it felt like a design flaw in the entire industry, not ready for the agents revolution, and with billions left on the table.
The Tech Powering The Agent Interface Protocol
AIP is a vertically integrated, three-subsystem architecture built from first principles around a purpose-designed bytecode language (AIB), a fine-tuned generative model, and a deterministic compiler-renderer. AIB encodes UI components as single uppercase letters with two-character style modifiers and positional content slots, achieving 75-90% compression over HTML by targeting semantic intent rather than presentational syntax -- co-designed with Qwen 2.5's BPE tokeniser so every primitive costs exactly 1-2 tokens with zero encoding waste. A LoRA-fine-tuned Qwen 2.5 Coder 1.5B learns to emit this bytecode natively at 97.5% token accuracy (final loss 0.089) after training on validated synthetic pairs (600 for demo, 5,000 generated), while a three-stage runtime pipeline separates structural generation (~35-50 tokens, sub-second) from parallel content streaming via SSE, and a Kotlin Lexer-Parser-AST compiler deterministically renders the bytecode into production-quality HTML with Tailwind, dark mode, accessibility, and scroll animations, making AIP the first system to treat the AI-to-interface boundary as a formally grounded compression problem.
The system then extends beyond text input with a real-time LiveKit voice agent, OpenAI GPT-4o for transcription (STS), GPT-4.1 for reasoning (LLM), ElevenLabs Flash v2.5 for speech synthesis (TSS), and Silero VAD for voice activity detection, where users speak a description and receive a live, interactive webpage within seconds, with the React frontend rendering components in real-time via an animated aura visualisation tracking agent state (listening → thinking → speaking). This makes downstream capability, speed, cost, on-device inference, real-time voice, follow as a natural consequence.
AIP As A Startup
AIP can monetise through three tiers, Starter at €29/month (50K generations), Pro at €99/month (500K generations with full analytics), and Enterprise at €299+/month (unlimited self-hosted generations, custom component libraries) - all metered per-generation using so customers pay only for what they use. Stripe powers the subscription lifecycle: checkout, card-on-file billing, upgrades, and enterprise invoicing are live Stripe checkout sessions wired directly into our infrastructure. Paid.ai sits on top as the usage intelligence layer - every AIB generation is tracked as a metered signal through Paid.ai's API, feeding real-time dashboards that show token consumption, compression ratio vs raw HTML (averaging >5×), and exact cost savings per account, while auto-generating invoices and handling overage billing. Stripe processes the money; Paid.ai makes the value visible and the model frictionless from first sign-up to renewal.
The path to billions follows the Stripe play-book: advertise the benefits of the protocol to drive adoption, monetise the optimised runtime and fine-tuned models, and let network effects compound - every interface generated improves frequency analysis, which improves compression, which improves the product for everyone, creating a data moat that cannot be replicated overnight. At scale, AIP saves 90kg of CO₂ daily and makes every AI interface builder on earth 5-50× faster and 80-98% cheaper. That is not a feature - that is a new layer of the web, and the ROI is visible on the first invoice.
A picture speaks a 1,000 words,
A website takes 3,000 tokens, and it shouldn’t.
What’s Next
Our Plan is Simple: Distribute, Scale, get PAID(.ai), Profit!
Scale the Bootstrap Standard Library to 10,000+ components from a 10-million-site scan, ship the embedded <script> runtime so any AI product can integrate AIP in one line, and launch the Paid.ai-metered cloud API publicly - turning every AI-generated interface on the internet into a customer. Then: enterprise contracts, vertical fine-tuned models (e-commerce, dashboards, learning platforms), and a native mobile renderer - compounding the data moat with every generation until AIP is the invisible infrastructure layer beneath every agentic interface on the web.
Built With
- claude
- css
- elevenlabs
- javascript
- kotlin
- lmstudio
- stripe

Log in or sign up for Devpost to join the conversation.