AIP - AI Interface Producer

Real-time voice AI agent that helps people solve daily problems through generating UI's through an agentic voice interface with the help of our custom llm leveraging our proprietary Agent-to-Interface-Protocol (AIP) aimed at reducing the token bloating.

Live demo link: https://aip-neon.vercel.app

🎯 Project Overview

Core Feature: Talk to an AI agent that generates in real-time as you describe them. The generated UI appears instantly on screen while maintaining persistent voice interaction through an elegant Aura overlay.

Demo Flow:

Connect to voice agent
Say "Create a user profile card with avatar and bio"
Agent generates and displays the interface instantly
Continue voice interaction with mini Aura overlay
Agent can end session gracefully via voice command

🏗️ Architecture

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   React Frontend │    │  LiveKit Cloud   │    │  Voice Agent    │
│                 │◄──►│                  │◄──►│                 │
│ • Aura Component│    │ • WebRTC         │    │ • GPT-4 + Tools │
│ • UI Rendering  │    │ • Text Streams   │    │ • ElevenLabs TTS│
│ • Session Mgmt  │    │ • Authentication │    │ • OpenAI STT    │
└─────────────────┘    └──────────────────┘    └─────────────────┘

Tech Stack

Backend Agent (Deployed to LiveKit Cloud):

LiveKit Agents SDK v1.4.2 - Voice agent framework
OpenAI GPT-4.1 - LLM + gpt-4o-transcribe STT
ElevenLabs eleven_flash_v2_5 - TTS
Silero VAD + MultilingualModel - Turn detection
Python 3.12 via Anaconda

Frontend:

React 19.2 + Vite 7.3.1 - UI framework
LiveKit Components - WebRTC client + official AgentAudioVisualizerAura
Tailwind CSS v4 + shadcn - Styling
Shadow DOM - CSS scoping for generated UI

Infrastructure:

LiveKit Cloud - Agent hosting + WebRTC infrastructure
Project: aip (wss://aip-go1n19vl.livekit.cloud)
Agent ID: CA_dJ9gqgtu9hJB (EU West B region)

🚀 Quick Start

Prerequisites

# Backend
python 3.12+ (Anaconda recommended)
lk CLI (npm install -g @livekit/cli)

# Frontend  
node 18+

1. Clone & Setup

git clone <repo>
cd aip

# Backend setup
cd backend
pip install -r requirements.txt
cp .env.example .env  # Add your API keys

# Frontend setup
cd ../frontend
npm install
cp .env.example .env  # Add VITE_SANDBOX_ID

2. API Keys Required

Backend (.env):

OPENAI_API_KEY=sk-...
ELEVEN_API_KEY=sk_...
LIVEKIT_URL=wss://aip-go1n19vl.livekit.cloud
LIVEKIT_API_KEY=APIgiR8q69idSmU
LIVEKIT_API_SECRET=...

Frontend (.env):

VITE_SANDBOX_ID=aiphack-10p40t

3. Development

Option A: Use Deployed Agent (Recommended)

# Frontend only
cd frontend
npm run dev  # → http://localhost:5173

Option B: Local Development

# Terminal 1: Local agent
cd backend
python agent.py console  # or `python agent.py dev` for cloud mode

# Terminal 2: Frontend
cd frontend  
npm run dev

4. Production Deployment

Agent (Already deployed):

cd backend
lk agent create --secrets-file .env
# ✅ Deployed to LiveKit Cloud automatically

Frontend (Deploy to Vercel):

cd frontend
npm run build
# Deploy dist/ to Vercel (no env vars needed - sandbox ID hardcoded)

🎮 Usage

Voice Commands

"Create a [description]" → Generates UI component
"Make it [modification]" → Modifies existing UI (planned)
"Show me the components" → Lists available components (planned)
"Goodbye" / "End session" → Agent gracefully disconnects

UI States

Pre-connect: Large Aura with "Tap to connect"
Connected: Generated UI with small Aura overlay (bottom-left)
Disconnected: Clickable reconnect pill with pulse animation

Current Agent Tools

✅ generate_ui() - Creates HTML/CSS interface (currently hardcoded)
✅ end_session() - Graceful disconnect with drain → shutdown → sleep → disconnect
🔄 modify_ui() - Edit existing interface (placeholder)
🔄 list_components() - Show available components (placeholder)

🔧 Development Notes

Agent Logs

lk agent logs                      # Runtime logs
lk agent logs --log-type=build     # Build logs

Key Implementation Details

Shadow DOM: Generated UI uses Shadow DOM for CSS scoping
Text Streams: Agent sends UI via room.local_participant.send_text()
Event Handling: Frontend uses participantDisconnected + disconnected for reliable disconnect detection
Session Management: session.end() called on disconnect for clean reconnection
Graceful Shutdown: Agent uses await session.drain() → session.shutdown() → room.disconnect()

Turn Detection Tuning

MultilingualModel(
    unlikely_threshold=0.3,      # Lower = more sensitive
    min_endpointing_delay=0.35,  # Faster response  
    max_endpointing_delay=2.0    # Max wait time
)

📋 Roadmap

Phase 1: Hackathon MVP ✅

Voice agent with tool calling
Official Aura component integration
Real-time UI generation display
Agent-controlled session ending
Reconnectable sessions
Deploy to LiveKit Cloud

Phase 2: Enhanced Generation 🔄

LLM-powered UI generation (replace hardcoded HTML)
Component library integration
Interactive modifications via voice
Component export/save functionality

Phase 3: Production 🚀

Custom token endpoint (replace sandbox)
User authentication & sessions
Component persistence
Multi-user collaboration

🛠️ Troubleshooting

Common Issues

"Connecting" stuck: Check agent deployment status with lk agent logs
No audio: Verify microphone permissions in browser
Build fails: Ensure all API keys are set in .env files
Agent not responding: Check OpenAI + ElevenLabs API key limits

Authentication

Development: Uses sandbox token server (current setup)
Production: Requires custom JWT token endpoint

🏆 ParisHack 2026

Built for ParisHack hackathon with focus on rapid prototyping and user experience. The project demonstrates real-time voice-to-UI generation with persistent voice interaction capabilities.

Team: Solo project by @diniskakov Demo: Live voice agent at wss://aip-go1n19vl.livekit.cloud

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIP - AI Interface Producer

🎯 Project Overview

🏗️ Architecture

Tech Stack

🚀 Quick Start

Prerequisites

1. Clone & Setup

2. API Keys Required

3. Development

4. Production Deployment

🎮 Usage

Voice Commands

UI States

Current Agent Tools

🔧 Development Notes

Agent Logs

Key Implementation Details

Turn Detection Tuning

📋 Roadmap

Phase 1: Hackathon MVP ✅

Phase 2: Enhanced Generation 🔄

Phase 3: Production 🚀

🛠️ Troubleshooting

Common Issues

Authentication

🏆 ParisHack 2026

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AIP - AI Interface Producer

🎯 Project Overview

🏗️ Architecture

Tech Stack

🚀 Quick Start

Prerequisites

1. Clone & Setup

2. API Keys Required

3. Development

4. Production Deployment

🎮 Usage

Voice Commands

UI States

Current Agent Tools

🔧 Development Notes

Agent Logs

Key Implementation Details

Turn Detection Tuning

📋 Roadmap

Phase 1: Hackathon MVP ✅

Phase 2: Enhanced Generation 🔄

Phase 3: Production 🚀

🛠️ Troubleshooting

Common Issues

Authentication

🏆 ParisHack 2026

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages