The autonomous AI marketing agent that researches your competitors, builds your brand, generates landing pages, and optimizes them continuouslyβwhile you sleep.
3-minute walkthrough showing the full autonomous branding experience
Try Beevo Live β https://beevo-430715776322.us-central1.run.app
Note: Requires microphone access for voice interaction with the AI
Important
For Judges: The logo inspiration feature uses Puppeteer (headless Chrome) to scrape real brand logos, which does not work reliably on Cloud Run due to Google Images blocking requests from datacenter IPs. For the full experience, please run the app locally:
npm install
# Terminal 1:
npm run dev:server
# Terminal 2:
npm run dev:clientThis starts the server (http://localhost:3000) and client (http://localhost:5173) in separate terminals. All other features (voice, palettes, fonts, research, landing pages, watchers) work fine on the deployed version.
Gemini API Developer Competition β Build with Gemini
"Build new, creative apps and use cases powered by the Gemini API."
Beevo is an autonomous AI branding agent that turns a 6-week branding process into 6 minutes. It's not a chatbotβit's a full marketing department powered by Gemini.
For small businesses, branding is broken:
- Expensive β Hiring a marketing agency costs $15,000+ minimum
- Slow β Landing pages take days of back-and-forth with designers
- Manual β A/B testing, competitor research, and optimization require dedicated staff
- Fragmented β You need 5+ tools (Canva, SEMrush, Webflow, Hotjar, etc.)
Most founders spend 40% of their time on marketing they hate, instead of building what they love.
Beevo combines multiple Gemini capabilities into a single autonomous agent:
- Gemini Live API β Real-time voice conversations to understand your brand vision
- Google Search Grounding β Live competitor research with real market data
- Gemini Vision β Pixel-precise audits of brand assets and landing page analysis
- Gemini Image Generation β Logo kits, campaign images, and social media assets
- Veo Video Generation β Cinematic brand videos from text prompts
The result? An AI that listens to your brand vision, researches your market, creates your entire brand identity, builds your landing page, and optimizes it autonomously.
Beevo uses a unique Bridge architecture with two Gemini models working in tandem:
- Gemini Live (
gemini-2.5-flash-native-audio) β Handles real-time voice conversation with the user, providing natural audio responses - The Brain (
gemini-3-flash-preview) β A separate model that receives batched transcripts and decides when to call tools (create palettes, research competitors, generate assets)
When you speak to Beevo, your voice goes to Gemini Live for conversation. Simultaneously, the Brain analyzes the transcript and autonomously calls toolsβcreating color palettes, researching competitors, and generating assets without you needing to click anything.
Talk to Beevo like you'd talk to a real CMO:
- Gemini Live API powers real-time bidirectional voice conversations
- Audio Streaming via WebSocket with VAD (Voice Activity Detection)
- Voice Orb visualizes the conversation with reactive animations
- The AI confirms actions before executing: "Would you like me to save 'Nike' as your brand name?"
Not generic templatesβreal-time market intelligence using Google Search Grounding:
- Brand Input β Tell Beevo about your brand via voice
- Research Agent β Uses Google Search to find real competitors in your industry
- Aesthetic Analysis β Analyzes competitor color palettes, domains, and visual identity
- Strategic Gap β Identifies your unique market positioning via SWOT analysis
- Differentiated Assets β Generates palettes and typography positioned against competitors
After generating your landing page, Beevo deploys six specialized agents that continuously monitor and optimize each section:
| Watcher | Responsibility |
|---|---|
| Hero Watcher | Headline copy, hero visuals, above-the-fold impact |
| Proof Watcher | Statistics, trust badges, executive dashboards |
| Offer Watcher | CTA copy, pricing presentation, conversion triggers |
| PAS Watcher | Problem-Agitate-Solve messaging framework |
| Spec Watcher | Feature lists, technical specs, product details |
| Social Watcher | Testimonials, social proof, sharing elements |
Each watcher uses the Nano Banana Design System (NanoBananaService) to generate high-fidelity visual components (layout strategies like SPLIT, CLOUDS, TRIPTYCH, FORENSIC_GRID) and Gemini Vision to analyze real page screenshots for continuous refinement.
Upload any brand asset, and the Guardian uses Gemini Vision (gemini-3-pro-preview) to:
- Check if colors match your Brand DNA hex codes
- Detect logo distortion or Safe Zone violations
- Return bounding-box overlays pinpointing exact issues on the image
- Flag content that fails brand guidelines
Generate campaign assets directly from your Brand DNA:
- Images β Uses
gemini-2.5-flash-imageto generate campaign images, social media assets, and advertisements - Videos β Uses
veo-3.1-generate-previewto create cinematic brand videos with configurable lighting, camera movement, and color grading - Logo Kit β Generates a complete logo kit (primary logo, favicon, wordmark, social profile image) via parallel Gemini calls
A dedicated frame on the canvas for logo creation:
- Logo Structures β AI recommends logo types (Wordmark, Emblem, Monogram, Pictorial, etc.) based on your brand
- Logo Inspirations β Finds real brand logos via Google Search Grounding + Logo.dev API
- Logo Kit Generation β Creates primary, inverted, favicon, wordmark, and social media variants
The watchers are autonomous, but you stay in control:
- HITL Control Center β Approve, reject, or modify any watcher's proposed changes
- Command Center β Lock specific sections, adjust performance metrics, and give directives
- Configurable Intervention β PRE_GENERATION (ask before creating) or POST_GENERATION (ask after)
- Auto-Proceed β Set a timeout; if you don't respond, changes auto-approve
- Telegram Notifications β Get notified on your phone when a watcher requests approval
- Web Push Notifications β Browser push for desktop alerts
Everything lives on a ReactFlow-powered infinite canvas with 24 node types:
- Palette Nodes β Interactive color palettes with copy-to-clipboard
- Typography Nodes β Live font previews with Google Fonts integration
- Thought Signature Nodes β Transparent AI reasoning chains
- Landing Page Frame β Live landing page preview with watcher status
- Logo Studio Frame β Logo generation workspace
- Imagery Nodes β AI-generated brand imagery concepts
- Vault Node β Brand DNA summary and document storage
- Voice Orb Node β Embedded voice interaction point
| Feature | Description |
|---|---|
| Voice-First Branding | Talk to your AI CMO via Gemini Live β it listens, confirms, then acts |
| Live Competitor Research | Google Search Grounding finds real competitors and analyzes their branding |
| 5 Research Phases | Vision Analysis β Competitor Research β Aesthetics β Palettes β Typography |
| 6 Watcher Agents | Autonomous optimization of every landing page section |
| Logo Studio | Logo structures, inspirations, and full kit generation |
| Forge (Image + Video) | Gemini Image + Veo video generation from brand context |
| Guardian Audits | Pixel-precise vision-based brand compliance checking |
| Nano Banana Design System | High-fidelity visual component generation (Proof, PAS, Spec, Offer, Social) |
| HITL Control Center | Approve/reject watcher changes via web, Telegram, or push notifications |
| Infinite Canvas | ReactFlow-based canvas with 24 node types for brand visualization |
| Workspace Management | Multiple brand projects with independent state and history |
| Thought Signatures | Every AI decision shows transparent reasoning |
| Model | Identifier | Purpose |
|---|---|---|
| Gemini 3 Pro | gemini-3-pro-preview |
Strategic reasoning (Strategist SWOT), Guardian Vision audits |
| Gemini 3 Flash | gemini-3-flash-preview |
Brain tool orchestration, asset generation, watcher refinement |
| Gemini Live | gemini-2.5-flash-native-audio-preview |
Real-time bidirectional voice conversation |
| Gemini Image | gemini-2.5-flash-image |
Logo kits, campaign images, social media assets |
| Veo 3.1 | veo-3.1-generate-preview |
Cinematic video generation |
Input: Real-time PCM audio stream via WebSocket
Process: Bidirectional voice conversation + VAD turn management
Output: AI voice responses + transcript buffered for Brain analysis
Input: Batched conversation transcripts from Live
Process: Gemini 3 Flash with 20+ function declarations
Output: Tool calls (create_color_palette, start_brand_research, etc.)
Tools: update_brand_name, update_mission, create_color_palette,
select/unselect/delete palettes, create_fonts, create_logo_structures,
create_logo_inspirations, create_imagery, general_research, etc.
Input: Brand name + industry
Process: Google Search Grounding β competitor discovery β SWOT analysis
Output: Competitor list, color analysis, strategic gap, differentiation strategy
Images: gemini-2.5-flash-image β campaign images, logo kits (4 parallel calls)
Videos: veo-3.1-generate-preview β cinematic videos with polling
Audits: gemini-3-pro-preview β vision-based pixel-precise asset auditing
Input: Brand DNA + selected assets
Process: Generate structured JSON configs for each section type
Sections: Hero, Proof (stats/dashboard), PAS (Problem-Agitate-Solve),
Spec (features), Social (testimonials), Offer (pricing/CTA)
Layouts: SPLIT, CLOUDS, TRIPTYCH, FORENSIC_GRID
Process: Capture screenshot β Gemini Vision analysis β Gemini Flash refinement
Result: Updated section content pushed to client via WebSocket
HITL: Watchers request approval before/after changes via NotificationService
- React + TypeScript (Vite) β Fast development with HMR
- ReactFlow β Infinite canvas with 24 custom node types
- Zustand β State management (
useBrandStore,useActivityStore) - WebSocket β Real-time bidirectional communication with server
- Lucide React β Icon library
- Node.js + Express + WebSocket β Unified HTTP + WS server
- TypeScript (tsx) β Runtime TypeScript compilation (no build step)
- Google GenAI SDK (
@google/genai) β All Gemini API calls - Puppeteer β Logo research via headless Chrome (stealth mode)
- SQLite (
better-sqlite3) β Analytics and metrics storage - uuid β Unique ID generation for interventions
StateManagerβ Persists workspace state with versioned history, emits eventsSessionManagerβ WebSocket hub, broadcastsFULL_STATE_UPDATEto clientsExecutionEngineβ Orchestrates 20+ tools (palette creation, font generation, etc.)NanoBananaServiceβ High-fidelity visual component generationSearchGroundingServiceβ Gemini Search Grounding for logo discoveryNotificationServiceβ HITL intervention queue with debounced broadcastingTelegramServiceβ Telegram bot integration for mobile notificationsPushServiceβ Web Push (VAPID) notificationsMetricsServiceβ Analytics tracking and metrics storageMediaServiceβ File uploads and media handlingVideoGeneratorβ Veo video generation with operation pollingDatabaseServiceβ SQLite-based persistent storageSystemConfigServiceβ Per-workspace configuration management
- Google Cloud Run β Containerized deployment
- Google Cloud Build β Docker build pipeline
- Docker β Multi-stage build with Google Chrome for Puppeteer
flowchart TB
subgraph Client["π₯οΈ React Client (Vite + TypeScript)"]
UI["Agent Canvas (ReactFlow)"]
VoiceOrb["Voice Orb (Audio Streaming)"]
ThinkingPanel["Thinking Panel"]
HITL["HITL Control Center"]
Forge_UI["Forge (Image/Video Gen)"]
Guardian_UI["Guardian (Asset Audits)"]
end
subgraph Server["βοΈ Node.js Server"]
SM["Session Manager (WebSocket Hub)"]
Live["LiveConnection (Gemini Live)"]
Brain["BrainConnection (Tool Orchestrator)"]
TH["ToolHandler (20+ Tools)"]
EE["Execution Engine"]
State["State Manager"]
Notify["Notification Service"]
end
subgraph Agents["π€ Autonomous Agents"]
RA["Research Agent"]
LS["Logo Strategist"]
HW["Hero Watcher"]
PW["Proof Watcher"]
OW["Offer Watcher"]
PASW["PAS Watcher"]
SPW["Spec Watcher"]
SW["Social Watcher"]
end
subgraph Gemini["π§ Gemini API"]
GLive["Gemini Live (Voice)"]
GFlash["Gemini 3 Flash (Brain + Generation)"]
GPro["Gemini 3 Pro (Strategy + Vision)"]
GImage["Gemini Image (Forge)"]
GVeo["Veo 3.1 (Video)"]
GSearch["Google Search Grounding"]
end
subgraph Notifications["π± Notifications"]
Telegram["Telegram Bot"]
Push["Web Push (VAPID)"]
end
VoiceOrb --> SM
UI --> SM
HITL --> SM
SM --> Live --> GLive
Live --> Brain
Brain --> TH
TH --> EE --> State
RA --> GSearch
RA --> GPro
LS --> GSearch
TH --> GFlash
Forge_UI --> GImage
Forge_UI --> GVeo
Guardian_UI --> GPro
HW & PW & OW & PASW & SPW & SW --> GPro
HW & PW & OW & PASW & SPW & SW --> GFlash
HW & PW & OW & PASW & SPW & SW --> Notify
Notify --> Telegram
Notify --> Push
sequenceDiagram
participant U as User
participant F as Frontend
participant Live as Gemini Live
participant Brain as Brain (Gemini Flash)
participant G as Gemini APIs
participant S as State Manager
U->>F: Speak into Voice Orb
F->>Live: Audio Stream (WebSocket)
Live-->>F: AI Voice Response
Live->>Brain: Buffered Transcript
rect rgb(40, 40, 60)
Note over Brain,G: Brain Tool Decisions
Brain->>Brain: Analyze transcript, decide tools
Brain->>G: start_brand_research
G-->>Brain: Competitor data (Search Grounding)
Brain->>G: create_color_palette
G-->>Brain: Generated palettes (JSON)
Brain->>G: create_fonts
G-->>Brain: Typography pairings
end
Brain->>S: Save state (versioned)
S-->>F: FULL_STATE_UPDATE (WebSocket broadcast)
F->>F: Canvas renders new nodes
rect rgb(60, 40, 40)
Note over F,G: Watcher Loop (Autonomous)
loop Every cycle
F->>G: Screenshot β Vision analysis
G-->>F: Improvement suggestions
F->>G: Flash β Rewrite section
G-->>F: Optimized content
F->>U: HITL: Request approval?
end
end
- Node.js v18+
- npm
- A Gemini API Key (Get one here)
git clone https://github.com/excel-asaph/beevo.git
cd beevonpm installCreate a .env.local file in the project root:
GEMINI_API_KEY=your_gemini_api_key
# Optional: Google Custom Search (for competitor research)
GOOGLE_SEARCH_API_KEY=your_google_search_api_key
GOOGLE_SEARCH_CX=your_custom_search_engine_id
# Optional: Telegram Bot (for HITL mobile notifications)
TELEGRAM_BOT_TOKEN=your_telegram_bot_token
TELEGRAM_BOT_USERNAME=your_bot_username
# Optional: Web Push (VAPID keys for browser notifications)
VAPID_PUBLIC_KEY=your_vapid_public_key
VAPID_PRIVATE_KEY=your_vapid_private_key
VAPID_EMAIL=mailto:[email protected]npm run devThis starts both the client and server concurrently:
- Client (Vite): http://localhost:5173
- Server (tsx): http://localhost:3000
npm run init:system # Run system initializers (initial generators)
npm run start:watchers # Start the 6 watcher agents
npm run db:check # Check database state# Build Docker image (includes Google Chrome for Puppeteer)
gcloud builds submit --tag gcr.io/YOUR_PROJECT_ID/beevo
# Deploy to Cloud Run
gcloud run deploy beevo \
--image gcr.io/YOUR_PROJECT_ID/beevo \
--platform managed \
--region us-central1 \
--allow-unauthenticated \
--memory 2Gi \
--cpu 2 \
--set-env-vars "GEMINI_API_KEY=your_key"| Service | URL |
|---|---|
| Beevo | https://beevo-430715776322.us-central1.run.app |
beevo/
βββ client/ # React Frontend (Vite + TypeScript)
β βββ src/
β β βββ components/
β β β βββ Agent/ # Canvas, 24 node types, WorkspaceLanding
β β β βββ Architect/ # Chat interface, voice controls
β β β βββ LogoStudio/ # Logo generation workspace
β β β βββ Blocks/ # Reusable UI blocks
β β β βββ Analytics/ # Analytics dashboard
β β β βββ Forge.tsx # Image & video generation
β β β βββ Guardian.tsx # Pixel-precise asset auditing
β β β βββ Strategist.tsx # SWOT analysis interface
β β β βββ HITLControlCenter.tsx # Human-in-the-Loop controls
β β β βββ DynamicLandingPage.tsx # Live landing page preview
β β βββ hooks/ # useWebSocket, useAgent, useAudioStream, etc.
β β βββ stores/ # useBrandStore, useActivityStore (Zustand)
β β βββ context/ # BrandContext, WorkspaceContext
β β βββ services/gemini.ts # Client-side Gemini API calls
β βββ package.json
βββ server/ # Node.js Backend (Express + WebSocket)
β βββ src/
β β βββ agents/ # 15 agents total:
β β β βββ ResearchAgent.ts # Google Search Grounding
β β β βββ LogoStrategist.ts # Puppeteer-based logo research
β β β βββ HeroWatcher.ts # Hero section optimizer
β β β βββ ProofWatcher.ts # Social proof optimizer
β β β βββ OfferWatcher.ts # CTA/pricing optimizer
β β β βββ PASWatcher.ts # Problem-Agitate-Solve optimizer
β β β βββ SpecWatcher.ts # Features/specs optimizer
β β β βββ SocialWatcher.ts # Testimonials optimizer
β β β βββ Initial*Generator.ts (Γ6) # First-pass content generators
β β βββ gemini/
β β β βββ LiveConnection.ts # Gemini Live (voice)
β β β βββ BrainConnection.ts # Tool orchestrator (20+ tools)
β β β βββ ToolHandler.ts # Tool execution engine
β β β βββ SearchGroundingService.ts # Logo search via Grounding
β β βββ services/
β β β βββ ExecutionEngine.ts # Core asset generation
β β β βββ NanoBananaService.ts # Visual component system
β β β βββ StateManager.ts # Versioned state persistence
β β β βββ NotificationService.ts # HITL intervention queue
β β β βββ TelegramService.ts # Telegram bot integration
β β β βββ PushService.ts # Web Push (VAPID)
β β β βββ VideoGenerator.ts # Veo video generation
β β β βββ MetricsService.ts # Analytics/metrics
β β β βββ DatabaseService.ts # SQLite storage
β β βββ sessions/SessionManager.ts # WebSocket hub + broadcasting
β β βββ index.ts # Server entry point (1132 lines)
β βββ package.json
βββ shared/ # Shared types, constants, messages
β βββ types.ts # BrandDNA, ColorPalette, LogoInspiration, etc.
β βββ constants.ts # Model IDs, system instructions, audio config
β βββ messages.ts # WebSocket message type definitions
βββ Dockerfile # Multi-stage build (Chrome + Node 18)
βββ package.json # Root workspace config (client + server + shared)
-
The Bridge Architecture is powerful β Separating voice (Gemini Live) from tool orchestration (Brain) allows natural conversation while autonomously executing complex multi-step workflows.
-
Google Search Grounding enables real competitive intelligence β Unlike static databases, live search gives Beevo up-to-the-minute market awareness for brand positioning.
-
Autonomous agents need guardrails β The HITL system (with Telegram + Push + auto-proceed) is essential. Without it, the watchers can optimize in unexpected directions.
-
WebSocket state synchronization β Managing real-time state across multiple clients, 6 watcher agents, and the server required a custom
StateManagerwith EventEmitter-based broadcasting and versioned history. -
Puppeteer in Cloud Run β Running headless Chrome in containers required installing Google Chrome Stable, setting
PUPPETEER_EXECUTABLE_PATH, and using--single-processand--no-zygoteflags. Google Images also blocks scraping from datacenter IPs, requiring fallback to Search Grounding. -
Voice + Function Calling coordination β Gemini Live handles voice but doesn't call tools directly. We buffer transcripts and flush them to the Brain model, which then decides tool callsβrequiring careful pause/resume management to avoid audio interrupting tool execution.
-
Nano Banana visual fidelity β Generating structured JSON for complex layouts (SPLIT, CLOUDS, TRIPTYCH, FORENSIC_GRID) required extensive prompt engineering with layout-specific constraints passed to Gemini.
| Name | Role |
|---|---|
| Excel Asaph | Full-Stack Developer & Product Designer |
This project is licensed under the Apache License 2.0 β see the LICENSE file for details.
Beevo turns a 6-week branding process into 6 minutes.
Your Sovereign Virtual CMO β Available 24/7.
Built with Gemini π§