🎙️ AI Voice Agent — Real-Time Gemini + Murf Falcon 🚀

A simple real-time AI voice agent built entirely using React, Google Gemini, and Murf Falcon TTS — with no backend server required. Everything runs directly in the browser.

This project allows users to speak to an AI agent using the microphone, convert speech to text via browser APIs, generate intelligent responses using Gemini, and hear natural voice output using Murf Falcon.

🚀 Features

🎤 Real-time microphone capture
🔊 Large audio visualizer using Web Audio API
🧠 AI responses powered by Google Gemini
🗣️ Ultra-fast TTS using Murf Falcon
💬 Conversation history UI
🌙 Black themed, minimal, modern interface
🎛️ Fully client-side — no backend, no server, no deployments needed

🛠️ Technologies Used

Frontend

React
JavaScript + JSX
Web Audio API (Visualizer)
Web Speech API (Speech Recognition, browser dependent)
CSS for styling

AI Services

Google Gemini API → LLM responses
Murf Falcon API → Text-to-Speech voice output

No backend required

All API calls are made directly from the browser.

📁 Project Structure

/src
   App.jsx
   components/
      AudioVisualizer.jsx
      MicButton.jsx
      ChatBubble.jsx
   utils/
      gemini.js
      murf.js
      audio.js
   styles.css
index.html
package.json
README.md

Note: Your actual file names may differ — this is a general overview.

🔧 Setup & Installation

Clone the project

git clone https://github.com/your-username/your-repo.git
cd your-repo

Install dependencies

npm install

Add your API keys

Create a .env or paste directly inside your config file:

REACT_APP_GEMINI_API_KEY=YOUR_GEMINI_KEY
REACT_APP_MURF_API_KEY=YOUR_MURF_KEY
REACT_APP_MURF_API_URL=YOUR_MURF_ENDPOINT

Run the project

npm start

The app will start at:

👉 http://localhost:3000

▶️ How It Works (Behind the Scenes)

1️⃣ Speech Input

User presses Mic button
Browser captures mic audio
Web Speech API converts speech → text
Audio visualizer animates in real-time

2️⃣ AI Response (Gemini)

Text is sent to Gemini’s API
Gemini returns a natural language reply

3️⃣ Voice Output (Murf Falcon TTS)

Reply is sent to Murf Falcon
Murf generates high-quality voice audio
Audio plays in the browser using AudioContext

4️⃣ UI Updates

User + AI messages appear in chat bubbles
Playback indicator shows speaking state

🧪 Testing the Agent

Open the app
Allow microphone permission
Press 🎤 Start Mic
Say:

“Hello, how are you?”
Gemini generates a reply
Murf speaks it back
Repeat!

💡 Tips

Chrome gives best performance for Web Speech API
Use short inputs for fastest Murf response
If speech recognition fails, type into the text box and hit Send

🙌 Credits

Google Gemini for LLM intelligence
Murf Falcon for ultra-fast TTS voices
React for the UI
You — for building an awesome voice agent 🎉

📢 License

MIT — free to use and modify.

🚀 Keep Building!

This is your Day-1 foundation for a complete multi-day voice agent series.
You can now extend it with:

custom personas
emotional prosody
tools integration
memory
multi-modal responses
translations

The future of voice AI is in your hands 💛

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
components		components
services		services
.gitignore		.gitignore
App.tsx		App.tsx
README.md		README.md
constants.ts		constants.ts
index.html		index.html
index.tsx		index.tsx
metadata.json		metadata.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
types.ts		types.ts
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ AI Voice Agent — Real-Time Gemini + Murf Falcon 🚀

🚀 Features

🛠️ Technologies Used

Frontend

AI Services

No backend required

📁 Project Structure

🔧 Setup & Installation

▶️ How It Works (Behind the Scenes)

1️⃣ Speech Input

2️⃣ AI Response (Gemini)

3️⃣ Voice Output (Murf Falcon TTS)

4️⃣ UI Updates

🧪 Testing the Agent

💡 Tips

🙌 Credits

📢 License

🚀 Keep Building!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ AI Voice Agent — Real-Time Gemini + Murf Falcon 🚀

🚀 Features

🛠️ Technologies Used

Frontend

AI Services

No backend required

📁 Project Structure

🔧 Setup & Installation

▶️ How It Works (Behind the Scenes)

1️⃣ Speech Input

2️⃣ AI Response (Gemini)

3️⃣ Voice Output (Murf Falcon TTS)

4️⃣ UI Updates

🧪 Testing the Agent

💡 Tips

🙌 Credits

📢 License

🚀 Keep Building!

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages