Method-AI Voice Actor

A Vertex AI-powered rehearsal engine that transforms dry text into vivid character performances using Google Cloud and ElevenLabs.

🎭 The Method AI Experience

Method AI is a "Rehearsal Engine" that takes dry text (documentation, scripts, manuals) and performs them in specific character personas. It uses:

Google Vertex AI (Gemini 3.0 Flash) for fast, intelligent script rewriting with method acting techniques (with optional Gemini 3.0 Pro for "Deep Rehearsal" mode)
ElevenLabs for premium voice synthesis and vocal performance
Google Cloud Run for scalable deployment

Features

🎭 Method Actor Rewriting: Transforms text into character-specific performances while maintaining factual accuracy
⚡ Gemini 3.0 Flash: Optimized for speed with frontier intelligence for real-time conversational feel
🧠 Deep Rehearsal Mode: Optional Gemini 3.0 Pro mode for highly analytical script breakdowns
🎬 Three Personas: Noir Detective, SoCal Surfer, and 1920s News Anchor (easily extensible)
🗣️ Voice Synthesis: High-quality text-to-speech with ElevenLabs premium voices
🎨 Three-Column Studio: Intuitive interface - Script, Director's Chair, and Performance
🌐 Modern Web Stack: React frontend with Node.js/Express backend
☁️ Cloud-Ready: Designed for Google Cloud Run deployment

Tech Stack

Backend

Node.js with Express
Google Vertex AI with Gemini 3.0 Flash (default) and Gemini 3.0 Pro (Deep Rehearsal mode)
ElevenLabs Text-to-Speech API
CORS and environment variable support

Frontend

React with Vite
React Router for navigation
Axios for API calls
ElevenLabs React SDK
Modern CSS with responsive three-column layout

Project Structure

method-ai-voice-actor/
├── backend/              # Node.js Express backend
│   ├── src/
│   │   ├── config/      # API configuration files
│   │   ├── routes/      # Express route handlers
│   │   ├── services/    # Business logic services
│   │   └── index.js     # Main server file
│   ├── .env.example     # Environment variables template
│   └── package.json
│
├── frontend/            # React frontend
│   ├── src/
│   │   ├── components/  # Reusable React components
│   │   ├── pages/       # Page components
│   │   ├── services/    # API service functions
│   │   └── App.jsx      # Main app component
│   ├── .env.example     # Environment variables template
│   └── package.json
│
└── README.md

Getting Started

Prerequisites

Node.js (v18 or higher)
npm or yarn
Google Cloud account with Vertex AI API enabled
ElevenLabs API account

Installation

Clone the repository

git clone https://github.com/wildhash/method-ai-voice-actor.git
cd method-ai-voice-actor

Set up the backend

cd backend
npm install
cp .env.example .env

Edit .env and add your configuration:

PORT=3001

# For Vertex AI (Method AI)
GCP_PROJECT_ID=your_gcp_project_id
GCP_REGION=us-central1

# For ElevenLabs
ELEVENLABS_API_KEY=your_elevenlabs_api_key_here

# Legacy Gemini API (for Classic Studio)
GEMINI_API_KEY=your_gemini_api_key_here

Authenticate with Google Cloud (for Vertex AI)

# Install gcloud CLI if not already installed
# Then authenticate:
gcloud auth application-default login

# Set your project
gcloud config set project YOUR_PROJECT_ID

Set up the frontend

cd ../frontend
npm install
cp .env.example .env

Edit .env if needed:

VITE_API_BASE_URL=http://localhost:3001/api

Running the Application

Start the backend server
```
cd backend
npm run dev
```
Server will run on http://localhost:3001
Start the frontend development server (in a new terminal)
```
cd frontend
npm run dev
```
Frontend will run on http://localhost:5173
Open your browser Navigate to http://localhost:5173 to use the application

API Endpoints

Backend API

Health Check

GET /api/health - Check if the server is running

Method AI (The Director)

POST /api/gemini/perform - Transform text using Method Actor technique

{
  "text": "Your raw text here",
  "personaKey": "Full persona description including character traits, speech patterns, etc."
}

Returns:

{
  "script": "The rewritten text in character"
}

Gemini AI (Classic Studio)

POST /api/gemini/rewrite - Rewrite text in character voice

{
  "text": "Your text here",
  "characterPrompt": "Character description"
}

POST /api/gemini/generate-dialogue - Generate character dialogue

{
  "scenario": "Scene description",
  "characterName": "Character name",
  "characterTraits": "Character traits"
}

Voice Synthesis

POST /api/voice/synthesize - Convert text to speech

{
  "text": "Text to synthesize",
  "voiceId": "voice_id_from_elevenlabs"
}

GET /api/voice/voices - Get available voices

Usage

Method AI Studio (The Main Feature)

Navigate to Method Studio at http://localhost:5173/method
The Three-Column Interface:
- Left: The Script - Paste your raw text (documentation, scripts, manuals)
- Center: The Director's Chair - Select your persona and hit REHEARSE
- Right: The Performance - View the rewritten text and play the audio
Choose a Persona:
- Gritty Noir Detective - Cynical 1940s private eye
- SoCal Surfer - Laid-back, enthusiastic beach dude
- 1920s Transatlantic News Anchor - Fast-talking, high-energy reporter
Toggle Deep Rehearsal (optional): Enable for more analytical script breakdown using Gemini 3.0 Pro (slower but more sophisticated)
Click REHEARSE: The system will rewrite your text in character using Gemini 3.0 Flash (or Pro if Deep Rehearsal is enabled) and generate audio
Listen and Download: Play the generated audio performance

Classic Studio

Navigate to Classic Studio at http://localhost:5173/studio
Choose a Mode:
- Rewrite Text: Transform existing text into a character voice
- Generate Dialogue: Create new character dialogue from scratch
Generate Content: Use AI to create character-specific text
Synthesize Speech: Convert the text to audio with a selected voice
Listen and Download: Play the generated audio

Development

Adding New Personas

Personas are defined in frontend/src/personas.js. To add a new persona:

export const PERSONAS = {
  // ... existing personas
  
  your_persona_key: {
    label: "Display Name for Your Persona",
    systemPrompt: "Detailed character description including speech patterns, vocabulary, tone, and any character-specific traits...",
    elevenLabsVoiceId: "voice_id_from_elevenlabs"
  }
};

Finding ElevenLabs Voice IDs:

Visit ElevenLabs Voice Library
Choose a voice that matches your persona
Copy the voice ID from the voice details

Backend Development

cd backend
npm run dev  # Runs with nodemon for auto-reload

Frontend Development

cd frontend
npm run dev  # Runs Vite dev server with HMR

Building for Production

Backend

cd backend
npm start

Frontend

cd frontend
npm run build
npm run preview  # Preview production build

Deployment to Google Cloud

Deploying to Cloud Run

Enable Required APIs

gcloud services enable run.googleapis.com
gcloud services enable aiplatform.googleapis.com

Deploy Backend

cd backend
gcloud run deploy method-ai-backend \
  --source . \
  --region us-central1 \
  --allow-unauthenticated \
  --set-env-vars GCP_PROJECT_ID=YOUR_PROJECT_ID,GCP_REGION=us-central1,ELEVENLABS_API_KEY=YOUR_KEY

Deploy Frontend

cd frontend
npm run build
gcloud run deploy method-ai-frontend \
  --source . \
  --region us-central1 \
  --allow-unauthenticated \
  --set-env-vars VITE_API_BASE_URL=https://your-backend-url/api

For detailed Cloud Run deployment instructions, see Google Cloud Run Documentation.

API Keys

Getting Google Gemini API Key

Visit Google AI Studio
Create a new API key
Add it to your backend .env file

Getting ElevenLabs API Key

Sign up at ElevenLabs
Navigate to your profile settings
Copy your API key
Add it to your backend .env file

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

ISC

Support

For issues and questions, please open an issue on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
.gitignore		.gitignore
CASTING_DIRECTOR.md		CASTING_DIRECTOR.md
CLOUD_RUN_DEPLOY.md		CLOUD_RUN_DEPLOY.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT.md		DEPLOYMENT.md
FINAL_PR_DESCRIPTION.md		FINAL_PR_DESCRIPTION.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
PR_DESCRIPTION.md		PR_DESCRIPTION.md
QUICKSTART.md		QUICKSTART.md
QUICKSTART_CASTING.md		QUICKSTART_CASTING.md
RAILWAY_DEPLOY.md		RAILWAY_DEPLOY.md
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
test_audio.mp3		test_audio.mp3

Folders and files

Latest commit

History

Repository files navigation

Method-AI Voice Actor

🎭 The Method AI Experience

Features

Tech Stack

Backend

Frontend

Project Structure

Getting Started

Prerequisites

Installation

Running the Application

API Endpoints

Backend API

Health Check

Method AI (The Director)

Gemini AI (Classic Studio)

Voice Synthesis

Usage

Method AI Studio (The Main Feature)

Classic Studio

Development

Adding New Personas

Backend Development

Frontend Development

Building for Production

Backend

Frontend

Deployment to Google Cloud

Deploying to Cloud Run

API Keys

Getting Google Gemini API Key

Getting ElevenLabs API Key

Contributing

License

Support

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages