Landing Page
Song Generation
Song Generation
Voicemail Agent

MemoMuse — Full Product Overview

Voice memo → Produced music demo → Vinyl on Shopify.

Record a rough idea, pick a genre, tweak with studio controls, get a mixed track with vocals and instrumentation, sell it as a vinyl record, and manage customer support — all in one app.

What is MemoMuse?

MemoMuse is an end-to-end music creation and commerce platform with three core features:

Music Studio — Turn voice memos into fully produced songs.
Vinyl Storefront — Publish and sell generated tracks as vinyl records on Shopify.
Voicemail Copilot (Customer Agent) — AI-powered customer support inbox, purpose-built for Shopify merchants.

Feature 1: Music Studio

The core pipeline transforms a rough voice idea into a fully mixed, produced track in six steps.

How It Works

Step	Service	What happens
1. Transcribe	OpenAI Whisper (local, base)	Raw audio → text transcript
2. Analyze	Google Gemini 2.5 Flash	Transcript → song title, lyrics (Verse/Chorus, 16–24 lines), style prompt, mood, BPM, key, and `contains_lyrics` detection (real words vs humming)
3. Store session	Backboard.io REST API	Persists transcript, lyrics, genre, mood for memory across runs. Optional — pipeline continues if unavailable
4. Refine lyrics	Featherless AI (Qwen2.5-7B)	LLM polish pass on lyrics. Optional — original lyrics used if unavailable
5. Generate audio	Google Lyria Realtime + ElevenLabs	Instrumental: Lyria generates 60s at 48 kHz from the style prompt. Vocals: ElevenLabs TTS for lyrics, STS for humming (preserves melody). Both run in parallel. Falls back to instrumental-only if vocals fail
6. Mix + Post-process	pydub + FFmpeg	Both tracks normalized to −20 dBFS. Bass/treble EQ, pitch shift, vocal balance applied. Exported as final MP3

Studio Controls

Control	Range	What it does
Voice	20+ ElevenLabs voices	Selects vocal character (male, female, various accents) with preview
Bass	-10 to +10	Low-frequency EQ — boosts/cuts under 250 Hz
Treble	-10 to +10	High-frequency EQ — boosts/cuts above 4 kHz
Pitch	-6 to +6 semitones	Shifts the final mix up/down via sample rate manipulation
Vocal Mix	-6 to +6 dB	Adjusts vocal/instrumental balance

Frontend Features

Audio recording — MediaRecorder API with live waveform visualizer (40 animated bars).
Genre picker — 8 chips: Pop, Lo-Fi, Hip Hop, Cinematic, R&B, Indie Folk, Electronic, Jazz.
Pipeline progress — 4-step animated indicator (Transcribe → Analyze → Generate → Mix).
Live karaoke display — Lyrics highlighted line-by-line in sync with playback, auto-scrolling.
Vinyl record visual — CSS-animated spinning disc with song title and genre label.
Download MP3 — One-click download named after the song title.
Copy Lyrics — Copies generated lyrics to clipboard.

Feature 2: Vinyl Storefront (Shopify Integration)

Every generated track can be published and sold as a vinyl record product directly on your Shopify store — closing the loop from creative idea to real commerce.

One-click publish — The "Buy as Vinyl" button calls POST /api/publish, which uses the Shopify Admin API (write_products scope) to create a product instantly.
Auto-populated listing — Product includes song title, $24.99 price, full generated lyrics in description, and genre/mood/BPM/key metadata tagged under the "MemoMuse" vendor.
CDN-aware UX — Frontend waits 8 seconds after product creation before opening the Shopify store URL, ensuring the product is live and purchasable.
Storefront config — GET /api/config returns the Shopify storefront domain and token so the frontend can link directly to the product page.

Result: Artists go from humming an idea to having a purchasable product on their Shopify store in under two minutes.

Feature 3: Voicemail Copilot (Customer Agent)

Voicemail Copilot is MemoMuse's AI-powered customer service feature — a support inbox that lets Shopify merchants receive, analyze, and respond to customer voicemails, then push resolved tickets back to Shopify without leaving the app.

User Flow

Open Voicemail Copilot — Visit /voicemail or click it in the nav.
Add a voicemail — Upload an audio file or use a built-in demo (e.g., "Angry Customer — Late Order").
Analyze — Click Analyze Voicemail; Gemini returns structured JSON and a new ticket appears in the inbox.
Review ticket — See summary, classification chips (intent, sentiment, urgency), original audio playback, full transcript, and a suggested reply.
Generate voice reply & push to Shopify — Edit the reply, click Generate Voice Reply (ElevenLabs TTS), then push the resolved ticket back to the Shopify store's customer service record.

Analysis Output

Field	Type	Description
transcript	string	Full text of the voicemail
intent	enum	ORDER_STATUS, RETURN, GENERAL_QUESTION, COMPLAINT, OTHER
sentiment	enum	POSITIVE, NEUTRAL, NEGATIVE
urgency	enum	LOW, MEDIUM, HIGH
summary	string	Short summary of the message
suggestedReply	string	AI-suggested reply text for the support agent

Full API Reference

Method	Path	Description
GET	`/`	Single-page frontend (Music Studio)
POST	`/generate`	Accepts audio + genre + studio JSON, runs pipeline, returns track data
GET	`/audio/{filename}`	Serves generated MP3 files from temp/
GET	`/api/voices`	Returns available ElevenLabs voices (id, name, gender, accent, preview URL)
POST	`/api/publish`	Creates a vinyl product on Shopify; returns product_url
GET	`/api/config`	Returns Shopify storefront domain + token
GET	`/voicemail`	Serves Voicemail Copilot SPA
POST	`/api/voicemail/analyze`	Analyzes voicemail audio via Gemini
POST	`/api/voicemail/tts`	Generates ElevenLabs voice reply
POST	`/api/voicemail/shopify-push`	Pushes resolved ticket to Shopify

Sponsor Integrations

Google — Gemini + Lyria

Gemini 2.5 Flash: Powers both Music Studio analysis (lyrics, mood, BPM, key) and Voicemail Copilot classification (intent, sentiment, urgency, suggested reply).
Lyria Realtime: Generates 60-second instrumentals via async WebSocket from a style prompt + BPM.

ElevenLabs — Voices + TTS + STS

TTS (eleven_multilingual_v2): Synthesizes lyrics into vocals and generates voice replies for Copilot.
STS (eleven_multilingual_sts_v2): When the user hums, preserves melody while applying the selected voice.
Voice Library: 20+ voices with metadata and preview URLs, cached after first call.

Shopify — Admin API

Music Studio: Publishes generated tracks as $24.99 vinyl record products.
Voicemail Copilot: Pushes resolved replies back to the store's customer service record.
Scope: Uses OAuth-acquired Admin API token with write_products scope.

Tech Stack

Component	Technology
Runtime	Python 3, FastAPI, uvicorn
Transcription	OpenAI Whisper (local, base model)
LLM Analysis	Google Gemini 2.5 Flash
Instrumental	Google Lyria Realtime (experimental)
Vocals	ElevenLabs TTS / STS + Voice Library
Lyric Refinement	Featherless AI (Qwen2.5-7B-Instruct)
Session Memory	Backboard.io REST API
E-Commerce	Shopify Admin API
Audio Processing	pydub + FFmpeg
Frontend	Vanilla HTML/CSS/JS, MediaRecorder API

Setup & Environment

Environment Variables

GEMINI_API_KEY                       # Google AI Studio — Gemini + Lyria
ELEVENLABS_API_KEY                   # ElevenLabs TTS/STS + voice library
ELEVENLABS_VOICE_ID                  # Optional — default voice ID
FEATHERLESS_API_KEY                  # Optional — lyric refinement
BACKBOARD_API_KEY                    # Optional — session memory
SHOPIFY_ADMIN_TOKEN                  # Shopify Admin API (write_products scope)
NEXT_PUBLIC_SHOPIFY_STORE_DOMAIN     # e.g. yourstore.myshopify.com
SHOPIFY_STOREFRONT_TOKEN             # Shopify Storefront API token

Running the App

# Install dependencies
pip install -r requirements.txt

# Ensure ffmpeg is installed
brew install ffmpeg        # macOS
apt install ffmpeg         # Ubuntu/Debian

# Start the server
python main.py             # serves on http://localhost:8000