Home Screen
Sample chatting with a persona
Gemini vision capabilities
Gemini image generation
Voice call feature

AlterEcho

AlterEcho is a digital persona system that allows you to "resurrect" or clone digital personalities from chat history and voice samples. By analyzing linguistic patterns from Chat data logs and cloning voice profiles using WaveSpeed, AlterEcho creates a hyper-realistic AI companion that speaks and texts exactly like the original person.

Inspiration

Time passes and never comes back. This is what makes "memories" truly priceless. There are countless people in the world that would do so much, just to be able to relive a past experience. That is what inspired us to create something capable of bringing back something lost in time, something people thought can only continue to live in their memories.

Features

Linguistic Style Hyper-Profiling: Mathematically deconstructs chat logs to replicate vocabulary, sarcasm patterns, and emotional tone using Gemini's massive context window.
Voice Cloning: Integrates WaveSpeed MiniMax 2.6 Turbo for high-fidelity voice synthesis.
Multi-Source Import: Supports WhatsApp(.txt), Line(.txt), Instagram (.json) and Discord (.zip) chat exports.
Hybrid Memory: Combines static style guides with RAG (Retrieval Augmented Generation) for accurate long-term memory.
Local-First Architecture: Your data stays on your machine (Stored in cache).

Powered by Gemini

1. Style Hyper-Profiling (Long Context)

We leverage Gemini's massive context window to feed thousands of chat messages into a single prompt.

Embeddings: using gemini's Embedding model, AlterEcho is able to create a vector database to efficiently store and read needed information.

2. Native Function Calling

The chatbot isn't just text. It uses Gemini Function Calling to autonomously interact with the world.

Tools: We define tools like generate_or_edit_image. Users don't need to type special commands. If you say "Send me a selfie" or "Make that picture darker", Gemini understands the intent ("Is this a tool call?") and executes the function naturally.

3. Multimodal Vision

AlterEcho can see.

Visual Understanding: You can upload images to the chat, and Gemini will analyze them and react in character (e.g., "Omg that cat is so cute!!" or "Where did you buy those shoes??").
Image Editing: It maintains an "Image History", allowing it to reference, understand, and even edit previous images in the conversation multimodally.

4. Low-Latency Streaming

For the voice feature (StreamChat), speed is critical.

Pipeline: As soon as Gemini generates the first few words, they are cleaned and piped directly into the WaveSpeed TTS engine. This results in a near-instant conversational flow that feels like a real phone call.

Tech Stack

Frontend: React 19, TailwindCSS v4, Radix UI, Framer Motion
Backend: Python 3.10+, Flask
AI: Google Gemini (Logic/Text/Vision/Image/Embeddings), WaveSpeed MiniMax (Voice)

All AI Models used in AlterEcho

Chatbot models : gemini-flash-latest, gemini-3-flash-preview, gemini-flash-lite-latest
Preprocessing models : gemini-3-pro-preview, gemini-2.5-pro, gemini-flash-latest, gemini-3-flash-preview, gemini-embedding-001
Imagen : gemini-2.5-flash-image, gemini-3-pro-image-preview
Voice cloning : WaveSpeed MiniMax 2.6 Turbo

Built With

Updates

Jorvan Low started this project — Feb 05, 2026 06:44 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.