VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

VoxSherpa TTS is listed in the official README of k2-fsa/sherpa-onnx — the core inference library powering this app.

Why VoxSherpa?

Most TTS apps make you choose between quality and privacy. Cloud-based tools like ElevenLabs sound incredible — but they require internet, send your text to remote servers, and charge per character.

VoxSherpa breaks that tradeoff.

It runs two professional-grade neural engines entirely on your device:

Engine	Quality	Speed	Best For
🧠 Kokoro-82M	Studio-grade · rivals ElevenLabs	Slower on budget hardware	Audiobooks, voiceovers, professional content
⚡ Piper / VITS	Natural · clear	Fast on any device	Daily use, quick synthesis

Screenshots

Generate	Models	Library	Settings

Features

🎙️ Dual Neural Engine

Kokoro-82M — 82 million parameter neural model. Multilingual support including Hindi, English, British English, French, Spanish, Chinese, Japanese and 50+ more languages. Same architecture used by top-tier commercial TTS services.
Piper / VITS — Fast, lightweight, natural. Generates speech in seconds on any Android device.

🔒 100% Offline & Private

All processing happens on your device
No internet required after model download
No account, no telemetry, no data collection
Your text never leaves your phone

📦 Model Management

Download models directly from the app
Import your own .onnx models from local storage
Multiple models installed simultaneously
Smart storage tracking

🎧 Audio Controls

Real-time waveform visualization
Adjustable speed and pitch
Play, pause, and replay generated audio
Export as WAV with correct sample rate per model

📚 Speech Library

Save all generated audio locally
Favorites system for quick access
View generation history with timestamps
Voice model attribution per recording

⚙️ Smart Settings

Smart Punctuation — natural pauses after sentence breaks
Emotion Tags — [whisper], [angry], [happy] support
Per-model voice selection (Kokoro supports 100+ speakers)
Theme-aware UI

Technical Architecture

User Text
    │
    ├─── Kokoro Engine (KokoroEngine.java)
    │         └── Sherpa-ONNX JNI → ONNX Runtime → CPU/NNAPI
    │                   └── kokoro-multi-lang-v1_0 (82M params, FP32)
    │
    └─── Piper / VITS Engine (VoiceEngine.java)
              └── Sherpa-ONNX JNI → ONNX Runtime → CPU
                        └── VITS model (language-specific)

Built with:

Sherpa-ONNX — on-device neural inference
Kokoro-82M — multilingual neural TTS model
Piper — fast local TTS
Android AudioTrack API — low-latency PCM playback

Performance

Generation speed depends entirely on your device's processor:

Device Tier	Kokoro	Piper
🟢 Flagship (Snapdragon 8 Gen 3)	~20–40 sec/min audio	~5 sec/min audio
🟡 Mid-range (8-core)	~60–90 sec/min audio	~10 sec/min audio
🔴 Budget (6-core)	~2–3 min/min audio	~20 sec/min audio

Kokoro prioritizes quality over speed by design. It uses the same 82M parameter architecture that powers premium commercial TTS — running it entirely offline on a mobile CPU is genuinely pushing the hardware limits.

Installation

🚀 Early Access (Production Review Pending)

Update: Thanks to the amazing support from this community, the 14-day closed testing is complete, and VoxSherpa TTS is currently under Production Review by Google Play! 🎉

While we wait for the app to go publicly live, you can still get Early Access to the stable V2.5 directly from the Play Store.

What's new in V2.5 (Stable):

🔊 System-wide TTS engine — use VoxSherpa in any app (Chrome, WhatsApp, etc.)
📄 PDF to Audio
📑 TXT to Audio
✨ Interactive mini-player, smoother UI, and improved audio generation

How to join Early Access:

Fill out the form below with your Google Play email.
I will manually add you to the early access list.
You will receive a direct Play Store link to install the app.

Source code for V2.5 will be pushed to the GitHub Main branch once the production version is officially live on the Play Store.

Model Import (Technical Users)

VoxSherpa supports importing custom .onnx models without any server:

Place your .onnx model + tokens.txt on device storage
Open Models tab → tap + → Import Local Model
Select your files

Compatible with any Sherpa-ONNX compatible TTS model.

Contributing

VoxSherpa is open source. Contributions welcome:

🐛 Bug reports via Issues
💡 Feature requests via Discussions
🔧 Pull requests for fixes and improvements

License

Copyright (C) 2025 CodeBySonu95

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

https://www.gnu.org/licenses/gpl-3.0.html

Acknowledgements

k2-fsa/sherpa-onnx — the inference engine that makes this possible
hexgrad/Kokoro-82M — the neural model behind studio-quality synthesis
rhasspy/piper — fast local TTS engine

Built with obsession. Runs without internet.

VoxSherpa — Because your voice deserves to stay yours.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
app		app
assets		assets
fastlane/metadata/android/en-US		fastlane/metadata/android/en-US
gradle/wrapper		gradle/wrapper
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
index.html		index.html
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

Why VoxSherpa?

Screenshots

Features

🎙️ Dual Neural Engine

🔒 100% Offline & Private

📦 Model Management

🎧 Audio Controls

📚 Speech Library

⚙️ Smart Settings

Technical Architecture

Performance

Installation

🚀 Early Access (Production Review Pending)

Model Import (Technical Users)

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Contributors 2

Languages

Folders and files

Latest commit

History

Repository files navigation

VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

Why VoxSherpa?

Screenshots

Features

🎙️ Dual Neural Engine

🔒 100% Offline & Private

📦 Model Management

🎧 Audio Controls

📚 Speech Library

⚙️ Smart Settings

Technical Architecture

Performance

Installation

🚀 Early Access (Production Review Pending)

Model Import (Technical Users)

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors 2

Languages

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.