Skip to content

consulfedor/VoiceGrab

Repository files navigation

πŸŽ™οΈ VoiceGrab

πŸŽ₯ Watch 15-sec Demo

Windows Groq Telegram LinkedIn License

Free Voice-to-Text for Windows β€” No Subscription Required

Record voice β†’ Get text instantly β†’ Paste anywhere

πŸš€ Quick Start β€’ ✨ Features β€’ πŸ“₯ Download


⚑ Why VoiceGrab?

VoiceGrab is a free, open-source alternative to Dragon NaturallySpeaking ($300) and Windows Voice Typing. Uses Groq Whisper API for professional-grade transcription in 57 languages.

Not a browser extension. Works system-wide: VS Code, Cursor, Slack, Word, ChatGPT, Telegram β€” any Windows app.

πŸ”₯ Key Benefits

You get Why it matters
100% Free Tier Groq's free API = ~8 hours/day of dictation
Speak 3x faster than typing Voice prompts for ChatGPT, Claude, Copilot
Auto-Translation Speak Russian β†’ Type English (DeepL/Groq)
Smart Cleaning Removes "um", "uh" and Whisper hallucinations
5 Dev Modes Prompts for coding, docs, notes, chat

Note: Gemini 2.0 integration is experimental (in development). Primary stable engines: Groq (transcription) + DeepL (translation).


πŸŽ₯ Demo

Voice-to-Text with Real-time Translation:

demo.mp4

(15 sec: Dictating in Russian β†’ Auto-translates to English β†’ Pastes into MS Word)


πŸš€ Quick Start

Option A: Download Release

  1. Download Latest Release (ZIP)
  2. Get Free API Key β†’ console.groq.com/keys
  3. Run VoiceGrab.bat
  4. Enter key β†’ Install Deps β†’ Run
  5. Press Right Ctrl to dictate! πŸŽ‰

Option B: Clone

git clone https://github.com/consulfedor/VoiceGrab.git
cd VoiceGrab

Usage

Action How
Start/Stop Recording Right Ctrl
Switch Mode Click β—€ Mode β–Ά on indicator
Settings Right-click tray β†’ Settings

πŸ’‘ Tip: Text is always in clipboard! Ctrl+V to paste anywhere.


✨ Features

Core

  • 🎀 One-Click Recording β€” Press Right Ctrl (configurable)
  • ♾️ Unlimited Recording β€” Auto-segments every 3 min
  • πŸ“‹ Auto-Paste β€” Text types directly into active window

Transcription

  • ⚑ Groq Whisper API β€” Fast, accurate, FREE tier
  • 🧹 Filler Cleanup β€” Removes "um", "uh", "like"
  • πŸ‘» Hallucination Filter β€” Removes AI "ghost" phrases

Translation

  • 🌐 Auto-Translate β€” Off / Replace / Append modes
  • πŸ”€ DeepL Integration β€” Quality translation
  • 🌍 Groq AI Translate β€” Fast AI translation

UI

  • πŸ–₯️ System Tray β€” Runs in background
  • πŸ’‘ Floating Indicator β€” Shows mode + recording status
  • βš™οΈ Dark Settings Panel β€” All options per mode

βš™οΈ Configuration

5 Modes (fully customizable)

Mode Use Case
πŸ€– AI Chat ChatGPT prompts, AI conversations
πŸ’» Code Programming, technical terms
πŸ“‹ Docs Documentation, formal writing
πŸ“ Notes Quick notes, ideas
πŸ’¬ Chat Casual conversation

Per-Mode Settings

  • Language (57 options)
  • Prompt (context hint for Whisper)
  • Filler/Hallucination filters
  • Auto-Translate (Off/Replace/Append)
  • Translation engine (Groq/DeepL)

πŸ“¦ Requirements

  • Windows 10/11
  • Python 3.10+ β€” Download

    ⚠️ Check "Add Python to PATH" during installation!

  • Microphone

πŸ“‹ Changelog

v2.3.2 (2026-01-06)

  • πŸ›‘οΈ Config Protection β€” Prevents saving empty/NULL config

v2.3.1 (2025-12-31)

  • πŸ› οΈ NULL Eval Fix β€” Fixed startup error

v2.3.0 (2025-12-31)

  • πŸ”§ Settings Fix β€” All fields save correctly
  • πŸ“ Single Default Config β€” config_default.json
Older versions

v2.0.0 - v2.2.0

  • 🏷️ Custom Mode Names
  • πŸ’‘ Tab Tooltips
  • βš™οΈ Mode Priority System

v1.5.0

  • 🌐 Auto-Translate (Groq/DeepL)
  • πŸ”€ Translator Tool

v1.4.0

  • πŸ“ Batch Transcription
  • πŸ“„ Document Converter

v1.3.0

  • 🌍 57 Languages support

❓ FAQ

Is it really free?

Yes! Groq offers a generous FREE tier:

  • ~10 requests per minute
  • ~25,000 audio seconds per day (~8 hours)
  • No credit card required
Which languages are supported?

57 languages including: Russian, English, Ukrainian, Turkish, Spanish, French, German, Chinese, Japanese.

Does it work offline?

No, requires internet. Audio is processed in cloud and deleted immediately.


πŸ› Troubleshooting

Issue Solution
Python not found Reinstall, check "Add to PATH"
No microphone Check Windows sound settings
Rate limit Wait 1 min, or use Turbo model

πŸ› οΈ Tech Stack

  • Core: Python 3.10+
  • UI: PowerShell + WebView2 (HTML/CSS)
  • APIs: Groq (Whisper), DeepL, Gemini (dev)

πŸ“„ License

MIT License β€” free for personal and commercial use.


⭐ Star this repo if you find it useful!

Report Bug β€’ Join Telegram

About

πŸŽ™οΈ Voice-to-Text Bridge for AI & Any Application. Record voice β†’ Get text β†’ Paste anywhere.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors