Inspiration
Working with a computer is a serious of tedious, manual clicks and keystrokes. We wanted to make the digital experience feel more like a superpower. We set out to build Pinscher: a system that turns your natural gestures and voice into a direct bridge to your operating system.
What it does
Pinscher is an AI-powered OS controller that lets you run your Mac hands-free.
- Vision Control: Use index-thumb pinches to click and drag, or ring-finger swipes to switch browser tabs.
- Voice Intent: Natural language processing that doesn't just transcribe, it understands. Say "Summarize this meeting," and Pinscher records, transcribes, and generates a PDF.
- Gemini Intelligence: A "Presenter Assistant" that generates slide decks (Marp) and speaker notes automatically from your recorded video sessions.
- Accessibility: Includes dwell-click and sticky-key modes for users with limited mobility.
How we built it
We built a multi-process architecture to handle real-time data streams:
- Frontend: A slick Electron dashboard for device configuration and gesture tuning.
- Backend: A Python core managed by Poetry, utilizing OpenCV for hand tracking and Vosk for offline-capable speech-to-text.
- Intelligence: Integrated the Gemini 2.0 Flash API to parse complex user intents and perform multi-modal video summarization.
- OS Link: A custom bridge that translates Python logic into low-level Mac system events for mouse and keyboard control.
Challenges we ran into
- Learning how to set up Electron for the first time
- Dependency issues regarding Vosk, which can work on certain versions of Python and processors
Accomplishments that we're proud of
- We successfully scrolled through a document and switched apps using nothing but a hand gesture
- We were able to play Wordle without touching our keyboard
- Creating a command that records a demo, muxes it, and generates a formatted PDF slide deck in under 60 seconds
What we learned
- Multimodal is difficult to manage
- Cursor anchors is more feasible since holding up your hand for long can be tiring in real life ## What's next for Pinscher
- Make it compatible across all other OS systems

Log in or sign up for Devpost to join the conversation.