๐ Inspiration
Our inspiration was the need to benefit humanity, specifically targeting visually impaired individuals.
"What if we could give vision to those who cannot see?"
This question led us to build GeminEyeโan AI-powered real-time navigation tool that enhances safety, inclusion, and independence for people with vision impairment.
๐ฏ What It Does
GeminEye is a real-time object detection application that:
โ๏ธ Identifies objects & obstacles in the environment.
โ๏ธ Provides navigation guidance with clear voice instructions.
โ๏ธ Enables hands-free use with voice-controlled interaction.
โ๏ธ Locates specific objects based on user commands (e.g., "Where is my backpack?").
๐ ๏ธ How We Built It
We used cutting-edge AI technology to make GeminEye fast, responsive, and accurate:
- ๐ค AI Model: Gemini-1.5-Flash for scene analysis & guidance.
- ๐ฅ OpenCV: Captures live video frames every 2 seconds for smooth AI feedback.
- ๐ฃ๏ธ gTTS: Converts AI-generated text responses into speech output for accessibility.
- ๐ค Python: The core programming language that ties it all together.
๐ Challenges We Ran Into
1๏ธโฃ Camera Access Issues in WSL
๐น Initially, we couldn't open the camera using the WSL terminal.
๐น We tested the camera with debug scripts and switched to Windows CMD to enable the live camera feature.
2๏ธโฃ Live API Limitations
๐น We couldn't integrate Geminiโs Live API for real-time streaming.
๐น Instead, we captured frames every 2 seconds to simulate live video streaming without saving images to disk.
๐ Accomplishments We're Proud Of
โ
Completed our first hackathon project!
โ
Overcame technical challenges like API integration & real-time video analysis.
โ
Built a working prototype within a short timeframe!
๐ What We Learned
๐น Better API integration and real-time AI processing.
๐น Optimizing AI prompts to get the most useful responses.
๐น Persistence! We didnโt give up despite multiple technical challenges.
๐ฎ What's Next for GeminEye?
๐ Scaling Up & Expanding Features
- ๐ฑ Smartphone & Wearable Integration: Making GeminEye accessible via smart glasses & mobile apps.
- ๐ Multilingual Support: Using Google Translate API to make navigation more inclusive globally.
- ๐บ๏ธ GPS & Mapping: Integrating maps to enable self-navigation & location tracking.
- ๐ข Live API Support: Using Geminiโs Live API for real-time, smoother analysis.
- ๐จ Color Recognition for Colorblind Users: Helping users distinguish colors in their surroundings.
๐ฅ Why GeminEye Matters?
๐ก AI should empower everyoneโnot just those who can see.
๐ก GeminEye turns a smartphone into an AI-powered guide, making cities more inclusive, smarter, and safer for all.
Log in or sign up for Devpost to join the conversation.