๐Ÿš€ Inspiration

Our inspiration was the need to benefit humanity, specifically targeting visually impaired individuals.
"What if we could give vision to those who cannot see?"
This question led us to build GeminEyeโ€”an AI-powered real-time navigation tool that enhances safety, inclusion, and independence for people with vision impairment.


๐ŸŽฏ What It Does

GeminEye is a real-time object detection application that:
โœ”๏ธ Identifies objects & obstacles in the environment.
โœ”๏ธ Provides navigation guidance with clear voice instructions.
โœ”๏ธ Enables hands-free use with voice-controlled interaction.
โœ”๏ธ Locates specific objects based on user commands (e.g., "Where is my backpack?").


๐Ÿ› ๏ธ How We Built It

We used cutting-edge AI technology to make GeminEye fast, responsive, and accurate:

  • ๐Ÿค– AI Model: Gemini-1.5-Flash for scene analysis & guidance.
  • ๐ŸŽฅ OpenCV: Captures live video frames every 2 seconds for smooth AI feedback.
  • ๐Ÿ—ฃ๏ธ gTTS: Converts AI-generated text responses into speech output for accessibility.
  • ๐Ÿ”ค Python: The core programming language that ties it all together.

๐Ÿ›‘ Challenges We Ran Into

1๏ธโƒฃ Camera Access Issues in WSL

๐Ÿ”น Initially, we couldn't open the camera using the WSL terminal.
๐Ÿ”น We tested the camera with debug scripts and switched to Windows CMD to enable the live camera feature.

2๏ธโƒฃ Live API Limitations

๐Ÿ”น We couldn't integrate Geminiโ€™s Live API for real-time streaming.
๐Ÿ”น Instead, we captured frames every 2 seconds to simulate live video streaming without saving images to disk.


๐Ÿ† Accomplishments We're Proud Of

โœ… Completed our first hackathon project!
โœ… Overcame technical challenges like API integration & real-time video analysis.
โœ… Built a working prototype within a short timeframe!


๐Ÿ“š What We Learned

๐Ÿ”น Better API integration and real-time AI processing.
๐Ÿ”น Optimizing AI prompts to get the most useful responses.
๐Ÿ”น Persistence! We didnโ€™t give up despite multiple technical challenges.


๐Ÿ”ฎ What's Next for GeminEye?

๐Ÿš€ Scaling Up & Expanding Features

  • ๐Ÿ“ฑ Smartphone & Wearable Integration: Making GeminEye accessible via smart glasses & mobile apps.
  • ๐ŸŒ Multilingual Support: Using Google Translate API to make navigation more inclusive globally.
  • ๐Ÿ—บ๏ธ GPS & Mapping: Integrating maps to enable self-navigation & location tracking.
  • ๐ŸŸข Live API Support: Using Geminiโ€™s Live API for real-time, smoother analysis.
  • ๐ŸŽจ Color Recognition for Colorblind Users: Helping users distinguish colors in their surroundings.

๐Ÿ”ฅ Why GeminEye Matters?

๐Ÿ’ก AI should empower everyoneโ€”not just those who can see.
๐Ÿ’ก GeminEye turns a smartphone into an AI-powered guide, making cities more inclusive, smarter, and safer for all.

Built With

Share this project:

Updates