See++

See++ is a web app that lets users analyze their surroundings using a webcam, voice input, and AI — powered by Google's Gemini multimodal model. You can ask questions by clicking a button or speaking, and See++ will respond with both on-screen and spoken answers.

🚀 Features

📷 Uses your webcam to capture live images
🔊 Converts AI responses to speech
🗣️ Accepts spoken input through voice recognition
🔄 Flip between front and rear cameras
🧠 Sends queries and images to Gemini for intelligent answers

⚙️ How to Set It Up (Step-by-Step)

1. Clone this repository

git clone https://github.com/ParthParikh04/See.git

2. Create a virtual environment

python -m venv venv
source venv/bin/activate

Note: Make sure this virtual environment (venv) is created within the project directory.

3. Install dependencies

pip install -r requirements.txt

4. Get a Gemini API key from Google

Visit https://makersuite.google.com/app
Sign in with your Google account
Click your profile icon (top right) → "API Key"
Copy the key

5. Store your API key securely

Create a .env file in the root of the project directory with the following contents:

GEMINI_API_KEY=your_api_key_here

🔐 Important: Keep your .env file secret — never share it or commit it to version control.

6. Run the Flask app

python app.py

7. Open it in your browser

Go to: http://localhost:5000

📱 Access the App on Your Phone (Optional with Ngrok)

Want to test See++ on your phone? You can use ngrok to tunnel your localhost and expose it to the internet:

🔧 Steps:

Install ngrok

If you haven't already, install it from https://ngrok.com/download, or use Homebrew:
```
brew install ngrok
```
Sign in and authenticate

Create an account on ngrok and get your auth token from your dashboard. Then run:
```
ngrok config add-authtoken your_token_here
```
Start your Flask app

In one terminal tab, run:
```
python app.py
```
Expose your local server with ngrok

In a new terminal tab, run:
```
ngrok http 5000
```

Get the public URL

ngrok will display a forwarding URL like:

Forwarding                    https://abc123.ngrok.io -> http://localhost:5000

Visit on your phone

Open the https://abc123.ngrok.io link on your phone (make sure your phone and computer are both online).

📸 Your phone will ask for access to the camera and mic — allow these for full functionality.

🧪 How to Use

Allow access to your webcam and microphone when prompted
Click a button to:
- 📄 Extract text from the camera view
- 🖼️ Describe the current scene
- 🎤 Use your voice to ask a question
The app will:
1. Capture a webcam frame
2. Send it (along with your question) to the Gemini API
3. Display and speak the response

🔐 Requirements

Python 3.7+
A modern browser (Chrome recommended)
Internet access
A Gemini API key from Google

👥 Contributors

Parth Parikh
Andria Wang
Colby Brown
James Martin

📄 License

MIT — use freely, modify creatively, and share responsibly!

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
backend		backend
static		static
templates		templates
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

See++

🚀 Features

⚙️ How to Set It Up (Step-by-Step)

1. Clone this repository

2. Create a virtual environment

3. Install dependencies

4. Get a Gemini API key from Google

5. Store your API key securely

6. Run the Flask app

7. Open it in your browser

📱 Access the App on Your Phone (Optional with Ngrok)

🔧 Steps:

🧪 How to Use

🔐 Requirements

👥 Contributors

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

See++

🚀 Features

⚙️ How to Set It Up (Step-by-Step)

1. Clone this repository

2. Create a virtual environment

3. Install dependencies

4. Get a Gemini API key from Google

5. Store your API key securely

6. Run the Flask app

7. Open it in your browser

📱 Access the App on Your Phone (Optional with Ngrok)

🔧 Steps:

🧪 How to Use

🔐 Requirements

👥 Contributors

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages