ADAM — Autonomous Desktop AI Module

ADAM is an AI-powered desktop assistant built by DGEN Technologies Pvt. Ltd. Powered by Google Gemini Live API, it features real-time voice conversation, emotion-driven face animations, live camera vision, and a persistent memory system.

Project Structure

ADAM/
├── adam_live_v19_attention.py   # ★ Latest — v19.1: Gemini Live + camera + smart attention
├── adam_live_v18_camera.py      # v18: Gemini Live + OpenCV camera + face recognition
├── adam_live_v17.py             # v17: Gemini Live + Flask face UI + WebSocket
├── adam_live_v9.py              # v9:  Gemini Live API (standalone, no camera)
├── adam_live_v9_legacy.py       # v9:  Legacy variant with Google Search tool
├── adam_voice_elevenlabs.py     # Classic: Google STT + ElevenLabs TTS
├── adam_native_audio.py         # Native audio: speech-segmented Gemini input
├── wake_word_vosk.py            # Wake word detector (Vosk offline model)
├── wake_word_google.py          # Wake word detector (Google Speech Recognition)
├── adam_face.html               # Face animation UI (served via Flask)
├── system_prompt.txt            # ADAM's full personality & behaviour prompt
├── adam_memory.json             # Persistent conversation memory (auto-generated)
├── requirements_native_audio.txt
├── README_native_audio.md
└── design/
    └── media/
        ├── body.jpeg
        └── generated_design.jpeg

Quick Start

Latest version (recommended)

pip install --upgrade google-genai pyaudio python-dotenv websockets flask opencv-python Pillow

Set your API key:

# Linux / macOS
export GOOGLE_API_KEY="your_key_here"

# Windows PowerShell
$env:GOOGLE_API_KEY = "your_key_here"

Run:

python adam_live_v19_attention.py

Classic voice-only version

pip install speechrecognition elevenlabs playsound python-dotenv google-generativeai
python adam_voice_elevenlabs.py

Version History

File	Version	Key Features
`adam_live_v19_attention.py`	v19.1	Camera + smart attention (face gaze, wake word, timeout)
`adam_live_v18_camera.py`	v18	Camera + face recognition + persistent visual memory
`adam_live_v17.py`	v17	Gemini Live + face UI (WebSocket + Flask)
`adam_live_v9.py`	v9	Gemini Live API + session resumption + voice picker
`adam_native_audio.py`	—	Native audio with speech-segmentation
`adam_voice_elevenlabs.py`	—	Classic STT + ElevenLabs TTS pipeline

Built by

DGEN Technologies Pvt. Ltd. — Kolkata, India
"Innovate. Integrate. Inspire." | Made in India.

Website: dgentechnologies.com
Twitter/X: @dgen_tec
Instagram: @dgen_technologies
LinkedIn: dgentechnologies

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.claude/agents		.claude/agents
.github/agents		.github/agents
adam-web-demo		adam-web-demo
design/media		design/media
faces		faces
old_versions		old_versions
.gitignore		.gitignore
README.md		README.md
adamV24.py		adamV24.py
adamV25.py		adamV25.py
adamV26.py		adamV26.py
adamV27.py		adamV27.py
adamV28.py		adamV28.py
adamV29.py		adamV29.py
adam_conversations.json		adam_conversations.json
adam_face.html		adam_face.html
adam_faces.json		adam_faces.json
adam_memory.json		adam_memory.json
adam_neck_serial.py		adam_neck_serial.py
system_prompt.txt		system_prompt.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADAM — Autonomous Desktop AI Module

Project Structure

Quick Start

Latest version (recommended)

Classic voice-only version

Version History

Built by

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ADAM — Autonomous Desktop AI Module

Project Structure

Quick Start

Latest version (recommended)

Classic voice-only version

Version History

Built by

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages