🦇 Real-time Spatial Pulse Engine

Sensory Augmentation for the Visually Impaired

The "Pulse" is an advanced assistive augmented reality system designed to bridge the gap between traditional white canes and full environmental awareness. By converting a live camera feed into high-fidelity 3D spatial audio, it allows visually impaired users to "hear" the geometry of a room, detect obstacles at head height, and navigate dynamic environments using echolocation principles.

🌟 Key Features

🔊 Hyper-Realistic Spatial Audio: Uses a custom CUDA-accelerated physics engine to simulate Doppler shift, wall reflections, and room reverb in real-time.
👁️ Visual-to-Audio Mapping:
- Pitch: Maps vertical elevation (High Pitch = Floor, Low Pitch = Ceiling/Obstacles).
- Volume: Maps horizontal (XY) proximity. Objects at feet sound as loud as objects at head level.
🚀 Hybrid Compute Architecture:
- Vision (iGPU): Runs Optical Flow (RAFT) and Depth Estimation on integrated graphics to prevent stalls.
- Audio (RTX GPU): Dedicated high-priority CUDA streams for sub-15ms audio synthesis.
📍 Precision Tracking: Fuses Visual Odometry (RAFT) with IMU data for drift-free 6DoF head tracking.
🛡️ Safety-First Design: "Fail-Loud" architecture instantly cuts audio if tracking is lost, preventing false confidence.

🏗️ System Architecture

The system follows a strict Sense-Process-Sonify pipeline, distributed across hardware to maximize throughput.

graph TD
    A[📱 Camera & IMU] -->|720p @ 30FPS| B(Visual Pipeline - iGPU)
    B -->|Optical Flow & Depth| C{Fusion Engine}
    D[🧠 Head Tracking] -->|Rotation Matrices| C
    C -->|3D Landmarks| E(Physics Engine - RTX GPU)
    E -->|Wave Propagation| F[🎧 Binaural Audio Output]
    
    style B fill:#f9f,stroke:#333,stroke-width:2px
    style E fill:#bbf,stroke:#333,stroke-width:2px

🛠️ Installation & Setup

Prerequisites

OS: Windows 10/11 or Linux (Ubuntu 20.04+)
GPU: NVIDIA RTX 2060 or higher (6GB+ VRAM)
Python: Version 3.9 (Strict requirement for CuPy/PyTorch compatibility)
Drivers: CUDA Toolkit 11.8+

Step-by-Step Guide

Clone the Repository

git clone https://github.com/YourUsername/SpatialPulse.git
cd SpatialPulse

Install Core Dependencies (PyTorch with CUDA)

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu118

Install Acceleration Libraries

pip install cupy-cuda11x opencv-python sounddevice numpy scipy

Download Model Weights Ensure raft-small.pth and your depth estimation weights are placed in the models/ directory.
Configure Camera
- Install IPWebcam (or compatible app) on your Android device.
- Set resolution to Low/Medium (recommended for 11 FPS cap).
- Start server and note the IP (e.g., http://192.168.1.5:8080).

🚀 Usage

1. Calibration (Critical Step)

Before running the main engine, you must calibrate the camera to ensure the floor/ceiling pitch mapping is accurate.

python calibrate.py

Follow the on-screen prompts to capture the chessboard pattern.
Press S to save the calibration matrix.

2. Start the Engine

Run the main pipeline script. Ensure your headphones are connected before starting.

python vsa_pipeline.py

3. Understanding the Audio Cues (User Guide)

Sound Characteristic	Environmental Meaning	Action Required
Steady Pulse	System is active and tracking.	Safe to proceed slowly.
High Pitch 🐦	Object is BELOW you (Floor).	Normal walking surface.
Low Pitch 🐻	Object is ABOVE you (Ceiling/Overhang).	DUCK! Danger at head height.
Loud Volume 🔊	Object is horizontally CLOSE (XY distance).	Stop or change direction.
Reverb/Echo 🏰	You are near a wall or corner.	Use as a navigational landmark.
Left/Right Pan ↔️	Obstacle direction.	Steer away from the sound.

⚠️ SAFETY WARNING: This software is a prototype. Do not use in traffic, construction zones, or high-risk environments. Always carry a white cane as a backup.

📂 File Structure Breakdown

vsa_pipeline.py
- The Brain. Orchestrates the threads, manages GPU streams, and handles the main event loop.
highp_spatial_landmarks.py
- The Physics Engine. Contains the CUDA kernels for Doppler shift, HRIR convolution, and room acoustics.
tracker.py
- The Eyes. Handles RAFT optical flow and feature tracking. Runs primarily on the iGPU/CPU to save RTX resources.
depth_client.py & depth_check.py
- Handles monocular depth estimation and Z-sampling for 3D coordinate generation.
calibrate.py
- Utility for camera intrinsics calibration.

🔧 Troubleshooting & Known Issues

📉 Low Framerate (FPS)

Issue: Visual system running at ~11 FPS.
Reason: This is intentional! We cap the visual tracking to spare resources for the audio engine (which needs <15ms latency).
Fix: Scan your head slowly to "paint" the room with sound.

🔇 Audio Stuttering

Fix 1: Ensure sounddevice block size is set to 512 or 1024 in vsa_pipeline.py.
Fix 2: Close other GPU-heavy apps (Chrome, Games).
Fix 3: Check that the dedicated NVIDIA GPU is not being used for the visual tracker (check Task Manager).

🌡️ Thermal Throttling

Issue: After ~45 mins, performance drops.
Fix: Restart the script to clear GPU memory cache and reset thermal limits.

📜 License

This project uses components from Open Source libraries (OpenCV, PyTorch).

Code: MIT License
HRIR Database: CIPIC Interface (Research Use Only)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
CIPIC_hrtf_database		CIPIC_hrtf_database
calib.txt		calib.txt
calibrate.py		calibrate.py
camera_calib.json		camera_calib.json
camera_worker_impl.py		camera_worker_impl.py
depth_check.py		depth_check.py
depth_client.py		depth_client.py
depth_estimator_backend.py		depth_estimator_backend.py
filters.py		filters.py
highp_spatial_landmarks.py		highp_spatial_landmarks.py
imu_sense.py		imu_sense.py
process_depthmap.py		process_depthmap.py
raft_check.py		raft_check.py
raft_tracker_backend.py		raft_tracker_backend.py
readme.md		readme.md
realtime_frame_reader.py		realtime_frame_reader.py
spatial_check.py		spatial_check.py
stickymemory.py		stickymemory.py
tracker.py		tracker.py
utils.py		utils.py
vsa_pipeline.py		vsa_pipeline.py
vsa_visualizer.py		vsa_visualizer.py
yolov10n.pt		yolov10n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦇 Real-time Spatial Pulse Engine

Sensory Augmentation for the Visually Impaired

🌟 Key Features

🏗️ System Architecture

🛠️ Installation & Setup

Prerequisites

Step-by-Step Guide

🚀 Usage

1. Calibration (Critical Step)

2. Start the Engine

3. Understanding the Audio Cues (User Guide)

📂 File Structure Breakdown

🔧 Troubleshooting & Known Issues

📉 Low Framerate (FPS)

🔇 Audio Stuttering

🌡️ Thermal Throttling

📜 License

Bridging the Dark with Sound.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🦇 Real-time Spatial Pulse Engine

Sensory Augmentation for the Visually Impaired

🌟 Key Features

🏗️ System Architecture

🛠️ Installation & Setup

Prerequisites

Step-by-Step Guide

🚀 Usage

1. Calibration (Critical Step)

2. Start the Engine

3. Understanding the Audio Cues (User Guide)

📂 File Structure Breakdown

🔧 Troubleshooting & Known Issues

📉 Low Framerate (FPS)

🔇 Audio Stuttering

🌡️ Thermal Throttling

📜 License

Bridging the Dark with Sound.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages