Tags from VisionDepth3D

VisionDepth3Dv3.2.8 Release

2026-02-11T22:18:49Z

V3.8.2 Update (#87)

* Update Model List

v3.8.2 - Readded ONNX model from Depth inference list that was missing

* Add files via upload

Adapters for DAV3 and Video Depth Anything Depth models for integration

* Add files via upload

Video Depth Anything Backend

* Add files via upload

Config files for DAV3

* Add files via upload

model backend for integration

* Add files via upload

DAV3 Model back end

* Add files via upload

Windows Updater for latest release

* v3.8.2 - Main GUI & Workflow Improvements

### Main GUI & Workflow Improvements

- Renamed **Depth Estimation** tab to **Depth Engine** to reflect multi-backend depth processing.
- Added native DA3 and Video Depth Anything engines directly into the unified depth selector.
- Improved model list consistency so UI options always match available backends.
- Added clearer ONNX model identification in the console during load.
- Fixed mismatched slider labels and tooltips in the 3D Generator tab.
- Reworked **Encoding Settings** dialog layout for cleaner spacing and readability.
- Moved **Clip Range** controls into Processing Options with translated labels and tooltips.
- Added optional **Convergence Crosshairs** overlay in Preview GUI for faster tuning.
- Fixed File menu actions failing to trigger dialogs (Load Preset and Output Path).
- Simplified File menu by removing redundant Save/Load Settings in favor of presets.
- Integrated built-in **VisionDepth3D Updater** accessible from Help → Check Updates.
- Added confirmation prompt before launching updater for safe auto-closing behavior.
- Reduced console warning spam for cleaner runtime output.

* v.3.8.2 - Depth Estimation Improvements & Fixes

### Depth Estimation Improvements & Fixes

- Introduced native **Depth Anything 3 (DA3)** backend with full integration into image and video workflows.
- Added native **Video Depth Anything (VDA)** backend with sequence-aware temporal inference.
- Unified DA3, VDA, ONNX, and Hugging Face models under a single depth engine pipeline.
- Normalized all depth outputs into a consistent 0–1 range for reliable blending and 3D rendering.
- Added warm-up passes for DA3 and VDA to eliminate first-frame hitching.
- Improved batching support and fallback handling for multi-frame depth inference.
- Added configurable target FPS control for VDA to reduce inference load on high-FPS sources.

#### ONNX Stability & Model Fixes
- Fixed Distill-Any-Depth ONNX models failing due to tensor shape mismatches.
- Enforced correct 518×518 inference resolution for Distill-Any-Depth models.
- Added automatic ONNX model detection and resolution enforcement.
- Switched ONNX preprocessing to aspect-ratio-preserving padding instead of stretching.
- Enabled safe ONNX Runtime graph optimizations for improved stability and performance.
- Fixed ONNX warm-up errors and broadcast failures.

#### Video Depth Handling Improvements
- Fixed letterbox (black bar) regions incorrectly affecting depth inference.
- Improved multi-frame letterbox detection to prevent flicker.
- Filled letterbox areas with neutral depth to prevent pop-out artifacts and white banding.

#### Performance Optimizations
- Removed redundant image resizing during video inference.
- Consolidated resizing into a single pass per frame.
- Enabled CUDA `channels_last` memory layout for supported Hugging Face models.
- Improved FP16 inference handling for faster CUDA performance.
- Optimized ONNX session configuration to reduce memory overhead.
- Improved batch handling to reduce per-frame processing cost.
- Reduced console warning spam for cleaner runtime output.

* v3.8.2 Preview GUI & Live View Improvements

- Added optional **Convergence Crosshairs overlay** to the Preview GUI for faster and more precise convergence tuning.
- Significantly improved real-time Preview GUI smoothness by resetting render state between sessions to prevent drift and jitter.
- Eliminated “settling” artifacts at the start of previews by reinitializing depth normalization and convergence trackers per render.
- Improved floating window behavior during the first frames of preview playback for more stable stereo alignment.
- Increased live preview FPS by reducing GPU memory churn and persistent buffer reuse.
- Reduced preview stutter caused by warm-up spikes and redundant tensor allocations.
- Improved frame pacing for smoother SBS output during live preview.
- Enhanced stability when mixing screen capture with GPU depth inference.

* v3.8.2 - VD3D Live 3D Performance & Stability Upgrades

- Major real-time performance boost across GPUs, with live 3D preview running approximately **40 to 70 percent faster** depending on resolution and hardware.
- Eliminated frequent GPU memory reallocations by introducing persistent CUDA buffers for depth inference and stereo rendering.
- Smoother live depth updates through optimized GPU tensor reuse and reduced CPU to GPU transfer overhead.
- Added independent **Depth FPS control**, allowing depth inference to run at a lower rate than preview rendering for better responsiveness and stability.
- Reduced temporal jitter in live depth maps using improved EMA smoothing while preserving depth responsiveness.
- Minimized preview hitching caused by first-frame warm-up and inference spikes.
- Improved frame pacing for more consistent SBS output in live mode.
- Increased stability when combining screen capture with GPU depth inference.

* v3.8.2 - 3D Generator Performance, Stability & Quality Improvements

- Significantly smoother offline and real-time 3D rendering by fully resetting internal render state at the start of each render session.
- Eliminated temporal drift, convergence carry-over, and accumulated smoothing artifacts between consecutive renders.
- Improved depth range calibration per clip with fresh percentile normalization for more consistent parallax response.
- Stabilized floating window behavior and convergence transitions during the first frames of each render.
- Increased real-time preview FPS and reduced jitter across long renders.

- Fixed output sizing across all 3D modes including:
- VR formats
- Passive Interlaced displays
- Single-eye exports

- Corrected floating window calculations to operate per-eye instead of full SBS width.
- Added safety resizing to guarantee final encoded frames always match target output resolution.

- Added optional **Convergence Crosshairs overlay** in the Preview GUI for faster and more precise tuning.

- Cleaned up UI inconsistencies:
- Foreground and Background shift labels now match their actual sliders
- Tooltips correctly reflect each control’s function

- Reworked Encoding Settings layout for better readability and workflow.
- Moved Clip Range controls into Processing Options for a cleaner main interface.

- Fixed File menu actions:
- Preset loading now works correctly from the dropdown
- Output path dialog now opens properly from both menu and hotkey
- Removed redundant Save/Load Settings in favor of streamlined Preset workflow

* Revise changelog for VisionDepth3D v3.8.2 release

Updated changelog for VisionDepth3D v3.8.2 with performance improvements, new depth engines, and various fixes.

* Update requirements.txt with new packages

Added new dependencies to requirements.txt for additional functionality.

* Update copyright year in LICENSE.txt

* Delete presets/Best3DSettings.json

old

* Delete presets/balanced_depth.json

old

* Add files via upload

VisionDepth3Dv3.8.1 - Release Bug patch

2025-12-28T17:42:36Z

VisionDepth3Dv3.8 - Release

2025-12-18T02:28:52Z

Include licensing information in README

Added licensing notice for VisionDepth3D.

VisionDepth3Dv3.7 - Release

2025-11-26T17:27:10Z

VisionDepth3D v3.7 Update (#78)

* Delete languages directory

Delete for 3.7 update

* Add Language Directory

Adding language directory for 3.7

* Update DB.py

v3.7 Update

* Update render_depth.py

v3.7 Update

* Update render_3d.py

v3.7 Update

* Update merged_pipeline.py

v3.7 update

* Update vd3d_live.py

v3.7 update

* Update preview_gui.py

v3.7 Update

* Reformat preview_utils.py for consistency

v3.7 update

* Refactor imports in core/__init__.py for clarity

v3.7 update

* Update VisionDepth3D.py

v3.7 Update

* Revise Changelog for VisionDepth3D v3.7 release

Updated changelog for version 3.7 with new features, fixes, and performance improvements.

VisionDepth3Dv3.6.2 - Release

2025-10-08T18:14:52Z

Update README.md

Updated Banner Photo

VisionDepth3Dv3.6 - Release

2025-10-06T14:02:33Z

Update index.html

updated download tally badge

VisionDepth3D Setup Downloader

2025-12-14T18:30:17Z

Update index.html

links

VisionDepth3Dv3.5 - Release

2025-08-24T06:06:30Z

V3.5 (#56)

* Delete languages directory

Removing to add updated ones

* v3.5 - Language JSON Files

Updated json files to all match en and cleaned up dups

* v3.5

# VisionDepth3D v3.5 – Changelog

## 1) Depth of Field (DOF) – Rewritten & Stabilized
- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**.
- Uses **per-pixel interpolation** between blur levels for smooth transitions.
- Added **motion-adaptive focal tracking**:
- Exponential moving average (EMA) for stable focus.
- Deadband to ignore micro noise.
- Max-step limiter to prevent “focus pops.”
- DOF now applies **after stereo warp** using warped per-eye depth.
- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF.
- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.

## 2) Audio Tool – Revamp & Codec Control
- Added **progress bar** for encoding/attaching audio.
- Users can now select **codec and bitrate** before muxing:
- `aac` (default) and `libmp3lame` supported.
- Configurable bitrate (e.g. 128k, 192k, 320k).
- **Offset slider** added for real-time sync adjustment when attaching audio.
- Audio attachment now clearly distinguishes between **copy vs. re-encode**:
- If codec/bitrate unchanged → fast copy (`-c copy`).
- If codec/bitrate changed → re-encode.
- UI fields now properly populate when files are chosen.
- Safe handling of long videos (2+ hours) with progress feedback.

## 3) Color Grading – GPU Accelerated & Fully Integrated
- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with:
- **Saturation**
- **Contrast**
- **Brightness**
- Color grading now applies **after stereo warp & DOF, before packing/formatting**.
- Added **Preview GUI sync**:
- Sliders update live in the preview with **debounced re-rendering**.
- Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa.
- Preset/save/load support extended to include color grading.
- Tooltips and i18n refreshed for new controls.
- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.

## 4) Stereo Separation (IPD Adjustment) – New 3D Control
- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation.
- Works as a **global scale factor** on pixel shifts (foreground, midground, background).
- Allows creators to:
- Increase IPD for stronger 3D “pop” on large screens / VR headsets.
- Reduce IPD for comfortable viewing on smaller displays.
- Fully integrated into:
- **Preset system** (save/load).
- **Preview GUI** with real-time feedback.
- **Tooltip and i18n system** for clarity.
- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.

## 5) General Fixes & Stability
- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output.
- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing.
- Minor UI consistency fixes across tools.
- **Language Files Clean-Up:**
- Removed duplicate keys and aligned all translations with `en.json`.
- Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages.
- Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.

## 6) New Session Additions – ONNX Pipeline & UI Enhancements
- **ONNX Integration**
- Converted **Video Depth Anything (pth → onnx)** for faster inference.
- Optimized ONNX pipeline path to run converted models efficiently.
- **UI Enhancements**
- New **start time / end time controls** inside Encoding Settings:
- Users can render **short clips or preview segments** without full video runs.
- Inputs section refactored into its own **dedicated frame** for clarity.
- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.

---

**Summary:**
v3.4 gave creators fine-grained depth & subject control.
v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.

* v3.5

# VisionDepth3D v3.5 – Changelog

---

* v3.5

# VisionDepth3D v3.5 – Changelog

---

* v3.5

# VisionDepth3D v3.5 – Changelog

---

* v3.5

# VisionDepth3D v3.5 – Changelog

---

* v3.5

# VisionDepth3D v3.5 – Changelog

---

* v3.5

# VisionDepth3D v3.5 – Changelog

---

VisionDepth3D v3.3 - Release

2025-08-12T13:42:14Z

Update README.md

Updated install guide

VisionDepth3Dv3.2.4 - Release

2025-06-04T02:07:05Z

Update de.json

removed trailing comma