tag:github.com,2008:https://github.com/VisionDepth/VisionDepth3D/releases Tags from VisionDepth3D 2026-02-11T20:31:11Z tag:github.com,2008:Repository/918467518/Release-v3.8.2 2026-02-11T22:18:49Z VisionDepth3Dv3.2.8 Release <p>V3.8.2 Update (<a class="issue-link js-issue-link" href="https://github.com/VisionDepth/VisionDepth3D/pull/87">#87</a>)</p> <p>* Update Model List</p> <p>v3.8.2 - Readded ONNX model from Depth inference list that was missing</p> <p>* Add files via upload</p> <p>Adapters for DAV3 and Video Depth Anything Depth models for integration</p> <p>* Add files via upload</p> <p>Video Depth Anything Backend</p> <p>* Add files via upload</p> <p>* Add files via upload</p> <p>Config files for DAV3</p> <p>* Add files via upload</p> <p>model backend for integration</p> <p>* Add files via upload</p> <p>DAV3 Model back end</p> <p>* Add files via upload</p> <p>Windows Updater for latest release</p> <p>* v3.8.2 - Main GUI &amp; Workflow Improvements</p> <p>### Main GUI &amp; Workflow Improvements</p> <p>- Renamed **Depth Estimation** tab to **Depth Engine** to reflect multi-backend depth processing. <br />- Added native DA3 and Video Depth Anything engines directly into the unified depth selector. <br />- Improved model list consistency so UI options always match available backends. <br />- Added clearer ONNX model identification in the console during load. <br />- Fixed mismatched slider labels and tooltips in the 3D Generator tab. <br />- Reworked **Encoding Settings** dialog layout for cleaner spacing and readability. <br />- Moved **Clip Range** controls into Processing Options with translated labels and tooltips. <br />- Added optional **Convergence Crosshairs** overlay in Preview GUI for faster tuning. <br />- Fixed File menu actions failing to trigger dialogs (Load Preset and Output Path). <br />- Simplified File menu by removing redundant Save/Load Settings in favor of presets. <br />- Integrated built-in **VisionDepth3D Updater** accessible from Help → Check Updates. <br />- Added confirmation prompt before launching updater for safe auto-closing behavior. <br />- Reduced console warning spam for cleaner runtime output.</p> <p>* v.3.8.2 - Depth Estimation Improvements &amp; Fixes</p> <p>### Depth Estimation Improvements &amp; Fixes</p> <p>- Introduced native **Depth Anything 3 (DA3)** backend with full integration into image and video workflows. <br />- Added native **Video Depth Anything (VDA)** backend with sequence-aware temporal inference. <br />- Unified DA3, VDA, ONNX, and Hugging Face models under a single depth engine pipeline. <br />- Normalized all depth outputs into a consistent 0–1 range for reliable blending and 3D rendering. <br />- Added warm-up passes for DA3 and VDA to eliminate first-frame hitching. <br />- Improved batching support and fallback handling for multi-frame depth inference. <br />- Added configurable target FPS control for VDA to reduce inference load on high-FPS sources.</p> <p>#### ONNX Stability &amp; Model Fixes <br />- Fixed Distill-Any-Depth ONNX models failing due to tensor shape mismatches. <br />- Enforced correct 518×518 inference resolution for Distill-Any-Depth models. <br />- Added automatic ONNX model detection and resolution enforcement. <br />- Switched ONNX preprocessing to aspect-ratio-preserving padding instead of stretching. <br />- Enabled safe ONNX Runtime graph optimizations for improved stability and performance. <br />- Fixed ONNX warm-up errors and broadcast failures.</p> <p>#### Video Depth Handling Improvements <br />- Fixed letterbox (black bar) regions incorrectly affecting depth inference. <br />- Improved multi-frame letterbox detection to prevent flicker. <br />- Filled letterbox areas with neutral depth to prevent pop-out artifacts and white banding.</p> <p>#### Performance Optimizations <br />- Removed redundant image resizing during video inference. <br />- Consolidated resizing into a single pass per frame. <br />- Enabled CUDA `channels_last` memory layout for supported Hugging Face models. <br />- Improved FP16 inference handling for faster CUDA performance. <br />- Optimized ONNX session configuration to reduce memory overhead. <br />- Improved batch handling to reduce per-frame processing cost. <br />- Reduced console warning spam for cleaner runtime output.</p> <p>* v3.8.2 Preview GUI &amp; Live View Improvements</p> <p>- Added optional **Convergence Crosshairs overlay** to the Preview GUI for faster and more precise convergence tuning. <br />- Significantly improved real-time Preview GUI smoothness by resetting render state between sessions to prevent drift and jitter. <br />- Eliminated “settling” artifacts at the start of previews by reinitializing depth normalization and convergence trackers per render. <br />- Improved floating window behavior during the first frames of preview playback for more stable stereo alignment. <br />- Increased live preview FPS by reducing GPU memory churn and persistent buffer reuse. <br />- Reduced preview stutter caused by warm-up spikes and redundant tensor allocations. <br />- Improved frame pacing for smoother SBS output during live preview. <br />- Enhanced stability when mixing screen capture with GPU depth inference.</p> <p>* v3.8.2 - VD3D Live 3D Performance &amp; Stability Upgrades</p> <p>- Major real-time performance boost across GPUs, with live 3D preview running approximately **40 to 70 percent faster** depending on resolution and hardware. <br />- Eliminated frequent GPU memory reallocations by introducing persistent CUDA buffers for depth inference and stereo rendering. <br />- Smoother live depth updates through optimized GPU tensor reuse and reduced CPU to GPU transfer overhead. <br />- Added independent **Depth FPS control**, allowing depth inference to run at a lower rate than preview rendering for better responsiveness and stability. <br />- Reduced temporal jitter in live depth maps using improved EMA smoothing while preserving depth responsiveness. <br />- Minimized preview hitching caused by first-frame warm-up and inference spikes. <br />- Improved frame pacing for more consistent SBS output in live mode. <br />- Increased stability when combining screen capture with GPU depth inference.</p> <p>* v3.8.2 - 3D Generator Performance, Stability &amp; Quality Improvements</p> <p>- Significantly smoother offline and real-time 3D rendering by fully resetting internal render state at the start of each render session. <br />- Eliminated temporal drift, convergence carry-over, and accumulated smoothing artifacts between consecutive renders. <br />- Improved depth range calibration per clip with fresh percentile normalization for more consistent parallax response. <br />- Stabilized floating window behavior and convergence transitions during the first frames of each render. <br />- Increased real-time preview FPS and reduced jitter across long renders.</p> <p>- Fixed output sizing across all 3D modes including: <br /> - VR formats <br /> - Passive Interlaced displays <br /> - Single-eye exports </p> <p>- Corrected floating window calculations to operate per-eye instead of full SBS width. <br />- Added safety resizing to guarantee final encoded frames always match target output resolution.</p> <p>- Added optional **Convergence Crosshairs overlay** in the Preview GUI for faster and more precise tuning.</p> <p>- Cleaned up UI inconsistencies: <br /> - Foreground and Background shift labels now match their actual sliders <br /> - Tooltips correctly reflect each control’s function </p> <p>- Reworked Encoding Settings layout for better readability and workflow. <br />- Moved Clip Range controls into Processing Options for a cleaner main interface.</p> <p>- Fixed File menu actions: <br /> - Preset loading now works correctly from the dropdown <br /> - Output path dialog now opens properly from both menu and hotkey <br /> - Removed redundant Save/Load Settings in favor of streamlined Preset workflow</p> <p>* Revise changelog for VisionDepth3D v3.8.2 release</p> <p>Updated changelog for VisionDepth3D v3.8.2 with performance improvements, new depth engines, and various fixes.</p> <p>* Update requirements.txt with new packages</p> <p>Added new dependencies to requirements.txt for additional functionality.</p> <p>* Update copyright year in LICENSE.txt</p> <p>* Delete presets/Best3DSettings.json</p> <p>old</p> <p>* Delete presets/balanced_depth.json</p> <p>old</p> <p>* Add files via upload</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.8.1 2025-12-28T17:42:36Z VisionDepth3Dv3.8.1 - Release Bug patch VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.8 2025-12-18T02:28:52Z VisionDepth3Dv3.8 - Release <p>Include licensing information in README</p> <p>Added licensing notice for VisionDepth3D.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.7 2025-11-26T17:27:10Z VisionDepth3Dv3.7 - Release <p>VisionDepth3D v3.7 Update (<a class="issue-link js-issue-link" href="https://github.com/VisionDepth/VisionDepth3D/pull/78">#78</a>)</p> <p>* Delete languages directory</p> <p>Delete for 3.7 update</p> <p>* Add Language Directory </p> <p>Adding language directory for 3.7</p> <p>* Update DB.py</p> <p>v3.7 Update</p> <p>* Update render_depth.py</p> <p>v3.7 Update</p> <p>* Update render_3d.py</p> <p>v3.7 Update</p> <p>* Update merged_pipeline.py</p> <p>v3.7 update</p> <p>* Update vd3d_live.py</p> <p>v3.7 update</p> <p>* Update preview_gui.py</p> <p>v3.7 Update</p> <p>* Reformat preview_utils.py for consistency</p> <p>v3.7 update</p> <p>* Refactor imports in core/__init__.py for clarity</p> <p>v3.7 update</p> <p>* Update VisionDepth3D.py</p> <p>v3.7 Update</p> <p>* Revise Changelog for VisionDepth3D v3.7 release</p> <p>Updated changelog for version 3.7 with new features, fixes, and performance improvements.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.6.2 2025-10-08T18:14:52Z VisionDepth3Dv3.6.2 - Release <p>Update README.md</p> <p>Updated Banner Photo</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.6 2025-10-06T14:02:33Z VisionDepth3Dv3.6 - Release <p>Update index.html</p> <p>updated download tally badge</p> VisionDepth tag:github.com,2008:Repository/918467518/Installer 2025-12-14T18:30:17Z VisionDepth3D Setup Downloader <p>Update index.html</p> <p>links</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.5 2025-08-24T06:06:30Z VisionDepth3Dv3.5 - Release <p>V3.5 (<a class="issue-link js-issue-link" href="https://github.com/VisionDepth/VisionDepth3D/pull/56">#56</a>)</p> <p>* Delete languages directory</p> <p>Removing to add updated ones</p> <p>* v3.5 - Language JSON Files</p> <p>Updated json files to all match en and cleaned up dups</p> <p>* v3.5 </p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5</p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5 </p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5 </p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5</p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5 </p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> <p>* v3.5 </p> <p># VisionDepth3D v3.5 – Changelog</p> <p>## 1) Depth of Field (DOF) – Rewritten &amp; Stabilized <br />- Fully rewritten as a **GPU-accelerated, multi-level Gaussian pipeline**. <br />- Uses **per-pixel interpolation** between blur levels for smooth transitions. <br />- Added **motion-adaptive focal tracking**: <br /> - Exponential moving average (EMA) for stable focus. <br /> - Deadband to ignore micro noise. <br /> - Max-step limiter to prevent “focus pops.” <br />- DOF now applies **after stereo warp** using warped per-eye depth. <br />- DOF slider maps directly to `max blur`; setting it to `0` cleanly disables DOF. <br />- **Result:** smoother bokeh, no ring artifacts, and much more natural focal transitions.</p> <p>## 2) Audio Tool – Revamp &amp; Codec Control <br />- Added **progress bar** for encoding/attaching audio. <br />- Users can now select **codec and bitrate** before muxing: <br /> - `aac` (default) and `libmp3lame` supported. <br /> - Configurable bitrate (e.g. 128k, 192k, 320k). <br />- **Offset slider** added for real-time sync adjustment when attaching audio. <br />- Audio attachment now clearly distinguishes between **copy vs. re-encode**: <br /> - If codec/bitrate unchanged → fast copy (`-c copy`). <br /> - If codec/bitrate changed → re-encode. <br />- UI fields now properly populate when files are chosen. <br />- Safe handling of long videos (2+ hours) with progress feedback.</p> <p>## 3) Color Grading – GPU Accelerated &amp; Fully Integrated <br />- Introduced **GPU-accelerated color grading pipeline** (`apply_color_grade`) with: <br /> - **Saturation** <br /> - **Contrast** <br /> - **Brightness** <br />- Color grading now applies **after stereo warp &amp; DOF, before packing/formatting**. <br />- Added **Preview GUI sync**: <br /> - Sliders update live in the preview with **debounced re-rendering**. <br /> - Two-way binding with main UI — values set in Preview transfer to main UI controls and vice versa. <br />- Preset/save/load support extended to include color grading. <br />- Tooltips and i18n refreshed for new controls. <br />- **Result:** creators can now fine-tune the image directly inside VD3D without round-tripping into external grading tools.</p> <p>## 4) Stereo Separation (IPD Adjustment) – New 3D Control <br />- Added **Interpupillary Distance (IPD) adjustment slider** for fine-tuning stereo separation. <br />- Works as a **global scale factor** on pixel shifts (foreground, midground, background). <br />- Allows creators to: <br /> - Increase IPD for stronger 3D “pop” on large screens / VR headsets. <br /> - Reduce IPD for comfortable viewing on smaller displays. <br />- Fully integrated into: <br /> - **Preset system** (save/load). <br /> - **Preview GUI** with real-time feedback. <br /> - **Tooltip and i18n system** for clarity. <br />- **Result:** users can now match stereo depth strength to their **display environment and audience comfort**.</p> <p>## 5) General Fixes &amp; Stability <br />- Fixed tensor size mismatch crash in DOF when depth/resolution didn’t match warp output. <br />- Preview GUI sliders now wire correctly to main GUI sliders for seamless testing. <br />- Minor UI consistency fixes across tools. <br />- **Language Files Clean-Up:** <br /> - Removed duplicate keys and aligned all translations with `en.json`. <br /> - Verified **FR/DE/ES/JA** language packs — all tooltips and UI labels now update correctly when switching languages. <br /> - Added missing entries (`Apply Entries`, `Start Batch Render`, scene detection, etc.) to ensure full coverage.</p> <p>## 6) New Session Additions – ONNX Pipeline &amp; UI Enhancements <br />- **ONNX Integration** <br /> - Converted **Video Depth Anything (pth → onnx)** for faster inference. <br /> - Optimized ONNX pipeline path to run converted models efficiently. <br />- **UI Enhancements** <br /> - New **start time / end time controls** inside Encoding Settings: <br /> - Users can render **short clips or preview segments** without full video runs. <br /> - Inputs section refactored into its own **dedicated frame** for clarity. <br />- **Result:** streamlined workflow for experimenting with models, and flexible render ranges for testing.</p> <p>---</p> <p>**Summary:** <br />v3.4 gave creators fine-grained depth &amp; subject control. <br />v3.5 brings **cinematic polish** with stabilized DOF, a **true audio tool** with sync + codec options, a **GPU color grading suite**, a **stereo separation (IPD) adjustment** for display comfort, and now **ONNX pipeline + clip rendering support** for faster experimentation.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.3 2025-08-12T13:42:14Z VisionDepth3D v3.3 - Release <p>Update README.md</p> <p>Updated install guide</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.2.4 2025-06-04T02:07:05Z VisionDepth3Dv3.2.4 - Release <p>Update de.json</p> <p>removed trailing comma</p> VisionDepth