tag:github.com,2008:https://github.com/VisionDepth/VisionDepth3D/releasesRelease notes from VisionDepth3D2026-02-11T20:31:11Ztag:github.com,2008:Repository/918467518/Release-v3.8.22026-02-11T22:18:49ZVisionDepth3Dv3.2.8 Release<h1>VisionDepth3D v3.8.2</h1>
<p>This release brings major depth engine upgrades, large real-time performance gains, and important stability fixes across both offline rendering and live 3D preview.</p>
<p>Expect faster playback, cleaner depth output, improved codec reliability, and a smoother overall workflow.</p>
<hr>
<h2>New Depth Engines</h2>
<h3>Depth Anything 3 (DA3) Integration</h3>
<ul>
<li>Native DA3 backend (not Hugging Face pipeline based)</li>
<li>Supports DA3 Small, Base, Large, Giant, and Metric variants</li>
<li>Proper resolution handling and depth normalization</li>
<li>Faster warm-up and improved batching support</li>
</ul>
<h3>Video Depth Anything (VDA)</h3>
<ul>
<li>Sequence-aware video depth inference</li>
<li>Temporal processing for smoother depth output</li>
<li>Target FPS control for heavy footage</li>
<li>Unified post-processing with other depth engines</li>
</ul>
<hr>
<h2>Performance Improvements</h2>
<h3>Live 3D</h3>
<ul>
<li>40 to 70 percent FPS increase on most GPUs</li>
<li>Persistent GPU buffers eliminating per-frame allocations</li>
<li>Smoother depth refresh scheduling</li>
<li>Reduced jitter and stutter</li>
</ul>
<h3>Offline Depth and Rendering</h3>
<ul>
<li>Single-pass resizing reducing CPU overhead</li>
<li>Faster FP16 GPU inference</li>
<li>Optimized ONNX runtime sessions</li>
<li>FFmpeg piping enabled by default for faster encoding</li>
</ul>
<hr>
<h2>Stability and Quality Fixes</h2>
<h3>ONNX Models</h3>
<ul>
<li>Fixed Distill-Any-Depth shape mismatch crashes</li>
<li>Enforced correct inference resolution (518×518)</li>
<li>Aspect-ratio safe preprocessing without stretching</li>
<li>Cleaner backend detection and logging</li>
</ul>
<h3>Letterbox Handling</h3>
<ul>
<li>Black bars no longer corrupt depth maps</li>
<li>Neutral depth fill prevents white banding artifacts</li>
<li>Stable detection across frames</li>
</ul>
<hr>
<h2>3D Generator Improvements</h2>
<ul>
<li>Full render state reset per clip to eliminate drift artifacts</li>
<li>Smoother convergence and floating window behavior</li>
<li>Corrected output sizing for VR, SBS, and interlaced modes</li>
<li>Optional convergence crosshair overlay in Preview GUI</li>
<li>Cleaner encoding settings layout</li>
</ul>
<hr>
<h2>GUI and Workflow Enhancements</h2>
<ul>
<li>Depth Estimation tab renamed to <strong>Depth Engine</strong></li>
<li>Fixed preset loading from menu dropdown</li>
<li>Fixed Output Path menu action</li>
<li>Simplified File menu using preset system only</li>
<li>Built-in VisionDepth3D Updater in Help menu</li>
<li>Confirmation prompt before updating</li>
</ul>
<hr>
<h2>Upgrade Note</h2>
<p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling older versions.</p>
<p>Use <strong>VisionDepth3D_Setup_Downloader</strong> to install v3.8.2 and required <code>.bin</code> files.</p>
<hr>
<p>Thanks to everyone supporting VisionDepth3D and helping shape each release.</p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.8.12025-12-28T17:42:36ZVisionDepth3Dv3.8.1 - Release Bug patch<h1>VisionDepth3D v3.8 – Bug Patch</h1>
<hr>
<h2>1) Depth Estimation Inference Error</h2>
<ul>
<li>Fixed progress bar error for depth estimation causing error on inference</li>
</ul>
<hr>
<blockquote>
<p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling v3.8<br>
Then run VisionDepth3D_Setup_Downloader to download the official<br>
VisionDepth3D v3.8.1 Windows installer and required <code>.bin</code> files.</p>
</blockquote>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.82025-12-18T02:28:52ZVisionDepth3Dv3.8 - Release <h1>VisionDepth3D v3.8 – Changelog</h1>
<hr>
<h2>1) Depth Estimation Tab</h2>
<h3>Depth Models</h3>
<ul>
<li>Fixed ONNX model loading:
<ul>
<li>Distill-Any-Depth (inference resolution 518×518, batch size 8)</li>
<li>Video Depth Anything (inference resolution 512×288, batch size 8)</li>
</ul>
</li>
<li>Implemented LBM depth model (dev version). Thanks to Aether for the implementation fix.</li>
<li>Removed depth models from the dropdown that returned no <code>d_type</code>.</li>
<li>Fixed Hugging Face model downloads and caching so zoo models consistently save inside the app <code>weights/</code> directory (no more extra <code>.cache</code> downloads).</li>
<li>Updated Transformers image processor loading to prefer <code>use_fast=True</code> when available (with automatic fallback when unsupported).</li>
</ul>
<h3>Depth Backend</h3>
<ul>
<li>Implemented temporal smoothing in the depth pipeline to reduce flicker and improve temporal stability of depth map output.</li>
<li>Packaged VisionDepth3D.exe with Distill-Any-Depth (ONNX), Video Depth Anything (ONNX), and Depth Anything v2 Giant weights.</li>
</ul>
<hr>
<h2>2) 3D Render Tab</h2>
<h3>UI Fixes</h3>
<ul>
<li>Added buttons for encoder settings and processing options.</li>
<li>Implemented multi-language support and tooltips for new dialog boxes.</li>
<li>Adjusted preview image window size and video info layout to prevent window overflow.</li>
<li>3D tab columns now stack correctly when resizing the window on smaller screens.</li>
</ul>
<h3>3D Backend</h3>
<ul>
<li>Reworked Auto Crop Black Bars to use first-frame detection with cached crop reuse.</li>
<li>Prevents per-frame crop jitter and depth/frame misalignment.</li>
<li>Improves stability for cinema content with subtle letterboxing.</li>
<li>Keep Audio checkbox now respects the user-selected output container instead of forcing MP4.</li>
</ul>
<hr>
<h2>3) Frametool Backend</h2>
<ul>
<li>Reworked Frametool backend to support SSResNet models for feature model integration.</li>
</ul>
<hr>
<h2>4) Console Improvements</h2>
<ul>
<li>Standardized startup console messages to clearly reflect which subsystems are initializing (Torch, depth estimation, upscaler, external 3D pipeline, language, settings).</li>
<li>Unified compute device reporting across pipelines for consistent and clearer console output.</li>
<li>Suppressed optional xFormers dependency warning on startup.</li>
<li>Prevented duplicate language loading during settings restore.</li>
</ul>
<hr>
<h2>Summary</h2>
<p>v3.8 focuses on stabilizing depth estimation, improving model compatibility,<br>
and refining the 3D Render tab UI with better layout behavior, clearer diagnostics, and improved localization support.</p>
<blockquote>
<p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling v3.7.<br>
Then run VisionDepth3D_Setup_Downloader to download the official<br>
VisionDepth3D v3.8 Windows installer and required <code>.bin</code> files.</p>
</blockquote>
<blockquote>
<p>(Optional but recommended) Clear the Hugging Face cache to free space and<br>
avoid duplicate model downloads:<br>
<code>C:\Users\YOUR_USERNAME\.cache\huggingface</code></p>
</blockquote>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.72025-11-26T17:27:10ZVisionDepth3Dv3.7 - Release<h1>VisionDepth3D v3.7 –Release Changelog</h1>
<hr>
<h2>1) Live 3D Capture Overhaul</h2>
<p>Live 3D Capture received a full stability and quality pass.</p>
<p><strong>What is new:</strong></p>
<ul>
<li>Optional live audio passthrough for external capture devices, with device selection and audio delay control.</li>
<li>Audio routed through DirectShow and WASAPI, with an FFplay based monitor for low latency listening.</li>
<li>Color channel controls to fix purple and red tint issues on some capture cards.</li>
<li>Tuning for real time depth inference so Live Capture can run at practical frame rates on 1080p HDMI sources.</li>
<li>A headless mode (<code>--no-preview</code>) so you can run capture without a local preview window.</li>
<li>Early groundwork for browser based SBS VR streaming with synchronized audio and video.</li>
</ul>
<p><strong>What is fixed:</strong></p>
<ul>
<li>GUI settings (resolution, backend, FPS and more) are now correctly applied when starting Live Capture from the UI.</li>
<li>Capture failures like “no frames arriving” are resolved by enforcing the correct fourcc and backend.</li>
<li>Audio is now present in Live Capture sessions instead of silent output.</li>
<li>Frame pacing is smoother and depth plus stereo warp no longer hit the same FPS bottlenecks as before.</li>
</ul>
<hr>
<h2>2) Floating Window, Depth Stability and Black Bar Handling</h2>
<p>The stereo presentation pipeline has been tightened up for more comfortable 3D.</p>
<p><strong>Dynamic Floating Window (DFW):</strong></p>
<ul>
<li>Rebuilt the floating window logic so it masks only one edge at a time, based on the dominant parallax direction.</li>
<li>Adds a minimum parallax threshold so the window stays off when depth is near the screen plane.</li>
<li>Uses temporal smoothing and easing so the window glides in and out instead of popping or flickering.</li>
<li>Supports both soft faded edges and solid black cinema bars through a single toggle.</li>
</ul>
<p>Result: fewer edge violations, a cleaner frame in VR and on monitors, and a more cinema friendly presentation.</p>
<p><strong>Frame jitter and temporal stability:</strong></p>
<ul>
<li>Fixed depth “breathing” where scenes would appear to move in and out over time.</li>
<li>Introduced several smoothing passes over subject depth, depth percentiles and convergence.</li>
<li>Added a global parallax smoother for foreground, midground and background layers.</li>
</ul>
<p>Result: more stable parallax over time, less shimmer and a more comfortable stereo experience.</p>
<p><strong>Auto crop for black bars:</strong></p>
<ul>
<li>Improved black bar detection during fades and dark transitions.</li>
<li>Added guards so detection does not update on very dark frames.</li>
<li>Handles changes in letterbox height without vertical drift.</li>
</ul>
<p>Result: 2.35:1 and similar letterboxed content now auto crops in a reliable and repeatable way.</p>
<hr>
<h2>3) Unified Depth Pipeline and Platform Support</h2>
<p>The Depth tab has been upgraded into a unified, cross platform pipeline.</p>
<p><strong>Multi backend support (CUDA, ROCm, MPS, CPU):</strong></p>
<ul>
<li>Device detection has been rewritten so CUDA is no longer assumed by default.</li>
<li>The app now picks the best available backend automatically.</li>
</ul>
<p>Supported depth backends:</p>
<ul>
<li>CUDA on NVIDIA GPUs</li>
<li>ROCm on AMD GPUs</li>
<li>MPS on Apple Silicon</li>
<li>CPU fallback when no GPU is present</li>
</ul>
<p>This prevents crashes on AMD and macOS, avoids accidental CPU only runs on capable GPUs, and lays the foundations for Linux builds.</p>
<p><strong>Codec selection for depth exports:</strong></p>
<ul>
<li>The Depth tab now has a Video Codec dropdown.</li>
<li>You can select from hardware encoders (NVENC, AMF, QSV) as well as CPU encoders (libx264, libx265, AV1, and legacy MPEG-4 variants).</li>
<li>XVID and other problematic codecs now have safer defaults and better behavior on non NVIDIA systems.</li>
<li>AV1 has guard rails where OpenCV decoding is limited, with warnings where needed.</li>
<li>Codec support is now aligned with the 3D Converter and FrameTools.</li>
</ul>
<p><strong>Depth pipeline control:</strong></p>
<ul>
<li>Depth renders now support Pause, Resume and Cancel.</li>
<li>Pauses release resources more safely and cancels avoid corrupt output files.</li>
<li>Clear status states show when a job is running, paused, canceling or completed.</li>
</ul>
<hr>
<h2>4) 3D Pipeline and UX Polish</h2>
<p>The main 3D converter pipeline has been cleaned up and extended.</p>
<ul>
<li>New <strong>Keep Original Audio</strong> option to pass through source audio into the final 3D export without re encoding.</li>
<li>New <strong>image based 3D pipeline</strong> that runs through the same renderer, ideal for single frame 3D stills.</li>
<li>Mode selector is now wired to switch cleanly between Single, Batch and Image workflows inside the same UI.</li>
<li>A <strong>3D filename suffix system</strong> automatically labels exports by format and eye mode<br>
(examples: <code>_LRF_Full_SBS</code>, <code>_LRF_Half_SBS</code>, <code>_VR</code>, <code>_Anaglyph</code>, <code>_Interlaced</code>, <code>_LRF_Left</code>, <code>_LRF_Right</code>).</li>
<li>Multi language labels and tooltips across the app have been reviewed and cleaned up.</li>
</ul>
<hr>
<h2>5) Depth Blender Preview</h2>
<p>The Depth Blender tab has been upgraded into a more visual tool.</p>
<ul>
<li>Live preview now shows the base V2 depth map and the blended result side by side.</li>
<li>All blend parameters (white strength, feather blur, CLAHE, bilateral filters) update the preview in real time.</li>
<li>A frame scrubber lets you move through frames in a sequence and see how the blend behaves across time, before running a full batch on folders or videos.</li>
</ul>
<hr>
<h2>Summary</h2>
<p>VisionDepth3D v3.7 focuses on stability, cross platform support and workflow quality.</p>
<ul>
<li>Live 3D Capture is more stable, more accurate and closer to being stream ready.</li>
<li>The stereo pipeline has better temporal behavior and cleaner edges.</li>
<li>The Depth tab now runs on NVIDIA, AMD ROCm, Apple Silicon and CPU only setups with flexible codecs and playback options.</li>
<li>The 3D converter and Depth Blender both gained quality of life improvements that make it easier to preview, tune and export 3D content.</li>
</ul>
<p>These changes set the stage for future Linux builds, more advanced streaming paths and additional 3D presets in upcoming releases.</p>
<h2>How to Install</h2>
<ol>
<li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li>
<li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li>
<li>Place all files in the same folder</li>
<li>Run the <code>.exe</code> installer and follow the prompts</li>
<li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li>
</ol>
<p>Download VisionDepth3D Release Installer to simplify fetching most recent releases</p>
<p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a>.</p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.6.22025-10-08T18:14:52ZVisionDepth3Dv3.6.2 - Release <p>VisionDepth3D v3.6.2 – Bug Patches & Cleanup</p>
<p>Adapters<br>
• Fixed Hugging Face call in depthanything_adapter.py.<br>
• Fixed Depth Anything V2 Giant download in VisionDepth3D.py.</p>
<p>UI & Codec<br>
• Fixed threaded render button.<br>
• Fixed codec bug where output wasn’t respecting selected codec.</p>
<p>Codebase Cleanup<br>
• Removed broken/unused DepthCrafter files (depth_crafter_ppl.py, depthcrafter_adapter.py, weights dir).<br>
• Cleaned up render_depth.py, dropped legacy/unused code.</p>
<p>Assets<br>
• Deleted old previews and logo icon.<br>
• Added updated UI photos.</p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.62025-10-06T14:02:33ZVisionDepth3Dv3.6 - Release <h1>VisionDepth3D v3.6 Release</h1>
<p><strong>This update is all about <em>quality and speed</em>. A brand-new Depth Blender tab lets you mix models with precision for cleaner separation and smoother parallax, while HDR10 handling has been rebuilt to preserve true 10-bit color and metadata. The experimental Live 3D pipeline makes its debut, turning capture cards, consoles, and webcams into real-time 3D feeds. Upscaling and interpolation have been overhauled with threaded workers, dropping render times from 10 hours to ~1 hour on long projects. Add in clip-range rendering, direct Left/Right output, smarter padding, codec fixes, and a full UI overhaul — v3.6 is the most refined and flexible VisionDepth3D yet.</strong></p>
<hr>
<h2>1) Upscaling & Interpolation – Massive Speed Boost</h2>
<ul>
<li>Rewritten <strong>Frames tab pipeline</strong> with threaded workers + queues</li>
<li>RIFE, ESRGAN, and FFmpeg writing now run <strong>concurrently</strong> instead of sequentially</li>
<li>Intelligent frame indexing and buffering preserve order while maximizing throughput</li>
<li>Render time reduced from <strong>10 hours → ~1 hour</strong> on long clips</li>
<li><strong>Result:</strong> creators can upscale and interpolate full-length videos in a fraction of the time without crashes or dropped frames</li>
</ul>
<hr>
<h2>2) Depth Pipeline – Refinements & Blending</h2>
<ul>
<li>New <strong>Depth Blender tab</strong> with sliders for model blend weights</li>
<li>Improved <strong>16-bit depth output handling</strong> for smoother disparity</li>
<li>Early <strong>percentile clipping</strong> reduces outliers without flattening depth</li>
<li>Added <strong>Depth Anything V2 Giant</strong> model support</li>
<li>Added <strong>FP16 precision toggle</strong> for faster inference and reduced VRAM use</li>
<li><strong>Result:</strong> cleaner separation between foreground and background, less fuzz, and more consistent 3D parallax</li>
</ul>
<hr>
<h2>3) HDR10 Support – Preservation & Metadata</h2>
<ul>
<li>Fixed washed-out HDR outputs when re-encoding</li>
<li>Preserves:
<ul>
<li><strong>10-bit pixel format</strong> (<code>yuv420p10le</code>)</li>
<li><strong>BT.2020 color space</strong></li>
<li><strong>PQ curve</strong> (<code>smpte2084</code>)</li>
<li><strong>HDR metadata (Master Display / MaxCLL)</strong></li>
</ul>
</li>
<li>UI toggle: <strong>Preserve HDR10 Metadata</strong></li>
<li><strong>Result:</strong> HDR content now keeps its original punch and dynamic range</li>
</ul>
<hr>
<h2>4) Experimental Live 3D (WIP)</h2>
<ul>
<li>Added <strong>real-time 3D pipeline</strong> for external inputs (consoles, capture cards, webcams)</li>
<li>Uses <strong>Depth Anything v2 Small</strong> by default (swap models if GPU allows)</li>
<li>Stereo conversion powered by the <strong>VisionDepth3D method</strong></li>
<li>End-to-end capture → depth → stereo loop is working</li>
<li>Early tests show playable 3D console and video feeds</li>
<li><strong>Performance optimizations ongoing</strong> for fps, latency, and GPU acceleration</li>
</ul>
<hr>
<h2>5) General Fixes & Stability</h2>
<h3>Rendering</h3>
<ul>
<li>Restored <strong>Clip-range UI</strong> — set start/end times for partial renders</li>
<li>Added <strong>Left-only / Right-only</strong> output modes (no post-split required)</li>
<li>Extra <strong>padding + edge reflection</strong> reduce stereo bleed-through</li>
<li>Optimized per-eye resize, aspect ratio, and DOF/color grading checks</li>
<li>Fixed floating-window scaling in single-eye renders</li>
</ul>
<h3>UI & Error Handling</h3>
<ul>
<li>Patched <strong>white-edge artifact</strong> from 16-bit normalization</li>
<li>Better error handling when models fail to load</li>
<li>Synced <strong>language packs</strong> with new controls (HDR toggle, depth blender, etc.)</li>
<li>All buttons and inputs styled with a new <strong>dark theme</strong></li>
</ul>
<h3>Codec & Output</h3>
<ul>
<li>Fixed FFmpeg forcing slow presets on GPU codecs</li>
<li>NVENC now uses correct encoder flags (<code>preset p5</code>, <code>rc vbr</code>, <code>cq</code>)</li>
<li>CPU codecs retain CRF + preset for consistent quality</li>
</ul>
<hr>
<h2>6) UI & Workflow Enhancements</h2>
<ul>
<li>Full <strong>3D Generator tab UI overhaul</strong> for a cleaner look</li>
<li>Hotkeys to import video & depth maps directly into workflow</li>
<li>Save/load presets with one click</li>
<li>Reset button and quick navigation to docs, bug reports, and GitHub</li>
<li><strong>Result:</strong> smoother daily workflow and better testing inside VD3D</li>
</ul>
<hr>
<h2>Summary</h2>
<p>v3.6 delivers <strong>depth blending refinements</strong>, <strong>true HDR10 preservation</strong>, and <strong>massive speed boosts</strong> through concurrent processing.<br>
It restores <strong>clip-range flexibility</strong>, adds <strong>direct eye outputs</strong>, and debuts the first <strong>Live 3D pipeline</strong>, moving VisionDepth3D toward real-time stereo rendering.</p>
<hr>
<h2>How to Install</h2>
<ol>
<li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li>
<li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li>
<li>Place all files in the same folder</li>
<li>Run the <code>.exe</code> installer and follow the prompts</li>
<li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li>
</ol>
<p>Download VisionDepth3D Release Installer to simplify fetching most recent releases</p>
<p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a>.</p>VisionDepthtag:github.com,2008:Repository/918467518/Installer2025-12-14T18:30:17ZVisionDepth3D Setup Downloader<h1>VisionDepth3D Setup Instructions</h1>
<p>This tool is a <strong>Setup Downloader</strong>.<br>
It downloads the official VisionDepth3D installer files and then launches the setup wizard.</p>
<hr>
<h2>How to Install VisionDepth3D</h2>
<h3>1. Open the VisionDepth3D Setup Downloader</h3>
<ul>
<li>Select the <strong>latest release</strong> from the list.</li>
<li>Click <strong>Install</strong>.</li>
</ul>
<hr>
<h3>2. Download Setup Files</h3>
<ul>
<li>The setup files will download automatically.</li>
<li>When finished, the VisionDepth3D setup window (<strong>Inno Setup</strong>) will open.</li>
</ul>
<p><g-emoji class="g-emoji" alias="warning">⚠️</g-emoji> <strong>Do not re-run the Setup Downloader after this point.</strong></p>
<hr>
<h3>3. Complete the Setup Wizard</h3>
<ul>
<li>Choose where you want VisionDepth3D installed.</li>
<li>Follow the on-screen steps until installation is complete.</li>
</ul>
<hr>
<h3>4. Launch VisionDepth3D</h3>
<ul>
<li>If prompted, you may launch VisionDepth3D immediately after setup.</li>
<li>Otherwise, open it from the <strong>Start Menu</strong> or <strong>Desktop shortcut</strong> created during setup.</li>
</ul>
<hr>
<h3>5. (Optional) Clean Up Installer Files</h3>
<ul>
<li>After installation, you may use the <strong>“Remove installer files”</strong> button in the Setup Downloader.</li>
<li>This deletes the downloaded setup files only.</li>
<li><strong>This does NOT remove VisionDepth3D itself.</strong></li>
</ul>
<hr>
<h2>Important Notes</h2>
<ul>
<li>The <strong>Setup Downloader is NOT the VisionDepth3D application</strong>.</li>
<li>After installation, always launch VisionDepth3D from the <strong>Start Menu or Desktop</strong>, not by re-running the downloader.</li>
<li>If VisionDepth3D does not open when installed in <code>Program Files</code>, install it to <strong>Documents</strong> or another user folder to avoid Windows permission issues.</li>
<li>If needed, try <strong>right-click → Run as administrator</strong> when launching VisionDepth3D.</li>
</ul>
<hr>
<p><g-emoji class="g-emoji" alias="warning">⚠️</g-emoji> <strong>Do NOT select the Setup Downloader inside the Setup application drop down where you select latest version.</strong></p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.52025-08-24T06:06:30ZVisionDepth3Dv3.5 - Release <h1>VisionDepth3D v3.5 Release</h1>
<p><strong>This update transforms VD3D into a cinematic 2D-to-3D studio. Depth of Field has been rebuilt for buttery smooth bokeh with motion-adaptive focus, the Audio Tool is now pro-level with codec, bitrate, and sync offset control, and GPU-accelerated color grading puts saturation, contrast, and brightness right inside the workflow. A new IPD stereo slider lets you dial in the perfect 3D strength for any screen or headset. Add in a streamlined ONNX pipeline, clip rendering tools, and polished multi-language support, and v3.5 delivers the most powerful, creator-focused VisionDepth3D yet.</strong></p>
<h2>1) Depth of Field (DOF) – Rewritten and Stabilized</h2>
<ul>
<li>Fully rewritten as a GPU-accelerated, multi-level Gaussian pipeline</li>
<li>Uses per-pixel interpolation between blur levels for smooth transitions</li>
<li>Added motion-adaptive focal tracking:
<ul>
<li>Exponential moving average (EMA) for stable focus</li>
<li>Deadband to ignore micro noise</li>
<li>Max-step limiter to prevent sudden focus jumps</li>
</ul>
</li>
<li>DOF now applies after stereo warp using warped per-eye depth</li>
<li>DOF slider maps directly to <code>max blur</code>; setting it to <code>0</code> disables DOF</li>
<li>Result: smoother bokeh, no ring artifacts, more natural focal transitions</li>
</ul>
<h2>2) Audio Tool – Revamp and Codec Control</h2>
<ul>
<li>Added progress bar for encoding and attaching audio</li>
<li>Codec and bitrate can now be selected before muxing
<ul>
<li><code>aac</code> (default) and <code>libmp3lame</code> supported</li>
<li>Configurable bitrate (128k, 192k, 320k)</li>
</ul>
</li>
<li>Added offset slider for real-time sync adjustment</li>
<li>Clear distinction between copy vs re-encode:
<ul>
<li>If codec/bitrate unchanged → fast copy (<code>-c copy</code>)</li>
<li>If changed → re-encode</li>
</ul>
</li>
<li>Safe handling of long videos (2+ hours) with progress feedback</li>
</ul>
<h2>3) Color Grading – GPU Accelerated and Integrated</h2>
<ul>
<li>Introduced GPU-accelerated color grading (<code>apply_color_grade</code>) with:
<ul>
<li>Saturation</li>
<li>Contrast</li>
<li>Brightness</li>
</ul>
</li>
<li>Color grading now applies after stereo warp and DOF, before packing</li>
<li>Added Preview GUI sync:
<ul>
<li>Sliders update live in preview with debounced re-rendering</li>
<li>Two-way binding with main UI controls</li>
</ul>
</li>
<li>Preset/save/load support extended to include color grading</li>
<li>Result: fine-tune the image directly inside VD3D without external tools</li>
</ul>
<h2>4) Stereo Separation (IPD Adjustment) – New 3D Control</h2>
<ul>
<li>Added Interpupillary Distance (IPD) adjustment slider</li>
<li>Works as a global scale factor on pixel shifts (foreground, midground, background)</li>
<li>Allows creators to:
<ul>
<li>Increase IPD for stronger 3D on large screens or VR</li>
<li>Reduce IPD for comfortable viewing on smaller displays</li>
</ul>
</li>
<li>Fully integrated into:
<ul>
<li>Preset system (save/load)</li>
<li>Preview GUI with real-time feedback</li>
<li>Tooltip and i18n system</li>
</ul>
</li>
<li>Result: match stereo depth strength to your display environment</li>
</ul>
<h2>5) General Fixes and Stability</h2>
<ul>
<li>Fixed tensor size mismatch crash in DOF when depth/resolution mismatched</li>
<li>Preview GUI sliders now wire correctly to main GUI sliders</li>
<li>Minor UI consistency fixes across tools</li>
<li>Language files clean-up:
<ul>
<li>Removed duplicate keys and aligned all translations with <code>en.json</code></li>
<li>Verified FR/DE/ES/JA language packs</li>
<li>Added missing entries (<code>Apply Entries</code>, <code>Start Batch Render</code>, scene detection, etc.)</li>
</ul>
</li>
</ul>
<h2>6) New Session Additions – ONNX Pipeline and UI Enhancements</h2>
<ul>
<li>ONNX Integration
<ul>
<li>Converted Video Depth Anything (pth → onnx) for faster inference</li>
<li>Optimized ONNX pipeline path for efficient runtime</li>
</ul>
</li>
<li>UI Enhancements
<ul>
<li>New start and end time controls inside Encoding Settings</li>
<li>Render short clips or preview segments without running full videos</li>
<li>Inputs section refactored into its own dedicated frame for clarity</li>
</ul>
</li>
<li>Result: streamlined workflow for experimenting with models, and flexible render ranges for testing</li>
</ul>
<hr>
<h2>Summary</h2>
<p>v3.4 gave creators fine-grained depth and subject control.<br>
v3.5 brings cinematic polish with stabilized DOF, a true audio tool with sync and codec options, a GPU color grading suite, stereo separation (IPD adjustment) for display comfort.</p>
<hr>
<h2>How to Install</h2>
<ol>
<li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li>
<li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li>
<li>Place all files in the same folder</li>
<li>Run the <code>.exe</code> installer and follow the prompts</li>
<li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li>
</ol>
<p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a> in the repository.</p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.32025-08-12T13:42:14ZVisionDepth3D v3.3 - Release<h1>VisionDepth3D v3.3 — Stability, Accuracy & Artifact Reduction</h1>
<p>This update is a <strong>major overhaul</strong> to both the <strong>Depth Estimation Pipeline</strong> and the <strong>3D Rendering Pipeline</strong>, with a focus on <strong>stability</strong>, <strong>accuracy</strong>, and <strong>artifact reduction</strong>.</p>
<hr>
<h2>How to Install</h2>
<p>Because this package is larger than 2 GB, the installer is split into multiple files.</p>
<ol>
<li>Download <strong><code>VisionDepth3D_v3.3_WIN_x64_SETUP.exe</code></strong><br>
<strong>and</strong> all accompanying <code>.bin</code> files from the release page</li>
<li>Place them together in the same folder</li>
<li>Run the <code>.exe</code> and follow the on-screen instructions</li>
</ol>
<hr>
<h2>Depth Pipeline Updates</h2>
<h3><strong>Black Bar Cropping for Depth Estimation</strong></h3>
<ul>
<li>New <code>ignore_letterbox_bars</code> detects bars in the first non-empty frame</li>
<li>Crops top/bottom bars before sending frames to the depth model</li>
<li>Re-applies bars after processing with neutral depth values, preventing black regions from appearing closer or farther than the main scene</li>
</ul>
<h3><strong>Output Resolution Preservation</strong></h3>
<ul>
<li>Depth maps resized back to the original cropped resolution before re-adding bars</li>
<li>Ensures final depth video matches original width/height</li>
</ul>
<h3><strong>Safety Checks</strong></h3>
<ul>
<li>If bars exceed frame height or the frame is empty, bars reset to zero to prevent OpenCV errors</li>
</ul>
<h3><strong>Unified Depth-to-Grayscale Conversion</strong></h3>
<ul>
<li><code>convert_depth_to_grayscale()</code> now handles:
<ul>
<li><code>PIL.Image</code></li>
<li><code>torch.Tensor</code></li>
<li><code>numpy.ndarray</code></li>
</ul>
</li>
<li>Cleans NaN values and fixes shape inconsistencies</li>
<li>Centralized for consistent grayscale output</li>
</ul>
<h3><strong>Sidecar Metadata for Bars</strong></h3>
<ul>
<li>Saves <code>.letterbox.json</code> with <code>top</code>, <code>bottom</code>, and <code>original_resolution</code> next to the depth video</li>
</ul>
<hr>
<h2>3D Pipeline Updates</h2>
<h3><strong>Single or Batch Processing</strong></h3>
<ul>
<li>Process one video or queue multiple for 3D rendering</li>
</ul>
<h3><strong>Stability & Robustness</strong></h3>
<ul>
<li>Render loop wrapped in <code>try/except/finally</code> for guaranteed cleanup</li>
<li>Defensive init for <code>ffmpeg_proc</code> and <code>out</code></li>
<li>Early exit if <code>VideoWriter</code> fails</li>
<li>Pause handling keeps frame index + ETA/FPS updated</li>
<li>Cancel paths work during processing and pause</li>
<li>Automatic codec fallback if FFmpeg encoder is invalid</li>
</ul>
<h3><strong>Depth Map Processing</strong></h3>
<ul>
<li><code>TemporalDepthFilter</code> (EMA smoothing) reduces depth flicker</li>
<li>Percentile-based normalization for consistent depth range</li>
<li>Midtone shaping (gamma) improves depth layering</li>
<li>Optional curvature enhancement for roundness</li>
</ul>
<h3><strong>Stereo / Parallax Control</strong></h3>
<ul>
<li><code>ShiftSmoother</code> damps rapid disparity changes</li>
<li>Edge-aware masking + feathering reduces tearing</li>
<li>Dynamic IPD scaling adapts stereo strength</li>
<li>Subject-tracked zero parallax with window easing</li>
<li>Optional dynamic convergence bias</li>
<li>IPD factor knob for global stereo strength</li>
</ul>
<h3><strong>Image Quality</strong></h3>
<ul>
<li>GPU depth-of-field blur with Gaussian blending</li>
<li>Brightness-preserving sharpening with highlight protection</li>
</ul>
<h3><strong>Framing, Aspect & Output Formats</strong></h3>
<ul>
<li>Aspect-ratio safe resizing with <code>pad_to_aspect_ratio</code></li>
<li>Two modes:
<ul>
<li>Preserve Original Aspect Ratio</li>
<li>Target Output Aspect</li>
</ul>
</li>
<li>Formats:
<ul>
<li>Full-SBS</li>
<li>Half-SBS</li>
<li>VR 1440×1600</li>
<li>Dubois anaglyph</li>
<li>Passive interlaced</li>
</ul>
</li>
</ul>
<h3><strong>Encoding & I/O</strong></h3>
<ul>
<li>FFmpeg over stdin:
<ul>
<li>CRF for <code>libx*</code></li>
<li>CQ for NVENC with <code>-b:v 0</code></li>
</ul>
</li>
<li>CPU/GPU encoder mapping, OpenCV fallback</li>
</ul>
<h3><strong>UX / Telemetry</strong></h3>
<ul>
<li>Smooth, real-time progress/FPS/ETA — also while paused</li>
<li>More descriptive logging</li>
</ul>
<hr>
<h2>Download</h2>
<p><a href="https://github.com/VisionDepth/VisionDepth3D/releases">📥 Download VisionDepth3D v3.3</a></p>
<hr>
<p><strong>VisionDepth3D is free for personal and non-commercial use.</strong><br>
Commercial use or redistribution without consent is prohibited.</p>
<h2>What's Changed</h2>
<ul>
<li>V3.2.5 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3255234087" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/48" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/48/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/48">#48</a></li>
<li>V3.2.5 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3256387329" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/49" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/49/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/49">#49</a></li>
<li>V3.2.6 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3307253530" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/52" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/52/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/52">#52</a></li>
<li>V3.3 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3307259718" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/53" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/53/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/53">#53</a></li>
</ul>
<p><strong>Full Changelog</strong>: <a class="commit-link" href="https://github.com/VisionDepth/VisionDepth3D/compare/Release-v3.2.4...Release-v3.3"><tt>Release-v3.2.4...Release-v3.3</tt></a></p>VisionDepthtag:github.com,2008:Repository/918467518/Release-v3.2.42025-06-04T02:07:05ZVisionDepth3Dv3.2.4 - Release <h1>VisionDepth3D – Hybrid 2D-to-3D Converter</h1>
<p>Convert any 2D video into immersive stereoscopic 3D using AI-powered depth estimation, real-time preview, and fully customizable stereo controls — all built for creators, VR tinkerers, and 3D enthusiasts.</p>
<p>Whether you're producing content for VR headsets, YouTube 3D, or your own Blu-ray collection, <strong>VisionDepth3D</strong> delivers sharp, artifact-free 3D with GPU-accelerated tools and formats for every workflow.</p>
<p><em>Download .exe and both .bin files and make sure they are in the same folder before installing</em></p>
<hr>
<h2>Powered by the VisionDepth3D Method</h2>
<p>At the core of this app is the <strong>VisionDepth3D Method</strong> — a custom real-time rendering technique designed to produce smooth, eye-comfortable stereo from AI depth maps.</p>
<h3>Core Features of the Method:</h3>
<ul>
<li>Dynamic parallax scaling</li>
<li>Zero-parallax tracking</li>
<li>Edge-aware masking</li>
<li>Scene-aware stereo dampening</li>
</ul>
<p>Want to dive deeper into how it works?</p>
<p>📄 <strong><a href="/VisionDepth/VisionDepth3D/blob/Release-v3.2.4/VisionDepth3D_Method.md">Read the Method → VisionDepth3D_Method.md</a></strong></p>
<hr>
<h2>Features</h2>
<ul>
<li><strong>AI Depth Estimation</strong> — Supports 20+ models (DPT, MiDaS, Depth Anything v2, etc.) with CUDA acceleration</li>
<li><strong>Batch Processing</strong> — Smart VRAM-aware queueing for images & video</li>
<li><strong>3D Output Modes</strong> — Half-SBS, Full-SBS, Interlaced, Anaglyph</li>
<li><strong>Frame Interpolation</strong> — Smooth motion via ONNX RIFE (2×–8×)</li>
<li><strong>Super Resolution</strong> — Real-ESRGAN upscaling (e.g., 1080p → 4K)</li>
<li><strong>Parallax Tuning</strong> — Independent controls for foreground, midground, and background</li>
<li><strong>Smart Mask Effects</strong> — Built-in feathering and ghost suppression</li>
<li><strong>Audio Tools</strong> — Attach AAC, MP3, or WAV using the FFmpeg GUI</li>
<li><strong>Live Feedback</strong> — Real-time FPS, ETA, pause/resume/cancel support</li>
<li><strong>Preview Modes</strong> — Heatmap, SBS, Anaglyph, Interlaced</li>
<li><strong>Export-Ready</strong> — Output for YouTube 3D, Oculus Quest, and MP4/MKV/AVI with GPU encoding</li>
</ul>
<hr>
<h2>Free to Use. Built for Creators.</h2>
<p>If you find it helpful, consider donating — every bit goes toward:</p>
<ul>
<li>New hardware for testing</li>
<li>Supporting more depth models</li>
<li>Continued updates and features</li>
</ul>
<p>Thanks for supporting open 3D tools!</p>
<hr>
<h1>VisionDepth3D v3.2.4 – Changelog</h1>
<p><em>Note: Although the last official version was listed as 3.1.9, several intermediate patches were applied via GitHub and consolidated under version 3.2.4.</em></p>
<hr>
<h2>GUI Enhancements</h2>
<h3>Inference Steps Control</h3>
<ul>
<li>Introduced <code>inference_steps_entry</code> field to support user-defined inference steps for diffusion models.</li>
<li>Includes input validation and fallback handling.</li>
<li>Dynamically updates on <code><Return></code> and <code><FocusOut></code> events.</li>
</ul>
<h3>Resolution Dropdown Improvements</h3>
<ul>
<li>Expanded resolution options to include model-native sizes:
<ul>
<li><code>512x256</code>, <code>704x384</code>, <code>960x540</code>, <code>1024x576</code>, and others for improved performance and visual quality.</li>
</ul>
</li>
<li>Automatically strips display hints like <code>" (DC-Fastest)"</code> for cleaner parsing of dimensions.</li>
</ul>
<h3>CPU Offload Mode Selection</h3>
<ul>
<li>Added support for multiple modes:
<ul>
<li><code>"model"</code>, <code>"vae"</code>, <code>"unet"</code>, <code>"sequential"</code>, <code>"none"</code></li>
</ul>
</li>
<li>The selected value is passed directly to the pipeline logic via <code>offload_mode_dropdown.get()</code>.</li>
</ul>
<h3>Sidebar Layout</h3>
<ul>
<li>Sidebar width increased from <code>22</code> to <code>30</code> for improved component spacing and usability.</li>
</ul>
<hr>
<h2>DepthCrafter Integration (Work-in-Progress)</h2>
<h3>Pipeline integration</h3>
<ul>
<li><code>load_depthcrafter_pipeline()</code> now supports the following arguments:
<ul>
<li><code>inference_steps</code></li>
<li><code>offload_mode</code></li>
</ul>
</li>
<li>Additional parameters are currently hardcoded and will be configurable in future updates.</li>
<li>Device mapping is handled dynamically based on <code>offload_mode</code>:
<ul>
<li><code>"sequential"</code> runs all operations on the GPU.</li>
<li>Other modes selectively offload components to CPU to manage VRAM usage.</li>
</ul>
</li>
</ul>
<hr>
<h2>Stability Fixes and Improvements</h2>
<ul>
<li>Warm-up logic now includes spinner feedback to prevent GUI freeze during model loading.</li>
<li>All models, including local ones, now run reliably. However, local models still require manual configuration of inference resolution and batch size due to unresolved dynamic resolution handling.</li>
<li><code>invert_var</code> toggle is now functioning correctly for .</li>
<li>Subject depth smoothing introduced in the 3D pipeline to reduce temporal jitter in estimated depth maps.</li>
<li>Focal depth consistency added for stereo rendering: subject depth is now shared across both eye views.</li>
</ul>
<hr>
<h2>License Notice</h2>
<p><strong>VisionDepth3D is free for personal and non-commercial use only.</strong><br>
Commercial use, modification, or redistribution is not permitted without prior written consent.</p>
<p>Full license terms available in the <a href="https://github.com/VisionDepth/VisionDepth3D">GitHub repository</a>.</p>VisionDepth