tag:github.com,2008:https://github.com/VisionDepth/VisionDepth3D/releases Release notes from VisionDepth3D 2026-02-11T20:31:11Z tag:github.com,2008:Repository/918467518/Release-v3.8.2 2026-02-11T22:18:49Z VisionDepth3Dv3.2.8 Release <h1>VisionDepth3D v3.8.2</h1> <p>This release brings major depth engine upgrades, large real-time performance gains, and important stability fixes across both offline rendering and live 3D preview.</p> <p>Expect faster playback, cleaner depth output, improved codec reliability, and a smoother overall workflow.</p> <hr> <h2>New Depth Engines</h2> <h3>Depth Anything 3 (DA3) Integration</h3> <ul> <li>Native DA3 backend (not Hugging Face pipeline based)</li> <li>Supports DA3 Small, Base, Large, Giant, and Metric variants</li> <li>Proper resolution handling and depth normalization</li> <li>Faster warm-up and improved batching support</li> </ul> <h3>Video Depth Anything (VDA)</h3> <ul> <li>Sequence-aware video depth inference</li> <li>Temporal processing for smoother depth output</li> <li>Target FPS control for heavy footage</li> <li>Unified post-processing with other depth engines</li> </ul> <hr> <h2>Performance Improvements</h2> <h3>Live 3D</h3> <ul> <li>40 to 70 percent FPS increase on most GPUs</li> <li>Persistent GPU buffers eliminating per-frame allocations</li> <li>Smoother depth refresh scheduling</li> <li>Reduced jitter and stutter</li> </ul> <h3>Offline Depth and Rendering</h3> <ul> <li>Single-pass resizing reducing CPU overhead</li> <li>Faster FP16 GPU inference</li> <li>Optimized ONNX runtime sessions</li> <li>FFmpeg piping enabled by default for faster encoding</li> </ul> <hr> <h2>Stability and Quality Fixes</h2> <h3>ONNX Models</h3> <ul> <li>Fixed Distill-Any-Depth shape mismatch crashes</li> <li>Enforced correct inference resolution (518×518)</li> <li>Aspect-ratio safe preprocessing without stretching</li> <li>Cleaner backend detection and logging</li> </ul> <h3>Letterbox Handling</h3> <ul> <li>Black bars no longer corrupt depth maps</li> <li>Neutral depth fill prevents white banding artifacts</li> <li>Stable detection across frames</li> </ul> <hr> <h2>3D Generator Improvements</h2> <ul> <li>Full render state reset per clip to eliminate drift artifacts</li> <li>Smoother convergence and floating window behavior</li> <li>Corrected output sizing for VR, SBS, and interlaced modes</li> <li>Optional convergence crosshair overlay in Preview GUI</li> <li>Cleaner encoding settings layout</li> </ul> <hr> <h2>GUI and Workflow Enhancements</h2> <ul> <li>Depth Estimation tab renamed to <strong>Depth Engine</strong></li> <li>Fixed preset loading from menu dropdown</li> <li>Fixed Output Path menu action</li> <li>Simplified File menu using preset system only</li> <li>Built-in VisionDepth3D Updater in Help menu</li> <li>Confirmation prompt before updating</li> </ul> <hr> <h2>Upgrade Note</h2> <p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling older versions.</p> <p>Use <strong>VisionDepth3D_Setup_Downloader</strong> to install v3.8.2 and required <code>.bin</code> files.</p> <hr> <p>Thanks to everyone supporting VisionDepth3D and helping shape each release.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.8.1 2025-12-28T17:42:36Z VisionDepth3Dv3.8.1 - Release Bug patch <h1>VisionDepth3D v3.8 – Bug Patch</h1> <hr> <h2>1) Depth Estimation Inference Error</h2> <ul> <li>Fixed progress bar error for depth estimation causing error on inference</li> </ul> <hr> <blockquote> <p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling v3.8<br> Then run VisionDepth3D_Setup_Downloader to download the official<br> VisionDepth3D v3.8.1 Windows installer and required <code>.bin</code> files.</p> </blockquote> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.8 2025-12-18T02:28:52Z VisionDepth3Dv3.8 - Release <h1>VisionDepth3D v3.8 – Changelog</h1> <hr> <h2>1) Depth Estimation Tab</h2> <h3>Depth Models</h3> <ul> <li>Fixed ONNX model loading: <ul> <li>Distill-Any-Depth (inference resolution 518×518, batch size 8)</li> <li>Video Depth Anything (inference resolution 512×288, batch size 8)</li> </ul> </li> <li>Implemented LBM depth model (dev version). Thanks to Aether for the implementation fix.</li> <li>Removed depth models from the dropdown that returned no <code>d_type</code>.</li> <li>Fixed Hugging Face model downloads and caching so zoo models consistently save inside the app <code>weights/</code> directory (no more extra <code>.cache</code> downloads).</li> <li>Updated Transformers image processor loading to prefer <code>use_fast=True</code> when available (with automatic fallback when unsupported).</li> </ul> <h3>Depth Backend</h3> <ul> <li>Implemented temporal smoothing in the depth pipeline to reduce flicker and improve temporal stability of depth map output.</li> <li>Packaged VisionDepth3D.exe with Distill-Any-Depth (ONNX), Video Depth Anything (ONNX), and Depth Anything v2 Giant weights.</li> </ul> <hr> <h2>2) 3D Render Tab</h2> <h3>UI Fixes</h3> <ul> <li>Added buttons for encoder settings and processing options.</li> <li>Implemented multi-language support and tooltips for new dialog boxes.</li> <li>Adjusted preview image window size and video info layout to prevent window overflow.</li> <li>3D tab columns now stack correctly when resizing the window on smaller screens.</li> </ul> <h3>3D Backend</h3> <ul> <li>Reworked Auto Crop Black Bars to use first-frame detection with cached crop reuse.</li> <li>Prevents per-frame crop jitter and depth/frame misalignment.</li> <li>Improves stability for cinema content with subtle letterboxing.</li> <li>Keep Audio checkbox now respects the user-selected output container instead of forcing MP4.</li> </ul> <hr> <h2>3) Frametool Backend</h2> <ul> <li>Reworked Frametool backend to support SSResNet models for feature model integration.</li> </ul> <hr> <h2>4) Console Improvements</h2> <ul> <li>Standardized startup console messages to clearly reflect which subsystems are initializing (Torch, depth estimation, upscaler, external 3D pipeline, language, settings).</li> <li>Unified compute device reporting across pipelines for consistent and clearer console output.</li> <li>Suppressed optional xFormers dependency warning on startup.</li> <li>Prevented duplicate language loading during settings restore.</li> </ul> <hr> <h2>Summary</h2> <p>v3.8 focuses on stabilizing depth estimation, improving model compatibility,<br> and refining the 3D Render tab UI with better layout behavior, clearer diagnostics, and improved localization support.</p> <blockquote> <p>Back up your <code>weights/</code> and <code>presets/</code> folders before uninstalling v3.7.<br> Then run VisionDepth3D_Setup_Downloader to download the official<br> VisionDepth3D v3.8 Windows installer and required <code>.bin</code> files.</p> </blockquote> <blockquote> <p>(Optional but recommended) Clear the Hugging Face cache to free space and<br> avoid duplicate model downloads:<br> <code>C:\Users\YOUR_USERNAME\.cache\huggingface</code></p> </blockquote> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.7 2025-11-26T17:27:10Z VisionDepth3Dv3.7 - Release <h1>VisionDepth3D v3.7 –Release Changelog</h1> <hr> <h2>1) Live 3D Capture Overhaul</h2> <p>Live 3D Capture received a full stability and quality pass.</p> <p><strong>What is new:</strong></p> <ul> <li>Optional live audio passthrough for external capture devices, with device selection and audio delay control.</li> <li>Audio routed through DirectShow and WASAPI, with an FFplay based monitor for low latency listening.</li> <li>Color channel controls to fix purple and red tint issues on some capture cards.</li> <li>Tuning for real time depth inference so Live Capture can run at practical frame rates on 1080p HDMI sources.</li> <li>A headless mode (<code>--no-preview</code>) so you can run capture without a local preview window.</li> <li>Early groundwork for browser based SBS VR streaming with synchronized audio and video.</li> </ul> <p><strong>What is fixed:</strong></p> <ul> <li>GUI settings (resolution, backend, FPS and more) are now correctly applied when starting Live Capture from the UI.</li> <li>Capture failures like “no frames arriving” are resolved by enforcing the correct fourcc and backend.</li> <li>Audio is now present in Live Capture sessions instead of silent output.</li> <li>Frame pacing is smoother and depth plus stereo warp no longer hit the same FPS bottlenecks as before.</li> </ul> <hr> <h2>2) Floating Window, Depth Stability and Black Bar Handling</h2> <p>The stereo presentation pipeline has been tightened up for more comfortable 3D.</p> <p><strong>Dynamic Floating Window (DFW):</strong></p> <ul> <li>Rebuilt the floating window logic so it masks only one edge at a time, based on the dominant parallax direction.</li> <li>Adds a minimum parallax threshold so the window stays off when depth is near the screen plane.</li> <li>Uses temporal smoothing and easing so the window glides in and out instead of popping or flickering.</li> <li>Supports both soft faded edges and solid black cinema bars through a single toggle.</li> </ul> <p>Result: fewer edge violations, a cleaner frame in VR and on monitors, and a more cinema friendly presentation.</p> <p><strong>Frame jitter and temporal stability:</strong></p> <ul> <li>Fixed depth “breathing” where scenes would appear to move in and out over time.</li> <li>Introduced several smoothing passes over subject depth, depth percentiles and convergence.</li> <li>Added a global parallax smoother for foreground, midground and background layers.</li> </ul> <p>Result: more stable parallax over time, less shimmer and a more comfortable stereo experience.</p> <p><strong>Auto crop for black bars:</strong></p> <ul> <li>Improved black bar detection during fades and dark transitions.</li> <li>Added guards so detection does not update on very dark frames.</li> <li>Handles changes in letterbox height without vertical drift.</li> </ul> <p>Result: 2.35:1 and similar letterboxed content now auto crops in a reliable and repeatable way.</p> <hr> <h2>3) Unified Depth Pipeline and Platform Support</h2> <p>The Depth tab has been upgraded into a unified, cross platform pipeline.</p> <p><strong>Multi backend support (CUDA, ROCm, MPS, CPU):</strong></p> <ul> <li>Device detection has been rewritten so CUDA is no longer assumed by default.</li> <li>The app now picks the best available backend automatically.</li> </ul> <p>Supported depth backends:</p> <ul> <li>CUDA on NVIDIA GPUs</li> <li>ROCm on AMD GPUs</li> <li>MPS on Apple Silicon</li> <li>CPU fallback when no GPU is present</li> </ul> <p>This prevents crashes on AMD and macOS, avoids accidental CPU only runs on capable GPUs, and lays the foundations for Linux builds.</p> <p><strong>Codec selection for depth exports:</strong></p> <ul> <li>The Depth tab now has a Video Codec dropdown.</li> <li>You can select from hardware encoders (NVENC, AMF, QSV) as well as CPU encoders (libx264, libx265, AV1, and legacy MPEG-4 variants).</li> <li>XVID and other problematic codecs now have safer defaults and better behavior on non NVIDIA systems.</li> <li>AV1 has guard rails where OpenCV decoding is limited, with warnings where needed.</li> <li>Codec support is now aligned with the 3D Converter and FrameTools.</li> </ul> <p><strong>Depth pipeline control:</strong></p> <ul> <li>Depth renders now support Pause, Resume and Cancel.</li> <li>Pauses release resources more safely and cancels avoid corrupt output files.</li> <li>Clear status states show when a job is running, paused, canceling or completed.</li> </ul> <hr> <h2>4) 3D Pipeline and UX Polish</h2> <p>The main 3D converter pipeline has been cleaned up and extended.</p> <ul> <li>New <strong>Keep Original Audio</strong> option to pass through source audio into the final 3D export without re encoding.</li> <li>New <strong>image based 3D pipeline</strong> that runs through the same renderer, ideal for single frame 3D stills.</li> <li>Mode selector is now wired to switch cleanly between Single, Batch and Image workflows inside the same UI.</li> <li>A <strong>3D filename suffix system</strong> automatically labels exports by format and eye mode<br> (examples: <code>_LRF_Full_SBS</code>, <code>_LRF_Half_SBS</code>, <code>_VR</code>, <code>_Anaglyph</code>, <code>_Interlaced</code>, <code>_LRF_Left</code>, <code>_LRF_Right</code>).</li> <li>Multi language labels and tooltips across the app have been reviewed and cleaned up.</li> </ul> <hr> <h2>5) Depth Blender Preview</h2> <p>The Depth Blender tab has been upgraded into a more visual tool.</p> <ul> <li>Live preview now shows the base V2 depth map and the blended result side by side.</li> <li>All blend parameters (white strength, feather blur, CLAHE, bilateral filters) update the preview in real time.</li> <li>A frame scrubber lets you move through frames in a sequence and see how the blend behaves across time, before running a full batch on folders or videos.</li> </ul> <hr> <h2>Summary</h2> <p>VisionDepth3D v3.7 focuses on stability, cross platform support and workflow quality.</p> <ul> <li>Live 3D Capture is more stable, more accurate and closer to being stream ready.</li> <li>The stereo pipeline has better temporal behavior and cleaner edges.</li> <li>The Depth tab now runs on NVIDIA, AMD ROCm, Apple Silicon and CPU only setups with flexible codecs and playback options.</li> <li>The 3D converter and Depth Blender both gained quality of life improvements that make it easier to preview, tune and export 3D content.</li> </ul> <p>These changes set the stage for future Linux builds, more advanced streaming paths and additional 3D presets in upcoming releases.</p> <h2>How to Install</h2> <ol> <li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li> <li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li> <li>Place all files in the same folder</li> <li>Run the <code>.exe</code> installer and follow the prompts</li> <li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li> </ol> <p>Download VisionDepth3D Release Installer to simplify fetching most recent releases</p> <p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a>.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.6.2 2025-10-08T18:14:52Z VisionDepth3Dv3.6.2 - Release <p>VisionDepth3D v3.6.2 – Bug Patches &amp; Cleanup</p> <p>Adapters<br> • Fixed Hugging Face call in depthanything_adapter.py.<br> • Fixed Depth Anything V2 Giant download in VisionDepth3D.py.</p> <p>UI &amp; Codec<br> • Fixed threaded render button.<br> • Fixed codec bug where output wasn’t respecting selected codec.</p> <p>Codebase Cleanup<br> • Removed broken/unused DepthCrafter files (depth_crafter_ppl.py, depthcrafter_adapter.py, weights dir).<br> • Cleaned up render_depth.py, dropped legacy/unused code.</p> <p>Assets<br> • Deleted old previews and logo icon.<br> • Added updated UI photos.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.6 2025-10-06T14:02:33Z VisionDepth3Dv3.6 - Release <h1>VisionDepth3D v3.6 Release</h1> <p><strong>This update is all about <em>quality and speed</em>. A brand-new Depth Blender tab lets you mix models with precision for cleaner separation and smoother parallax, while HDR10 handling has been rebuilt to preserve true 10-bit color and metadata. The experimental Live 3D pipeline makes its debut, turning capture cards, consoles, and webcams into real-time 3D feeds. Upscaling and interpolation have been overhauled with threaded workers, dropping render times from 10 hours to ~1 hour on long projects. Add in clip-range rendering, direct Left/Right output, smarter padding, codec fixes, and a full UI overhaul — v3.6 is the most refined and flexible VisionDepth3D yet.</strong></p> <hr> <h2>1) Upscaling &amp; Interpolation – Massive Speed Boost</h2> <ul> <li>Rewritten <strong>Frames tab pipeline</strong> with threaded workers + queues</li> <li>RIFE, ESRGAN, and FFmpeg writing now run <strong>concurrently</strong> instead of sequentially</li> <li>Intelligent frame indexing and buffering preserve order while maximizing throughput</li> <li>Render time reduced from <strong>10 hours → ~1 hour</strong> on long clips</li> <li><strong>Result:</strong> creators can upscale and interpolate full-length videos in a fraction of the time without crashes or dropped frames</li> </ul> <hr> <h2>2) Depth Pipeline – Refinements &amp; Blending</h2> <ul> <li>New <strong>Depth Blender tab</strong> with sliders for model blend weights</li> <li>Improved <strong>16-bit depth output handling</strong> for smoother disparity</li> <li>Early <strong>percentile clipping</strong> reduces outliers without flattening depth</li> <li>Added <strong>Depth Anything V2 Giant</strong> model support</li> <li>Added <strong>FP16 precision toggle</strong> for faster inference and reduced VRAM use</li> <li><strong>Result:</strong> cleaner separation between foreground and background, less fuzz, and more consistent 3D parallax</li> </ul> <hr> <h2>3) HDR10 Support – Preservation &amp; Metadata</h2> <ul> <li>Fixed washed-out HDR outputs when re-encoding</li> <li>Preserves: <ul> <li><strong>10-bit pixel format</strong> (<code>yuv420p10le</code>)</li> <li><strong>BT.2020 color space</strong></li> <li><strong>PQ curve</strong> (<code>smpte2084</code>)</li> <li><strong>HDR metadata (Master Display / MaxCLL)</strong></li> </ul> </li> <li>UI toggle: <strong>Preserve HDR10 Metadata</strong></li> <li><strong>Result:</strong> HDR content now keeps its original punch and dynamic range</li> </ul> <hr> <h2>4) Experimental Live 3D (WIP)</h2> <ul> <li>Added <strong>real-time 3D pipeline</strong> for external inputs (consoles, capture cards, webcams)</li> <li>Uses <strong>Depth Anything v2 Small</strong> by default (swap models if GPU allows)</li> <li>Stereo conversion powered by the <strong>VisionDepth3D method</strong></li> <li>End-to-end capture → depth → stereo loop is working</li> <li>Early tests show playable 3D console and video feeds</li> <li><strong>Performance optimizations ongoing</strong> for fps, latency, and GPU acceleration</li> </ul> <hr> <h2>5) General Fixes &amp; Stability</h2> <h3>Rendering</h3> <ul> <li>Restored <strong>Clip-range UI</strong> — set start/end times for partial renders</li> <li>Added <strong>Left-only / Right-only</strong> output modes (no post-split required)</li> <li>Extra <strong>padding + edge reflection</strong> reduce stereo bleed-through</li> <li>Optimized per-eye resize, aspect ratio, and DOF/color grading checks</li> <li>Fixed floating-window scaling in single-eye renders</li> </ul> <h3>UI &amp; Error Handling</h3> <ul> <li>Patched <strong>white-edge artifact</strong> from 16-bit normalization</li> <li>Better error handling when models fail to load</li> <li>Synced <strong>language packs</strong> with new controls (HDR toggle, depth blender, etc.)</li> <li>All buttons and inputs styled with a new <strong>dark theme</strong></li> </ul> <h3>Codec &amp; Output</h3> <ul> <li>Fixed FFmpeg forcing slow presets on GPU codecs</li> <li>NVENC now uses correct encoder flags (<code>preset p5</code>, <code>rc vbr</code>, <code>cq</code>)</li> <li>CPU codecs retain CRF + preset for consistent quality</li> </ul> <hr> <h2>6) UI &amp; Workflow Enhancements</h2> <ul> <li>Full <strong>3D Generator tab UI overhaul</strong> for a cleaner look</li> <li>Hotkeys to import video &amp; depth maps directly into workflow</li> <li>Save/load presets with one click</li> <li>Reset button and quick navigation to docs, bug reports, and GitHub</li> <li><strong>Result:</strong> smoother daily workflow and better testing inside VD3D</li> </ul> <hr> <h2>Summary</h2> <p>v3.6 delivers <strong>depth blending refinements</strong>, <strong>true HDR10 preservation</strong>, and <strong>massive speed boosts</strong> through concurrent processing.<br> It restores <strong>clip-range flexibility</strong>, adds <strong>direct eye outputs</strong>, and debuts the first <strong>Live 3D pipeline</strong>, moving VisionDepth3D toward real-time stereo rendering.</p> <hr> <h2>How to Install</h2> <ol> <li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li> <li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li> <li>Place all files in the same folder</li> <li>Run the <code>.exe</code> installer and follow the prompts</li> <li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li> </ol> <p>Download VisionDepth3D Release Installer to simplify fetching most recent releases</p> <p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a>.</p> VisionDepth tag:github.com,2008:Repository/918467518/Installer 2025-12-14T18:30:17Z VisionDepth3D Setup Downloader <h1>VisionDepth3D Setup Instructions</h1> <p>This tool is a <strong>Setup Downloader</strong>.<br> It downloads the official VisionDepth3D installer files and then launches the setup wizard.</p> <hr> <h2>How to Install VisionDepth3D</h2> <h3>1. Open the VisionDepth3D Setup Downloader</h3> <ul> <li>Select the <strong>latest release</strong> from the list.</li> <li>Click <strong>Install</strong>.</li> </ul> <hr> <h3>2. Download Setup Files</h3> <ul> <li>The setup files will download automatically.</li> <li>When finished, the VisionDepth3D setup window (<strong>Inno Setup</strong>) will open.</li> </ul> <p><g-emoji class="g-emoji" alias="warning">⚠️</g-emoji> <strong>Do not re-run the Setup Downloader after this point.</strong></p> <hr> <h3>3. Complete the Setup Wizard</h3> <ul> <li>Choose where you want VisionDepth3D installed.</li> <li>Follow the on-screen steps until installation is complete.</li> </ul> <hr> <h3>4. Launch VisionDepth3D</h3> <ul> <li>If prompted, you may launch VisionDepth3D immediately after setup.</li> <li>Otherwise, open it from the <strong>Start Menu</strong> or <strong>Desktop shortcut</strong> created during setup.</li> </ul> <hr> <h3>5. (Optional) Clean Up Installer Files</h3> <ul> <li>After installation, you may use the <strong>“Remove installer files”</strong> button in the Setup Downloader.</li> <li>This deletes the downloaded setup files only.</li> <li><strong>This does NOT remove VisionDepth3D itself.</strong></li> </ul> <hr> <h2>Important Notes</h2> <ul> <li>The <strong>Setup Downloader is NOT the VisionDepth3D application</strong>.</li> <li>After installation, always launch VisionDepth3D from the <strong>Start Menu or Desktop</strong>, not by re-running the downloader.</li> <li>If VisionDepth3D does not open when installed in <code>Program Files</code>, install it to <strong>Documents</strong> or another user folder to avoid Windows permission issues.</li> <li>If needed, try <strong>right-click → Run as administrator</strong> when launching VisionDepth3D.</li> </ul> <hr> <p><g-emoji class="g-emoji" alias="warning">⚠️</g-emoji> <strong>Do NOT select the Setup Downloader inside the Setup application drop down where you select latest version.</strong></p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.5 2025-08-24T06:06:30Z VisionDepth3Dv3.5 - Release <h1>VisionDepth3D v3.5 Release</h1> <p><strong>This update transforms VD3D into a cinematic 2D-to-3D studio. Depth of Field has been rebuilt for buttery smooth bokeh with motion-adaptive focus, the Audio Tool is now pro-level with codec, bitrate, and sync offset control, and GPU-accelerated color grading puts saturation, contrast, and brightness right inside the workflow. A new IPD stereo slider lets you dial in the perfect 3D strength for any screen or headset. Add in a streamlined ONNX pipeline, clip rendering tools, and polished multi-language support, and v3.5 delivers the most powerful, creator-focused VisionDepth3D yet.</strong></p> <h2>1) Depth of Field (DOF) – Rewritten and Stabilized</h2> <ul> <li>Fully rewritten as a GPU-accelerated, multi-level Gaussian pipeline</li> <li>Uses per-pixel interpolation between blur levels for smooth transitions</li> <li>Added motion-adaptive focal tracking: <ul> <li>Exponential moving average (EMA) for stable focus</li> <li>Deadband to ignore micro noise</li> <li>Max-step limiter to prevent sudden focus jumps</li> </ul> </li> <li>DOF now applies after stereo warp using warped per-eye depth</li> <li>DOF slider maps directly to <code>max blur</code>; setting it to <code>0</code> disables DOF</li> <li>Result: smoother bokeh, no ring artifacts, more natural focal transitions</li> </ul> <h2>2) Audio Tool – Revamp and Codec Control</h2> <ul> <li>Added progress bar for encoding and attaching audio</li> <li>Codec and bitrate can now be selected before muxing <ul> <li><code>aac</code> (default) and <code>libmp3lame</code> supported</li> <li>Configurable bitrate (128k, 192k, 320k)</li> </ul> </li> <li>Added offset slider for real-time sync adjustment</li> <li>Clear distinction between copy vs re-encode: <ul> <li>If codec/bitrate unchanged → fast copy (<code>-c copy</code>)</li> <li>If changed → re-encode</li> </ul> </li> <li>Safe handling of long videos (2+ hours) with progress feedback</li> </ul> <h2>3) Color Grading – GPU Accelerated and Integrated</h2> <ul> <li>Introduced GPU-accelerated color grading (<code>apply_color_grade</code>) with: <ul> <li>Saturation</li> <li>Contrast</li> <li>Brightness</li> </ul> </li> <li>Color grading now applies after stereo warp and DOF, before packing</li> <li>Added Preview GUI sync: <ul> <li>Sliders update live in preview with debounced re-rendering</li> <li>Two-way binding with main UI controls</li> </ul> </li> <li>Preset/save/load support extended to include color grading</li> <li>Result: fine-tune the image directly inside VD3D without external tools</li> </ul> <h2>4) Stereo Separation (IPD Adjustment) – New 3D Control</h2> <ul> <li>Added Interpupillary Distance (IPD) adjustment slider</li> <li>Works as a global scale factor on pixel shifts (foreground, midground, background)</li> <li>Allows creators to: <ul> <li>Increase IPD for stronger 3D on large screens or VR</li> <li>Reduce IPD for comfortable viewing on smaller displays</li> </ul> </li> <li>Fully integrated into: <ul> <li>Preset system (save/load)</li> <li>Preview GUI with real-time feedback</li> <li>Tooltip and i18n system</li> </ul> </li> <li>Result: match stereo depth strength to your display environment</li> </ul> <h2>5) General Fixes and Stability</h2> <ul> <li>Fixed tensor size mismatch crash in DOF when depth/resolution mismatched</li> <li>Preview GUI sliders now wire correctly to main GUI sliders</li> <li>Minor UI consistency fixes across tools</li> <li>Language files clean-up: <ul> <li>Removed duplicate keys and aligned all translations with <code>en.json</code></li> <li>Verified FR/DE/ES/JA language packs</li> <li>Added missing entries (<code>Apply Entries</code>, <code>Start Batch Render</code>, scene detection, etc.)</li> </ul> </li> </ul> <h2>6) New Session Additions – ONNX Pipeline and UI Enhancements</h2> <ul> <li>ONNX Integration <ul> <li>Converted Video Depth Anything (pth → onnx) for faster inference</li> <li>Optimized ONNX pipeline path for efficient runtime</li> </ul> </li> <li>UI Enhancements <ul> <li>New start and end time controls inside Encoding Settings</li> <li>Render short clips or preview segments without running full videos</li> <li>Inputs section refactored into its own dedicated frame for clarity</li> </ul> </li> <li>Result: streamlined workflow for experimenting with models, and flexible render ranges for testing</li> </ul> <hr> <h2>Summary</h2> <p>v3.4 gave creators fine-grained depth and subject control.<br> v3.5 brings cinematic polish with stabilized DOF, a true audio tool with sync and codec options, a GPU color grading suite, stereo separation (IPD adjustment) for display comfort.</p> <hr> <h2>How to Install</h2> <ol> <li>Go to the <a href="https://github.com/VisionDepth/VisionDepth3D/releases">VisionDepth3D Releases</a> page</li> <li>Download the latest installer <code>.exe</code> and <code>.bin</code> parts</li> <li>Place all files in the same folder</li> <li>Run the <code>.exe</code> installer and follow the prompts</li> <li>Launch VisionDepth3D from the Start Menu or Desktop shortcut</li> </ol> <p>For source installation and advanced setup see the <a href="https://github.com/VisionDepth/VisionDepth3D#installation">Installation Guide</a> in the repository.</p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.3 2025-08-12T13:42:14Z VisionDepth3D v3.3 - Release <h1>VisionDepth3D v3.3 — Stability, Accuracy &amp; Artifact Reduction</h1> <p>This update is a <strong>major overhaul</strong> to both the <strong>Depth Estimation Pipeline</strong> and the <strong>3D Rendering Pipeline</strong>, with a focus on <strong>stability</strong>, <strong>accuracy</strong>, and <strong>artifact reduction</strong>.</p> <hr> <h2>How to Install</h2> <p>Because this package is larger than 2 GB, the installer is split into multiple files.</p> <ol> <li>Download <strong><code>VisionDepth3D_v3.3_WIN_x64_SETUP.exe</code></strong><br> <strong>and</strong> all accompanying <code>.bin</code> files from the release page</li> <li>Place them together in the same folder</li> <li>Run the <code>.exe</code> and follow the on-screen instructions</li> </ol> <hr> <h2>Depth Pipeline Updates</h2> <h3><strong>Black Bar Cropping for Depth Estimation</strong></h3> <ul> <li>New <code>ignore_letterbox_bars</code> detects bars in the first non-empty frame</li> <li>Crops top/bottom bars before sending frames to the depth model</li> <li>Re-applies bars after processing with neutral depth values, preventing black regions from appearing closer or farther than the main scene</li> </ul> <h3><strong>Output Resolution Preservation</strong></h3> <ul> <li>Depth maps resized back to the original cropped resolution before re-adding bars</li> <li>Ensures final depth video matches original width/height</li> </ul> <h3><strong>Safety Checks</strong></h3> <ul> <li>If bars exceed frame height or the frame is empty, bars reset to zero to prevent OpenCV errors</li> </ul> <h3><strong>Unified Depth-to-Grayscale Conversion</strong></h3> <ul> <li><code>convert_depth_to_grayscale()</code> now handles: <ul> <li><code>PIL.Image</code></li> <li><code>torch.Tensor</code></li> <li><code>numpy.ndarray</code></li> </ul> </li> <li>Cleans NaN values and fixes shape inconsistencies</li> <li>Centralized for consistent grayscale output</li> </ul> <h3><strong>Sidecar Metadata for Bars</strong></h3> <ul> <li>Saves <code>.letterbox.json</code> with <code>top</code>, <code>bottom</code>, and <code>original_resolution</code> next to the depth video</li> </ul> <hr> <h2>3D Pipeline Updates</h2> <h3><strong>Single or Batch Processing</strong></h3> <ul> <li>Process one video or queue multiple for 3D rendering</li> </ul> <h3><strong>Stability &amp; Robustness</strong></h3> <ul> <li>Render loop wrapped in <code>try/except/finally</code> for guaranteed cleanup</li> <li>Defensive init for <code>ffmpeg_proc</code> and <code>out</code></li> <li>Early exit if <code>VideoWriter</code> fails</li> <li>Pause handling keeps frame index + ETA/FPS updated</li> <li>Cancel paths work during processing and pause</li> <li>Automatic codec fallback if FFmpeg encoder is invalid</li> </ul> <h3><strong>Depth Map Processing</strong></h3> <ul> <li><code>TemporalDepthFilter</code> (EMA smoothing) reduces depth flicker</li> <li>Percentile-based normalization for consistent depth range</li> <li>Midtone shaping (gamma) improves depth layering</li> <li>Optional curvature enhancement for roundness</li> </ul> <h3><strong>Stereo / Parallax Control</strong></h3> <ul> <li><code>ShiftSmoother</code> damps rapid disparity changes</li> <li>Edge-aware masking + feathering reduces tearing</li> <li>Dynamic IPD scaling adapts stereo strength</li> <li>Subject-tracked zero parallax with window easing</li> <li>Optional dynamic convergence bias</li> <li>IPD factor knob for global stereo strength</li> </ul> <h3><strong>Image Quality</strong></h3> <ul> <li>GPU depth-of-field blur with Gaussian blending</li> <li>Brightness-preserving sharpening with highlight protection</li> </ul> <h3><strong>Framing, Aspect &amp; Output Formats</strong></h3> <ul> <li>Aspect-ratio safe resizing with <code>pad_to_aspect_ratio</code></li> <li>Two modes: <ul> <li>Preserve Original Aspect Ratio</li> <li>Target Output Aspect</li> </ul> </li> <li>Formats: <ul> <li>Full-SBS</li> <li>Half-SBS</li> <li>VR 1440×1600</li> <li>Dubois anaglyph</li> <li>Passive interlaced</li> </ul> </li> </ul> <h3><strong>Encoding &amp; I/O</strong></h3> <ul> <li>FFmpeg over stdin: <ul> <li>CRF for <code>libx*</code></li> <li>CQ for NVENC with <code>-b:v 0</code></li> </ul> </li> <li>CPU/GPU encoder mapping, OpenCV fallback</li> </ul> <h3><strong>UX / Telemetry</strong></h3> <ul> <li>Smooth, real-time progress/FPS/ETA — also while paused</li> <li>More descriptive logging</li> </ul> <hr> <h2>Download</h2> <p><a href="https://github.com/VisionDepth/VisionDepth3D/releases">📥 Download VisionDepth3D v3.3</a></p> <hr> <p><strong>VisionDepth3D is free for personal and non-commercial use.</strong><br> Commercial use or redistribution without consent is prohibited.</p> <h2>What's Changed</h2> <ul> <li>V3.2.5 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3255234087" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/48" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/48/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/48">#48</a></li> <li>V3.2.5 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3256387329" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/49" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/49/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/49">#49</a></li> <li>V3.2.6 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3307253530" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/52" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/52/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/52">#52</a></li> <li>V3.3 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/VisionDepth/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/VisionDepth">@VisionDepth</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="3307259718" data-permission-text="Title is private" data-url="https://github.com/VisionDepth/VisionDepth3D/issues/53" data-hovercard-type="pull_request" data-hovercard-url="/VisionDepth/VisionDepth3D/pull/53/hovercard" href="https://github.com/VisionDepth/VisionDepth3D/pull/53">#53</a></li> </ul> <p><strong>Full Changelog</strong>: <a class="commit-link" href="https://github.com/VisionDepth/VisionDepth3D/compare/Release-v3.2.4...Release-v3.3"><tt>Release-v3.2.4...Release-v3.3</tt></a></p> VisionDepth tag:github.com,2008:Repository/918467518/Release-v3.2.4 2025-06-04T02:07:05Z VisionDepth3Dv3.2.4 - Release <h1>VisionDepth3D – Hybrid 2D-to-3D Converter</h1> <p>Convert any 2D video into immersive stereoscopic 3D using AI-powered depth estimation, real-time preview, and fully customizable stereo controls — all built for creators, VR tinkerers, and 3D enthusiasts.</p> <p>Whether you're producing content for VR headsets, YouTube 3D, or your own Blu-ray collection, <strong>VisionDepth3D</strong> delivers sharp, artifact-free 3D with GPU-accelerated tools and formats for every workflow.</p> <p><em>Download .exe and both .bin files and make sure they are in the same folder before installing</em></p> <hr> <h2>Powered by the VisionDepth3D Method</h2> <p>At the core of this app is the <strong>VisionDepth3D Method</strong> — a custom real-time rendering technique designed to produce smooth, eye-comfortable stereo from AI depth maps.</p> <h3>Core Features of the Method:</h3> <ul> <li>Dynamic parallax scaling</li> <li>Zero-parallax tracking</li> <li>Edge-aware masking</li> <li>Scene-aware stereo dampening</li> </ul> <p>Want to dive deeper into how it works?</p> <p>📄 <strong><a href="/VisionDepth/VisionDepth3D/blob/Release-v3.2.4/VisionDepth3D_Method.md">Read the Method → VisionDepth3D_Method.md</a></strong></p> <hr> <h2>Features</h2> <ul> <li><strong>AI Depth Estimation</strong> — Supports 20+ models (DPT, MiDaS, Depth Anything v2, etc.) with CUDA acceleration</li> <li><strong>Batch Processing</strong> — Smart VRAM-aware queueing for images &amp; video</li> <li><strong>3D Output Modes</strong> — Half-SBS, Full-SBS, Interlaced, Anaglyph</li> <li><strong>Frame Interpolation</strong> — Smooth motion via ONNX RIFE (2×–8×)</li> <li><strong>Super Resolution</strong> — Real-ESRGAN upscaling (e.g., 1080p → 4K)</li> <li><strong>Parallax Tuning</strong> — Independent controls for foreground, midground, and background</li> <li><strong>Smart Mask Effects</strong> — Built-in feathering and ghost suppression</li> <li><strong>Audio Tools</strong> — Attach AAC, MP3, or WAV using the FFmpeg GUI</li> <li><strong>Live Feedback</strong> — Real-time FPS, ETA, pause/resume/cancel support</li> <li><strong>Preview Modes</strong> — Heatmap, SBS, Anaglyph, Interlaced</li> <li><strong>Export-Ready</strong> — Output for YouTube 3D, Oculus Quest, and MP4/MKV/AVI with GPU encoding</li> </ul> <hr> <h2>Free to Use. Built for Creators.</h2> <p>If you find it helpful, consider donating — every bit goes toward:</p> <ul> <li>New hardware for testing</li> <li>Supporting more depth models</li> <li>Continued updates and features</li> </ul> <p>Thanks for supporting open 3D tools!</p> <hr> <h1>VisionDepth3D v3.2.4 – Changelog</h1> <p><em>Note: Although the last official version was listed as 3.1.9, several intermediate patches were applied via GitHub and consolidated under version 3.2.4.</em></p> <hr> <h2>GUI Enhancements</h2> <h3>Inference Steps Control</h3> <ul> <li>Introduced <code>inference_steps_entry</code> field to support user-defined inference steps for diffusion models.</li> <li>Includes input validation and fallback handling.</li> <li>Dynamically updates on <code>&lt;Return&gt;</code> and <code>&lt;FocusOut&gt;</code> events.</li> </ul> <h3>Resolution Dropdown Improvements</h3> <ul> <li>Expanded resolution options to include model-native sizes: <ul> <li><code>512x256</code>, <code>704x384</code>, <code>960x540</code>, <code>1024x576</code>, and others for improved performance and visual quality.</li> </ul> </li> <li>Automatically strips display hints like <code>" (DC-Fastest)"</code> for cleaner parsing of dimensions.</li> </ul> <h3>CPU Offload Mode Selection</h3> <ul> <li>Added support for multiple modes: <ul> <li><code>"model"</code>, <code>"vae"</code>, <code>"unet"</code>, <code>"sequential"</code>, <code>"none"</code></li> </ul> </li> <li>The selected value is passed directly to the pipeline logic via <code>offload_mode_dropdown.get()</code>.</li> </ul> <h3>Sidebar Layout</h3> <ul> <li>Sidebar width increased from <code>22</code> to <code>30</code> for improved component spacing and usability.</li> </ul> <hr> <h2>DepthCrafter Integration (Work-in-Progress)</h2> <h3>Pipeline integration</h3> <ul> <li><code>load_depthcrafter_pipeline()</code> now supports the following arguments: <ul> <li><code>inference_steps</code></li> <li><code>offload_mode</code></li> </ul> </li> <li>Additional parameters are currently hardcoded and will be configurable in future updates.</li> <li>Device mapping is handled dynamically based on <code>offload_mode</code>: <ul> <li><code>"sequential"</code> runs all operations on the GPU.</li> <li>Other modes selectively offload components to CPU to manage VRAM usage.</li> </ul> </li> </ul> <hr> <h2>Stability Fixes and Improvements</h2> <ul> <li>Warm-up logic now includes spinner feedback to prevent GUI freeze during model loading.</li> <li>All models, including local ones, now run reliably. However, local models still require manual configuration of inference resolution and batch size due to unresolved dynamic resolution handling.</li> <li><code>invert_var</code> toggle is now functioning correctly for .</li> <li>Subject depth smoothing introduced in the 3D pipeline to reduce temporal jitter in estimated depth maps.</li> <li>Focal depth consistency added for stereo rendering: subject depth is now shared across both eye views.</li> </ul> <hr> <h2>License Notice</h2> <p><strong>VisionDepth3D is free for personal and non-commercial use only.</strong><br> Commercial use, modification, or redistribution is not permitted without prior written consent.</p> <p>Full license terms available in the <a href="https://github.com/VisionDepth/VisionDepth3D">GitHub repository</a>.</p> VisionDepth