Video Distortion Removal for VapourSynth

Also known as atmospheric turbulance mitigation, warp stabilization, film shrink or VHS distortion fix, dewobble, dewiggle, detilt, rectification, heat haze removal. Can help with distortions from low bitrate compression or old codecs like MPEG2.

This is a partial implementation of the Turbulence Mitigation Transformer. (only distortion removal, no deblurring)

This does not do general video stabilization for shaky footage, only removes distortions within the frames. It is recommented to stabilize first if needed.

Check out stinkybread's comparisons here and here. (chrome recommended for smooth playback)

Requirements

pip install numpy (optional, only for pytorch backend)
pytorch with cuda (optional, only for pytorch backend)
vs-mlrt with tensorrt (optional, only for tensorrt backend)

Setup

Put the entire vs_undistort folder into your vapoursynth scripts folder.
Or install via pip: pip install -U git+https://github.com/pifroggi/vs_undistort.git

PyTorch Backend

The PyTorch backend offers some extra control knobs and supports any CPU and Nvidia GPU, but is slower and requires more VRAM.

import vs_undistort
clip = vs_undistort.pytorch(clip, temp_window=10, tile_width=None, tile_height=None, overlap=None, scales=[True, True, True], interpolation="bilinear", device="cuda")

clip
Distorted clip. Must be in RGB format.

temp_window
Temporal window length. How many frames are grouped together and processed as a single chunk. Larger means higher VRAM requirements, but better temporal averaging and slower distortions can be removed. If this is too small, some distortions may not get removed, small jumps/hitches may be visible between windows and seams from tile size may become more obvious.

tile_width, tile_height (optional)
Size of tiles to split the frames into. Must be a multiple of 16. By default the full frame dimensions are used.
Larger means higher VRAM requirements, but better spatial averaging and larger distortions can be removed.

overlap (optional)
Overlap from one tile to the next. Use if seams between tiles are visible.

interpolation (optional)
Interpolation mode used for warping the frames.
Mode "bilinear" is a bit faster, but slightly blurry.
Mode "bicubic" is a bit slower and may oversharpen slightly, but no blur.

scales (optional)
Sets which distortion scales get fixed via scales=[True, True, True], which stands for [coarse, middle, fine]. Set one or more to False to disable them. This is an experimental feature and may get removed if it turns out to be useless.

device (optional)
Can be "cuda" to use with an Nvidia GPU, or "cpu". This will be extremely slow on CPU.

TensorRT Backend

The TensorRT backend is much faster and requires less VRAM, but only supports bilinear interpolation and requires an Nvidia RTX GPU. On the first run, it will automatically build an engine, which takes a few minutes. Changing tile size or temporal window length will trigger rebuilding, but previously build engines are saved for later.

import vs_undistort
clip = vs_undistort.tensorrt(clip, temp_window=10, tile_width=None, tile_height=None, overlap=None, num_streams=1)

clip
Distorted clip. Must be in RGBH format.

temp_window
Temporal window length. How many frames are grouped together and processed as a single chunk. Larger means higher VRAM requirements, but better temporal averaging and slower distortions can be removed. If this is too small, some distortions may not get removed, small jumps/hitches may be visible between windows and seams from tile size may become more obvious.

tile_width, tile_height (optional)
Size of tiles to split the frames into. Must be a multiple of 16. By default the full frame dimensions are used.
Larger means higher VRAM requirements, but better spatial averaging and larger distortions can be removed.

overlap (optional)
Overlap from one tile to the next. Use if seams between tiles are visible.

num_streams (optional)
Number of parallel TensorRT streams. For high end GPUs higher can be faster, but requires more VRAM.

Tip

If you see jumps/hitches between temporal windows, you can crossfade them with vs_tiletools.
If you have an undistorted reference clip, you can also try to align to it with vs_align.

Benchmarks

Hardware	Resolution	PyTorch FPS	TensorRT FPS
RTX 4090	720x480	~6.5 fps	~32 fps
RTX 4090	1440x1080	~1.5 fps	~7 fps
RTX 4090	2880x2160	~0.5 fps	~2 fps

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
vs_undistort		vs_undistort
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Distortion Removal for VapourSynth

Requirements

Setup

PyTorch Backend

TensorRT Backend

Benchmarks

About

Uh oh!

Releases 2

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Video Distortion Removal for VapourSynth

Requirements

Setup

PyTorch Backend

TensorRT Backend

Benchmarks

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Uh oh!

Contributors 1

Languages