CoWTracker: Tracking by Warping instead of Correlation

Zihang Lai^1,2, Eldar Insafutdinov¹, Edgar Sucar¹, Andrea Vedaldi^1,2

¹Visual Geometry Group (VGG), University of Oxford ²Meta AI

CoWTracker is a state-of-the-art dense point tracker that eschews traditional cost volumes in favor of an iterative warping mechanism. By warping target features to the query frame and refining tracks with a joint spatio-temporal transformer, CoWTracker achieves state-of-the-art performance on TAP-Vid (DAVIS, Kinetics), RoboTAP, and demonstrates strong zero-shot transfer to optical flow benchmarks like Sintel and KITTI.

🚀 Key Features

No Cost Volumes: Replaces memory-heavy correlation volumes with a lightweight warping operation, scaling linearly with spatial resolution.
High-Resolution Tracking: Processes features at high resolution (stride 2) to capture fine details and thin structures, unlike the stride 8 used by most prior methods.
Unified Architecture: A single model that excels at both long-range point tracking and optical flow estimation.
Robustness: Handles occlusions and rapid motion effectively using a video transformer with interleaved spatial and temporal attention.

⚡ Quick Start

1. Clone the Repository

Clone with submodules to get all required dependencies:

git clone --recurse-submodules https://github.com/facebookresearch/cowtracker.git
cd cowtracker

If you already cloned without submodules, initialize them with:

git submodule update --init --recursive

2. Install Dependencies

Create the conda environment:

conda env create -f environments.yml
conda activate cowtracker

3. Run Inference

Model weights are automatically downloaded from HuggingFace Hub on first run:

python demo.py --video videos/bmx-bumps.mp4 --output output.mp4

To use a local checkpoint instead:

python demo.py --video videos/bmx-bumps.mp4 --output output.mp4 --checkpoint ./cow_tracker_model.pth

⚙️ Command Line Options

python demo.py --help

Options:
  --video             Path to input video (required)
  --output            Path to output video (default: {input_name}_tracked.mp4)
  --checkpoint        Path to model checkpoint (optional, auto-downloads from HuggingFace if not provided)
  --rate              Subsampling rate for visualization (default: 8)
  --max_frames        Maximum number of frames to process (default: 200)
  --no_bkg            Hide video and show only tracks on black background

🐍 Python API

import torch
from cowtracker import CoWTracker

# Load model (auto-downloads from HuggingFace Hub)
model = CoWTracker.from_checkpoint(
    device="cuda",
    dtype=torch.float16,
)

# Or load from a local checkpoint
model = CoWTracker.from_checkpoint(
    "./cow_tracker_model.pth",
    device="cuda",
    dtype=torch.float16,
)

# Prepare video tensor [T, 3, H, W] in float32, range [0, 255]
video = ...

# Run inference
with torch.no_grad():
    predictions = model(video)

# Get outputs
tracks = predictions["track"]  # [B, T, H, W, 2] - tracked point coordinates
vis = predictions["vis"]       # [B, T, H, W] - visibility scores
conf = predictions["conf"]     # [B, T, H, W] - confidence scores

For long videos, use CoWTrackerWindowed:

from cowtracker import CoWTrackerWindowed

# Auto-downloads from HuggingFace Hub
model = CoWTrackerWindowed.from_checkpoint(
    device="cuda",
    dtype=torch.float16,
)

📖 Citation

@article{lai2026a,
  title   = {CoWTracker: Tracking by Warping instead of Correlation},
  author  = {Lai, Zihang and Insafutdinov, Eldar and Sucar, Edgar and Vedaldi, Andrea},
  journal = {arXiv preprint arXiv:2602.04877},
  year    = {2026},
}}

🙏 Acknowledgements

Thanks to these great repositories: VGGT, WAFT, CoTracker, AllTracker, DUSt3R, Depth Anything V2, and many other inspiring works in the community.

📄 License

See the LICENSE file for details about the license under which this code is made available.

This release is intended to support the open-source research community and fundamental research. Users are expected to leverage the artifacts for research purposes and make research findings arising from the artifacts publicly available for the benefit of the research community.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
cowtracker		cowtracker
docs		docs
videos		videos
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
environments.yml		environments.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoWTracker: Tracking by Warping instead of Correlation

🚀 Key Features

⚡ Quick Start

1. Clone the Repository

2. Install Dependencies

3. Run Inference

⚙️ Command Line Options

🐍 Python API

📖 Citation

🙏 Acknowledgements

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CoWTracker: Tracking by Warping instead of Correlation

🚀 Key Features

⚡ Quick Start

1. Clone the Repository

2. Install Dependencies

3. Run Inference

⚙️ Command Line Options

🐍 Python API

📖 Citation

🙏 Acknowledgements

📄 License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages