Few-Shot Referring Relationships in Videos

Overview

Given a query visual relationship <subject, predicate, object> and a test video, this framework spatiotemporally localizes the subject and object using only a few support videos sharing the same predicate (which is unseen during training).

Directory Structure

few_shot_refrel/
├── configs/
│   └── default.yaml            # All hyperparameters
├── datasets/
│   ├── base_dataset.py         # Abstract dataset class
│   ├── vidvrd_dataset.py        # ImageNet-VidVRD dataset loader
│   └── vidor_dataset.py        # VidOR dataset loader
├── models/
│   ├── feature_extractor.py    # FasterRCNN + I3D feature extraction
│   ├── relationship_embedding.py # Query-conditioned relationship embedding
│   ├── aggregation.py          # GSA and LLA modules
│   ├── relation_network.py     # Metric-based meta-learner
│   └── random_field.py         # T-partite random field + belief propagation
├── utils/
│   ├── metrics.py              # Asub, Aobj, Ar, mIoU computations
│   ├── episode_sampler.py      # Episodic training sampler
│   └── visualization.py        # Trajectory visualization
├── scripts/
│   ├── extract_features.py     # Pre-extract FasterRCNN/I3D features
│   ├── train.py                # Training script
│   ├── test.py                 # Evaluation script
├── train.py                    # Main training entry point
├── test.py                     # Main evaluation entry point
└── requirements.txt

Setup

pip install -r requirements.txt

Data Preparation

Download ImageNet-VidVRD or VidOR, then:

python scripts/extract_features.py --dataset vidvrd --data_root /path/to/data

Training

python train.py --config configs/default.yaml --dataset vidvrd

Evaluation

python test.py --config configs/default.yaml --dataset vidvrd --checkpoint checkpoints/best.pth

Citation

@inproceedings{kumar2023fewshot,
  title={Few-Shot Referring Relationships in Videos},
  author={Kumar, Yogesh and Mishra, Anand},
  booktitle={CVPR},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
configs		configs
data_preparation		data_preparation
datasets		datasets
eval		eval
inference		inference
model		model
scripts		scripts
utils		utils
vidvrd_helper		vidvrd_helper
README.md		README.md
video_to_frame.py		video_to_frame.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Few-Shot Referring Relationships in Videos

Overview

Directory Structure

Setup

Data Preparation

Training

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Few-Shot Referring Relationships in Videos

Overview

Directory Structure

Setup

Data Preparation

Training

Evaluation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages