Shaokai/faster inference by yeshaokai · Pull Request #3012 · DeepLabCut/DeepLabCut

yeshaokai · 2025-06-20T12:13:52Z

Added pre-fetching to inference runner.
Replaced torch.no_grad() with torch.inference_mode() as it's better
Added automatic mixed precision inference

Async mode by default True and num_prefetch_batches by default 4.

Empirically, batch size 16 for detector and batch size 32 work well. Therefore, I ran speed testing with changes introduced in this PR, using superanimal_video_inference and a 800x600 video :

resnet50_fasterrcnn + hrnet32
12.7 FPS -> 18 FPS

mobilenet_fasterrcnn + resnet50
25.4 FPS -> 31 FPS

ssdlite + rtmpose
25.7 FPS -> 33FPS

gpu memory usage is reduced with amp inference.

- remove tables from pip!

- removing tables; pip failing

maximpavliv · 2025-06-23T15:07:00Z

deeplabcut/pose_estimation_pytorch/runners/inference.py

-        outputs = self.model(inputs.to(self.device), **kwargs)
+
+
+
+


This introduces a bug, outputs becomes undefined in the next line

okay, lets write a test then @maximpavliv , as tests should not pass otherwise

I believe the variables are still alive after leaving the with scope. Is that what you have in mind?

yeshaokai added 3 commits June 19, 2025 16:20

prefetching

470a7fc

added amp and replace no_grad with inference_mode

9f1aa62

Merge branch 'main' of github.com:yeshaokai/DeepLabCut into main

b287843

yeshaokai requested review from MMathisLab and maximpavliv June 20, 2025 12:13

MMathisLab added pytorch DLC3.0🔥 labels Jun 23, 2025

MMathisLab added 3 commits June 23, 2025 12:29

Update requirements.txt

f8e9545

- remove tables from pip!

Update setup.py

6e00a55

- removing tables; pip failing

Merge branch 'main' into shaokai/faster_inference

6443a22

MMathisLab approved these changes Jun 23, 2025

View reviewed changes

MMathisLab merged commit e72b559 into DeepLabCut:main Jun 23, 2025
4 checks passed

maximpavliv reviewed Jun 23, 2025

View reviewed changes

maximpavliv mentioned this pull request Jun 25, 2025

Fix CTDInferenceRunner #3018

Merged

This was referenced Sep 18, 2025

Add optional torch.compile support to InferenceRunner via InferenceConfig #3098

Closed

Add autocast sub-config to inference settings and disable torch.autocast by default #3105

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shaokai/faster inference#3012

Shaokai/faster inference#3012
MMathisLab merged 6 commits intoDeepLabCut:mainfrom
yeshaokai:shaokai/faster_inference

yeshaokai commented Jun 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

maximpavliv Jun 23, 2025

Uh oh!

MMathisLab Jun 23, 2025

Uh oh!

yeshaokai Jun 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

yeshaokai commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

maximpavliv Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

MMathisLab Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

yeshaokai Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yeshaokai commented Jun 20, 2025 •

edited

Loading