PyTorch: Speed up DEKR predictor by arashsm79 · Pull Request #3121 · DeepLabCut/DeepLabCut

arashsm79 · 2025-10-10T12:06:00Z

Summary

This PR improves the performance of DEKR predictor.

Use advanced indexing instead of for loops.
Remove optimization TODOs.
Optimize both with and without the use of heatmap

Details

Profiling the DEKR predictor's forward call, shows a significant increase in speed. But the overall inference time of the architecture remains mostly unchanged (very minor improvements) since it is largely dominated by HRNet and torch native modules. (thanks to @maximpavliv for running the benchmark)

Below shows (the rectangle in blue) that most of the inference time is spent in the HRNet and PyTorch native inference procedures.
I have optimized the rest of it (the rectangle in red) as much as I could.

Improve the performance of DEKR predictor. Use advanced indexing instead of for loops and remove TODO.

maximpavliv

The refactoring looks great — the vectorized version is much cleaner.
I’ve run the full integration testing suite on my side, and everything passes without issues. Nice work on this improvement!

deruyter92

I ran the code step-by step and confirmed that evaluation results are close to previous implementation. The vectorization looks great, also happy that you added the tensor shapes in the comments. Looks all good to me!

This commit updates the DEKRPredictor to follow the DeepLabCut implementation in version 3.0.0rc13. see DeepLabCut/DeepLabCut#3121

* DEKRPredictor: add non-maximum suppression (NMS) This commit Updates the DEKR predictor to follow the DeepLabCut implementation in version 3.0.0rc7, see DeepLabCut/DeepLabCut#2907 * DEKRPredictor: speed up with vectorized operations This commit updates the DEKRPredictor to follow the DeepLabCut implementation in version 3.0.0rc13. see DeepLabCut/DeepLabCut#3121 * PartAffinityFieldPredictor (PAF): Speed up cost computation This commit updates the PAF predictor to follow the DeepLabCut implementation in version 3.0.0.rc13. See DeepLabCut/DeepLabCut#3117 * HeatmapPredictor (single animal): speed up with vecorized operations This commit updates the `HeatmapPredictor` in single_predictor.py to follow the implementation in DeepLabCut 3.0.0rc13. See DeepLabCut/DeepLabCut#3110

Optimize DEKR predictor

6ab5cca

Improve the performance of DEKR predictor. Use advanced indexing instead of for loops and remove TODO.

arashsm79 changed the title ~~PyTorch: Speedup DEKR predictor~~ PyTorch: Speed up DEKR predictor Oct 10, 2025

arashsm79 added 2 commits October 14, 2025 13:42

Use advance indexing for heatmap in DEKR predictor

c0fe331

black formatting

9b8a839

arashsm79 marked this pull request as ready for review October 14, 2025 12:05

maximpavliv self-requested a review October 14, 2025 12:09

maximpavliv approved these changes Oct 14, 2025

View reviewed changes

arashsm79 and others added 3 commits October 18, 2025 18:37

ci: trigger checks

377d4cd

Merge branch 'main' into arash/speedup_dekr

c916277

Merge branch 'main' into arash/speedup_dekr

1fa60c2

MMathisLab merged commit 54881d1 into DeepLabCut:main Nov 4, 2025
7 of 10 checks passed

deruyter92 reviewed Nov 4, 2025

View reviewed changes

deruyter92 added a commit to deruyter92/DeepLabCut-live that referenced this pull request Jan 21, 2026

DEKRPredictor: speed up with vectorized operations

3014710

This commit updates the DEKRPredictor to follow the DeepLabCut implementation in version 3.0.0rc13. see DeepLabCut/DeepLabCut#3121

deruyter92 mentioned this pull request Jan 21, 2026

update pytorch models following DeepLabCut 3.0.0rc13 DeepLabCut/DeepLabCut-live#151

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PyTorch: Speed up DEKR predictor#3121

PyTorch: Speed up DEKR predictor#3121
MMathisLab merged 6 commits intoDeepLabCut:mainfrom
arashsm79:arash/speedup_dekr

arashsm79 commented Oct 10, 2025 •

edited

Loading

Uh oh!

maximpavliv left a comment

Uh oh!

Uh oh!

deruyter92 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

arashsm79 commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Uh oh!

maximpavliv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deruyter92 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

arashsm79 commented Oct 10, 2025 •

edited

Loading