Stan Birchfield

Stan Birchfield

Principal Research Scientist and Senior Research Manager at NVIDIA

Exploring the intersection of computer vision and robotics

I lead a small group of amazing researchers in computer vision and robotics at NVIDIA, and I teach a class at the University of Washington. Previously, I was at Microsoft and Clemson University. I received my Ph.D. from Stanford University.

Selected Publications   (All Publications )

SpaceTools thumbnail

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

S. Chen, M. A. Uy, C. H. Song, F. Ladhak, A. Murali, Q. Qu, S. Birchfield, V. Blukis, J. Tremblay

CVPR '26

CARI4D thumbnail

CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

X. Xie, B. Wen, Y. Chang, H. Rabeti, J. Li, Y. Yuan, G. Pons-Moll, S. Birchfield

CVPR '26

Fast-FoundationStereo thumbnail

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

B. Wen, S. R. Dewan, S. Birchfield

CVPR '26

BOP-ASK thumbnail

BOP-ASK: Object-Interaction Reasoning for Vision-Language Models

V. Bhat, S. Kim, V. Blukis, G. Heinrich, P. Krishnamurthy, R. Karri, S. Birchfield, F. Khorrami, J. Tremblay

CVPR '26

RaySt3R thumbnail

RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion

B. P. Duisterhof, J. Oberst, B. Wen, S. Birchfield, D. Ramanan, J. Ichnowski

NeurIPS '25

FoundationStereo thumbnail

FoundationStereo: Zero-Shot Stereo Matching

B. Wen, M. Trepte, J. Aribido, J. Kautz, O. Gallo, S. Birchfield

CVPR '25

Award: CVPR Best Paper Award Candidate

News: #1 on Middlebury Stereo Evaluation - Version 3 (2025/02/03 – 2025/03/04)

News: #1 on ETH3D Low-Res Two-View Benchmark (2024/11/15 – 2025/05/14)

RoboSpatial thumbnail

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics

C. H. Song, V. Blukis, J. Tremblay, S. Tyree, Y. Su, S. Birchfield

CVPR '25

SPOT thumbnail

SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation

C.-C. Hsu, B. Wen, J. Xu, Y. Narang, X. Wang, Y. Zhu, J. Biswas, S. Birchfield

ICRA '25

FoundationPose thumbnail

FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

B. Wen, W. Yang, J. Kautz, S. Birchfield

CVPR '24

Award: BOP Challenge 2024 "Early Bird" Award

News: #1 on BOP leaderboard for unseen 6D pose estimation (2023/11/19 – 2024/09/03)

Press: NVIDIA AI post, NVIDIA blog on TAO Foundation Models

Neural Implicit thumbnail

Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects

Y. Weng, B. Wen, J. Tremblay, V. Blukis, D. Fox, L. Guibas, S. Birchfield

CVPR '24

NeRFDeformer thumbnail

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Z. Tang, Z. Ren, X. Zhao, B. Wen, J. Tremblay, S. Birchfield, A. Schwing

CVPR '24

View more publications

Book

IPAA thumbnail

I am the author of Image Processing and Analysis, an introductory textbook about the mathematical and algorithmic foundations of image processing and computer vision.

Teaching