Principal Research Scientist and Senior Research Manager at NVIDIA
Exploring the intersection of computer vision and robotics
I lead a small group of amazing researchers in computer vision and robotics at NVIDIA, and I teach a class at the University of Washington. Previously, I was at Microsoft and Clemson University. I received my Ph.D. from Stanford University.
FoundationStereo: Zero-Shot Stereo Matching (2025)
Computer Vision and Pattern Recognition (CVPR), Nashville, Tennessee 2025
News: #1 on Middlebury Stereo Evaluation - Version 3 (2025/02/03 – )
News: #1 on ETH3D Low-Res Two-View Benchmark (2024/11/15 – )
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics (2025)
Computer Vision and Pattern Recognition (CVPR), Nashville, Tennessee 2025
SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation (2025)
International Conference on Robotics and Automation (ICRA), Atlanta, May 2025
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects (2024)
Computer Vision and Pattern Recognition (CVPR), Seattle, June 2024
Press: NVIDIA AI post
News: #1 on BOP leaderboard for unseen 6D pose estimation (2023/11/19 – 2024/09/03)
Award: BOP Challenge 2024 "Early Bird" Award
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects (2024)
Computer Vision and Pattern Recognition (CVPR), Seattle, June 2024
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows (2024)
Computer Vision and Pattern Recognition (CVPR), Seattle, June 2024
I am the author of Image Processing and Analysis, an introductory textbook about the mathematical and algorithmic foundations of image processing and computer vision.