Landmark paper from NAVER Labs Europe reformulating 3D reconstruction as pointmap regression. Recovers 3D models, depth, correspondences, and camera parameters from uncalibrated image pairs using a Transformer. No prior camera calibration needed. SOTA on monocular depth, multi-view reconstruction, and pose estimation. 7K+ GitHub stars. Published at CVPR 2024.

Paper

arXiv: 2312.14132

Venue: CVPR 2024

Library

GitHub Repository

vision3dresearch

Related