arXiv Analytics

Sign in

arXiv:2205.04502 [cs.CV]AbstractReferencesReviewsResources

Multiview Stereo with Cascaded Epipolar RAFT

Zeyu Ma, Zachary Teed, Jia Deng

Published 2022-05-09Version 1

We address multiview stereo (MVS), an important 3D vision task that reconstructs a 3D model such as a dense point cloud from multiple calibrated images. We propose CER-MVS (Cascaded Epipolar RAFT Multiview Stereo), a new approach based on the RAFT (Recurrent All-Pairs Field Transforms) architecture developed for optical flow. CER-MVS introduces five new changes to RAFT: epipolar cost volumes, cost volume cascading, multiview fusion of cost volumes, dynamic supervision, and multiresolution fusion of depth maps. CER-MVS is significantly different from prior work in multiview stereo. Unlike prior work, which operates by updating a 3D cost volume, CER-MVS operates by updating a disparity field. Furthermore, we propose an adaptive thresholding method to balance the completeness and accuracy of the reconstructed point clouds. Experiments show that our approach achieves competitive performance on DTU (the second best among known results) and state-of-the-art performance on the Tanks-and-Temples benchmark (both the intermediate and advanced set). Code is available at https://github.com/princeton-vl/CER-MVS

Related articles: Most relevant | Search more
arXiv:2003.12039 [cs.CV] (Published 2020-03-26)
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
arXiv:1709.02371 [cs.CV] (Published 2017-09-07)
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
arXiv:2204.07636 [cs.CV] (Published 2022-04-15)
Lagrangian Motion Magnification with Double Sparse Optical Flow Decomposition