arXiv Analytics

Sign in

arXiv:2310.09739 [cs.CV]AbstractReferencesReviewsResources

AugUndo: Scaling Up Augmentations for Unsupervised Depth Completion

Yangchao Wu, Tian Yu Liu, Hyoungseob Park, Stefano Soatto, Dong Lao, Alex Wong

Published 2023-10-15Version 1

Unsupervised depth completion methods are trained by minimizing sparse depth and image reconstruction error. Block artifacts from resampling, intensity saturation, and occlusions are amongst the many undesirable by-products of common data augmentation schemes that affect image reconstruction quality, and thus the training signal. Hence, typical augmentations on images that are viewed as essential to training pipelines in other vision tasks have seen limited use beyond small image intensity changes and flipping. The sparse depth modality have seen even less as intensity transformations alter the scale of the 3D scene, and geometric transformations may decimate the sparse points during resampling. We propose a method that unlocks a wide range of previously-infeasible geometric augmentations for unsupervised depth completion. This is achieved by reversing, or "undo"-ing, geometric transformations to the coordinates of the output depth, warping the depth map back to the original reference frame. This enables computing the reconstruction losses using the original images and sparse depth maps, eliminating the pitfalls of naive loss computation on the augmented inputs. This simple yet effective strategy allows us to scale up augmentations to boost performance. We demonstrate our method on indoor (VOID) and outdoor (KITTI) datasets where we improve upon three existing methods by an average of 10.4\% across both datasets.

Related articles: Most relevant | Search more
arXiv:1812.02486 [cs.CV] (Published 2018-12-06)
Learning to Infer the Depth Map of a Hand from its Color Image
arXiv:1903.00231 [cs.CV] (Published 2019-03-01)
Single Image Deblurring and Camera Motion Estimation with Depth Map
arXiv:1812.09874 [cs.CV] (Published 2018-12-24)
Perceptually-based single-image depth super-resolution