arXiv Analytics

Sign in

arXiv:1510.05970 [cs.CV]AbstractReferencesReviewsResources

Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Jure Žbontar, Yann LeCun

Published 2015-10-20Version 1

We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.

Comments: Submitted to the Journal of Machine Learning Research
Categories: cs.CV, cs.LG, cs.NE
Related articles: Most relevant | Search more
arXiv:1605.02289 [cs.CV] (Published 2016-05-08)
Detecting Ground Control Points via Convolutional Neural Network for Stereo Matching
arXiv:1409.4326 [cs.CV] (Published 2014-09-15)
Computing the Stereo Matching Cost with a Convolutional Neural Network
arXiv:1504.02351 [cs.CV] (Published 2015-04-09)
When Face Recognition Meets with Deep Learning: an Evaluation of Convolutional Neural Networks for Face Recognition