arXiv:1510.05970 Abstract | arXiv Analytics

arXiv:1510.05970 [cs.CV]Abstract References Reviews Resources

Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Published 2015-10-20Version 1

We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.

Comments: Submitted to the Journal of Machine Learning Research

Categories: cs.CV, cs.LG, cs.NE

Keywords: convolutional neural network, image patches, stereo matching, middlebury stereo data sets, binary classification data set

Related articles: Most relevant | Search more

arXiv:1605.02289 [cs.CV] (Published 2016-05-08)

Detecting Ground Control Points via Convolutional Neural Network for Stereo Matching

Zhun Zhong, Songzhi Su, Donglin Cao, Shaozi Li

arXiv:1409.4326 [cs.CV] (Published 2014-09-15)

Computing the Stereo Matching Cost with a Convolutional Neural Network

Jure Žbontar, Yann LeCun

arXiv:1504.02351 [cs.CV] (Published 2015-04-09)

When Face Recognition Meets with Deep Learning: an Evaluation of Convolutional Neural Networks for Face Recognition