arXiv:1907.11111 Abstract | arXiv Analytics

arXiv:1907.11111 [cs.CV]Abstract References Reviews Resources

MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and Classification

Published 2019-07-25Version 1

We introduce MultiDepth, a novel training strategy and convolutional neural network (CNN) architecture that allows approaching single-image depth estimation (SIDE) as a multi-task problem. SIDE is an important part of road scene understanding. It, thus, plays a vital role in advanced driver assistance systems and autonomous vehicles. Best results for the SIDE task so far have been achieved using deep CNNs. However, optimization of regression problems, such as estimating depth, is still a challenging task. For the related tasks of image classification and semantic segmentation, numerous CNN-based methods with robust training behavior have been proposed. Hence, in order to overcome the notorious instability and slow convergence of depth value regression during training, MultiDepth makes use of depth interval classification as an auxiliary task. The auxiliary task can be disabled at test-time to predict continuous depth values using the main regression branch more efficiently. We applied MultiDepth to road scenes and present results on the KITTI depth prediction dataset. In experiments, we were able to show that end-to-end multi-task learning with both, regression and classification, is able to considerably improve training and yield more accurate results.

Comments: Accepted for presentation at the IEEE Intelligent Transportation Systems Conference (ITSC) 2019

Categories: cs.CV

Keywords: multi-task regression, multidepth, kitti depth prediction dataset, road scene, auxiliary task

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:2002.02857 [cs.CV] (Published 2020-02-07)

An Auxiliary Task for Learning Nuclei Segmentation in 3D Microscopy Images

Peter Hirsch, Dagmar Kainmueller

arXiv:2202.12687 [cs.CV] (Published 2022-02-25)

Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Mesay Samuel Gondere, Lars Schmidt-Thieme, Durga Prasad Sharma, Abiot Sinamo Boltena

arXiv:2301.10922 [cs.CV] (Published 2023-01-26)