arXiv Analytics

Sign in

arXiv:1801.08110 [cs.CV]AbstractReferencesReviewsResources

The challenge of simultaneous object detection and pose estimation: a comparative study

Daniel Oñoro-Rubio, Roberto J. López-Sastre, Carolina Redondo-Cabrera, Pedro Gil-Jiménez

Published 2018-01-24Version 1

Detecting objects and estimating their pose remains as one of the major challenges of the computer vision research community. There exists a compromise between localizing the objects and estimating their viewpoints. The detector ideally needs to be view-invariant, while the pose estimation process should be able to generalize towards the category-level. This work is an exploration of using deep learning models for solving both problems simultaneously. For doing so, we propose three novel deep learning architectures, which are able to perform a joint detection and pose estimation, where we gradually decouple the two tasks. We also investigate whether the pose estimation problem should be solved as a classification or regression problem, being this still an open question in the computer vision community. We detail a comparative analysis of all our solutions and the methods that currently define the state of the art for this problem. We use PASCAL3D+ and ObjectNet3D datasets to present the thorough experimental evaluation and main results. With the proposed models we achieve the state-of-the-art performance in both datasets.

Related articles: Most relevant | Search more
arXiv:1002.1148 [cs.CV] (Published 2010-02-05)
A Comparative Study of Removal Noise from Remote Sensing Image
arXiv:2302.04584 [cs.CV] (Published 2023-02-09)
Complex Network for Complex Problems: A comparative study of CNN and Complex-valued CNN
arXiv:2001.03831 [cs.CV] (Published 2020-01-12)
A Comparative Study for Non-rigid Image Registration and Rigid Image Registration