arXiv:1706.06782 Abstract | arXiv Analytics

arXiv:1706.06782 [cs.CV]Abstract References Reviews Resources

Object Detection Using Deep CNNs Trained on Synthetic Images

Param S. Rajpura, Ravi S. Hegde, Hristo Bojinov

Published 2017-06-21Version 1

The need for large annotated image datasets for training Convolutional Neural Networks (CNNs) has been a significant impediment for their adoption in computer vision applications. We show that with transfer learning an effective object detector can be trained almost entirely on synthetically rendered datasets. We apply this strategy for detecting pack- aged food products clustered in refrigerator scenes. Our CNN trained only with 4000 synthetic images achieves mean average precision (mAP) of 24 on a test set with 55 distinct products as objects of interest and 17 distractor objects. A further increase of 12% in the mAP is obtained by adding only 400 real images to these 4000 synthetic images in the training set. A high degree of photorealism in the synthetic images was not essential in achieving this performance. We analyze factors like training data set size and 3D model dictionary size for their influence on detection performance. Additionally, training strategies like fine-tuning with selected layers and early stopping which affect transfer learning from synthetic scenes to real scenes are explored. Training CNNs with synthetic datasets is a novel application of high-performance computing and a promising approach for object detection applications in domains where there is a dearth of large annotated image data.

Categories: cs.CV

Keywords: object detection, deep cnns, synthetic images achieves mean average, images achieves mean average precision, large annotated image

Related articles: Most relevant | Search more

arXiv:1711.01043 [cs.CV] (Published 2017-11-03)

A Taught-Obesrve-Ask (TOA) Method for Object Detection with Critical Supervision

Chi-Hao Wu, Qin Huang, Siyang Li, C. -C. Jay Kuo

arXiv:1805.08798 [cs.CV] (Published 2018-05-22)

A scene perception system for visually impaired based on object detection and classification using multi-modal DCNN

Baljit Kaur, Jhilik Bhattacharya

arXiv:1804.06215 [cs.CV] (Published 2018-04-17)

DetNet: A Backbone network for Object Detection

Zeming Li, Chao Peng, Gang Yu, Xiangyu Zhang, Yangdong Deng, Jian Sun