arXiv Analytics

Sign in

arXiv:1709.05675 [cs.CV]AbstractReferencesReviewsResources

Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

Anastasiia D. Sokolova, Angelina S. Kharchevnikova, Andrey V. Savchenko

Published 2017-09-17Version 1

In this paper we propose the two-stage approach of organizing information in video surveillance systems. At first, the faces are detected in each frame and a video stream is split into sequences of frames with face region of one person. Secondly, these sequences (tracks) that contain identical faces are grouped using face verification algorithms and hierarchical agglomerative clustering. Gender and age are estimated for each cluster (person) in order to facilitate the usage of the organized video collection. The particular attention is focused on the aggregation of features extracted from each frame with the deep convolutional neural networks. The experimental results of the proposed approach using YTF and IJB-A datasets demonstrated that the most accurate and fast solution is achieved for matching of normalized average of feature vectors of all frames in a track.

Comments: 8 pages; 1 figure, accepted for publication at AIST17
Categories: cs.CV
Subjects: 68T10, 68T45, I.4.8, I.5.4
Related articles: Most relevant | Search more
arXiv:1411.3159 [cs.CV] (Published 2014-11-12)
Part Detector Discovery in Deep Convolutional Neural Networks
arXiv:1604.02245 [cs.CV] (Published 2016-04-08)
Infrared Colorization Using Deep Convolutional Neural Networks
arXiv:1604.06832 [cs.CV] (Published 2016-04-22)
Refining Architectures of Deep Convolutional Neural Networks