arXiv:1707.06923 Abstract | arXiv Analytics

arXiv:1707.06923 [cs.CV]Abstract References Reviews Resources

Pillar Networks for action recognition

Published 2017-07-21Version 1

Image understanding using deep convolutional network has reached human-level performance, yet a closely related problem of video understanding especially, action recognition has not reached the requisite level of maturity. We combine multi-kernels based support-vector-machines (SVM) with a multi-stream deep convolutional neural network to achieve close to state-of-the-art performance on a 51-class activity recognition problem (HMDB-51 dataset); this specific dataset has proved to be particularly challenging for deep neural networks due to the heterogeneity in camera viewpoints, video quality, etc. The resulting architecture is named pillar networks as each (very) deep neural network acts as a pillar for the hierarchical classifiers.

Categories: cs.CV, stat.ML

Keywords: action recognition, pillar networks, multi-stream deep convolutional neural network, deep neural network acts, deep convolutional network

Related articles: Most relevant | Search more

arXiv:1607.02556 [cs.CV] (Published 2016-07-09)

Action Recognition with Joint Attention on Multi-Level Deep Features

Jialin Wu, Gu Wang, Wukui Yang, Xiangyang Ji

arXiv:1801.01415 [cs.CV] (Published 2018-01-04)

What have we learned from deep representations for action recognition?

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes, Andrew Zisserman

arXiv:1906.06822 [cs.CV] (Published 2019-06-17)

Spatio-Temporal Fusion Networks for Action Recognition

Sangwoo Cho, Hassan Foroosh