arXiv Analytics

Sign in

arXiv:1607.01979 [cs.CV]AbstractReferencesReviewsResources

Untrimmed Video Classification for Activity Detection: submission to ActivityNet Challenge

Gurkirt Singh, Fabio Cuzzolin

Published 2016-07-07Version 1

Current state-of-the-art human activity recognition is focused on the classification of temporally trimmed videos in which only one action occurs per frame. We propose a simple, yet effective, method for the temporal detection of activities in temporally untrimmed videos with the help of untrimmed classification. Firstly, our model predicts the top k labels for each untrimmed video by analysing global video-level features. Secondly, frame-level binary classification is combined with dynamic programming to generate the temporally trimmed activity proposals. Finally, each proposal is assigned a label based on the global label, and scored with the score of the temporal activity proposal and the global score. Ultimately, we show that untrimmed video classification models can be used as stepping stone for temporal detection.

Comments: 3 pages, Presented at ActivityNet Large Scale Activity Recognition Challenge workshop at CVPR 2016
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:1906.08547 [cs.CV] (Published 2019-06-20)
vireoJD-MM at Activity Detection in Extended Videos
arXiv:2206.10861 [cs.CV] (Published 2022-06-22)
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022
arXiv:2006.11693 [cs.CV] (Published 2020-06-21)
Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020