arXiv Analytics

Sign in

arXiv:1803.06316 [cs.CV]AbstractReferencesReviewsResources

Activity Detection with Latent Sub-event Hierarchy Learning

AJ Piergiovanni, Michael S. Ryoo

Published 2018-03-16Version 1

In this paper, we introduce a new convolutional layer named the Temporal Gaussian Mixture (TGM) layer and present how it can be used to efficiently capture temporal structure in continuous activity videos. Our layer is designed to allow the model to learn a latent hierarchy of sub-event intervals. Our approach is fully differentiable while relying on a significantly less number of parameters, enabling its end-to-end training with standard backpropagation. We present our convolutional video models with multiple TGM layers for activity detection. Our experiments on multiple datasets including Charades and MultiTHUMOS confirm the benefit of our TGM layers, illustrating that it outperforms other models and temporal convolutions.

Related articles: Most relevant | Search more
arXiv:1906.08547 [cs.CV] (Published 2019-06-20)
vireoJD-MM at Activity Detection in Extended Videos
arXiv:1604.00427 [cs.CV] (Published 2016-04-01)
Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video
arXiv:1607.01979 [cs.CV] (Published 2016-07-07)
Untrimmed Video Classification for Activity Detection: submission to ActivityNet Challenge