arXiv:1511.05298 Abstract | arXiv Analytics

arXiv:1511.05298 [cs.LG]Abstract References Reviews Resources

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena

Published 2015-11-17Version 1

Deep Recurrent Neural Network architectures, though remarkably capable at modeling sequences, lack an intuitive high-level spatio-temporal structure. That is while many problems in computer vision inherently have an underlying high-level structure and can benefit from it. Spatio-temporal graphs are a popular flexible tool for imposing such high-level intuitions in the formulation of real world problems. In this paper, we propose an approach for combining the power of high-level spatio-temporal graphs and sequence learning success of Recurrent Neural Networks~(RNNs). We develop a scalable method for casting an arbitrary spatio-temporal graph as a rich RNN mixture that is feedforward, fully differentiable, and jointly trainable. The proposed method is generic and principled as it can be used for transforming any spatio-temporal graph through employing a certain set of well defined steps. The evaluations of the proposed approach on a diverse set of problems, ranging from modeling human motion to object interactions, shows improvement over the state-of-the-art with a large margin. We expect this method to empower a new convenient approach to problem formulation through high-level spatio-temporal graphs and Recurrent Neural Networks, and be of broad interest to the community.

Comments: Video https://cs.stanford.edu/people/ashesh/srnn

Categories: cs.LG, cs.NE

Keywords: deep learning, high-level spatio-temporal graphs, deep recurrent neural network architectures, structural-rnn, real world problems

Related articles: Most relevant | Search more

arXiv:1404.1559 [cs.LG] (Published 2014-04-06)

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation

R. Vidya, Dr. G. M. Nasira, R. P. Jaia Priyankka

arXiv:1506.00619 [cs.LG] (Published 2015-06-01)

Blocks and Fuel: Frameworks for deep learning

Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, Yoshua Bengio

arXiv:1611.04231 [cs.LG] (Published 2016-11-14)

Identity Matters in Deep Learning