arXiv:1712.06317 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords video object detection, spatial-temporal memory networks, imagenet pre-trained backbone cnn weights, novel spatial-temporal memory module, model long-term temporal appearance Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset