arXiv:2401.02052 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords long short-term memory, video captioning, encoder-decoder, input temporal sequence, demonstrate model generality Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset