arXiv:2304.06708 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords improving verb understanding, method achieves state-of-the-art results, verb phrase alignment loss, real-world video applications, leveraging pretrained large language models Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset