arXiv:2401.09985 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords video generation, general world models, predicting masked tokens, diverse general world environments, general world dynamic environments Tags github project Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset