arXiv:2302.01329 Abstract | arXiv Analytics

arXiv:2302.01329 [cs.CV]Abstract References Reviews Resources

Dreamix: Video Diffusion Models are General Video Editors

Eyal Molad, Eliahu Horwitz, Dani Valevski, Alex Rav Acha, Yossi Matias, Yael Pritch, Yaniv Leviathan, Yedid Hoshen

Published 2023-02-02Version 1

Text-driven image and video diffusion models have recently achieved unprecedented generation realism. While diffusion models have been successfully applied for image editing, very few works have done so for video editing. We present the first diffusion-based method that is able to perform text-based motion and appearance editing of general videos. Our approach uses a video diffusion model to combine, at inference time, the low-resolution spatio-temporal information from the original video with new, high resolution information that it synthesized to align with the guiding text prompt. As obtaining high-fidelity to the original video requires retaining some of its high-resolution information, we add a preliminary stage of finetuning the model on the original video, significantly boosting fidelity. We propose to improve motion editability by a new, mixed objective that jointly finetunes with full temporal attention and with temporal attention masking. We further introduce a new framework for image animation. We first transform the image into a coarse video by simple image processing operations such as replication and perspective geometric projections, and then use our general video editor to animate it. As a further application, we can use our method for subject-driven video generation. Extensive qualitative and numerical experiments showcase the remarkable editing ability of our method and establish its superior performance compared to baseline methods.

Categories: cs.CV

Keywords: video diffusion model, general video editor, original video, subject-driven video generation, low-resolution spatio-temporal information

Related articles: Most relevant | Search more

arXiv:2401.06578 [cs.CV] (Published 2024-01-12)

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model

Qian Wang, Weiqi Li, Chong Mou, Xinhua Cheng, Jian Zhang

arXiv:2506.17705 [cs.CV] (Published 2025-06-21)

DreamJourney: Perpetual View Generation with Video Diffusion Models

Bo Pan, Yang Chen, Yingwei Pan, Ting Yao, Wei Chen, Tao Mei

arXiv:2501.05763 [cs.CV] (Published 2025-01-10)

StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation

Shangjin Zhai et al.

arXiv Analytics

arXiv:2302.01329 [cs.CV]Abstract References Reviews Resources

Dreamix: Video Diffusion Models are General Video Editors

Links

Toolbox

arXiv:2302.01329 [cs.CV]AbstractReferencesReviewsResources

Dreamix: Video Diffusion Models are General Video Editors

Links

Toolbox

arXiv:2302.01329 [cs.CV]Abstract References Reviews Resources