Comparison to Baselines
A dog is jumping into a river. → A horse is jumping into a river.
Input video
|
DreamMotion w/ Zeroscope
|
Tune-A-Video
|
ControlVideo
|
|
Control-A-Video
|
Gen-1
|
TokenFlow
|
A seagull is walking. → A duck is walking on the mud.
Input video
|
DreamMotion w/ Zeroscope
|
Tune-A-Video
|
ControlVideo
|
|
Control-A-Video
|
Gen-1
|
TokenFlow
|
A car is driving on the road. → A lamborghini is walking is driving on the road, on sunset.
Input video
|
DreamMotion w/ Zeroscope
|
Tune-A-Video
|
ControlVideo
|
|
Control-A-Video
|
Gen-1
|
TokenFlow
|
A man is skateboarding. → A firefighter is skateboarding.
Input video
|
|
DreamMotion w/ Show-1
|
DDIM inversion + Word swap
|
VMC
|
Cars are running on the bridge. → Buses are running on the bridge.
Input video
|
|
DreamMotion w/ Show-1
|
DDIM inversion + Word swap
|
VMC
|
• Sterling, Spencer. Zeroscope. https://huggingface.co/cerspense/zeroscope_v2_576w (2023).
• Zhang, David Junhao, et al. "Show-1: Marrying pixel and latent diffusion models for text-to-video generation." arXiv preprint arXiv:2309.15818 (2023).
• Wu, Jay Zhangjie, et al. "Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation." Proceedings of the IEEE/CVF International Conference on Computer Vision (2023).
• Zhang, Yabo, et al. "Controlvideo: Training-free controllable text-to-video generation." arXiv preprint arXiv:2305.13077 (2023).
• Chen, Weifeng, et al. "Control-a-video: Controllable text-to-video generation with diffusion models." arXiv preprint arXiv:2305.13840 (2023).
• Esser, Patrick, et al. "Structure and content-guided video synthesis with diffusion models." Proceedings of the IEEE/CVF International Conference on Computer Vision (2023).
• Geyer, Michal, et al. "Tokenflow: Consistent diffusion features for consistent video editing." arXiv preprint arXiv:2307.10373 (2023).
• Jeong, Hyeonho, et al. "VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models." arXiv preprint arXiv:2312.00845 (2023).