![]() |
![]() |
![]() |
![]() |
![]() |
A person riding a horse | Big beautiful mountain with waterfall, a long view | White clouds floating in the sky over the valley river | A boy playing guitar | A white swan swimming in the water |
![]() |
![]() |
![]() |
![]() |
![]() |
An apple is falling from a tree | Red car running, a close-up video | Sailboat sailing on the sea at dusk | A football player shooting | A person walking front with his friends on the grass |
Video generation using diffusion-based models is constrained by high computational costs due to the frame-wise iterative diffusion process. This work presents a Diffusion Reuse MOtion (Dr. Mo) network to accelerate latent video generation. Our key discovery is that coarse-grained noises in earlier denoising steps have demonstrated high motion consistency across consecutive video frames. Following this observation, Dr. Mo propagates those coarse-grained noises onto the next frame by incorporating carefully designed, lightweight inter-frame motions, eliminating massive computational redundancy in frame-wise diffusion models. The more sensitive and fine-grained noises are still acquired via later denoising steps, which can be essential to retain visual qualities. As such, deciding which intermediate steps should switch from motion-based propagations to denoising can be a crucial problem and a key tradeoff between efficiency and quality. Dr. Mo employs a meta-network named Denoising Step Selector (DSS) to dynamically determine desirable intermediate steps across video frames. Extensive evaluations on video generation and editing tasks have shown that Dr. Mo can substantially accelerate diffusion models in video tasks with improved visual qualities.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
CogVideo[1] | ![]() |
![]() |
![]() |
![]() |
Latent-Shift[2] | ![]() |
![]() |
![]() |
![]() |
Dr. Mo (Ours) | ![]() |
![]() |
![]() |
![]() |
A person playing piano | A person doing handstand pushups | A person performing a bench press | A person knitting |
VDM[3] | ![]() |
![]() |
![]() |
![]() |
SimDA[4] | ![]() |
![]() |
![]() |
![]() |
Dr. Mo (Ours) | ![]() |
![]() |
![]() |
![]() |
Mountain river | Path in a tropical forest | Forest in Autumn | Dramatic ocean sunset |