Video Generation with Embedded Tracks by Track4Gen
Track4Gen generates videos with accurate point tracks predicted using features extracted during the denoising process.
Input Image
Generated Video
Tracking results using Features
Failure Cases: Video Generation
Track4Gen may produce physically unrealistic motion and exhibit artifacts on human faces,
particularly when the size of the human subject in the video is small.
Input Image
Input Image
Generated Video
Failure Cases: Video Tracking
Track4Gen lacks robustness on videos featuring fast-moving objects or multiple semantically similar objects.
Input Video
Fast-moving Object
Fast-moving Object
Tracking Failure
Tracking Failure
Input Video
Semantically Similar Objects
Semantically Similar Objects
Tracking Failure
Tracking Failure