Diffusion-powered video tokenizer for comprehension and generation, unifying video understanding and synthesis. Published at CVPR 2025.

Paper

Venue CVPR 2025
videotokenizergenerationresearch