Unified spacetime autoregressive modeling framework for visual generation. Extends autoregressive approaches to jointly model spatial and temporal dimensions for coherent video and image generation.
generationvideovisionautoregressiveresearch