Latent motion tokens as a bridging language for learning robot manipulation from videos. Accepted at ICCV 2025 as Oral.

Paper

Venue ICCV 2025
embodiedvideoresearch