InternVL 3.5 | Lab Index

Advances open-source multimodal models in versatility, reasoning, and efficiency. Introduces Cascade Reinforcement Learning (offline + online RL), Visual Resolution Router for dynamic token adjustment, and Decoupled Vision-Language Deployment. Up to 16% performance gain and 4x inference speedup over InternVL3. Includes native vision-language-action capabilities.

No results found