First-generation vision-language model in the Qwen family.

Paper

Citations 139
multimodalopen-weightvision

Related