First-generation vision-language model in the Qwen family.

Paper

arXiv: 2308.12966

multimodalopen-weightvision

Related