Contrastive vision-language pretraining specifically for Chinese. Predates the Qwen branding.

Paper

arXiv: 2211.01335

multimodalnlp