Native bilingual Chinese-English image generation model with integrated LLM text encoder and character-level text rendering.

Paper

generationvision

Related