Text-to-speech model from the GLM family.

Paper

arXiv: 2512.14291

audiogeneration