At launch, the world's largest single-language (Chinese) model at 245B parameters (dense). Trained on 5TB of high-quality Chinese text. Passed a Turing Test variant where humans could not distinguish its news articles and poems from human-written content.

Model Details

Architecture DENSE
Parameters 245B

Paper

arXiv: 2110.04725

open-weightfrontierscaling