Baichuan 2 | Lab Index

Second generation of open-source large language models with 7B and 13B parameters, trained from scratch on 2.6 trillion tokens. Matches or outperforms other open-source models of similar size on benchmarks including MMLU, CMMLU, GSM8K, and HumanEval, with particular strength in medicine and law domains.

Paper (arXiv)GitHub HuggingFace (7B-Base)HuggingFace (13B-Base)

Outputs 3

Baichuan 2: Open Large-scale Language Models

paper 2023-09-19

Technical report describing the training of the Baichuan 2 model series on 2.6 trillion tokens with evaluations across public benchmarks.

Citations 125

Baichuan2-7B

model

7-billion-parameter variant of the Baichuan 2 series with both base and chat-aligned versions.

HuggingFace (Base)HuggingFace (Chat)

Architecture DENSE

Parameters 7B

Training tokens 2.6T

Baichuan2-13B

model

13-billion-parameter variant of the Baichuan 2 series with both base and chat-aligned versions.

HuggingFace (Base)HuggingFace (Chat)

Architecture DENSE

Parameters 13B

Training tokens 2.6T

open-weightnlp