Smaller, cost-efficient LongCat variant based on the "Scaling Embeddings Outperforms Scaling Experts" research. Designed for high-throughput production use cases.

Outputs 2

LongCat-Flash-Lite

model
Architecture MOE

Scaling Embeddings Outperforms Scaling Experts in Language Models

paper

arXiv: 2601.21204

moeefficiencyopen-weight

Related