Smaller, cost-efficient LongCat variant based on the "Scaling Embeddings Outperforms Scaling Experts" research. Designed for high-throughput production use cases.

Outputs 2

LongCat-Flash-Lite

model
Architecture MOE
AA Intelligence 24

Scaling Embeddings Outperforms Scaling Experts in Language Models

paper
moeefficiencyopen-weight

Related