LongCat-Flash-Lite
model paperSmaller, cost-efficient LongCat variant based on the "Scaling Embeddings Outperforms Scaling Experts" research. Designed for high-throughput production use cases.
Outputs 2
Scaling Embeddings Outperforms Scaling Experts in Language Models
paperarXiv: 2601.21204