Timeline
726 outputs ·
| Date ▾ | Name | Lab | Type | Stars | Downloads | Citations |
|---|---|---|---|---|---|---|
| 2026-04-01 | ★ GLM-5V-Turbo | Z.ai | model | — | — | — |
| 2026-03-31 | Think-Anywhere | Alibaba | paper | — | — | — |
| 2026-03-26 | Cohere Transcribe | Cohere | model | — | — | — |
| 2026-03-26 | Intern-S1-Pro | PJLab | model | — | — | — |
| 2026-03-25 | ★ LongCat-Next | Meituan | model | — | — | — |
| 2026-03-23 | Felis | ByteDance | paper | — | — | — |
| 2026-03-20 | LongCat-Flash-Prover | Meituan | model | — | 13 | — |
| 2026-03-18 | Qianfan-OCR | Baidu | paper | — | 5.5k | — |
| 2026-03-18 | ★ MiniMax-M2.7 | MiniMax | model | — | — | — |
| 2026-03-18 | MiMo-V2-Omni | Xiaomi | model | — | — | — |
| 2026-03-18 | ★ MiMo-V2-Pro | Xiaomi | model | — | — | — |
| 2026-03-18 | MiMo-V2-TTS | Xiaomi | model | — | — | — |
| 2026-03-16 | ★ Mistral Small 4 | Mistral | model | — | — | — |
| 2026-03-16 | Attention Residuals 2 | Moonshot AI | paper library | — | — | — |
| 2026-03-15 | Scientific Judge 2 | Baidu | paper dataset | — | — | — |
| 2026-03-12 | RoboBrain-Dex | BAAI | model | — | — | — |
| 2026-03-12 | Spatial-TTT | Tencent | paper | — | — | — |
| 2026-03-11 | ★ Nemotron 3 Super | NVIDIA | model | — | — | — |
| 2026-03-06 | ★ Sarvam-105B | Sarvam | model | — | — | — |
| 2026-03-06 | ★ Sarvam-30B | Sarvam | model | — | — | — |
| 2026-03-04 | RIVER | PJLab | dataset | — | — | — |
| 2026-03-01 | ★ OLMo Hybrid | Ai2 | model | — | — | — |
| 2026-02-28 | AnyTouch2 / ToucHD 2 | BAAI | dataset paper | — | — | — |
| 2026-02-25 | MaxClaw | MiniMax | library | — | — | — |
| 2026-02-17 | OLMix | Ai2 | paper | — | — | — |
| 2026-02-17 | Tiny Aya | Cohere | model | — | — | — |
| 2026-02-16 | ★ Qwen3.5 5 | Alibaba | model | 2.2k | — | — |
| 2026-02-16 | ★ Ling 2.5 | Ant Group | model | — | 339 | — |
| 2026-02-16 | ZoomBench | Ant Group | dataset | 106 | — | — |
| 2026-02-15 | Optimal Batch Size Scheduling via Functional Scaling Laws | Meituan | paper | — | — | — |
| 2026-02-14 | ★ Doubao-Seed-2.0 | ByteDance | model | — | — | — |
| 2026-02-12 | ★ MiniMax-M2.5 | MiniMax | model | — | 490.6k | — |
| 2026-02-12 | GEBench | StepFun | dataset | — | — | — |
| 2026-02-12 | Xiaomi-Robotics-0 2 | Xiaomi | model paper | — | — | — |
| 2026-02-11 | Ming-Flash-Omni-2.0 | Ant Group | model | — | — | — |
| 2026-02-11 | MiniCPM-SALA | OpenBMB | model | — | 1.9k | — |
| 2026-02-11 | ★ Step-3.5-Flash 3 | StepFun | model paper dataset | — | 85.1k | — |
| 2026-02-11 | ★ GLM-5 3 | Z.ai | model paper | — | 125k | — |
| 2026-02-11 | Slime: Asynchronous RL for Agentic Tasks | Z.ai | library | — | — | — |
| 2026-02-09 | Protenix | ByteDance | model | — | — | — |
| 2026-02-07 | Seedance 2.0 | ByteDance | model | — | — | — |
| 2026-02-06 | Baichuan-M3 2 | Baichuan | paper model | — | 1k | — |
| 2026-02-05 | ★ Kling 3.0 2 | Kuaishou | model paper | — | — | — |
| 2026-02-05 | CL-bench | Tencent | dataset | — | — | — |
| 2026-02-04 | RationaleRM | Alibaba | dataset | — | — | — |
| 2026-02-03 | FactNet & NOSA | OpenBMB | dataset | — | — | — |
| 2026-02-03 | ★ MiniCPM-o 4.5 | OpenBMB | model | — | 35.5k | — |
| 2026-02-02 | ★ Kimi K2.5 2 | Moonshot AI | model paper | — | — | — |
| 2026-02-01 | KuaiSearch | Kuaishou | dataset | — | — | — |
| 2026-01-29 | MemOCR | Meituan | library | — | — | — |
| 2026-01-29 | SenseNova-MARS 3 | SenseTime | model paper dataset | — | 423 | — |
| 2026-01-28 | ★ Trinity Large | Arcee | model | — | — | — |
| 2026-01-28 | Trinity Mini / Nano | Arcee | model | — | — | — |
| 2026-01-28 | ACE-Step-1.5 | StepFun | model | — | — | — |
| 2026-01-27 | LongCat-Flash-Lite 2 | Meituan | model paper | — | 1.1k | — |
| 2026-01-26 | DeepPlanning | Alibaba | dataset | — | — | — |
| 2026-01-26 | ★ Solar Pro 3 | Upstage | model | — | — | — |
| 2026-01-23 | ★ LongCat-Flash-Thinking-2601 2 | Meituan | model paper | — | 102 | — |
| 2026-01-22 | ★ ERNIE 5.0 2 | Baidu | model paper | — | — | — |
| 2026-01-22 | EvoCUA | Meituan | library | — | 10.3k | — |
| 2026-01-20 | ★ Yuan 3.0 Ultra | Inspur | model | — | — | — |
| 2026-01-20 | Step-3-VL-10B 2 | StepFun | model paper | — | 180.9k | — |
| 2026-01-14 | GLM-Image | Z.ai | model | — | — | — |
| 2026-01-12 | Engram: Conditional Memory via Scalable Lookup | DeepSeek | paper | — | — | — |
| 2026-01-11 | ★ Solar Open 100B | Upstage | model | — | — | — |
| 2026-01-09 | PaCoRe: Learning to Scale Test-Time Compute | StepFun | paper | — | 530 | — |
| 2026-01-05 | Yuan 3.0 Flash | Inspur | model | — | — | — |
| 2026-01-05 | ★ K-EXAONE | LG | model | — | — | — |
| 2026-01-05 | ★ HyperCLOVA X SEED Omni | Naver | model | — | — | — |
| 2026-01-05 | ★ Falcon-H1R | TII | model | — | — | — |
| 2026-01-03 | ★ HyperCLOVA X SEED Think | Naver | model | — | — | — |
| 2026-01-01 | FlashInfer-python-paddle | Baidu | library | — | — | — |
| 2026-01-01 | PinchBench & ClawEval | Xiaomi | dataset | — | — | — |
| 2026-01-01 | Agentar-Z-100K | Z.ai | dataset | — | — | — |
| 2025-12-31 | DATAMASK | ByteDance | paper | — | — | — |
| 2025-12-31 | FineWeb-Mask | ByteDance | dataset | — | — | — |
| 2025-12-31 | mHC: Manifold-Constrained Hyper-Connections | DeepSeek | paper | — | — | — |
| 2025-12-31 | OpenOneRec | Kuaishou | library | — | 147 | — |
| 2025-12-30 | SeedFold | ByteDance | paper | — | — | — |
| 2025-12-30 | LongCat ZigZag Attention | Meituan | paper | — | 7 | — |
| 2025-12-27 | ★ A.X K1 | SK Telecom | model | — | — | — |
| 2025-12-23 | ★ MiniMax-M2.1 | MiniMax | model | — | 46.8k | — |
| 2025-12-23 | VIBE & OctoCodingBench | MiniMax | dataset | — | — | — |
| 2025-12-23 | Step-DeepResearch | StepFun | library | — | — | — |
| 2025-12-22 | SekoTalk / Seko 2.0 | SenseTime | model | — | — | — |
| 2025-12-22 | ★ GLM-4.7 | Z.ai | model | — | — | — |
| 2025-12-18 | ★ Seed1.8 | ByteDance | model | — | — | — |
| 2025-12-18 | EXAONE Path 2.5 | LG | paper | — | — | — |
| 2025-12-18 | Towards Scalable Pre-training of Visual Tokenizers | MiniMax | paper | — | — | — |
| 2025-12-18 | HY-Motion 1.0 | Tencent | paper | — | — | — |
| 2025-12-16 | ★ Molmo 2 | Ai2 | model | — | — | — |
| 2025-12-16 | ★ MiMo-V2-Flash 2 | Xiaomi | model paper | — | 211.4k | — |
| 2025-12-16 | MOPD (Multi-Teacher On-Policy Distillation) | Xiaomi | library | — | — | — |
| 2025-12-16 | GLM-TTS | Z.ai | model | — | — | — |
| 2025-12-15 | ★ Nemotron 3 Nano | NVIDIA | model | — | — | — |
| 2025-12-12 | CI-VID | BAAI | dataset | 29 | — | — |
| 2025-12-10 | LLaDA 2 2 | Ant Group | model paper | — | — | — |
| 2025-12-09 | ★ JAIS 2 | MBZUAI | model | — | — | — |
| 2025-12-08 | LongCat-Image 3 | Meituan | model paper | — | 80.3k | — |
| 2025-12-06 | ★ K2-V2 (LLM360) | MBZUAI | model | — | — | — |
| 2025-12-05 | NEO (Native VLM Architecture) 2 | SenseTime | model paper | — | — | 1 |
| 2025-12-05 | ★ Hunyuan 2.0 | Tencent | model | — | — | — |
| 2025-12-02 | ★ Mistral Large 3 | Mistral | model | — | — | — |
| 2025-12-01 | Ministral 3 | Mistral | model | — | — | — |
| 2025-11-30 | gelab-zero (STEP-GUI) | StepFun | library | — | 436 | — |
| 2025-11-27 | DeepSeek-Math-V2 2 | DeepSeek | model dataset | — | 4.9k | — |
| 2025-11-24 | HunyuanOCR | Tencent | model | — | 401.3k | — |
| 2025-11-20 | ★ OLMo 3 | Ai2 | model | — | — | — |
| 2025-11-20 | HunyuanVideo-1.5 | Tencent | model | — | — | — |
| 2025-11-20 | MiMo-Embodied: X-Embodied Foundation Model | Xiaomi | paper | — | 166 | — |
| 2025-11-19 | LPLB (Linear-Programming Load Balancer) | DeepSeek | library | — | — | — |
| 2025-11-19 | Step-Audio-R1 | StepFun | model | — | 33 | — |
| 2025-11-17 | SenseNova-SI (Spatial Intelligence) 3 | SenseTime | model paper dataset | — | — | — |
| 2025-11-14 | Miloco (Xiaomi Local Copilot) | Xiaomi | library | — | — | — |
| 2025-11-13 | M100 Chip | Baidu | announcement | — | — | — |
| 2025-11-12 | T-Rex-Omni 2 | IDEA Lab | model paper | — | — | — |
| 2025-11-10 | kosong | Moonshot AI | library | — | — | — |
| 2025-11-06 | InfinityStar | ByteDance | model | — | — | — |
| 2025-11-06 | Step-Audio-EditX | StepFun | model | — | 16.9k | — |
| 2025-11-03 | ★ LongCat-Flash-Omni 2 | Meituan | model paper | — | 89 | — |
| 2025-11-01 | LightX2V | SenseTime | library | — | — | — |
| 2025-10-31 | GATE | LG | paper | — | — | — |
| 2025-10-30 | ★ Emu3.5 | BAAI | model | 1.5k | 1.6k | — |
| 2025-10-30 | Kimi Linear 2 | Moonshot AI | model paper | — | — | — |
| 2025-10-28 | MeasureBench | BAAI | dataset | 6 | — | — |
| 2025-10-28 | ODesign | BAAI | model | — | — | — |
| 2025-10-28 | URSA | BAAI | model | — | 4 | — |
| 2025-10-27 | ★ MiniMax-M2 | MiniMax | model | — | 123k | — |
| 2025-10-27 | CoKE: Context as the Key to Biomolecular Understanding | PJLab | paper | — | — | — |
| 2025-10-27 | JanusCoder | PJLab | model | — | 33 | — |
| 2025-10-27 | Hunyuan Mirror | Tencent | paper | — | 7.1k | — |
| 2025-10-25 | LongCat-Video 3 | Meituan | model paper | — | 1.2k | — |
| 2025-10-24 | KAT-Coder 2 | Kuaishou | model paper | — | — | — |
| 2025-10-22 | Seed3D 1.0 | ByteDance | model | — | — | — |
| 2025-10-20 | DeepSeek-OCR / OCR-2 | DeepSeek | model | — | 3M | — |
| 2025-10-17 | LongCat-Audio-Codec | Meituan | paper | — | — | — |
| 2025-10-17 | WOWService: LLM-Powered Intelligent Interaction System | Meituan | paper | — | — | — |
| 2025-10-15 | ERQA+ | BAAI | dataset | — | — | — |
| 2025-10-15 | Multi-Resolution Quantum Embedding for Surface Chemistry | ByteDance | paper | — | — | — |
| 2025-10-15 | InteractiveOmni 2 | SenseTime | model paper | — | — | — |
| 2025-10-14 | Rex-Omni 2 | IDEA Lab | model paper | — | 27.6k | — |
| 2025-10-09 | ★ Ling 2.0 / Ling-1T 2 | Ant Group | model paper | — | 2.4k | — |
| 2025-10-01 | R-HORIZON-Websearch | Meituan | dataset | — | — | — |
| 2025-09-30 | ★ GLM-4.6 | Z.ai | model | — | — | — |
| 2025-09-29 | Ring 3 | Ant Group | model paper | 242 | 18.8k | — |
| 2025-09-29 | ★ DeepSeek-V3.2 2 | DeepSeek | model paper | — | 291.1k | 1 |
| 2025-09-28 | HunyuanImage-3.0 2 | Tencent | model paper | — | 675 | — |
| 2025-09-26 | Qwen3Guard | Alibaba | model | 439 | — | — |
| 2025-09-25 | Expanding Reasoning Potential (CoTP) | Meituan | paper | — | — | — |
| 2025-09-24 | LRM-Eval / ROME | BAAI | dataset | 5 | — | — |
| 2025-09-23 | ByteWrist | ByteDance | model | — | — | — |
| 2025-09-23 | DORA System | Meituan | library | — | — | — |
| 2025-09-23 | ★ LongCat-Flash-Thinking 2 | Meituan | model paper | — | 83 | — |
| 2025-09-23 | Symphony-MoE | PCL | paper | — | — | — |
| 2025-09-22 | BGE-Reasoner | BAAI | model | 24 | 710 | — |
| 2025-09-22 | ScaleCUA | PJLab | model | — | 89 | — |
| 2025-09-18 | Seedream 4.0 | ByteDance | model | — | — | — |
| 2025-09-15 | checkpoint-engine | Moonshot AI | library | — | — | — |
| 2025-09-05 | Klear 3 | Kuaishou | model paper | — | 2.3k | — |
| 2025-09-05 | ★ MiniCPM4.1 2 | OpenBMB | model paper | — | 39.5k | — |
| 2025-09-02 | Baichuan-M2 2 | Baichuan | paper model | — | 234.7k | 1 |
| 2025-09-02 | ★ Apertus | Swiss AI | model | — | — | — |
| 2025-09-01 | VeOmni | ByteDance | library | — | — | — |
| 2025-09-01 | ★ LongCat-Flash-Chat 2 | Meituan | model paper | — | 40.7k | — |
| 2025-09-01 | Hunyuan-MT | Tencent | model | — | 26.3k | — |
| 2025-08-28 | HyperOS 3 | Xiaomi | announcement | — | — | — |
| 2025-08-26 | ★ MiniCPM-V 4.5 2 | OpenBMB | model paper | — | 93.4k | — |
| 2025-08-25 | GEPO | PCL | paper | — | — | — |
| 2025-08-25 | InternVL 3.5 | PJLab | model | — | — | — |
| 2025-08-23 | HunyuanVideo-Foley | Tencent | paper | — | — | — |
| 2025-08-21 | Waver | ByteDance | model | — | — | — |
| 2025-08-21 | ★ DeepSeek-V3.1 | DeepSeek | model | — | — | — |
| 2025-08-21 | Intern-S1 | PJLab | model | — | — | — |
| 2025-08-20 | ★ Seed-OSS-36B | ByteDance | model | — | 26.8k | — |
| 2025-08-20 | Nemotron Nano V2 | NVIDIA | model | — | — | — |
| 2025-08-15 | AI4Research: A Survey of Artificial Intelligence for Scientific Research | ByteDance | paper | — | — | — |
| 2025-08-15 | PXDesign | ByteDance | model | — | — | — |
| 2025-08-15 | Physical Autoregressive Model (PAR) | PCL | paper | — | — | — |
| 2025-08-14 | NextStep-1 2 | StepFun | model paper | — | 37 | — |
| 2025-08-14 | Hunyuan-GameCraft 1.0 | Tencent | model | — | 42 | — |
| 2025-08-13 | AutoCodeBench | Tencent | dataset | — | — | — |
| 2025-08-12 | InternBootcamp | PJLab | library | — | — | — |
| 2025-08-11 | GLM-4.5V | Z.ai | model | — | 46.6k | — |
| 2025-08-07 | CANN | Huawei | library | — | — | — |
| 2025-08-07 | TMA-Adaptive FP8 Grouped GEMM | PJLab | paper | — | — | — |
| 2025-08-06 | ACAVCaps | Xiaomi | dataset | — | — | — |
| 2025-08-05 | OmniScale | ByteDance | paper | — | — | — |
| 2025-08-05 | Seed Diffusion | ByteDance | model | — | — | — |
| 2025-08-01 | Qwen-Image 2 | Alibaba | model | 7.6k | 223.7k | — |
| 2025-07-31 | Seed-Prover | ByteDance | model | — | — | — |
| 2025-07-29 | Libra-Bench & PIE_bench | Meituan | dataset | — | — | — |
| 2025-07-28 | MixGRPO | Tencent | paper | — | — | — |
| 2025-07-28 | ★ GLM-4.5 2 | Z.ai | model paper | — | — | 1 |
| 2025-07-27 | SenseNova V6.5 | SenseTime | model | — | — | — |
| 2025-07-27 | StepFun-Prover-Preview | StepFun | model | — | 47 | — |
| 2025-07-27 | HunyuanWorld 2 | Tencent | model | — | 1.3k | — |
| 2025-07-25 | ★ Step-3 2 | StepFun | model paper | — | 73.4k | — |
| 2025-07-24 | A.X 3.1 | SK Telecom | model | — | — | — |
| 2025-07-23 | Towards Greater Leverage: Scaling Laws for Efficient MoE | Ant Group | paper | — | — | — |
| 2025-07-22 | Qwen-Code | Alibaba | library | 20.9k | — | — |
| 2025-07-22 | Qwen3-Coder 2 | Alibaba | model | 16.1k | 1.3M | — |
| 2025-07-22 | Seed-X Series | ByteDance | model | — | 1k | — |
| 2025-07-17 | Agentar-DeepFinance-100K | Ant Group | dataset | 34 | — | — |
| 2025-07-14 | ★ EXAONE 4.0 | LG | model | — | — | — |
| 2025-07-11 | ★ Kimi K2 4 | Moonshot AI | model paper | — | 3.8M | 3 |
| 2025-07-10 | FlexOlmo | Ai2 | model | — | — | — |
| 2025-07-10 | KAT (Kwai-AutoThink) 2 | Kuaishou | paper model | — | 138 | — |
| 2025-07-09 | EXAONE Path 2.0 | LG | paper | — | — | — |
| 2025-07-08 | ArtifactsBenchmark | Tencent | dataset | — | — | — |
| 2025-07-07 | POLAR | PJLab | paper | — | — | — |
| 2025-07-03 | ★ A.X 4.0 | SK Telecom | model | — | — | — |
| 2025-07-01 | Voxtral | Mistral | model | — | — | — |
| 2025-07-01 | ★ Solar Pro 2 | Upstage | model | — | — | — |
| 2025-06-30 | ★ openPangu | Huawei | announcement | — | — | — |
| 2025-06-27 | ★ HyperCLOVA X THINK | Naver | model | — | — | — |
| 2025-06-27 | Hunyuan-A13B 2 | Tencent | model paper | — | 21.4k | — |
| 2025-06-26 | Kwai Keye-VL 2 | Kuaishou | model paper | — | 79.9k | — |
| 2025-06-24 | Video-XL-2 | BAAI | model | — | 223 | — |
| 2025-06-16 | SciSage / SurveyScope | BAAI | library | — | — | — |
| 2025-06-16 | ★ MiniMax-M1 2 | MiniMax | model paper | — | 12.1k | — |
| 2025-06-12 | ★ Magistral | Mistral | model | — | — | — |
| 2025-06-12 | Predictable Scale Part II: Farseer | StepFun | paper | — | — | — |
| 2025-06-11 | FlagEvalMM | BAAI | library | 101 | — | — |
| 2025-06-10 | Seedance 1.0 | ByteDance | model | — | — | — |
| 2025-06-09 | Infinity-Instruct | BAAI | dataset | — | — | — |
| 2025-06-06 | RoboBrain 2.0 2 | BAAI | model | — | — | — |
| 2025-06-06 | ★ MiniCPM4 2 | OpenBMB | model paper | — | 729 | — |
| 2025-06-06 | Ultra-FineWeb | OpenBMB | dataset | — | 25 | — |
| 2025-06-05 | RoboRefer / RefSpatial | BAAI | model | — | — | — |
| 2025-06-04 | MiMo-VL 2 | Xiaomi | model paper | — | 1.6k | — |
| 2025-06-01 | HumanSense Benchmark | Ant Group | dataset | — | — | — |
| 2025-06-01 | BrowseComp & WideSearch | Moonshot AI | dataset | — | — | — |
| 2025-06-01 | kimi-agent-sdk | Moonshot AI | library | — | — | — |
| 2025-06-01 | kimi-cli | Moonshot AI | library | — | — | — |
| 2025-06-01 | Kimi-Dev 2 | Moonshot AI | model paper | — | 2.7k | — |
| 2025-06-01 | Kimi-Researcher | Moonshot AI | model | — | — | — |
| 2025-06-01 | walle | Moonshot AI | library | — | — | — |
| 2025-06-01 | AgentCPM Infrastructure | OpenBMB | library | — | — | — |
| 2025-06-01 | AgentCPM Series 3 | OpenBMB | paper | — | — | — |
| 2025-06-01 | A.X Encoder | SK Telecom | model | — | — | — |
| 2025-06-01 | CF-Div2-Stepfun | StepFun | dataset | — | — | — |
| 2025-06-01 | SteptronOss | StepFun | library | — | — | — |
| 2025-06-01 | WeKnora | Tencent | library | — | — | — |
| 2025-06-01 | MiMo-Audio 2 | Xiaomi | model paper | — | — | — |
| 2025-06-01 | BrowseComp | Z.ai | dataset | — | — | — |
| 2025-06-01 | KTransformers | Z.ai | library | — | — | — |
| 2025-06-01 | TQA (Temporal Question Answering) | Z.ai | dataset | — | — | — |
| 2025-05-30 | AReaL | Ant Group | library | 4.9k | — | — |
| 2025-05-28 | Ming-Omni | Ant Group | model | 645 | 10.2k | — |
| 2025-05-28 | ★ DeepSeek-R1-0528 | DeepSeek | model | — | — | — |
| 2025-05-28 | Pangu Embedded | Huawei | paper | — | 138 | — |
| 2025-05-28 | ★ Skywork Open Reasoner 1 | Skywork | model | — | — | — |
| 2025-05-27 | Pangu Pro MoE | Huawei | paper | — | 47 | — |
| 2025-05-27 | HunyuanVideo-Avatar | Tencent | paper | — | — | — |
| 2025-05-26 | SynLogic 2 | MiniMax | paper dataset | — | 496 | — |
| 2025-05-25 | Multimodal Generative Retrieval for Food Delivery | Meituan | paper | — | — | — |
| 2025-05-23 | One RL to See Them All: Visual Triple Unified RL | MiniMax | paper | — | — | — |
| 2025-05-22 | XRing O1 | Xiaomi | announcement | — | — | — |
| 2025-05-21 | Devstral 2 | Mistral | model | — | — | — |
| 2025-05-21 | ★ Falcon-H1 | TII | model | — | — | — |
| 2025-05-20 | BAGEL | ByteDance | model | — | 6.6k | — |
| 2025-05-19 | Video-SafetyBench | BAAI | dataset | — | — | — |
| 2025-05-17 | Model Merging in Pre-training of LLMs | ByteDance | paper | — | — | — |
| 2025-05-15 | BGE-Code-v1 | BAAI | model | 11.4k | 13.4k | — |
| 2025-05-12 | Seed1.5-VL | ByteDance | model | — | — | — |
| 2025-05-12 | MiniMax-Speech: Intrinsic Zero-Shot TTS | MiniMax | paper | — | — | — |
| 2025-05-12 | Step1X-3D: High-Fidelity Textured 3D Assets | StepFun | model | — | — | — |
| 2025-05-10 | Gated Attention for Large Language Models | Alibaba | paper | — | — | — |
| 2025-05-07 | DeerFlow | ByteDance | library | — | — | — |
| 2025-05-07 | ★ Pangu Ultra MoE 2 | Huawei | model paper | — | 9 | — |
| 2025-05-07 | HunyuanCustom | Tencent | paper | — | — | — |
| 2025-05-06 | CCI 4.0 | BAAI | dataset | — | — | — |
| 2025-05-06 | OpenSeek | BAAI | model | — | 1 | — |
| 2025-05-06 | RoboOS 2 | BAAI | library | — | — | — |
| 2025-05-02 | ★ MiMo (Reasoning) 2 | Xiaomi | model paper | — | 41.1k | — |
| 2025-05-01 | AWorld | Ant Group | library | 1.2k | — | — |
| 2025-04-30 | DeepSeek-Prover-V2 | DeepSeek | model | — | — | — |
| 2025-04-29 | ★ Qwen3 9 | Alibaba | model paper | 27k | — | 40 |
| 2025-04-25 | PolyMath | Alibaba | dataset | 43 | — | — |
| 2025-04-25 | Kimi-Audio 2 | Moonshot AI | model paper | — | 20.2k | — |
| 2025-04-24 | Step1X-Edit | StepFun | model | — | 62 | — |
| 2025-04-21 | Chinese-LiPS | BAAI | dataset | 9 | — | — |
| 2025-04-19 | SRPO: Staged History-Resampling Policy Optimization | Kuaishou | paper | — | — | — |
| 2025-04-15 | DataDecide | Ai2 | paper | — | — | — |
| 2025-04-15 | ★ Kling 2.0 1 | Kuaishou | model | — | — | — |
| 2025-04-15 | Kimina-Prover 2 | Moonshot AI | model paper | — | 809 | — |
| 2025-04-15 | miniF2F-test (Rectified) | Moonshot AI | dataset | — | — | — |
| 2025-04-15 | Step-R1-V-Mini | StepFun | model | — | — | — |
| 2025-04-14 | InternVL3 | PJLab | model | — | — | — |
| 2025-04-12 | SenseNova V6 | SenseTime | model | — | — | — |
| 2025-04-10 | ★ Seed1.5-Thinking: Advancing Superb Reasoning Models with RL | ByteDance | paper | — | — | 1 |
| 2025-04-10 | ★ Pangu Ultra 2 | Huawei | model paper | — | — | — |
| 2025-04-10 | Kimi-VL 2 | Moonshot AI | model paper | — | 104.2k | 1 |
| 2025-04-08 | Dream 7B | Huawei | model | — | — | — |
| 2025-04-08 | ★ Skywork R1V Series | Skywork | model | — | — | — |
| 2025-04-07 | BaichuanMed-OCR | Baichuan | model | — | 37 | — |
| 2025-04-04 | ★ Nemotron-H | NVIDIA | model | — | — | — |
| 2025-04-03 | DeepSeek-GRM: Inference-Time Scaling for Generalist Reward Modeling | DeepSeek | paper | — | — | 1 |
| 2025-04-01 | MiniMax Speech Series | MiniMax | model | — | — | — |
| 2025-03-28 | Doubao-Deep-Thinking | ByteDance | model | — | — | — |
| 2025-03-27 | OpenComplex 2 | BAAI | model | — | — | — |
| 2025-03-26 | Qwen2.5-Omni-7B | Alibaba | model | 4k | 466.1k | — |
| 2025-03-21 | Hunyuan-T1 | Tencent | model | — | — | — |
| 2025-03-20 | SeniorTalk | BAAI | dataset | — | — | — |
| 2025-03-18 | Sable | BAAI | model | — | — | — |
| 2025-03-18 | ★ Llama-Nemotron (Nano/Super/Ultra) | NVIDIA | model | — | — | — |
| 2025-03-18 | HaploVL | Tencent | model | — | — | — |
| 2025-03-17 | ★ EXAONE Deep | LG | model | — | — | — |
| 2025-03-16 | ★ ERNIE 4.5 | Baidu | model | — | — | — |
| 2025-03-16 | ERNIE X1 | Baidu | model | — | — | — |
| 2025-03-10 | Seedream 2.0 | ByteDance | paper | — | — | — |
| 2025-03-07 | ★ Ling 2 | Ant Group | paper model | 242 | 11.6k | 1 |
| 2025-03-06 | QwQ-32B | Alibaba | model | — | 54.9k | — |
| 2025-03-06 | BGE-VL 2 | BAAI | model dataset | 11.4k | 3.3k | — |
| 2025-03-06 | Predictable Scale Part I: Step Law | StepFun | paper | — | — | — |
| 2025-03-04 | CogView-4 | Z.ai | model | — | — | — |
| 2025-03-03 | ★ Aya Vision | Cohere | model | — | — | — |
| 2025-03-01 | ★ Command A | Cohere | model | — | — | — |
| 2025-03-01 | CPM.cu | OpenBMB | library | — | — | — |
| 2025-02-28 | 3FS (Fire-Flyer File System) | DeepSeek | library | — | — | — |
| 2025-02-28 | Smallpond | DeepSeek | library | — | — | — |
| 2025-02-28 | Image-01 | MiniMax | model | — | — | — |
| 2025-02-27 | RoboBrain | BAAI | model | — | 99.7k | — |
| 2025-02-27 | UniTok | ByteDance | paper | — | — | — |
| 2025-02-27 | DualPipe | DeepSeek | library | — | — | — |
| 2025-02-27 | EPLB (Expert Parallelism Load Balancer) | DeepSeek | library | — | — | — |
| 2025-02-27 | Hunyuan Turbo S 2 | Tencent | model paper | — | — | — |
| 2025-02-26 | DeepGEMM | DeepSeek | library | — | — | — |
| 2025-02-25 | DeepEP | DeepSeek | library | — | — | — |
| 2025-02-24 | Baichuan-Audio 2 | Baichuan | paper model | — | 37 | — |
| 2025-02-24 | FlashMLA | DeepSeek | library | — | — | — |
| 2025-02-24 | Muon Optimizer 2 | Moonshot AI | paper library | — | — | 1 |
| 2025-02-22 | Moonlight-3B/16B | Moonshot AI | model | — | 78.4k | 1 |
| 2025-02-19 | Qwen2.5-VL | Alibaba | model | — | — | — |
| 2025-02-18 | MoBA: Mixture of Block Attention for Long-Context LLMs | Moonshot AI | paper | — | — | 1 |
| 2025-02-18 | Hunyuan-Large-Vision | Tencent | model | — | — | — |
| 2025-02-17 | Mistral Saba | Mistral | model | — | — | — |
| 2025-02-17 | OpenDWM / MaskGWM 2 | SenseTime | library paper | — | — | — |
| 2025-02-17 | Step-Audio / Step-Audio2 | StepFun | model | — | — | — |
| 2025-02-16 | NSA: Native Sparse Attention | DeepSeek | paper | — | — | 2 |
| 2025-02-15 | 1bit-Merging 1 | Huawei | paper | — | — | — |
| 2025-02-14 | WebOrganizer | Ai2 | paper | — | — | — |
| 2025-02-14 | LLaDA 2 | Ant Group | model | 3.7k | 1.8k | 4 |
| 2025-02-14 | Step-Video-T2V 2 | StepFun | model paper | — | — | — |
| 2025-01-26 | Baichuan-Omni-1.5 2 | Baichuan | paper model | — | 316 | — |
| 2025-01-24 | Baichuan-M1 3 | Baichuan | paper model | — | 828 | 5 |
| 2025-01-23 | UltraRAG | OpenBMB | library | — | — | — |
| 2025-01-22 | ★ Doubao-1.5-Pro | ByteDance | model | — | — | — |
| 2025-01-22 | UI-TARS | ByteDance | library | — | 140.8k | 4 |
| 2025-01-22 | ★ DeepSeek-R1 | DeepSeek | model | — | 1.6M | — |
| 2025-01-21 | Hunyuan3D 2.0 3 | Tencent | model paper | — | — | — |
| 2025-01-20 | ★ Kimi k1.5 2 | Moonshot AI | model paper | — | — | 10 |
| 2025-01-17 | ComplexFuncBench | Z.ai | dataset | — | — | — |
| 2025-01-15 | ★ InternLM3 | PJLab | model | — | — | — |
| 2025-01-14 | ★ MiniMax-01 3 | MiniMax | model paper | — | 101.3k | 1 |
| 2025-01-14 | ★ MiniCPM-o 2.6 | OpenBMB | model | — | 114.8k | — |
| 2025-01-10 | GThinker | PCL | model | — | — | — |
| 2025-01-09 | WanJuan 3.0 (WanJuan-SiLu) | PJLab | dataset | — | — | — |
| 2025-01-07 | ★ Cosmos | NVIDIA | model | — | — | — |
| 2025-01-03 | AgentRefine | Meituan | paper | — | — | — |
| 2025-01-01 | Document Parse | Upstage | library | — | — | — |
| 2024-12-31 | ★ OLMo 2 | Ai2 | model | — | — | — |
| 2024-12-26 | ★ DeepSeek-V3 3 | DeepSeek | model paper | — | — | 206 |
| 2024-12-25 | QVQ | Alibaba | model | — | — | — |
| 2024-12-23 | Baichuan4-Finance 2 | Baichuan | paper model | — | — | 1 |
| 2024-12-18 | NOVA | BAAI | model | — | — | — |
| 2024-12-13 | DeepSeek-VL2 | DeepSeek | model | — | — | — |
| 2024-12-10 | See3D | BAAI | model | — | — | — |
| 2024-12-09 | ProcessBench | Alibaba | dataset | — | — | 1 |
| 2024-12-06 | ★ Aya Expanse | Cohere | model | — | — | — |
| 2024-12-06 | ★ EXAONE 3.5 | LG | model | — | — | — |
| 2024-12-06 | Densing Law of LLMs | OpenBMB | paper | — | — | — |
| 2024-12-06 | InternVL 2.5 | PJLab | model | — | — | — |
| 2024-12-05 | Language Model Ladders | Ai2 | paper | — | — | — |
| 2024-12-05 | Infinity & InfinityStar | ByteDance | model | — | — | 1 |
| 2024-12-05 | Liquid: Scalable Multi-modal Generation | ByteDance | model | — | — | — |
| 2024-12-05 | Divot | Tencent | model | — | — | — |
| 2024-12-05 | Moto | Tencent | paper | — | — | — |
| 2024-12-03 | HunyuanVideo | Tencent | model | — | 666 | 6 |
| 2024-12-03 | SEED-Voken | Tencent | paper | — | — | — |
| 2024-12-03 | GLM-4-Voice: End-to-End Spoken Chatbot | Z.ai | model | — | — | 1 |
| 2024-12-01 | ★ Falcon 3 | TII | model | — | — | — |
| 2024-11-22 | ★ Tülu 3 | Ai2 | model | — | — | — |
| 2024-11-21 | DINO-X 2 | IDEA Lab | model paper | — | — | 4 |
| 2024-11-20 | Hymba | NVIDIA | paper | — | — | — |
| 2024-11-19 | Aquila-VL-2B | BAAI | model | — | 256 | — |
| 2024-11-15 | KuaiFormer | Kuaishou | library | — | — | — |
| 2024-11-04 | ★ Hunyuan-Large 2 | Tencent | model paper | — | 977 | 5 |
| 2024-11-04 | Hunyuan3D 1.0 | Tencent | model | — | 92.4k | 6 |
| 2024-11-01 | InternThinker | PJLab | model | — | — | — |
| 2024-10-28 | AutoGLM 2 | Z.ai | model paper | — | — | — |
| 2024-10-24 | Infinity-MM | BAAI | dataset | — | — | — |
| 2024-10-24 | MotionCLR 2 | IDEA Lab | library paper | — | — | — |
| 2024-10-24 | ★ Skywork-Reward 2 | Skywork | model | — | — | — |
| 2024-10-22 | OmniGen | BAAI | model | — | 10 | 1 |
| 2024-10-17 | Janus 4 | DeepSeek | model paper dataset | — | 53.4k | 11 |
| 2024-10-11 | Baichuan-Omni 2 | Baichuan | paper model | — | — | — |
| 2024-10-07 | ★ Falcon Mamba | TII | model | — | — | — |
| 2024-10-02 | ★ Llama-3.1-Nemotron-70B | NVIDIA | model | — | — | — |
| 2024-09-27 | ★ Emu3 2 | BAAI | paper | 2.4k | — | 4 |
| 2024-09-26 | DreamWaltz-G 2 | IDEA Lab | library paper | — | — | — |
| 2024-09-25 | ★ Molmo | Ai2 | model | — | — | — |
| 2024-09-23 | MobileUI Dataset | Xiaomi | dataset | — | — | — |
| 2024-09-23 | MobileVLM | Xiaomi | model | — | — | — |
| 2024-09-19 | ★ Qwen2.5 3 | Alibaba | model paper | 27k | — | 57 |
| 2024-09-18 | Qwen2-VL | Alibaba | model | — | 48.7k | 246 |
| 2024-09-18 | Qwen2.5-Coder 2 | Alibaba | model paper | 16.1k | — | 30 |
| 2024-09-18 | Qwen2.5-Math 2 | Alibaba | model paper | 1.1k | — | 10 |
| 2024-09-11 | ★ Pixtral 12B | Mistral | model | — | — | — |
| 2024-09-05 | ★ DeepSeek-V2.5 | DeepSeek | model | — | 7.2k | — |
| 2024-09-05 | ★ MiniCPM3-4B | OpenBMB | model | — | 14.1k | — |
| 2024-09-05 | Open-MAGVIT2 | Tencent | library | — | — | — |
| 2024-09-03 | ★ OLMoE | Ai2 | model | — | — | — |
| 2024-08-31 | Hailuo AI (Video-01 / 2.3) | MiniMax | model | — | — | — |
| 2024-08-29 | CogVLM2 | Z.ai | model | — | — | — |
| 2024-08-28 | Auxiliary-Loss-Free Load Balancing Strategy | DeepSeek | paper | — | — | — |
| 2024-08-26 | Fire-Flyer AI-HPC: Cost-Effective Software-Hardware Co-Design | DeepSeek | paper | — | — | — |
| 2024-08-21 | Minitron | NVIDIA | paper | — | — | — |
| 2024-08-12 | CogVideoX: Text-to-Video Diffusion Models | Z.ai | model | — | — | 15 |
| 2024-08-07 | ★ EXAONE 3.0 | LG | model | — | — | — |
| 2024-08-05 | MiniCPM-V 2.6 | OpenBMB | model | — | — | — |
| 2024-08-01 | EXAONEPath 1.0 | LG | paper | — | — | — |
| 2024-08-01 | MiniMax Music Series | MiniMax | model | — | — | — |
| 2024-07-29 | MindSearch | PJLab | library | — | — | 2 |
| 2024-07-26 | Ying (影) | Z.ai | model | — | — | — |
| 2024-07-24 | ★ Mistral Large 2 | Mistral | model | — | — | — |
| 2024-07-20 | Consent in Crisis: The Rapid Decline of the AI Data Commons | Cohere | paper | — | — | — |
| 2024-07-20 | ★ Falcon 2 | TII | model | — | — | — |
| 2024-07-18 | Mistral NeMo | Mistral | model | — | — | — |
| 2024-07-16 | Codestral Mamba | Mistral | model | — | — | — |
| 2024-07-11 | EchoMimicV2 & V3 | Ant Group | paper | 4.2k | — | 2 |
| 2024-07-11 | Skywork-Math | Skywork | paper | — | — | — |
| 2024-07-06 | Kolors 2 | Kuaishou | model paper | — | 594 | — |
| 2024-07-05 | ★ SenseNova 5.5 | SenseTime | model | — | — | — |
| 2024-07-05 | Vimi | SenseTime | model | — | — | — |
| 2024-07-04 | ★ Step-2 | StepFun | model | — | — | — |
| 2024-07-03 | LivePortrait 2 | Kuaishou | library paper | — | 5.1k | 9 |
| 2024-07-03 | ★ InternLM2.5 | PJLab | model | — | 41.6k | — |
| 2024-07-02 | InternVL 2.0 | PJLab | model | — | — | — |
| 2024-07-01 | QPlanner | LG | paper | — | — | — |
| 2024-07-01 | Mathstral 7B | Mistral | model | — | — | — |
| 2024-07-01 | MMLongBench-Doc | PJLab | dataset | — | — | 1 |
| 2024-06-28 | ERNIE 4.0 Turbo | Baidu | model | — | — | — |
| 2024-06-24 | Mooncake 2 | Moonshot AI | paper dataset | — | — | 12 |
| 2024-06-17 | AquilaMed-RL | BAAI | model | — | 28 | 1 |
| 2024-06-17 | DeepSeek-Coder-V2 2 | DeepSeek | model paper | — | 7.4k | 48 |
| 2024-06-17 | ★ Nemotron-4 340B | NVIDIA | model | — | — | — |
| 2024-06-14 | MASt3R | Naver | paper | — | — | — |
| 2024-06-12 | SciRIFF | Ai2 | dataset | — | — | — |
| 2024-06-11 | Dasheng 3 | Xiaomi | paper model | — | — | — |
| 2024-06-10 | LlamaGen | ByteDance | model | — | — | 5 |
| 2024-06-06 | ★ Qwen2 2 | Alibaba | model paper | 27k | — | 43 |
| 2024-06-06 | ★ Kling 2 | Kuaishou | model paper | — | — | — |
| 2024-06-05 | GLM-4V | Z.ai | model | — | — | — |
| 2024-06-04 | Seed-TTS 2 | ByteDance | model paper | — | — | 4 |
| 2024-06-03 | ★ Skywork-MoE | Skywork | model | — | — | — |
| 2024-06-01 | agentUniverse | Ant Group | library | 2.2k | — | 1 |
| 2024-06-01 | FlagScale | BAAI | library | 495 | — | — |
| 2024-06-01 | Yuan Embedding | Inspur | model | — | — | — |
| 2024-05-30 | MotionLLM 2 | IDEA Lab | model paper | — | — | 5 |
| 2024-05-29 | ★ Codestral | Mistral | model | — | — | — |
| 2024-05-28 | Yuan 2.0-M32 | Inspur | model | — | — | — |
| 2024-05-27 | RLAIF-V | OpenBMB | paper | — | 74 | 3 |
| 2024-05-23 | DeepSeek-Prover | DeepSeek | model | — | 906 | 5 |
| 2024-05-22 | ★ Baichuan 4 | Baichuan | model | — | — | — |
| 2024-05-16 | ★ Grounding DINO 1.5 2 | IDEA Lab | model paper | — | — | 10 |
| 2024-05-15 | ByteFF | ByteDance | model | — | — | — |
| 2024-05-14 | Piccolo2 Embedding Model 2 | SenseTime | model paper | — | 25 | 1 |
| 2024-05-14 | Hunyuan-DiT 2 | Tencent | model paper | — | — | 1 |
| 2024-05-13 | Plot2Code | Tencent | dataset | — | — | — |
| 2024-05-07 | ★ DeepSeek-V2 2 | DeepSeek | model paper | — | 12.8k | 97 |
| 2024-04-25 | InternVL 1.5 | PJLab | model | — | — | — |
| 2024-04-25 | ShareGPT-4o | PJLab | dataset | — | — | 16 |
| 2024-04-24 | ★ SenseNova 5.0 | SenseTime | model | — | — | — |
| 2024-04-22 | SEED-X | Tencent | model | — | — | — |
| 2024-04-19 | Groma | ByteDance | model | — | — | 2 |
| 2024-04-17 | ★ ABAB 6 / 6.5 | MiniMax | model | — | — | — |
| 2024-04-17 | ★ Mixtral 8x22B | Mistral | model | — | — | — |
| 2024-04-15 | HQ-Edit | ByteDance | dataset | — | — | 1 |
| 2024-04-12 | ★ MiniCPM-V 4 | OpenBMB | model paper | — | 190.1k | 21 |
| 2024-04-11 | MiniCPM-V 2.0 | OpenBMB | model | — | — | — |
| 2024-04-03 | VAR (Visual Autoregressive Modeling) | ByteDance | model | — | — | 7 |
| 2024-04-02 | ★ HyperCLOVA X | Naver | model | — | — | — |
| 2024-03-30 | ST-LLM | Tencent | paper | — | — | — |
| 2024-03-28 | Dataverse | Upstage | library | — | — | — |
| 2024-03-28 | sDPO | Upstage | paper | — | — | — |
| 2024-03-23 | ★ Step-1 | StepFun | model | — | — | — |
| 2024-03-23 | Step-1V / 1.5V / 2V | StepFun | model | — | — | — |
| 2024-03-23 | Understanding Emergent Abilities from the Loss Perspective | Z.ai | paper | — | — | — |
| 2024-03-21 | Meituan-INFORMS-TSL | Meituan | dataset | — | — | — |
| 2024-03-19 | MergeKit | Arcee | library | — | — | — |
| 2024-03-19 | TAPTR 4 | IDEA Lab | library paper | — | — | — |
| 2024-03-19 | CoFARS | Meituan | paper | — | — | — |
| 2024-03-12 | ★ Command R / R+ | Cohere | model | — | — | — |
| 2024-03-11 | Unraveling the Mystery of Scaling Laws: Part I | Meituan | paper | — | — | — |
| 2024-03-08 | DeepSeek-VL | DeepSeek | model | — | 12.7k | 43 |
| 2024-03-08 | CogView3 | Z.ai | model | — | — | — |
| 2024-03-01 | Kimi 2M | Moonshot AI | model | — | — | — |
| 2024-02-28 | WanJuan 2.0 (WanJuan-CC) | PJLab | dataset | — | — | — |
| 2024-02-23 | MegaScale | ByteDance | library | — | — | 24 |
| 2024-02-21 | SDXL-Lightning | ByteDance | model | — | 53.4k | 6 |
| 2024-02-15 | SAMformer 2 | Huawei | model paper | — | — | 1 |
| 2024-02-06 | ★ SenseNova 4.0 | SenseTime | model | — | — | — |
| 2024-02-05 | ★ BGE-M3 2 | BAAI | model paper | 11.4k | 16.4M | 45 |
| 2024-02-05 | DeepSeek-Math 2 | DeepSeek | model paper | — | — | 66 |
| 2024-02-05 | Direct-a-Video: User-Directed Camera Movement and Object Motion | Kuaishou | paper | — | — | — |
| 2024-02-04 | ★ Qwen1.5 | Alibaba | model | — | — | — |
| 2024-02-01 | ★ OLMo | Ai2 | model | — | — | — |
| 2024-02-01 | ★ Aya 101 | Cohere | model | — | — | — |
| 2024-02-01 | ★ MiniCPM 3 | OpenBMB | model paper | — | 3.8k | 19 |
| 2024-01-31 | ★ Dolma | Ai2 | dataset | — | — | — |
| 2024-01-30 | YOLO-World | Tencent | paper | — | — | 26 |
| 2024-01-29 | ★ Baichuan 3 | Baichuan | model | — | — | — |
| 2024-01-23 | ★ InternLM2 2 | PJLab | model paper | — | 22.6k | 27 |
| 2024-01-20 | TFLOP | Upstage | paper | — | — | — |
| 2024-01-19 | Depth Anything | ByteDance | model | — | — | 21 |
| 2024-01-17 | ★ GLM-4 | Z.ai | model | — | — | — |
| 2024-01-15 | SciGLM / SciInstruct | Z.ai | paper | — | — | — |
| 2024-01-11 | ★ DeepSeek-MoE 2 | DeepSeek | model paper | — | 23.3k | 16 |
| 2024-01-09 | Lightning Linear Attention | Ant Group | paper | — | — | 2 |
| 2024-01-09 | Baichuan-NPC | Baichuan | model | — | — | — |
| 2024-01-04 | LLaMA Pro | Tencent | model | — | — | — |
| 2024-01-01 | VSAG | Ant Group | library | 459 | — | — |
| 2024-01-01 | FlagAI | BAAI | library | 3.9k | — | — |
| 2023-12-28 | PanGu-pi 3 | Huawei | model paper | — | — | 2 |
| 2023-12-23 | ★ SOLAR 10.7B | Upstage | model | — | — | — |
| 2023-12-21 | DUSt3R | Naver | paper | — | — | — |
| 2023-12-21 | ★ InternVL: Scaling up Vision Foundation Models | PJLab | model | — | — | 16 |
| 2023-12-20 | ★ Emu2 | BAAI | model | 1.8k | 21 | 7 |
| 2023-12-16 | Paloma | Ai2 | dataset | — | — | — |
| 2023-12-16 | Shot2Story20K | ByteDance | dataset | — | — | 3 |
| 2023-12-11 | ★ Mixtral 8x7B | Mistral | model | — | — | — |
| 2023-12-11 | SmartEdit | Tencent | paper | — | — | — |
| 2023-12-05 | Lenna 2 | Meituan | model paper | — | — | 1 |
| 2023-12-05 | ReasonDet | Meituan | dataset | — | — | 1 |
| 2023-11-29 | ★ DeepSeek-LLM 2 | DeepSeek | model paper | — | 2k | 82 |
| 2023-11-28 | ★ Falcon (7B / 40B / 180B) | TII | model | — | — | — |
| 2023-11-27 | MagicAnimate & Make Pixels Dance | ByteDance | model | — | — | 7 |
| 2023-11-27 | Yuan 2.0 | Inspur | model | — | — | — |
| 2023-11-27 | UniRepLKNet | Tencent | model | — | — | 34 |
| 2023-11-22 | T-Rex 3 | IDEA Lab | model paper | — | — | 8 |
| 2023-11-14 | Qwen2-Audio 2 | Alibaba | model paper | 2.1k | 2.3k | 25 |
| 2023-11-06 | CogVLM | Z.ai | model | — | — | 81 |
| 2023-11-02 | DeepSeek-Coder 2 | DeepSeek | model paper | — | 14.2k | 104 |
| 2023-10-30 | ★ Skywork-13B | Skywork | model | — | — | — |
| 2023-10-25 | DiQAD | Baidu | dataset | — | — | — |
| 2023-10-19 | KwaiYiiMath 2 | Kuaishou | model paper | — | — | — |
| 2023-10-17 | ★ ERNIE 4.0 | Baidu | model | — | — | — |
| 2023-10-13 | VideoCrafter | Tencent | library | — | — | — |
| 2023-10-12 | ★ Aquila2 | BAAI | model | 444 | 56 | — |
| 2023-10-09 | ★ Kimi-v1 | Moonshot AI | model | — | — | — |
| 2023-10-04 | SEED / SEED-LLaMA | Tencent | model | — | — | 10 |
| 2023-09-27 | ★ Mistral 7B | Mistral | model | — | — | — |
| 2023-09-26 | InternLM-XComposer | PJLab | model | — | 7.5k | 31 |
| 2023-09-25 | Qwen-Agent | Alibaba | library | 15.7k | — | — |
| 2023-09-25 | qwen.cpp | Alibaba | library | 621 | — | — |
| 2023-09-21 | ★ PengCheng-Mind | PCL | model | — | — | — |
| 2023-09-08 | Ant Financial LLM | Ant Group | model | — | — | — |
| 2023-09-08 | CodeFuse | Ant Group | model | — | — | — |
| 2023-09-08 | Fin-Eval | Ant Group | dataset | — | — | — |
| 2023-09-07 | ★ Hunyuan-LLM | Tencent | model | — | — | — |
| 2023-09-06 | ★ Baichuan 2 3 | Baichuan | paper model | — | 230.5k | 125 |
| 2023-09-01 | OpenSPG & OpenAGL | Ant Group | library | 2k | — | — |
| 2023-09-01 | XTuner | PJLab | library | — | — | — |
| 2023-08-31 | ★ ERNIE 3.5 | Baidu | model | — | — | — |
| 2023-08-30 | ★ JAIS | MBZUAI | model | — | — | — |
| 2023-08-29 | LongBench | Z.ai | dataset | — | — | 8 |
| 2023-08-24 | Qwen-VL | Alibaba | model | — | — | — |
| 2023-08-22 | Lagent & AgentLego | PJLab | library | — | — | — |
| 2023-08-21 | WanJuan 1.0 Corpus | PJLab | dataset | — | — | 8 |
| 2023-08-20 | ViT-Lens | Tencent | paper | — | — | — |
| 2023-08-18 | KwaiYii | Kuaishou | model | — | — | — |
| 2023-08-15 | ★ Aquila 2 | BAAI | model paper | 444 | 162 | — |
| 2023-08-11 | MiLM-6B | Xiaomi | model | — | — | — |
| 2023-08-08 | Baichuan-53B | Baichuan | model | — | — | — |
| 2023-08-03 | ★ Qwen 3 | Alibaba | model paper | 20.8k | 113.2k | 80 |
| 2023-08-02 | ★ BGE Text Embeddings | BAAI | model | 11.4k | 6M | — |
| 2023-08-01 | FlagEmbedding & C-MTEB 2 | BAAI | library dataset | 11.4k | — | 70 |
| 2023-08-01 | ★ ABAB 5 / 5.5 | MiniMax | model | — | — | — |
| 2023-07-31 | ToolLLM: Facilitating LLMs to Master 16000+ APIs | OpenBMB | paper | — | — | 63 |
| 2023-07-30 | SEED-Bench | Tencent | dataset | — | — | — |
| 2023-07-19 | ★ EXAONE 2.0 | LG | model | — | — | — |
| 2023-07-16 | ChatDev 2 | OpenBMB | paper library | — | — | 69 |
| 2023-07-13 | InternVid | PJLab | dataset | — | — | 32 |
| 2023-07-11 | ★ Emu | BAAI | model | 1.8k | — | 29 |
| 2023-07-11 | Baichuan-13B | Baichuan | model | — | 9.6k | — |
| 2023-07-07 | ★ SenseNova 2.0 Upgrade | SenseTime | model | — | — | — |
| 2023-07-06 | ★ InternLM-1.0 | PJLab | model | — | 679 | — |
| 2023-07-05 | PanGu-Weather 2 | Huawei, PCL | model paper | — | — | 122 |
| 2023-07-01 | InternEvo | PJLab | library | — | — | — |
| 2023-07-01 | OpenCompass | PJLab | library | — | — | — |
| 2023-06-25 | ChatGLM2 / ChatGLM3 | Z.ai | model | — | — | — |
| 2023-06-23 | MME (Multimodal Evaluation) | BAAI | dataset | 17.5k | — | — |
| 2023-06-20 | UniAD | SenseTime, PJLab | paper | — | — | 1 |
| 2023-06-15 | ★ Baichuan-7B | Baichuan | model | — | 56.8k | — |
| 2023-06-14 | WebGLM | Z.ai | paper | — | — | — |
| 2023-06-13 | KuaiSAR | Kuaishou | dataset | — | — | — |
| 2023-06-12 | detrex 2 | IDEA Lab | library paper | — | — | 15 |
| 2023-06-01 | DB-GPT | Ant Group | library | 18.3k | — | — |
| 2023-06-01 | DLRover | Ant Group | library | 1.6k | — | — |
| 2023-06-01 | LMDeploy | PJLab | library | — | — | — |
| 2023-06-01 | ★ RefinedWeb | TII | dataset | — | — | — |
| 2023-06-01 | zhipuai-sdk | Z.ai | library | — | — | — |
| 2023-05-30 | GPT4Tools | Tencent | paper | — | — | — |
| 2023-05-29 | Mix-of-Show | Tencent | paper | — | — | — |
| 2023-05-27 | CPM-Bee | OpenBMB | model | — | — | — |
| 2023-05-21 | DreamWaltz 2 | IDEA Lab | library paper | — | — | 16 |
| 2023-05-20 | PengCheng-Nebula | PCL | announcement | — | — | — |
| 2023-05-17 | ★ Ziya LLM 4 | IDEA Lab | model | — | 1.2k | — |
| 2023-05-15 | C-Eval 2 | BAAI | dataset paper | — | — | 90 |
| 2023-04-20 | UltraChat & UltraFeedback | OpenBMB | dataset | — | — | — |
| 2023-04-11 | SenseChat / SenseNova Launch | SenseTime | model | — | — | — |
| 2023-04-11 | SenseMirage | SenseTime | model | — | — | — |
| 2023-04-10 | Stable-DINO 2 | IDEA Lab | library paper | — | — | 5 |
| 2023-04-06 | Grounded SAM 3 | IDEA Lab | library paper | — | — | 88 |
| 2023-04-01 | BMTools | OpenBMB | library | — | — | — |
| 2023-03-27 | EVA-CLIP | BAAI | model | 2.7k | — | 78 |
| 2023-03-27 | Qianfan Platform | Baidu | announcement | — | — | — |
| 2023-03-20 | ★ PanGu-Sigma 2 | Huawei, PCL | model paper | — | — | 7 |
| 2023-03-14 | OpenSeeD 2 | IDEA Lab | library paper | — | — | 2 |
| 2023-03-14 | ★ ChatGLM-6B | Z.ai | model | — | 83.7k | 175 |
| 2023-03-09 | ★ Grounding DINO 2 | IDEA Lab | model paper | — | 1.4M | 240 |
| 2023-01-01 | FlagEvaluation | BAAI | library | 12 | — | — |
| 2022-12-22 | Tune-A-Video | Tencent | paper | — | — | — |
| 2022-12-06 | InternVideo / InternVideo2 | PJLab | model | — | — | 91 |
| 2022-12-05 | Painter | BAAI | model | — | — | 10 |
| 2022-11-12 | AltCLIP & AltDiffusion | BAAI | model | 3.9k | 149.6k | 9 |
| 2022-11-10 | InternImage 2 | PJLab | model paper | — | — | 38 |
| 2022-11-02 | Chinese CLIP | Alibaba | model | 5.8k | — | 51 |
| 2022-11-02 | Taiyi 3 | IDEA Lab | model paper | — | 510 | 2 |
| 2022-10-06 | ByteTransformer | ByteDance | library | — | — | 1 |
| 2022-09-30 | CodeGeeX 2 | Z.ai | model paper | — | — | 47 |
| 2022-09-16 | CPM-Ant | OpenBMB | model | — | — | — |
| 2022-09-03 | TuGraph | Ant Group | library | 1.7k | — | — |
| 2022-08-24 | ★ GLM-130B 2 | Z.ai | model paper | — | — | 294 |
| 2022-08-18 | KuaiRand | Kuaishou | dataset | — | — | — |
| 2022-07-22 | PanGu-Coder 3 | Huawei | model paper | — | 5 | 36 |
| 2022-07-04 | SecretFlow | Ant Group | library | 2.6k | — | — |
| 2022-06-24 | YOLOv6 2 | Meituan | library paper | — | — | 1.7k |
| 2022-06-06 | Mask DINO 2 | IDEA Lab | library paper | — | — | 19 |
| 2022-06-01 | Vision GNN (ViG) 2 | Huawei | model paper | — | — | 194 |
| 2022-04-26 | CogView2 | Z.ai, BAAI | model | — | — | — |
| 2022-03-20 | Delta Tuning 2 | OpenBMB | paper library | — | — | — |
| 2022-03-07 | DINO (DETR) 2 | IDEA Lab | library paper | — | — | 747 |
| 2022-03-02 | DN-DETR 2 | IDEA Lab | library paper | — | — | 54 |
| 2022-02-22 | KuaiRec | Kuaishou | dataset | — | — | 8 |
| 2022-02-11 | BMTrain | BAAI | library | 625 | — | — |
| 2022-02-07 | OFA: One For All | Alibaba | model | 2.6k | — | 258 |
| 2022-01-30 | FEDformer 2 | Huawei | model paper | — | — | 534 |
| 2022-01-28 | DAB-DETR 2 | IDEA Lab | library paper | — | — | 391 |
| 2022-01-25 | SPIRAL 2 | Huawei | model paper | — | — | 7 |
| 2022-01-24 | SenseCore AI Infrastructure | SenseTime | announcement | — | — | — |
| 2021-12-31 | ERNIE-ViLG | Baidu | model | — | — | 30 |
| 2021-12-01 | ★ EXAONE 1.0 | LG | model | — | — | — |
| 2021-12-01 | tFold | Tencent | model | — | — | — |
| 2021-11-30 | Donut | Naver | paper | — | — | — |
| 2021-11-22 | Fengshenbang 3 | IDEA Lab | library model | — | 339 | — |
| 2021-10-13 | ByteTrack | ByteDance | library | — | — | 105 |
| 2021-10-10 | Yuan 1.0 | Inspur | model | — | — | — |
| 2021-09-28 | DiffVC 2 | Huawei | model paper | — | — | 25 |
| 2021-09-10 | ★ HyperCLOVA | Naver | model | — | — | — |
| 2021-07-12 | SPLADE | Naver | paper | — | — | — |
| 2021-07-08 | OpenDILab / DI-engine 2 | SenseTime, PJLab | library | — | — | — |
| 2021-07-08 | OpenPPL (PPLNN) | SenseTime | library | — | — | — |
| 2021-07-05 | ★ ERNIE 3.0 & 3.0 Titan | Baidu | model | — | — | 193 |
| 2021-07-01 | Meituan Sky Project | Meituan | announcement | — | — | — |
| 2021-06-24 | CPM-2 | OpenBMB, BAAI | model | — | — | — |
| 2021-06-01 | OceanBase | Ant Group | library | 10k | — | — |
| 2021-06-01 | ★ Wu Dao 2.0 2 | BAAI | model paper | — | — | — |
| 2021-06-01 | Wu Dao Corpora | BAAI | dataset | — | — | — |
| 2021-05-26 | CogView | BAAI | model | — | — | — |
| 2021-05-13 | Grad-TTS 2 | Huawei | model paper | — | — | 43 |
| 2021-05-01 | Trustworthy AI White Paper | Xiaomi | paper | — | — | — |
| 2021-04-26 | ★ PanGu-alpha 2 | Huawei, PCL | model paper | — | — | 94 |
| 2021-03-24 | FastMoE | BAAI | library | 1.8k | — | 39 |
| 2021-03-20 | ★ Wu Dao 1.0 | BAAI | model | — | — | — |
| 2021-03-18 | ★ GLM (Original) 2 | Z.ai | model paper | — | 256 | 21 |
| 2021-03-18 | P-Tuning | Z.ai | paper | — | — | — |
| 2021-03-01 | M6 Series | Alibaba | model | — | — | 48 |
| 2021-02-27 | Transformer in Transformer (TNT) 2 | Huawei | model paper | — | — | 1k |
| 2021-01-01 | MS-MARCO-CN | Baidu | dataset | — | — | 18 |
| 2021-01-01 | PaddleNLP | Baidu | library | — | — | — |
| 2021-01-01 | PaddleSpeech | Baidu | library | — | — | — |
| 2021-01-01 | KoBART | SK Telecom | model | — | — | — |
| 2020-12-07 | HEBO 2 | Huawei | library paper | — | — | 19 |
| 2020-12-01 | CPM-1 | OpenBMB | model | — | — | — |
| 2020-10-08 | Deformable DETR 2 | SenseTime | model paper | — | 19.7k | 1.9k |
| 2020-09-12 | FuxiCTR / BARS 3 | Huawei | library paper | — | — | 12 |
| 2020-08-01 | Vega | Huawei | library | — | — | — |
| 2020-07-15 | PaddleOCR | Baidu | library | — | — | — |
| 2020-04-08 | DynaBERT 2 | Huawei | model paper | — | 4 | 119 |
| 2020-03-28 | MindSpore | Huawei | library | — | — | — |
| 2020-03-10 | Bolt | Huawei | library | — | — | — |
| 2020-01-01 | Kunlun XPU | Baidu | announcement | — | — | — |
| 2020-01-01 | PaddleDetection | Baidu | library | — | — | — |
| 2020-01-01 | PaddleSeg | Baidu | library | — | — | — |
| 2020-01-01 | KoGPT2 | SK Telecom | model | — | — | — |
| 2019-11-27 | GhostNet 4 | Huawei | model paper | — | — | 404 |
| 2019-09-23 | TinyBERT 2 | Huawei | model paper | — | 91.8k | 136 |
| 2019-09-17 | ★ Megatron-LM | NVIDIA | library | — | — | — |
| 2019-09-01 | NEZHA 2 | Huawei | model paper | — | — | 86 |
| 2019-08-23 | Ascend 910 Series | Huawei | announcement | — | — | — |
| 2019-07-29 | ★ ERNIE 2.0 | Baidu | model | — | — | 74 |
| 2019-07-19 | SUMBT | SK Telecom | paper | — | — | — |
| 2019-06-20 | Alchemy | Tencent | dataset | — | — | 64 |
| 2019-06-01 | KoBERT | SK Telecom | model | — | — | — |
| 2019-04-19 | ★ ERNIE 1.0 2 | Baidu | model paper | — | — | 767 |
| 2019-04-03 | CRAFT | Naver | paper | — | — | — |
| 2018-11-01 | Logan | Meituan | library | — | — | — |
| 2018-10-24 | Tencent AI Lab Embedding Corpus | Tencent | dataset | — | — | — |
| 2018-10-15 | ML-Images | Tencent | dataset | — | — | — |
| 2018-10-01 | OpenMMLab / MMDetection 2 | SenseTime, PJLab | library paper | — | — | 794 |
| 2018-08-23 | WMRouter | Meituan | library | — | — | — |
| 2018-06-01 | MACE (Mobile AI Compute Engine) | Xiaomi | library | — | — | — |
| 2018-03-16 | ApolloScape | Baidu | dataset | — | — | — |
| 2018-03-01 | Kata Containers | Ant Group | library | 7.6k | — | — |
| 2018-01-01 | Super Brain & Intelligent Dispatch | Meituan | announcement | — | — | — |
| 2017-11-24 | StarGAN | Naver | paper | — | — | — |
| 2017-11-14 | DuReader | Baidu | dataset | — | — | 51 |
| 2017-07-26 | Xiao AI | Xiaomi | announcement | — | — | — |
| 2017-07-05 | Apollo | Baidu | library | — | — | — |
| 2017-06-01 | Angel ML | Tencent | library | — | — | — |
| 2017-03-15 | DiscoGAN | SK Telecom | paper | — | — | — |
| 2016-09-30 | PaddlePaddle | Baidu | library | — | — | — |
| 2014-12-17 | Deep Speech 1 & 2 | Baidu | paper | — | — | — |
| 2006-01-01 | AMiner | Z.ai | library | — | — | — |