Date Name Lab Type Stars Downloads Citations
2026-05-14 TWN: Think When Needed Alibaba paper
2026-05-14 Realtime Voice API GA + gpt-realtime-2 Family (3 New Audio Models) OpenAI OpenAI news
2026-05-14 HCLTech to Anchor $300M Sarvam Round at $1.5B; Bessemer +$50M; NVIDIA, Prosperity7 Participating Outlook Business Sarvam news
2026-05-14 SKT × Korean Defense Ministry Sign MOU on Applying Sovereign AI Foundation Model to Defense SK Telecom SK Telecom news
2026-05-14 SpaceXAI Division Bleeding Researchers Since Merger; 11+ to Meta, 7+ to Thinking Machines TechCrunch xAI news
2026-05-13 Granite Embedding Multilingual R2 IBM paper
2026-05-12 Kuaishou Plans to Spin Off Kling AI Video Unit at \$20B Valuation; Tencent in Talks for \$2B Pre-IPO Round The Information Kuaishou news
2026-05-11 MiniCPM-V 4.6 OpenBMB model
2026-05-11 DeepSeek First External Funding Round Reportedly Near Close at $45–50B Valuation, Led by China's 'Big Fund III' SCMP DeepSeek news
2026-05-09 ERNIE 5.1 Baidu model
2026-05-09 Step-Audio-R1.1 (Realtime) Tops Big Bench Audio at 96.4%, Surpassing Grok Voice Agent Artificial Analysis StepFun news
2026-05-07 Cola DLM ByteDance paper
2026-05-07 AI Co-Mathematician Google paper
2026-05-07 OMAI Compute Cluster Goes Live — $152M NSF + Blackwell-Ultra Infrastructure for Open Science AI Ai2 Ai2 news
2026-05-07 Kakao Announces Kanana 2.5 — 150B Agent-Focused LLM at Q1 Earnings Call Korea Herald Kakao news
2026-05-07 Kimi Chatbot Maker Moonshot AI Valued at $20 Billion in Meituan-Led Round Bloomberg Meituan news
2026-05-07 Kimi Chatbot Maker Moonshot AI Valued at $20 Billion in Meituan-Led Round Bloomberg Moonshot AI news
2026-05-06 ZAYA1-8B Zyphra model
2026-05-06 DeepSeek in Talks for First-Ever Outside Round at $45B; Tencent + Big Fund III in Lead Group TechCrunch DeepSeek news
2026-05-05 TRIBE v2 (Brain Activity Foundation Model) Meta paper
2026-05-05 iOS 27 to Let Users Swap in Claude, Gemini, and Others as Default Apple Intelligence Model Bloomberg Apple news
2026-05-05 GPT-5.5 Instant Becomes Default ChatGPT Model; 52.5% Fewer Hallucinated Claims vs 5.3 Instant OpenAI OpenAI news
2026-05-04 Horizon Length in LLM Agent Training Microsoft paper
2026-05-03 Korea's National Growth Fund and SIF Approve KRW 560B (~$400M) Direct Equity in Upstage — First Software Co. Recipient Seoul Economic Daily Upstage news
2026-05-01 Huawei's AI Chip Gains Ground as DeepSeek and Others Shift Away from Nvidia Financial Times Huawei news
2026-04-30 OlmPool: Cracks in the Foundation Ai2 paper
2026-04-30 SenseTime Is Running Its New Model on Chinese Chips WIRED SenseTime news
2026-04-29 Granite 4.1 IBM model
2026-04-29 Mistral Medium 3.5 Mistral model
2026-04-29 Granite 4.1 Released: 3B/8B/30B Dense Models, 512K Context, 8B Matches Prior 32B MoE IBM Research IBM news
2026-04-28 MiMo-V2.5-Pro Xiaomi model
2026-04-24 DeepSeek-V4 3 DeepSeek model paper
2026-04-24 Cohere Completes Merger with Germany's Aleph Alpha, Creating Transatlantic AI Champion Financial Times Cohere news
2026-04-24 DeepSeek-V4 Released: 1.6T/49B MoE, First Frontier Model Trained Entirely on Huawei Ascend 950PR, MIT License DeepSeek DeepSeek news
2026-04-23 Sapiens2 Meta model
2026-04-23 GPT-5.5 OpenAI model
2026-04-23 Hy3 Preview Tencent model
2026-04-22 Qwen3.6 Open-Weight Models 2 Alibaba model
2026-04-22 LLaDA 2.0-Uni Ant Group model
2026-04-22 Tencent and Alibaba in Talks to Invest in DeepSeek at $20B+ Valuation — First External Funding Bloomberg DeepSeek news
2026-04-22 Cloud Next '26: TPU 8t/8i Announced, Deep Research Max Agents, Chrome Auto Browse, Thinking Machines Lab Multi-Billion Deal 9to5Google Google news
2026-04-21 SpaceX Strikes Deal for Right to Acquire Cursor for $60B Bloomberg xAI news
2026-04-20 BAR: Branch-Adapt-Route Ai2 paper
2026-04-20 Kimi K2.6 Moonshot AI model
2026-04-20 Qwen3.6-Max-Preview: Alibaba's Most Powerful Model, #1 on Six Coding Benchmarks (AA Intelligence Index 52) Qwen Alibaba news
2026-04-20 Amazon Invests $5B More (Total $13B); Anthropic Commits $100B+ AWS Spend Over 10 Years, Secures Up to 5GW Compute Anthropic Anthropic news
2026-04-17 Grok 4.3 xAI model
2026-04-16 Claude Opus 4.7 Anthropic model
2026-04-16 LeapAlign: Post-Training Flow Matching Models at Any Generation Step ByteDance paper
2026-04-16 Prefill-as-a-Service: Cross-Datacenter KVCache for Next-Generation Models Moonshot AI paper
2026-04-16 Claude Opus 4.7 Released Anthropic Anthropic news
2026-04-16 ByteDance Recruits DeepSeek R1 Lead Author Daya Guo for Seed Agent Team SCMP ByteDance news
2026-04-16 DeepSeek R1 Lead Author Daya Guo Joins ByteDance Seed Amid Intensifying AI Talent War SCMP DeepSeek news
2026-04-16 DeepSeek V4 Imminent — 1T-Parameter MoE to Run Solely on Huawei Ascend 950PR Chips Dataconomy DeepSeek news
2026-04-16 How France's Mistral Built a $14 Billion AI Empire by Not Being American Forbes Mistral news
2026-04-16 GPT-Rosalind Launched for Life Sciences Drug Discovery OpenAI OpenAI news
2026-04-15 Revenue Run Rate Hits $30B; VCs Offer Up to $800B Valuation Axios Anthropic news
2026-04-15 Upstage Becomes Korea's First Generative AI Unicorn with $126M Series C Seoul Economic Daily Upstage news
2026-04-14 Lightning OPD: Efficient Post-Training for Large Reasoning Models NVIDIA paper
2026-04-14 Gemini Robotics-ER 1.6 Launched; Boston Dynamics Partnership for Industrial AI Boston Dynamics Google news
2026-04-14 NAACP Sues xAI Over Memphis Colossus Data Center Pollution CNBC xAI news
2026-04-13 OpenAI Touts Amazon Alliance, Says Microsoft Has 'Limited Our Ability' to Reach Enterprise CNBC OpenAI news
2026-04-13 StepFun Unwinding Offshore Structure to Pave Way for HK IPO at Up to $10B Reuters StepFun news
2026-04-12 SoftBank/NEC/Honda/Sony Form JV for Trillion-Parameter Physical AI Model; $6.3B Government Backing Nikkei Asia SB Intuitions news
2026-04-10 Nexus: Common Minima for Better Generalization ByteDance paper
2026-04-10 Alibaba Token Hub Created: 5 AI Units Consolidated Under CEO Eddie Wu; RMB 380B ($53B) 3-Year Commitment SCMP Alibaba news
2026-04-10 Cohere in Advanced Merger Talks with Germany's Aleph Alpha Reuters Cohere news
2026-04-10 SK Telecom Partners with Rebellions and Arm for Sovereign AI Inference Infrastructure Rebellions SK Telecom news
2026-04-10 xAI Spending Pushed SpaceX to Nearly $5B Loss; CFO Anthony Armstrong Departs The Information xAI news
2026-04-09 Metis: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models Alibaba paper
2026-04-09 HiFloat4 Format for LLM Pre-training on Ascend NPUs Huawei paper
2026-04-09 EXAONE 4.5 LG model
2026-04-09 Efficient RL Training for LLMs with Experience Replay Meta paper
2026-04-09 EXAONE 4.5 Released — LG's First Open-Weight Vision-Language Model Korea Herald LG news
2026-04-09 Naver Shuts Down Clova X Chatbot; Pivots to Vertical AI Integrated into Search, Shopping, Finance Seoul Economic Daily Naver news
2026-04-08 Muse Spark Meta model
2026-04-08 Muse Spark Unveiled — First Model from Superintelligence Labs (Proprietary) Bloomberg Meta news
2026-04-08 Zhipu Hikes Prices Again as China AI Monetization Wave Quickens Bloomberg Z.ai news
2026-04-07 Harrier Microsoft model
2026-04-07 GLM-5.1 Z.ai model
2026-04-07 Claude Mythos Withheld from Public Release; Project Glasswing Cybersecurity Consortium Launched with Apple and Google Fortune Anthropic news
2026-04-07 Ascend 950PR AI Chip in Production; 750K Units Planned for 2026; Alibaba, ByteDance, Tencent Place Massive Orders TrendForce Huawei news
2026-04-07 GLM-5.1 Open-Source Release Scores #3 on Code Arena (1530 Elo); Stock Surges 19% BuildFastWithAI Z.ai news
2026-04-06 AI Agent Traps Google paper
2026-04-06 MedGemma 1.5 Google model
2026-04-06 Multi-GW Compute Partnership Expansion with Google Cloud and Broadcom TechCrunch Anthropic news
2026-04-06 NVIDIA Acquires SchedMD (Slurm Workload Manager); Draws Regulatory Scrutiny Reuters NVIDIA news
2026-04-03 Microsoft Announces $10B Japan AI Infrastructure Investment (2026-2029) WSJ Microsoft news
2026-04-03 TII Launches Falcon Perception — 600M-Parameter Open Multimodal Model for Grounding and Segmentation TII TII news
2026-04-03 Xiaomi Reveals MiMo-V2-Pro (1T Parameters), Approaching GPT-5.2 / Opus 4.6 Performance VentureBeat Xiaomi news
2026-04-02 Qwen 3.6-Plus Alibaba model
2026-04-02 Trinity Large Thinking Arcee model
2026-04-02 Gemma 4 Google model
2026-04-02 MAI Foundation Models Microsoft model
2026-04-02 SWE-HERO NVIDIA paper
2026-04-02 Alibaba Unveils Third Closed-Source AI Model in Focus on Profit Bloomberg Alibaba news
2026-04-02 Arcee's New Open-Source Trinity Large Thinking Is the Rare Powerful U.S.-Made Model VentureBeat Arcee news
2026-04-02 Gemma 4 Open Models Released Google Developers Blog Google news
2026-04-02 Sarvam AI Nearing $300-350M Raise at $1.5B Valuation Led by Bessemer with Nvidia and Amazon Bloomberg Sarvam news
2026-04-01 Simple Self-Distillation for Code Generation Apple paper
2026-04-01 Scaling Reasoning Tokens via RL and Parallel Thinking ByteDance paper
2026-04-01 Procedural Knowledge at Scale Improves Reasoning Meta paper
2026-04-01 Speech LLMs as Contextual Reasoning Transcribers Microsoft paper
2026-04-01 GLM-5V-Turbo Z.ai model
2026-04-01 Moonshot AI Raising $1B at $18B Valuation; Working with CICC and Goldman Sachs on HK IPO Bloomberg Moonshot AI news
2026-03-31 Think-Anywhere Alibaba paper
2026-03-31 ASI-Evolve SII paper
2026-03-31 OpenAI Closes $122B Round at $852B Valuation OpenAI OpenAI news
2026-03-31 Zhipu's Losses Climb 60% After Chinese AI Rivalry Worsens Bloomberg Z.ai news
2026-03-31 Zhipu's Losses Climb 60% After Chinese AI Rivalry Worsens Bloomberg Z.ai news
2026-03-30 Mistral AI Raises $830M in Debt to Set Up a Data Center Near Paris TechCrunch Mistral news
2026-03-28 daVinci-LLM SII model
2026-03-28 DeepSeek Before V4: Culture, Organization, and Liang Wenfeng's Unique Goals (English summary) LatePost (晚点) DeepSeek news
2026-03-26 Cohere Transcribe Cohere model
2026-03-26 Intern-S1-Pro PJLab model
2026-03-26 China's Moonshot AI Seeks Listing in Hong Kong Under Heightened Scrutiny WSJ Moonshot AI news
2026-03-25 LongCat-Next Meituan model
2026-03-25 Alibaba Launches AI Model Task Force; Top Researcher Resigns The Information Alibaba news
2026-03-25 MiniMax-M2.7, GLM-5 at 1/3 Cost Latent Space MiniMax news
2026-03-24 DeepSeek's Latest Job Postings Highlight Pivot to Agentic AI Bloomberg DeepSeek news
2026-03-23 SkillRouter Alibaba paper
2026-03-23 Felis ByteDance paper
2026-03-20 LongCat-Flash-Prover Meituan model 13
2026-03-18 Path-Constrained Mixture-of-Experts Apple paper
2026-03-18 Qianfan-OCR Baidu paper 5.5k
2026-03-18 MiniMax-M2.7 MiniMax model
2026-03-18 MiMo-V2-Omni Xiaomi model
2026-03-18 MiMo-V2-Pro Xiaomi model
2026-03-18 MiMo-V2-TTS Xiaomi model
2026-03-18 Chinese AI Developer Zhipu to Create New Unit for Product Development The Information Z.ai news
2026-03-17 Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning SB Intuitions paper
2026-03-16 Mixture-of-Depths Attention (MoDA) ByteDance paper
2026-03-16 Mistral Small 4 Mistral model
2026-03-16 Attention Residuals 2 Moonshot AI paper library
2026-03-15 Scientific Judge 2 Baidu paper dataset
2026-03-13 OpenSWE / daVinci-Env SII dataset
2026-03-12 RoboBrain-Dex BAAI model
2026-03-11 Nemotron 3 Super NVIDIA model
2026-03-10 Exclusive Self Attention Apple paper
2026-03-10 Ai2 CEO Ali Farhadi Steps Down; Microsoft Hires Key Researchers GeekWire Ai2 news
2026-03-09 Anthropic Sues Trump Admin Over Pentagon AI Blacklist CNBC Anthropic news
2026-03-09 OpenAI Acquires Promptfoo for AI Agent Security OpenAI OpenAI news
2026-03-08 Scalable Training of MoE Models with Megatron Core NVIDIA paper
2026-03-06 Sarvam-105B Sarvam model
2026-03-06 Sarvam-30B Sarvam model
2026-03-05 GPT-5.4 OpenAI model
2026-03-05 GPT-5.4 Released with 1M Token Context OpenAI OpenAI news
2026-03-04 RIVER PJLab dataset
2026-03-01 OLMo Hybrid Ai2 model
2026-03-01 LLM-jp-4 NII model
2026-02-28 AnyTouch2 / ToucHD 2 BAAI dataset paper
2026-02-25 MaxClaw MiniMax library
2026-02-25 ZUNA (EEG Foundation Model) Zyphra model
2026-02-25 Tencent-Backed AI Startup StepFun Is Said to Plan Hong Kong IPO Bloomberg StepFun news
2026-02-19 Gemini 3.1 Pro Google model
2026-02-19 Gemini 3.1 Pro Released, Ties #1 on AA Intelligence Index Google DeepMind Google news
2026-02-17 OLMix Ai2 paper
2026-02-17 Tiny Aya Cohere model
2026-02-17 Grok-4.20 xAI model
2026-02-17 Mercury 2 Released: Diffusion LLM with AA Index 33 at 1000 tok/s Inception Labs Inception Labs news
2026-02-16 Qwen3.5 5 Alibaba model 2.2k
2026-02-16 Ling 2.5 Ant Group model 339
2026-02-16 ZoomBench Ant Group dataset 106
2026-02-15 Optimal Batch Size Scheduling via Functional Scaling Laws Meituan paper
2026-02-14 Doubao-Seed-2.0 ByteDance model
2026-02-14 Doubao-Seed-2.0 Family Launched (Pro / Lite / Mini / Code) TechNode ByteDance news
2026-02-13 Cohere's $240M Year Sets Stage for IPO TechCrunch Cohere news
2026-02-12 MiniMax-M2.5 MiniMax model 490.6k
2026-02-12 GEBench StepFun dataset
2026-02-12 Xiaomi-Robotics-0 2 Xiaomi model paper
2026-02-12 Anthropic Raises $30B Series G at $380B Valuation Anthropic Anthropic news
2026-02-11 Ming-Flash-Omni-2.0 Ant Group model
2026-02-11 MiniCPM-SALA OpenBMB model 1.9k
2026-02-11 Step-3.5-Flash 3 StepFun model paper dataset 85.1k
2026-02-11 GLM-5 3 Z.ai model paper 125k
2026-02-11 Slime: Asynchronous RL for Agentic Tasks Z.ai library
2026-02-09 Protenix ByteDance model
2026-02-08 Data Darwinism / Darwin Corpora SII dataset
2026-02-07 Seedance 2.0 ByteDance model
2026-02-06 Baichuan-M3 2 Baichuan paper model 1k
2026-02-05 Claude Opus 4.6 Anthropic model
2026-02-05 Kling 3.0 2 Kuaishou model paper
2026-02-05 Claude Opus 4.6 Released with 1M Context Anthropic Anthropic news
2026-02-04 RationaleRM Alibaba dataset
2026-02-03 MiniCPM-o 4.5 OpenBMB model 35.5k
2026-02-02 Kimi K2.5 2 Moonshot AI model paper
2026-02-02 daVinci-Agency SII model
2026-02-02 SpaceX Acquires xAI at $1.25T Combined Valuation Fortune xAI news
2026-01-30 Keel: Post-LayerNorm Is Back ByteDance paper
2026-01-29 SenseNova-MARS 3 SenseTime model paper dataset 423
2026-01-28 Trinity Large Arcee model
2026-01-28 Trinity Mini / Nano Arcee model
2026-01-28 ACE-Step-1.5 StepFun model
2026-01-27 K2 Think V2 MBZUAI model
2026-01-27 LongCat-Flash-Lite 2 Meituan model paper 1.1k
2026-01-27 Mistral AI Surges Revenue 20-Fold to Over $400 Million ARR MLQ Mistral news
2026-01-27 Tencent Bets Its AI Future on 28-Year-Old From OpenAI Caixin Tencent news
2026-01-26 DeepPlanning Alibaba dataset
2026-01-26 daVinci-Dev SII model
2026-01-26 Solar Pro 3 Upstage model
2026-01-23 LongCat-Flash-Thinking-2601 2 Meituan model paper 102
2026-01-22 ERNIE 5.0 2 Baidu model paper
2026-01-22 EvoCUA Meituan library 10.3k
2026-01-21 CorpusQA: A 10 Million Token Benchmark for Corpus-Level Analysis and Reasoning Alibaba eval
2026-01-20 Yuan 3.0 Ultra Inspur model
2026-01-20 Step-3-VL-10B 2 StepFun model paper 180.9k
2026-01-15 Tao Qin Elected 2025 ACM Fellow ACM ZGCA news
2026-01-12 Engram: Conditional Memory via Scalable Lookup DeepSeek paper
2026-01-12 Alphabet Hits $4T Market Cap CNBC Google news
2026-01-11 Solar Open 100B Upstage model
2026-01-09 PaCoRe: Learning to Scale Test-Time Compute StepFun paper 530
2026-01-09 Zhipu and MiniMax IPO ChinaTalk MiniMax news
2026-01-09 Zhipu and MiniMax IPO ChinaTalk Z.ai news
2026-01-06 xAI Raises $20B Series E at $230B Valuation CNBC xAI news
2026-01-05 Yuan 3.0 Flash Inspur model
2026-01-05 K-EXAONE LG model
2026-01-05 HyperCLOVA X SEED Omni Naver model
2026-01-05 Falcon-H1R TII model
2026-01-03 HyperCLOVA X SEED Think Naver model
2026-01-01 FlashInfer-python-paddle Baidu library
2026-01-01 Agentar-Z-100K Z.ai dataset
2025-12-31 FineWeb-Mask ByteDance dataset
2025-12-31 mHC: Manifold-Constrained Hyper-Connections DeepSeek paper
2025-12-31 OpenOneRec Kuaishou library 147
2025-12-30 SeedFold ByteDance paper
2025-12-30 LongCat ZigZag Attention Meituan paper 7
2025-12-27 A.X K1 SK Telecom model
2025-12-23 MiniMax-M2.1 MiniMax model 46.8k
2025-12-23 VIBE & OctoCodingBench MiniMax dataset
2025-12-23 Step-DeepResearch StepFun library
2025-12-23 Zhipu AI's Rise from Tsinghua Lab Pandaily Z.ai news
2025-12-22 SekoTalk / Seko 2.0 SenseTime model
2025-12-22 GLM-4.7 Z.ai model
2025-12-19 Kanana-2 Kakao model
2025-12-19 Kakao Open-Sources Kanana-2 Model Optimized for Agentic AI Korea Times Kakao news
2025-12-18 Seed1.8 ByteDance model
2025-12-18 EXAONE Path 2.5 LG paper
2025-12-18 Towards Scalable Pre-training of Visual Tokenizers MiniMax paper
2025-12-18 HY-Motion 1.0 Tencent paper
2025-12-18 Seed1.8 Released as a Generalized Agentic Model ByteDance Seed ByteDance news
2025-12-17 Peter DeSantis to Lead Unified AGI Org; Rohit Prasad Departing CNBC Amazon news
2025-12-17 Tencent restructures AI operations, promotes high-profile recruit to chief AI scientist SCMP Tencent news
2025-12-16 Molmo 2 Ai2 model
2025-12-16 MiMo-V2-Flash 2 Xiaomi model paper 211.4k
2025-12-16 MOPD (Multi-Teacher On-Policy Distillation) Xiaomi library
2025-12-15 Nemotron 3 Nano NVIDIA model
2025-12-15 NVIDIA in Advanced Talks to Acquire AI21 Labs for $2-3B SiliconANGLE AI21 Labs news
2025-12-10 LLaDA 2 2 Ant Group model paper
2025-12-09 JAIS 2 MBZUAI model
2025-12-08 LongCat-Image 3 Meituan model paper 80.3k
2025-12-06 K2-V2 (LLM360) MBZUAI model
2025-12-05 NEO (Native VLM Architecture) 2 SenseTime model paper 1
2025-12-05 Hunyuan 2.0 Tencent model
2025-12-02 Amazon Nova 2 Amazon model
2025-12-02 Mistral Large 3 Mistral model
2025-12-02 Nova 2 Model Family and Nova Act GA at re:Invent 2025 TechCrunch Amazon news
2025-12-02 Anthropic Acquires Bun, Claude Code Hits $1B ARR Anthropic Anthropic news
2025-12-01 Ministral 3 Mistral model
2025-12-01 John Giannandrea to Retire; Amar Subramanya Named VP of AI Apple Apple news
2025-11-30 gelab-zero (STEP-GUI) StepFun library 436
2025-11-28 LFM2 (Liquid Foundation Models 2) Liquid AI model
2025-11-27 DeepSeek-Math-V2 2 DeepSeek model dataset 4.9k
2025-11-24 Claude 4.5 Opus Anthropic model
2025-11-24 HunyuanOCR Tencent model 401.3k
2025-11-20 OLMo 3 Ai2 model
2025-11-20 HunyuanVideo-1.5 Tencent model
2025-11-20 MiMo-Embodied: X-Embodied Foundation Model Xiaomi paper 166
2025-11-19 LPLB (Linear-Programming Load Balancer) DeepSeek library
2025-11-19 Step-Audio-R1 StepFun model 33
2025-11-19 Yann LeCun Departs Meta to Found AMI Labs CNBC Meta news
2025-11-17 SenseNova-SI (Spatial Intelligence) 3 SenseTime model paper dataset
2025-11-15 Doubao Seed Code ByteDance model
2025-11-15 Doubao Seed Code (Reasoning Coder) Hits AA Intelligence Index 34 Artificial Analysis ByteDance news
2025-11-14 Miloco (Xiaomi Local Copilot) Xiaomi library
2025-11-13 M100 Chip Baidu announcement
2025-11-12 AlphaProof Google paper
2025-11-12 Interview: Ant Group's Open Model Ambitions Interconnects Ant Group news
2025-11-10 kosong Moonshot AI library
2025-11-06 InfinityStar ByteDance model
2025-11-06 Step-Audio-EditX StepFun model 16.9k
2025-11-05 SoftBank and SB Intuitions launch Sarashina API for enterprise access to Japanese LLM SoftBank SB Intuitions news
2025-11-03 LongCat-Flash-Omni 2 Meituan model paper 89
2025-11-01 LightX2V SenseTime library
2025-11-01 Inception Labs Raises $56M Seed from Menlo, Andrew Ng, Karpathy Inception Labs Inception Labs news
2025-10-31 GATE LG paper
2025-10-30 Emu3.5 BAAI model 1.5k 1.6k
2025-10-30 Kimi Linear 2 Moonshot AI model paper
2025-10-29 Ouro ByteDance model
2025-10-28 ODesign BAAI model
2025-10-28 URSA (Uniform Discrete Diffusion) BAAI model 4
2025-10-28 Parallel Loop Transformer ByteDance paper
2025-10-28 OpenAI Completes For-Profit PBC Restructuring OpenAI OpenAI news
2025-10-27 MiniMax-M2 MiniMax model 123k
2025-10-27 CoKE: Context as the Key to Biomolecular Understanding PJLab paper
2025-10-27 JanusCoder PJLab model 33
2025-10-27 Hunyuan Mirror Tencent paper 7.1k
2025-10-25 LongCat-Video 3 Meituan model paper 1.2k
2025-10-24 KAT-Coder 2 Kuaishou model paper
2025-10-23 Anthropic to Expand Google Cloud TPU Use to 1M+ TPUs Anthropic Anthropic news
2025-10-22 Seed3D 1.0 ByteDance model
2025-10-20 DeepSeek-OCR / OCR-2 DeepSeek model 3M
2025-10-17 LongCat-Audio-Codec Meituan paper
2025-10-16 MorphoBench ZGCA paper
2025-10-15 Granite 4.0 IBM model
2025-10-15 InteractiveOmni 2 SenseTime model paper
2025-10-15 Granite 4.0: Hybrid Mamba Architecture, First ISO 42001 Certified Open Models IBM IBM news
2025-10-14 Rex-Omni 2 IDEA Lab model paper 27.6k
2025-10-14 Zhipu AI Breaks US Chip Reliance With First Major Model Trained on Huawei Stack SCMP Z.ai news
2025-10-13 RITE: Reinforcement Learning for Tool-Integrated Interleaved Thinking Meituan paper
2025-10-09 Ling 2.0 / Ling-1T 2 Ant Group model paper 2.4k
2025-10-01 R-HORIZON-Websearch Meituan dataset
2025-10-01 GDPval OpenAI eval
2025-10-01 IBM Research Names Jay Gambetta as Director; Dario Gil to DOE IBM IBM news
2025-09-30 GLM-4.6 Z.ai model
2025-09-29 Ring 4 Ant Group model paper 242 18.8k
2025-09-29 DeepSeek-V3.2 2 DeepSeek model paper 291.1k 1
2025-09-28 HunyuanImage-3.0 2 Tencent model paper 675
2025-09-26 Qwen3Guard Alibaba model 439
2025-09-25 Expanding Reasoning Potential (CoTP) Meituan paper
2025-09-24 LRM-Eval / ROME BAAI dataset 5
2025-09-23 ByteWrist ByteDance model
2025-09-23 LongCat-Flash-Thinking 2 Meituan model paper 83
2025-09-23 Symphony-MoE PCL paper
2025-09-22 BGE-Reasoner BAAI model 24 710
2025-09-22 ScaleCUA PJLab model 89
2025-09-18 Seedream 4.0 ByteDance model
2025-09-17 AToken Apple paper
2025-09-16 Shanghai launches innovation institute to bridge AI research and industry Shanghai Municipal Government SII news
2025-09-15 checkpoint-engine Moonshot AI library
2025-09-08 PLaMo 2 PFN model
2025-09-05 Klear 3 Kuaishou model paper 2.3k
2025-09-05 MiniCPM4.1 2 OpenBMB model paper 39.5k
2025-09-02 Baichuan-M2 2 Baichuan paper model 234.7k 1
2025-09-02 Apertus Swiss AI model
2025-09-01 VeOmni ByteDance library
2025-09-01 LongCat-Flash-Chat 2 Meituan model paper 40.7k
2025-09-01 Hunyuan-MT Tencent model 26.3k
2025-09-01 RLinf ZGCA library
2025-09-01 TwinBrainVLA ZGCA paper
2025-09-01 Mistral AI Raises EUR 2B at EUR 12B Valuation Mistral AI Mistral news
2025-08-28 HyperOS 3 Xiaomi announcement
2025-08-26 MiniCPM-V 4.5 2 OpenBMB model paper 93.4k
2025-08-25 GEPO PCL paper
2025-08-25 InternVL 3.5 PJLab model
2025-08-23 HunyuanVideo-Foley Tencent paper
2025-08-21 Fin-PRM: Process Reward Model for Financial Reasoning Alibaba paper
2025-08-21 Waver ByteDance model
2025-08-21 DeepSeek-V3.1 DeepSeek model
2025-08-21 Intern-S1 PJLab model
2025-08-20 Seed-OSS-36B ByteDance model 26.8k
2025-08-20 Nemotron Nano V2 NVIDIA model
2025-08-20 Seed-OSS-36B Released as Apache-2.0 Open-Weight Model VentureBeat ByteDance news
2025-08-15 PXDesign ByteDance model
2025-08-15 Physical Autoregressive Model (PAR) PCL paper
2025-08-14 NextStep-1 2 StepFun model paper 37
2025-08-14 Hunyuan-GameCraft 1.0 Tencent model 42
2025-08-14 Cohere Raises $500M at $6.8B Valuation Cohere Cohere news
2025-08-14 Cohere Hires Long-Time Meta Research Head Joelle Pineau as Chief AI Officer TechCrunch Cohere news
2025-08-12 Mistral Medium 3.1 Mistral model
2025-08-12 InternBootcamp PJLab library
2025-08-11 GLM-4.5V Z.ai model 46.6k
2025-08-07 CANN Huawei library
2025-08-07 GPT-5 OpenAI model
2025-08-07 TMA-Adaptive FP8 Grouped GEMM PJLab paper
2025-08-06 ACAVCaps Xiaomi dataset
2025-08-05 OmniScale ByteDance paper
2025-08-05 Seed Diffusion ByteDance model
2025-08-01 Qwen-Image 2 Alibaba model 7.6k 223.7k
2025-08-01 MegaDFT ZGCA paper
2025-08-01 Ai2 and UW Awarded $152M from NSF and NVIDIA for Open Scientific AI GeekWire Ai2 news
2025-07-31 Seed-Prover ByteDance model
2025-07-29 Libra-Bench & PIE_bench Meituan dataset
2025-07-28 MixGRPO Tencent paper
2025-07-28 GLM-4.5 2 Z.ai model paper 1
2025-07-27 SenseNova V6.5 SenseTime model
2025-07-27 StepFun-Prover-Preview StepFun model 47
2025-07-27 HunyuanWorld 3 Tencent model 1.3k
2025-07-25 Step-3 2 StepFun model paper 73.4k
2025-07-24 A.X 3.1 SK Telecom model
2025-07-24 SoftBank Corp. to Build the World's Largest AI Computing Infrastructure Using NVIDIA DGX SuperPOD with NVIDIA Blackwell GPUs SB Intuitions SB Intuitions news
2025-07-23 Towards Greater Leverage: Scaling Laws for Efficient MoE Ant Group paper
2025-07-23 ASI-Arch SII paper
2025-07-22 Qwen-Code Alibaba library 20.9k
2025-07-22 Qwen3-Coder 2 Alibaba model 16.1k 1.3M
2025-07-22 Seed-X Series ByteDance model 1k
2025-07-22 Reka Raises $110M Series B at $1B Valuation Reka Reka news
2025-07-17 Agentar-DeepFinance-100K Ant Group dataset 34
2025-07-17 Apple Intelligence Foundation Models Tech Report 2025 Apple Apple news
2025-07-14 EXAONE 4.0 LG model
2025-07-12 Scaling Laws for Optimal Data Mixtures Apple paper
2025-07-11 Kimi K2 4 Moonshot AI model paper 3.8M 3
2025-07-10 FlexOlmo Ai2 model
2025-07-10 KAT (Kwai-AutoThink) 2 Kuaishou paper model 138
2025-07-09 EXAONE Path 2.0 LG paper
2025-07-09 Grok-4 xAI model
2025-07-09 Grok-4 Released with Native Tool Use and Reasoning xAI xAI news
2025-07-07 POLAR PJLab paper
2025-07-05 How to Train Your LLM Web Agent ServiceNow paper
2025-07-03 IFBench Ai2 eval
2025-07-03 A.X 4.0 SK Telecom model
2025-07-01 CodePRM: Execution Feedback-enhanced Process Reward Model for Code Generation Huawei paper
2025-07-01 Voxtral Mistral model
2025-07-01 Solar Pro 2 Upstage model
2025-06-30 openPangu Huawei announcement
2025-06-30 Meta Superintelligence Labs Created; Wang Named Chief AI Officer CNBC Meta news
2025-06-27 HyperCLOVA X THINK Naver model
2025-06-27 Hunyuan-A13B 2 Tencent model paper 21.4k
2025-06-26 Kwai Keye-VL 2 Kuaishou model paper 79.9k
2025-06-25 OctoThinker SII paper
2025-06-24 Video-XL-2 BAAI model 223
2025-06-17 Mercury (Diffusion LLM) Inception Labs model
2025-06-16 SciSage / SurveyScope BAAI library
2025-06-16 MiniMax-M1 2 MiniMax model paper 12.1k
2025-06-15 AI-Driven Agentic Design Platform for Tumor Immunotherapy Drugs ZGCA announcement
2025-06-15 ZGCA & ZGCI Unveil AI-Driven Tumor Immunotherapy Drug Design Platform Zhongguancun Academy ZGCA news
2025-06-13 Scientists' First Exam PJLab eval
2025-06-12 Seed-1.6 (AdaCoT) ByteDance model
2025-06-12 Magistral Mistral model
2025-06-12 Predictable Scale Part II: Farseer StepFun paper
2025-06-12 Seed-1.6 Introduces Adaptive Chain-of-Thought (AdaCoT) ByteDance Seed ByteDance news
2025-06-11 FlagEvalMM BAAI library 101
2025-06-10 Seedance 1.0 ByteDance model
2025-06-07 The Illusion of Thinking Apple paper
2025-06-06 RoboBrain 2.0 2 BAAI model
2025-06-06 MiniCPM4 2 OpenBMB model paper 729
2025-06-06 Ultra-FineWeb OpenBMB dataset 25
2025-06-05 RoboRefer / RefSpatial BAAI model
2025-06-04 MiMo-VL 2 Xiaomi model paper 1.6k
2025-06-01 HumanSense Benchmark Ant Group dataset
2025-06-01 BrowseComp & WideSearch Moonshot AI dataset
2025-06-01 kimi-agent-sdk Moonshot AI library
2025-06-01 kimi-cli Moonshot AI library
2025-06-01 Kimi-Dev 2 Moonshot AI model paper 2.7k
2025-06-01 Kimi-Researcher Moonshot AI model
2025-06-01 walle Moonshot AI library
2025-06-01 AgentCPM Series 3 OpenBMB paper
2025-06-01 A.X Encoder SK Telecom model
2025-06-01 CF-Div2-Stepfun StepFun dataset
2025-06-01 SteptronOss StepFun library
2025-06-01 MiMo-Audio 2 Xiaomi model paper
2025-06-01 BrowseComp Z.ai dataset
2025-06-01 KTransformers Z.ai library
2025-05-30 AReaL Ant Group library 4.9k
2025-05-28 Ming-Omni Ant Group model 645 10.2k
2025-05-28 DeepSeek-R1-0528 DeepSeek model
2025-05-28 Pangu Embedded Huawei paper 138
2025-05-28 Skywork Open Reasoner 1 Skywork model
2025-05-27 Pangu Pro MoE Huawei paper 47
2025-05-27 HunyuanVideo-Avatar Tencent paper
2025-05-26 SynLogic 2 MiniMax paper dataset 496
2025-05-23 One RL to See Them All: Visual Triple Unified RL MiniMax paper
2025-05-22 Claude 4 Anthropic model
2025-05-22 XRing O1 Xiaomi announcement
2025-05-21 Devstral 2 Mistral model
2025-05-21 Falcon-H1 TII model
2025-05-20 BAGEL ByteDance model 6.6k
2025-05-17 Video-SafetyBench BAAI eval
2025-05-17 Model Merging in Pre-training of LLMs ByteDance paper
2025-05-15 BGE-Code-v1 BAAI model 11.4k 13.4k
2025-05-15 Apriel-Nemotron-15B Reasoning Model with NVIDIA ServiceNow ServiceNow news
2025-05-14 AlphaEvolve Google paper
2025-05-12 Seed1.5-VL ByteDance model
2025-05-12 MiniMax-Speech: Intrinsic Zero-Shot TTS MiniMax paper
2025-05-12 Step1X-3D: High-Fidelity Textured 3D Assets StepFun model
2025-05-10 Gated Attention for Large Language Models Alibaba paper
2025-05-08 Seed-Coder-8B ByteDance model
2025-05-07 DeerFlow ByteDance library
2025-05-07 Pangu Ultra MoE 2 Huawei model paper 9
2025-05-07 HunyuanCustom Tencent paper
2025-05-06 CCI 4.0 BAAI dataset
2025-05-06 OpenSeek BAAI model 1
2025-05-06 RoboOS 2 BAAI library
2025-05-02 MiMo (Reasoning) 2 Xiaomi model paper 41.1k
2025-05-01 AWorld Ant Group library 1.2k
2025-05-01 Kanana 1.5 Kakao model
2025-04-30 Amazon Nova Premier Amazon model
2025-04-30 DeepSeek-Prover-V2 DeepSeek model
2025-04-30 Nova Premier Launched as Amazon's Most Capable AI Model TechCrunch Amazon news
2025-04-30 Phi-4 Reasoning Models Released with Chain-of-Thought Microsoft Microsoft news
2025-04-29 Qwen3 9 Alibaba model paper 27k 40
2025-04-29 First LlamaCon Developer Conference Meta AI Meta news
2025-04-25 PolyMath Alibaba dataset 43
2025-04-25 Kimi-Audio 2 Moonshot AI model paper 20.2k
2025-04-24 Step1X-Edit StepFun model 62
2025-04-19 SRPO: Staged History-Resampling Policy Optimization Kuaishou paper
2025-04-17 Nemotron-CLIMB: Clustering-based Iterative Data Mixture Bootstrapping 3 NVIDIA paper dataset
2025-04-16 o3 OpenAI model
2025-04-15 DataDecide Ai2 paper
2025-04-15 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs ByteDance paper
2025-04-15 Kling 2.0 1 Kuaishou model
2025-04-15 Kimina-Prover 2 Moonshot AI model paper 809
2025-04-15 miniF2F-test (Rectified) Moonshot AI dataset
2025-04-15 Apriel ServiceNow model
2025-04-15 Step-R1-V-Mini StepFun model
2025-04-15 ZR1-1.5B Zyphra model
2025-04-15 Apriel-5B: ServiceNow's First Open SLM ServiceNow ServiceNow news
2025-04-14 InternVL3 PJLab model
2025-04-12 SenseNova V6 SenseTime model
2025-04-10 Scaling Laws for Native Multimodal Models Apple paper
2025-04-10 Seed1.5-Thinking: Advancing Superb Reasoning Models with RL ByteDance paper 1
2025-04-10 Pangu Ultra 2 Huawei model paper
2025-04-10 Kimi-VL 2 Moonshot AI model paper 104.2k 1
2025-04-08 Amazon Nova Sonic Amazon model
2025-04-08 Dream 7B Huawei model
2025-04-08 Skywork R1V Series Skywork model
2025-04-08 Nova Sonic Speech-to-Speech Model Launched on Bedrock AWS Amazon news
2025-04-07 BaichuanMed-OCR Baichuan model 37
2025-04-05 Llama 4 Meta model
2025-04-05 Llama 4 Scout and Maverick Released (First MoE, Multimodal) Meta AI Meta news
2025-04-04 Nemotron-H NVIDIA model
2025-04-03 DeepSeek-GRM: Inference-Time Scaling for Generalist Reward Modeling DeepSeek paper 1
2025-04-01 MiniMax Speech Series MiniMax model
2025-03-31 Amazon Nova Act Amazon model
2025-03-31 Amazon Unveils Nova Act, an AI Agent That Controls a Web Browser TechCrunch Amazon news
2025-03-30 ToRL: Scaling Tool-Integrated RL SII paper
2025-03-28 Doubao-Deep-Thinking ByteDance model
2025-03-27 OpenComplex 2 BAAI model
2025-03-26 Qwen2.5-Omni-7B Alibaba model 4k 466.1k
2025-03-25 Gemini 2.5 Pro Google model
2025-03-21 Hunyuan-T1 Tencent model
2025-03-18 Sable BAAI model
2025-03-18 Llama-Nemotron (Nano/Super/Ultra) NVIDIA model
2025-03-18 HaploVL Tencent model
2025-03-17 EXAONE Deep LG model
2025-03-16 ERNIE 4.5 Baidu model
2025-03-16 ERNIE X1 Baidu model
2025-03-12 Gemma 3 Google model
2025-03-10 Seedream 2.0 ByteDance paper
2025-03-10 Reka Flash 3 Released (Open-Weight, 21B) Reka Reka news
2025-03-07 Ling 2 Ant Group paper model 242 11.6k 1
2025-03-06 QwQ-32B Alibaba model 54.9k
2025-03-06 BGE-VL 2 BAAI model dataset 11.4k 3.3k
2025-03-06 Predictable Scale Part I: Step Law StepFun paper
2025-03-04 CogView-4 Z.ai model
2025-03-03 Aya Vision Cohere model
2025-03-01 Command A Cohere model
2025-03-01 Sarashina2.2 3 SB Intuitions model
2025-02-28 3FS (Fire-Flyer File System) DeepSeek library
2025-02-28 Smallpond DeepSeek library
2025-02-28 Image-01 MiniMax model
2025-02-27 RoboBrain BAAI model 99.7k
2025-02-27 UniTok ByteDance paper
2025-02-27 DualPipe DeepSeek library
2025-02-27 EPLB (Expert Parallelism Load Balancer) DeepSeek library
2025-02-27 Hunyuan Turbo S 2 Tencent model paper
2025-02-26 DeepGEMM DeepSeek library
2025-02-26 BIG-Bench Extra Hard (BBEH) Google eval
2025-02-26 Kanana Kakao model
2025-02-26 Granite 3.2: Multimodal Vision and Chain-of-Thought Reasoning IBM IBM news
2025-02-25 DeepEP DeepSeek library
2025-02-24 Claude Code Anthropic library
2025-02-24 Baichuan-Audio 2 Baichuan paper model 37
2025-02-24 FlashMLA DeepSeek library
2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Google paper
2025-02-24 Muon Optimizer 2 Moonshot AI paper library 1
2025-02-24 Topic Over Source: The Key to Effective Data Mixing for LLM Pre-training PJLab paper
2025-02-22 Moonlight-3B/16B Moonshot AI model 78.4k 1
2025-02-20 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines ByteDance eval
2025-02-19 Qwen2.5-VL Alibaba model
2025-02-19 FlexTok Apple paper
2025-02-18 MoBA: Mixture of Block Attention for Long-Context LLMs Moonshot AI paper 1
2025-02-18 Hunyuan-Large-Vision Tencent model
2025-02-17 Mistral Saba Mistral model
2025-02-17 OpenDWM / MaskGWM 2 SenseTime library paper
2025-02-17 Step-Audio / Step-Audio2 StepFun model
2025-02-17 Grok-3 xAI model
2025-02-17 Grok-3 Launched, Trained on 200K GPU Colossus Cluster xAI xAI news
2025-02-16 AdaGC: Improving Training Stability for Large Language Model Pretraining Baidu paper
2025-02-16 NSA: Native Sparse Attention DeepSeek paper 2
2025-02-15 1bit-Merging 1 Huawei paper
2025-02-14 WebOrganizer Ai2 paper
2025-02-14 LLaDA 2 Ant Group model 3.7k 1.8k 4
2025-02-14 Step-Video-T2V 2 StepFun model paper
2025-02-12 Wu Yonghui Joins ByteDance as Head of Seed Basic Research SCMP ByteDance news
2025-02-11 Nature Language Model (NatureLM) Microsoft model
2025-02-05 Scaling Laws for Upcycling Mixture-of-Experts Language Models SB Intuitions paper
2025-02-05 LIMO SII paper
2025-02-04 OpenAI and Kakao to Jointly Develop AI Products for South Korea CNBC Kakao news
2025-02-01 ModernBERT-Ja SB Intuitions model
2025-01-26 Baichuan-Omni-1.5 2 Baichuan paper model 316
2025-01-24 Baichuan-M1 3 Baichuan paper model 828 5
2025-01-23 UltraRAG OpenBMB library
2025-01-22 Doubao-1.5-Pro ByteDance model
2025-01-22 UI-TARS ByteDance library 140.8k 4
2025-01-22 DeepSeek-R1 DeepSeek model 1.6M
2025-01-22 Revisit Self-Debugging with Self-Generated Tests for Code Generation Meituan paper
2025-01-21 Hunyuan3D 2.0 3 Tencent model paper
2025-01-21 Stargate Project: $500B AI Infrastructure Initiative OpenAI OpenAI news
2025-01-20 Kimi k1.5 2 Moonshot AI model paper 10
2025-01-17 ComplexFuncBench Z.ai dataset
2025-01-15 InternLM3 PJLab model
2025-01-14 MiniMax-01 3 MiniMax model paper 101.3k 1
2025-01-14 MiniCPM-o 2.6 OpenBMB model 114.8k
2025-01-10 GThinker PCL model
2025-01-09 WanJuan 3.0 (WanJuan-SiLu) PJLab dataset
2025-01-07 Cosmos NVIDIA model
2025-01-03 AgentRefine Meituan paper
2025-01-01 Document Parse Upstage library
2024-12-31 OLMo 2 Ai2 model
2024-12-26 DeepSeek-V3 3 DeepSeek model paper 206
2024-12-25 QVQ Alibaba model
2024-12-24 LLM-jp-3 (172B) NII model
2024-12-23 Baichuan4-Finance 2 Baichuan paper model 1
2024-12-18 NOVA (Non-quantized Video Autoregressive) BAAI model
2024-12-13 DeepSeek-VL2 DeepSeek model
2024-12-13 Liquid AI Raises $250M Series A Led by AMD Liquid AI Liquid AI news
2024-12-13 Profile: Shanghai AI Lab: Driving both AI safety and development MERICS PJLab news
2024-12-12 Phi-4 Microsoft model
2024-12-12 Phi-4 Released: 14B SLM Specializing in Complex Reasoning Microsoft Research Microsoft news
2024-12-09 ProcessBench Alibaba dataset 1
2024-12-06 Aya Expanse Cohere model
2024-12-06 EXAONE 3.5 LG model
2024-12-06 Densing Law of LLMs OpenBMB paper
2024-12-06 InternVL 2.5 PJLab model
2024-12-05 Language Model Ladders Ai2 paper
2024-12-05 Infinity & InfinityStar ByteDance model 1
2024-12-05 Liquid: Scalable Multi-modal Generation ByteDance model
2024-12-05 Divot Tencent model
2024-12-05 Moto Tencent paper
2024-12-04 GenCast Google paper
2024-12-04 RedStone Microsoft dataset
2024-12-03 Amazon Nova Amazon model
2024-12-03 AWS Trainium2 (Trn2 / Trn2 UltraServer) Amazon announcement
2024-12-03 HunyuanVideo Tencent model 666 6
2024-12-03 SEED-Voken Tencent paper
2024-12-03 GLM-4-Voice: End-to-End Spoken Chatbot Z.ai model 1
2024-12-03 AWS Trainium2 Chips Generally Available; Trainium3 Previewed TechCrunch Amazon news
2024-12-01 Falcon 3 TII model
2024-11-25 Model Context Protocol (MCP) Anthropic library
2024-11-22 Tülu 3 Ai2 model
2024-11-22 Zamba2 (Hybrid SSM/Transformer Suite) Zyphra model
2024-11-22 Amazon Doubles Anthropic Investment to $8 Billion CNBC Amazon news
2024-11-21 AIMv2 Apple paper
2024-11-21 DINO-X 2 IDEA Lab model paper 4
2024-11-20 Hymba NVIDIA paper
2024-11-19 Aquila-VL-2B BAAI model 256
2024-11-08 Sarashina2-8x70B SB Intuitions model
2024-11-08 SB Intuitions releases 460B-parameter Japanese LLM Sarashina2-8x70B for academia and industry SB Intuitions SB Intuitions news
2024-11-04 Hunyuan-Large 2 Tencent model paper 977 5
2024-11-04 Hunyuan3D 1.0 Tencent model 92.4k 6
2024-11-01 SimpleQA OpenAI eval
2024-11-01 InternThinker PJLab model
2024-10-29 Agentforce Platform Launched for Enterprise AI Agents Salesforce Salesforce news
2024-10-28 AutoGLM 2 Z.ai model paper
2024-10-28 Zhongguancun Institute of Artificial Intelligence Established ZGCI ZGCA news
2024-10-24 Infinity-MM BAAI dataset
2024-10-24 MotionCLR 2 IDEA Lab library paper
2024-10-24 Skywork-Reward 2 Skywork model
2024-10-22 OmniGen BAAI model 10 1
2024-10-17 Janus 4 DeepSeek model paper dataset 53.4k 11
2024-10-15 Zyda-2 Zyphra dataset
2024-10-11 Baichuan-Omni 2 Baichuan paper model
2024-10-09 MLE-bench OpenAI eval
2024-10-09 PLaMo-100B PFN model
2024-10-09 Demis Hassabis & John Jumper Awarded Nobel Prize in Chemistry Google DeepMind Google news
2024-10-07 Falcon Mamba TII model
2024-10-02 Llama-3.1-Nemotron-70B NVIDIA model
2024-10-01 TxT360 MBZUAI dataset
2024-09-27 Emu3 2 BAAI paper 2.4k 4
2024-09-25 Molmo Ai2 model
2024-09-23 MobileUI Dataset Xiaomi dataset
2024-09-23 MobileVLM Xiaomi model
2024-09-19 Qwen2.5 3 Alibaba model paper 27k 57
2024-09-18 Qwen2-VL Alibaba model 48.7k 246
2024-09-18 Qwen2.5-Coder 2 Alibaba model paper 16.1k 30
2024-09-18 Qwen2.5-Math 2 Alibaba model paper 1.1k 10
2024-09-12 o1 OpenAI model
2024-09-11 Pixtral 12B Mistral model
2024-09-05 AdEMAMix Optimizer Apple paper
2024-09-05 DeepSeek-V2.5 DeepSeek model 7.2k
2024-09-05 MiniCPM3-4B OpenBMB model 14.1k
2024-09-05 Open-MAGVIT2 Tencent library
2024-09-05 Silvio Savarese Named to TIME 100 Most Influential in AI Salesforce Salesforce news
2024-09-03 OLMoE Ai2 model
2024-08-31 Hailuo AI (Video-01 / 2.3) MiniMax model
2024-08-29 CogVLM2 Z.ai model
2024-08-28 Auxiliary-Loss-Free Load Balancing Strategy DeepSeek paper
2024-08-26 Fire-Flyer AI-HPC: Cost-Effective Software-Hardware Co-Design DeepSeek paper
2024-08-21 Minitron NVIDIA paper
2024-08-21 Sarashina2 SB Intuitions model
2024-08-12 CogVideoX: Text-to-Video Diffusion Models Z.ai model 15
2024-08-07 EXAONE 3.0 LG model
2024-08-05 MiniCPM-V 2.6 OpenBMB model
2024-08-01 EXAONEPath 1.0 LG paper
2024-08-01 MiniMax Music Series MiniMax model
2024-07-29 Apple Foundation Models (AFM) Apple paper
2024-07-29 MindSearch PJLab library 2
2024-07-24 Mistral Large 2 Mistral model
2024-07-23 Llama 3.1 Meta model
2024-07-20 Consent in Crisis: The Rapid Decline of the AI Data Commons Cohere paper
2024-07-20 Falcon 2 TII model
2024-07-18 Mistral NeMo Mistral model
2024-07-16 Codestral Mamba Mistral model
2024-07-11 EchoMimicV2 & V3 Ant Group paper 4.2k 2
2024-07-11 Skywork-Math Skywork paper
2024-07-06 Kolors 2 Kuaishou model paper 594
2024-07-05 SenseNova 5.5 SenseTime model
2024-07-05 Vimi SenseTime model
2024-07-04 LLM-jp (v1/v2) NII model
2024-07-04 Step-2 StepFun model
2024-07-03 LivePortrait 2 Kuaishou library paper 5.1k 9
2024-07-03 InternLM2.5 PJLab model 41.6k
2024-07-02 InternVL 2.0 PJLab model
2024-07-01 QPlanner LG paper
2024-07-01 Mathstral 7B Mistral model
2024-07-01 MMLongBench-Doc PJLab dataset 1
2024-06-28 ERNIE 4.0 Turbo Baidu model
2024-06-26 Zhang Hongjiang, founder of BAAI: 'AI systems should never be able to deceive humans' Financial Times BAAI news
2024-06-24 Mooncake 2 Moonshot AI paper dataset 12
2024-06-24 Large Vocabulary Size Improves Large Language Models SB Intuitions paper
2024-06-20 Claude 3.5 Sonnet Anthropic model
2024-06-17 AquilaMed-RL BAAI model 28 1
2024-06-17 DeepSeek-Coder-V2 2 DeepSeek model paper 7.4k 48
2024-06-17 Nemotron-4 340B NVIDIA model
2024-06-14 MASt3R Naver paper
2024-06-12 SciRIFF Ai2 dataset
2024-06-11 Dasheng 3 Xiaomi paper model
2024-06-10 LlamaGen ByteDance model 5
2024-06-10 Apple Intelligence Introduced at WWDC 2024 Apple Apple news
2024-06-06 Qwen2 2 Alibaba model paper 27k 43
2024-06-06 Kling 2 Kuaishou model paper
2024-06-05 GLM-4V Z.ai model
2024-06-04 Seed-TTS 2 ByteDance model paper 4
2024-06-03 Skywork-MoE Skywork model
2024-06-01 agentUniverse Ant Group library 2.2k 1
2024-06-01 FlagScale BAAI library 495
2024-06-01 Yuan Embedding Inspur model
2024-05-30 MotionLLM 2 IDEA Lab model paper 5
2024-05-29 Codestral Mistral model
2024-05-28 Yuan 2.0-M32 Inspur model
2024-05-27 RLAIF-V OpenBMB paper 74 3
2024-05-23 DeepSeek-Prover DeepSeek model 906 5
2024-05-22 Baichuan 4 Baichuan model
2024-05-21 Scaling Monosemanticity Anthropic paper
2024-05-16 Grounding DINO 1.5 2 IDEA Lab model paper 10
2024-05-15 ByteFF ByteDance model
2024-05-14 Piccolo2 Embedding Model 2 SenseTime model paper 25 1
2024-05-14 Hunyuan-DiT 2 Tencent model paper 1
2024-05-13 GPT-4o OpenAI model
2024-05-13 Plot2Code Tencent dataset
2024-05-08 AlphaFold 3 Google paper
2024-05-07 DeepSeek-V2 2 DeepSeek model paper 12.8k 97
2024-05-07 Granite Code IBM model
2024-04-26 llm-jp-corpus NII dataset
2024-04-25 InternVL 1.5 PJLab model
2024-04-25 ShareGPT-4o PJLab dataset 16
2024-04-24 SenseNova 5.0 SenseTime model
2024-04-22 OpenELM Apple model
2024-04-22 SEED-X Tencent model
2024-04-18 Reka Core, Flash, and Edge Reka model
2024-04-17 ABAB 6 / 6.5 MiniMax model
2024-04-17 Mixtral 8x22B Mistral model
2024-04-12 MiniCPM-V 4 OpenBMB model paper 190.1k 21
2024-04-11 MiniCPM-V 2.0 OpenBMB model
2024-04-03 VAR (Visual Autoregressive Modeling) ByteDance model 7
2024-04-02 HyperCLOVA X Naver model
2024-04-01 RULER: What's the Real Context Size of Your LLM? NVIDIA eval
2024-03-28 Jamba AI21 Labs model
2024-03-28 Dataverse Upstage library
2024-03-28 sDPO Upstage paper
2024-03-28 Jamba: First Production-Grade SSM-Transformer Hybrid Released AI21 Labs AI21 Labs news
2024-03-23 Step-1 StepFun model
2024-03-23 Step-1V / 1.5V / 2V StepFun model
2024-03-23 Understanding Emergent Abilities from the Loss Perspective Z.ai paper
2024-03-19 MergeKit Arcee library
2024-03-17 Grok-1 xAI model
2024-03-12 Command R / R+ Cohere model
2024-03-11 Unraveling the Mystery of Scaling Laws: Part I Meituan paper
2024-03-08 DeepSeek-VL DeepSeek model 12.7k 43
2024-03-08 CogView3 Z.ai model
2024-03-04 Claude 3 Anthropic model
2024-03-01 Kimi 2M Moonshot AI model
2024-02-28 WanJuan 2.0 (WanJuan-CC) PJLab dataset
2024-02-27 BioT5+ Microsoft paper
2024-02-23 MegaScale ByteDance library 24
2024-02-21 SDXL-Lightning ByteDance model 53.4k 6
2024-02-21 Gemma Google model
2024-02-15 Gemini 1.5 Pro Google model
2024-02-15 SAMformer 2 Huawei model paper 1
2024-02-15 Sora OpenAI model
2024-02-07 Moirai Salesforce model
2024-02-06 SenseNova 4.0 SenseTime model
2024-02-05 BGE-M3 2 BAAI model paper 11.4k 16.4M 45
2024-02-05 DeepSeek-Math 2 DeepSeek model paper 66
2024-02-04 Qwen1.5 Alibaba model
2024-02-01 OLMo Ai2 model
2024-02-01 Aya 101 Cohere model
2024-02-01 MiniCPM 3 OpenBMB model paper 3.8k 19
2024-01-31 Dolma Ai2 dataset
2024-01-30 YOLO-World Tencent paper 26
2024-01-29 Baichuan 3 Baichuan model
2024-01-23 InternLM2 2 PJLab model paper 22.6k 27
2024-01-20 TFLOP Upstage paper
2024-01-19 Depth Anything ByteDance model 21
2024-01-17 AlphaGeometry Google paper
2024-01-17 GLM-4 Z.ai model
2024-01-15 SciGLM / SciInstruct Z.ai paper
2024-01-11 DeepSeek-MoE 2 DeepSeek model paper 23.3k 16
2024-01-09 Lightning Linear Attention Ant Group paper 2
2024-01-09 Baichuan-NPC Baichuan model
2024-01-04 LLaMA Pro Tencent model
2024-01-01 VSAG Ant Group library 459
2024-01-01 FlagAI BAAI library 3.9k
2023-12-28 PanGu-pi 3 Huawei model paper 2
2023-12-28 Spike No More: Stabilizing the Pre-training of Large Language Models SB Intuitions paper
2023-12-23 SOLAR 10.7B Upstage model
2023-12-22 GraphCast Google paper
2023-12-21 DUSt3R Naver paper
2023-12-21 InternVL: Scaling up Vision Foundation Models PJLab model 16
2023-12-20 Emu2 BAAI model 1.8k 21 7
2023-12-16 Paloma Ai2 dataset
2023-12-11 Mixtral 8x7B Mistral model
2023-12-06 Gemini 1.0 Google model
2023-12-05 MLX Apple library
2023-12-05 Lenna 2 Meituan model paper 1
2023-12-05 ReasonDet Meituan dataset 1
2023-11-29 DeepSeek-LLM 2 DeepSeek model paper 2k 82
2023-11-29 GNoME (Materials Discovery) Google paper
2023-11-28 Falcon (7B / 40B / 180B) TII model
2023-11-27 MagicAnimate & Make Pixels Dance ByteDance model 7
2023-11-27 Yuan 2.0 Inspur model
2023-11-27 UniRepLKNet Tencent model 34
2023-11-22 T-Rex 3 IDEA Lab model paper 8
2023-11-20 GPQA: Graduate-Level Google-Proof Q&A Anthropic, Ai2 eval
2023-11-14 Qwen2-Audio 2 Alibaba model paper 2.1k 2.3k 25
2023-11-06 CogVLM Z.ai model 81
2023-11-02 DeepSeek-Coder 2 DeepSeek model paper 14.2k 104
2023-10-30 Skywork-13B Skywork model
2023-10-25 DiQAD Baidu dataset
2023-10-19 KwaiYiiMath 2 Kuaishou model paper
2023-10-17 ERNIE 4.0 Baidu model
2023-10-17 BitNet Microsoft paper
2023-10-13 VideoCrafter Tencent library
2023-10-12 Aquila2 BAAI model 444 56
2023-10-09 Kimi-v1 Moonshot AI model
2023-10-05 MathCoder PJLab paper
2023-10-04 SEED / SEED-LLaMA Tencent model 10
2023-10-01 DALL-E 3 OpenAI model
2023-09-29 ToRA: Tool-Integrated Reasoning Agent Microsoft paper
2023-09-28 PLaMo-13B PFN model
2023-09-27 Mistral 7B Mistral model
2023-09-26 InternLM-XComposer PJLab model 7.5k 31
2023-09-25 Qwen-Agent Alibaba library 15.7k
2023-09-25 qwen.cpp Alibaba library 621
2023-09-21 PengCheng-Mind PCL model
2023-09-08 Ant Financial LLM Ant Group model
2023-09-08 CodeFuse Ant Group model
2023-09-08 Fin-Eval Ant Group dataset
2023-09-07 Hunyuan-LLM Tencent model
2023-09-06 Baichuan 2 3 Baichuan paper model 230.5k 125
2023-09-01 OpenSPG & OpenAGL Ant Group library 2k
2023-09-01 XTuner PJLab library
2023-08-31 ERNIE 3.5 Baidu model
2023-08-31 Belebele Meta eval
2023-08-30 JAIS MBZUAI model
2023-08-29 LongBench Z.ai eval 8
2023-08-24 Qwen-VL Alibaba model
2023-08-24 Code Llama Meta model
2023-08-22 Lagent & AgentLego PJLab library
2023-08-21 WanJuan 1.0 Corpus PJLab dataset 8
2023-08-20 ViT-Lens Tencent paper
2023-08-18 KwaiYii Kuaishou model
2023-08-15 Aquila 2 BAAI model paper 444 162
2023-08-11 MiLM-6B Xiaomi model
2023-08-08 Baichuan-53B Baichuan model
2023-08-04 SoftBank launches an OpenAI for Japan: SB Intuitions, building LLMs and generative AI in Japanese TechCrunch SB Intuitions news
2023-08-03 Qwen 3 Alibaba model paper 20.8k 113.2k 80
2023-08-02 BGE Text Embeddings BAAI model 11.4k 6M
2023-08-01 FlagEmbedding & C-MTEB 2 BAAI library dataset 11.4k 70
2023-08-01 ABAB 5 / 5.5 MiniMax model
2023-07-31 ToolLLM: Facilitating LLMs to Master 16000+ APIs OpenBMB paper 63
2023-07-30 SEED-Bench Tencent eval
2023-07-19 EXAONE 2.0 LG model
2023-07-18 Llama 2 Meta model
2023-07-16 ChatDev 2 OpenBMB paper library 69
2023-07-13 InternVid PJLab dataset 32
2023-07-11 Emu BAAI model 1.8k 29
2023-07-11 Baichuan-13B Baichuan model 9.6k
2023-07-07 SenseNova 2.0 Upgrade SenseTime model
2023-07-06 InternLM-1.0 PJLab model 679
2023-07-05 PanGu-Weather 2 Huawei, PCL model paper 122
2023-07-01 InternEvo PJLab library
2023-07-01 OpenCompass PJLab library
2023-06-25 ChatGLM2 / ChatGLM3 Z.ai model
2023-06-23 MME (Multimodal Evaluation) BAAI eval 17.5k
2023-06-20 Phi-1 ("Textbooks Are All You Need") Microsoft model
2023-06-20 UniAD SenseTime, PJLab paper 1
2023-06-15 Baichuan-7B Baichuan model 56.8k
2023-06-14 WebGLM Z.ai paper
2023-06-12 detrex 2 IDEA Lab library paper 15
2023-06-07 AlphaDev Google paper
2023-06-01 DB-GPT Ant Group library 18.3k
2023-06-01 DLRover Ant Group library 1.6k
2023-06-01 LMDeploy PJLab library
2023-06-01 RefinedWeb TII dataset
2023-05-31 Let's Verify Step by Step OpenAI paper
2023-05-30 GPT4Tools Tencent paper
2023-05-29 Mix-of-Show Tencent paper
2023-05-27 CPM-Bee OpenBMB model
2023-05-22 Grouped Query Attention (GQA) Google paper
2023-05-20 PengCheng-Nebula PCL announcement
2023-05-17 Ziya LLM 4 IDEA Lab model 1.2k
2023-05-04 StarCoder ServiceNow model
2023-04-20 UltraChat & UltraFeedback OpenBMB dataset
2023-04-11 SenseChat / SenseNova Launch SenseTime model
2023-04-11 SenseMirage SenseTime model
2023-04-10 Stable-DINO 2 IDEA Lab library paper 5
2023-04-06 Grounded SAM 3 IDEA Lab library paper 88
2023-04-05 Segment Anything (SAM) Meta model
2023-04-01 BMTools OpenBMB library
2023-03-27 EVA-CLIP BAAI model 2.7k 78
2023-03-27 Qianfan Platform Baidu announcement
2023-03-20 PanGu-Sigma 2 Huawei, PCL model paper 7
2023-03-14 OpenSeeD 2 IDEA Lab library paper 2
2023-03-14 GPT-4 OpenAI model
2023-03-14 ChatGLM-6B Z.ai model 83.7k 175
2023-03-09 Grounding DINO 2 IDEA Lab model paper 1.4M 240
2023-02-27 LLaMA Meta model
2023-01-30 BLIP-2 Salesforce model
2023-01-23 Microsoft Extends Multibillion-Dollar OpenAI Partnership Microsoft Microsoft news
2023-01-01 FlagEvaluation BAAI library 12
2022-12-22 Tune-A-Video Tencent paper
2022-12-15 Constitutional AI Anthropic paper
2022-12-06 InternVideo / InternVideo2 PJLab model 91
2022-12-05 Painter BAAI model 10
2022-11-30 Speculative Decoding Google paper
2022-11-12 AltCLIP & AltDiffusion BAAI model 3.9k 149.6k 9
2022-11-10 InternImage 2 PJLab model paper 38
2022-11-02 Chinese CLIP Alibaba model 5.8k 51
2022-11-02 Taiyi 3 IDEA Lab model paper 510 2
2022-10-06 ByteTransformer ByteDance library 1
2022-09-30 CodeGeeX 2 Z.ai model paper 47
2022-09-21 Whisper OpenAI model
2022-09-16 CPM-Ant OpenBMB model
2022-09-03 TuGraph Ant Group library 1.7k
2022-08-24 GLM-130B 2 Z.ai model paper 294
2022-08-01 COYO-700M Kakao dataset
2022-07-22 PanGu-Coder 3 Huawei model paper 5 36
2022-07-04 SecretFlow Ant Group library 2.6k
2022-06-24 YOLOv6 2 Meituan library paper 1.7k
2022-06-06 Mask DINO 2 IDEA Lab library paper 19
2022-06-01 Vision GNN (ViG) 2 Huawei model paper 194
2022-05-02 OPT (Open Pre-trained Transformer) Meta model
2022-04-26 CogView2 Z.ai, BAAI model
2022-04-12 Training a Helpful and Harmless Assistant (HH-RLHF) Anthropic paper
2022-04-05 PaLM Google model
2022-03-29 Chinchilla (Compute-Optimal Training) Google paper
2022-03-25 CodeGen Salesforce model
2022-03-20 Delta Tuning 2 OpenBMB paper library
2022-03-07 DINO (DETR) 2 IDEA Lab library paper 747
2022-03-04 InstructGPT (RLHF) OpenAI paper
2022-03-02 DN-DETR 2 IDEA Lab library paper 54
2022-02-11 BMTrain OpenBMB library
2022-02-08 AlphaCode Google paper
2022-02-07 OFA: One For All Alibaba model 2.6k 258
2022-01-30 FEDformer 2 Huawei model paper 534
2022-01-28 Chain-of-Thought Prompting Google paper
2022-01-28 DAB-DETR 2 IDEA Lab library paper 391
2022-01-25 SPIRAL 2 Huawei model paper 7
2022-01-24 SenseCore AI Infrastructure SenseTime announcement
2022-01-14 DeepSpeed-MoE Microsoft paper
2021-12-31 ERNIE-ViLG Baidu model 30
2021-12-01 EXAONE 1.0 LG model
2021-12-01 tFold Tencent model
2021-11-30 Donut Naver paper
2021-11-22 Fengshenbang 3 IDEA Lab library model 339
2021-11-01 KoGPT Kakao model
2021-10-13 ByteTrack ByteDance library 105
2021-10-10 Yuan 1.0 Inspur model
2021-09-28 DiffVC 2 Huawei model paper 25
2021-09-10 HyperCLOVA Naver model
2021-09-03 FLAN (Instruction Tuning) Google paper
2021-08-10 Codex OpenAI model
2021-07-28 Triton OpenAI library
2021-07-15 AlphaFold 2 Google paper
2021-07-12 SPLADE Naver paper
2021-07-08 OpenDILab / DI-engine 2 SenseTime, PJLab library
2021-07-08 OpenPPL (PPLNN) SenseTime library
2021-07-07 HumanEval OpenAI eval
2021-07-05 ERNIE 3.0 & 3.0 Titan Baidu model 193
2021-07-01 Meituan Sky Project Meituan announcement
2021-06-28 PFP / Matlantis PFN model
2021-06-24 CPM-2 OpenBMB, BAAI model
2021-06-17 LoRA (Low-Rank Adaptation) Microsoft paper
2021-06-01 OceanBase Ant Group library 10k
2021-06-01 Wu Dao 2.0 2 BAAI model paper
2021-06-01 Wu Dao Corpora BAAI dataset
2021-05-26 CogView Z.ai, BAAI model
2021-05-13 Grad-TTS 2 Huawei model paper 43
2021-05-01 Trustworthy AI White Paper Xiaomi paper
2021-04-29 DINO Meta paper
2021-04-26 PanGu-alpha 2 Huawei, PCL model paper 94
2021-03-20 Wu Dao 1.0 BAAI model
2021-03-18 GLM (Original) 2 Z.ai model paper 256 21
2021-03-18 P-Tuning Z.ai paper
2021-03-01 M6 Series Alibaba model 48
2021-02-27 Transformer in Transformer (TNT) 2 Huawei model paper 1k
2021-02-26 CLIP OpenAI paper
2021-01-11 Switch Transformer Google paper
2021-01-05 DALL-E OpenAI model
2021-01-01 MS-MARCO-CN Baidu dataset 18
2021-01-01 PaddleNLP Baidu library
2021-01-01 PaddleSpeech Baidu library
2021-01-01 KoBART SK Telecom model
2020-12-07 HEBO 2 Huawei library paper 19
2020-12-01 CPM-1 OpenBMB model
2020-10-22 Vision Transformer (ViT) Google paper
2020-10-08 Deformable DETR 2 SenseTime model paper 19.7k 1.9k
2020-09-12 FuxiCTR / BARS 3 Huawei library paper 12
2020-08-01 Vega Huawei library
2020-07-15 PaddleOCR Baidu library
2020-06-11 GPT-3 OpenAI model
2020-06-08 Liquid Time-constant Networks Liquid AI paper
2020-04-08 DynaBERT 2 Huawei model paper 4 119
2020-03-28 MindSpore Huawei library
2020-03-13 ProGen Salesforce model
2020-03-10 Bolt Huawei library
2020-02-13 DeepSpeed Microsoft library
2020-01-23 Scaling Laws for Neural Language Models OpenAI paper
2020-01-01 Kunlun XPU Baidu announcement
2020-01-01 PaddleDetection Baidu library
2020-01-01 PaddleSeg Baidu library
2020-01-01 KoGPT2 SK Telecom model
2019-11-27 GhostNet 4 Huawei model paper 404
2019-09-23 TinyBERT 2 Huawei model paper 91.8k 136
2019-09-17 Megatron-LM NVIDIA library
2019-09-01 NEZHA 2 Huawei model paper 86
2019-08-23 Ascend 910 Series Huawei announcement
2019-08-01 Chainer PFN library
2019-07-29 ERNIE 2.0 Baidu model 74
2019-07-25 Optuna PFN library
2019-07-19 SUMBT SK Telecom paper
2019-06-20 Alchemy Tencent dataset 64
2019-06-01 KoBERT SK Telecom model
2019-04-19 ERNIE 1.0 2 Baidu model paper 767
2019-04-03 CRAFT Naver paper
2019-02-14 GPT-2 OpenAI model
2018-12-01 JAX Google library
2018-10-11 BERT Google paper
2018-10-01 OpenMMLab / MMDetection 2 SenseTime, PJLab library paper 794
2018-06-11 GPT-1 OpenAI model
2018-06-01 MACE (Mobile AI Compute Engine) Xiaomi library
2018-03-16 ApolloScape Baidu dataset
2018-03-01 Kata Containers Ant Group library 7.6k
2017-11-24 StarGAN Naver paper
2017-11-14 DuReader Baidu dataset 51
2017-11-02 VQ-VAE Google paper
2017-07-26 Xiao AI Xiaomi announcement
2017-07-20 Proximal Policy Optimization (PPO) OpenAI paper
2017-07-05 Apollo Baidu library
2017-06-12 Attention Is All You Need (Transformer) Google paper
2017-06-01 Angel ML Tencent library
2017-03-15 DiscoGAN SK Telecom paper
2017-01-18 PyTorch Meta library
2016-09-30 PaddlePaddle Baidu library
2016-04-27 OpenAI Gym OpenAI library
2016-01-27 AlphaGo Google paper
2015-11-09 TensorFlow Google library
2015-03-09 Distilling the Knowledge in a Neural Network Google paper
2015-02-26 DQN (Deep Q-Network) Google paper
2015-02-11 Batch Normalization Google paper
2014-12-17 Deep Speech 1 & 2 Baidu paper