AI Lab Tracker
Labs
Timeline
What's New
Mooncake
paper
dataset
2024-06-24
Moonshot AI
A KVCache-centric disaggregated architecture for LLM serving. Won Best Paper at FAST 2025.
Paper (arXiv)
GitHub
Outputs
2
Mooncake: KVCache-centric Disaggregated Architecture
paper
Venue
FAST 2025
Citations
12
arXiv
HTML
Mooncake Dataset & Code
dataset
GitHub
infrastructure
efficiency