AI Lab Tracker
Labs
Timeline
CPM.cu
library
2025-03-01
OpenBMB
Lightweight CUDA implementation for maximum LLM inference performance on edge GPUs (RTX series and Jetson).
GitHub
Library
GitHub Repository
efficiency
on-device
infrastructure