AI Lab Tracker
Labs
Timeline
ACAVCaps
dataset
2025-08-06
Xiaomi
38,000-hour collection of general audio captions for training holistic audio-reasoning models.
Paper (arXiv)
GitHub
audio
training-data