AI Lab Tracker
Labs
Timeline
Plot2Code
dataset
2024-05-13
Tencent
Comprehensive benchmark for evaluating multimodal LLMs on code generation from scientific plots. Published at NAACL Findings 2025.
Paper (arXiv)
GitHub
Dataset
Dataset
GitHub Repository
benchmark
code
multimodal