AI Lab Tracker
Labs
Timeline
Model Merging in Pre-training of LLMs
paper
2025-05-17
ByteDance
Research on model merging techniques during pre-training of large language models, exploring how independently trained model branches can be merged to improve training efficiency and final model quality.
Paper (arXiv)
nlp
training
research