Model Merging
Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it!
-
Qualitatively characterizing neural network optimization problems
Paper • 1412.6544 • Published • 1 -
Convergent Learning: Do different neural networks learn the same representations?
Paper • 1511.07543 • Published -
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Paper • 1909.11299 • Published -
Model Fusion via Optimal Transport
Paper • 1910.05653 • Published