Edan Kinderman
YOU?
Author Swipe
View article: Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks Open
Recent methods aim to merge neural networks (NNs) with identical architectures trained on different tasks into a single multi-task model. While most works focus on the simpler setup of merging NNs initialized from a common pre-trained netw…