Explanipedia

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent Open

Yongxian Wei, Anke Tang, Shen Li, Feng Xiong, Chun Yuan , et al. · 2025

Merging multiple expert models offers a promising approach for performing multi-task learning without accessing their original data. Existing methods attempt to alleviate task conflicts by sparsifying task vectors or promoting orthogonalit…

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Open

Anke Tang, Sheng Li, Yong Luo, Shuai Xie, Han Hu , et al. · 2024

Computer science Mathematics Materials science

Deep model training on extensive datasets is increasingly becoming cost-prohibitive, prompting the widespread adoption of deep model fusion techniques to leverage knowledge from pre-existing models. From simple weight averaging to more sop…

Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion Open

Anke Tang, Li Shen, Yong Luo, Liang Ding, Han Hu , et al. · 2023

Computer science Mathematics Engineering

Merging models fine-tuned from a common, extensively pre-trained large model but specialized for different tasks has been demonstrated as a cheap and scalable strategy to construct a multi-task model that performs well across diverse tasks…

Learning from models beyond fine-tuning Open

Hongling Zheng, Li Shen, Anke Tang, Yong Luo, Hu Han , et al. · 2023

Computer science Engineering

Foundation models (FM) have demonstrated remarkable performance across a wide range of tasks (especially in the fields of natural language processing and computer vision), primarily attributed to their ability to comprehend instructions an…

Parameter Efficient Multi-task Model Fusion with Partial Linearization Open

Anke Tang, Li Shen, Yong Luo, Yibing Zhan, Han Hu , et al. · 2023

Computer science Mathematics Physics

Large pre-trained models have enabled significant advances in machine learning and served as foundation components. Model fusion methods, such as task arithmetic, have been proven to be powerful and scalable to incorporate fine-tuned weigh…

Improving Heterogeneous Model Reuse by Density Estimation Open

Anke Tang, Yong Luo, Han Hu, Fengxiang He, Kehua Su , et al. · 2023

Computer science Engineering Geography

This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party. Con…

Improving Heterogeneous Model Reuse by Density Estimation Open

Anke Tang, Yong Luo, Hu Han, Fengxiang He, Kehua Su , et al. · 2023

Computer science Engineering Chemistry

This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party. Con…

Anke Tang YOU? Author Swipe