Explanipedia

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining Open

Daouda Sow, Herbert Woisetschläger, Saikiran Bulusu, Shiqiang Wang, Hans‐Arno Jacobsen , et al. · 2025

Pretraining large language models (LLMs) on vast and heterogeneous datasets is crucial for achieving state-of-the-art performance across diverse downstream tasks. However, current training paradigms treat all samples equally, overlooking t…

Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization Open

Xuxi Chen, Zhendong Wang, Daouda Sow, Junjie Yang, Tianlong Chen , et al. · 2024

Computer science Psychology Mathematics

In the rapidly advancing arena of large language models (LLMs), a key challenge is to enhance their capabilities amid a looming shortage of high-quality training data. Our study starts from an empirical strategy for the light continual tra…

Non-Convex Bilevel Optimization with Time-Varying Objective Functions Open

Sen Lin, Daouda Sow, Kaiyi Ji, Yingbin Liang, Ness B. Shroff · 2023

Computer science Mathematics Biology

Bilevel optimization has become a powerful tool in a wide variety of machine learning problems. However, the current nonconvex bilevel optimization considers an offline dataset and static functions, which may not work well in emerging onli…

Doubly Robust Instance-Reweighted Adversarial Training Open

Daouda Sow, Sen Lin, Zhangyang Wang, Yingbin Liang · 2023

Computer science Mathematics Economics

Assigning importance weights to adversarial data has achieved great success in training adversarially robust networks under limited model capacity. However, existing instance-reweighted adversarial training (AT) methods heavily depend on h…

Algorithm Design for Online Meta-Learning with Task Boundary Detection Open

Daouda Sow, Sen Lin, Yingbin Liang, Junshan Zhang · 2023

Computer science Philosophy Geology

Online meta-learning has recently emerged as a marriage between batch meta-learning and online learning, for achieving the capability of quick adaptation on new tasks in a lifelong manner. However, most existing approaches focus on the res…

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima Open

Daouda Sow, Kaiyi Ji, Ziwei Guan, Yingbin Liang · 2022

Computer science Mathematics Economics

Bilevel optimization has found extensive applications in modern machine learning problems such as hyperparameter optimization, neural architecture search, meta-learning, etc. While bilevel problems with a unique inner minimal point (e.g., …

On the Convergence Theory for Hessian-Free Bilevel Algorithms Open

Daouda Sow, Kaiyi Ji, Yingbin Liang · 2021

Computer science Mathematics Economics

Bilevel optimization has arisen as a powerful tool in modern machine learning. However, due to the nested structure of bilevel optimization, even gradient-based methods require second-order derivative approximations via Jacobian- or/and He…

ES-Based Jacobian Enables Faster Bilevel Optimization Open

Daouda Sow, Kaiyi Ji, Yingbin Liang · 2021

Computer science Mathematics Economics

Bilevel optimization (BO) has arisen as a powerful tool for solving many modern machine learning problems. However, due to the nested structure of BO, existing gradient-based methods require second-order derivative approximations via Jacob…

A sequential guiding network with attention for image captioning Open

Daouda Sow, Zengchang Qin, Mouhamed Niasse, Tao Wan · 2018

Computer science Philosophy Geography

The recent advances of deep learning in both computer vision (CV) and natural language processing (NLP) provide us a new way of understanding semantics, by which we can deal with more challenging tasks such as automatic description generat…

Development of a Solar Controller with MLI Control Open

Mamadou Wade, M. Gueye, Ousmane Sow, Daouda Sow, Babou Dione , et al. · 2018

Engineering Computer science Physics

This work presents the development of a solar regulator which manages the charge and discharge of a (lead) battery installed in a photovoltaic system in order to extend its lifetime. The regulator is controlled by a microcontroller (PIC16F…

Daouda Sow YOU? Author Swipe