Explanipedia

ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training Open

Adel Nabli, Louis Fournier, Pierre Erbacher, Louis Serrano, Eugene Belilovsky , et al. · 2024

Computer science Geography

Training LLMs relies on distributed implementations using multiple GPUs to compute gradients in parallel with sharded optimizers. However, synchronizing gradients in data parallel setups introduces communication overhead that grows with th…

WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average Open

Louis Fournier, Adel Nabli, Masih Aminbeidokhti, Marco Pedersoli, Eugene Belilovsky , et al. · 2024

Computer science Mathematics

The performance of deep neural networks is enhanced by ensemble methods, which average the output of several models. However, this comes at an increased cost at inference. Weight averaging methods aim at balancing the generalization of ens…

Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning Open

Robin Algayres, Adel Nabli, Benoît Sagot, Emmanuel Dupoux · 2022

Computer science Mathematics Economics

We introduce a simple neural encoder architecture that can be trained using\nan unsupervised contrastive learning objective which gets its positive samples\nfrom data-augmented k-Nearest Neighbors search. We show that when built on top\nof…

DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization Open

Adel Nabli, Edouard Oyallon · 2022

Mathematics Computer science Physics

This work introduces DADAO: the first decentralized, accelerated, asynchronous, primal, first-order algorithm to minimize a sum of $L$-smooth and $μ$-strongly convex functions distributed over a given network of size $n$. Our key insight i…

Speech Sequence Embeddings using Nearest Neighbors Contrastive Learning Open

Robin Algayres, Adel Nabli, Benoît Sagot, Emmanuel Dupoux · 2022

Computer science Mathematics Geography

We introduce a simple neural encoder architecture that can be trained using an unsupervised contrastive learning objective which gets its positive samples from data-augmented k-Nearest Neighbors search. We show that when built on top of re…

Complexity of the Multilvel Critical Node Problem Open

Adel Nabli, Margarida Carvalho, Pierre Hosteins · 2022

Computer science Mathematics Political science

In this work, we analyze a sequential game played in a graph called the Multilevel Critical Node problem (MCN). A defender and an attacker are the players of this game. The defender starts by preventively interdicting vertices (vaccination…

The multilevel critical node problem : theoretical intractability and a curriculum learning approach Open

Adel Nabli · 2020

Computer science Sociology Mathematics

Évaluer la vulnérabilité des réseaux est un enjeu de plus en plus critique. Dans ce mémoire, nous nous penchons sur une approche étudiant la défense d’infrastructures stratégiques contre des attaques malveillantes au travers de problèmes d…

Curriculum learning for multilevel budgeted combinatorial problems Open

Adel Nabli, Margarida Carvalho · 2020

Computer science Mathematics

Learning heuristics for combinatorial optimization problems through graph neural networks have recently shown promising results on some classic NP-hard problems. These are single-level optimization problems with only one player. Multilevel…

Adel Nabli YOU? Author Swipe