Explanipedia

Effective Long-Context Scaling of Foundation Models Open

Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava , et al. · 2023

We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts …

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning Open

John Nguyen, Jianyu Wang, Kshitiz Malik, Maziar Sanjabi, Michael Rabbat · 2022

Computer science Geography Physics

An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to the…

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning Open

John Nguyen, Kshitiz Malik, Maziar Sanjabi, Michael Rabbat · 2022

Computer science Physics Economics

An oft-cited challenge of federated learning is the presence of heterogeneity. \emph{Data heterogeneity} refers to the fact that data from different clients may follow very different distributions. \emph{System heterogeneity} refers to cli…

Federated Learning with Partial Model Personalization Open

Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael Rabbat, Maziar Sanjabi , et al. · 2022

Computer science Economics

We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the l…

FedSynth: Gradient Compression via Synthetic Data in Federated Learning Open

Shengyuan Hu, Jack Goetz, Kshitiz Malik, Hongyuan Zhan, Zhe Liu , et al. · 2022

Computer science Mathematics Art

Model compression is important in federated learning (FL) with large models to reduce communication cost. Prior works have been focusing on sparsification based compression that could desparately affect the global model accuracy. In this w…

Papaya: Practical, Private, and Scalable Federated Learning Open

Dzmitry Huba, John Nguyen, Kshitiz Malik, Ruiyu Zhu, Mike Rabbat , et al. · 2021

Computer science Mathematics Engineering

Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients …

Federated Learning with Buffered Asynchronous Aggregation Open

John Nguyen, Kshitiz Malik, Hongyuan Zhan, Ashkan Yousefpour, Michael Rabbat , et al. · 2021

Computer science Mathematics Economics

Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hu…

Active Federated Learning Open

Jack Goetz, Kshitiz Malik, Duc Viet Bui, Seungwhan Moon, Honglei Liu , et al. · 2019

Computer science Mathematics Philosophy

Federated Learning allows for population level models to be trained without centralizing client data by transmitting the global model to clients, calculating gradients locally, then averaging the gradients. Downloading models and uploading…

Federated User Representation Learning Open

Duc Viet Bui, Kshitiz Malik, Jack Goetz, Honglei Liu, Seungwhan Moon , et al. · 2019

Computer science Political science

Collaborative personalization, such as through learned user representations (embeddings), can improve the prediction accuracy of neural-network-based models significantly. We propose Federated User Representation Learning (FURL), a simple,…

Kshitiz Malik YOU? Author Swipe