Explanipedia

LM Agents May Fail to Act on Their Own Risk Knowledge Open

Yun Tang, Tianxiao Li, Elizabeth Li, Chris J. Maddison, Honghua Dong , et al. · 2025

Language model (LM) agents have demonstrated significant potential for automating real-world tasks, yet they pose a diverse array of potential, severe risks in safety-critical scenarios. In this work, we identify a significant gap between …

Reasoning to Learn from Latent Thoughts Open

Yangjun Ruan, Neil Band, Chris J. Maddison, Tatsunori Hashimoto · 2025

Compute scaling for language model (LM) pretraining has outpaced the growth of human-written texts, leading to concerns that data will become the bottleneck to LM scaling. To continue scaling pretraining in this data-constrained regime, we…

MixMin: Finding Data Mixtures via Convex Minimization Open

Anvith Thudi, Evianne Rovers, Yangjun Ruan, Tristan Thrush, Chris J. Maddison · 2025

Modern machine learning pipelines are increasingly combining and mixing data from diverse and disparate sources, e.g., pre-training large language models. Yet, finding the optimal data mixture is a challenging and open problem. We formaliz…

APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts Open

Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan , et al. · 2025

Graph-based Uncertainty Metrics for Long-form Language Model Outputs Open

Mingjian Jiang, Yangjun Ruan, Prasanna Sattigeri, Salim Roukos, Tatsunori Hashimoto · 2024

Recent advancements in Large Language Models (LLMs) have significantly improved text generation capabilities, but these systems are still known to hallucinate, and granular uncertainty estimation for long-form LLM generations remains chall…

APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts Open

Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan , et al. · 2024

Large Language Models (LLMs) have become increasingly capable of handling diverse tasks with the aid of well-crafted prompts and integration of external tools, but as task complexity rises, the workflow involving LLMs can be complicated an…

Observational Scaling Laws and the Predictability of Language Model Performance Open

Yangjun Ruan, Chris J. Maddison, Tatsunori Hashimoto · 2024

Understanding how language model performance varies with scale is critical to benchmark and algorithm development. Scaling laws are one approach to building this understanding, but the requirement of training models across many different s…

FastSpeech: Fast, Robust and Controllable Text to Speech Open

Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao , et al. · 2024

Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text, and then synthesize speech from the m…

Identifying the Risks of LM Agents with an LM-Emulated Sandbox Open

Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou , et al. · 2023

Recent advances in Language Model (LM) agents and tool use, exemplified by applications like ChatGPT Plugins, enable a rich set of capabilities but also amplify potential risks - such as leaking private data or causing financial losses. Id…

Weighted Ensemble Self-Supervised Learning Open

Yangjun Ruan, Saurabh Singh, Warren R. Morningstar, Alexander A. Alemi, Sergey Ioffe , et al. · 2022

Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-…

Augment with Care: Contrastive Learning for Combinatorial Problems Open

Haonan Duan, Pashootan Vaezipoor, Max B. Paulus, Yangjun Ruan, Chris J. Maddison · 2022

Supervised learning can improve the design of state-of-the-art solvers for combinatorial problems, but labelling large numbers of combinatorial instances is often impractical due to exponential worst-case complexity. Inspired by the recent…

Optimal Representations for Covariate Shift Open

Yangjun Ruan, Yann Dubois, Chris J. Maddison · 2021

Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are gu…

Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding Open

Yangjun Ruan, Karen Ullrich, Daniel Severo, James T. Townsend, Ashish Khisti , et al. · 2021

Latent variable models have been successfully applied in lossless compression with the bits-back coding algorithm. However, bits-back suffers from an increase in the bitrate equal to the KL divergence between the approximate posterior and …

Learning to Learn by Zeroth-Order Oracle Open

Yangjun Ruan, Yuanhao Xiong, Sashank J. Reddi, Sanjiv Kumar, Cho‐Jui Hsieh · 2020

In the learning to learn (L2L) framework, we cast the design of optimization algorithms as a machine learning problem and use deep neural networks to learn the update rules. In this paper, we extend the L2L framework to zeroth-order (ZO) o…

Learning to Learn by Zeroth-Order Oracle Open

Yangjun Ruan, Yuanhao Xiong, Sashank J. Reddi, Sanjiv Kumar, Cho‐Jui Hsieh · 2019

In the learning to learn (L2L) framework, we cast the design of optimization algorithms as a machine learning problem and use deep neural networks to learn the update rules. In this paper, we extend the L2L framework to zeroth-order (ZO) o…

FastSpeech: Fast, Robust and Controllable Text to Speech Open

Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao , et al. · 2019

Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text, and then synthesize speech from the m…

Yangjun Ruan YOU? Author Swipe