Explanipedia

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting Open

Howard Chen, Noam Razin, Karthik Narasimhan, Danqi Chen · 2025

Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating thi…

Extracting Rule-based Descriptions of Attention Features in Transformers Open

Dan Friedman, Adithya Bhaskar, Alexander Wettig, Danqi Chen · 2025

Mechanistic interpretability strives to explain model behavior in terms of bottom-up primitives. The leading paradigm is to express hidden states as a sparse linear combination of basis vectors, called features. However, this only identifi…

The Model Hears You: Audio Language Model Deployments Should Consider the Principle of Least Privilege Open

Luxi He, Xiangyu Qi, Michel Liao, Ian R Cheong, Prateek Mittal , et al. · 2025

The latest Audio Language Models (Audio LMs) process speech directly instead of relying on a separate transcription step. This shift preserves detailed information, such as into- nation or the presence of multiple speakers, that would othe…

Sustainable valorization of tea waste by enhanced caffeine extraction via microbial-driven solid-state fermentation Open

Bixia Qiu, Jiaying Yu, Danqi Chen, Yuying Zeng, Chunyu Li , et al. · 2025

Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking Open

W. J. Zhang, Fangcong Yin, H. W. Yen, Danqi Chen, Xi Ye · 2025

Recent work has identified retrieval heads, a subset of attention heads responsible for retrieving salient information in long-context language models (LMs), as measured by their copy-paste behavior in Needlein-a-Haystack tasks. In this pa…

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning Open

Xinyu Zhu, Mengfan Xia, Zhepei Wei, Weilin Chen, Danqi Chen , et al. · 2025

Reinforcement learning with verifiable rewards (RLVR) is a promising approach for training language models (LMs) on reasoning tasks that elicit emergent long chains of thought (CoTs). Unlike supervised learning, it updates the model using …

The Model Hears You: Audio Language Model Deployments Should Consider the Principle of Least Privilege Open

Luxi He, Xiangyu Qi, Michel Liao, Inyoung Cheong, Prateek Mittal , et al. · 2025

The latest Audio Language Models (Audio LMs) process speech directly instead of relying on a separate transcription step. This shift preserves detailed information, such as intonation or the presence of multiple speakers, that would otherw…

Formaldehyde Exposure Induces Systemic Epigenetic Alterations in Histone Methylation and Acetylation Open

Jiahao Feng, Chih‐Wei Liu, Jingya Peng, Yun-Chung Hsiao, Danqi Chen , et al. · 2025

Formaldehyde (FA) is a pervasive environmental organic pollutant and a Group 1 human carcinogen. While FA has been implicated in various cancers, its genotoxic effects, including DNA damage and DNA-protein crosslinking, have proven insuffi…

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Open

Alexander Wettig, Kyle Lo, Sewon Min, Hannaneh Hajishirzi, Danqi Chen , et al. · 2025

Modern language models are trained on large, unstructured datasets consisting of trillions of tokens and obtained by crawling the web. The unstructured nature makes it difficult to reason about their contents and develop systematic approac…

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Open

Yong Lin, Sui Tang, Bohan Lyu, Jiayun Wu, Hui Lin , et al. · 2025

We introduce Goedel-Prover, an open-source language model that achieves state-of-the-art (as of April 5 2025) performance in automated formal proof generation for mathematical problems. A key challenge in this field is the scarcity of form…

Optimizing the thermostability of triketone dioxygenase for engineering tolerance to mesotrione herbicide in soybean and cotton Open

Stephen M. G. Duff, Lei Shi, Danqi Chen, Xiaoran Fu, Mingsheng Peng , et al. · 2025

Optimized triketone dioxygenase (TDO) variants with enhanced temperature stability parameters were engineered to enable robust triketone tolerance in transgenic cotton and soybean crops. This herbicide tolerance trait, which can metabolize…

Metadata Conditioning Accelerates Language Model Pre-training Open

Tianyu Gao, Alexander Wettig, Luxi He, Yihe Dong, Sadhika Malladi , et al. · 2025

The vast diversity of styles, domains, and quality levels present in language model pre-training corpora is essential in developing general model capabilities, but efficiently learning and deploying the correct behaviors exemplified in eac…

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation Open

Xi Ye, Fangcong Yin, Ying‐Hui He, J. Zhang, H. W. Yen , et al. · 2025

Existing benchmarks for evaluating long-context language models (LCLMs) primarily focus on long-context recall, requiring models to produce short responses based on a few critical snippets while processing thousands of irrelevant tokens. W…

Metadata Conditioning Accelerates Language Model Pre-training Open

Tianyu Gao, Alexander Wettig, Luxi He, Yihe Dong, Sadhika Malladi , et al. · 2025

The vast diversity of styles, domains, and quality levels present in language model pre-training corpora is essential in developing general model capabilities, but efficiently learning and deploying the correct behaviors exemplified in eac…

Representing Rule-based Chatbots with Transformers Open

Dan Friedman, Abhishek Panigrahi, Danqi Chen · 2025

How to Train Long-Context Language Models (Effectively) Open

Tianyu Gao, Alexander Wettig, H. W. Yen, Danqi Chen · 2025

Industrial Robots and Migrants’ Settlement Intention in Cities: A Study on China Open

Liqun Pan, Danqi Chen, Jing Li, Pundarik Mukhopadhaya · 2025

Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking Open

W. J. Zhang, Fangcong Yin, H. W. Yen, Danqi Chen, Xi Ye · 2025

Continual Memorization of Factoids in Language Models Open

Howard Chen, Jiayi Geng, Adithya Bhaskar, Dan Friedman, Danqi Chen · 2024

As new knowledge rapidly accumulates, language models (LMs) with pretrained knowledge quickly become obsolete. A common approach to updating LMs is fine-tuning them directly on new knowledge. However, recent studies have shown that fine-tu…

Efficacy and Safety of DPP-4 Inhibitors and Metformin Combinations in Type 2 Diabetes: A Systematic Literature Review and Network Meta-Analysis [Corrigendum] Open

Rongping Chen, Jing Li, Danqi Chen, Weiheng Wen, Susu Zhang , et al. · 2024

[This corrects the article DOI: 10.2147/DMSO.S450994.].

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization Open

Noam Razin, Sadhika Malladi, A. Uday Bhaskar, Danqi Chen, Sanjeev Arora , et al. · 2024

Direct Preference Optimization (DPO) and its variants are increasingly used for aligning language models with human preferences. Although these methods are designed to teach a model to generate preferred responses more frequently relative …

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly Open

H. W. Yen, Tianyu Gao, Minghui Hou, Ke Ding, Daniel Fleischer , et al. · 2024

Many benchmarks exist for evaluating long-context language models (LCLMs), yet developers often rely on synthetic tasks such as needle-in-a-haystack (NIAH) or an arbitrary subset of tasks. However, it remains unclear whether these benchmar…

How to Train Long-Context Language Models (Effectively) Open

Tianyu Gao, Alexander Wettig, H. W. Yen, Danqi Chen · 2024

We study continued training and supervised fine-tuning (SFT) of a language model (LM) to make effective use of long-context information. We first establish a reliable evaluation protocol to guide model development -- instead of perplexity …

Diagnosis of ecological security and the spatial heterogeneity of its driving factors in the mining-impacted watershed, based on ecosystem health-risk-services framework Open

Wenjuan Jin, Zhenxing Bian, Zhichao Dong, Danqi Chen, Xufeng Zhang , et al. · 2024

A comprehensive diagnosis of ecological security (ES) and its driving mechanisms in the watershed under mining influence is essential for the conservation and restoration of watershed ecosystems. Few studies have comprehensively evaluated …

Parental perceptions and experiences of kangaroo care for preterm infants in neonatal intensive care units in China: a qualitative study Open

Qian Cai, Yunxian Zhou, Danqi Chen, Fang Wang, Xinfen Xu · 2024

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Open

Hongjin Su, H. W. Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff , et al. · 2024

Existing retrieval benchmarks primarily consist of information-seeking queries (e.g., aggregated questions from search engines) where keyword or semantic-based retrieval is usually sufficient. However, many complex real-world queries requi…

Representing Rule-based Chatbots with Transformers Open

Dan Friedman, Abhishek Panigrahi, Danqi Chen · 2024

What kind of internal mechanisms might Transformers use to conduct fluid, natural-sounding conversations? Prior work has illustrated by construction how Transformers can solve various synthetic tasks, such as sorting a list or recognizing …

LitSearch: A Retrieval Benchmark for Scientific Literature Search Open

Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen , et al. · 2024

Literature search questions, such as "Where can I find research on the evaluation of consistency in generated summaries?" pose significant challenges for modern search engines and retrieval systems. These questions often require a deep und…

Healthcare providers' perceptions and experiences of kangaroo mother care for preterm infants in four neonatal intensive care units in China: a qualitative descriptive study Open

Qian Cai, Yunxian Zhou, M H Hong, Danqi Chen, Xinfen Xu · 2024

Background Kangaroo mother care (KMC) is an evidence-based intervention that can effectively reduce morbidity and mortality in preterm infants, but it has yet to be widely implemented in health systems in China. Most qualitative studies on…

Remodeling tumor‐associated macrophage for anti‐cancer effects by rational design of irreversible inhibition of mitogen‐activated protein kinase‐activated protein kinase 2 Open

Danyi Wang, Deqiao Sun, Xiaoyan Wang, Peng Xia, Yinchun Ji , et al. · 2024

Mitogen‐activated protein kinase‐activated protein kinase 2 (MK2) emerges as a pivotal target in developing anti‐cancer therapies. The limitations of ATP‐competitive inhibitors, due to insufficient potency and selectivity, underscore the u…

Danqi Chen YOU? Author Swipe