Explanipedia

Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones Open

Daking Rai · 2025

Despite remarkable advances in coding capabilities, language models (LMs) still struggle with simple syntactic tasks such as generating balanced parentheses. In this study, we investigate the underlying mechanisms behind the persistence of…

Mechanistic Understanding of Language Models in Syntactic Code Completion Open

Samuel A. Miller, Daking Rai, Ziyu Yao · 2025

Recently, language models (LMs) have shown impressive proficiency in code generation tasks, especially when fine-tuned on code-specific datasets, commonly known as Code LMs. However, our understanding of the internal decision-making proces…

All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens Open

Siddarth Mamidanna, Daking Rai, Ziyu Yao, Yilun Zhou · 2025

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Open

Dong Shu, Xuansheng Wu, Haiyan Zhao, Daking Rai, Ziyu Yao , et al. · 2025

Understanding the Effect of Algorithm Transparency of Model Explanations in Text-to-SQL Semantic Parsing Open

Daking Rai, Rydia R. Weiland, Kayla Margaret Gabriella Herrera, Tyler H. Shaw, Ziyu Yao · 2024

Explaining the decisions of AI has become vital for fostering appropriate user trust in these systems. This paper investigates explanations for a structured prediction task called ``text-to-SQL Semantic Parsing'', which translates a natura…

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models Open

Daking Rai, Yilun Zhou, Feng Shi, Abulhair Saparov, Ziyu Yao · 2024

Mechanistic interpretability (MI) is an emerging sub-field of interpretability that seeks to understand a neural network model by reverse-engineering its internal computations. Recently, MI has garnered significant attention for interpreti…

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract) Open

Daking Rai, Yilun Zhou, Bailin Wang, Ziyu Yao · 2023

While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success. Our work studies different me…

Improving Generalization in Language Model-Based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-Based Techniques Open

Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao · 2023

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's …

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract) Open

Daking Rai, Yilun Zhou, Bailin Wang, Ziyu Yao · 2023

While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success. Our work studies different me…

Improving Generalization in Language Model-based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-based Techniques Open

Daking Rai, Bailin Wang, Yilun Zhou, Ziyu Yao · 2023

Compositional and domain generalization present significant challenges in semantic parsing, even for state-of-the-art semantic parsers based on pre-trained language models (LMs). In this study, we empirically investigate improving an LM's …

Daking Rai YOU? Author Swipe