Explanipedia

Correction: Toward mapping pragmatic impairment of autism spectrum disorder individuals through the development of a corpus of spoken Japanese Open

Sumi Kato, Kazuaki Hanawa, Vo Phuong Linh, Manabu Saito, Ryuichi Iimura , et al. · 2025

[This corrects the article DOI: 10.1371/journal.pone.0264204.].

Uncovering the Spectral Bias in Diagonal State Space Models Open

Rubén Solozabal, Velibor Bojković, Hilal AlQuabeh, Kentaro Inui, Martin Takáč · 2025

Current methods for initializing state space models (SSMs) parameters mainly rely on the \textit{HiPPO framework}, which is based on an online approximation of orthogonal polynomials. Recently, diagonal alternatives have shown to reach a s…

Cross-prompt Pre-finetuning of Language Models for Short Answer Scoring Open

Hiroaki Funayama, Yuichiroh Matsubayashi, Yuya Asazuma, Tomoya Mizumoto, Kentaro Inui · 2025

Automated short answer scoring (SAS) is the task of automatically scoring a given input to a prompt based on rubrics and reference answers. SAS is promising for real-world applications. However, because rubrics and reference answers differ…

TopK Language Models Open

Ryōsuke Takahashi, Tatsuro Inaba, Kentaro Inui, Benjamin Heinzerling · 2025

Sparse autoencoders (SAEs) have become an important tool for analyzing and interpreting the activation space of transformer-based language models (LMs). However, SAEs suffer several shortcomings that diminish their utility and internal val…

Emergence of Primacy and Recency Effect in Mamba: A Mechanistic Point of View Open

Muhammad Cendekia Airlangga, Hilal AlQuabeh, Munachiso Nwadike, Kentaro Inui · 2025

We study memory in state-space language models using primacy and recency effects as behavioral tools to uncover how information is retained and forgotten over time. Applying structured recall tasks to the Mamba architecture, we observe a c…

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance Open

Shintaro Ozaki, Tatsuya Hiraoka, Hideki Otake, Hiroki Ouchi, Masaru Isonuma , et al. · 2025

Large Language Models (LLMs) are known to process information using a proficient internal language consistently, referred to as latent language, which may differ from the input or output languages. However, how the discrepancy between the …

Mechanistic Insights into Grokking from the Embedding Layer Open

H. V. AlquBoj, Hilal AlQuabeh, Velibor Bojković, Munachiso Nwadike, Kentaro Inui · 2025

Grokking, a delayed generalization in neural networks after perfect training performance, has been observed in Transformers and MLPs, but the components driving it remain underexplored. We show that embeddings are central to grokking: intr…

SPIRIT: Patching Speech Language Models against Jailbreak Attacks Open

Amirbek Djanibekov, Nurdaulet Mukhituly, Kentaro Inui, Hanan Aldarmaki, Nils Lukas · 2025

Speech Language Models (SLMs) enable natural interactions via spoken instructions, which more effectively capture user intent by detecting nuances in speech. The richer speech signal introduces new security risks compared to text-based mod…

SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation Open

Quang P. M. Pham, Khoi Nguyen, Nhi H. Doan, Cuong The Pham, Qinbo Sun , et al. · 2025

Efficient path planning in robotics, particularly within large-scale, complex environments, remains a significant hurdle. While Large Language Models (LLMs) offer strong reasoning capabilities, their high computational cost and limited ada…

Beyond Click to Cognition Open

Yuko Tanaka, Hiromi Arai, Miwa Inuzuka, Yoichi Takahashi, Minao Kukita , et al. · 2025

How Individual Traits and Language Styles Shape Preferences In Open-ended User-LLM Interaction: A Preliminary Study Open

Rendi Chevi, Kentaro Inui, Thamar Solorio, Alham Fikri Aji · 2025

What makes an interaction with the LLM more preferable for the user? While it is intuitive to assume that information accuracy in the LLM's responses would be one of the influential variables, recent studies have found that inaccurate LLM'…

Correction: Toward mapping pragmatic impairment of autism spectrum disorder individuals through the development of a corpus of spoken Japanese Open

Sumi Kato, Kazuaki Hanawa, V. Nguyen, Manabu Saito, Ryuichi Iimura , et al. · 2025

[This corrects the article DOI: 10.1371/journal.pone.0264204.].

Syntactic Learnability of Echo State Neural Language Models at Scale Open

Ryo Ueda, Tatsuki Kuribayashi, Shunsuke Kando, Kentaro Inui · 2025

What is a neural model with minimum architectural complexity that exhibits reasonable language learning capability? To explore such a simple but sufficient neural language model, we revisit a basic reservoir computing (RC) model, Echo Stat…

Number Representations in LLMs: A Computational Parallel to Human Perception Open

H. V. AlquBoj, Hilal AlQuabeh, Velibor Bojković, Takaharu Hiraoka, Ahmed Oumar El-Shangiti , et al. · 2025

Humans are believed to perceive numbers on a logarithmic mental number line, where smaller values are represented with greater resolution than larger ones. This cognitive bias, supported by neuroscience and behavioral studies, suggests tha…

Large Language Models Are Human-Like Internally Open

Tatsuki Kuribayashi, Yohei Oseki, Souhaib Ben Taieb, Kentaro Inui, Timothy Baldwin · 2025

Recent cognitive modeling studies have reported that larger language models (LMs) exhibit a poorer fit to human reading behavior (Oh and Schuler, 2023b; Shain et al., 2024; Kuribayashi et al., 2024), leading to claims of their cognitive im…

FinchGPT: a Transformer based language model for birdsong analysis Open

Kôji Kobayashi, Kosuke Matsuzaki, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui , et al. · 2025

The long-range dependencies among the tokens, which originate from hierarchical structures, are a defining hallmark of human language. However, whether similar dependencies exist within the sequential vocalization of non-human animals rema…

Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference Open

Go Kamoda, Benjamin Hienzerling, Tatsuro Inaba, Keito Kudo, Keisuke Sakaguchi , et al. · 2025

According to the stages-of-inference hypothesis, early layers of language models map their subword-tokenized input, which does not necessarily correspond to a linguistically meaningful segmentation, to more meaningful representations that …

RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles Open

Munachiso Nwadike, Zangir Iklassov, Toluwani Aremu, Takaharu Hiraoka, Velibor Bojković , et al. · 2025

We introduce the concept of the self-referencing causal cycle (abbreviated RECALL) - a mechanism that enables large language models (LLMs) to bypass the limitations of unidirectional causality, which underlies a phenomenon known as the rev…

Identification of Multiple Logical Interpretations in Counter-Arguments Open

Wenzhi Wang, Paul Reisert, Shoichi Naito, Naoya Inoue, Machi Shimmei , et al. · 2025

Repetition Neurons: How Do Language Models Produce Repetitions? Open

Tatsuya Hiraoka, Kentaro Inui · 2025

Understanding the Side Effects of Rank-One Knowledge Editing Open

Ryōsuke Takahashi, Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, Kentaro Inui · 2025

Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles Open

Munachiso Nwadike, Zangir Iklassov, Toluwani Aremu, Takaharu Hiraoka, Benjamin Heinzerling , et al. · 2025

Annotating Errors in English Learners’ Written Language Production: Advancing Automated Written Feedback Systems Open

Steven Coyne, Diana Galván-Sosa, Ryan Spring, Camélia Guerraoui, Michael Zock , et al. · 2025

LLMs Can Compensate for Deficiencies in Visual Representations Open

Sho Takishita, Jay Gala, Abdelrahman Mohamed, Kentaro Inui, Yova Kementchedjhieva · 2025

The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces Open

Ahmed Oumar El-Shangiti, Takaharu Hiraoka, Hilal AlQuabeh, Benjamin Heinzerling, Kentaro Inui · 2025

Spelling-out is not Straightforward: LLMs’ Capability of Tokenization from Token to Characters Open

Tatsuya Hiraoka, Kentaro Inui · 2025

SPIRIT: Patching Speech Language Models against Jailbreak Attacks Open

Amirbek Djanibekov, Nurdaulet Mukhituly, Kentaro Inui, Hanan Aldarmaki, Nils Lukas · 2025

Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning Open

Ayana Niwa, Masahiro Kaneko, Kentaro Inui · 2025

Deterministic Compression of Word Embeddings Open

Y. Nakamura, Jun Suzuki, Takumi Ito, Kentaro Inui · 2025

Word embeddings are an indispensable technology in the field of artificial intelligence, particularly when working with natural language processing models. To further enhance their usability, several studies have tackled the compression of…

Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference Open

Go Kamoda, Benjamin Heinzerling, Tatsuro Inaba, Keito Kudo, Keisuke Sakaguchi , et al. · 2025

Kentaro Inui YOU? Author Swipe