Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell

Exploring foci of: arXiv (Cornell University) Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell June 2024 • Taiming Lu, Muhan Gao, Kuai Yu, Adam Byerly, Daniel Khashabi Large Language Models (LLMs) exhibit positional bias, struggling to utilize information from the middle or end of long contexts. Our study explores LLMs' long-context reasoning by probing their hidden representations. We find that while LLMs encode the position of target information, they often fail to leverage this in generating accurate responses. This reveals a disconnect between information retrieval and utilization, a "know but don't tell" phenomenon. We further analyze the relationship between extraction tim… Open Article Page

Computer Science Open Article