Exploring foci of:
arXiv (Cornell University)
Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell
June 2024 • Taiming Lu, Muhan Gao, Kuai Yu, Adam Byerly, Daniel Khashabi
Large Language Models (LLMs) exhibit positional bias, struggling to utilize information from the middle or end of long contexts. Our study explores LLMs' long-context reasoning by probing their hidden representations. We find that while LLMs encode the position of target information, they often fail to leverage this in generating accurate responses. This reveals a disconnect between information retrieval and utilization, a "know but don't tell" phenomenon. We further analyze the relationship between extraction tim…
Computer Science