Explanipedia

STRUX: An LLM for Decision-Making with Structured Explanations Open

Yiming Lu, Yebowen Hu, Hassan Foroosh, Wei Jin, Fei Liu · 2024

Psychology Computer science

Countless decisions shape our daily lives, and it is paramount to understand the how and why behind these choices. In this paper, we introduce a new LLM decision-making framework called STRUX, which enhances LLM decision-making by providin…

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives Open

Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao , et al. · 2024

Computer science Psychology Art

Reasoning is most powerful when an LLM accurately aggregates relevant information. We examine the critical role of information aggregation in reasoning by requiring the LLM to analyze sports narratives. To succeed at this task, an LLM must…

BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models Open

Jiaqi Xue, Mengxin Zheng, Yebowen Hu, Fei Liu, Xun Chen , et al. · 2024

Computer science

Large Language Models (LLMs) are constrained by outdated information and a tendency to generate incorrect data, commonly referred to as "hallucinations." Retrieval-Augmented Generation (RAG) addresses these limitations by combining the str…

Can Large Language Models do Analytical Reasoning? Open

Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh , et al. · 2024

Computer science Psychology

This paper explores the cutting-edge Large Language Model with analytical reasoning on sports. Our analytical reasoning embodies the tasks of letting large language models count how many points each team scores in a quarter in the NBA and …

SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs Open

Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh , et al. · 2024

Computer science History Political science

Large language models hold significant potential for integrating various data types, such as text documents and database records, for advanced analytics. However, blending text and numerical data presents substantial challenges. LLMs need …

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Open

Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh , et al. · 2023

Psychology Computer science Political science

Human preference judgments are pivotal in guiding large language models (LLMs) to produce outputs that align with human values. Human evaluations are also used in summarization tasks to compare outputs from various systems, complementing e…

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Open

Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh , et al. · 2023

Computer science Psychology Mathematics

Human preference judgments are pivotal in guiding large language models (LLMs) to produce outputs that align with human values. Human evaluations are also used in summarization tasks to compare outputs from various systems, complementing e…

Yebowen Hu YOU? Author Swipe