Explanipedia

WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks Open

Jingbo Yang, Bairu Hou, Wei Wei, Shiyu Chang, Yujia Bao · 2025

Large language model (LLM) agents are becoming competent at straightforward web tasks, such as opening an item page or submitting a form, but still struggle with objectives that require long horizon navigation, large scale information extr…

Research on Accident Type Prediction for New Energy Vehicles Based on the AS-Naive Bayes Algorithm Open

Shengwei Huang, Bairu Hou, Ximing Yin, Chenchen Kong, Chongming Wang · 2025

Developing new energy vehicles (NEVs) is a key strategy for achieving low-carbon and sustainable transportation. However, as the number of NEVs increases, traffic accidents involving these vehicles have risen sharply. To explore the charac…

Research on Target Detection Method for Intelligent Mobile Robots based on Machine Visionr Open

Bairu Hou · 2025

Computer science

Intelligent mobile robots use machine vision and deep learning to achieve environmental perception, path planning, and obstacle avoidance, effectively improving autonomous decision-making capabilities. In this paper, you only look once (YO…

KVLink: Accelerating Large Language Models via Efficient KV Cache Reuse Open

Jingbo Yang, Bairu Hou, Wei Wei, Yujia Bao, Shiyu Chang · 2025

We describe KVLink, an approach for efficient key-value (KV) cache reuse in large language models (LLMs). In many LLM applications, different inputs can share overlapping context, such as the same retrieved document appearing in multiple q…

Instruction-Following Pruning for Large Language Models Open

Bairu Hou, Qibin Chen, Jianyu Wang, Guoli Yin, Chong Wang , et al. · 2025

Computer science Biology Philosophy

With the rapid scaling of large language models (LLMs), structured pruning has become a widely used technique to learn efficient, smaller models from larger ones, delivering superior performance compared to training similarly sized models …

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation Open

Bairu Hou, Yang Zhang, Jacob Andreas, Shiyu Chang · 2024

Computer science Psychology Mathematics

This paper focuses on the task of hallucination detection, which aims to determine the truthfulness of LLM-generated statements. To address this problem, a popular class of methods utilize the LLM's self-consistencies in its beliefs in a s…

Advancing the Robustness of Large Language Models through Self-Denoised Smoothing Open

Jiabao Ji, Bairu Hou, Zhen Zhang, Guanhua Zhang, Wenqi Fan , et al. · 2024

Computer science Mathematics Chemistry

Although large language models (LLMs) have achieved significant success, their vulnerability to adversarial perturbations, including recent jailbreak attacks, has raised considerable concerns. However, the increasing size of these models a…

A Survey on Data Selection for Language Models Open

Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert , et al. · 2024

Computer science

A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as…

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing Open

Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani , et al. · 2024

Computer science

Aligned large language models (LLMs) are vulnerable to jailbreaking attacks, which bypass the safeguards of targeted LLMs and fool them into generating objectionable content. While initial defenses show promise against token-based threat m…

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling Open

Bairu Hou, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang , et al. · 2023

Computer science Economics Biology

Uncertainty decomposition refers to the task of decomposing the total uncertainty of a predictive model into aleatoric (data) uncertainty, resulting from inherent randomness in the data-generating process, and epistemic (model) uncertainty…

Certified Robustness for Large Language Models with Self-Denoising Open

Zhen Zhang, Guanhua Zhang, Bairu Hou, Wenqi Fan, Qing Li , et al. · 2023

Computer science Chemistry Political science

Although large language models (LLMs) have achieved great success in vast real-world applications, their vulnerabilities towards noisy inputs have significantly limited their uses, especially in high-stake environments. In these contexts, …

Improving Diffusion Models for Scene Text Editing with Dual Encoders Open

Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang , et al. · 2023

Computer science Art Physics

Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance. Most previous approaches to this task rely on style-transfer models that crop…

PromptBoosting: Black-Box Text Classification with Ten Forward Passes Open

Bairu Hou, Joe O’Connor, Jacob Andreas, Shiyu Chang, Yang Zhang · 2022

Computer science

We describe PromptBoosting, a query-efficient procedure for building a text classifier from a neural language model (LM) without access to the LM's parameters, gradients, or hidden representations. This form of "black-box" classifier train…

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization Open

Bairu Hou, Jinghan Jia, Yihua Zhang, Guanhua Zhang, Shuicheng Yan , et al. · 2022

Computer science Chemistry

Robustness evaluation against adversarial examples has become increasingly important to unveil the trustworthiness of the prevailing deep models in natural language processing (NLP). However, in contrast to the computer vision domain where…

OpenAttack: An Open-source Textual Adversarial Attack Toolkit Open

Guoyang Zeng, Fanchao Qi, Qianrui Zhou, Tingji Zhang, Zixian Ma , et al. · 2021

Computer science Chemistry Mathematics

Textual adversarial attacking has received wide and increasing attention in recent years. Various attack models have been proposed, which are enormously distinct and implemented with different programming frameworks and settings. These fac…

Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations Open

Yuan Zang, Bairu Hou, Fanchao Qi, Zhiyuan Liu, Xiaojun Meng , et al. · 2020

Computer science

Adversarial attacking aims to fool deep neural networks with adversarial examples. In the field of natural language processing, various textual adversarial attack models have been proposed, varying in the accessibility to the victim model.…

Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet Open

Bairu Hou, Fanchao Qi, Yuan Zang, Xurui Zhang, Zhiyuan Liu , et al. · 2020

Computer science Economics Geology

Word sense disambiguation (WSD) is a fundamental natural language processing task. Unsupervised knowledge-based WSD only relies on a lexical knowledge base as the sense inventory and has wider practical use than supervised WSD that require…

Bairu Hou YOU? Author Swipe