Explanipedia

Echoless Label-Based Pre-computation for Memory-Efficient Heterogeneous Graph Learning Open

Jun Hu, Shangheng Chen, Yuan Li, Bryan Hooi · 2025

Heterogeneous Graph Neural Networks (HGNNs) are widely used for deep learning on heterogeneous graphs. Typical end-to-end HGNNs require repetitive message passing during training, limiting efficiency for large-scale real-world graphs. Pre-…

Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods Open

Yulin Chen, Haoran Li, Yuan Sui, Bryan Hooi · 2025

With the development of technology, large language models (LLMs) have dominated the downstream natural language processing (NLP) tasks. However, because of the LLMs' instruction-following abilities and inability to distinguish the instruct…

How to Make Large Language Models Generate 100% Valid Molecules? Open

Tao Wen, Jing Tang, Alvin Chan, Bryan Hooi, Baolong Bi , et al. · 2025

Molecule generation is key to drug discovery and materials science, enabling the design of novel compounds with specific properties. Large language models (LLMs) can learn to perform a wide range of tasks from just a few examples. However,…

RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design. Open

Rishabh Anand, Chaitanya K Joshi, Alex Morehead, Arian R. Jamasb, Charles B. Harris , et al. · 2025

We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges p…

Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance Open

Yun He, Ruoyu Li, Alex Chen, Yue Liu, Yulin Chen , et al. · 2025

Large language model (LLM) agents often struggle in environments where rules and required domain knowledge frequently change, such as regulatory compliance and user risk screening. Current approaches, like offline fine-tuning and standard …

Campolina: A Deep Neural Framework for Accurate Segmentation of Nanopore Signals Open

Sara Bakić, Kresimir Friganovic, Bryan Hooi, Mile Šikić · 2025

Nanopore sequencing enables real-time, long-read analysis by processing raw signals as they are produced. A key step, segmentation of signals into events, is typically handled algorithmically, struggling in noisy regions. We present Campol…

NTSFormer: A Self-Teaching Graph Transformer for Multimodal Isolated Cold-Start Node Classification Open

Jun Hu, Yufei He, Yuan Li, Bryan Hooi, Bingsheng He · 2025

Isolated cold-start node classification on multimodal graphs is challenging because such nodes have no edges and often have missing modalities (e.g., absent text or image features). Existing methods address structural isolation by degradin…

VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents Open

Tri Cao, Bennett Lim, Yue Liu, Yuan Sui, Yuexin Li , et al. · 2025

Computer-Use Agents (CUAs) with full system access enable powerful task automation but pose significant security and privacy risks due to their ability to manipulate files, access user data, and execute arbitrary commands. While prior work…

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research Open

Hui Chen, Miao Xiong, Yujie Lu, Wei Han, Ailin Deng , et al. · 2025

Recent advancements in AI agents have demonstrated their growing potential to drive and support scientific discovery. In this work, we introduce MLR-Bench, a comprehensive benchmark for evaluating AI agents on open-ended machine learning r…

Efficient Reasoning via Chain of Unconscious Thought Open

Rongzhou Gong, Yue Liu, Wenjie Qu, Mingzhe Du, Yufei He , et al. · 2025

Large Reasoning Models (LRMs) achieve promising performance but compromise token efficiency due to verbose reasoning processes. Unconscious Thought Theory (UTT) posits that complex problems can be solved more efficiently through internaliz…

Seeing Through Deception: Uncovering Misleading Creator Intent in Multimodal News with Vision-Language Models Open

J. Wu, Feng Li, Min‐Yen Kan, Bryan Hooi · 2025

The impact of misinformation arises not only from factual inaccuracies but also from the misleading narratives that creators deliberately embed. Interpreting such creator intent is therefore essential for multimodal misinformation detectio…

PhishIntel: Toward Practical Deployment of Reference-Based Phishing Detection Open

Yuexin Li, H TAN, Qiaoran Meng, Mei Lin Lock, Tri Cao , et al. · 2025

Safety in Large Reasoning Models: A Survey Open

Cheng Wang, Yue Liu, Baoliang Bi, Duzhen Zhang, Zhongzhi Li , et al. · 2025

Large Reasoning Models (LRMs) have exhibited extraordinary prowess in tasks like mathematics and coding, leveraging their advanced reasoning capabilities. Nevertheless, as these capabilities progress, significant concerns regarding their v…

UniGraph2: Learning a Unified Embedding Space to Bind Multimodal Graphs Open

Yufei He, Yuan Sui, Xiaoxin He, Yue Liu, Yifei Sun , et al. · 2025

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation Open

Zhiyuan Hu, Shiyun Xiong, Yifan Zhang, See-Kiong Ng, Luu Anh Tuan , et al. · 2025

Recent advancements in visual language models (VLMs) have notably enhanced their capabilities in handling complex Graphical User Interface (GUI) interaction tasks. Despite these improvements, current frameworks often struggle to generate c…

PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection Open

Tri Cao, Chenyu Huang, Yuexin Li, Huilin Wang, Amy He , et al. · 2025

Phishing attacks are a major threat to online security, exploiting user vulnerabilities to steal sensitive information. Various methods have been developed to counteract phishing, each with varying levels of accuracy, but they also face no…

Modality-Independent Graph Neural Networks with Global Transformers for Multimodal Recommendation Open

Junhui Hu, Bryan Hooi, Bingsheng He, Yinwei Wei · 2025

Multimodal recommendation systems can learn users' preferences from existing user-item interactions as well as the semantics of multimodal data associated with items. Many existing methods model this through a multimodal user-item graph, a…

Geneshift: Impact of different scenario shift on Jailbreaking LLM Open

Tianyi Wu, Zhiwei Xue, Yue Liu, Jiaheng Zhang, Bryan Hooi , et al. · 2025

Jailbreak attacks, which aim to cause LLMs to perform unrestricted behaviors, have become a critical and challenging direction in AI safety. Despite achieving the promising attack success rate using dictionary-based evaluation, existing ja…

UniGraph: Learning a Unified Cross-Domain Foundation Model for Text-Attributed Graphs Open

Yufei He, Yuan Sui, Xiaoxin He, Bryan Hooi · 2025

N <span>ode</span> I <span>mport</span> : Imbalanced Node Classification with Node Importance Assessment Open

N.A. Chen, Zemin Liu, Bryan Hooi, Bingsheng He, Jun Hu , et al. · 2025

Words or Vision: Do Vision-Language Models Have Blind Faith in Text? Open

Ailin Deng, Tri Cao, Zhirui Chen, Bryan Hooi · 2025

Vision-Language Models (VLMs) excel in integrating visual and textual information for vision-centric tasks, but their handling of inconsistencies between modalities is underexplored. We investigate VLMs' modality preferences when faced wit…

Fact or Guesswork? Evaluating Large Language Model's Medical Knowledge with Structured One-Hop Judgment Open

Jiaxi Li, Yiwei Wang, Kai Zhang, Yujun Cai, Bryan Hooi , et al. · 2025

Large language models (LLMs) have been widely adopted in various downstream task domains. However, their ability to directly recall and apply factual medical knowledge remains under-explored. Most existing medical QA benchmarks assess comp…

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning Open

Tianyi Wu, Jingwei Ni, Bryan Hooi, Jiaheng Zhang, Elliott Ash , et al. · 2025

Instruction fine-tuning (IFT) can increase the informativeness of large language models (LLMs), but may reduce their truthfulness. This trade-off arises because IFT steers LLMs to generate responses containing long-tail knowledge that was …

ReLearn: Unlearning via Learning for Large Language Models Open

Haoming Xu, Naihui Zhao, Liming Yang, Sendong Zhao, Shumin Deng , et al. · 2025

Current unlearning methods for large language models usually rely on reverse optimization to reduce target token probabilities. However, this paradigm disrupts the subsequent tokens prediction, degrading model performance and linguistic co…

UniGraph2: Learning a Unified Embedding Space to Bind Multimodal Graphs Open

Yufei He, Yuan Sui, Xiaoxin He, Zhaoyu Li, Yifei Sun , et al. · 2025

Existing foundation models, such as CLIP, aim to learn a unified embedding space for multimodal data, enabling a wide range of downstream web-based applications like search, recommendation, and content classification. However, these models…

GuardReasoner: Towards Reasoning-based LLM Safeguards Open

Yue Liu, Hongcheng Gao, Suodi Zhai, Jun Xia, Tianyi Wu , et al. · 2025

As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardReasoner, a new safeguard for LLMs, by guiding the guard model to learn to reason. Concretel…

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs Open

Jinlan Fu, Shenzhen Huangfu, Hao Fei, Xiaoyu Shen, Bryan Hooi , et al. · 2025

Multimodal Large Language Models (MLLMs) still struggle with hallucinations despite their impressive capabilities. Recent studies have attempted to mitigate this by applying Direct Preference Optimization (DPO) to multimodal scenarios usin…

Spatio-Temporal Foundation Models: Vision, Challenges, and Opportunities Open

Adam Goodge, Wee Siong Ng, Bryan Hooi, See Kiong Ng · 2025

Foundation models have revolutionized artificial intelligence, setting new benchmarks in performance and enabling transformative capabilities across a wide range of vision and language tasks. However, despite the prevalence of spatio-tempo…

Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design Open

Zhi Zheng, Zhuoliang Xie, Zhenkun Wang, Bryan Hooi · 2025

Handcrafting heuristics for solving complex optimization tasks (e.g., route planning and task allocation) is a common practice but requires extensive domain knowledge. Recently, Large Language Model (LLM)-based automatic heuristic design (…

Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study Over Open-ended Question Answering Open

Yuan Sui, Yufei He, Zifeng Ding, Bryan Hooi · 2025

Bryan Hooi YOU? Author Swipe