Explanipedia

A Comprehensive Survey on Multimodal RAG: All Combinations of Modalities as Input and Output Open

Rui Zhang, Chen Liu, Ruixuan Li, Philip S. Yu · 2025

ScaleFormer: Span Representation Cumulation for Long-Context Transformer Open

Du Jiang, Philip S. Yu · 2025

The quadratic complexity of standard self-attention severely limits the application of Transformer-based models to long-context tasks. While efficient Transformer variants exist, they often require architectural changes and costly pre-trai…

Jailbreaking LLMs Through Alignment Vulnerabilities in Out-of-Distribution Settings Open

Yue Huang, Jingyu Tang, Dongping Chen, Bingda Tang, Yao Wan , et al. · 2025

Frontiers in Graph Machine Learning for the Large Model Era Open

Qingyun Sun, Ziwei Zhang, Xingcheng Fu, Yangqiu Song, Jianxin Li , et al. · 2025

<span>InterFormer:</span> Effective Heterogeneous Interaction Learning for Click-Through Rate Prediction Open

Zhichen Zeng, Xiaolong Liu, Mengyue Hang, Xiaoyi Liu, Qinghai Zhou , et al. · 2025

Dialogues Aspect-based Sentiment Quadruple Extraction via Structural Entropy Minimization Partitioning Open

Kun Peng, Cong Cao, Hao Peng, Zhifeng Hao, Lei Jiang , et al. · 2025

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey Open

Henry Peng Zou, Wei Huang, Yaozu Wu, Yankai Chen, Chunyan Miao , et al. · 2025

The Emerged Security and Privacy of LLM Agent: A Survey with Case Studies Open

He Feng, Tianqing Zhu, Dayong Ye, Bo Liu, Wanlei Zhou , et al. · 2025

Inspired by the rapid development of Large Language Models (LLMs), LLM agents have evolved to perform complex tasks. LLM agents are now extensively applied across various domains, handling vast amounts of data to interact with humans and e…

The Impact of Digital Transformation on Financing Constraints Open

Philip S. Yu · 2025

In the era of the digital economy, the digital transformation of enterprises has a profound impact on their financing capabilities. By comprehensively reviewing relevant studies, this paper clarifies the concepts of digital transformation …

Deeper with Riemannian Geometry: Overcoming Oversmoothing and Oversquashing for Graph Foundation Models Open

Li Sun, Zhenhao Huang, Ming Zhang, Philip S. Yu · 2025

Message Passing Neural Networks (MPNNs) is the building block of graph foundation models, but fundamentally suffer from oversmoothing and oversquashing. There has recently been a surge of interest in fixing both issues. Existing efforts pr…

Paper2Web: Let's Make Your Paper Alive! Open

Yuhang Chen, Tingting Lv, Siyi Zhang, Yifang Yin, Yao Wan , et al. · 2025

Academic project websites can more effectively disseminate research when they clearly present core content and enable intuitive navigation and interaction. However, current approaches such as direct Large Language Model (LLM) generation, t…

Global-focal Adaptation with Information Separation for Noise-robust Transfer Fault Diagnosis Open

Junyu Ren, Wensheng Gan, Guangyu Zhang, Wei Zhong, Philip S. Yu · 2025

Existing transfer fault diagnosis methods typically assume either clean data or sufficient domain similarity, which limits their effectiveness in industrial environments where severe noise interference and domain shifts coexist. To address…

RoBCtrl: Attacking GNN-Based Social Bot Detectors via Reinforced Manipulation of Bots Control Interaction Open

Yingguang Yang, Xianghua Zeng, Q. M. Jonathan Wu, Hao Peng, Yutong Xia , et al. · 2025

Social networks have become a crucial source of real-time information for individuals. The influence of social bots within these platforms has garnered considerable attention from researchers, leading to the development of numerous detecti…

DeepResearchGuard: Deep Research with Open-Domain Evaluation and Multi-Stage Guardrails for Safety Open

Henry Peng Zou, Dongyuan Li, Andrea Zangari, Jing Guo, Chunyan Miao , et al. · 2025

Deep research frameworks have shown promising capabilities in synthesizing comprehensive reports from web sources. While deep research possesses significant potential to address complex issues through planning and research cycles, existing…

Reinforcement Learning from Probabilistic Forecasts for Safe Decision-Making via Conditional Value-at-Risk Planning Open

Michal Koren, Philip S. Yu · 2025

Sequential decisions in volatile, high-stakes settings require more than maximizing expected return; they require principled uncertainty management. This paper presents the Uncertainty-Aware Markov Decision Process (UAMDP), a unified frame…

AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents Open

Mingdai Yang, Nurendra Choudhary, Jiangshu Du, Edward Huang, Philip S. Yu , et al. · 2025

Recent agent-based recommendation frameworks aim to simulate user behaviors by incorporating memory mechanisms and prompting strategies, but they struggle with hallucinating non-existent items and full-catalog ranking. Besides, a largely u…

RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback Open

Chunyan Miao, Henry Peng Zou, Yangning Li, Yankai Chen, Yibo Wang , et al. · 2025

Large language models (LLMs) show the promise in supporting scientific research implementation, yet their ability to generate correct and executable code remains limited. Existing works largely adopt one-shot settings, ignoring the iterati…

Glocal Information Bottleneck for Time Series Imputation Open

Jie Yang, Kexin Zhang, Guoqiang Zhang, Philip S. Yu, Kan Ding · 2025

Time Series Imputation (TSI), which aims to recover missing values in temporal data, remains a fundamental challenge due to the complex and often high-rate missingness in real-world scenarios. Existing models typically optimize the point-w…

AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning Open

Zhenyu Pan, Yiting Zhang, Zhuo Liu, Y. Tang, Zeliang Zhang , et al. · 2025

LLM-based multi-agent systems excel at planning, tool use, and role coordination, but their openness and interaction complexity also expose them to jailbreak, prompt-injection, and adversarial collaboration. Existing defenses fall into two…

PSG-Agent: Personality-Aware Safety Guardrail for LLM-based Agents Open

Yaozu Wu, Juan Guo, Dongyuan Li, Henry Peng Zou, Wei‐Chieh Huang , et al. · 2025

Effective guardrails are essential for safely deploying LLM-based agents in critical applications. Despite recent advances, existing guardrails suffer from two fundamental limitations: (i) they apply uniform guardrail policies to all users…

GraphIFE: Rethinking Graph Imbalance Node Classification via Invariant Learning Open

F. Zeng, Wensheng Gan, Philip S. Yu · 2025

The class imbalance problem refers to the disproportionate distribution of samples across different classes within a dataset, where the minority classes are significantly underrepresented. This issue is also prevalent in graph-structured d…

Revisiting Multivariate Time Series Forecasting with Missing Values Open

Jie Yang, Y. Hu, Kexin Zhang, L. L. Niu, Yushun Dong , et al. · 2025

Missing values are common in real-world time series, and multivariate time series forecasting with missing values (MTSF-M) has become a crucial area of research for ensuring reliable predictions. To address the challenge of missing data, c…

Advances in Large Language Models for Medicine Open

Wensheng Gan, Philip S. Yu · 2025

Artificial intelligence (AI) technology has advanced rapidly in recent years, with large language models (LLMs) emerging as a significant breakthrough. LLMs are increasingly making an impact across various industries, with the medical fiel…

Utility-based Privacy Preserving Data Mining Open

Qingfeng Zhou, Wensheng Gan, Zhenlian Qi, Philip S. Yu · 2025

With the advent of big data, periodic pattern mining has demonstrated significant value in real-world applications, including smart home systems, healthcare systems, and the medical field. However, advances in network technology have enabl…

Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum Learning Open

Tingwei Lu, Wei‐Chieh Huang, Wenhao Jiang, Hui Wang, Hai-Tao Zheng , et al. · 2025

Efficient instruction tuning aims to enhance the ultimate performance of large language models (LLMs) trained on a given instruction dataset. Curriculum learning as a typical data organization strategy has shown preliminary effectiveness i…

Unique Security and Privacy Threats of Large Language Models: A Comprehensive Survey Open

Shang Wang, Tianqing Zhu, Bo Liu, Ming Ding, Dayong Ye , et al. · 2025

With the rapid development of artificial intelligence, large language models (LLMs) have made remarkable advancements in natural language processing. These models are trained on vast datasets to exhibit powerful language understanding and …

MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models Open

Leyi Pan, Sheng Guan, Zheyu Fu, Lipeng Si, Zian Wang , et al. · 2025

We introduce MarkDiffusion, an open-source Python toolkit for generative watermarking of latent diffusion models. It comprises three key components: a unified implementation framework for streamlined watermarking algorithm integrations and…

SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation Open

Weizhi Zhang, Liangwei Yang, Zihe Song, Henry Peng Zou, Ke Xu , et al. · 2025

ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training Open

Jiayang Wu, Wensheng Gan, Jiahao Zhang, Philip S. Yu · 2025

In the current development of large language models (LLMs), it is important to ensure the accuracy and reliability of the underlying data sources. LLMs are critical for various applications, but they often suffer from hallucinations and in…

Can Large Language Models Serve as Evaluators for Code Summarization? Open

Yang Wu, Yao Wan, Zhaoyang Chu, Wenting Zhao, Ye Liu , et al. · 2025

Philip S. Yu YOU? Author Swipe