Explanipedia

Self-Monitoring Large Language Models for Click-Through Rate Prediction Open

Huachi Zhou, Kravtsova Viktoria Yu., Qinggang Zhang, Hao Chen, Daochen Zha , et al. · 2025

Computer science

Click-through rate prediction tasks estimate interaction probabilities using user–item features (i.e., the combined set of user and item features). LLMs have emerged as a promising approach by organizing these features into prompts and fin…

Graph-MLLM: Harnessing Multimodal Large Language Models for Multimodal Graph Learning Open

Jialu Liu, Dongzhe Fan, Jiacheng Shen, Chuanhao Ji, Daochen Zha , et al. · 2025

Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in representing and understanding diverse modalities. However, they typically focus on modality alignment in a pairwise manner while overlooking structural …

Large language models for disease diagnosis: a scoping review Open

Shuang Zhou, Zidu Xu, Mian Zhang, Chunpu Xu, Yawen Guo , et al. · 2025

Medicine Computer science Engineering

Automatic disease diagnosis has become increasingly valuable in clinical practice. The advent of large language models (LLMs) has catalyzed a paradigm shift in artificial intelligence, with growing evidence supporting the efficacy of LLMs …

FinLoRA: Benchmarking LoRA Methods for Fine-Tuning LLMs on Financial Datasets Open

Daju Wang, J. Patel, Daochen Zha, Steve Y. Yang, Xiao-Yang Liu · 2025

Low-rank adaptation (LoRA) methods show great potential for scaling pre-trained general-purpose Large Language Models (LLMs) to hundreds or thousands of use scenarios. However, their efficacy in high-stakes domains like finance is rarely e…

Beyond Pairwise Learning-To-Rank At Airbnb Open

Malay Haldar, Daochen Zha, Huiji Gao, Liwei He, Sanjeev Katariya · 2025

There are three fundamental asks from a ranking algorithm: it should scale to handle a large number of items, sort items accurately by their utility, and impose a total order on the items for logical consistency. But here's the catch-no al…

Customized FinGPT Search Agents Using Foundation Models Open

F. Tian, A. S. Byadgi, D. Kim, Daochen Zha, Matt White , et al. · 2024

Computer science Political science

Current large language models (LLMs) have proven useful for analyzing\nfinancial data, but most existing models, such as BloombergGPT and FinGPT, lack\ncustomization for specific user needs. In this paper, we address this gap by\ndevelopin…

Large Language Models for Disease Diagnosis: A Scoping Review Open

Shuang Zhou, Zidu Xu, Mian Zhang, Chunpu Xu, Yawen Guo , et al. · 2024

Computer science Medicine Philosophy

Automatic disease diagnosis has become increasingly valuable in clinical practice. The advent of large language models (LLMs) has catalyzed a paradigm shift in artificial intelligence, with growing evidence supporting the efficacy of LLMs …

Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning Open

Xiao-Yang Liu, Rongyi Zhu, Daochen Zha, Jiechao Gao, Shan Zhong , et al. · 2024

Computer science Physics

The surge in interest and application of large language models (LLMs) has sparked a drive to fine-tune these models to suit specific applications, such as finance and medical science. However, concerns regarding data privacy have emerged, …

A-I-RAVEN and I-RAVEN-Mesh: Two New Benchmarks for Abstract Visual Reasoning Open

Qian Luo, Tien-Ping Tan, Daochen Zha, Tianqiao Zhang · 2024

Computer science Mathematics

We study generalization and knowledge reuse capabilities of deep neural networks in the domain of abstract visual reasoning (AVR), employing Raven's Progressive Matrices (RPMs), a recognized benchmark task for assessing AVR abilities. Two …

LTSM-Bundle: A Toolbox and Benchmark on Large Language Models for Time Series Forecasting Open

Yu-Neng Chuang, Songchen Li, Jiayi Yuan, Guanchu Wang, Kwei-Herng Lai , et al. · 2024

Computer science Geography Geology

Time Series Forecasting (TSF) has long been a challenge in time series analysis. Inspired by the success of Large Language Models (LLMs), researchers are now developing Large Time Series Models (LTSMs)-universal transformer-based models th…

GAugLLM: Improving Graph Contrastive Learning for Text-Attributed Graphs with Large Language Models Open

Yifang Ma, Dongzhe Fan, Daochen Zha, Qiaoyu Tan · 2024

Computer science

This work studies self-supervised graph learning for text-attributed graphs (TAGs) where nodes are represented by textual attributes. Unlike traditional graph contrastive methods that perturb the numerical feature space and alter the graph…

GraphFM: A Comprehensive Benchmark for Graph Foundation Model Open

Yuhao Xu, Xinqi Liu, Keyu Duan, Yi Fang, Yu-Neng Chuang , et al. · 2024

Computer science Geography

Foundation Models (FMs) serve as a general class for the development of artificial intelligence systems, offering broad potential for generalization across a spectrum of downstream tasks. Despite extensive research into self-supervised lea…

Denoising-Aware Contrastive Learning for Noisy Time Series Open

Shuang Zhou, Daochen Zha, Xiao Shen, Xiao Huang, Rui Zhang , et al. · 2024

Computer science Geology

Time series self-supervised learning (SSL) aims to exploit unlabeled data for pre-training to mitigate the reliance on labels. Despite the great success in recent years, there is limited discussion on the potential noise in the time series…

Cost-efficient Knowledge-based Question Answering with Large Language Models Open

Junnan Dong, Qinggang Zhang, Chuang Zhou, Hao Chen, Daochen Zha , et al. · 2024

Computer science

Knowledge-based question answering (KBQA) is widely used in many scenarios that necessitate domain knowledge. Large language models (LLMs) bring opportunities to KBQA, while their costs are significantly higher and absence of domain-specif…

DCAI: Data-centric Artificial Intelligence Open

Wei Jin, Haohan Wang, Daochen Zha, Qiaoyu Tan, Yao Ma , et al. · 2024

Computer science

The emergence of Data-centric AI (DCAI) represents a pivotal shift in AI development, redirecting focus from model refinement to prioritizing data quality. This paradigmatic transition emphasizes the critical role of data in AI. While past…

E2GNN: Efficient Graph Neural Network Ensembles for Semi-Supervised Classification Open

Xin Zhang, Daochen Zha, Qiaoyu Tan · 2024

Computer science

This work studies ensemble learning for graph neural networks (GNNs) under the popular semi-supervised setting. Ensemble learning has shown superiority in improving the accuracy and robustness of traditional machine learning by combining t…

Modality-Aware Integration with Large Language Models for Knowledge-based Visual Question Answering Open

Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng , et al. · 2024

Computer science

Knowledge-based visual question answering (KVQA) has been extensively studied to answer visual questions with external knowledge, e.g., knowledge graphs (KGs). While several attempts have been proposed to leverage large language models (LL…

Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning Open

Xiaoyang Liu, Rongyi Zhu, Daochen Zha, Jiechao Gao, Shan Zhong , et al. · 2023

Computer science Physics

The surge in interest and application of large language models (LLMs) has sparked a drive to fine-tune these models to suit specific applications, such as finance and medical science. However, concerns regarding data privacy have emerged, …

KnowGPT: Knowledge Graph based Prompting for Large Language Models Open

Qinggang Zhang, Junnan Dong, Hao Chen, Xiao Huang, Daochen Zha , et al. · 2023

Computer science Mathematics Geography

Large Language Models (LLMs) have demonstrated remarkable capabilities in many real-world applications. Nonetheless, LLMs are often criticized for their tendency to produce hallucinations, wherein the models fabricate incorrect statements …

Enhanced Generalization through Prioritization and Diversity in Self-Imitation Reinforcement Learning over Procedural Environments with Sparse Rewards Open

Alain Andrés, Daochen Zha, Javier Del Ser · 2023

Computer science Psychology Mathematics

Exploration poses a fundamental challenge in Reinforcement Learning (RL) with sparse rewards, limiting an agent's ability to learn optimal decision-making due to a lack of informative feedback signals. Self-Imitation Learning (self-IL) has…

Tackling Diverse Minorities in Imbalanced Classification Open

Kwei-Herng Lai, Daochen Zha, Huiyuan Chen, Mangesh Bendre, Yuzhong Chen , et al. · 2023

Computer science Mathematics Geography

Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers. When working with large datasets, the imbalanced issue can be further exacerbated, making it exception…

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research Open

Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Kwei-Herng Lai, Daochen Zha , et al. · 2023

Computer science

The exponential growth in scholarly publications necessitates advanced tools for efficient article retrieval, especially in interdisciplinary fields where diverse terminologies are used to describe similar research. Traditional keyword-bas…

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research Open

Yu-Neng Chuang, Guanchu Wang, Chia‐Yuan Chang, Kwei-Herng Lai, Daochen Zha , et al. · 2023

Computer science

The exponential growth in scholarly publications necessitates advanced tools for efficient article retrieval, especially in interdisciplinary fields where diverse terminologies are used to describe similar research. Traditional keyword-bas…

FinGPT: Democratizing Internet-scale Data for Financial Large Language Models Open

Xiaoyang Liu, Guoxuan Wang, Daochen Zha · 2023

Computer science Business Geography

Large language models (LLMs) have demonstrated remarkable proficiency in understanding and generating human-like texts, which may potentially revolutionize the finance industry. However, existing LLMs often fall short in the financial fiel…

Adaptive Popularity Debiasing Aggregator for Graph Collaborative Filtering Open

Huachi Zhou, Hao Chen, Junnan Dong, Daochen Zha, C. E. Zhou , et al. · 2023

Computer science Psychology Physics

The graph neural network-based collaborative filtering (CF) models user-item interactions as a bipartite graph and performs iterative aggregation to enhance performance. Unfortunately, the aggregation process may amplify the popularity bia…

OpenGSL: A Comprehensive Benchmark for Graph Structure Learning Open

Zhiyao Zhou, Sheng Zhou, Bochao Mao, Xuanyi Zhou, Jiawei Chen , et al. · 2023

Computer science Mathematics Geography

Graph Neural Networks (GNNs) have emerged as the de facto standard for representation learning on graphs, owing to their ability to effectively integrate graph topology and node attributes. However, the inherent suboptimal nature of node c…

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model Open

Zirui Liu, Guanchu Wang, Shaochen Zhong, Zhaozhuo Xu, Daochen Zha , et al. · 2023

Computer science Mathematics Business

With the rapid growth in model size, fine-tuning the large pre-trained language model has become increasingly difficult due to its extensive memory usage. Previous works usually focus on reducing the number of trainable parameters in the n…

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models Open

Daochen Zha, Louis Feng, Liang Luo, Bhargav Bhushanam, Zirui Liu , et al. · 2023

Computer science Geography

Sharding a large machine learning model across multiple devices to balance the costs is important in distributed training. This is challenging because partitioning is NP-hard, and estimating the costs accurately and efficiently is difficul…

Dynamic Datasets and Market Environments for Financial Reinforcement Learning Open

Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha , et al. · 2023

Computer science

The financial market is a particularly challenging playground for deep reinforcement learning due to its unique feature of dynamic datasets. Building high-quality market environments for training financial reinforcement learning (FinRL) ag…

Daochen Zha YOU? Author Swipe