Explanipedia

Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning Open

Gangwei Jiang, Caigao Jiang, Zhaoyi Li, Siqiao Xue, Jun Zhou , et al. · 2025

Catastrophic forgetting (CF) poses a significant challenge in machine learning, where a model forgets previously learned information upon learning new tasks. Despite the advanced capabilities of Large Language Models (LLMs), they continue …

ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning Open

Yi Huang, Fangyin Cheng, Fan Zhou, Jiahui Li, Gong Jian , et al. · 2024

Computer science Business

In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities in data analytics when integrated with Multi-Agent Systems (MAS). However, these systems often struggle with complex tasks that involve diverse functio…

Refine Large Language Model Fine-tuning via Instruction Vector Open

Gangwei Jiang, Zhaoyi Li, Caigao Jiang, Siqiao Xue, Jun Zhou , et al. · 2024

Computer science Psychology Physics

Fine-tuning large language models (LLMs) can cause them to lose their general capabilities. However, the intrinsic mechanisms behind such forgetting remain unexplored. In this paper, we begin by examining this phenomenon by focusing on kno…

Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models Open

Siqiao Xue, Danrui Qi, Caigao Jiang, Wenhui Shi, Fangyin Cheng , et al. · 2024

Computer science

The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. The technologies of interacting with data particularly have an important entanglement with LLMs as efficient and intuitive data i…

DB-GPT: Empowering Database Interactions with Private Large Language Models Open

Siqiao Xue, Caigao Jiang, Wenhui Shi, Fangyin Chen, Keting Chen , et al. · 2023

Computer science Biology

The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. Database technologies particularly have an important entanglement with LLMs as efficient and intuitive database interactions are …

Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompt Open

Gangwei Jiang, Caigao Jiang, Siqiao Xue, James Y. Zhang, Jun Zhou , et al. · 2023

Computer science Mathematics Philosophy

Continual pre-training has been urgent for adapting a pre-trained model to a multitude of domains and tasks in the fast-evolving world. In practice, a continually pre-trained model is expected to demonstrate not only greater capacity when …

Prompt-augmented Temporal Point Process for Streaming Event Sequence Open

Siqiao Xue, Yan Wang, Zhixuan Chu, Xiaoming Shi, Caigao Jiang , et al. · 2023

Computer science Biology Philosophy

Neural Temporal Point Processes (TPPs) are the prevalent paradigm for modeling continuous-time event sequences, such as user activities on the web and financial transactions. In real-world applications, event data is typically received in …

Enhancing Asynchronous Time Series Forecasting with Contrastive Relational Inference Open

Yan Wang, Zhixuan Chu, Tao Zhou, Caigao Jiang, Hongyan Hao , et al. · 2023

Computer science Mathematics Physics

Asynchronous time series, also known as temporal event sequences, are the basis of many applications throughout different industries. Temporal point processes(TPPs) are the standard method for modeling such data. Existing TPP models have f…

WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine Open

Siqiao Xue, Fan Zhou, Yi Xu, Hongyu Zhao, Shuo Xie , et al. · 2023

Computer science Business Mathematics

We present WeaverBird, an intelligent dialogue system designed specifically for the finance domain. Our system harnesses a large language model of GPT architecture that has been tuned using extensive corpora of finance-related text. As a r…

Continual Learning in Predictive Autoscaling Open

Hongyan Hao, Zhixuan Chu, Shiyi Zhu, Gangwei Jiang, Yan Wang , et al. · 2023

Computer science Engineering Mathematics

Predictive Autoscaling is used to forecast the workloads of servers and prepare the resources in advance to ensure service level objectives (SLOs) in dynamic cloud environments. However, in practice, its prediction task often suffers from …

EasyTPP: Towards Open Benchmarking Temporal Point Processes Open

Siqiao Xue, Xiaoming Shi, Zhixuan Chu, Yan Wang, Fan Zhou , et al. · 2023

Computer science Business Mathematics

Continuous-time event sequences play a vital role in real-world domains such as healthcare, finance, online shopping, social networks, and so on. To model such data, temporal point processes (TPPs) have emerged as the most natural and comp…

Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompts Open

Gangwei Jiang, Caigao Jiang, Siqiao Xue, James Zhang, Jun Zhou , et al. · 2023

Computer science Mathematics Philosophy

Continual pre-training has been urgent for adapting a pre-trained model to a multitude of domains and tasks in the fast-evolving world. In practice, a continually pre-trained model is expected to demonstrate not only greater capacity when …

Learning Large-scale Universal User Representation with Sparse Mixture of Experts Open

Caigao Jiang, Siqiao Xue, James Zhang, Lingyue Liu, Zhibo Zhu , et al. · 2022

Computer science Engineering Political science

Learning user sequence behaviour embedding is very sophisticated and challenging due to the complicated feature interactions over time and high dimensions of user features. Recent emerging foundation models, e.g., BERT and its variants, en…

Unit Ball Model for Embedding Hierarchical Structures in the Complex Hyperbolic Space Open

Huiru Xiao, Caigao Jiang, Yangqiu Song, James Zhang, Junwu Xiong · 2021

Mathematics Computer science

Learning the representation of data with hierarchical structures in the hyperbolic space attracts increasing attention in recent years. Due to the constant negative curvature, the hyperbolic space resembles tree metrics and captures the tr…

Caigao Jiang YOU? Author Swipe