Explanipedia

DPF-CM: A Data Processing Framework with Privacy-Preserving Vector Databases for Chinese Medical LLMs Training and Deployment Open

Wei Huang, Anda Cheng · 2025

Current open-source training pipelines for Chinese medical language models predominantly emphasize optimizing training methodologies to enhance the performance of large language models (LLMs), yet lack comprehensive exploration into traini…

CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models Open

Chunyang Li, Junwei Zhang, Anda Cheng, Zhuo Ma, Xinghua Li , et al. · 2025

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by incorporating external knowledge, but its openness introduces vulnerabilities that can be exploited by poisoning attacks. Existing poisoning methods for RAG syst…

Information Leakage from Embedding in Large Language Models Open

Zhipeng Wang, Anda Cheng, Yinggui Wang, Lei Wang · 2024

Computer science Economics

The widespread adoption of large language models (LLMs) has raised concerns regarding data privacy. This study aims to investigate the potential for privacy invasion through input reconstruction attacks, in which a malicious model provider…

A Fast, Performant, Secure Distributed Training Framework For Large Language Model Open

Wei Huang, Yinggui Wang, Anda Cheng, Aihui Zhou, Chaofan Yu , et al. · 2024

Computer science Mathematics Economics

The distributed (federated) LLM is an important method for co-training the domain-specific LLM using siloed data. However, maliciously stealing model parameters and data from the server or client side has become an urgent problem to be sol…

HPN: Personalized Federated Hyperparameter Optimization Open

Anda Cheng, Zhen Wang, Yaliang Li, Jian Cheng · 2023

Computer science Biology

Numerous research studies in the field of federated learning (FL) have attempted to use personalization to address the heterogeneity among clients, one of FL's most crucial and challenging problems. However, existing works predominantly fo…

PKD: General Distillation Framework for Object Detectors via Pearson Correlation Coefficient Open

Weihan Cao, Yifan Zhang, Jianfei Gao, Anda Cheng, Ke Cheng , et al. · 2022

Computer science Mathematics Physics

Knowledge distillation(KD) is a widely-used technique to train compact models in object detection. However, there is still a lack of study on how to distill between heterogeneous detectors. In this paper, we empirically find that better FP…

DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy Open

Anda Cheng, Jiaxing Wang, Xi Sheryl Zhang, Qiang Chen, Peisong Wang , et al. · 2022

Computer science Engineering Art

Training deep neural networks (DNNs) for meaningful differential privacy (DP) guarantees severely degrades model utility. In this paper, we demonstrate that the architecture of DNNs has a significant impact on model utility in the context …

Differentially Private Federated Learning with Local Regularization and Sparsification Open

Anda Cheng, Peisong Wang, Xi Sheryl Zhang, Jian Cheng · 2022

Computer science Mathematics Political science

User-level differential privacy (DP) provides certifiable privacy guarantees to the information that is specific to any user's data in federated learning. Existing methods that ensure user-level DP come at the cost of severe accuracy decre…

DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy Open

Anda Cheng, Jiaxing Wang, Xi Sheryl Zhang, Qiang Chen, Peisong Wang , et al. · 2021

Computer science Engineering Art

Training deep neural networks (DNNs) for meaningful differential privacy (DP) guarantees severely degrades model utility. In this paper, we demonstrate that the architecture of DNNs has a significant impact on model utility in the context …

Location-aware Upsampling for Semantic Segmentation Open

Xiangyu He, Zitao Mo, Qiang Chen, Anda Cheng, Peisong Wang , et al. · 2019

Computer science

Many successful learning targets such as minimizing dice loss and cross-entropy loss have enabled unprecedented breakthroughs in segmentation tasks. Beyond these semantic metrics, this paper aims to introduce location supervision into sema…

SpatialFlow: Bridging All Tasks for Panoptic Segmentation Open

Qiang Chen, Anda Cheng, Xiangyu He, Peisong Wang, Jian Cheng · 2019

Computer science Geography Economics

Object location is fundamental to panoptic segmentation as it is related to all things and stuff in the image scene. Knowing the locations of objects in the image provides clues for segmenting and helps the network better understand the sc…

Anda Cheng YOU? Author Swipe