Explanipedia

Differentially Private Relational Learning with Entity-level Privacy Guarantees Open

Yinan Huang, Haoteng Yin, Eli Chien, Rongzhe Wei, Li Pan · 2025

Learning with relational and network-structured data is increasingly vital in sensitive domains where protecting the privacy of individual entities is paramount. Differential Privacy (DP) offers a principled approach for quantifying privac…

Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness Open

Rongzhe Wei, Peizhi Niu, Hui‐Chin Hsu, Ruihan Wu, Haoteng Yin , et al. · 2025

Machine unlearning techniques aim to mitigate unintended memorization in large language models (LLMs). However, existing approaches predominantly focus on the explicit removal of isolated facts, often overlooking latent inferential depende…

Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning Open

Rongzhe Wei, Michelle Li, Mohsen Ghassemi, Eleonora Kreačić, Yifan Li , et al. · 2024

Large Language Models (LLMs) embed sensitive, human-generated data, prompting the need for unlearning methods. Although certified unlearning offers strong privacy guarantees, its restrictive assumptions make it unsuitable for LLMs, giving …

LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation Open

Mufei Li, Viraj Shitole, Eli Chien, Changhai Man, Zhaodong Wang , et al. · 2024

Computer science Mathematics Physics

Directed acyclic graphs (DAGs) serve as crucial data representations in domains such as hardware synthesis and compiler/program optimization for computing systems. DAG generative models facilitate the creation of synthetic DAGs, which can …

Privately Learning from Graphs with Applications in Fine-tuning Large Language Models Open

Haoteng Yin, Rongzhe Wei, Eli Chien, Pan Li · 2024

Computer science Psychology Philosophy

Graphs offer unique insights into relationships between entities, complementing data modalities like text and images and enabling AI models to extend their capabilities beyond traditional tasks. However, learning from graphs often involves…

Differentially Private Graph Diffusion with Applications in Personalized PageRanks Open

Rongzhe Wei, Eli Chien, Pan Li · 2024

Computer science Business Physics

Graph diffusion, which iteratively propagates real-valued substances among the graph, is used in numerous graph/network-involved applications. However, releasing diffusion vectors may reveal sensitive linking information in the data such a…

Multifaceted roles of cohesin in regulating transcriptional loops Open

Minji Kim, Ping Wang, Patricia A. Clow, Eli Chien, Xiaotao Wang , et al. · 2024

Biology Philosophy

Cohesin is required for chromatin loop formation. However, its precise role in regulating gene transcription remains largely unknown. We investigated the relationship between cohesin and RNA Polymerase II (RNAPII) using single-molecule map…

Certified Machine Unlearning via Noisy Stochastic Gradient Descent Open

Eli Chien, Haoyu Wang, Ziang Chen, Pan Li · 2024

Computer science Physics Mathematics

``The right to be forgotten'' ensured by laws for user data privacy becomes increasingly important. Machine unlearning aims to efficiently remove the effect of certain data points on the trained model parameters so that it can be approxima…

Machine Unlearning of Pre-trained Large Language Models Open

Yao Jin, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang , et al. · 2024

Computer science Philosophy

This study investigates the concept of the `right to be forgotten' within the context of large language models (LLMs). We explore machine unlearning as a pivotal solution, with a focus on pre-trained models--a notably under-researched area…

Langevin Unlearning: A New Perspective of Noisy Gradient Descent for Machine Unlearning Open

Eli Chien, Haoyu Wang, Ziang Chen, Pan Li · 2024

Computer science Biology Political science

Machine unlearning has raised significant interest with the adoption of laws ensuring the ``right to be forgotten''. Researchers have provided a probabilistic notion of approximate unlearning under a similar definition of Differential Priv…

Machine Unlearning of Pre-trained Large Language Models Open

Yao Jin, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang , et al. · 2024

Computer science

The 62nd Annual Meeting of the Association for Computational Linguistics, Bangkok, Thailand, August 11-16, 2024

Breaking the Trilemma of Privacy, Utility, Efficiency via Controllable Machine Unlearning Open

Zheyuan Liu, Guangyao Dou, Yijun Tian, Chunhui Zhang, Eli Chien , et al. · 2023

Computer science Mathematics Biology

Machine Unlearning (MU) algorithms have become increasingly critical due to the imperative adherence to data privacy regulations. The primary objective of MU is to erase the influence of specific data samples on a given model without the n…

On the Inherent Privacy Properties of Discrete Denoising Diffusion Models Open

Rongzhe Wei, Eleonora Kreačić, Haoyu Wang, Haoteng Yin, Eli Chien , et al. · 2023

Computer science Mathematics Physics

Privacy concerns have led to a surge in the creation of synthetic datasets, with diffusion models emerging as a promising avenue. Although prior studies have performed empirical evaluations on these models, there has been a gap in providin…

Federated Classification in Hyperbolic Spaces via Secure Aggregation of Convex Hulls Open

Saurav Prakash, Jin Sima, Chao Pan, Eli Chien, Olgica Milenković · 2023

Mathematics Computer science

Hierarchical and tree-like data sets arise in many applications, including language processing, graph data mining, phylogeny and genomics. It is known that tree-like data cannot be embedded into Euclidean spaces of finite dimension with sm…

Differentially Private Decoupled Graph Convolutions for Multigranular Topology Protection Open

Eli Chien, Weining Chen, Chao Pan, Pan Li, Ayfer Özgür , et al. · 2023

Computer science Mathematics Engineering

GNNs can inadvertently expose sensitive user information and interactions through their model predictions. To address these privacy concerns, Differential Privacy (DP) protocols are employed to control the trade-off between provable privac…

Representer Point Selection for Explaining Regularized High-dimensional Models Open

Che-Ping Tsai, Jiong Zhang, Eli Chien, Hsiang‐Fu Yu, Cho‐Jui Hsieh , et al. · 2023

Computer science Mathematics Biology

We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training samp…

PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation Open

Eli Chien, Jiong Zhang, Cho‐Jui Hsieh, Jyun‐Yu Jiang, Wei-Cheng Chang , et al. · 2023

Computer science Geography Physics

The eXtreme Multi-label Classification~(XMC) problem seeks to find relevant labels from an exceptionally large label space. Most of the existing XMC learners focus on the extraction of semantic features from input query text. However, conv…

Unlearning Graph Classifiers with Limited Data Resources Open

Chao Pan, Eli Chien, Olgica Milenković · 2022

Computer science Mathematics Business

As the demand for user privacy grows, controlled data removal (machine unlearning) is becoming an important feature of machine learning models for data-sensitive Web applications such as social networks and recommender systems. Nevertheles…

Certified Graph Unlearning Open

Eli Chien, Chao Pan, Olgica Milenković · 2022

Computer science Business

Graph-structured data is ubiquitous in practice and often processed using graph neural networks (GNNs). With the adoption of recent laws ensuring the ``right to be forgotten'', the problem of graph data removal has become of significant im…

HyperAid: Denoising in hyperbolic spaces for tree-fitting and hierarchical clustering Open

Eli Chien, Puoya Tabaghi, Olgica Milenković · 2022

Computer science Mathematics Economics

The problem of fitting distances by tree-metrics has received significant attention in the theoretical computer science and machine learning communities alike, due to many applications in natural language processing, phylogeny, cancer geno…

Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 Open

Vishal Rana, Eli Chien, Jianhao Peng, Olgica Milenković · 2022

Biology Medicine Mathematics

We consider the problem of determining the mutational support and distribution of the SARS-CoV-2 viral genome in the small-sample regime. The mutational support refers to the unknown number of sites that may eventually mutate in the SARS-C…

Provably Accurate and Scalable Linear Classifiers in Hyperbolic Spaces Open

Chao Pan, Eli Chien, Puoya Tabaghi, Jianhao Peng, Olgica Milenković · 2022

Computer science Mathematics

Many high-dimensional practical data sets have hierarchical structures induced by graphs or time series. Such data sets are hard to process in Euclidean spaces and one often seeks low-dimensional embeddings in other space forms to perform …

Node Feature Extraction by Self-Supervised Multi-scale Neighborhood\n Prediction Open

Eli Chien, Wei-Cheng Chang, Cho‐Jui Hsieh, Hsiang‐Fu Yu, Jiong Zhang , et al. · 2021

Computer science

Learning on graphs has attracted significant attention in the learning\ncommunity due to numerous real-world applications. In particular, graph neural\nnetworks (GNNs), which take numerical node features and graph structure as\ninputs, hav…

Node Feature Extraction by Self-Supervised Multi-scale Neighborhood Prediction Open

Eli Chien, Wei-Cheng Chang, Cho‐Jui Hsieh, Hsiang‐Fu Yu, Jiong Zhang , et al. · 2021

Computer science

Learning on graphs has attracted significant attention in the learning community due to numerous real-world applications. In particular, graph neural networks (GNNs), which take numerical node features and graph structure as inputs, have b…

Landing Probabilities of Random Walks for Seed-Set Expansion in Hypergraphs Open

Eli Chien, Pan Li, Olgica Milenković · 2021

Mathematics Computer science

We describe the first known mean-field study of landing probabilities for random walks on hypergraphs. In particular, we examine clique-expansion and tensor methods and evaluate their mean-field characteristics over a class of random hyper…

Highly Scalable and Provably Accurate Classification in Poincare Balls Open

Eli Chien, Chao Pan, Puoya Tabaghi, Olgica Milenković · 2021

Computer science Mathematics

Many high-dimensional and large-volume data sets of practical relevance have hierarchical structures induced by trees, graphs or time series. Such data sets are hard to process in Euclidean spaces and one often seeks low-dimensional embedd…

Support Estimation with Sampling Artifacts and Errors Open

Eli Chien, Olgica Milenković, Angelia Nedich · 2021

Computer science Mathematics Economics

The problem of estimating the support of a distribution is of great importance in many areas of machine learning, computer science, physics and biology. Most of the existing work in this domain has focused on settings that assume perfectly…

You are AllSet: A Multiset Function Framework for Hypergraph Neural\n Networks Open

Eli Chien, Chao Pan, Jianhao Peng, Olgica Milenković · 2021

Computer science Mathematics Business

Hypergraphs are used to model higher-order interactions amongst agents and\nthere exist many practically relevant instances of hypergraph datasets. To\nenable efficient processing of hypergraph-structured data, several hypergraph\nneural n…

You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks Open

Eli Chien, Chao Pan, Jianhao Peng, Olgica Milenković · 2021

Computer science Mathematics Engineering

Hypergraphs are used to model higher-order interactions amongst agents and there exist many practically relevant instances of hypergraph datasets. To enable efficient processing of hypergraph-structured data, several hypergraph neural netw…

Linear Classifiers in Mixed Constant Curvature Spaces. Open

Puoya Tabaghi, Eli Chien, Chao Pan, Olgica Milenković · 2021

Mathematics Computer science

Embedding methods for mixed-curvature spaces are powerful techniques for low-distortion and low-dimensional representation of complex data structures. Nevertheless, little is known regarding downstream learning and optimization in the embe…

Eli Chien YOU? Author Swipe