Xuemin Lin
YOU?
Author Swipe
View article: Approximate Nearest Neighbor Search of Large Scale Vectors on Distributed Storage
Approximate Nearest Neighbor Search of Large Scale Vectors on Distributed Storage Open
Approximate Nearest Neighbor Search (ANNS) in high-dimensional space is an essential operator in many online services, such as information retrieval and recommendation. Indices constructed by the state-of-the-art ANNS algorithms must be st…
View article: Accelerating K-Core Computation in Temporal Graphs
Accelerating K-Core Computation in Temporal Graphs Open
We address the problem of enumerating all temporal k-cores given a query time range and a temporal graph, which suffers from poor efficiency and scalability in the state-of-the-art solution. Motivated by an existing concept called core tim…
View article: DHG-Bench: A Comprehensive Benchmark for Deep Hypergraph Learning
DHG-Bench: A Comprehensive Benchmark for Deep Hypergraph Learning Open
Deep graph models have achieved great success in network representation learning. However, their focus on pairwise relationships restricts their ability to learn pervasive higher-order interactions in real-world systems, which can be natur…
View article: GTRSS: Graph-based Top-$k$ Representative Similar Subtrajectory Query
GTRSS: Graph-based Top-$k$ Representative Similar Subtrajectory Query Open
Trajectory mining has attracted significant attention. This paper addresses the Top-k Representative Similar Subtrajectory Query (TRSSQ) problem, which aims to find the k most representative subtrajectories similar to a query. Existing met…
View article: Editorial for Special Issue: VLDB 2022
Editorial for Special Issue: VLDB 2022 Open
View article: Picking point localization method based on semantic reasoning for complex picking scenarios in vineyards
Picking point localization method based on semantic reasoning for complex picking scenarios in vineyards Open
View article: Ranking on Dynamic Graphs: An Effective and Robust Band-Pass Disentangled Approach
Ranking on Dynamic Graphs: An Effective and Robust Band-Pass Disentangled Approach Open
View article: Covering K-Cliques in Billion-Scale Graphs
Covering K-Cliques in Billion-Scale Graphs Open
View article: UniDyG: A Unified and Effective Representation Learning Approach for Large Dynamic Graphs
UniDyG: A Unified and Effective Representation Learning Approach for Large Dynamic Graphs Open
Dynamic graphs are formulated in continuous-time or discrete-time dynamic graphs. They differ in temporal granularity: Continuous-Time Dynamic Graphs (CTDGs) exhibit rapid, localized changes, while Discrete-Time Dynamic Graphs (DTDGs) show…
View article: Infinite Stream Estimation under Personalized <i>w</i> -Event Privacy
Infinite Stream Estimation under Personalized <i>w</i> -Event Privacy Open
Streaming data collection is indispensable for stream data analysis, such as event monitoring. However, publishing these data directly leads to privacy leaks. w -event privacy is a valuable tool to protect individual privacy within a given…
View article: Ensemble-based Deep Multilayer Community Search
Ensemble-based Deep Multilayer Community Search Open
Multilayer graphs, consisting of multiple interconnected layers, are widely used to model diverse relationships in the real world. A community is a cohesive subgraph that offers valuable insights for analyzing (multilayer) graphs. Recently…
View article: On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing
On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing Open
Missing data imputation, which aims to impute the missing values in the raw datasets to achieve the completeness of datasets, is crucial for modern data-driven models like large language models (LLMs) and has attracted increasing interest …
View article: Common Neighborhood Estimation over Bipartite Graphs under Local Differential Privacy
Common Neighborhood Estimation over Bipartite Graphs under Local Differential Privacy Open
Bipartite graphs, formed by two vertex layers, arise as a natural fit for modeling the relationships between two groups of entities. In bipartite graphs, common neighborhood computation between two vertices on the same vertex layer is a ba…
View article: Counting Butterflies over Streaming Bipartite Graphs with Duplicate Edges
Counting Butterflies over Streaming Bipartite Graphs with Duplicate Edges Open
Bipartite graphs are commonly used to model relationships between two distinct entities in real-world applications, such as user-product interactions, user-movie ratings and collaborations between authors and publications. A butterfly (a 2…
View article: Efficient Dynamic Attributed Graph Generation
Efficient Dynamic Attributed Graph Generation Open
Data generation is a fundamental research problem in data management due to its diverse use cases, ranging from testing database engines to data-specific applications. However, real-world entities often involve complex interactions that ca…
View article: StructRide: A Framework to Exploit the Structure Information of Shareability Graph in Ridesharing
StructRide: A Framework to Exploit the Structure Information of Shareability Graph in Ridesharing Open
Ridesharing services play an essential role in modern transportation, which significantly reduces traffic congestion and exhaust pollution. In the ridesharing problem, improving the sharing rate between riders can not only save the travel …
View article: Contextual Representation Anchor Network to Alleviate Selection Bias in Few-Shot Drug Discovery
Contextual Representation Anchor Network to Alleviate Selection Bias in Few-Shot Drug Discovery Open
In the drug discovery process, the low success rate of drug candidate screening often leads to insufficient labeled data, causing the few-shot learning problem in molecular property prediction. Existing methods for few-shot molecular prope…
View article: TCGU: Data-centric Graph Unlearning based on Transferable Condensation
TCGU: Data-centric Graph Unlearning based on Transferable Condensation Open
With growing demands for data privacy and model robustness, graph unlearning (GU), which erases the influence of specific data on trained GNN models, has gained significant attention. However, existing exact unlearning methods suffer from …
View article: EntropyStop: Unsupervised Deep Outlier Detection with Loss Entropy
EntropyStop: Unsupervised Deep Outlier Detection with Loss Entropy Open
Unsupervised Outlier Detection (UOD) is an important data mining task. With the advance of deep learning, deep Outlier Detection (OD) has received broad interest. Most deep UOD models are trained exclusively on clean datasets to learn the …
View article: Simpler is More: Efficient Top-K Nearest Neighbors Search on Large Road Networks
Simpler is More: Efficient Top-K Nearest Neighbors Search on Large Road Networks Open
Top-k Nearest Neighbors (kNN) problem on road network has numerous applications on location-based services. As direct search using the Dijkstra's algorithm results in a large search space, a plethora of complex-index-based approaches have …
View article: Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs
Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs Open
Cohesive subgraph mining is a fundamental problem in bipartite graph analysis. In reality, relationships between two types of entities often occur at some specific timestamps, which can be modeled as a temporal bipartite graph. However, th…
View article: Efficient Influence Minimization via Node Blocking
Efficient Influence Minimization via Node Blocking Open
Given a graph G, a budget k and a misinformation seed set S, Influence Minimization (IMIN) via node blocking aims to find a set of k nodes to be blocked such that the expected spread of S is minimized. This problem finds important applicat…
View article: Deep Overlapping Community Search via Subspace Embedding
Deep Overlapping Community Search via Subspace Embedding Open
Overlapping Community Search (OCS) identifies nodes that interact with multiple communities based on a specified query. Existing community search approaches fall into two categories: algorithm-based models and Machine Learning-based (ML) m…
View article: Hypergraph Self-supervised Learning with Sampling-efficient Signals
Hypergraph Self-supervised Learning with Sampling-efficient Signals Open
Self-supervised learning (SSL) provides a promising alternative for representation learning on hypergraphs without costly labels. However, existing hypergraph SSL models are mostly based on contrastive methods with the instance-level discr…
View article: A Survey of Distributed Graph Algorithms on Massive Graphs
A Survey of Distributed Graph Algorithms on Massive Graphs Open
Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been…
View article: Efficient Unsupervised Community Search with Pre-trained Graph Transformer
Efficient Unsupervised Community Search with Pre-trained Graph Transformer Open
Community search has aroused widespread interest in the past decades. Among existing solutions, the learning-based models exhibit outstanding performance in terms of accuracy by leveraging labels to 1) train the model for community score l…
View article: Neural Attributed Community Search at Billion Scale
Neural Attributed Community Search at Billion Scale Open
Community search has been extensively studied in the past decades. In recent years, there is a growing interest in attributed community search that aims to identify a community based on both the query nodes and query attributes. A set of t…
View article: Simple and deep graph attention networks
Simple and deep graph attention networks Open
Graph Attention Networks (GATs) and Graph Convolutional Neural Networks (GCNs) are two state-of-the-art architectures in Graph Neural Networks (GNNs). It is well known that both models suffer from performance degradation when more GNN laye…
View article: Wait to be Faster: a Smart Pooling Framework for Dynamic Ridesharing
Wait to be Faster: a Smart Pooling Framework for Dynamic Ridesharing Open
Ridesharing services, such as Uber or Didi, have attracted considerable attention in recent years due to their positive impact on environmental protection and the economy. Existing studies require quick responses to orders, which lack the …
View article: Diffusion-based Graph Generative Methods
Diffusion-based Graph Generative Methods Open
Being the most cutting-edge generative methods, diffusion methods have shown great advances in wide generation tasks. Among them, graph generation attracts significant research attention for its broad application in real life. In our surve…