Shaofeng H.-C. Jiang
YOU?
Author Swipe
View article: Coresets for Robust Clustering via Black-Box Reductions to Vanilla Case
Coresets for Robust Clustering via Black-Box Reductions to Vanilla Case Open
We devise ε-coresets for robust (k,z)-Clustering with m outliers through black-box reductions to vanilla clustering. Given an ε-coreset construction for vanilla clustering with size N, we construct coresets of size N⋅ polylog(kmε^{-1}) + O…
View article: Streaming Algorithms for Geometric Steiner Forest
Streaming Algorithms for Geometric Steiner Forest Open
We consider a generalization of the Steiner tree problem, the Steiner forest problem , in the Euclidean plane: the input is a multiset \(X\subseteq{\mathbb{R}}^{2}\) , partitioned into \(k\) color classes \(C_{1},\ldots,C_{k}\subseteq X\) …
View article: Online Facility Location with Predictions
Online Facility Location with Predictions Open
We provide nearly optimal algorithms for online facility location (OFL) with predictions. In OFL, $n$ demand points arrive in order and the algorithm must irrevocably assign each demand point to an open facility upon its arrival. The objec…
View article: Coresets for Kernel Clustering
Coresets for Kernel Clustering Open
We devise coresets for kernel $k$-Means with a general kernel, and use them to obtain new, more efficient, algorithms. Kernel $k$-Means has superior clustering capability compared to classical $k$-Means, particularly when clusters are non-…
View article: Coresets for Clustering with Missing Values
Coresets for Clustering with Missing Values Open
We provide the first coreset for clustering points in $\mathbb{R}^d$ that have multiple missing values (coordinates). Previous coreset constructions only allow one missing coordinate. The challenge in this setting is that objective functio…
View article: SPIN: BSP Job Scheduling With Placement-Sensitive Execution
SPIN: BSP Job Scheduling With Placement-Sensitive Execution Open
The Bulk Synchronous Parallel (BSP) paradigm is gaining tremendous importance recently due to the popularity of computations as distributed machine learning and graph computation. In a typical BSP job, multiple workers concurrently conduct…
View article: Coresets for Clustering in Excluded-minor Graphs and Beyond
Coresets for Clustering in Excluded-minor Graphs and Beyond Open
Coresets are modern data-reduction tools that are widely used in data analysis to improve efficiency in terms of running time, space and communication complexity. Our main result is a fast algorithm to construct a small coreset for k-Media…
View article: Scheduling Placement-Sensitive BSP Jobs with Inaccurate Execution Time Estimation
Scheduling Placement-Sensitive BSP Jobs with Inaccurate Execution Time Estimation Open
The Bulk Synchronous Parallel (BSP) paradigm is gaining tremendous importance recently because of the pop-ularity of computations such as distributed machine learning and graph computation. In a typical BSP job, multiple workers concurrent…
View article: Coresets for Clustering in Excluded-minor Graphs and Beyond
Coresets for Clustering in Excluded-minor Graphs and Beyond Open
Coresets are modern data-reduction tools that are widely used in data analysis to improve efficiency in terms of running time, space and communication complexity. Our main result is a fast algorithm to construct a small coreset for k-Media…
View article: A Unified PTAS for Prize Collecting TSP and Steiner Tree Problem in Doubling Metrics
A Unified PTAS for Prize Collecting TSP and Steiner Tree Problem in Doubling Metrics Open
We present a unified (randomized) polynomial-time approximation scheme (PTAS) for the prize collecting traveling salesman problem (PCTSP) and the prize collecting Steiner tree problem (PCSTP) in doubling metrics. Given a metric space and a…
View article: Coresets for Clustering in Graphs of Bounded Treewidth
Coresets for Clustering in Graphs of Bounded Treewidth Open
We initiate the study of coresets for clustering in graph metrics, i.e., the shortest-path metric of edge-weighted graphs. Such clustering problems are essential to data analysis and used for example in road networks and data visualization…
View article: Coresets for Clustering with Fairness Constraints
Coresets for Clustering with Fairness Constraints Open
In a recent work, [19] studied the following "fair" variants of classical clustering problems such as $k$-means and $k$-median: given a set of $n$ data points in $\mathbb{R}^d$ and a binary type associated to each data point, the goal is t…
View article: Coresets for Ordered Weighted Clustering
Coresets for Ordered Weighted Clustering Open
We design coresets for Ordered k-Median, a generalization of classical clustering problems such as k-Median and k-Center, that offers a more flexible data analysis, like easily combining multiple objectives (e.g., to increase fairness or f…
View article: Epsilon-Coresets for Clustering (with Outliers) in Doubling Metrics
Epsilon-Coresets for Clustering (with Outliers) in Doubling Metrics Open
We study the problem of constructing epsilon-coresets for the (k, z)-clustering problem in a doubling metric M(X, d). An epsilon-coreset is a weighted subset S subset of X with weight function w : S -> R->= 0, such that for any k-subset C …
View article: $\varepsilon$-Coresets for Clustering (with Outliers) in Doubling Metrics
$\varepsilon$-Coresets for Clustering (with Outliers) in Doubling Metrics Open
We study the problem of constructing $\varepsilon$-coresets for the $(k, z)$-clustering problem in a doubling metric $M(X, d)$. An $\varepsilon$-coreset is a weighted subset $S\subseteq X$ with weight function $w : S \rightarrow \mathbb{R}…
View article: A Unified PTAS for Prize Collecting TSP and Steiner Tree Problem in\n Doubling Metrics
A Unified PTAS for Prize Collecting TSP and Steiner Tree Problem in\n Doubling Metrics Open
We present a unified polynomial-time approximation scheme (PTAS) for the\nprize collecting traveling salesman problem (PCTSP) and the prize collecting\nSteiner tree problem (PCSTP) in doubling metrics. Given a metric space and a\npenalty f…
View article: Online Submodular Maximization Problem with Vector Packing Constraint
Online Submodular Maximization Problem with Vector Packing Constraint Open
We consider the online vector packing problem in which we have a $d$ dimensional knapsack and items $u$ with weight vectors $\mathbf{w}_u \in \mathbb{R}_+^d$ arrive online in an arbitrary order. Upon the arrival of an item, the algorithm m…
View article: Optimal Decomposition Strategy For Tree Edit Distance
Optimal Decomposition Strategy For Tree Edit Distance Open
An ordered labeled tree is a tree where the left-to-right order among siblings is significant. Given two ordered labeled trees, the edit distance between them is the minimum cost edit operations that convert one tree to the other.\nIn this…
View article: Online Submodular Maximization Problem with Vector Packing Constraint
Online Submodular Maximization Problem with Vector Packing Constraint Open
We consider the online vector packing problem in which we have a d dimensional knapsack and items u with weight vectors w_u in R_+^d arrive online in an arbitrary order. Upon the arrival of an item, the algorithm must decide immediately wh…
View article: Approximation schemes for network design problems in doubling metrics
Approximation schemes for network design problems in doubling metrics Open
View article: Online Submodular Maximization with Free Disposal: Randomization Beats ¼ for Partition Matroids
Online Submodular Maximization with Free Disposal: Randomization Beats ¼ for Partition Matroids Open
We study the online submodular maximization problem with free disposal under a matroid constraint. Elements from some ground set arrive one by one in rounds, and the algorithm maintains a feasible set that is independent in the underlying …
View article: A PTAS for the Steiner Forest Problem in Doubling Metrics
A PTAS for the Steiner Forest Problem in Doubling Metrics Open
We achieve a (randomized) polynomial-time approximation scheme (PTAS) for the Steiner Forest Problem in doubling metrics. Before our work, a PTAS is given only for the Euclidean plane in [FOCS 2008: Borradaile, Klein and Mathieu]. Our PTAS…