Explanipedia

PolarQuant: Quantizing KV Caches with Polar Transformation Open

In‐Su Han, Praneeth Kacham, Amin Karbasi, Vahab Mirrokni, Amir Zandieh · 2025

Large language models (LLMs) require significant memory to store Key-Value (KV) embeddings in their KV cache, especially when handling long-range contexts. Quantization of these KV embeddings is a common technique to reduce memory consumpt…

Approximating the Top Eigenvector in Random Order Streams Open

Praneeth Kacham, David P. Woodruff · 2024

Mathematics Computer science Economics

When rows of an $n \times d$ matrix $A$ are given in a stream, we study algorithms for approximating the top eigenvector of the matrix ${A}^TA$ (equivalently, the top right singular vector of $A$). We consider worst case inputs $A$ but ass…

LevAttention: Time, Space, and Streaming Efficient Algorithm for Heavy Attentions Open

Ravindran Kannan, Chiranjib Bhattacharyya, Praneeth Kacham, David P. Woodruff · 2024

Computer science Mathematics Physics

A central problem related to transformers can be stated as follows: given two $n \times d$ matrices $Q$ and $K$, and a non-negative function $f$, define the matrix $A$ as follows: (1) apply the function $f$ to each entry of the $n \times n…

Optimal Communication Bounds for Classic Functions in the Coordinator Model and Beyond Open

Hossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David P. Woodruff, Peilin Zhong · 2024

Mathematics Computer science Physics

In the coordinator model of communication with s servers, given an arbitrary non-negative function f, we study the problem of approximating the sum ∑i ∈ [n]f(xi) up to a 1 ± ε factor. Here the vector x ∈ ℝn is defined to be x = x(1) + ⋯ + …

High-Dimensional Geometric Streaming for Nearly Low Rank Data Open

Hossein Esfandiari, Vahab Mirrokni, Praneeth Kacham, David P. Woodruff, Peilin Zhong · 2024

Computer science Mathematics

We study streaming algorithms for the $\ell_p$ subspace approximation problem. Given points $a_1, \ldots, a_n$ as an insertion-only stream and a rank parameter $k$, the $\ell_p$ subspace approximation problem is to find a $k$-dimensional s…

Optimal Communication for Classic Functions in the Coordinator Model and Beyond Open

Hossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David P. Woodruff, Peilin Zhong · 2024

Computer science

In the coordinator model of communication with $s$ servers, given an arbitrary non-negative function $f$, we study the problem of approximating the sum $\sum_{i \in [n]}f(x_i)$ up to a $1 \pm \varepsilon$ factor. Here the vector $x \in R^n…

Faster Algorithms for Schatten-p Low Rank Approximation Open

Praneeth Kacham, David P. Woodruff · 2024

Computer science Mathematics

We study algorithms for the Schatten-p Low Rank Approximation (LRA) problem. First, we show that by using fast rectangular matrix multiplication algorithms and different block sizes, we can improve the running time of the algorithms in the…

Lower Bounds on Adaptive Sensing for Matrix Recovery Open

Praneeth Kacham, David P. Woodruff · 2023

Mathematics Computer science Physics

We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an $n \times n$ matrix $A$, a general linear measurement $S(A)$, for an $n \times n$ matrix $S$, is just the inner produ…

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming Open

Praneeth Kacham, Rasmus Pagh, Mikkel Thorup, David P. Woodruff · 2023

Mathematics Physics Computer science

We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan'…

PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels Open

Praneeth Kacham, Vahab Mirrokni, Peilin Zhong · 2023

Mathematics Computer science Engineering

The quadratic time and memory complexity inherent to self-attention mechanisms, with respect to sequence length, presents a critical computational bottleneck in the training and deployment of large-scale Transformer-based language models. …

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming Open

Praneeth Kacham, Rasmus Pagh, Mikkel Thorup, David P. Woodruff · 2023

Mathematics Computer science Physics

We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan'…

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation Open

Ainesh Bakshi, Piotr Indyk, Praneeth Kacham, Sandeep Silwal, Samson Zhou · 2022

Mathematics Computer science

Kernel matrices, as well as weighted graphs represented by them, are ubiquitous objects in machine learning, statistics and other related fields. The main drawback of using kernel methods (learning and inference using kernel matrices) is e…

Sketching Algorithms and Lower Bounds for Ridge Regression Open

Praneeth Kacham, David P. Woodruff · 2022

Mathematics Computer science Biology

We give a sketching-based iterative algorithm that computes a $1+\varepsilon$ approximate solution for the ridge regression problem $\min_x \|Ax-b\|_2^2 +λ\|x\|_2^2$ where $A \in R^{n \times d}$ with $d \ge n$. Our algorithm, for a constan…

Near-Optimal Algorithms for Linear Algebra in the Current Matrix Multiplication Time Open

Nadiia Chepurko, Kenneth L. Clarkson, Praneeth Kacham, David P. Woodruff · 2021

Mathematics Materials science Philosophy

In the numerical linear algebra community, it was suggested that to obtain nearly optimal bounds for various problems such as rank computation, finding a maximal linearly independent subset of columns (a basis), regression, or low-rank app…

Reduced-Rank Regression with Operator Norm Error Open

Praneeth Kacham, David P. Woodruff · 2020

Mathematics Physics Chemistry

A common data analysis task is the reduced-rank regression problem: $$\min_{\textrm{rank-}k \ X} \|AX-B\|,$$ where $A \in \mathbb{R}^{n \times c}$ and $B \in \mathbb{R}^{n \times d}$ are given large matrices and $\|\cdot\|$ is some norm. H…

Dimensionality Reduction for Sum-of-Distances Metric Open

Zhili Feng, Praneeth Kacham, David P. Woodruff · 2019

Computer science Mathematics Engineering

We give a dimensionality reduction procedure to approximate the sum of distances of a given set of $n$ points in $R^d$ to any "shape" that lies in a $k$-dimensional subspace. Here, by "shape" we mean any set of points in $R^d$. Our algorit…

Praneeth Kacham YOU? Author Swipe