Explanipedia

Training Deep Nets with Sublinear Memory Cost Open

Tianqi Chen, Bing Xu, Chiyuan Zhang, Carlos Guestrin · 2016

We propose a systematic approach to reduce the memory consumption of deep neural network training. Specifically, we design an algorithm that costs O(sqrt(n)) memory to train a n layer network, with only the computational cost of an extra f…

Hamiltonian Simulation with Nearly Optimal Dependence on all Parameters Open

Dominic W. Berry, Andrew M. Childs, Robin Kothari · 2015

Mathematics Computer science Physics

We present an algorithm for sparse Hamiltonian simulation whose complexity is\noptimal (up to log factors) as a function of all parameters of interest.\nPrevious algorithms had optimal or near-optimal scaling in some parameters at\nthe cos…

Personalized Federated Learning with Moreau Envelopes Open

Canh T. Dinh, Nguyen H. Tran, Tuan Dung Nguyen · 2020

Computer science Mathematics Economics

Federated learning (FL) is a decentralized and privacy-preserving machine learning technique in which a group of clients collaborate with a server to learn a global model without sharing clients' data. One challenge associated with FL is s…

Adafactor: Adaptive Learning Rates with Sublinear Memory Cost Open

Noam Shazeer, Mitchell Stern · 2018

Computer science Mathematics Physics

In several recently proposed stochastic optimization methods (e.g. RMSProp, Adam, Adadelta), parameter updates are scaled by the inverse square roots of exponential moving averages of squared past gradients. Maintaining these per-parameter…

Contribution of sublinear and supralinear dendritic integration to neuronal computations Open

Alexandra Tran-Van-Minh, Romain D. Cazé, Therése Abrahamsson, Laurence Cathala, Boris Gutkin , et al. · 2015

Computer science Biology Mathematics

Nonlinear dendritic integration is thought to increase the computational ability of neurons. Most studies focus on how supralinear summation of excitatory synaptic responses arising from clustered inputs within single dendrites result in t…

On the Global Linear Convergence of Frank-Wolfe Optimization Variants Open

Simon Lacoste-Julien, Martin Jaggi · 2015

Mathematics Computer science Economics

The Frank-Wolfe (FW) optimization algorithm has lately re-gained popularity thanks in particular to its ability to nicely handle the structured constraints appearing in machine learning applications. However, its convergence rate is known …

Entanglement growth and correlation spreading with variable-range interactions in spin and fermionic tunneling models Open

Anton S. Buyskikh, Maurizio Fagotti, Johannes Schachenmayer, Fabian H. L. Eßler, Andrew J. Daley · 2016

Physics Mathematics Materials science

We investigate the dynamics following a global parameter quench for two one-dimensional models with variable-range power-law interactions: a long-range transverse Ising model, which has recently been realized in chains of trapped ions, and…

SpotLight Open

Dhivya Eswaran, Christos Faloutsos, Sudipto Guha, Nina Mishra · 2018

Computer science Mathematics

How do we spot interesting events from e-mail or transportation logs? How can we detect port scan or denial of service attacks from IP-IP communication data? In general, given a sequence of weighted, directed or bipartite graphs, each summ…

I-LAMM for sparse learning: Simultaneous control of algorithmic complexity and statistical error Open

Jianqing Fan, Han Liu, Qiang Sun, Tong Zhang · 2018

Mathematics Computer science Biology

We propose a computational framework named iterative local adaptive majorize-minimization (I-LAMM) to simultaneously control algorithmic complexity and statistical error when fitting high dimensional models. I-LAMM is a two-stage algorithm…

Approximate K-Means++ in Sublinear Time Open

Olivier Bachem, Mario Lučić, S. Hamed Hassani, Andreas Krause · 2016

Computer science Mathematics

The quality of K-Means clustering is extremely sensitive to proper initialization. The classic remedy is to apply k-means++ to obtain an initial set of centers that is provably competitive with the optimal solution. Unfortunately, k-means+…

Infinitely many solutions for the stationary Kirchhoff problems involving the fractional<i>p</i>-Laplacian Open

Mingqi Xiang, Giovanni Molica Bisci, Guohua Tian, Binlin Zhang · 2016

Mathematics Physics Chemistry

The aim of this paper is to establish the multiplicity of weak solutions for a Kirchhoff-type problem driven by a fractional p-Laplacian operator with homogeneous Dirichlet boundary conditions.

A note on the boundedness of sublinear operators on grand variable Herz spaces Open

Hammad Nafis, Humberto Rafeiro, Muhammad Asad Zaighum · 2020

Mathematics Biology

In this paper, we introduce grand variable Herz type spaces using discrete grand spaces and prove the boundedness of sublinear operators on these spaces.

Factorization Bandits for Interactive Recommendation Open

Huazheng Wang, Qingyun Wu, Hongning Wang · 2017

Computer science Mathematics Physics

We perform online interactive recommendation via a factorization-based bandit algorithm. Low-rank matrix completion is performed over an incrementally constructed user-item preference matrix, where an upper confidence bound based item sele…

Neural Policy Gradient Methods: Global Optimality and Rates of Convergence Open

Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang · 2019

Mathematics Computer science Economics

Policy gradient methods with actor-critic schemes demonstrate tremendous empirical successes, especially when the actors and critics are parameterized by neural networks. However, it remains less clear whether such "neural" policy gradient…

An Improved Convergence Analysis for Decentralized Online Stochastic Non-Convex Optimization Open

Ran Xin, Usman A. Khan, Soummya Kar · 2021

Mathematics Computer science Biology

In this paper, we study decentralized online stochastic non-convex\noptimization over a network of nodes. Integrating a technique called gradient\ntracking in decentralized stochastic gradient descent, we show that the\nresulting algorithm…

Solving the Diamond-Mortensen-Pissarides model accurately Open

Nicolas Petrosky-Nadeau, Lu Zhang · 2017

Mathematics Economics Physics

An accurate global projection algorithm is critical for quantifying the basic moments of the Diamond-Mortensen-Pissarides model. Log linearization under- states the mean and volatility of unemployment, but overstates the volatility of labo…

Convex Optimization: Algorithms and Complexity Open

Sébastien Bubeck · 2015

Computer science Mathematics Psychology

This monograph presents the main complexity theorems in convex optimization and their corresponding algorithms. Starting from the fundamental theory of black-box optimization, the material progresses towards recent advances in structural o…

QSGD: Randomized Quantization for Communication-Optimal Stochastic Gradient Descent Open

Dan Alistarh, Jerry Li, Ryota Tomioka, Milan Vojnović · 2016

Computer science Mathematics Biology

Parallel implementations of stochastic gradient descent (SGD) have received significant research attention, thanks to excellent scalability properties of this algorithm, and to its efficiency in the context of training deep neural networks…

Stochastic Recursive Gradient Algorithm for Nonconvex Optimization Open

Lam M. Nguyen, Jie Liu, Katya Scheinberg, Martin Takáč · 2017

Mathematics Computer science Economics

In this paper, we study and analyze the mini-batch version of StochAstic Recursive grAdient algoritHm (SARAH), a method employing the stochastic recursive gradient, for solving empirical loss minimization for the case of nonconvex losses. …

Law of large numbers and central limit theorem under nonlinear expectations Open

Shigē Péng · 2019

Mathematics Physics Computer science

The main achievement of this paper is the finding and proof of Central Limit Theorem (CLT, see Theorem 12) under the framework of sublinear expectation. Roughly speaking under some reasonable assumption, the random sequence {1/n(X1+⋯+Xn)}i…

Quantum chaos dynamics in long-range power law interaction systems Open

Xiaohong Chen, Tianci Zhou · 2019

Physics Mathematics Philosophy

We use out-of-time-order commutator (OTOC) to diagnose the propagation of chaos in one dimensional long-range power law interaction system. We map the evolution of OTOC to a classical stochastic dynamics problem and use a Brownian quantum …

Ligero++: A New Optimized Sublinear IOP Open

Rishabh Bhadauria, Zhiyong Fang, Carmit Hazay, Muthuramakrishnan Venkitasubramaniam, Tiancheng Xie , et al. · 2020

Computer science Mathematics

This paper follows the line of works that design concretely efficient transparent sublinear zero-knowledge Interactive Oracle Proofs (IOP). Arguments obtained via this paradigm have the advantages of not relying on public-key cryptography,…

Optimal hashing-based time-space trade-offs for approximate near neighbors Open

Alexandr Andoni, Thijs Laarhoven, Ilya Razenshteyn, Erik Waingarten · 2017

Mathematics Computer science

We show tight upper and lower bounds for time-space trade-offs for the c-approximate Near Neighbor Search problem. For the d-dimensional Euclidean space and n-point datasets, we develop a data structure with space n1+ρu+o(1) + O(dn) and qu…

On Thompson Sampling and Asymptotic Optimality Open

Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hütter · 2017

Mathematics Computer science

We discuss some recent results on Thompson sampling for nonparametric reinforcement learning in countable classes of general stochastic environments. These environments can be non-Markovian, non-ergodic, and partially observable. We show t…

Nonlocal Schrödinger-Kirchhoff equations with external magnetic field Open

Mingqi Xiang, Patrizia Pucci, Marco Squassina, Binlin Zhang · 2016

Physics Mathematics Chemistry

The paper deals with the existence and multiplicity of solutions of the fractional Schrödinger-Kirchhoff equation involving an external magnetic potential. As a consequence, the results can be applied to the special case $\begin{equation*}…

Disorder, entropy and harmonic functions Open

Itaï Benjamini, Hugo Duminil‐Copin, Gady Kozma, Ariel Yadin · 2015

Mathematics Physics

We study harmonic functions on random environments with particular emphasis on the case of the infinite cluster of supercritical percolation on $\\mathbb{Z}^{d}$. We prove that the vector space of harmonic functions growing at most linearl…

Estimating the Unseen Open

Gregory Valiant, Paul Valiant · 2017

Computer science Mathematics Chemistry

We show that a class of statistical properties of distributions, which includes such practically relevant properties as entropy, the number of distinct elements, and distance metrics between pairs of distributions, can be estimated given a…

A Coordinate-Descent Primal-Dual Algorithm with Large Step Size and Possibly Nonseparable Functions Open

Olivier Fercoq, Pascal Bianchi · 2019

Mathematics Computer science Economics

This paper introduces a coordinate descent version of the V\\~u-Condat\nalgorithm. By coordinate descent, we mean that only a subset of the coordinates\nof the primal and dual iterates is updated at each iteration, the other\ncoordinates b…

Sketching and Sublinear Data Structures in Genomics Open

Guillaume Marçais, Brad Solomon, Rob Patro, Carl Kingsford · 2019

Computer science Mathematics Geography

Large-scale genomics demands computational methods that scale sublinearly with the growth of data. We review several data structures and sketching techniques that have been used in genomic analysis methods. Specifically, we focus on four k…

Model-Free Linear Quadratic Control via Reduction to Expert Prediction Open

Yasin Abbasi-Yadkori, Nevena Lazic, Csaba Szepesvári · 2018

Computer science Mathematics

Model-free approaches for reinforcement learning (RL) and continuous control find policies based only on past states and rewards, without fitting a model of the system dynamics. They are appealing as they are general purpose and easy to im…

Sublinear function