Explanipedia

Training-Free Out-Of-Distribution Segmentation With Foundation Models Open

Yaroslav Kholodov, Alexander Gasnikov · 2025

Detecting unknown objects in semantic segmentation is crucial for safety-critical applications such as autonomous driving. Large vision foundation models, including DINOv2, InternImage, and CLIP, have advanced visual representation learnin…

Modeling skiers flows via Wardrope equilibrium in closed capacitated networks Open

Demyan Yarmoshik, Igor N. Ignashin, Ekaterina Sikacheva, Alexander Gasnikov · 2025

We propose an equilibrium model of ski resorts where users are assigned to cycles in a closed network. As queues form on lifts with limited capacity, we derive an efficient way to find waiting times via convex optimization. The equilibrium…

Sign Operator for Coping with Heavy-Tailed Noise in Non-Convex Optimization: High Probability Bounds Under $(L_0, L_1)$-Smoothness Open

Nikita Kornilov, Philip Zmushko, Andrei Semenov, M.Yu. Ikonnikov, Alexander Gasnikov , et al. · 2025

In recent years, non-convex optimization problems are more often described by generalized $(L_0, L_1)$-smoothness assumption rather than standard one. Meanwhile, severely corrupted data used in these problems has increased the demand for m…

Power of Generalized Smoothness in Stochastic Convex Optimization: First- and Zero-Order Algorithms Open

A. V. Lobanov, Alexander Gasnikov · 2025

This paper is devoted to the study of stochastic optimization problems under the generalized smoothness assumption. By considering the unbiased gradient oracle in Stochastic Gradient Descent, we provide strategies to achieve in bounds the …

Decentralised convex optimisation with probability-proportional-to-size quantization Open

Dmitrii Pasechniuk, Pavel Dvurechensky, César A. Uribe, Alexander Gasnikov · 2025

Mathematics Computer science

Communication is one of the bottlenecks of distributed optimisation and learning. To overcome this bottleneck, we propose a novel quantization method that transforms a vector into a sample of components' indices drawn from a categorical di…

Decentralised convex optimisation with probability-proportional-to-size quantization Open

Dmitry Pasechnyuk, Pavel Evgenyevich Dvurechensky, César A. Uribe, Alexander Gasnikov · 2025

Mathematics Computer science

Communication is one of the bottlenecks of distributed optimisation and learning. To overcome this bottleneck, we propose a novel quantization method that transforms a vector into a sample of components' indices drawn from a categorical di…

Ruppert-Polyak averaging for Stochastic Order Oracle Open

Vladimir Smirnov, K. M. Kazistova, I. A. Sudakov, Valentin Leplat, Alexander Gasnikov , et al. · 2024

Mathematics Computer science Economics

Black-box optimization, a rapidly growing field, faces challenges due to limited knowledge of the objective function's internal mechanisms. One promising approach to address this is the Stochastic Order Oracle Concept. This concept, simila…

Accelerated Bregman gradient methods for relatively smooth and relatively Lipschitz continuous minimization problems Open

Oleg Savchuk, Mohammad Alkousa, A. S. Shushko, A. A. Vyguzov, Fedor Stonyakin , et al. · 2024

Mathematics

In this paper, we propose some accelerated methods for solving optimization problems under the condition of relatively smooth and relatively Lipschitz continuous functions with an inexact oracle. We consider the problem of minimizing the c…

Accelerated zero-order SGD under high-order smoothness and overparameterized regime Open

Georgii Bychkov, Darina Dvinskikh, Anastasia Antsiferova, Alexander Gasnikov, A. V. Lobanov · 2024

Mathematics Economics Philosophy

We present a novel gradient-free algorithm to solve a convex stochastic optimization problem, such as those encountered in medicine, physics, and machine learning (e.g., adversarial multi-armed bandit problem), where the objective function…

OPTAMI: Global Superlinear Convergence of High-order Methods Open

Dmitry Kamzolov, Dmitry Pasechnyuk, Artem Agafonov, Alexander Gasnikov, Martin Takáč · 2024

Mathematics Computer science Economics

Second-order methods for convex optimization outperform first-order methods in terms of theoretical iteration convergence, achieving rates up to $O(k^{-5})$ for highly-smooth functions. However, their practical performance and applications…

Nesterov's method of dichotomy via Order Oracle: The problem of optimizing a two-variable function on a square Open

Boris Chervonenkis, Andrei Krasnov, Alexander Gasnikov, A. V. Lobanov · 2024

Mathematics Computer science Economics

The challenges of black box optimization arise due to imprecise responses and limited output information. This article describes new results on optimizing multivariable functions using an Order Oracle, which provides access only to the ord…

Local SGD for Near-Quadratic Problems: Improving Convergence under Unconstrained Noise Conditions Open

Andrey Sadchikov, Savelii Chezhegov, Aleksandr Beznosikov, Alexander Gasnikov · 2024

Mathematics Computer science Economics

Distributed optimization plays an important role in modern large-scale machine learning and data processing systems by optimizing the utilization of computational resources. One of the classical and popular approaches is Local Stochastic G…

Average-case optimization analysis for distributed consensus algorithms on regular graphs Open

Nhat Trung Nguyen, Alexander Rogozin, Alexander Gasnikov · 2024

Computer science Mathematics

The consensus problem in distributed computing involves a network of agents aiming to compute the average of their initial vectors through local communication, represented by an undirected graph. This paper focuses on the studying of this …

An Equilibrium Dynamic Traffic Assignment Model with Linear Programming Formulation Open

Victoria Guseva, Ilya Sklonin, Irina Podlipnova, Demyan Yarmoshik, Alexander Gasnikov · 2024

Computer science Mathematics

In this paper, we consider a dynamic equilibrium transportation problem. There is a fixed number of cars moving from origin to destination areas. Preferences for arrival times are expressed as a cost of arriving before or after the preferr…

Exploring Applications of State Space Models and Advanced Training Techniques in Sequential Recommendations: A Comparative Study on Efficiency and Performance Open

Mark Obozov, Makar Baderko, Stepan Kulibaba, Nikolay Kutuzov, Alexander Gasnikov · 2024

Computer science Mathematics Geography

Recommender systems aim to estimate the dynamically changing user preferences and sequential dependencies between historical user behaviour and metadata. Although transformer-based models have proven to be effective in sequential recommend…

Decentralized Optimization with Coupled Constraints Open

Demyan Yarmoshik, Dmitry Kovalev, Alexander Rogozin, Nikita Kiselev, Daniil Dorin , et al. · 2024

Computer science Mathematics

We consider the decentralized minimization of a separable objective $\sum_{i=1}^{n} f_i(x_i)$, where the variables are coupled through an affine constraint $\sum_{i=1}^n\left(\mathbf{A}_i x_i - b_i\right) = 0$. We assume that the functions…

Stochastic Frank-Wolfe: Unified Analysis and Zoo of Special Cases Open

Ruslan Nazykov, А. Л. Шестаков, Vladimir Solodkin, Aleksandr Beznosikov, Gauthier Gidel , et al. · 2024

Computer science Mathematics

The Conditional Gradient (or Frank-Wolfe) method is one of the most well-known methods for solving constrained optimization problems appearing in various machine learning tasks. The simplicity of iteration and applicability to many practic…

Accelerated Stochastic Gradient Method with Applications to Consensus Problem in Markov-Varying Networks Open

Vladimir Solodkin, Savelii Chezhegov, Ruslan Nazikov, Aleksandr Beznosikov, Alexander Gasnikov · 2024

Computer science Mathematics

Stochastic optimization is a vital field in the realm of mathematical optimization, finding applications in diverse areas ranging from operations research to machine learning. In this paper, we introduce a novel first-order optimization al…

Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed Open

Savelii Chezhegov, Yaroslav Klyukin, Andrei Semenov, Aleksandr Beznosikov, Alexander Gasnikov , et al. · 2024

Mathematics Physics Computer science

Methods with adaptive stepsizes, such as AdaGrad and Adam, are essential for training modern Deep Learning models, especially Large Language Models. Typically, the noise in the stochastic gradients is heavy-tailed for the later ones. Gradi…

Local Methods with Adaptivity via Scaling Open

Savelii Chezhegov, Sergey Skorik, Nikolas Khachaturov, Danil Shalagin, Aram Avetisyan , et al. · 2024

Computer science Mathematics

The rapid development of machine learning and deep learning has introduced increasingly complex optimization challenges that must be addressed. Indeed, training modern, advanced models has become difficult to implement without leveraging m…

Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying Networks Open

Dmitry Kovalev, Ekaterina Borodich, Alexander Gasnikov, Dmitrii Feoktistov · 2024

Computer science Mathematics

We consider the task of minimizing the sum of convex functions stored in a decentralized manner across the nodes of a communication network. This problem is relatively well-studied in the scenario when the objective functions are smooth, o…

Exploring Jacobian Inexactness in Second-Order Methods for Variational Inequalities: Lower Bounds, Optimal Algorithms and Quasi-Newton Approximations Open

Artem Agafonov, Petr Ostroukhov, Roman Mozhaev, Konstantin Yakovlev, Eduard Gorbunov , et al. · 2024

Mathematics Physics Medicine

Variational inequalities represent a broad class of problems, including minimization and min-max problems, commonly found in machine learning. Existing second-order and high-order methods for variational inequalities require precise comput…

Higher Degree Inexact Model for Optimization problems Open

Mohammad Alkousa, Fedor Stonyakin, Alexander Gasnikov, Asmaa Abdo, Mohammad Alcheikh · 2024

Computer science Mathematics Physics

In this paper, it was proposed a new concept of the inexact higher degree $(δ, L, q)$-model of a function that is a generalization of the inexact $(δ, L)$-model, $(δ, L)$-oracle and $(δ, L)$-oracle of degree $q \in [0,2)$. Some examples we…

Accelerated Methods with Compression for Horizontal and Vertical Federated Learning Open

Sergey Stanko, Timur Karimullin, Aleksandr Beznosikov, Alexander Gasnikov · 2024

Computer science Business Geology

Distributed optimization algorithms have emerged as a superior approaches for solving machine learning problems. To accommodate the diverse ways in which data can be stored across devices, these methods must be adaptable to a wide range of…

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Open

Andrei Semenov, Владимир Иванов, Aleksandr Beznosikov, Alexander Gasnikov · 2024

Computer science Mathematics

We propose a novel architecture and method of explainable classification with Concept Bottleneck Models (CBMs). While SOTA approaches to Image Classification task work as a black box, there is a growing demand for models that would provide…

Extragradient Sliding for Composite Non-Monotone Variational Inequalities Open

R T Emelyanov, Andrey Tikhomirov, Aleksandr Beznosikov, Alexander Gasnikov · 2024

Mathematics

Variational inequalities offer a versatile and straightforward approach to analyzing a broad range of equilibrium problems in both theoretical and practical fields. In this paper, we consider a composite generally non-monotone variational …

Optimal Flow Matching: Learning Straight Trajectories in Just One Step Open

Nikita Kornilov, Alexander Gasnikov, Alexander Korotin · 2024

Computer science Mathematics

Over the several recent years, there has been a boom in development of Flow Matching (FM) methods for generative modeling. One intriguing property pursued by the community is the ability to learn flows with straight trajectories which real…

Alexander Gasnikov YOU? Author Swipe