Explanipedia

Value bounds and Convergence Analysis for Averages of LRP attributions Open

Alexander Binder, Ürün Doǧan · 2025

We analyze numerical properties of Layer-wise relevance propagation (LRP)-type attribution methods by representing them as a product of modified gradient matrices. This representation creates an analogy to matrix multiplications of Jacobi-…

Pearl: A Production-ready Reinforcement Learning Agent Open

Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Qin Wan , et al. · 2023

Reinforcement learning (RL) is a versatile framework for optimizing long-term goals. Although many real-world problems can be formalized with RL, learning and deploying a performant RL policy requires a system designed to address several i…

IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control Open

Rohan Chitnis, Yingchen Xu, B. Hashemi, Lucas Lehnert, Ürün Doǧan , et al. · 2023

Computer science Mathematics Geography

Model-based reinforcement learning (RL) has shown great promise due to its sample efficiency, but still struggles with long-horizon sparse-reward tasks, especially in offline settings where the agent learns from a fixed dataset. We hypothe…

Representation learning for clustering via building consensus Open

Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Ürün Doǧan · 2022

Computer science Political science Sociology

In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated t…

Offline RL With Resource Constrained Online Deployment Open

Jayanth Reddy Regatti, Aniket Anand Deshmukh, Y. Frank Cheng, Young Hun Jung, Abhishek Gupta , et al. · 2021

Computer science Geography

Offline reinforcement learning is used to train policies in scenarios where real-time access to the environment is expensive or impossible. As a natural consequence of these harsh conditions, an agent may lack the resources to fully observ…

Consensus Clustering With Unsupervised Representation Learning Open

Jayanth Reddy Regatti, Aniket Anand Deshmukh, Eren Manavoglu, Ürün Doǧan · 2021

Computer science Political science

Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or…

Representation Learning for Clustering via Building Consensus Open

Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Ürün Doǧan · 2021

Computer science Sociology Political science

In this paper, we focus on unsupervised representation learning for clustering of images. Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated t…

Zero Shot Domain Generalization Open

Udit Maniyar, K J Joseph, Aniket Anand Deshmukh, Ürün Doǧan, Vineeth N Balasubramanian · 2020

Computer science Mathematics Chemistry

Standard supervised learning setting assumes that training data and test data come from the same distribution (domain). Domain generalization (DG) methods try to learn a model that when trained on data from multiple domains, would generali…

Self-Supervised Contextual Bandits in Computer Vision Open

Aniket Anand Deshmukh, Abhimanu Kumar, Levi Boyles, Denis Charles, Eren Manavoglu , et al. · 2020

Computer science Mathematics Economics

Contextual bandits are a common problem faced by machine learning practitioners in domains as diverse as hypothesis testing to product recommendations. There have been a lot of approaches in exploiting rich data representations for context…

Data Transformation Insights in Self-supervision with Clustering Tasks Open

Abhimanu Kumar, Aniket Anand Deshmukh, Ürün Doǧan, Denis Charles, Eren Manavoglu · 2020

Computer science Mathematics Economics

Self-supervision is key to extending use of deep learning for label scarce domains. For most of self-supervised approaches data transformations play an important role. However, up until now the impact of transformations have not been studi…

A Generalization Error Bound for Multi-class Domain Generalization Open

Aniket Anand Deshmukh, Yunwen Lei, Srinagesh Sharma, Ürün Doǧan, James Cutler , et al. · 2019

Computer science Mathematics

Domain generalization is the problem of assigning labels to an unlabeled data set, given several similar data sets for which labels have been provided. Despite considerable interest in this problem over the last decade, there has been no t…

Domain Generalization by Marginal Transfer Learning Open

Gilles Blanchard, Aniket Anand Deshmukh, Ürün Doǧan, Gyemin Lee, Clayton Scott · 2017

Computer science Mathematics

In the problem of domain generalization (DG), there are labeled training data sets from several related prediction problems, and the goal is to make accurate predictions on future unlabeled data sets that are not known to the learner. This…

Data-dependent Generalization Bounds for Multi-class Classification Open

Yunwen Lei, Ürün Doǧan, Ding‐Xuan Zhou, Marius Kloft · 2017

Mathematics Computer science Physics

In this paper, we study data-dependent generalization error bounds exhibiting a mild dependency on the number of classes, making them suitable for multi-class learning with a large number of label classes. The bounds generally hold for emp…

Distributed optimization of multi-class SVMs Open

Maximilian Alber, Julian Zimmert, Ürün Doǧan, Marius Kloft · 2017

Computer science Mathematics Medicine

Training of one-vs.-rest SVMs can be parallelized over the number of classes in a straight forward way. Given enough computational resources, one-vs.-rest SVMs can thus be trained on data involving a large number of classes. The same canno…

Multi-Task Learning for Contextual Bandits Open

Aniket Anand Deshmukh, Ürün Doǧan, Clayton Scott · 2017

Computer science Economics

Contextual bandits are a form of multi-armed bandit in which the agent has access to predictive side information (known as the context) for each arm at each time step, and have been used to model personalized news recommendation, ad placem…

Multi-class SVMs: From Tighter Data-Dependent Generalization Bounds to\n Novel Algorithms Open

Yunwen Lei, Ürün Doǧan, Alexander Binder, Marius Kloft · 2015

Computer science Mathematics Political science

This paper studies the generalization performance of multi-class\nclassification algorithms, for which we obtain, for the first time, a\ndata-dependent generalization error bound with a logarithmic dependence on the\nclass size, substantia…

Localized Multiple Kernel Learning---A Convex Approach Open

Yunwen Lei, Alexander Binder, Ürün Doǧan, Marius Kloft · 2015

Computer science Mathematics

We propose a localized approach to multiple kernel learning that can be formulated as a convex optimization problem over a given cluster structure. For which we obtain generalization error guarantees and derive an optimization algorithm ba…

Localized Multiple Kernel Learning---A Convex Approach Open

Yunwen Lei, Alexander Binder, Ürün Doǧan, Marius Kloft · 2015

Computer science Mathematics Political science

We propose a localized approach to multiple kernel learning that can be formulated as a convex optimization problem over a given cluster structure. For which we obtain generalization error guarantees and derive an optimization algorithm ba…

Multi-class SVMs: From Tighter Data-Dependent Generalization Bounds to Novel Algorithms Open

Yunwen Lei, Ürün Doǧan, Alexander Binder, Marius Kloft · 2015

Computer science Mathematics Political science

This paper studies the generalization performance of multi-class classification algorithms, for which we obtain, for the first time, a data-dependent generalization error bound with a logarithmic dependence on the class size, substantially…

Ürün Doǧan YOU? Author Swipe