Explanipedia

Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection Open

I. Jara, Cristian Rodríguez-Opazo, Damien Teney, Damith C. Ranasinghe, Ehsan Abbasnejad · 2025

Out-of-distribution (OOD) detection is essential for reliably deploying machine learning models in the wild. Yet, most methods treat large pre-trained models as monolithic encoders and rely solely on their final-layer representations for d…

Beyond Imitation: Recovering Dense Rewards from Demonstrations Open

Jiangnan Li, Thuy-Trang Vu, Ehsan Abbasnejad, Gholamreza Haffari · 2025

Conventionally, supervised fine-tuning (SFT) is treated as a simple imitation learning process that only trains a policy to imitate expert behavior on demonstration datasets. In this work, we challenge this view by establishing a fundament…

Parameter-Efficient Action Planning with Large Language Models for Vision-and-Language Navigation Open

Bahram Mohammadi, Ehsan Abbasnejad, Yuankai Qi, Qi Wu, Anton van den Hengel , et al. · 2025

Bayesian Low-Rank Learning (Bella): A Practical Approach to Bayesian Neural Networks Open

Bao Gia Doan, Afshar Shamsi, Xiaoyu Guo, Arash Mohammadi, Hamid Alinejad‐Rokny , et al. · 2025

Computational complexity of Bayesian learning is impeding its adoption in practical, large-scale tasks, despite demonstrations of significant merits such as improved robustness and resilience to unseen or out-of-distribution inputs over th…

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild Open

Damien Teney, Liangze Jiang, Florin Gogianu, Ehsan Abbasnejad · 2025

Neural architectures tend to fit their data with relatively simple functions. This "simplicity bias" is widely regarded as key to their success. This paper explores the limits of this principle. Building on recent findings that the simplic…

RandLoRA: Full-rank parameter-efficient fine-tuning of large models Open

Paul S. Albert, Frederic Z. Zhang, Hemanth Saratchandran, Cristian Rodríguez-Opazo, Anton van den Hengel , et al. · 2025

Low-Rank Adaptation (LoRA) and its variants have shown impressive results in reducing the number of trainable parameters and memory requirements of large transformer networks while maintaining fine-tuning performance. The low-rank nature o…

Learning to Reason and Navigate: Parameter Efficient Action Planning with Large Language Models Open

Bahram Mohammadi, Ehsan Abbasnejad, Yuankai Qi, Qi Wu, Anton van den Hengel , et al. · 2025

Modelling individual variation in human walking gait across populations and walking conditions via gait recognition Open

Kayne A. Duncanson, Fabian Horst, Ehsan Abbasnejad, Gary Hanly, William S. P. Robertson , et al. · 2024

Human walking gait is a personal story written by the body, a tool for understanding biological identity in healthcare and security. Gait analysis methods traditionally diverged between these domains but are now merging their complementary…

ETAGE: Enhanced Test Time Adaptation with Integrated Entropy and Gradient Norms for Robust Model Performance Open

Afshar Shamsi, Rejisa Becirovic, Ahmadreza Argha, Ehsan Abbasnejad, Hamid Alinejad‐Rokny , et al. · 2024

Test time adaptation (TTA) equips deep learning models to handle unseen test data that deviates from the training distribution, even when source data is inaccessible. While traditional TTA methods often rely on entropy as a confidence metr…

InvariantStock: Learning Invariant Features for Mastering the Shifting Market Open

Haiyao Cao, Jinan Zou, Yuhang Liu, Zhen Zhang, Ehsan Abbasnejad , et al. · 2024

Accurately predicting stock returns is crucial for effective portfolio management. However, existing methods often overlook a fundamental issue in the market, namely, distribution shifts, making them less practical for predicting future ma…

Rethinking State Disentanglement in Causal Reinforcement Learning Open

Haiyao Cao, Zhen Zhang, Panpan Cai, Yuhang Liu, Jinan Zou , et al. · 2024

One of the significant challenges in reinforcement learning (RL) when dealing with noise is estimating latent states from observations. Causality provides rigorous theoretical support for ensuring that the underlying states can be uniquely…

On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World Open

Bao Gia Doan, Quang Dang Nguyen, Callum Lindquist, Paul Montague, Tamas Abraham , et al. · 2024

Object detectors are vulnerable to backdoor attacks. In contrast to classifiers, detectors possess unique characteristics, architecturally and in task execution; often operating in challenging conditions, for instance, detecting traffic si…

Bayesian Low-Rank LeArning (Bella): A Practical Approach to Bayesian Neural Networks Open

Bao Gia Doan, Afshar Shamsi, Xiaoyu Guo, Arash Mohammadi, Hamid Alinejad‐Rokny , et al. · 2024

Computational complexity of Bayesian learning is impeding its adoption in practical, large-scale tasks. Despite demonstrations of significant merits such as improved robustness and resilience to unseen or out-of-distribution inputs over th…

Knowledge Composition using Task Vectors with Learned Anisotropic Scaling Open

Frederic Z. Zhang, Paul S. Albert, Cristian Rodríguez-Opazo, Anton van den Hengel, Ehsan Abbasnejad · 2024

Pre-trained models produce strong generic representations that can be adapted via fine-tuning. The learned weight difference relative to the pre-trained model, known as a task vector, characterises the direction and stride of fine-tuning. …

Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling Open

Cristian Rodríguez-Opazo, Ehsan Abbasnejad, Damien Teney, Edison Marrese-Taylor, Hamed Damirchi , et al. · 2024

Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various architectures, from vision transformers (ViTs) to convolutional networks (ResNets) have been trained with CLIP to ser…

BruSLeAttack: A Query-Efficient Score-Based Black-Box Sparse Adversarial Attack Open

Viet Quoc Vo, Ehsan Abbasnejad, Damith C. Ranasinghe · 2024

We study the unique, less-well understood problem of generating sparse adversarial samples simply by observing the score-based replies to model queries. Sparse attacks aim to discover a minimum number-the l0 bounded-perturbations to model …

Bayesian Learned Models Can Detect Adversarial Malware For Free Open

Bao Gia Doan, Quang Dang Nguyen, Paul Montague, Tamas Abraham, Olivier De Vel , et al. · 2024

The vulnerability of machine learning-based malware detectors to adversarial attacks has prompted the need for robust solutions. Adversarial training is an effective method but is computationally expensive to scale up to large datasets and…

Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning Open

Mark D. McDonnell, Dong Gong, Ehsan Abbasnejad, Anton van den Hengel · 2024

Continual learning requires a model to adapt to ongoing changes in the data distribution, and often to the set of tasks to be performed. It is rare, however, that the data and task changes are completely unpredictable. Given a description …

Do Deep Neural Network Solutions Form a Star Domain? Open

Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad, Seung‐June Oh · 2024

It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means that a linear path can connect two indep…

Neural Redshift: Random Networks are not Random Functions Open

Damien Teney, Armand Mihai Nicolicioiu, Valentin N. Hartmann, Ehsan Abbasnejad · 2024

Our understanding of the generalization capabilities of neural networks (NNs) is still incomplete. Prevailing explanations are based on implicit biases of gradient descent (GD) but they cannot account for the capabilities of models from gr…

Invariant Representation Learning for Generalizable Imitation Open

Mohamed Khalil Jabri, Panagiotis Papadakis, Ehsan Abbasnejad, Gilles Coppin, Qinfeng Shi · 2024

International audience

Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances Open

Cristian Rodríguez-Opazo, Edison Marrese-Taylor, Ehsan Abbasnejad, Hamed Damirchi, I. Jara , et al. · 2023

Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various neural architectures, spanning Transformer-based models like Vision Transformers (ViTs) to Convolutional Networks (Co…

Zero-shot Retrieval: Augmenting Pre-trained Models with Search Engines Open

Hamed Damirchi, Cristian Rodríguez-Opazo, Ehsan Abbasnejad, Damien Teney, Qinfeng Shi , et al. · 2023

Large pre-trained models can dramatically reduce the amount of task-specific data required to solve a problem, but they often fail to capture domain-specific nuances out of the box. The Web likely contains the information necessary to exce…

SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation Open

Iman Abbasnejad, Fabio Zambetta, Flora D. Salim, Timothy Wiley, Jeffrey Chan , et al. · 2023

SCONE-GAN presents an end-to-end image translation, which is shown to be effective for learning to generate realistic and diverse scenery images. Most current image-to-image translation approaches are devised as two mappings: a translation…

Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models Open

Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix · 2023

As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model. While significant pr…

RanPAC: Random Projections and Pre-trained Models for Continual Learning Open

Mark D. McDonnell, Dong Gong, Amin Parveneh, Ehsan Abbasnejad, Anton van den Hengel · 2023

Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch…

Feature-Space Bayesian Adversarial Learning Improved Malware Detector Robustness Open

Bao Gia Doan, Shuiqiao Yang, Paul Montague, Olivier De Vel, Tamas Abraham , et al. · 2023

We present a new algorithm to train a robust malware detector. Malware is a prolific problem and malware detectors are a front-line defense. Modern detectors rely on machine learning algorithms. Now, the adversarial objective is to devise …

Semantic Role Labeling Guided Out-of-distribution Detection Open

Jinan Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao , et al. · 2023

Identifying unexpected domain-shifted instances in natural language processing is crucial in real-world applications. Previous works identify the out-of-distribution (OOD) instance by leveraging a single global feature embedding to represe…

Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup Open

Damien Teney, Jindong Wang, Ehsan Abbasnejad · 2023

Mixup is a highly successful technique to improve generalization of neural networks by augmenting the training data with combinations of random pairs. Selective mixup is a family of methods that apply mixup to specific pairs, e.g. only com…

Deep Metric Learning for Scalable Gait-Based Person Re-Identification Using Force Platform Data Open

Kayne A. Duncanson, Simon Thwaites, David T. Booth, Gary Hanly, William S. P. Robertson , et al. · 2023

Walking gait data acquired with force platforms may be used for person re-identification (re-ID) in various authentication, surveillance, and forensics applications. Current force platform-based re-ID systems classify a fixed set of identi…

Ehsan Abbasnejad YOU? Author Swipe