Explanipedia

Rethinking Inter-LoRA Orthogonality in Adapter Merging: Insights from Orthogonal Monte Carlo Dropout Open

Andi Zhang, X. X. Ding, Haofan Wang, Steven McDonagh, Samuel Kaski · 2025

We propose Orthogonal Monte Carlo Dropout, a mechanism that enforces strict orthogonality when combining sparse semantic vectors without extra time complexity. Low-Rank Adaptation (LoRA), a popular fine-tuning method for large models, typi…

A Shift in Perspective on Causality in Domain Generalization Open

Damian Machlanski, Sarah Riley, Edward Moroshko, Kurt Butler, Panagiotis Dimitrakopoulos , et al. · 2025

The promise that causal modelling can lead to robust AI generalization has been challenged in recent work on domain generalization (DG) benchmarks. We revisit the claims of the causality and DG literature, reconciling apparent contradictio…

SWiFT: Soft-Mask Weight Fine-tuning for Bias Mitigation Open

Junyu Yan, Feng Chen, Yuyang Xue, Yuning Du, Konstantinos Vilouras , et al. · 2025

Recent studies have shown that Machine Learning (ML) models can exhibit bias in real-world scenarios, posing significant challenges in ethically sensitive domains such as healthcare. Such bias can negatively affect model fairness, model ge…

No time to train! Training-Free Reference-Based Instance Segmentation Open

Miguel Espinosa, Chenhongyi Yang, Linus Ericsson, Steven McDonagh, Elliot J. Crowley · 2025

The performance of image segmentation models has historically been constrained by the high cost of collecting large-scale annotated data. The Segment Anything Model (SAM) alleviates this original problem through a promptable, semantics-agn…

Concept-based Adversarial Attack: a Probabilistic Perspective Open

Andi Zhang, X. X. Ding, Steven McDonagh, Samuel Kaski · 2025

We propose a concept-based adversarial attack framework that extends beyond single-image perturbations by adopting a probabilistic perspective. Rather than modifying a single image, our method operates on an entire concept -- represented b…

CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs Open

Raman Dutt, Pedro L. Sánchez, Yao Yao, Steven McDonagh, Sotirios A. Tsaftaris , et al. · 2025

We introduce CheXGenBench, a rigorous and multifaceted evaluation framework for synthetic chest radiograph generation that simultaneously assesses fidelity, privacy risks, and clinical utility across state-of-the-art text-to-image generati…

Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities Open

Raman Dutt, Harleen Hanspal, Guoxuan Xia, Petru-Daniel Tudosiu, Alexander Black , et al. · 2025

In this work, we undertake the challenge of augmenting the existing generative capabilities of pre-trained text-only large language models (LLMs) with multi-modal generation capability while satisfying two core constraints: C1 preserving t…

Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation Open

Jinghan Sun, Dong Wei, Zhe Xu, Donghuan Lu, Hong Liu , et al. · 2024

Anatomical abnormality detection and report generation of chest X-ray (CXR) are two essential tasks in clinical practice. The former aims at localizing and characterizing cardiopulmonary radiological findings in CXRs, while the latter summ…

There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks Open

Miguel Espinosa, Chenhongyi Yang, Linus Ericsson, Steven McDonagh, Elliot J. Crowley · 2024

The Segment Anything Model (SAM) was originally designed for label-agnostic mask generation. Does this model also possess inherent semantic understanding, of value to broader visual tasks? In this work we follow a multi-staged approach tow…

Improving Object Detection via Local-global Contrastive Learning Open

Danai Triantafyllidou, Sarah Parisot, Aleš Leonardis, Steven McDonagh · 2024

Visual domain gaps often impact object detection performance. Image-to-image translation can mitigate this effect, where contrastive approaches enable learning of the image-to-image mapping under unsupervised regimes. However, existing met…

BMFT: Achieving Fairness via Bias-based Weight Masking Fine-tuning Open

Yuyang Xue, Junyu Yan, Raman Dutt, Fasih Haider, Jingshuai Liu , et al. · 2024

Developing models with robust group fairness properties is paramount, particularly in ethically sensitive domains such as medical diagnosis. Recent approaches to achieving fairness in machine learning require a substantial amount of traini…

einspace: Searching for Neural Architectures from Fundamental Operations Open

Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey , et al. · 2024

Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the sear…

Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction Open

Yuyang Xue, Jingshuai Liu, Steven McDonagh, Sotirios A. Tsaftaris · 2024

Machine unlearning is a promising paradigm for removing unwanted data samples from a trained model, towards ensuring compliance with privacy regulations and limiting harmful biases. Although unlearning has been shown in, e.g., classificati…

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation Open

Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh , et al. · 2024

Text-to-image generation has achieved astonishing results, yet precise spatial controllability and prompt fidelity remain highly challenging. This limitation is typically addressed through cumbersome prompt engineering, scene layout condit…

Label-efficient object detection via region proposal network pre-training Open

Nanqing Dong, Linus Ericsson, Yongxin Yang, Aleš Leonardis, Steven McDonagh · 2024

Self-supervised pre-training, based on the pretext task of instance discrimination, has fueled the recent advance in label-efficient object detection. However, existing studies focus on pre-training only a feature extractor network to lear…

Optimisation-Based Multi-Modal Semantic Image Editing Open

Bowen Li, Yongxin Yang, Steven McDonagh, Shifeng Zhang, Petru-Daniel Tudosiu , et al. · 2023

Image editing affords increased control over the aesthetics and content of generated images. Pre-existing works focus predominantly on text-based instructions to achieve desired image modifications, which limit edit precision and accuracy.…

Multi-task Learning with 3D-Aware Regularization Open

Weihong Li, Steven McDonagh, Aleš Leonardis, Hakan Bilen · 2023

Deep neural networks have become a standard building block for designing models that can perform multiple dense computer vision tasks such as depth estimation and semantic segmentation thanks to their ability to capture complex correlation…

Learning to Name Classes for Vision and Language Models Open

Sarah Parisot, Yongxin Yang, Steven McDonagh · 2023

Large scale vision and language models can achieve impressive zero-shot recognition performance by mapping class specific text queries to image content. Two distinct challenges that remain however, are high sensitivity to the choice of han…

Tunable Convolutions with Parametric Multi-Loss Optimization Open

Matteo Maggioni, Thomas Tanay, Francesca Babiloni, Steven McDonagh, Aleš Leonardis · 2023

Behavior of neural networks is irremediably determined by the specific loss and data used during training. However it is often desirable to tune the model at inference time based on external factors such as preferences of the user or dynam…

Label-Efficient Object Detection via Region Proposal Network Pre-Training Open

Nanqing Dong, Linus Ericsson, Yongxin Yang, Aleš Leonardis, Steven McDonagh · 2022

Self-supervised pre-training, based on the pretext task of instance discrimination, has fueled the recent advance in label-efficient object detection. However, existing studies focus on pre-training only a feature extractor network to lear…

Content-Diverse Comparisons improve IQA Open

William Thong, José Costa Pereira, Sarah Parisot, Aleš Leonardis, Steven McDonagh · 2022

Image quality assessment (IQA) forms a natural and often straightforward undertaking for humans, yet effective automation of the task remains highly challenging. Recent metrics from the deep learning community commonly compare image pairs …

CLAD: A realistic Continual Learning benchmark for Autonomous Driving Open

Eli Verwimp, Kuo Yang, Sarah Parisot, Hong Lanqing, Steven McDonagh , et al. · 2022

In this paper we describe the design and the ideas motivating a new Continual Learning benchmark for Autonomous Driving (CLAD), that focuses on the problems of object classification and object detection. The benchmark utilises SODA10M, a r…

Residual Contrastive Learning for Image Reconstruction: Learning Transferable Representations from Noisy Images Open

Nanqing Dong, Matteo Maggioni, Yongxin Yang, Eduardo Pérez-Pellitero, Aleš Leonardis , et al. · 2022

This paper is concerned with contrastive learning (CL) for low-level image restoration and enhancement tasks. We propose a new label-efficient learning paradigm based on residuals, residual contrastive learning (RCL), and derive an unsuper…

Model-Based Image Signal Processors via Learnable Dictionaries Open

Marcos V. Conde, Steven McDonagh, Matteo Maggioni, Aleš Leonardis, Eduardo Pérez-Pellitero · 2022

Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP). Computational photography tasks such as image denoising and colour constancy are commonly performed in the RAW domain, in part du…

Out-of-Distribution Detection with Class Ratio Estimation Open

Ming‐Tian Zhang, Andi Zhang, Tim Z. Xiao, Yitong Sun, Steven McDonagh · 2022

Density-based Out-of-distribution (OOD) detection has recently been shown unreliable for the task of detecting OOD images. Various density ratio based approaches achieve good empirical performance, however methods typically lack a principl…

Re-examining Distillation For Continual Object Detection Open

Eli Verwimp, Kuo Yang, Sarah Parisot, Hong Lanqing, Steven McDonagh , et al. · 2022

Training models continually to detect and classify objects, from new classes and new domains, remains an open problem. In this work, we conduct a thorough analysis of why and how object detection models forget catastrophically. We focus on…

CroMo: Cross-Modal Learning for Monocular Depth Estimation Open

Yannick Verdié, Jifei Song, Barnabé Mas, Benjamin Busam, Aleš Leonardis , et al. · 2022

Learning-based depth estimation has witnessed recent progress in multiple directions; from self-supervision using monocular video to supervised methods offering highest accuracy. Complementary to supervision, further boosts to performance …

Long-tail Recognition via Compositional Knowledge Transfer Open

Sarah Parisot, Pedro M. Esperança, Steven McDonagh, Tamás J. Madarász, Yongxin Yang , et al. · 2021

In this work, we introduce a novel strategy for long-tail recognition that addresses the tail classes' few-shot problem via training-free knowledge transfer. Our objective is to transfer knowledge acquired from information-rich common clas…

Spread Flows for Manifold Modelling Open

Ming‐Tian Zhang, Yitong Sun, Chen Zhang, Steven McDonagh · 2021

Flow-based models typically define a latent space with dimensionality identical to the observational space. In many problems, however, the data does not populate the full ambient data space that they natively reside in, rather inhabiting a…

Flow Based Models For Manifold Data. Open

Ming‐Tian Zhang, Yitong Sun, Steven McDonagh, Chen Zhang · 2021

Flow-based generative models typically define a latent space with dimensionality identical to the observational space. In many problems, however, the data does not populate the full ambient data-space that they natively reside in, rather i…

Steven McDonagh YOU? Author Swipe