Explanipedia

One Size Does Not Fit All: Architecture-Aware Adaptive Batch Scheduling with DEBA Open

François Belias, Naser Ezzati‐Jivan, Foutse Khomh · 2025

Adaptive batch size methods aim to accelerate neural network training, but existing approaches apply identical adaptation strategies across all architectures, assuming a one-size-fits-all solution. We introduce DEBA (Dynamic Efficient Batc…

RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring Open

Khouloud Oueslati, Maxime Lamothe, Foutse Khomh · 2025

Large Language Models (LLMs) have substantially influenced various software engineering tasks. Indeed, in the case of software refactoring, traditional LLMs have shown the ability to reduce development time and enhance code quality. Howeve…

Correction to: Assessing the adoption of security policies by developers in terraform across different cloud providers Open

Alexandre Verdet, Mohammad Hamdaqa, Léuson Da Silva, Foutse Khomh · 2025

FairFLRep: Fairness aware fault localization and repair of Deep Neural Networks Open

Moses Openja, Paolo Arcaini, Foutse Khomh, Fuyuki Ishikawa · 2025

Deep neural networks (DNNs) are being utilized in various aspects of our daily lives, including high-stakes decision-making applications that impact individuals. However, these systems reflect and amplify bias from the data used during tra…

Adversarial attack classification and robustness testing for large language models for code Open

Yang Liu, Armstrong Foundjem, Foutse Khomh, Heng Li · 2025

ReCatcher: Towards LLMs Regression Testing for Code Generation Open

Altaf Allah Abbassi, Léuson Da Silva, Amin Nikanjam, Foutse Khomh · 2025

Large Language Models (LLMs) for code generation evolve rapidly through fine-tuning, merging, or new model releases. However, such updates can introduce regressions, not only in correctness but also in code quality and performance. To addr…

LLMs and Stack Overflow discussions: Reliability, impact, and challenges Open

Léuson Da Silva, Jordan Samhi, Foutse Khomh · 2025

Health data issues in Africa: time for digitization, standardization and harmonization Open

Abdoelnaser Degoot, Ismaël Koné, Shakuntala Baichoo, Mercy Ngungu, Nzisa Liku , et al. · 2025

This commentary discusses health data challenges in Africa, focusing on digitization, standardization, and harmonization as key solutions. It highlights how addressing these foundational issues can enable AI and data science to transform h…

SDLog: A Deep Learning Framework for Detecting Sensitive Information in Software Logs Open

Roozbeh Aghili, Xingfang Wu, Foutse Khomh, Heng Li · 2025

Software logs are messages recorded during the execution of a software system that provide crucial run-time information about events and activities. Although software logs have a critical role in software maintenance and operation tasks, p…

Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing Open

Seif Mzoughi, Mohamed Elshafeia, Foutse Khomh · 2025

Image segmentation is critical for applications such as medical imaging, augmented reality, and video surveillance. However, segmentation models often lack robustness, making them vulnerable to adversarial perturbations from subtle image d…

Representation Improvement in Latent Space for Search-Based Testing of Autonomous Robotic Systems Open

Dmytro Humeniuk, Foutse Khomh · 2025

Testing autonomous robotic systems, such as self-driving cars and unmanned aerial vehicles, is challenging due to their interaction with highly unpredictable environments. A common practice is to first conduct simulation-based testing, whi…

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Open

Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yin Xiao , et al. · 2025

Existing large language model (LLM) evaluation benchmarks primarily focus on English, while current multilingual tasks lack parallel questions that specifically assess cross-linguistic reasoning abilities. This dual limitation makes it cha…

A Taxonomy of Inefficiencies in LLM-Generated Python Code Open

Altaf Allah Abbassi, Léuson Da Silva, Amin Nikanjam, Foutse Khomh · 2025

Large Language Models (LLMs) are widely adopted for automated code generation with promising results. Although prior research has assessed LLM-generated code and identified various quality issues -- such as redundancy, poor maintainability…

Assessing the adoption of security policies by developers in terraform across different cloud providers Open

Alexandre Verdet, Mohammad Hamdaqa, Léuson Da Silva, Foutse Khomh · 2025

Automated UML Visualization of Software Ecosystems: Tracking Versions, Dependencies, and Security Updates Open

Vladimir Kan, Mathangi LNU, Solomon Berhe, Chandrakala Kari, Marc Maynard , et al. · 2025

MAC: Multi-Agent LLM Coder is All You Need Open

Abhishek Kodati, Foutse Khomh · 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Open

Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao , et al. · 2025

Diffusion-Based Adversarial Purification for Intrusion Detection Open

Mohamed Amine Merzouk, Erwan Beurier, Reda Yaich, Nora Cuppens, Frédéric Cuppens , et al. · 2025

Continuously Learning Bug Locations Open

Paulina Stevia Nouwou Mindom, Léuson Da Silva, Amin Nikanjam, Foutse Khomh · 2024

Automatically locating buggy changesets associated with bug reports is crucial in the software development process. Deep Learning (DL)-based techniques show promising results by leveraging structural information from the code and learning …

An Efficient Model Maintenance Approach for MLOps Open

Forough Majidi, Foutse Khomh, Heng Li, Amin Nikanjam · 2024

In recent years, many industries have utilized machine learning (ML) models in their systems. Ideally, ML models should be trained on and applied to data from the same distributions. However, the data evolves over time in many application …

Tracing Optimization for Performance Modeling and Regression Detection Open

Kaveh Shahedi, Heng Li, Maxime Lamothe, Foutse Khomh · 2024

Software performance modeling plays a crucial role in developing and maintaining software systems. A performance model analytically describes the relationship between the performance of a system and its runtime activities. This process typ…

Towards Understanding the Impact of Data Bugs on Deep Learning Models in Software Engineering Open

Mehil B Shah, Mohammad Masudur Rahman, Foutse Khomh · 2024

Deep learning (DL) techniques have achieved significant success in various software engineering tasks (e.g., code completion by Copilot). However, DL systems are prone to bugs from many sources, including training data. Existing literature…

Fault Localization in Deep Learning-based Software: A System-level Approach Open

Mohammad Mehdi Morovati, Amin Nikanjam, Foutse Khomh · 2024

Over the past decade, Deep Learning (DL) has become an integral part of our daily lives. This surge in DL usage has heightened the need for developing reliable DL software systems. Given that fault localization is a critical task in reliab…

Impact of LLM-based Review Comment Generation in Practice: A Mixed Open-/Closed-source User Study Open

Doriane Olewicki, Léuson Da Silva, Suhaib Mujahid, Ali Amini, Benjamin Mah , et al. · 2024

We conduct a large-scale empirical user study in a live setup to evaluate the acceptance of LLM-generated comments and their impact on the review process. This user study was performed in two organizations, Mozilla (which has its codebase …

Towards Optimizing SQL Generation via LLM Routing Open

Mohammadhossein Malekpour, Nicholas J. Shaheen, Foutse Khomh, Amine Mhedhbi · 2024

Text-to-SQL enables users to interact with databases through natural language, simplifying access to structured data. Although highly capable large language models (LLMs) achieve strong accuracy for complex queries, they incur unnecessary …

Trained without My Consent: Detecting Code Inclusion in Language Models Trained on Code Open

Vahid Majdinasab, Amin Nikanjam, Foutse Khomh · 2024

Code auditing ensures that the developed code adheres to standards, regulations, and copyright protection by verifying that it does not contain code from protected sources. The recent advent of Large Language Models (LLMs) as coding assist…

In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators Open

Dmytro Humeniuk, Houssem Ben Braiek, Thomas Reid, Foutse Khomh · 2024

Testing autonomous robotic manipulators is challenging due to the complex software interactions between vision and control components. A crucial element of modern robotic manipulators is the deep learning based object detection model. The …

What Information Contributes to Log-based Anomaly Detection? Insights from a Configurable Transformer-Based Approach Open

Xingfang Wu, Heng Li, Foutse Khomh · 2024

Log data are generated from logging statements in the source code, providing insights into the execution processes of software applications and systems. State-of-the-art log-based anomaly detection approaches typically leverage deep learni…

Understanding Web Application Workloads and Their Applications: Systematic Literature Review and Characterization Open

Roozbeh Aghili, Qiaolin Qin, Heng Li, Foutse Khomh · 2024

Web applications, accessible via web browsers over the Internet, facilitate complex functionalities without local software installation. In the context of web applications, a workload refers to the number of user requests sent by users or …

Protecting Privacy in Software Logs: What Should Be Anonymized? Open

Roozbeh Aghili, Heng Li, Foutse Khomh · 2024

Software logs, generated during the runtime of software systems, are essential for various development and analysis activities, such as anomaly detection and failure diagnosis. However, the presence of sensitive information in these logs p…

Foutse Khomh YOU? Author Swipe