Explanipedia

Privacy Preserving In-Context-Learning Framework for Large Language Models Open

Bishnu Bhusal, Manoj Acharya, Colin Samplawski, Adam D. Cobb, Susmit Jha · 2025

Large language models (LLMs) have significantly transformed natural language understanding and generation, but they raise privacy concerns due to potential exposure of sensitive information. Studies have highlighted the risk of information…

Spatio-Temporal Pruning for Compressed Spiking Large Language Models Open

Yi Jiang, Malyaban Bal, Brian Matejek, Susmit Jha, Adam D. Cobb , et al. · 2025

Large Language Models (LLMs) present significant challenges for deployment in energy-constrained environments due to their large model sizes and high inference latency. Spiking Neural Networks (SNNs), inspired by the sparse event-driven ne…

Predicting Secure Messaging Traffic in Clinical Settings Open

Laura Rosa Baratta, Sunny S. Lou, Thomas Kannampallil, Susmit Jha, Anirban Roy , et al. · 2025

Asynchronous text-based communication, secure messaging, has become one of the preferred modes of communication despite its potential to disrupt workflow and increase burden. Identifying peak communication can guide interventions to reduce…

Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference Open

Colin Samplawski, Adam D. Cobb, Manoj Acharya, Ramneet Kaur, Susmit Jha · 2025

Despite their widespread use, large language models (LLMs) are known to hallucinate incorrect information and be poorly calibrated. This makes the uncertainty quantification of these models of critical importance, especially in high-stakes…

On the Evaluation of Engineering Artificial General Intelligence Open

Sandeep Neema, Susmit Jha, Adam Nagel, Chandrasekar Sureshkumar, Aleksa Gordic , et al. · 2025

We discuss the challenges and propose a framework for evaluating engineering artificial general intelligence (eAGI) agents. We consider eAGI as a specialization of artificial general intelligence (AGI), deemed capable of addressing a broad…

Safety Monitoring for Learning-Enabled Cyber-Physical Systems in Out-of-Distribution Scenarios Open

Vivian Lin, Ramneet Kaur, Yahan Yang, Souradeep Dutta, Yiannis Kantaros , et al. · 2025

Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding Open

Trilok Padhi, Ramneet Kaur, Adam D. Cobb, Manoj Acharya, Anirban Roy , et al. · 2025

We introduce a novel approach for calibrating uncertainty quantification (UQ) tailored for multi-modal large language models (LLMs). Existing state-of-the-art UQ methods rely on consistency among multiple responses generated by the LLM on …

AGENT: An Aerial Vehicle Generation and Design Tool Using Large Language Models Open

Colin Samplawski, Adam D. Cobb, Susmit Jha · 2025

Computer-aided design (CAD) is a promising application area for emerging artificial intelligence methods. Traditional workflows for cyberphysical systems create detailed digital models which can be evaluated by physics simulators in order …

TeleLoRA: Teleporting Model-Specific Alignment Across LLMs Open

Xiao Lin, Manoj Acharya, Anirban Roy, Susmit Jha · 2025

Mitigating Trojans in Large Language Models (LLMs) is one of many tasks where alignment data is LLM specific, as different LLMs have different Trojan triggers and trigger behaviors to be removed. In this paper, we introduce TeleLoRA (Telep…

Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study) Open

Boyue Caroline Hu, Divya Gopinath, Corina S. Pasareanu, Nina Narodytska, Ravi Mangal , et al. · 2025

Debugging of Deep Neural Networks (DNNs), particularly vision models, is very challenging due to the complex and opaque decision-making processes in these networks. In this paper, we explore multi-modal Vision-Language Models (VLMs), such …

Calibration and Correctness of Language Models for Code ICSE Artifact Open

C. Katharina Spieß, David Gros, Kunal Suresh Pai, Michael Pradel, Md Rafiqul Islam Rabin , et al. · 2025

Machine learning models are widely used, but can also often be wrong. Users would benefit from a reliable indication of whether a given output from a given model should be trusted, so a rational decision can be made whether to use the outp…

Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs Open

Ayush Gupta, Ramneet Kaur, Anirban Roy, Adam D. Cobb, Rama Chellappa , et al. · 2025

Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI Open

Ramneet Kaur, Colin Samplawski, Adam D. Cobb, Anirban Roy, Brian Matejek , et al. · 2024

In this paper, we present a dynamic semantic clustering approach inspired by the Chinese Restaurant Process, aimed at addressing uncertainty in the inference of Large Language Models (LLMs). We quantify uncertainty of an LLM on a given que…

Second-Order Forward-Mode Automatic Differentiation for Optimization Open

Adam D. Cobb, Atılım Güneş Baydin, Barak A. Pearlmutter, Susmit Jha · 2024

This paper introduces a second-order hyperplane search, a novel optimization step that generalizes a second-order line search from a line to a $k$-dimensional hyperplane. This, combined with the forward-mode stochastic gradient method, yie…

Concept-based Analysis of Neural Networks via Vision-Language Models Open

Ravi Mangal, Nina Narodytska, Divya Gopinath, Boyue Caroline Hu, Anirban Roy , et al. · 2024

The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification procedures. In this p…

Task-Agnostic Detector for Insertion-Based Backdoor Attacks Open

Weimin Lyu, Xiao Lin, Songzhu Zheng, Lu Pang, Haibin Ling , et al. · 2024

Textual backdoor attacks pose significant security threats. Current detection approaches, typically relying on intermediate feature representation or reconstructing potential triggers, are task-specific and less effective beyond sentence c…

Non-Markovian Quantum Control via Model Maximum Likelihood Estimation and Reinforcement Learning Open

Tanmay Neema, Susmit Jha, Tuhin Sahai · 2024

Reinforcement Learning (RL) techniques have been increasingly applied in optimizing control systems. However, their application in quantum systems is hampered by the challenge of performing closed-loop control due to the difficulty in meas…

Direct Amortized Likelihood Ratio Estimation Open

Adam D. Cobb, Brian Matejek, Daniel Elenius, Anirban Roy, Susmit Jha · 2023

We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our appr…

math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories Open

Hassen Saı̈di, Susmit Jha, Tuhin Sahai · 2023

As artificial intelligence (AI) gains greater adoption in a wide variety of applications, it has immense potential to contribute to mathematical discovery, by guiding conjecture generation, constructing counterexamples, assisting in formal…

Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving Open

Sumit Kumar Jha, Susmit Jha, Patrick Lincoln, Nathaniel D. Bastian, Alvaro Velasquez , et al. · 2023

Generative large language models (LLMs) with instruct training such as GPT-4 can follow human-provided instruction prompts and generate human-like responses to these prompts. Apart from natural language responses, they have also been found…

Neural Stochastic Differential Equations for Robust and Explainable Analysis of Electromagnetic Unintended Radiated Emissions Open

Sumit Kumar Jha, Susmit Jha, Rickard Ewetz, Alvaro Velasquez · 2023

We present a comprehensive evaluation of the robustness and explainability of ResNet-like models in the context of Unintended Radiated Emission (URE) classification and suggest a new approach leveraging Neural Stochastic Differential Equat…

TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models Open

Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy , et al. · 2023

We present a Multimodal Backdoor Defense technique TIJO (Trigger Inversion using Joint Optimization). Recent work arXiv:2112.07668 has demonstrated successful backdoor attacks on multimodal models for the Visual Question Answering task. Th…

AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs Open

Adam D. Cobb, Anirban Roy, Daniel Elenius, Frederick M. Heim, Brian Swenson , et al. · 2023

Dataset accompanying code and paper: AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs We present AircraftVerse, a publicly available aerial vehicle design dataset…

AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs Open

Adam D. Cobb, Anirban Roy, Daniel Elenius, Frederick M. Heim, Brian Swenson , et al. · 2023

We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) desi…

Measuring Classification Decision Certainty and Doubt Open

Alexander Berenbeim, Iain J. Cruickshank, Susmit Jha, Robert Thomson, Nathaniel D. Bastian · 2023

Quantitative characterizations and estimations of uncertainty are of fundamental importance in optimization and decision-making processes. Herein, we propose intuitive scores, which we call certainty and doubt, that can be used in both a B…

On the Robustness of AlphaFold: A COVID-19 Case Study Open

Ismail Alkhouri, Sumit Kumar Jha, Andre Beckus, George Atia, Alvaro Velasquez , et al. · 2023

Protein folding neural networks (PFNNs) such as AlphaFold predict remarkably accurate structures of proteins compared to other approaches. However, the robustness of such networks has heretofore not been explored. This is particularly rele…

Principles of Robust Learning and Inference for IoBTs Open

Nathaniel D. Bastian, Susmit Jha, Paulo Tabuada, Venugopal V. Veeravalli, Gunjan Verma · 2022

The Internet of Battlefield Things (IoBTs) operate in an adversarial rapidly-evolving environment, necessitating fast, robust and resilient decision-making. The success of machine learning, in particular deep learning methods, can improve …

Design of Unmanned Air Vehicles Using Transformer Surrogate Models Open

Adam D. Cobb, Anirban Roy, Daniel Elenius, Susmit Jha · 2022

Computer-aided design (CAD) is a promising new area for the application of artificial intelligence (AI) and machine learning (ML). The current practice of design of cyber-physical systems uses the digital twin methodology, wherein the actu…

CODiT: Conformal Out-of-Distribution Detection in Time-Series Data Open

Ramneet Kaur, Kaustubh Sridhar, Sangdon Park, Susmit Jha, Anirban Roy , et al. · 2022

Machine learning models are prone to making incorrect predictions on inputs that are far from the training distribution. This hinders their deployment in safety-critical applications such as autonomous vehicles and healthcare. The detectio…

Inferring and Conveying Intentionality: Beyond Numerical Rewards to Logical Intentions Open

Susmit Jha, John Rushby · 2022

Shared intentionality is a critical component in developing conscious AI agents capable of collaboration, self-reflection, deliberation, and reasoning. We formulate inference of shared intentionality as an inverse reinforcement learning pr…

Susmit Jha YOU? Author Swipe