Aditya V. Nori
YOU?
Author Swipe
View article: RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation Open
Recent Large Language Models (LLMs) have reported high accuracy on reasoning benchmarks. However, it is still unclear whether the observed results arise from true reasoning or from statistical recall of the training set. Inspired by the la…
View article: Reasoning Elicitation in Language Models via Counterfactual Feedback
Reasoning Elicitation in Language Models via Counterfactual Feedback Open
Despite the increasing effectiveness of language models, their reasoning capabilities remain underdeveloped. In particular, causal reasoning through counterfactual question answering is lacking. This work aims to bridge this gap. We first …
View article: Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models
Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models Open
Recent advances in AI have been significantly driven by the capabilities of large language models (LLMs) to solve complex problems in ways that resemble human thinking. However, there is an ongoing debate about the extent to which LLMs are…
View article: Cautionary Tales on Synthetic Controls in Survival Analyses
Cautionary Tales on Synthetic Controls in Survival Analyses Open
Synthetic control (SC) methods have gained rapid popularity in economics recently, where they have been applied in the context of inferring the effects of treatments on standard continuous outcomes assuming linear input-output relations. I…
View article: Beyond Words: A Mathematical Framework for Interpreting Large Language Models
Beyond Words: A Mathematical Framework for Interpreting Large Language Models Open
Large language models (LLMs) are powerful AI tools that can generate and comprehend natural language text and other complex information. However, the field lacks a mathematical framework to systematically describe, compare and improve LLMs…
View article: Exploring the Boundaries of GPT-4 in Radiology
Exploring the Boundaries of GPT-4 in Radiology Open
The recent success of general-domain large language models (LLMs) has significantly changed the natural language processing paradigm towards a unified foundation model across domains and applications. In this paper, we focus on assessing t…
View article: Active label cleaning: Improving dataset quality under resource constraints.
Active label cleaning: Improving dataset quality under resource constraints. Open
Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have an often-overlooked confounding effect on the assessment of model performance. Nevertheless, employing experts to r…
View article: Hierarchical Analysis of Visual COVID-19 Features from Chest Radiographs
Hierarchical Analysis of Visual COVID-19 Features from Chest Radiographs Open
Chest radiography has been a recommended procedure for patient triaging and resource management in intensive care units (ICUs) throughout the COVID-19 pandemic. The machine learning efforts to augment this workflow have been long challenge…
View article: Secure Medical Image Analysis with CrypTFlow
Secure Medical Image Analysis with CrypTFlow Open
We present CRYPTFLOW, a system that converts TensorFlow inference code into Secure Multi-party Computation (MPC) protocols at the push of a button. To do this, we build two components. Our first component is an end-to-end compiler from Ten…
View article: Overfitting in Synthesis: Theory and Practice (Extended Version)
Overfitting in Synthesis: Theory and Practice (Extended Version) Open
In syntax-guided synthesis (SyGuS), a synthesizer's goal is to automatically generate a program belonging to a grammar of possible implementations that meets a logical specification. We investigate a common limitation across state-of-the-a…
View article: Robustness of Neural Networks: A Probabilistic and Practical Approach
Robustness of Neural Networks: A Probabilistic and Practical Approach Open
Neural networks are becoming increasingly prevalent in software, and it is therefore important to be able to verify their behavior. Because verifying the correctness of neural networks is extremely challenging, it is common to focus on the…
View article: Adaptive Neural Trees
Adaptive Neural Trees Open
Deep neural networks and decision trees operate on largely separate paradigms; typically, the former performs representation learning with pre-specified architectures, while the latter is characterised by learning hierarchies over pre-spec…
View article: Specification Inference and Invariant Generation: A Machine Learning Perspective
Specification Inference and Invariant Generation: A Machine Learning Perspective Open
Computing good specification and invariants is key to effective and efficient program verification. In this talk, I will describe our experiences in using machine learning techniques (Bayesian inference, SVMs) for computing specifications …
View article: FairSquare: probabilistic verification of program fairness
FairSquare: probabilistic verification of program fairness Open
With the range and sensitivity of algorithmic decisions expanding at a break-neck speed, it is imperative that we aggressively investigate fairness and bias in decision-making programs. First, we show that a number of recently proposed for…
View article: Quantifying Program Bias
Quantifying Program Bias Open
With the range and sensitivity of algorithmic decisions expanding at a break-neck speed, it is imperative that we aggressively investigate whether programs are biased. We propose a novel probabilistic program analysis technique and apply i…
View article: Fairness as a Program Property
Fairness as a Program Property Open
We explore the following question: Is a decision-making program fair, for some useful definition of fairness? First, we describe how several algorithmic fairness questions can be phrased as program verification problems. Second, we discuss…
View article: Debugging Machine Learning Tasks
Debugging Machine Learning Tasks Open
Unlike traditional programs (such as operating systems or word processors) which have large amounts of code, machine learning tasks use programs with relatively small amounts of code (written in machine learning libraries), but voluminous …
View article: Query-guided maximum satisfiability
Query-guided maximum satisfiability Open
We propose a new optimization problem "Q-MaxSAT", an extension of the well-known Maximum Satisfiability or MaxSAT problem. In contrast to MaxSAT, which aims to find an assignment to all variables in the formula, Q-MaxSAT computes an assign…
View article: A Provably Correct Sampler for Probabilistic Programs
A Provably Correct Sampler for Probabilistic Programs Open
We consider the problem of inferring the implicit distribution specified by a probabilistic program. A popular inference technique for probabilistic programs called Markov Chain Monte Carlo or MCMC sampling involves running the program rep…