Explanipedia

Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools Open

Prerna Agarwal, Himanshu Gupta, Soujanya Soni, Rohith D Vallam, Renuka Sindhgatta , et al. · 2025

Recent advancements in Large Language Models (LLMs) has lead to the development of agents capable of complex reasoning and interaction with external tools. In enterprise contexts, the effective use of such tools that are often enabled by a…

Quality Assessment of Tabular Data using Large Language Models and Code Generation Open

Ashlesha Akella, Akshar Kaul, Krishnasuri Narayanam, Sameep Mehta · 2025

Reliable data quality is crucial for downstream analysis of tabular datasets, yet rule-based validation often struggles with inefficiency, human intervention, and high computational costs. We present a three-stage framework that combines s…

A Framework for Testing and Adapting REST APIs as LLM Tools Open

Jayachandu Bandlamudi, Ritwik Chaudhuri, Neelamadhav Gantayat, Sambit Ghosh, Kushal Mukherjee , et al. · 2025

Large Language Models (LLMs) are increasingly used to build autonomous agents that perform complex tasks with external tools, often exposed through APIs in enterprise systems. Direct use of these APIs is difficult due to the complex input …

Question-guided Insights Generation for Automated Exploratory Data Analysis Open

Abhijit Manatkar, Ashlesha Akella, Krishnasuri Narayanam, Sameep Mehta · 2025

Computer science

Exploratory Data Analysis (EDA) derives meaningful insights from extensive and complex datasets. This process typically involves a series of analytical operations to identify the patterns within the data. However, the effectiveness of EDA …

LLMGuard: Guarding against Unsafe LLM Behavior Open

Shubh Goyal, Medha Hira, Shubham Kumar Mishra, Sukriti Goyal, Arnav Goel , et al. · 2024

Computer science

Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regula…

xLP: Explainable Link Prediction for Master Data Management Open

Balaji Ganesan, Matheen Ahmed Pasha, Srinivasa Parkala, Neeraj R Singh, Gayatri Mishra , et al. · 2024

Computer science

Explaining neural model predictions to users requires creativity. Especially in enterprise applications, where there are costs associated with users' time, and their trust in the model predictions is critical for adoption. For link predict…

LLMGuard: Guarding Against Unsafe LLM Behavior Open

Shubh Goyal, Medha Hira, Shubham Kumar Mishra, Sukriti Goyal, Arnav Goel , et al. · 2024

Computer science Psychology

Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regula…

"Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing Open

Sandeep Singamsetty, Nishtha Madaan, Sameep Mehta, Varad Bhatnagar, Pushpak Bhattacharyya · 2023

Computer science Psychology Mathematics

The prevalence of half-truths, which are statements containing some truth but that are ultimately deceptive, has risen with the increasing use of the internet. To help combat this problem, we have created a comprehensive pipeline consistin…

CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation Open

Rahul Madhavan, Rishabh Garg, Kahini Wadhawan, Sameep Mehta · 2023

Computer science Psychology Biology

We propose a method to control the attributes of Language Models (LMs) for the text generation task using Causal Average Treatment Effect (ATE) scores and counterfactual augmentation. We explore this method, in the context of LM detoxifica…

CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation Open

Rahul Madhavan, Rishabh Garg, Kahini Wadhawan, Sameep Mehta · 2023

Computer science Biology Philosophy

We propose a method to control the attributes of Language Models (LMs) for the text generation task using Causal Average Treatment Effect (ATE) scores and counterfactual augmentation. We explore this method, in the context of LM detoxifica…

Workshop on Data Fabric for Hybrid Clouds (WDFHC) Open

Yogesh Simmhan, Sameep Mehta · 2022

Computer science Engineering Chemistry

A number of organizations have adopted the hybrid-cloud paradigm to optimize business processes. Hybrid clouds span public and private clouds, different public cloud providers as well as edge and cloud resources. A hybrid cloud architectur…

Toward Scientific Workflows in a Serverless World Open

Aakash Khochare, Yogesh Simmhan, Sameep Mehta, Arvind Agarwal · 2022

Computer science Engineering Business

Serverless computing and FaaS have gained popularity due to their ease of design, deployment, scaling and billing on clouds. However, when used to compose and orchestrate scientific workflows, they pose limitations due to cold starts, mess…

Data Readiness Report Open

Shazia Afzal, C Rajmohan, Manish Kesarwani, Sameep Mehta, Hima Patel · 2021

Computer science Engineering Economics

Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which lim…

Data Quality Toolkit: Automatic assessment of data quality and remediation for machine learning datasets Open

Nitin Gupta, Hima Patel, Shazia Afzal, Naveen Panwar, Ruhi Sharma Mittal , et al. · 2021

Computer science Engineering Biology

The quality of training data has a huge impact on the efficiency, accuracy and complexity of machine learning tasks. Various tools and techniques are available that assess data quality with respect to general cleaning and profiling checks.…

Explainable Link Prediction for Privacy-Preserving Contact Tracing Open

Balaji Ganesan, Hima Patel, Sameep Mehta · 2020

Computer science Medicine Chemistry

Contact Tracing has been used to identify people who were in close proximity to those infected with SARS-Cov2 coronavirus. A number of digital contract tracing applications have been introduced to facilitate or complement physical contact …

Multidimensional Analysis of Trust in News Articles (Student Abstract) Open

Avneet Kaur, Maitree Leekha, Utkarsh Chawla, Ayush Agarwal, Mudit Saxena , et al. · 2020

Computer science Political science Sociology

The advancements in the field of Information Communication Technology have engendered revolutionary changes in the journalism industry, not only on the part of the journalists and the media personnel, but also on the people consuming these…

Fair Transfer of Multiple Style Attributes in Text Open

Karan Dabas, Nishtha Madan, Vijay Arya, Sameep Mehta, Gautam B. Singh , et al. · 2019

Computer science Art Philosophy

To preserve anonymity and obfuscate their identity on online platforms users may morph their text and portray themselves as a different gender or demographic. Similarly, a chatbot may need to customize its communication style to improve en…

Hardening Deep Neural Networks via Adversarial Model Cascades Open

Deepak Vijaykeerthy, Anshuman Suri, Sameep Mehta, Ponnurangam Kumaraguru · 2019

Computer science Engineering Chemistry

Deep neural networks (DNNs) are vulnerable to malicious inputs crafted by an adversary to produce erroneous outputs. Works on securing neural networks against adversarial examples achieve high empirical robustness on simple datasets such a…

FactSheets: Increasing trust in AI services through supplier's declarations of conformity Open

Matthew Arnold, Rachel Bellamy, Michael Hind, Stephanie Houde, Sameep Mehta , et al. · 2019

Business Computer science Political science

Accuracy is an important concern for suppliers of artificial intelligence (AI) services, but considerations beyond accuracy, such as safety (which includes fairness and explainability), security, and provenance, are also critical elements …

Model Extraction Warning in MLaaS Paradigm Open

Manish Kesarwani, Bhaskar Mukhoty, Vijay Arya, Sameep Mehta · 2018

Computer science Economics

Cloud vendors are increasingly offering machine learning services as part of their platform and services portfolios. These services enable the deployment of machine learning models on the cloud that are offered on a pay-per-query basis to …

What is my data worth? From data properties to data value Open

Kalapriya Kannan, Rema Ananthanarayanan, Sameep Mehta · 2018

Computer science Psychology Business

Data today fuels both the economy and advances in machine learning and AI. All aspects of decision making, at the personal and enterprise level and in governments are increasingly data-driven. In this context, however, there are still some…

AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias Open

Rachel Bellamy, Kuntal Dey, Michael Hind, Samuel C. Hoffman, Stephanie Houde , et al. · 2018

Computer science Business

Fairness is an increasingly important concern as machine learning models are used to support decision making in high-stakes applications such as mortgage lending, hiring, and prison sentencing. This paper introduces a new open source Pytho…

AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and\n Mitigating Unwanted Algorithmic Bias Open

Rachel Bellamy, Kuntal Dey, Michael Hind, Samuel C. Hoffman, Stephanie Houde , et al. · 2018

Computer science Business Materials science

Fairness is an increasingly important concern as machine learning models are\nused to support decision making in high-stakes applications such as mortgage\nlending, hiring, and prison sentencing. This paper introduces a new open source\nPy…

Extracting Fairness Policies from Legal Documents Open

Rashmi Nagpal, Chetna Wadhwa, Mallika Gupta, Samiulla Shaikh, Sameep Mehta , et al. · 2018

Computer science Biology Chemistry

Machine Learning community is recently exploring the implications of bias and fairness with respect to the AI applications. The definition of fairness for such applications varies based on their domain of application. The policies governin…

Efficiently Processing Workflow Provenance Queries on SPARK Open

C Rajmohan, Pranay Lohia, Himanshu Gupta, Siddhartha Brahma, Mauricio A. Hernández , et al. · 2018

Computer science Geology Physics

In this paper, we investigate how we can leverage Spark platform for efficiently processing provenance queries on large volumes of workflow provenance data. We focus on processing provenance queries at attribute-value level which is the fi…

Sameep Mehta YOU? Author Swipe