Explanipedia

MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications Open

Gagan Gupta, Manish Rai, Atreyi Chakraborty, Ashutosh Modi, Soumajit Pramanik , et al. · 2025

Large Language Models (LLMs) have emerged as powerful tools for automating complex reasoning and decision-making tasks. In telecommunications, they hold the potential to transform network optimization, automate troubleshooting, enhance cus…

IL-PCSR: Legal Corpus for Prior Case and Statute Retrieval Open

Shounak Paul, Dhananjay Ghumare, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi · 2025

Identifying/retrieving relevant statutes and prior cases/precedents for a given legal situation are common tasks exercised by law practitioners. Researchers to date have addressed the two tasks independently, thus developing completely dif…

POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation Open

Abhinav Joshi, Vaibhav Sharma, Sukwinder Singh, Ashutosh Modi · 2025

Sign language translation remains a challenging task due to the scarcity of large-scale, sentence-aligned datasets. Prior arts have focused on various feature extraction and architectural changes to support neural machine translation for s…

Calibration Across Layers: Understanding Calibration Evolution in LLMs Open

Abhinav Joshi, Areeb Ahmad, Ashutosh Modi · 2025

Large Language Models (LLMs) have demonstrated inherent calibration capabilities, where predicted probabilities align well with correctness, despite prior findings that deep neural networks are often overconfident. Recent studies have link…

CoMuMDR: Code-mixed Multi-modal Multi-domain corpus for Discourse paRsing in conversations Open

Divyaksh Shukla, Ritesh Baviskar, Dwijesh Gohil, Aniket Tiwari, Atul Shree , et al. · 2025

Discourse parsing is an important task useful for NLU applications such as summarization, machine comprehension, and emotion recognition. The current discourse parsing datasets based on conversations consists of written English dialogues r…

Towards Quantifying Commonsense Reasoning with Mechanistic Insights Open

Abhinav Joshi, Areeb Ahmad, D. K. Shukla, Ashutosh Modi · 2025

Commonsense reasoning deals with the implicit knowledge that is well understood by humans and typically acquired via interactions with the world. In recent times, commonsense reasoning and understanding of various LLMs have been evaluated …

IL-PCSR: Legal Corpus for Prior Case and Statute Retrieval Open

Shounak Paul, Dhananjay Ghumare, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi · 2025

LoRMA: Low-Rank Multiplicative Adaptation for LLMs Open

Harsh Bihany, Sanjay Patel, Ashutosh Modi · 2025

Towards Quantifying Commonsense Reasoning with Mechanistic Insights Open

Abhinav Joshi, Areeb Ahmad, D. K. Shukla, Ashutosh Modi · 2025

CoMuMDR: Code-mixed Multi-modal Multi-domain corpus for Discourse paRsing in conversations Open

Divyaksh Shukla, Ritesh Baviskar, Dwijesh Gohil, Aniket Tiwari, Atul Shree , et al. · 2025

Calibration Across Layers: Understanding Calibration Evolution in LLMs Open

Abhinav Joshi, Areeb Ahmad, Ashutosh Modi · 2025

PoseStitch-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation Open

Abhinav Joshi, Vaibhav Sharma, Sukwinder Singh, Ashutosh Modi · 2025

COLD: Causal reasOning in cLosed Daily activities Open

Abhinav Joshi, Areeb Ahmad, Ashutosh Modi · 2024

Large Language Models (LLMs) have shown state-of-the-art performance in a variety of tasks, including arithmetic and reasoning; however, to gauge the intellectual capabilities of LLMs, causal reasoning has become a reliable proxy for valid…

Towards Robust Evaluation of Unlearning in LLMs via Data Transformations Open

Abhinav Joshi, Sujit Saha, D. K. Shukla, Sriram Vema, Harsh Jhamtani , et al. · 2024

Large Language Models (LLMs) have shown to be a great success in a wide range of applications ranging from regular NLP-based use cases to AI agents. LLMs have been trained on a vast corpus of texts from various sources; despite the best ef…

Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs Open

Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava , et al. · 2024

The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially si…

IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning Open

Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh , et al. · 2024

Legal systems worldwide are inundated with exponential growth in cases and documents. There is an imminent need to develop NLP and ML techniques for automatically processing and understanding legal documents to streamline the legal system.…

iSign: A Benchmark for Indian Sign Language Processing Open

Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, A. Mangla, Sudeep Choudhary , et al. · 2024

Indian Sign Language has limited resources for developing machine learning and data-driven approaches for automated language processing. Though text/audio-based language processing techniques have shown colossal research interest and treme…

BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain Open

Rahul Kumar, Amar Raja Dibbu, Shrutendra Harsola, Vignesh Subrahmaniam, Ashutosh Modi · 2024

Several large-scale datasets (e.g., WikiSQL, Spider) for developing natural language interfaces to databases have recently been proposed. These datasets cover a wide breadth of domains but fall short on some essential domains, such as fina…

IITK at SemEval-2024 Task 4: Hierarchical Embeddings for Detection of Persuasion Techniques in Memes Open

Shreenaga Chikoti, Shrey Mehta, Ashutosh Modi · 2024

Memes are one of the most popular types of content used in an online disinformation campaign. They are primarily effective on social media platforms since they can easily reach many users. Memes in a disinformation campaign achieve their g…

IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical Trials Open

Shreyasi Mandal, Ashutosh Modi · 2024

Large Language models (LLMs) have demonstrated state-of-the-art performance in various natural language processing (NLP) tasks across multiple domains, yet they are prone to shortcut learning and factual inconsistencies. This research inve…

IITK at SemEval-2024 Task 10: Who is the speaker? Improving Emotion Recognition and Flip Reasoning in Conversations via Speaker Embeddings Open

Shubham Patel, Divyaksh Shukla, Ashutosh Modi · 2024

This paper presents our approach for the SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversations. For the Emotion Recognition in Conversations (ERC) task, we utilize a masked-memory network along with speaker partic…

IITK at SemEval-2024 Task 1: Contrastive Learning and Autoencoders for Semantic Textual Relatedness in Multilingual Texts Open

Udvas Basak, Rajarshi Dutta, Shivam Pandey, Ashutosh Modi · 2024

This paper describes our system developed for the SemEval-2024 Task 1: Semantic Textual Relatedness. The challenge is focused on automatically detecting the degree of relatedness between pairs of sentences for 14 languages including both h…

Towards Measuring and Modeling "Culture" in LLMs: A Survey Open

Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Ashutosh Dwivedi , et al. · 2024

We present a survey of more than 90 recent papers that aim to study cultural representation and inclusion in large language models (LLMs). We observe that none of the studies explicitly define "culture, which is a complex, multifaceted con…

ScriptWorld: Text Based Environment for Learning Procedural Knowledge Open

Abhinav Joshi, Areeb Ahmad, Umang Pandey, Ashutosh Modi · 2023

Text-based games provide a framework for developing natural language understanding and commonsense knowledge about the world in reinforcement learning based agents. Existing text-based environments often rely on fictional situations and ch…

ISLTranslate: Dataset for Translating Indian Sign Language Open

Abhinav Joshi, Susmit Agrawal, Ashutosh Modi · 2023

Sign languages are the primary means of communication for many hard-of-hearing people worldwide. Recently, to bridge the communication gap between the hard-of-hearing community and the rest of the population, several sign language translat…

U-CREAT: Unsupervised Case Retrieval using Events extrAcTion Open

Abhinav Joshi, Akshat Sharma, Sai Kiran Tanikella, Ashutosh Modi · 2023

The task of Prior Case Retrieval (PCR) in the legal domain is about automatically citing relevant (based on facts and precedence) prior legal cases in a given query case. To further promote research in PCR, in this paper, we propose a new …

ScriptWorld: Text Based Environment For Learning Procedural Knowledge Open

Abhinav Joshi, Areeb Ahmad, Umang Pandey, Ashutosh Modi · 2023

Text-based games provide a framework for developing natural language understanding and commonsense knowledge about the world in reinforcement learning based agents. Existing text-based environments often rely on fictional situations and ch…

SemEval 2023 Task 6: LegalEval - Understanding Legal Texts Open

Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi , et al. · 2023

In populous countries, pending legal cases have been growing exponentially. There is a need for developing NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we…

ISLTranslate: Dataset for Translating Indian Sign Language Open

Abhinav Joshi, Susmit Agrawal, Ashutosh Modi · 2023

Sign languages are the primary means of communication for many hard-of-hearing people worldwide. Recently, to bridge the communication gap between the hard-of-hearing community and the rest of the population, several sign language translat…

SemEval-2023 Task 6: LegalEval - Understanding Legal Texts Open

Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi , et al. · 2023

Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan. Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023).…

Ashutosh Modi YOU? Author Swipe