Aswin Rrv YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? Open

Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin Rrv, Nisarg Patel , et al. · 2024

Computer science Geography

Solving grid puzzles involves a significant amount of logical reasoning. Hence, it is a good domain to evaluate the reasoning capability of a model which can then guide us to improve the reasoning ability of models. However, most existing …

Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies Open

Aswin Rrv, Nemika Tyagi, Md Nayem Uddin, Neeraj Varshney, Chitta Baral · 2024

Computer science Psychology

This study explores the sycophantic tendencies of Large Language Models (LLMs), where these models tend to provide answers that match what users want to hear, even if they are not entirely correct. The motivation behind this exploration st…

Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization Open

Amir Saeidi, Shivanshu Verma, Aswin Rrv, Chitta Baral · 2024

Computer science Mathematics

Reinforcement Learning with Human Feedback (RLHF) enhances the alignment of Large Language Models (LLMs). However, its limitations have led to the development of Direct Preference Optimization (DPO), an RL-free approach designed to overcom…

Creating related items for first view…