arXiv (Cornell University)
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
November 2024 • Abhinav Joshi, Sujit Saha, D. K. Shukla, Sriram Vema, Harsh Jhamtani, Manas Gaur, Ashutosh Modi
Large Language Models (LLMs) have shown to be a great success in a wide range of applications ranging from regular NLP-based use cases to AI agents. LLMs have been trained on a vast corpus of texts from various sources; despite the best efforts during the data pre-processing stage while training the LLMs, they may pick some undesirable information such as personally identifiable information (PII). Consequently, in recent times research in the area of Machine Unlearning (MUL) has become active, the main idea is to …