Exploring Robust Overfitting for Pre-trained Language Models

Exploring foci of: doi.org Exploring Robust Overfitting for Pre-trained Language Models January 2023 • Bin Zhu, Yanghui Rao We identify the robust overfitting issue for pre-trained language models by showing that the robust test loss increases as the epoch grows. Through comprehensive exploration of the robust loss on the training set, we attribute robust overfitting to the model’s memorization of the adversarial training data. We attempt to mitigate robust overfitting by combining regularization methods with adversarial training. Following the philosophy that prevents the model from memorizing the adversarial data, we find that floodi… Open Article Page

Overfitting Computer Science Artificial Intelligence Machine Learning Training, Validation, And Test Data Sets Chemistry Biochemistry Open Article