Justin Olive
YOU?
Author Swipe
View article: Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights
Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights Open
AI evaluations have become critical tools for assessing large language model capabilities and safety. This paper presents practical insights from eight months of maintaining $inspect\_evals$, an open-source repository of 70+ community-cont…