Keith Tyser YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

AI-Driven Review Systems: Evaluating LLMs in Scalable and Bias-Aware Academic Reviews Open

Keith Tyser, Ben Segev, Gaston Longhitano, Xinyu Zhang, Zachary Meeks , et al. · 2024

Computer science Political science

Automatic reviewing helps handle a large volume of papers, provides early feedback and quality control, reduces bias, and allows the analysis of trends. We evaluate the alignment of automatic paper reviews with human reviews using an arena…

From Human Days to Machine Seconds: Automatically Answering and Generating Machine Learning Final Exams Open

Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser , et al. · 2023

Computer science Economics Geography

A final exam in machine learning at a top institution such as MIT, Harvard, or Cornell typically takes faculty days to write, and students hours to solve. We demonstrate that large language models pass machine learning finals at a human le…

Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models Open

Sarah Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean , et al. · 2023

Computer science Mathematics Engineering

We curate a comprehensive dataset of 4,550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree.…

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark Open

Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser , et al. · 2022

Computer science Geography Mathematics

We provide a new multi-task benchmark for evaluating text-to-image models. We perform a human evaluation comparing the most common open-source (Stable Diffusion) and commercial (DALL-E 2) models. Twenty computer science AI graduate student…

Creating related items for first view…