Explanipedia

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models Open

Huong Ngo, Matt Deitke, Martijn Bartelds, Sarah I. Pratt, Josh Gardner , et al. · 2025

Improvements in training data scale and quality have led to significant advances, yet its influence in speech recognition remains underexplored. In this paper, we present a large-scale dataset, OLMoASR-Pool, and series of models, OLMoASR, …

Language Models Improve When Pretraining Data Matches Target Tasks Open

David Mizrahi, Anders Larsen, J T Allardice, Suzie Petryk, Yuri Gorokhov , et al. · 2025

Every data selection method inherently has a target. In practice, these targets often emerge implicitly through benchmark-driven iteration: researchers develop selection strategies, train models, measure benchmark performance, then refine …

Large Scale Transfer Learning for Tabular Data via Language Modeling Open

Josh Gardner, Juan C. Perdomo, Ludwig Schmidt · 2024

Computer science Geography

Tabular data -- structured, heterogeneous, spreadsheet-style data with rows and columns -- is widely used in practice across many domains. However, while recent foundation models have reduced the need for developing task-specific datasets …

DataComp-LM: In search of the next generation of training sets for language models Open

Jeffrey Li, Alex Chengyu Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan , et al. · 2024

Computer science Geography

We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effect…

Benchmarking Distribution Shift in Tabular Data with TableShift Open

Josh Gardner, Zoran Popović, Ludwig Schmidt · 2023

Computer science Business Geography

Robustness to distribution shift has become a growing concern for text and image models as they transition from research subjects to deployment in the real world. However, high-quality benchmarks for distribution shift in tabular machine l…

LLark: A Multimodal Instruction-Following Language Model for Music Open

Josh Gardner, Simon Durand, Daniel Stoller, Rachel Bittner · 2023

Computer science History

Music has a unique and complex structure which is challenging for both expert humans and existing AI systems to understand, and presents unique challenges relative to other forms of audio. We present LLark, an instruction-tuned multimodal …

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use Open

Yonatan Bitton, Hritik Bansal, Jack Hessel, Rulin Shao, Wanrong Zhu , et al. · 2023

Computer science Engineering Biology

We introduce VisIT-Bench (Visual InsTruction Benchmark), a benchmark for evaluation of instruction-following vision-language models for real-world use. Our starting point is curating 70 'instruction families' that we envision instruction t…

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models Open

Anas Awadalla, Irena Gao, Josh Gardner, Jack Hessel, Yusuf Hanafy , et al. · 2023

Computer science Geography Mathematics

We introduce OpenFlamingo, a family of autoregressive vision-language models ranging from 3B to 9B parameters. OpenFlamingo is an ongoing effort to produce an open-source replication of DeepMind's Flamingo models. On seven vision-language …

Subgroup Robustness Grows On Trees: An Empirical Baseline Investigation Open

Josh Gardner, Zoran Popović, Ludwig Schmidt · 2022

Computer science Mathematics Chemistry

Researchers have proposed many methods for fair and robust machine learning, but comprehensive empirical evaluation of their subgroup robustness is lacking. In this work, we address this gap in the context of tabular data, where sensitive …

The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling Open

Yusong Wu, Josh Gardner, Ethan Manilow, Ian Simon, Curtis Hawthorne , et al. · 2022

Computer science Physics

Data is the lifeblood of modern machine learning systems, including for those in Music Information Retrieval (MIR). However, MIR has long been mired by small datasets and unreliable labels. In this work, we propose to break this bottleneck…

Multi-instrument Music Synthesis with Spectrogram Diffusion Open

Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner , et al. · 2022

Computer science Physics

An ideal music synthesizer should be both interactive and expressive, generating high-fidelity audio in realtime for arbitrary combinations of instruments and notes. Recent neural synthesizers have exhibited a tradeoff between domain-speci…

MT3: Multi-Task Multitrack Music Transcription Open

Josh Gardner, Ian Simon, Ethan Manilow, Curtis Hawthorne, Jesse Engel · 2021

Computer science Economics Philosophy

Automatic Music Transcription (AMT), inferring musical notes from raw audio, is a challenging task at the core of music understanding. Unlike Automatic Speech Recognition (ASR), which typically focuses on the words of a single speaker, AMT…

Engineering Design Of Musical Instruments As A Context For Math Physics And Technical Writing In A Freshman Learning Community Course Open

Robert Culbertson, Michael Oehrtman, Janice Thompson, Josh Gardner, Christopher Mehrens , et al. · 2020

Computer science Engineering Mathematics

NOTE: The first page of text has been automatically extracted and included below in lieu of an abstract Engineering Design of Musical Instruments as a Context for Math, Physics and Technical Writing in a Freshman Learning Community Course …

Driving with Data in the Motor City: Mining and Modeling Vehicle Fleet Maintenance Data Open

Josh Gardner, Jawad Mroueh, Natalia Jenuwine, N. Weaverdyck, Samuel Krassenstein , et al. · 2020

Computer science Engineering Geography

The City of Detroit maintains an active fleet of over 2500 vehicles, spending an annual average of over \$5 million on purchases and over \$7.7 million on maintenance. Modeling patterns and trends in this data is of particular importance t…

MORF: A Framework for Predictive Modeling and Replication At Scale With Privacy-Restricted MOOC Data Open

Josh Gardner, Christopher Brooks, Juan Miguel L. Andres, Ryan S. Baker · 2018

Computer science Engineering Physics

Big data repositories from online learning platforms such as Massive Open Online Courses (MOOCs) represent an unprecedented opportunity to advance research on education at scale and impact a global population of learners. To date, such res…

Beyond A/B Testing: Sequential Randomization for Developing Interventions in Scaled Digital Learning Environments Open

Timothy NeCamp, Josh Gardner, Christopher Brooks · 2018

Computer science Psychology Medicine

Randomized experiments ensure robust causal inference that are critical to effective learning analytics research and practice. However, traditional randomized experiments, like A/B tests, are limiting in large scale digital learning enviro…

Evaluating Predictive Models of Student Success: Closing the Methodological Gap Open

Josh Gardner, Christopher Brooks · 2018

Computer science Mathematics Political science

Model evaluation – the process of making inferences about the performance of predictive models – is a critical component of predictive model-ing research in learning analytics. In this work, we present an overview of the state-of-the-pract…

Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining Open

Josh Gardner, Yuming Yang, Ryan S. Baker, Christopher Brooks · 2018

Computer science Biology Mathematics

The use of machine learning techniques has expanded in education research, driven by the rich data from digital learning environments and institutional data warehouses. However, replication of machine learned models in the domain of the le…

Dropout Model Evaluation in MOOCs Open

Josh Gardner, Christopher Brooks · 2018

Computer science Mathematics Philosophy

The field of learning analytics needs to adopt a more rigorous approach for predictive model evaluation that matches the complex practice of model-building. In this work, we present a procedure to statistically test hypotheses about model …

MORF: A Framework for MOOC Predictive Modeling and Replication At Scale. Open

Josh Gardner, Christopher Brooks, Juan Miguel L. Andres, Ryan S. Baker · 2018

Computer science

The MOOC Replication Framework (MORF) is a novel software system for feature extraction, model training/testing, and evaluation of predictive dropout models in Massive Open Online Courses (MOOCs). MORF makes large-scale replication of comp…

Driving with Data: Modeling and Forecasting Vehicle Fleet Maintenance in Detroit Open

Josh Gardner, Danai Koutra, Jawad Mroueh, V. Pang, Arya Farahi , et al. · 2017

Computer science Engineering Business

The City of Detroit maintains an active fleet of over 2500 vehicles, spending an annual average of over \$5 million on new vehicle purchases and over \$7.7 million on maintaining this fleet. Understanding the existence of patterns and tren…

Josh Gardner YOU? Author Swipe