Sam Havens YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Open

Nandan Thakur, Jimmy Lin, Sam Havens, Michael Carbin, Andrew Drozdov · 2025

We introduce FreshStack, a holistic framework for automatically building information retrieval (IR) evaluation benchmarks by incorporating challenging questions and answers. FreshStack conducts the following steps: (1) automatic corpus col…

Long Context RAG Performance of Large Language Models Open

Quinn Leng, Jacob Portes, Sam Havens, Matei Zaharia, Michael Carbin · 2024

Retrieval Augmented Generation (RAG) has emerged as a crucial technique for enhancing the accuracy of Large Language Models (LLMs) by incorporating external information. With the advent of LLMs that support increasingly longer context leng…

LoRA Learns Less and Forgets Less Open

Dan Biderman, Jose Gonzalez Ortiz, Jacob Portes, Mansheej Paul, Philip Greengard , et al. · 2024

Computer science Economics Psychology

Low-Rank Adaptation (LoRA) is a widely-used parameter-efficient finetuning method for large language models. LoRA saves memory by training only low rank perturbations to selected weight matrices. In this work, we compare the performance of…

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining Open

Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla , et al. · 2023

Computer science Philosophy Physics

Although BERT-style encoder models are heavily used in NLP research, many researchers do not pretrain their own BERTs from scratch due to the high cost of training. In the past half-decade since BERT first rose to prominence, many advances…

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms Open

Aditi Jha, Sam Havens, Jeremey Dohmann, Alex Trott, Jacob Portes · 2023

Computer science Philosophy Mathematics

Large Language Models are traditionally finetuned on large instruction datasets. However recent studies suggest that small, high-quality datasets can suffice for general purpose instruction following. This lack of consensus surrounding fin…

Creating related items for first view…