Explanipedia

Trivalence and Transparency: a non-dynamic approach to anaphora Open

Benjamin Spector · 2025

ThunderKittens: Simple, Fast, and Adorable AI Kernels Open

Benjamin Spector, Simran Arora, Aaryan Singhal, Daniel Y. Fu, Christopher Ré · 2024

The challenge of mapping AI architectures to GPU hardware is creating a critical bottleneck in AI progress. Despite substantial efforts, hand-written custom kernels fail to meet their theoretical performance thresholds, even on well-establ…

LoLCATs: On Low-Rank Linearizing of Large Language Models Open

Michael Zhang, Simran Arora, Rahul Chalamala, Alan Wu, Benjamin Spector , et al. · 2024

Recent works show we can linearize large language models (LLMs) -- swapping the quadratic attentions of popular Transformer-based LLMs with subquadratic analogs, such as linear attention -- avoiding the expensive pretraining costs. However…

Just read twice: closing the recall gap for recurrent language models Open

Simran Arora, Aman Timalsina, Aaryan Singhal, Benjamin Spector, Sabri Eyuboglu , et al. · 2024

Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due…

Explaining vague language Open

Paul Égré, Benjamin Spector · 2024

Why is language vague? Vagueness may be explained and rationalized if it can be shown that vague language is more useful to speaker and hearer than precise language. In a well-known paper, Lipman proposes a game-theoretic account of vaguen…

Experimentally assessing the symmetry of presupposition filtering across disjunction Open

Matthew Iver Loder, Jeremy Kuhn, Benjamin Spector · 2024

International audience

Existential and universal readings of pronouns across binary connectives: an experimental investigation Open

Keny Chatain, Benjamin Spector, Nina Gregorio · 2024

It’s not about 'about' – comparatives, negation and intervals Open

Benjamin Spector · 2023

Solt (2014, 2018) discovered an intriguing pattern regarding the distribution of the approximator 'about'. While 'about n' is typically infelicitous under negation, this pattern is reversed with 'more than about n', which is fine under neg…

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture Open

Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu , et al. · 2023

Machine learning models are increasingly being scaled in both sequence length and model dimension to reach longer contexts and better performance. However, existing architectures such as Transformers scale quadratically along both these ax…

Accelerating LLM Inference with Staged Speculative Decoding Open

Benjamin Spector, Chris Ré · 2023

Recent advances with large language models (LLM) illustrate their diverse capabilities. We propose a novel algorithm, staged speculative decoding, to accelerate LLM inference in small-batch, on-device scenarios. We address the low arithmet…

On the optimality of vagueness: “around”, “between” and the Gricean maxims Open

Paul Égré, Benjamin Spector, Adèle Mortier, Steven Verheyen · 2023

Exhaustivity and Anti‐Exhaustivity in the RSA Framework: Testing the Effect of Prior Beliefs Open

Alexandre Cremers, Ethan G. Wilcox, Benjamin Spector · 2023

During communication, the interpretation of utterances is sensitive to a listener's probabilistic prior beliefs. In this paper, we focus on the influence of prior beliefs on so‐called exhaustivity interpretations , whereby a sentence such …

Exhaustivity and anti-exhaustivity in the RSA framework: Testing the effect of prior beliefs Open

Alexandre Cremers, Ethan G. Wilcox, Benjamin Spector · 2022

During communication, the interpretation of utterances is sensitive to a listener's probabilistic prior beliefs, something which is captured by one currently influential model of pragmatics, the Rational Speech Act (RSA) framework. In this…

Bounding the Last Mile: Efficient Learned String Indexing Open

Benjamin Spector, Andreas Kipf, Kapil Vaidya, Chi Wang, Umar Farooq Minhas , et al. · 2021

We introduce the RadixStringSpline (RSS) learned index structure for efficiently indexing strings. RSS is a tree of radix splines each indexing a fixed number of bytes. RSS approaches or exceeds the performance of traditional string indexe…

Explaining gaps in the logical lexicon of natural languages: A decision-theoretic perspective on the square of Aristotle Open

Émile Enguehard, Benjamin Spector · 2021

International audience

Modified Numerals Open

Benjamin Spector · 2020

Modified numerals are expressions such as more than three , fewer than three , at least three , at most three , up to ten , between three and ten , approximately ten , about ten , exactly ten , and so forth. At first sight, their semantic …

Interpreting plural predication: homogeneity and non-maximality Open

Manuel Križ, Benjamin Spector · 2020

On the Optimality of Vagueness: "Around", "Between", and the Gricean Maxims Open

Paul Égré, Benjamin Spector, Adèle Mortier, Steven Verheyen · 2020

Why is ordinary language vague? We argue that in contexts in which a cooperative speaker is not perfectly informed about the world, the use of vague expressions can offer an optimal tradeoff between truthfulness (Gricean Quality) and infor…

An argument for the trivalent approach to presupposition projection Open

Benjamin Spector · 2019

International audience

Preventing Adversarial Use of Datasets through Fair Core-Set Construction Open

Benjamin Spector, Ravi Kumar, Andrew Tomkins · 2019

We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor perfor…

Preventing Adversarial Use of Datasets through Fair Core-Set\n Construction Open

Benjamin Spector, Ravi Kumar, Andrew Tomkins · 2019

We propose improving the privacy properties of a dataset by publishing only a\nstrategically chosen "core-set" of the data containing a subset of the\ninstances. The core-set allows strong performance on primary tasks, but forces\npoor per…

Distinctions between primary and secondary scalar implicatures Open

Anouk Dieuleveut, Emmanuel Chemla, Benjamin Spector · 2019

The Role of Prior Beliefs in The Rational Speech Act Model of Pragmatics: Exhaustivity as a Case Study Open

Ethan Wilcox, Benjamin Spector · 2019

This paper examines the interaction between prior beliefs andpragmatic inferences, focusing on exhaustivity effects. Wepresent three experiments that tests how prior beliefs influenceboth interpretation and production of language, and comp…

Revealing abstract semantic mechanisms through priming: The distributive/collective contrast Open

Mora Maldonado, Emmanuel Chemla, Benjamin Spector · 2018

Economy and embedded exhaustification Open

Danny Fox, Benjamin Spector · 2018

Sample-Efficient Reinforcement Learning through Transfer and\n Architectural Priors Open

Benjamin Spector, Serge Belongie · 2018

Recent work in deep reinforcement learning has allowed algorithms to learn\ncomplex tasks such as Atari 2600 games just from the reward provided by the\ngame, but these algorithms presently require millions of training steps in\norder to l…

Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors Open

Benjamin Spector, Serge Belongie · 2018

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to lear…

Unexpected Wide‐Scope Phenomena Open

E. G. Ruys, Benjamin Spector · 2017

It has long been known that quantificational expressions in natural language do not all have the same scope properties. While the scope of some expressions is closely related to their observable, “surface” position in syntactic structure, …

The Design and Implementation of Modern Online Programming Competitions Open

Benjamin Spector, Michael Truell · 2017

This paper presents a framework for the implementation of online programming competitions, including a set of principles for the design of the multiplayer game and a practical framework for the construction of the competition environment. …

Asymmetric inference towards the antonym: Experiments into the polarity and morphology of negated adjectives Open

Nicolas Ruytenbeek, Steven Verheyen, Benjamin Spector · 2017

In this paper, we investigate the interpretation of negated antonyms. A sentence such as Peter is not tall can be understood as meaning either that Peter is not tall tout court or that Peter is rather short (inference towards the antonym; …

Benjamin Spector YOU? Author Swipe