Benjamin Spector

Explaining vague language

Why is language vague? Vagueness may be explained and rationalized if it can be shown that vague language is more useful to speaker and hearer than precise language. In a well-known paper, Lipman proposes a game-theoretic account of…

Computation and Language · Computer Science 2025-08-01 Paul Égré , Benjamin Spector

LoLCATs: On Low-Rank Linearizing of Large Language Models

Recent works show we can linearize large language models (LLMs) -- swapping the quadratic attentions of popular Transformer-based LLMs with subquadratic analogs, such as linear attention -- avoiding the expensive pretraining costs. However,…

Machine Learning · Computer Science 2025-03-07 Michael Zhang , Simran Arora , Rahul Chalamala , Alan Wu , Benjamin Spector , Aaryan Singhal , Krithik Ramesh , Christopher Ré

Just read twice: closing the recall gap for recurrent language models

Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due…

Computation and Language · Computer Science 2024-07-09 Simran Arora , Aman Timalsina , Aaryan Singhal , Benjamin Spector , Sabri Eyuboglu , Xinyi Zhao , Ashish Rao , Atri Rudra , Christopher Ré

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Machine learning models are increasingly being scaled in both sequence length and model dimension to reach longer contexts and better performance. However, existing architectures such as Transformers scale quadratically along both these…

Machine Learning · Computer Science 2023-10-19 Daniel Y. Fu , Simran Arora , Jessica Grogan , Isys Johnson , Sabri Eyuboglu , Armin W. Thomas , Benjamin Spector , Michael Poli , Atri Rudra , Christopher Ré

Accelerating LLM Inference with Staged Speculative Decoding

Recent advances with large language models (LLM) illustrate their diverse capabilities. We propose a novel algorithm, staged speculative decoding, to accelerate LLM inference in small-batch, on-device scenarios. We address the low…

Artificial Intelligence · Computer Science 2023-08-10 Benjamin Spector , Chris Re

On the Optimality of Vagueness: "Around", "Between", and the Gricean Maxims

Why is ordinary language vague? We argue that in contexts in which a cooperative speaker is not perfectly informed about the world, the use of vague expressions can offer an optimal tradeoff between truthfulness (Gricean Quality) and…

Computation and Language · Computer Science 2022-09-02 Paul Egré , Benjamin Spector , Adèle Mortier , Steven Verheyen

Exhaustivity and anti-exhaustivity in the RSA framework: Testing the effect of prior beliefs

During communication, the interpretation of utterances is sensitive to a listener's probabilistic prior beliefs, something which is captured by one currently influential model of pragmatics, the Rational Speech Act (RSA) framework. In this…

Computation and Language · Computer Science 2022-02-16 Alexandre Cremers , Ethan G. Wilcox , Benjamin Spector

Bounding the Last Mile: Efficient Learned String Indexing

We introduce the RadixStringSpline (RSS) learned index structure for efficiently indexing strings. RSS is a tree of radix splines each indexing a fixed number of bytes. RSS approaches or exceeds the performance of traditional string indexes…

Databases · Computer Science 2021-12-01 Benjamin Spector , Andreas Kipf , Kapil Vaidya , Chi Wang , Umar Farooq Minhas , Tim Kraska

Preventing Adversarial Use of Datasets through Fair Core-Set Construction

We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor…

Machine Learning · Computer Science 2019-10-25 Benjamin Spector , Ravi Kumar , Andrew Tomkins

Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to…

Machine Learning · Computer Science 2018-01-09 Benjamin Spector , Serge Belongie

The Design and Implementation of Modern Online Programming Competitions

This paper presents a framework for the implementation of online programming competitions, including a set of principles for the design of the multiplayer game and a practical framework for the construction of the competition environment.…

Computers and Society · Computer Science 2017-10-24 Benjamin Spector , Michael Truell