Explanipedia

TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards Open

Andreea Nica, Ivan Zakazov, Nicolas Baldwin, Saibo Geng, Robert West · 2025

Prompt optimization improves the reasoning abilities of large language models (LLMs) without requiring parameter updates to the target model. Following heuristic-based "Think step by step" approaches, the field has evolved in two main dire…

zip2zip: Inference-Time Adaptive Tokenization via Online Compression Open

Saibo Geng, Nathan Ranchin, Yongqiang Yao, Maxime Peyrard, Chris Wendler , et al. · 2025

Tokenization efficiency plays a critical role in the performance and cost of large language models (LLMs), yet most models rely on static tokenizers optimized on general-purpose corpora. These tokenizers' fixed vocabularies often fail to a…

JSONSchemaBench: A Rigorous Benchmark of Structured Outputs for Language Models Open

Saibo Geng, Hudson Cooper, Michał Moskal, Samuel Jenkins, Jules J. Berman , et al. · 2025

Computer science Geography

Reliably generating structured outputs has become a critical capability for modern language model (LM) applications. Constrained decoding has emerged as the dominant technology across sectors for enforcing structured outputs during generat…

Byte BPE Tokenization as an Inverse string Homomorphism Open

Saibo Geng, Sankalp Gambhir, Chris Wendler, Robert B. West · 2024

Computer science Mathematics

Tokenization is an important preprocessing step in the training and inference of large language models (LLMs). While there has been extensive research on the expressive power of the neural achitectures used in LLMs, the impact of tokenizat…

Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access Open

Saibo Geng, Berkay Döner, Chris Wendler, Martin Josifoski, Robert West · 2024

Computer science

Constrained decoding, a technique for enforcing constraints on language model outputs, offers a way to control text generation without retraining or architectural modifications. Its application is, however, typically restricted to models t…

Flows: Building Blocks of Reasoning and Collaborating AI Open

Martin Josifoski, Lars Klein, Maxime Peyrard, Nicolas Baldwin, Yifei Li , et al. · 2023

Computer science Mathematics

Recent advances in artificial intelligence (AI) have produced highly capable and controllable systems. This creates unprecedented opportunities for structured reasoning as well as collaboration among multiple AI systems and humans. To full…

Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning Open

Saibo Geng, Martin Josifosky, Maxime Peyrard, Robert West · 2023

Computer science Economics Philosophy

Despite their impressive performance, large language models (LMs) still struggle with reliably generating complex output structures when not finetuned to follow the required output format exactly. To address this issue, grammar-constrained…

Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning Open

Saibo Geng, Martin Josifoski, Maxime Peyrard, Robert West · 2023

Computer science Philosophy Mathematics

International audience

Legal Transformer Models May Not Always Help Open

Saibo Geng, Rémi Lebret, Karl Aberer · 2021

Computer science Engineering Medicine

Deep learning-based Natural Language Processing methods, especially transformers, have achieved impressive performance in the last few years. Applying those state-of-the-art NLP methods to legal activities to automate or simplify some simp…

An Enhanced MeanSum Method For Generating Hotel Multi-Review Summarizations Open

Saibo Geng, Diego Antognini · 2020

Computer science Geology

Multi-document summaritazion is the process of taking multiple texts as input and producing a short summary text based on the content of input texts. Up until recently, multi-document summarizers are mostly supervised extractive. However, …

Saibo Geng YOU? Author Swipe