Exploring foci of:
arXiv (Cornell University)
Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models
April 2025 • Haotian Ye, Himanshu Jain, Chong You, Ananda Theertha Suresh, Haowei Lin, James Zou, Felix Yu
In real-world applications of large language models, outputs are often required to be confined: selecting items from predefined product or document sets, generating phrases that comply with safety standards, or conforming to specialized formatting styles. To control the generation, constrained decoding has been widely adopted. However, existing prefix-tree-based constrained decoding is inefficient under GPU-based model inference paradigms, and it introduces unintended biases into the output distribution. This pape…
Main Page
Weapons (2025 Film)
Zohran Mamdani
Squid Game Season 3
Victoria Mboko
Lauren Sánchez
Jeff Bezos
Shefali Jariwala
F1 (Film)
Takahiro Shiraishi
The 1975
Matty Healy
Mira Nair
Kannappa (Film)
Squid Game
Truth And Reconciliation Commission Of Canada
Alanis Morissette
2025 Nba Draft
28 Years Later
Mahmood Mamdani
Reich Ministry Of Public Enlightenment And Propaganda
Rick Hurst
Fuck
Mutual Aid
Degenerate Art Exhibition
Kpop Demon Hunters
Anna Wintour