Ram Pasunuru YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Byte Latent Transformer: Patches Scale Better Than Tokens Open

Artidoro Pagnoni, Ram Pasunuru, Pedro Rodríguez, John Nguyen, Benjamin Müller , et al. · 2024

Computer science Engineering

We introduce the Byte Latent Transformer (BLT), a new byte-level LLM architecture that, for the first time, matches tokenization-based LLM performance at scale with significant improvements in inference efficiency and robustness. BLT encod…

Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Open

Yejin Lee, Anna Sun, Basil Hosmer, Bilge Acun, Can Balioglu , et al. · 2024

Computer science

Generative artificial intelligence (AI) technology is revolutionizing the computing industry. Not only its applications have broadened to various sectors but also poses new system design and optimization opportunities. The technology is ca…

The ART of LLM Refinement: Ask, Refine, and Trust Open

Kumar Shridhar, Koustuv Sinha, Andrew Cohen, Tianlu Wang, Ping Yu , et al. · 2023

Computer science Philosophy Economics

In recent years, Large Language Models (LLMs) have demonstrated remarkable generative abilities, but can they judge the quality of their own generations? A popular concept, referred to as self-refinement, postulates that LLMs can detect an…

Augmented Language Models: a Survey Open

Grégoire Mialon, Roberto Dessì, María Lomelí, Christoforos Nalmpantis, Ram Pasunuru , et al. · 2023

Computer science Biology

This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in c…

Creating related items for first view…