Giovanni Monea YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers Open

Chair Julie Dumas, Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West · 2024

A central question in multilingual language modeling is whether large language models (LLMs) develop a universal concept representation, disentangled from specific languages. In this paper, we address this question by analyzing latent repr…

Do Llamas Work in English? On the Latent Language of Multilingual Transformers Open

Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West · 2024

Computer science Engineering Philosophy

We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language models function and the origins of lingui…

PaSS: Parallel Speculative Sampling Open

Giovanni Monea, Armand Joulin, Édouard Grave · 2023

Computer science Political science Materials science

Scaling the size of language models to tens of billions of parameters has led to impressive performance on a wide range of tasks. At generation, these models are used auto-regressively, requiring a forward pass for each generated token, an…

Creating related items for first view…