Ann Yuan
YOU?
Author Swipe
View article: Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics
Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics Open
Large language models (LLMs) struggle with cross-lingual knowledge transfer: they hallucinate when asked in one language about facts expressed in a different language during training. This work introduces a controlled setting to study the …
View article: Just Say No to Single Embeddings: Why Your AI Needs Multiple Perspectives
Just Say No to Single Embeddings: Why Your AI Needs Multiple Perspectives Open
Note: This is a work in progress document This exploratory work analyzes 229 multi-agent AI dialogues byprojecting them into five different embedding spaces (transformer-based and classical) and measuring geometric properties. We find astr…
View article: LaMPost: AI Writing Assistance for Adults with Dyslexia Using Large Language Models
LaMPost: AI Writing Assistance for Adults with Dyslexia Using Large Language Models Open
The natural language capabilities demonstrated by large language models (LLMs) highlight an opportunity for new writing support tools that address the varied needs of people with dyslexia. We present LaMPost, a prototype email editor that …
View article: Who's asking? User personas and the mechanics of latent misalignment
Who's asking? User personas and the mechanics of latent misalignment Open
Despite investments in improving model safety, studies show that misaligned capabilities remain latent in safety-tuned models. In this work, we shed light on the mechanics of this phenomenon. First, we show that even when model generations…
View article: ConstitutionalExperts: Training a Mixture of Principle-based Prompts
ConstitutionalExperts: Training a Mixture of Principle-based Prompts Open
Large language models (LLMs) are highly capable at a variety of tasks given the right prompt, but writing one is still a difficult and tedious process. In this work, we introduce ConstitutionalExperts, a method for learning a prompt consis…
View article: Towards Agile Text Classifiers for Everyone
Towards Agile Text Classifiers for Everyone Open
Text-based safety classifiers are widely used for content moderation and increasingly to tune generative language model behavior - a topic of growing concern for the safety of digital assistants and chatbots. However, different policies re…
View article: Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning
Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning Open
Pretrained large language models (LLMs) are able to solve a wide variety of tasks through transfer learning. Various explainability methods have been developed to investigate their decision making process. TracIn (Pruthi et al., 2020) is o…
View article: Towards Agile Text Classifiers for Everyone
Towards Agile Text Classifiers for Everyone Open
Text-based safety classifiers are widely used for content moderation and increasingly to tune generative language model behavior - a topic of growing concern for the safety of digital assistants and chatbots. However, different policies re…
View article: Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers Open
Recent developments in natural language generation (NLG) using neural language models have brought us closer than ever to the goal of building AI-powered creative writing tools. However, most prior work on human-AI collaboration in the cre…
View article: LaMPost: Design and Evaluation of an AI-assisted Email Writing Prototype for Adults with Dyslexia
LaMPost: Design and Evaluation of an AI-assisted Email Writing Prototype for Adults with Dyslexia Open
Prior work has explored the writing challenges experienced by people with\ndyslexia, and the potential for new spelling, grammar, and word retrieval\ntechnologies to address these challenges. However, the capabilities for natural\nlanguage…
View article: The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
The Case for a Single Model that can Both Generate Continuations and Fill in the Blank Open
The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While previ…
View article: Perspective-Taking to Reduce Affective Polarization on Social Media
Perspective-Taking to Reduce Affective Polarization on Social Media Open
The intensification of affective polarization worldwide has raised new questions about how social media platforms might be further fracturing an already-divided public sphere. As opposed to ideological polarization, affective polarization …
View article: Wordcraft: Story Writing With Large Language Models
Wordcraft: Story Writing With Large Language Models Open
The latest generation of large neural language models such as GPT-3 have achieved new levels of performance on benchmarks for language understanding and generation. These models have even demonstrated an ability to perform arbitrary tasks …
View article: A Recipe for Arbitrary Text Style Transfer with Large Language Models
A Recipe for Arbitrary Text Style Transfer with Large Language Models Open
Emily Reif, Daphne Ippolito, Ann Yuan, Andy Coenen, Chris Callison-Burch, Jason Wei. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 2022.
View article: The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank
The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank Open
The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While previ…
View article: SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets
SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets Open
NLP researchers need more, higher-quality text datasets. Human-labeled datasets are expensive to collect, while datasets collected via automatic retrieval from the web such as WikiBio are noisy and can include undesired biases. Moreover, d…
View article: Perspective-taking to Reduce Affective Polarization on Social Media
Perspective-taking to Reduce Affective Polarization on Social Media Open
The intensification of affective polarization worldwide has raised new questions about how social media platforms might be further fracturing an already-divided public sphere. As opposed to ideological polarization, affective polarization …
View article: A Recipe For Arbitrary Text Style Transfer with Large Language Models
A Recipe For Arbitrary Text Style Transfer with Large Language Models Open
In this paper, we leverage large language models (LMs) to perform zero-shot text style transfer. We present a prompting method that we call augmented zero-shot learning, which frames style transfer as a sentence rewriting task and requires…
View article: Wordcraft: a Human-AI Collaborative Editor for Story Writing
Wordcraft: a Human-AI Collaborative Editor for Story Writing Open
As neural language models grow in effectiveness, they are increasingly being applied in real-world settings. However these applications tend to be limited in the modes of interaction they support. In this extended abstract, we propose Word…
View article: An Interpretability Illusion for BERT
An Interpretability Illusion for BERT Open
We describe an "interpretability illusion" that arises when analyzing the BERT model. Activations of individual neurons in the network may spuriously appear to encode a single, simple concept, when in fact they are encoding something far m…
View article: The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models Open
We present the Language Interpretability Tool (LIT), an open-source platform for visualization and understanding of NLP models. We focus on core questions about model behavior: Why did my model make this prediction? When does it perform po…
View article: The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models Open
Ian Tenney, James Wexler, Jasmijn Bastings, Tolga Bolukbasi, Andy Coenen, Sebastian Gehrmann, Ellen Jiang, Mahima Pushkarna, Carey Radebaugh, Emily Reif, Ann Yuan. Proceedings of the 2020 Conference on Empirical Methods in Natural Language…
View article: TensorFlow.js: Machine Learning for the Web and Beyond
TensorFlow.js: Machine Learning for the Web and Beyond Open
TensorFlow.js is a library for building and executing machine learning algorithms in JavaScript. TensorFlow.js models run in a web browser and in the Node.js environment. The library is part of the TensorFlow ecosystem, providing a set of …
View article: TensorFlow.js: Machine Learning for the Web and Beyond
TensorFlow.js: Machine Learning for the Web and Beyond Open
TensorFlow.js is a library for building and executing machine learning algorithms in JavaScript. TensorFlow.js models run in a web browser and in the Node.js environment. The library is part of the TensorFlow ecosystem, providing a set of …
View article: Me, My Echo Chamber, and I: Introspection on Social Media Polarization
Me, My Echo Chamber, and I: Introspection on Social Media Polarization Open
Homophily -- our tendency to surround ourselves with others who share our perspectives and opinions about the world -- is both a part of human nature and an organizing principle underpinning many of our digital social networks. However, wh…
View article: Me, My Echo Chamber, and I
Me, My Echo Chamber, and I Open
Homophily - our tendency to surround ourselves with others who share our perspectives and opinions about the world - is both a part of human nature and an organizing principle underpinning many of our digital social networks. However, when…
View article: Mapping Twitter Conversation Landscapes
Mapping Twitter Conversation Landscapes Open
While the most ambitious polls are based on standardized interviews with a few thousand people, millions are tweeting freely and publicly in their own voices about issues they care about. This data offers a vibrant 24/7 snapshot of people'…
View article: TweetVista: An AI-Powered Interactive Tool for Exploring Conversations on Twitter
TweetVista: An AI-Powered Interactive Tool for Exploring Conversations on Twitter Open
We present TweetVista, an interactive web-based tool for mapping the conversation landscapes on Twitter. TweetVista is an intelligent and interactive desktop web application for exploring the conversation landscapes on Twitter. Given a dat…