Liam Dugan
YOU?
Author Swipe
View article: Machine Text Detectors are Membership Inference Attacks
Machine Text Detectors are Membership Inference Attacks Open
Although membership inference attacks (MIAs) and machine-generated text detection target different goals, identifying training samples and synthetic texts, their methods often exploit similar signals based on a language model's probability…
View article: Controlling Difficulty of Generated Text for AI-Assisted Language Learning
Controlling Difficulty of Generated Text for AI-Assisted Language Learning Open
Practicing conversations with large language models (LLMs) presents a promising alternative to traditional in-person language learning. However, most LLMs generate text at a near-native level of complexity, making them ill-suited for begin…
View article: Domain Gating Ensemble Networks for AI-Generated Text Detection
Domain Gating Ensemble Networks for AI-Generated Text Detection Open
As state-of-the-art language models continue to improve, the need for robust detection of machine-generated text becomes increasingly critical. However, current state-of-the-art machine text detectors struggle to adapt to new unseen domain…
View article: Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection
Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection Open
The advancement of large language models (LLMs) has made it difficult to differentiate human-written text from AI-generated text. Several AI-text detectors have been developed in response, which typically utilize a fixed global threshold (…
View article: GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge
GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Open
Recently there have been many shared tasks targeting the detection of generated text from Large Language Models (LLMs). However, these shared tasks tend to focus either on cases where text is limited to one particular domain or cases where…
View article: MiRAGeNews: Multimodal Realistic AI-Generated News Detection
MiRAGeNews: Multimodal Realistic AI-Generated News Detection Open
The proliferation of inflammatory or misleading "fake" news content has become increasingly common in recent years. Simultaneously, it has become easier than ever to use AI tools to generate photorealistic images depicting any scene imagin…
View article: ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems Open
Recently, there has been increasing interest in using Large Language Models (LLMs) to construct complex multi-agent systems to perform tasks such as compiling literature reviews, drafting consumer reports, and planning vacations. Many tool…
View article: RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors Open
Many commercial and open-source models claim to detect machine-generated text with extremely high accuracy (99% or more). However, very few of these detectors are evaluated on shared benchmark datasets and even when they are, the datasets …
View article: FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models Open
One type of question that is commonly found in day-to-day scenarios is ``fan-out'' questions, complex multi-hop, multi-document reasoning questions that require finding information about a large number of entities. However, there exist few…
View article: Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck
Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck Open
Black-box deep neural networks excel in text classification, yet their application in high-stakes domains is hindered by their lack of interpretability. To address this, we propose Text Bottleneck Models (TBM), an intrinsically interpretab…
View article: Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications Open
Language model applications are becoming increasingly popular and complex, often including features like tool usage and retrieval augmentation. However, existing frameworks for such applications are often opinionated, deciding for develope…
View article: Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text
Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text Open
As text generated by large language models proliferates, it becomes vital to understand how humans engage with such text, and whether or not they are able to detect when the text they are reading did not originate with a human writer. Prio…
View article: Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models
Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models Open
Recent work in speech-to-speech translation (S2ST) has focused primarily on offline settings, where the full input utterance is available before any output is given. This, however, is not reasonable in many real-world scenarios. In latency…
View article: Exploring the Curious Case of Code Prompts
Exploring the Curious Case of Code Prompts Open
Recent work has shown that prompting language models with code-like representations of natural language leads to performance improvements on structured reasoning tasks. However, such tasks comprise only a small subset of all natural langua…
View article: Exploring the Curious Case of Code Prompts
Exploring the Curious Case of Code Prompts Open
Recent work has shown that prompting language models with code-like representations of natural language leads to performance improvements on structured reasoning tasks. However, such tasks comprise only a small subset of all natural langua…
View article: Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications Open
Language model applications are becoming increasingly popular and complex, often including features like tool usage and retrieval augmentation. However, existing frameworks for such applications are often opinionated, deciding for develope…
View article: Enhancing Human Summaries for Question-Answer Generation in Education
Enhancing Human Summaries for Question-Answer Generation in Education Open
Hannah Gonzalez, Liam Dugan, Eleni Miltsakaki, Zhiqi Cui, Jiaxuan Ren, Bryan Li, Shriyash Upadhyay, Etan Ginsberg, Chris Callison-Burch. Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2…
View article: Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Generated Text
Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Generated Text Open
As text generated by large language models proliferates, it becomes vital to understand how humans engage with such text, and whether or not they are able to detect when the text they are reading did not originate with a human writer. Prio…
View article: The Case for a Single Model that can Both Generate Continuations and Fill in the Blank
The Case for a Single Model that can Both Generate Continuations and Fill in the Blank Open
The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While previ…
View article: A Feasibility Study of Answer-Agnostic Question Generation for Education
A Feasibility Study of Answer-Agnostic Question Generation for Education Open
We conduct a feasibility study into the applicability of answer-agnostic question generation models to textbook passages. We show that a significant portion of errors in such systems arise from asking irrelevant or uninterpretable question…
View article: The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank
The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank Open
The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While previ…
View article: A Feasibility Study of Answer-Unaware Question Generation for Education
A Feasibility Study of Answer-Unaware Question Generation for Education Open
Liam Dugan, Eleni Miltsakaki, Shriyash Upadhyay, Etan Ginsberg, Hannah Gonzalez, DaHyeon Choi, Chuning Yuan, Chris Callison-Burch. Findings of the Association for Computational Linguistics: ACL 2022. 2022.
View article: Watching Paint Dry: I/VOC Emissions from Architectural Coatings and their Impact on SOA Formation
Watching Paint Dry: I/VOC Emissions from Architectural Coatings and their Impact on SOA Formation Open
Emissions from volatile chemical products (VCPs) are emerging as a major source of anthropogenic secondary organic aerosol (SOA) precursors. Paints and coatings are an important class of VCPs that emit both volatile and intermediate volati…
View article: Watching Paint Dry: I/VOC Emissions from Architectural Coatings and their Impact on SOA Formation
Watching Paint Dry: I/VOC Emissions from Architectural Coatings and their Impact on SOA Formation Open
Emissions from volatile chemical products (VCPs) are emerging as a major source of anthropogenic secondary organic aerosol (SOA) precursors. Paints and coatings are an important class of VCPs that emit both volatile and intermediate volati…
View article: RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text
RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text Open
In recent years, large neural networks for natural language generation (NLG) have made leaps and bounds in their ability to generate fluent text. However, the tasks of evaluating quality differences between NLG systems and understanding ho…