Explanipedia

Generative Enrichment via NFT and Synthesis (GENESIS): A Multi-Perspective AI-Driven Protocol for Secure Medical Analysis Open

Wenhao Yu, Dan Iter, Shuohang Wang, Xu Yi‐chong, Mingxuan Ju , et al. · 2025

Knowledge-intensive tasks, such as open-domain question answering (QA), require access to a large amount of world or domain knowledge. A common approach for knowledge-intensive tasks is to employ a retrieve-then-read pipeline that first re…

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Open

Marah Abdin, Sam Adé Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Hassan Awadallah , et al. · 2024

Computer science Philosophy

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5…

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions Open

Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao , et al. · 2023

Computer science Engineering Geography

Recent progress in Large Language Models (LLMs) has produced models that exhibit remarkable performance across a variety of NLP tasks. However, it remains unclear whether the existing focus of NLP research accurately captures the genuine r…

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models Open

Zhihan Zhang, Shuohang Wang, Wenhao Yu, Xu Yi‐chong, Dan Iter , et al. · 2023

Computer science Psychology Engineering

Large language models (LLMs) can perform a wide range of tasks by following natural language instructions, without the necessity of task-specific fine-tuning. Unfortunately, the performance of LLMs is greatly influenced by the quality of t…

In-Context Demonstration Selection with Cross Entropy Difference Open

Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu , et al. · 2023

Computer science Biology Physics

Large language models (LLMs) can use in-context demonstrations to improve performance on zero-shot tasks. However, selecting the best in-context examples is challenging because model performance can vary widely depending on the selected ex…

LMGQS: A Large-scale Dataset for Query-focused Summarization Open

Ruochen Xu, Song Wang, Yang Liu, Shuohang Wang, Xu Yi‐chong , et al. · 2023

Computer science Geography Philosophy

Query-focused summarization (QFS) aims to extract or generate a summary of an input document that directly answers or is relevant to a given query. The lack of large-scale datasets in the form of documents, queries, and summaries has hinde…

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT Open

Xu Yi‐chong, Ruochen Xu, Dan Iter, Yang Liu, Shuohang Wang , et al. · 2023

Computer science Physics Philosophy

While large models such as GPT-3 demonstrate exceptional performance in zeroshot and fewshot summarization tasks, their extensive serving and fine-tuning costs hinder their utilization in various applications. Conversely, previous studies …

Automatic Prompt Optimization with "Gradient Descent" and Beam Search Open

Reid Pryzant, Dan Iter, Jerry Li, Yin Tat Lee, Chenguang Zhu , et al. · 2023

Computer science Geography Economics

Large Language Models (LLMs) have shown impressive performance as general purpose agents, but their abilities remain highly dependent on prompts which are hand written with onerous trial-and-error effort. We propose a simple and nonparamet…

G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment Open

Yang Liu, Dan Iter, Xu Yi‐chong, Shuohang Wang, Ruochen Xu , et al. · 2023

Computer science Economics Philosophy

The quality of texts generated by natural language generation (NLG) systems is hard to measure automatically. Conventional reference-based metrics, such as BLEU and ROUGE, have been shown to have relatively low correlation with human judgm…

How Does In-Context Learning Help Prompt Tuning? Open

Simeng Sun, Yang Liu, Dan Iter, Chenguang Zhu, Mohit Iyyer · 2023

Computer science Psychology Economics

Fine-tuning large language models is becoming ever more impractical due to their rapidly-growing scale. This motivates the use of parameter-efficient adaptation methods such as prompt tuning (PT), which adds a small number of tunable embed…

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models Open

Zhihan Zhang, Shuohang Wang, Wenhao Yu, Xu Yi‐chong, Dan Iter , et al. · 2023

Computer science Psychology Engineering

Large language models (LLMs) can perform a wide range of tasks by following natural language instructions, without the necessity of task-specific fine-tuning. Unfortunately, the performance of LLMs is greatly influenced by the quality of t…

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions Open

Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao , et al. · 2023

Computer science Engineering History

Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.

G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment Open

Yang Liu, Dan Iter, Xu Yi‐chong, Shuohang Wang, Ruochen Xu , et al. · 2023

Computer science Philosophy Economics

The quality of texts generated by natural language generation (NLG) systems is hard to measure automatically. Conventional reference-based metrics, such as BLEU and ROUGE, have been shown to have relatively low correlation with human judgm…

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT Open

Xu Yi‐chong, Ruochen Xu, Dan Iter, Yang Liu, Shuohang Wang , et al. · 2023

Computer science Physics Philosophy

While large models such as GPT-3 demonstrate exceptional performance in zeroshot and fewshot summarization tasks, their extensive serving and fine-tuning costs hinder their utilization in various applications. Conversely, previous studies …

LMGQS: A Large-scale Dataset for Query-focused Summarization Open

Ruochen Xu, Song Wang, Yang Liu, Shuohang Wang, Xu Yi‐chong , et al. · 2023

Computer science Physics Geography

Query-focused summarization (QFS) aims to extract or generate a summary of an input document that directly answers or is relevant to a given query. The lack of large-scale datasets in the form of documents, queries, and summaries has hinde…

In-Context Demonstration Selection with Cross Entropy Difference Open

Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu , et al. · 2023

Computer science Physics Biology

Large language models (LLMs) can use in-context demonstrations to improve performance on zero-shot tasks. However, selecting the best in-context examples is challenging because model performance can vary widely depending on the selected ex…

Automatic Prompt Optimization with “Gradient Descent” and Beam Search Open

Reid Pryzant, Dan Iter, Jerry Li, Yin Lee, Chenguang Zhu , et al. · 2023

Computer science Mathematics Economics

Large Language Models (LLMs) have shown impressive performance as general purpose agents, but their abilities remain highly dependent on prompts which are hand written with onerous trial-and-error effort. We propose a simple and nonparamet…

The Trade-offs of Domain Adaptation for Neural Language Models Open

David Grangier, Dan Iter · 2022

Computer science Mathematics Physics

This work connects language model adaptation with concepts of machine learning theory. We consider a training setup with a large out-of-domain set and a small in-domain set. We derive how the benefit of training a model on either set depen…

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference Open

William Held, Dan Iter, Dan Jurafsky · 2021

Computer science Mathematics Physics

Performing event and entity coreference resolution across documents vastly increases the number of candidate mentions, making it intractable to do the full $n^2$ pairwise comparisons. Existing approaches simplify by considering coreference…

The Trade-offs of Domain Adaptation for Neural Language Models Open

David Grangier, Dan Iter · 2021

Computer science Mathematics Psychology

This work connects language model adaptation with concepts of machine learning theory. We consider a training setup with a large out-of-domain set and a small in-domain set. We derive how the benefit of training a model on either set depen…

On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation Open

Dan Iter, David Grangier · 2021

Computer science Mathematics Physics

Domain adaptation of neural networks commonly relies on three training phases: pretraining, selected data training and then fine tuning. Data selection improves target domain generalization by training further on pretraining data identifie…

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference Open

William Held, Dan Iter, Dan Jurafsky · 2021

Computer science Physics

Performing event and entity coreference resolution across documents vastly increases the number of candidate mentions, making it intractable to do the full $n^2$ pairwise comparisons. Existing approaches simplify by considering coreference…

Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models Open

Dan Iter, Kelvin Guu, Larry Lansing, Dan Jurafsky · 2020

Computer science Mathematics Geography

Recent models for unsupervised representation learning of text have employed a number of techniques to improve contextual word representations but have put little focus on discourse-level representations. We propose CONPONO, an inter-sente…

Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models Open

Dan Iter, Kelvin Guu, Larry Lansing, Dan Jurafsky · 2020

Computer science Mathematics Geography

Recent models for unsupervised representation learning of text have employed a number of techniques to improve contextual word representations but have put little focus on discourse-level representations. We propose Conpono, an inter-sente…

Entity Attribute Relation Extraction with Attribute-Aware Embeddings Open

Dan Iter, Xiao Yu, Fangtao Li · 2020

Computer science Physics

Entity-attribute relations are a fundamental component for building large-scale knowledge bases, which are widely employed in modern search engines. However, most such knowledge bases are manually curated, covering only a small fraction of…

Automatic Detection of Incoherent Speech for Diagnosing Schizophrenia Open

Dan Iter, Jong‐Woo Yoon, Dan Jurafsky · 2018

Computer science Psychology Philosophy

Schizophrenia is a mental disorder which afflicts an estimated 0.7% of adults world wide. It affects many areas of mental function, often evident from incoherent speech. Diagnosing schizophrenia relies on subjective judgments resulting in …

FrameIt: Ontology Discovery for Noisy User-Generated Text Open

Dan Iter, Alon Halevy, Wang-Chiew Tan · 2018

Computer science Philosophy

A common need of NLP applications is to extract structured data from text corpora in order to perform analytics or trigger an appropriate action. The ontology defining the structure is typically application dependent and in many cases it i…

Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data Open

Paroma Varma, Bryan He, Dan Iter, Peng Xu, Rose Yu , et al. · 2016

Computer science Psychology Geography

A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to …

Socratic Learning: Correcting Misspecified Generative Models using Discriminative Models Open

Paroma Varma, Bryan He, Dan Iter, Peng Xu, Rose Yu , et al. · 2016

Computer science Economics

A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to …

Dan Iter YOU? Author Swipe