Explanipedia

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting Open

Zilong Wang, Zifeng Wang, Long Tan Le, Huaixiu Zheng, Swaroop Mishra , et al. · 2024

Computer science Engineering

Retrieval augmented generation (RAG) combines the generative abilities of large language models (LLMs) with external knowledge sources to provide more accurate and up-to-date responses. Recent RAG advancements focus on improving retrieval …

CodecLM: Aligning Language Models with Tailored Synthetic Data Open

Zifeng Wang, Chunliang Li, Vinçent Pérot, Long Tan Le, Miao Jin , et al. · 2024

Computer science

Instruction tuning has emerged as the key in aligning large language models (LLMs) with specific task instructions, thereby mitigating the discrepancy between the next-token prediction objective and users' actual goals. To reduce the labor…

Noise-Aware Training of Layout-Aware Language Models Open

Ritesh Sarkhel, Xiaoqi Ren, Lauro Beltrão Costa, Guolong Su, Vinçent Pérot , et al. · 2024

Computer science Geography

A visually rich document (VRD) utilizes visual features along with linguistic cues to disseminate information. Training a custom extractor that identifies named entities from a document requires a large number of instances of the target do…

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding Open

Zilong Wang, Hao Zhang, Chunliang Li, Julian Martin Eisenschlos, Vinçent Pérot , et al. · 2024

Computer science Biology

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with generic reasoning, table-based reasoning…

LMDX: Language Model-based Document Information Extraction and Localization Open

Vinçent Pérot, Kai Kang, Florian Luisier, Guolong Su, Xiaoyu Sun , et al. · 2023

Computer science

Large Language Models (LLM) have revolutionized Natural Language Processing (NLP), improving state-of-the-art and exhibiting emergent capabilities across various tasks. However, their application in extracting information from visually ric…

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction Open

Chen-Yu Lee, Chunliang Li, Hao Zhang, Timothy Dozat, Vinçent Pérot , et al. · 2023

Computer science Sociology

The recent advent of self-supervised pre-training techniques has led to a surge in the use of multimodal learning in form document understanding. However, existing approaches that extend the mask language modeling to other modalities requi…

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction Open

Chen-Yu Lee, Chunliang Li, Hao Zhang, Timothy Dozat, Vinçent Pérot , et al. · 2023

Computer science Philosophy History

Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, Nikolay Glushnev, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua, Tomas Pfister. Proceedings of t…

QueryForm: A Simple Zero-shot Form Entity Query Framework Open

Zifeng Wang, Zizhao Zhang, Jacob Devlin, Chen‐Yu Lee, Guolong Su , et al. · 2023

Computer science

Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts e…

QueryForm: A Simple Zero-shot Form Entity Query Framework Open

Zifeng Wang, Zizhao Zhang, Jacob Devlin, Chen‐Yu Lee, Guolong Su , et al. · 2022

Computer science Physics

Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts e…

DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning Open

Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang , et al. · 2022

Computer science Psychology Economics

Continual learning aims to enable a single model to learn a sequence of tasks without catastrophic forgetting. Top-performing methods usually require a rehearsal buffer to store past pristine examples for experience replay, which, however,…

FormNet: Structural Encoding beyond Sequential Modeling in Form Document\n Information Extraction Open

Chen-Yu Lee, Chunliang Li, Timothy Dozat, Vinçent Pérot, Guolong Su , et al. · 2022

Computer science Biology Philosophy

Sequence modeling has demonstrated state-of-the-art performance on natural\nlanguage and document understanding tasks. However, it is challenging to\ncorrectly serialize tokens in form-like documents in practice due to their\nvariety of la…

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction Open

Chen-Yu Lee, Chunliang Li, Timothy Dozat, Vinçent Pérot, Guolong Su , et al. · 2022

Computer science Philosophy Biology

Sequence modeling has demonstrated state-of-the-art performance on natural language and document understanding tasks. However, it is challenging to correctly serialize tokens in form-like documents in practice due to their variety of layou…

Learning to Prompt for Continual Learning Open

Zifeng Wang, Zizhao Zhang, Chen‐Yu Lee, Han Zhang, Ruoxi Sun , et al. · 2021

Computer science Psychology Economics

The mainstream paradigm behind continual learning has been to adapt the model parameters to non-stationary data distributions, where catastrophic forgetting is the central challenge. Typical methods rely on a rehearsal buffer or known task…

Vinçent Pérot YOU? Author Swipe