Quanting Xie YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning Open

Junha Song, Yongsik Jo, So-Yeon Min, Quanting Xie, Tae-Hwan Kim , et al. · 2025

Systems such as video chatbots and navigation robots often depend on streaming image captioning to interpret visual inputs. Existing approaches typically employ large multimodal language models (MLLMs) for this purpose, but their substanti…

Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation Open

Quanting Xie, So-Yeon Min, Tianyi Zhang, Kaimeng Xu, Anil K. Bajaj , et al. · 2024

Computer science Mathematics

There is no limit to how much a robot might explore and learn, but all of that knowledge needs to be searchable and actionable. Within language research, retrieval augmented generation (RAG) has become the workhorse of large-scale non-para…

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis Open

Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar , et al. · 2023

Computer science Mathematics History

Building general-purpose robots that operate seamlessly in any environment, with any object, and utilizing various skills to complete diverse tasks has been a long-standing goal in Artificial Intelligence. However, as a community, we have …

Reasoning about the Unseen for Efficient Outdoor Object Navigation Open

Quanting Xie, Tianyi Zhang, Kedi Xu, Matthew Johnson‐Roberson, Yonatan Bisk · 2023

Computer science Geography Physics

Robots should exist anywhere humans do: indoors, outdoors, and even unmapped environments. In contrast, the focus of recent advancements in Object Goal Navigation(OGN) has targeted navigating in indoor environments by leveraging spatial an…

Creating related items for first view…