Explanipedia

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey Open

Yunkai Dang, Kaichen Huang, Jiahao Huo, Yibo Yan, Sirui Huang , et al. · 2024

The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with large language models (LLMs) and computer vision (CV) systems driving advancements in natural language understanding and visual processing, resp…

RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness Open

Tianyu Yu, Haoye Zhang, Yuan Yao, Yunkai Dang, Da Chen , et al. · 2024

Traditional feedback learning for hallucination reduction relies on labor-intensive manual labeling or expensive proprietary models. This leaves the community without foundational knowledge about how to build high-quality feedback with ope…

FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models? Open

Zihao Jiang, Yunkai Dang, Dong Pang, Huishuai Zhang, Weiran Huang · 2023

Few-shot learning aims to train models that can be generalized to novel classes with only a few samples. Recently, a line of works are proposed to enhance few-shot learning with accessible semantic information from class names. However, th…

Multi-Level Correlation Network For Few-Shot Image Classification Open

Yunkai Dang, Meijun Sun, Min Zhang, Zhengyu Chen, Xinliang Zhang , et al. · 2023

Few-shot image classification(FSIC) aims to recognize novel classes given few\nlabeled images from base classes. Recent works have achieved promising\nclassification performance, especially for metric-learning methods, where a\nmeasure at …

Yunkai Dang YOU? Author Swipe