Explanipedia

Text2Grad: Reinforcement Learning from Natural Language Feedback Open

Hanyang Wang, Lu Wang, Chaoyun Zhang, Tianjun Mao, Si Qin , et al. · 2025

Traditional RLHF optimizes language models with coarse, scalar rewards that mask the fine-grained reasons behind success or failure, leading to slow and opaque learning. Recent work augments RL with textual critiques through prompting or r…

Zoomer: Adaptive Image Focus Optimization for Black-box MLLM Open

Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang , et al. · 2025

Recent advancements in multimodal large language models (MLLMs) have broadened the scope of vision-language tasks, excelling in applications like image captioning and interactive question-answering. However, these models struggle with accu…

I$^2$KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding Open

Tianjun Mao, Chenghong Zhang · 2023

Spoken language understanding (SLU) typically includes two subtasks: intent detection and slot filling. Currently, it has achieved great success in high-resource languages, but it still remains challenging in low-resource languages due to …

Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information Open

Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan · 2023

Background music (BGM) can enhance the video's emotion. However, selecting an appropriate BGM often requires domain knowledge. This has led to the development of video-music retrieval techniques. Most existing approaches utilize pretrained…

Tianjun Mao YOU? Author Swipe