Explanipedia

Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning Open

Haiyang Yu, Yuchuan Wu, Fan Shi, Lei Liao, Jinghui Lu , et al. · 2025

Chinese ancient documents, invaluable carriers of millennia of Chinese history and culture, hold rich knowledge across diverse fields but face challenges in digitization and understanding, i.e., traditional methods only scan images, while …

Attention and Risk-Aware Decision Framework for Safe Autonomous Driving Open

Zhen Tian, Fan Yuan, Yangfan He, Qinghao Li, C. Chen , et al. · 2025

Autonomous driving has attracted great interest due to its potential capability in full-unsupervised driving. Model-based and learning-based methods are widely used in autonomous driving. Model-based methods rely on pre-defined models of t…

Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving Open

Fan Yuan, Zhen Tian, Yangfan He, Guojian Zou, Chunhong Yuan , et al. · 2025

In recent years, ensuring safety, efficiency, and comfort in interactive autonomous driving has become a critical challenge. Traditional model-based techniques, such as game-theoretic methods and robust control, are often overly conservati…

Enhanced Mean Field Game for Interactive Decision-Making with Varied Stylish Multi-Vehicles Open

Lin Zheng, Zhen Tian, Yangfan He, Shuo Liu, Huilin Chen , et al. · 2025

This paper presents an MFG-based decision-making framework for autonomous driving in heterogeneous traffic. To capture diverse human behaviors, we propose a quantitative driving style representation that maps abstract traits to parameters …

Enhancing Commentary Strategies for Guandan: A Study of LLMs in Game Commentary Generation Open

Juan Su, Meiling Tao, Xuechen Liang, Yangfan He, Yongding Tao , et al. · 2025

Recent advancements in large language models (LLMs) have unlocked the potential for generating high-quality game commentary. However, producing insightful and engaging commentary for complex games with incomplete information remains a sign…

See the Forest and the Trees: A Synergistic Reasoning Framework for Knowledge-Based Visual Question Answering Open

Junjie Wang, Yunhan Tang, Yijie Wang, Zhihao Yuan, Huan Wang , et al. · 2025

Multimodal Large Language Models (MLLMs) have pushed the frontiers of Knowledge-Based Visual Question Answering (KBVQA), yet their reasoning is fundamentally bottlenecked by a reliance on uni-dimensional evidence. This "seeing only the tre…

MountainLion: A Multi-Modal LLM-Based Agent System for Interpretable and Adaptive Financial Trading Open

Siyi Wu, Junqiao Wang, Zhongwei Guan, Long Zhao, Xinyuan Song , et al. · 2025

Cryptocurrency trading is a challenging task requiring the integration of heterogeneous data from multiple modalities. Traditional deep learning and reinforcement learning approaches typically demand large training datasets and encode dive…

GLIMPSE: Do Large Vision-Language Models Truly Think With Videos or Just Glimpse at Them? Open

Yiyang Zhou, Linjie Li, Shi Qiu, Zhengyuan Yang, Yuyang Zhao , et al. · 2025

Existing video benchmarks often resemble image-based benchmarks, with question types like "What actions does the person perform throughout the video?" or "What color is the woman's dress in the video?" For these, models can often answer by…

FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback Open

Kangan Qian, Ziang Luo, Shan Jiang, Zhi-Qiu Huang, Jinyu Miao , et al. · 2025

Ensuring safe, comfortable, and efficient planning is crucial for autonomous driving systems. While end-to-end models trained on large datasets perform well in standard driving scenarios, they struggle with complex low-frequency events. Re…

MaRI: Material Retrieval Integration across Domains Open

Jianhui Wang, Yangfan He, Huixiong Zhang, Yuxuan Chen, Jingwei Huang · 2025

Accurate material retrieval is critical for creating realistic 3D assets. Existing methods rely on datasets that capture shape-invariant and lighting-varied representations of materials, which are scarce and face challenges due to limited …

PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net Open

Jun Yin, Yangfan He, Mengmeng Zhang, Pengyu Zeng, Tianyi Wang , et al. · 2025

Learning and improving large language models through human preference feedback has become a mainstream approach, but it has rarely been applied to the field of low-light image enhancement. Existing low-light enhancement evaluations typical…

Enhancing Intent Understanding for Ambiguous prompt: A Human-Machine Co-Adaption Strategy Open

Yangfan He, Jianhui Wang, Kun Li, Yijin Wang, Li Sun , et al. · 2025

Computer science Psychology

Current image generation systems produce high-quality images but struggle with ambiguous user prompts, making interpretation of actual user intentions difficult. Many users must modify their prompts several times to ensure the generated im…

Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion Open

Yangfan He, Sida Li, Kun Li, Jianhui Wang, Binxu Li , et al. · 2025

Computer science Geology

Recent advancements in text-to-image (T2I) generation using diffusion models have enabled cost-effective video-editing applications by leveraging pre-trained models, eliminating the need for resource-intensive training. However, the frame-…

ArtFormer: Controllable Generation of Diverse 3D Articulated Objects Open

Jiayi Su, Yuanming Feng, Zheng Li, Jinhua Song, Yangfan He , et al. · 2024

Computer science

This paper presents a novel framework for modeling and conditional generation of 3D articulated objects. Troubled by flexibility-quality tradeoffs, existing methods are often limited to using predefined structures or retrieving shapes from…

FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback Open

Kangan Qian, Z. Ma, Yangfan He, Ziang Luo, Tianyu Shi , et al. · 2024

Computer science Psychology Philosophy

Ensuring safe, comfortable, and efficient navigation is a critical goal for autonomous driving systems. While end-to-end models trained on large-scale datasets excel in common driving scenarios, they often struggle with rare, long-tail eve…

FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system Open

Zeyuan Li, Yangfan He, Lewei He, Jianhui Wang, Tianyu Shi , et al. · 2024

Computer science Psychology Mathematics

Recently, large language models (LLMs) have achieved significant progress in automated code generation. Despite their strong instruction-following capabilities, these models frequently struggled to align with user intent in coding scenario…

Systematic review and meta-analysis of breathing exercises effects on lung function and quality of life in postoperative lung cancer patients Open

Jiayi Ren, Zongyue Li, Yangfan He, Hang Gao, Li Jin , et al. · 2024

Medicine

This study indicates that breathing exercises significantly improve postoperative pulmonary function and QoL in lung cancer patients. Future research should delve into the mechanisms behind these exercises and evaluate their long-term reha…

Azaphosphinate Dyes: A Low Molecular Weight Near‐Infrared Scaffold for Development of Photoacoustic or Fluorescence Imaging Probes Open

Ruwen Yin, Frederik Brøndsted, Lin Li, Julia L. McAfee, Fang Yuan , et al. · 2024

Chemistry Materials science Physics

Near‐infrared (NIR) dyes are desirable for biological imaging applications including photoacoustic (PA) and fluorescence imaging. Nonetheless, current NIR dyes are often plagued by relatively large molecular weights, poor water solubility,…

Azaphosphinate Dyes: A Low Molecular Weight Near-Infrared Scaffold for Development of Photoacoustic and Fluorescence Imaging Probes Open

Ruwen Yin, Frederik Brøndsted, Yangfan He, Fang Yuan, Cliff I. Stains · 2023

Chemistry Materials science Physics

Near-infrared (NIR) dyes are desirable for biological imaging applications including photoacoustic and fluorescence imaging. Nonetheless, current NIR dyes are often plagued by relatively large molecular weights, poor water solubility, and …

Classification and Generation of Light Sources Using Gamma Fitting Open

Shuanghao Zhang, Huaibin Zheng, Wang Gao, Hui Chen, Yangfan He , et al. · 2020

Environmental science Computer science Mathematics

In general, the typical approach to discriminate antibunching, bunching or superbunching categories make use of calculating the second-order coherence function ${g^{(2)}}(τ)$ of light. Although the classical light sources correspond to the…

Frequency-Diverse Bunching Metamaterial Antenna for Coincidence Imaging Open

Mengran Zhao, Shitao Zhu, Jianxing Li, Hongyu Shi, Juan Chen , et al. · 2019

Physics Computer science Medicine

A frequency-diverse bunching metamaterial antenna for coincidence imaging in the Ka band is proposed in this paper. The bunching metamaterial antenna includes a broadband circular array and a frequency-diverse bunching metalens. Firstly, i…

Wideband polarization-independent anomalous reflection metasurface with multiple resonance modes Open

Guoxiang Dong, Song Xia, Yongyong Zhuang, Hongyu Shi, Zhan Zhang , et al. · 2017

Materials science Physics Computer science

An ultra-thin metasurface is proposed to realize wideband polarization-independent anomalous reflection. The sub-wavelength resonator can produce different resonance modes, which are the result of the combined effect of dielectric and the …

Yangfan He YOU? Author Swipe