Explanipedia

OnlineHOI: Towards Online Human-Object Interaction Generation and Perception Open

Yihong Ji, Yunze Liu, Yiyao Zhuo, Weijiang Yu, Fei Ma , et al. · 2025

The perception and generation of Human-Object Interaction (HOI) are crucial for fields such as robotics, AR/VR, and human behavior understanding. However, current approaches model this task in an offline setting, where information at each …

CellFM: a large-scale foundation model pre-trained on transcriptomics of 100 million human cells Open

Yuansong Zeng, Jiancong Xie, Ningyuan Shangguan, Zhuoyi Wei, Wenbing Li , et al. · 2025

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis Open

Yifan Xie, Tao Feng, Xin Zhang, Xiangyang Luo, Zixuan Guo , et al. · 2025

Talking head synthesis with arbitrary speech audio is a crucial challenge in the field of digital humans. Recently, methods based on radiance fields have received increasing attention due to their ability to synthesize high-fidelity and id…

ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters Open

Xunzhi Xiang, Haiwei Xue, Zonghong Dai, Di Wang, Minglei Li , et al. · 2025

Pose-controlled human video generation is of significant interest and finds extensive applications in areas such as automated advertising and content creation on social media platforms. While existing methods employing pose sequences and r…

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis Open

Yifan Xie, Tao Feng, Xin Zhang, Xiangyang Luo, Zixuan Guo , et al. · 2024

Talking head synthesis with arbitrary speech audio is a crucial challenge in the field of digital humans. Recently, methods based on radiance fields have received increasing attention due to their ability to synthesize high-fidelity and id…

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions Open

Lei Huang, Weijiang Yu, Weitao Ma, Weihong Zhong, Zhangyin Feng , et al. · 2024

The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), fueling a paradigm shift in information acquisition. Nevertheless, LLMs are prone to hallucination, generating plausi…

BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering Open

Zheng Chu, Jingchang Chen, Qianglong Chen, Haotian Wang, Kun Yan Zhu , et al. · 2024

Large language models (LLMs) have demonstrated strong reasoning capabilities. Nevertheless, they still suffer from factual errors when tackling knowledge-intensive tasks. Retrieval-augmented reasoning represents a promising approach. Howev…

Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Open

Tianyu Lin, Zhiguang Chen, Zhonghao Yan, Weijiang Yu, Fudan Zheng · 2024

Diffusion models have demonstrated their effectiveness across various generative tasks. However, when applied to medical image segmentation, these models encounter several challenges, including significant resource and time requirements. T…

Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning Open

Fudan Zheng, Jindong Cao, Weijiang Yu, Zhiguang Chen, Nong Xiao , et al. · 2024

Most advances in medical image recognition supporting clinical auxiliary diagnosis meet challenges due to the low-resource situation in the medical field, where annotations are highly expensive and professional. This low-resource problem c…

Intensive Vision-guided Network for Radiology Report Generation Open

Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang , et al. · 2024

Automatic radiology report generation is booming due to its huge application potential for the healthcare industry. However, existing computer vision and natural language processing approaches to tackle this problem are limited in two aspe…

Intensive vision-guided network for radiology report generation Open

Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang , et al. · 2023

Objective. Automatic radiology report generation is booming due to its huge application potential for the healthcare industry. However, existing computer vision and natural language processing approaches to tackle this problem are limited …

AdaNAS: Adaptively Post-processing with Self-supervised Neural Architecture Search for Ensemble Rainfall Forecasts Open

Yingpeng Wen, Weijiang Yu, Fudan Zheng, Dan Huang, Nong Xiao · 2023

Previous post-processing studies on rainfall forecasts using numerical weather prediction (NWP) mainly focus on statistics-based aspects, while learning-based aspects are rarely investigated. Although some manually-designed models are prop…

Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System Open

Haotian Wang, Xiyuan Du, Weijiang Yu, Qianglong Chen, Kun Zhu , et al. · 2023

Multi-agent debate system (MAD) imitating the process of human discussion in pursuit of truth, aims to align the correct cognition of different agents for the optimal solution. It is challenging to make various agents perform right and hig…

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models Open

Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Haotian Wang , et al. · 2023

Grasping the concept of time is a fundamental facet of human cognition, indispensable for truly comprehending the intricacies of the world. Previous studies typically focus on specific aspects of time, lacking a comprehensive temporal reas…

Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model Open

Yuansong Zeng, Zhuoyi Wei, Qianmu Yuan, Sheng Chen, Weijiang Yu , et al. · 2023

Motivation Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Since experimental approaches are expensive and time-consuming, many computational methods have been designed to …

Identifying spatial domain by adapting transcriptomics with histology through contrastive learning Open

Yuansong Zeng, Rui Yin, Mai Luo, Jianing Chen, Zixiang Pan , et al. · 2023

Recent advances in spatial transcriptomics have enabled measurements of gene expression at cell/spot resolution meanwhile retaining both the spatial information and the histology images of the tissues. Accurately identifying the spatial do…

Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning Open

Fudan Zheng, Jindong Cao, Weijiang Yu, Zhiguang Chen, Nong Xiao , et al. · 2023

Identifying B-cell epitopes using AlphaFold2 predicted structures and pretrained language model Open

Yuansong Zeng, Zhuoyi Wei, Qianmu Yuan, Sheng Chen, Weijiang Yu , et al. · 2022

Motivation Identifying the B-cell epitopes is an essential step for guiding rational vaccine development and immunotherapies. Due to experimental approaches being expensive and time-consuming, many computational methods have been designed …

A Meta-learning based Graph-Hierarchical Clustering Method for Single Cell RNA-Seq Data Open

Zixiang Pan, Yuefan Lin, Haokun Zhang, Yuansong Zeng, Weijiang Yu , et al. · 2022

Single cell sequencing techniques enable researchers view complex bio-tissues from a more precise perspective to identify cell types. However, more and more recent works have been done to find more detailed subtypes within already known ce…

Deciphering Spatial Domains by Integrating Histopathological Image and Transcriptomics via Contrastive Learning Open

Yuansong Zeng, Rui Yin, Mai Luo, Jianing Chen, Zixiang Pan , et al. · 2022

Recent advances in spatial transcriptomics have enabled measurements of gene expression at cell/spot resolution meanwhile retaining both the spatial information and the histopathological images of the tissues. Deciphering the spatial domai…

A Meta-learning based Graph-Hierarchical Clustering Method for Single Cell RNA-Seq Data Open

Zixiang Pan, Yuefan Lin, Haokun Zhang, Yuansong Zeng, Weijiang Yu , et al. · 2022

Single cell sequencing techniques enable researchers view complex bio-tissues from a more precise perspective to identify cell types. However, more and more recent works have been done to find more detailed subtypes within already known ce…

Spatial transcriptomics prediction from histology jointly through Transformer and graph neural networks Open

Yuansong Zeng, Zhuoyi Wei, Weijiang Yu, Rui Yin, Yuchen Yuan , et al. · 2022

The rapid development of spatial transcriptomics allows the measurement of RNA abundance at a high spatial resolution, making it possible to simultaneously profile gene expression, spatial locations of cells or spots, and the corresponding…

Hybrid Reasoning Network for Video-based Commonsense Captioning Open

Weijiang Yu, Jian Liang, Lei Ji, Lu Li, Yuejian Fang , et al. · 2021

The task of video-based commonsense captioning aims to generate event-wise captions and meanwhile provide multiple commonsense descriptions (e.g., attribute, effect and intention) about the underlying event in the video. Prior works explor…

Deep Animation Video Interpolation in the Wild Open

Siyao Li, Shiyu Zhao, Weijiang Yu, Wenxiu Sun, Dimitris Metaxas , et al. · 2021

In the animation industry, cartoon videos are usually produced at low frame rate since hand drawing of such frames is costly and time-consuming. Therefore, it is desirable to develop computational models that can automatically interpolate …

Improving Math Word Problems with Pre-trained Knowledge and Hierarchical Reasoning Open

Weijiang Yu, Yingpeng Wen, Fudan Zheng, Nong Xiao · 2021

The recent algorithms for math word problems (MWP) neglect to use outside knowledge not present in the problems. Most of them only capture the word-level relationship and ignore to build hierarchical reasoning like the human being for mini…

Heterogeneous Graph Learning for Visual Commonsense Reasoning Open

Weijiang Yu, Jingwen Zhou, Weihao Yu, Xiaodan Liang, Nong Xiao · 2019

Visual commonsense reasoning task aims at leading the research field into solving cognition-level reasoning with the ability of predicting correct answers and meanwhile providing convincing reasoning paths, resulting in three sub-tasks i.e…

Layout-Graph Reasoning for Fashion Landmark Detection Open

Weijiang Yu, Xiaodan Liang, Ke Gong, Chenhan Jiang, Nong Xiao , et al. · 2019

Detecting dense landmarks for diverse clothes, as a fundamental technique for clothes analysis, has attracted increasing research attention due to its huge application potential. However, due to the lack of modeling underlying semantic lay…

Weijiang Yu YOU? Author Swipe