Explanipedia

A Comprehensive Survey on Cross-Domain Recommendation: Taxonomy, Progress, and Prospects Open

Hao Zhang, Mingyue Cheng, Qi Liu, Junzhe Jiang, Xianquan Wang , et al. · 2025

Reverse Modeling in Large Language Models Open

Sicheng Yu, Xu Yuanchen, Cunxiao Du, Yanying Zhou, Minghui Qiu , et al. · 2025

EFE-YOLO:A YOLO11n-based Efficient Feature Enhanced Framework for Steel Surface Defect Detection Open

Yongjian Chen, Hao Zhang · 2025

AUCAD: Automated Construction of Alignment Dataset from Log-Related Issues for Enhancing LLM-based Log Generation Open

Hao Zhang, Dong‐Jun Yu, Lei Zhang, Guoping Rong, Yongda Yu , et al. · 2024

Log statements have become an integral part of modern software systems. Prior research efforts have focused on supporting the decisions of placing log statements, such as where/what to log. With the increasing adoption of Large Language Mo…

Adaptive Hypergraph-Augmented Graph Convolution Network for Skeleton-based Action Recognition Open

Qi Qin, Yanan Liu, Qianhan Tang, Junhui He, Hao Zhang , et al. · 2024

Multi‐stage image inpainting using improved partial convolutions Open

Cheng Li, Dan Xu, Hao Zhang · 2024

In recent years, deep learning models have dramatically influenced image inpainting. However, many existing studies still suffer from over‐smoothed or blurred textures when missing regions are large or contain rich visual details. To resto…

MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs Open

Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen , et al. · 2024

Large language models (LLMs) have shown increasing capability in problem-solving and decision-making, largely based on the step-by-step chain-of-thought reasoning processes. However, evaluating these reasoning abilities has become increasi…

Blind Image Deblurring: When Patch-wise Minimal Pixels Prior Meets Fractional-Order Method Open

Tingting Wu, Shaojie Wan, Chenchen Feng, Hao Zhang, Tieyong Zeng · 2024

Blind image deblurring is a challenging issue in image processing. In blind image deblurring, the typical approach involves iteratively estimating both the blur kernel and latent image until convergence to the blur kernel of the observed i…

Evaluating the External and Parametric Knowledge Fusion of Large Language Models Open

Hao Zhang, Yuyang Zhang, Xiaoguang Li, Wenxuan Shi, Haonan Xu , et al. · 2024

Integrating external knowledge into large language models (LLMs) presents a promising solution to overcome the limitations imposed by their antiquated and static parametric memory. Prior studies, however, have tended to over-reliance on ex…

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI Open

Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Lin Han , et al. · 2024

Large Vision-Language Models (LVLMs) show significant strides in general-purpose multimodal applications such as visual dialogue and embodied navigation. However, existing multimodal evaluation benchmarks cover a limited number of multimod…

A Non-Destructive Detection and Grading Method of the Internal Quality of Preserved Eggs Based on an Improved ConvNext Open

Wenquan Tang, Hao Zhang, Hao Chen, Wei Fan, Qiaohua Wang · 2024

As a traditional delicacy in China, preserved eggs inevitably experience instances of substandard quality during the production process. Chinese preserved egg production facilities can only rely on experienced workers to select the preserv…

Empowering Sequential Recommendation from Collaborative Signals and Semantic Relatedness Open

Mingyue Cheng, Hao Zhang, Qi Liu, Fajie Yuan, Zhi Li , et al. · 2024

Sequential recommender systems (SRS) could capture dynamic user preferences by modeling historical behaviors ordered in time. Despite effectiveness, focusing only on the \textit{collaborative signals} from behaviors does not fully grasp us…

Deep Unfolding Network with Spatial Alignment for multi-modal MRI reconstruction Open

Hao Zhang, Qi Wang, Jun Shi, Shihui Ying, Zhijie Wen · 2023

Multi-modal Magnetic Resonance Imaging (MRI) offers complementary diagnostic information, but some modalities are limited by the long scanning time. To accelerate the whole acquisition process, MRI reconstruction of one modality from highl…

Visual sentiment analysis with semantic correlation enhancement Open

Hao Zhang, Yanan Liu, Zhaoyu Xiong, Zhichao Wu, Dan Xu · 2023

Visual sentiment analysis is in great demand as it provides a computational method to recognize sentiment information in abundant visual contents from social media sites. Most of existing methods use CNNs to extract varying visual attribut…

Interpretable Geoscience Artificial Intelligence (XGeoS-AI): Application to Demystify Image Recognition Open

Jin‐Jian Xu, Hao Zhang, Chaosheng Tang, Lin Li, Bin Shi · 2023

As Earth science enters the era of big data, artificial intelligence (AI) not only offers great potential for solving geoscience problems, but also plays a critical role in accelerating the understanding of the complex, interactive, and mu…

Text-Guided Generation and Editing of Compositional 3D Avatars Open

Hao Zhang, Feng Yao, Peter Kulits, Yandong Wen, Justus Thies , et al. · 2023

Our goal is to create a realistic 3D facial avatar with hair and accessories using only a text description. While this challenge has attracted significant recent interest, existing methods either lack realism, produce unrealistic shapes, o…

Visual Sentiment Analysis with Semantic Correlation Enhancement Open

Hao Zhang, Yanan Liu, Zhaoyu Xiong, Zhichao Wu, Dan Xu · 2023

The moving target tracking and segmentation method based on space-time fusion Open

Jie Wang, Shibin Xuan, Hao Zhang, Xuyang Qin · 2022

At present, the target tracking method based on the correlation operation mainly uses deep learning to extract spatial information from video frames and then performs correlations on this basis. However, it does not extract the motion feat…

Learning multi-level representations for affective image recognition Open

Hao Zhang, Dan Xu, Gaifang Luo, Kangjian He · 2022

Images can convey intense affective experiences and affect people on an affective level. With the prevalence of online pictures and videos, evaluating emotions from visual content has attracted considerable attention. Affective image recog…

Fine-grained Sentiment Classification of Chinese Microblogs Combining Dual Weight Mechanismand Graph Convolutional Neural Network Open

Hao Zhang · 2022

Using deep learning models and attention mechanisms to classify fine-grained emotions of Chinese microblogs has become a research hotspot.However,the existing attention mechanisms consider the impact of words on words,and lack effective in…

Graph transformer network with temporal kernel attention for skeleton-based action recognition Open

Yanan Liu, Hao Zhang, Dan Xu, Kangjian He · 2022

Skeleton-based human action recognition has caused wide concern, as skeleton data can robustly adapt to dynamic circumstances such as camera view changes and background interference thus allowing recognition methods to focus on robust feat…

MAM: A multipath attention mechanism for image recognition Open

Hao Zhang, Guoqin Peng, Zhichao Wu, Jian Gong, Dan Xu , et al. · 2021

Attention mechanism has shown excellent performance in many computer vision tasks, while the previous literature may not adequately consider different types of attention mechanisms or is individual elaborate designed for a certain network.…

Contrastive learning for a single historical painting’s blind super-resolution Open

Hongzhen Shi, Dan Xu, Kangjian He, Hao Zhang, Yingying Yue · 2021

OsaMOT: Occlusion and scale‐aware multi‐object tracking algorithm for low viewpoint Open

Yingying Yue, Dan Xu, Kangjian He, Hongzhen Shi, Hao Zhang · 2021

Multi‐object tracking (MOT), which uses the context information of image sequences to locate, maintain identities and generate trajectories of multiple targets in each frame, is key technology in the field of computer vision. To address th…

Ant_ViBe: Improved ViBe Algorithm Based on Ant Colony Clustering under Dynamic Background Open

Yingying Yue, Dan Xu, Zhiming Qian, Hongzhen Shi, Hao Zhang · 2020

Foreground target detection algorithm (FTDA) is a fundamental preprocessing step in computer vision and video processing. A universal background subtraction algorithm for video sequences (ViBe) is a fast, simple, efficient and with optimal…

Target Tracking Method Based on Adaptive Structured Sparse Representation With Attention Open

Jie Wang, Shibin Xuan, Hao Zhang, Xuyang Qin · 2020

Considering the problems of motion blur, partial occlusion and fast motion in target tracking, a target tracking method based on adaptive structured sparse representation with attention is proposed. Under the framework of particle filterin…

Hao Zhang YOU? Author Swipe