Hao Sheng
YOU?
Author Swipe
View article: Efficient Data-Driven Modeling of Core Loss in Magnetic Materials for Power Electronics Systems
Efficient Data-Driven Modeling of Core Loss in Magnetic Materials for Power Electronics Systems Open
The accurate modeling and prediction of magnetic core losses are of great significance in the design of power electronic systems. Although traditional empirical equations (such as Steinmetz equation and its improved forms) are easy to calc…
View article: Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning Open
Offline reinforcement learning (RL) has achieved significant advances in domains such as robotic control, autonomous driving, and medical decision-making. Most existing methods primarily focus on training policies that maximize cumulative …
View article: Multi-Agent-Supported Tracking Based on Hidden Markov Model with Weighted Entropy
Multi-Agent-Supported Tracking Based on Hidden Markov Model with Weighted Entropy Open
Multi-object tracking (MOT) has witnessed significant advancements in recent years, yet it remains challenged by complex uncertainties arising from pedestrian movement patterns. To address this, we present a unified framework that explicit…
View article: YED-Net: Yoga Exercise Dynamics Monitoring with YOLOv11-ECA-Enhanced Detection and DeepSORT Tracking
YED-Net: Yoga Exercise Dynamics Monitoring with YOLOv11-ECA-Enhanced Detection and DeepSORT Tracking Open
Against the backdrop of the deep integration of national fitness and sports science, this study addresses the lack of standardized movement assessment in yoga training by proposing an intelligent analysis system that integrates an improved…
View article: Occlusion-Robust Multi-Target Tracking and Segmentation Framework with Mask Enhancement
Occlusion-Robust Multi-Target Tracking and Segmentation Framework with Mask Enhancement Open
Multi-object tracking stands as one of the most prominent domains in Computer Vision and has significant research value and practical importance. However, due to the complexity of scenarios in the real world, especially in crowded environm…
View article: Wavelet-Based Multi-Level Information Compensation Learning for Visible-Infrared Person Re-Identification
Wavelet-Based Multi-Level Information Compensation Learning for Visible-Infrared Person Re-Identification Open
View article: Advances in Light Field Spatial Super-Resolution: A Comprehensive Literature Survey
Advances in Light Field Spatial Super-Resolution: A Comprehensive Literature Survey Open
Super-resolution reconstruction of light field images has recently become a central focus in the fields of computational photography and computer vision. We present a systematic review of 17 mainstream light field spatial super-resolution …
View article: Improved Localization and Recognition of Handwritten Digits on MNIST Dataset with ConvGRU
Improved Localization and Recognition of Handwritten Digits on MNIST Dataset with ConvGRU Open
Video location prediction for handwritten digits presents unique challenges in computer vision due to the complex spatiotemporal dependencies and the need to maintain digit legibility across predicted frames, while existing deep learning-b…
View article: EVA-S3PC: Efficient, Verifiable, Accurate Secure Matrix Multiplication Protocol Assembly and Its Application in Regression
EVA-S3PC: Efficient, Verifiable, Accurate Secure Matrix Multiplication Protocol Assembly and Its Application in Regression Open
Efficient multi-party secure matrix multiplication is crucial for privacy-preserving machine learning, but existing mixed-protocol frameworks often face challenges in balancing security, efficiency, and accuracy. This paper presents an eff…
View article: Two-Stage Locating and Capacity Optimization Model for the Ultra-High-Voltage DC Receiving End Considering Carbon Emission Trading and Renewable Energy Time-Series Output Reconstruction
Two-Stage Locating and Capacity Optimization Model for the Ultra-High-Voltage DC Receiving End Considering Carbon Emission Trading and Renewable Energy Time-Series Output Reconstruction Open
With the load center’s continuous expansion and development of the AC power grid’s scale and construction, the recipient grid under the multi-feed DC environment is facing severe challenges of DC commutation failure and bipolar blocking du…
View article: End-to-end multimodal deep learning for real-time decoding of months-long neural activity from the same cells
End-to-end multimodal deep learning for real-time decoding of months-long neural activity from the same cells Open
Long-term, stable, and real-time decoding of behavior-dependent neural dynamics from the same cells is critical for brain-computer interfaces (BCIs) and for understanding neural evolution during learning, and memory, and disease progressio…
View article: RANDnet: Vehicle Re-Identification with Relation Attention and Nuance–Disparity Masks
RANDnet: Vehicle Re-Identification with Relation Attention and Nuance–Disparity Masks Open
Vehicle re-identification (vehicle ReID) is designed to recognize all instances of a specific vehicle across various camera viewpoints, facing significant challenges such as high similarity among different vehicles from the same viewpoint …
View article: Dynamic Downsampling Algorithm for 3D Point Cloud Map Based on Voxel Filtering
Dynamic Downsampling Algorithm for 3D Point Cloud Map Based on Voxel Filtering Open
In response to the challenge of handling large-scale 3D point cloud data, downsampling is a common approach, yet it often leads to the problem of feature loss. We present a dynamic downsampling algorithm for 3D point cloud maps based on an…
View article: Transformer-Based Semantic Entity Recognition for Chinese Medical Inspection Reports
Transformer-Based Semantic Entity Recognition for Chinese Medical Inspection Reports Open
With the rapid development of the Internet, the application of artificial intelligence in the medical field has become a focus of attention. At present, understanding the Chinses medical inspection report (CMIR) is of great significance fo…
View article: Camouflaged Object Detection Based on Deep Learning with Attention-Guided Edge Detection and Multi-Scale Context Fusion
Camouflaged Object Detection Based on Deep Learning with Attention-Guided Edge Detection and Multi-Scale Context Fusion Open
In nature, objects that use camouflage have features like colors and textures that closely resemble their background. This creates visual illusions that help them hide and protect themselves from predators. This similarity also makes the t…
View article: MSIE-Net: Associative Entity-Based Multi-Stage Network for Structured Information Extraction from Reports
MSIE-Net: Associative Entity-Based Multi-Stage Network for Structured Information Extraction from Reports Open
Efficient document recognition and sharing remain challenges in the healthcare, insurance, and finance sectors. One solution to this problem has been the use of deep learning techniques to automatically extract structured information from …
View article: A survey for light field super-resolution
A survey for light field super-resolution Open
Compared to 2D imaging data, the 4D light field (LF) data retains richer scene’s structure information, which can significantly improve the computer’s perception capability, including depth estimation, semantic segmentation, and LF renderi…
View article: Light field depth estimation: A comprehensive survey from principles to future
Light field depth estimation: A comprehensive survey from principles to future Open
Light Field (LF) depth estimation is an important research direction in the area of computer vision and computational photography, which aims to infer the depth information of different objects in three-dimensional scenes by capturing LF d…
View article: Parallel Dense Video Caption Generation with Multi-Modal Features
Parallel Dense Video Caption Generation with Multi-Modal Features Open
The task of dense video captioning is to generate detailed natural-language descriptions for an original video, which requires deep analysis and mining of semantic captions to identify events in the video. Existing methods typically follow…
View article: Light field super-resolution using complementary-view feature attention
Light field super-resolution using complementary-view feature attention Open
Light field (LF) cameras record multiple perspectives by a sparse sampling of real scenes, and these perspectives provide complementary information. This information is beneficial to LF super-resolution (LFSR). Compared with traditional si…
View article: Take Your Model Further: A General Post-refinement Network for Light Field Disparity Estimation via BadPix Correction
Take Your Model Further: A General Post-refinement Network for Light Field Disparity Estimation via BadPix Correction Open
Most existing light field (LF) disparity estimation algorithms focus on handling occlusion, texture-less or other areas that harm LF structure to improve accuracy, while ignoring other potential modeling ideas. In this paper, we propose a …
View article: CoGAN: Cooperatively trained conditional and unconditional GAN for person image generation
CoGAN: Cooperatively trained conditional and unconditional GAN for person image generation Open
Person image generation aims to synthesize realistic person images that follow the same distribution as the given dataset. Previous attempts can be generally categorized into two classes: conditional GAN and unconditional GAN. The former u…
View article: Fusion of Multi-Modal Features to Enhance Dense Video Caption
Fusion of Multi-Modal Features to Enhance Dense Video Caption Open
Dense video caption is a task that aims to help computers analyze the content of a video by generating abstract captions for a sequence of video frames. However, most of the existing methods only use visual features in the video and ignore…
View article: Distributed Truss Computation in Dynamic Graphs
Distributed Truss Computation in Dynamic Graphs Open
Large-scale graphs usually exhibit global sparsity with local cohesiveness, and mining the representative cohesive subgraphs is a fundamental problem in graph analysis. The $k$ -truss is one of the most commonly studied cohesive subgraphs,…
View article: Editorial: Digital resource vitalization and urban intelligent computing
Editorial: Digital resource vitalization and urban intelligent computing Open
EDITORIAL article Front. Environ. Sci., 15 March 2023Sec. Environmental Informatics and Remote Sensing Volume 11 - 2023 | https://doi.org/10.3389/fenvs.2023.1180305
View article: Fusion and Allocation Network for Light Field Image Super-Resolution
Fusion and Allocation Network for Light Field Image Super-Resolution Open
Light field (LF) images taken by plenoptic cameras can record spatial and angular information from real-world scenes, and it is beneficial to fully integrate these two pieces of information to improve image super-resolution (SR). However, …
View article: MPEFT: A novel task scheduling method for workflows
MPEFT: A novel task scheduling method for workflows Open
Optimizing the scheduling algorithm is a key problem to improving the service efficiency of urban heterogeneous computing platforms. In this paper, we propose a novel list-based scheduling algorithm called Modified Predict Earliest Finish …
View article: Detecting Neighborhood Gentrification at Scale via Street-level Visual Data
Detecting Neighborhood Gentrification at Scale via Street-level Visual Data Open
Neighborhood gentrification plays a significant role in shaping the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely…
View article: Learning for Data Synthesis: Joint Local Salient Projection and Adversarial Network Optimization for Vehicle Re-Identification
Learning for Data Synthesis: Joint Local Salient Projection and Adversarial Network Optimization for Vehicle Re-Identification Open
The problem of vehicle re-identification in surveillance scenarios has grown in popularity as a research topic. Deep learning has been successfully applied in re-identification tasks in the last few years due to its superior performance. H…
View article: 3D zebrafish tracking with topology association
3D zebrafish tracking with topology association Open
Recently, zebrafish has received more and more attention due to its wide range of applications such as regeneration promoting therapeutics and drug discovery. Therefore, vision‐based trackers are utilized to record the swimming trajectory …