Explanipedia

Single-Image 3D Human Reconstruction with 3D-Aware Diffusion Priors and Facial Enhancement Open

Jie Yang, B. Zhang, Hongbo Fu, Yu‐Kun Lai, Lin Gao · 2025

DDBot: Differentiable Physics-based Digging Robot for Unknown Granular Materials Open

Xintong Yang, Max Wei, Yu‐Kun Lai, Ze Ji · 2025

Automating the manipulation of granular materials poses significant challenges due to complex contact dynamics, unpredictable material properties, and intricate system states. Existing approaches often fail to achieve efficiency and accura…

Differentiable Skill Optimisation for Powder Manipulation in Laboratory Automation Open

Max Wei, Xintong Yang, Yu‐Kun Lai, Ze Ji · 2025

Robotic automation is accelerating scientific discovery by reducing manual effort in laboratory workflows. However, precise manipulation of powders remains challenging, particularly in tasks such as transport that demand accuracy and stabi…

MirrorSAM2: Segment Mirror in Videos with Depth Perception Open

Mingchen Xu, Yu‐Kun Lai, Ze Ji, Jing Wu · 2025

This paper presents MirrorSAM2, the first framework that adapts Segment Anything Model 2 (SAM2) to the task of RGB-D video mirror segmentation. MirrorSAM2 addresses key challenges in mirror detection, such as reflection ambiguity and textu…

DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video Open

Hao Wen, Hongbo Kang, Jian Ma, Jing Huang, Yuanwang Yang , et al. · 2025

3D reconstruction of dynamic crowds in large scenes has become increasingly important for applications such as city surveillance and crowd analysis. However, current works attempt to reconstruct 3D crowds from a static image, causing a lac…

FRNeRF: Fusion and Regularization Fields for Dynamic View Synthesis Open

Xinyi Jing, Tao Yu, Ran He, Yu‐Kun Lai, Ke Li · 2025

Novel space-time view synthesis for monocular video is a highly challenging task: both static and dynamic objects usually appear in the video, but only a single view of the current scene is available, resulting in inaccurate synthesis resu…

Navigating the tightrope:The art of balancing for better performance Open

Sihua Chen, Yu‐Kun Lai, Xiang Wen, Wei He, Jian Mou · 2025

Effective supervisory mechanisms enable both monitoring and regulation of employee behavior, ensuring behavioral compliance and performance stability. However, continuous supervision can increase employees' psychological arousal, especiall…

Differentiable physics-based system identification for robotic manipulation of elastoplastic materials Open

Xintong Yang, Ze Ji, Yu‐Kun Lai · 2025

Robotic manipulation of volumetric elastoplastic deformable materials, from foods such as dough to construction materials like clay, is in its infancy, largely due to the difficulty of modelling and perception in a high-dimensional space. …

NeRFFaceShop: Learning a Photo-Realistic 3D-Aware Generative Model of Animatable and Relightable Heads From Large-Scale in-the-Wild Videos Open

Kaiwen Jiang, Fenglin Liu, Shuyu Chen, Pengfei Wan, Yuan Zhang , et al. · 2025

Animatable and relightable 3D facial generation has fundamental applications in computer vision and graphics. Although animation and relighting are highly correlated, previous methods usually address them separately. Effectively combining …

Skeletonization Quality Evaluation: Geometric Metrics for Point Cloud Analysis in Robotics Open

Qingmeng Wen, Yu‐Kun Lai, Ze Ji, Seyed Amir Tafrishi · 2025

Skeletonization is a powerful tool for shape analysis, rooted in the inherent instinct to understand an object's morphology. It has found applications across various domains, including robotics. Although skeletonization algorithms have bee…

The Double-Edged Effects of Work Task Stress on Safety Performance: A Cognitive Appraisal Perspective Open

Yu‐Kun Lai, Sihua Chen, Guoxiang Li, Zhiqiang Wang, Bolin Wang · 2025

Temporal Inconsistency Guidance for Super-resolution Video Quality Assessment Open

Yixiao Li, Xiaoyuan Yang, Weide Liu, Xin Jin, Xu Jia , et al. · 2024

As super-resolution (SR) techniques introduce unique distortions that fundamentally differ from those caused by traditional degradation processes (e.g., compression), there is an increasing demand for specialized video quality assessment (…

NeRF-Texture: Synthesizing Neural Radiance Field Textures Open

Yi-Hua Huang, Yan–Pei Cao, Yu‐Kun Lai, Ying Shan, Lin Gao · 2024

Texture synthesis is a fundamental problem in computer graphics that would\nbenefit various applications. Existing methods are effective in handling 2D\nimage textures. In contrast, many real-world textures contain meso-structure in\nthe 3…

DualAvatar: Robust Gaussian Splatting Avatar with Dual Representation Open

Jinsong Zhang, I‐Chao Shen, Jotaro Sakamiya, Yu‐Kun Lai, Takeo Igarashi , et al. · 2024

Real-time Large-scale Deformation of Gaussian Splatting Open

Lin Gao, Jie Yang, B. Zhang, Jia-Mu Sun, Yu-Jie Yuan , et al. · 2024

Neural implicit representations, including Neural Distance Fields and Neural Radiance Fields, have demonstrated significant capabilities for reconstructing surfaces with complicated geometry and topology, and generating novel views of a sc…

Real-time 3D Human Reconstruction and Rendering System from a Single RGB Camera Open

Yuanwang Yang, Qiao Feng, Yu‐Kun Lai, Kun Li · 2024

Transforming 2D human images into 3D appearance is essential for immersive communication. In this paper, we introduce a low-cost real-time 3D human reconstruction and rendering system with a single RGB camera at 28+ FPS, which guarantees b…

Differentiable Physics-based System Identification for Robotic Manipulation of Elastoplastic Materials Open

Xintong Yang, Ze Ji, Yu‐Kun Lai · 2024

Robotic manipulation of volumetric elastoplastic deformable materials, from foods such as dough to construction materials like clay, is in its infancy, largely due to the difficulty of modelling and perception in a high-dimensional space. …

SceneExpander: Real-Time Scene Synthesis for Interactive Floor Plan Editing Open

Shao-Kui Zhang, Junkai Huang, Yue Liang, J. W. Zhang, Jiahong Liu , et al. · 2024

Scene synthesis has gained significant attention recently, and interactive scene synthesis focuses on yielding scenes according to user preferences. Existing literature either generates floor plans or scenes according to the floor plans. T…

Real-time 3D-aware Portrait Video Relighting Open

Ziqi Cai, Kaiwen Jiang, Shuyu Chen, Yu‐Kun Lai, Hongbo Fu , et al. · 2024

Synthesizing realistic videos of talking faces under custom lighting conditions and viewing angles benefits various downstream applications like video conferencing. However, most existing relighting methods are either time-consuming or una…

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting Open

Yizhe Tang, Yue Wang, Teng Hu, Ran Yi, Xin Tan , et al. · 2024

Stroke-based Rendering (SBR) aims to decompose an input image into a sequence of parameterized strokes, which can be rendered into a painting that resembles the input image. Recently, Neural Painting methods that utilize deep learning and …

FilterGNN: Image feature matching with cascaded outlier filters and linear attention Open

Junxiong Cai, Tai‐Jiang Mu, Yu‐Kun Lai · 2024

The cross-view matching of local image features is a fundamental task in visual localization and 3D reconstruction. This study proposes FilterGNN, a transformer-based graph neural network (GNN), aiming to improve the matching efficiency an…

HumanCoser: Layered 3D Human Generation via Semantic-Aware Diffusion Model Open

Yi Wang, Jian Ma, Ruizhi Shao, Feng Qiao, Yu‐Kun Lai , et al. · 2024

This paper aims to generate physically-layered 3D humans from text prompts. Existing methods either generate 3D clothed humans as a whole or support only tight and simple clothing generation, which limits their applications to virtual try-…

Two-stage deep neural network for diagnosing fungal keratitis via in vivo confocal microscopy images Open

Chunpeng Li, Weiwei Dai, Yunpeng Xiao, Mengying Qi, Lingxiao Zhang , et al. · 2024

Timely and effective diagnosis of fungal keratitis (FK) is necessary for suitable treatment and avoiding irreversible vision loss for patients. In vivo confocal microscopy (IVCM) has been widely adopted to guide the FK diagnosis. We presen…

Generating animatable 3D cartoon faces from single portraits Open

Chuanyu Pan, Guowei Yang, Tai‐Jiang Mu, Yu‐Kun Lai · 2024

RecStitchNet: Learning to stitch images with rectangular boundaries Open

Yun Zhang, Yu‐Kun Lai, Lang Nie, Fang‐Lue Zhang, Lin Xu · 2024

Irregular boundaries in image stitching naturally occur due to freely moving cameras. To deal with this problem, existing methods focus on optimizing mesh warping to make boundaries regular using the traditional explicit solution. However,…

SketchDream: Sketch-based Text-To-3D Generation and Editing Open

Fenglin Liu, Hongbo Fu, Yu‐Kun Lai, Lin Gao · 2024

Existing text-based 3D generation methods generate attractive results but lack detailed geometry control. Sketches, known for their conciseness and expressiveness, have contributed to intuitive 3D modeling but are confined to producing tex…

4Dynamic: Text-to-4D Generation with Hybrid Priors Open

Yu-Jie Yuan, Leif Kobbelt, Jiwen Liu, Yuan Zhang, Pengfei Wan , et al. · 2024

Due to the fascinating generative performance of text-to-image diffusion models, growing text-to-3D generation works explore distilling the 2D generative priors into 3D, using the score distillation sampling (SDS) loss, to bypass the data …

VRMM: A Volumetric Relightable Morphable Head Model Open

Haotian Yang, Mingwu Zheng, Chongyang Ma, Yu‐Kun Lai, Pengfei Wan , et al. · 2024

In this paper, we introduce the Volumetric Relightable Morphable Model (VRMM), a novel volumetric and parametric facial prior for 3D face modeling. While recent volumetric prior models offer improvements over traditional methods like 3D Mo…

Fusion of Short-term and Long-term Attention for Video Mirror Detection Open

Mingchen Xu, Jing Wu, Yu‐Kun Lai, Ze Ji · 2024

Techniques for detecting mirrors from static images have witnessed rapid growth in recent years. However, these methods detect mirrors from single input images. Detecting mirrors from video requires further consideration of temporal consis…

SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis Open

Teng Hu, Ran Yi, Baihong Qian, Jiangning Zhang, Paul L. Rosin , et al. · 2024

SVG (Scalable Vector Graphics) is a widely used graphics format that possesses excellent scalability and editability. Image vectorization, which aims to convert raster images to SVGs, is an important yet challenging problem in computer vis…

Yu‐Kun Lai YOU? Author Swipe