Explanipedia

Hand-Shadow Poser Open

Hao Xu, Yinqiao Wang, Niloy J. Mitra, Pheng‐Ann Heng, Chi‐Wing Fu · 2025

Computer science

Hand shadow art is a captivating art form, creatively using hand shadows to reproduce expressive shapes on the wall. In this work, we study an inverse problem: given a target shape, find the poses of left and right hands that together best…

Visualization-Driven Illumination for Density Plots Open

Xin Chen, Yunhai Wang, Han Bao, Kecheng Lu, Jaemin Jo , et al. · 2025

We present a novel visualization-driven illumination model for density plots, a new technique to enhance density plots by effectively revealing the detailed structures in high- and medium-density regions and outliers in low-density regions…

Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation Open

Weiliang Tang, Dong Jing, Jiahui Pan, Zhiwu Lu, Yun‐Hui Liu , et al. · 2025

Recent Large Multimodal Models have demonstrated remarkable reasoning capabilities, especially in solving complex mathematical problems and realizing accurate spatial perception. Our key insight is that these emerging abilities can natural…

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Open

Zhenghao Xing, Xiaowei Hu, Chi‐Wing Fu, Wenhai Wang, Jifeng Dai , et al. · 2025

Multimodal large language models (MLLMs) have advanced perception across text, vision, and audio, yet they often struggle with structured cross-modal reasoning, particularly when integrating audio and visual signals. We introduce EchoInk-R…

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Open

Mengyang Wu, Yuzhi Zhao, Jialun Cao, Mingjie Xu, Zhongming Jiang , et al. · 2025

Computer science Mathematics Philosophy

Controversial contents largely inundate the Internet, infringing various cultural norms and child protection standards. Traditional Image Content Moderation (ICM) models fall short in producing precise moderation decisions for diverse stan…

HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions Open

Kui Du, Hao Xu, Haipeng Li, Hong Qu, Chi‐Wing Fu , et al. · 2025

Computer science Mathematics

Scene-level point cloud registration is very challenging when considering dynamic foregrounds. Existing indoor datasets mostly assume rigid motions, so the trained models cannot robustly handle scenes with non-rigid motions. On the other h…

Not-So-Optimal Transport Flows for 3D Point Cloud Generation Open

Ka-Hei Hui, Chao Liu, Xiaohui Zeng, Chi‐Wing Fu, Arash Vahdat · 2025

Computer science Geology Mathematics

Learning generative models of 3D point clouds is one of the fundamental problems in 3D generative learning. One of the key properties of point clouds is their permutation invariance, i.e., changing the order of points in a point cloud does…

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Open

Jinbo Xing, Luo Mai, Cusuh Ham, Jiahui Huang, Aniruddha Mahapatra , et al. · 2025

Computer science Materials science

This paper presents a method that allows users to design cinematic video shots in the context of image-to-video generation. Shot design, a critical aspect of filmmaking, involves meticulously planning both camera movements and object motio…

Overcoming Support Dilution for Robust Few-shot Semantic Segmentation Open

Weiliang Tang, Biqi Yang, Pheng‐Ann Heng, Yun-Hui Liu, Chi‐Wing Fu · 2025

Computer science Materials science Physics

Few-shot Semantic Segmentation (FSS) is a challenging task that utilizes limited support images to segment associated unseen objects in query images. However, recent FSS methods are observed to perform worse, when enlarging the number of s…

GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation Open

Weiliang Tang, Jiahui Pan, Yunhui Liu, Masayoshi Tomizuka, Li Erran Li , et al. · 2025

Computer science

We present GeoManip, a framework to enable generalist robots to leverage essential conditions derived from object and part relationships, as geometric constraints, for robot manipulation. For example, cutting the carrot requires adhering t…

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Open

Mengyang Wu, Yuzhi Zhao, Jialun Cao, Mingjie Xu, Zhongming Jiang , et al. · 2024

Computer science Psychology Mathematics

Controversial contents largely inundate the Internet, infringing various cultural norms and child protection standards. Traditional Image Content Moderation (ICM) models fall short in producing precise moderation decisions for diverse stan…

MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis Open

Tianyu Wang, Jianming Zhang, Haitian Zheng, Zhihong Ding, Scott Cohen , et al. · 2024

Computer science Psychology

Shadows are often under-considered or even ignored in image editing applications, limiting the realism of the edited results. In this paper, we introduce MetaShadow, a three-in-one versatile framework that enables detection, removal, and c…

CRAYM: Neural Field Optimization via Camera RAY Matching Open

Liqiang Lin, Wenpeng Wu, Chi‐Wing Fu, Hao Zhang, Hui Huang · 2024

Computer science Mathematics

We introduce camera ray matching (CRAYM) into the joint optimization of camera poses and neural fields from multi-view images. The optimized field, referred to as a feature volume, can be "probed" by the camera rays for novel view synthesi…

Learn to Create Simple LEGO Micro Buildings Open

Jiahao Ge, Mingjun Zhou, Chi‐Wing Fu · 2024

Computer science Philosophy

This paper presents the first learning-based generative pipeline for effectively creating 3D LEGO® 1 models. This task is very challenging due to the lack of dedicated representations and datasets for learning coherently-connected bricks a…

Visualization-Driven Illumination for Density Plots Open

Xin Chen, Yunhai Wang, Han Bao, Kecheng Lu, Jaemin Jo , et al. · 2024

Computer science

We present a novel visualization-driven illumination model for density plots, a new technique to enhance density plots by effectively revealing the detailed structures in high- and medium-density regions and outliers in low-density regions…

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion Open

Runsong Zhu, Shi Qiu, Qianyi Wu, Ka-Hei Hui, Pheng‐Ann Heng , et al. · 2024

Computer science Sociology Philosophy

Panoptic lifting is an effective technique to address the 3D panoptic segmentation task by unprojecting 2D panoptic segmentations from multi-views to 3D scene. However, the quality of its results largely depends on the 2D segmentations, wh…

Embodiment-Agnostic Action Planning via Object-Part Scene Flow Open

Weiliang Tang, Jiahui Pan, Wei Zhan, Jianshu Zhou, Huaxiu Yao , et al. · 2024

Computer science Mathematics Physics

Observing that the key for robotic action planning is to understand the target-object motion when its associated part is manipulated by the end effector, we propose to generate the 3D object-part scene flow and extract its transformations …

Unveiling Deep Shadows: A Survey and Benchmark on Image and Video Shadow Detection, Removal, and Generation in the Deep Learning Era Open

Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi‐Wing Fu, Pheng‐Ann Heng · 2024

Computer science Psychology

Shadows are created when light encounters obstacles, resulting in regions of reduced illumination. In computer vision, detecting, removing, and generating shadows are critical tasks for improving scene understanding, enhancing image qualit…

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models Open

Jiaqi Xu, Mengyang Wu, Xiaowei Hu, Chi‐Wing Fu, Qi Dou , et al. · 2024

Computer science Geography

This paper addresses the limitations of adverse weather image restoration approaches trained on synthetic data when applied to real-world scenarios. We formulate a semi-supervised learning framework employing vision-language models to enha…

PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training Open

Suyi Chen, Hao Xu, Haipeng Li, Kunming Luo, Guanghui Liu , et al. · 2024

Computer science Geography Mathematics

Data plays a crucial role in training learning-based methods for 3D point cloud registration. However, the real-world dataset is expensive to build, while rendering-based synthetic data suffers from domain gaps. In this work, we present Po…

CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization Open

Jingyu Hu, Ka-Hei Hui, Zhengzhe Liu, Hao Zhang, Chi‐Wing Fu · 2024

Computer science Physics

This paper introduces a new approach based on a coupled representation and a neural volume optimization to implicitly perform 3D shape editing in latent space. This work has three innovations. First, we design the coupled neural shape (CNS…

Object-level Scene Deocclusion Open

Zhengzhe Liu, Q. Liu, Chirui Chang, Jianming Zhang, Daniil Pakhomov , et al. · 2024

Computer science

Deoccluding the hidden portions of objects in a scene is a formidable task, particularly when addressing real-world scenes. In this paper, we present a new self-supervised PArallel visible-to-COmplete diffusion framework, named PACO, a fou…

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions Open

Hao Xu, Haipeng Li, Yinqiao Wang, Shuaicheng Liu, Chi‐Wing Fu · 2024

Computer science

Reconstructing 3D hand mesh robustly from a single image is very challenging, due to the lack of diversity in existing real-world datasets. While data synthesis helps relieve the issue, the syn-to-real gap still hinders its usage. In this …

SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View Adaptation Open

Yinqiao Wang, Hao Xu, Pheng‐Ann Heng, Chi‐Wing Fu · 2024

Computer science Biology

Estimating 3D hand mesh from RGB images is a longstanding track, in which occlusion is one of the most challenging problems. Existing attempts towards this task often fail when the occlusion dominates the image space. In this paper, we pro…

The Test of Time (ToT) Awards Open

Chi‐Wing Fu · 2024

Computer science Biology

Computer Graphics and Applications (IEEE CG&A) Test of Time (ToT) Award was introduced in 2021, aiming to recognize regular or special issue articles published by the magazine that have made profound and

CNS-Edit: 3D Shape Editing via Coupled Neural Shape Optimization Open

Jingyu Hu, Ka-Hei Hui, Zhengzhe Liu, Hao Zhang, Chi‐Wing Fu · 2024

Computer science Engineering

This paper introduces a new approach based on a coupled representation and a neural volume optimization to implicitly perform 3D shape editing in latent space. This work has three innovations. First, we design the coupled neural shape (CNS…

SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View Adaptation Open

Yinqiao Wang, Hao Xu, Pheng‐Ann Heng, Chi‐Wing Fu · 2024

Computer science Psychology Geology

Estimating 3D hand mesh from RGB images is a longstanding track, in which occlusion is one of the most challenging problems. Existing attempts towards this task often fail when the occlusion dominates the image space. In this paper, we pro…

Make-A-Shape: a Ten-Million-scale 3D Shape Model Open

Ka-Hei Hui, Aditya Sanghi, Arianna Rampini, Kamal Rahimi Malekshan, Zhengzhe Liu , et al. · 2024

Computer science Mathematics Political science

Significant progress has been made in training large generative models for natural language and images. Yet, the advancement of 3D generative models is hindered by their substantial resource demands for training, along with inefficient, no…

Chi‐Wing Fu YOU? Author Swipe