Explanipedia

Sci-Tech finance, digital economy and high-quality development of regional economy: empirical evidence from 273 cities in China Open

Jingshi He, Yongjun Chen, Hui Chen, Jennifer H. Zhao, Chunsheng Wang , et al. · 2025

Business Economics Political science

This study uses panel data from 273 prefecture-level cities in China between 2011 and 2022 to demonstrate that science and technology (Sci-Tech) finance significantly contributes to high-quality development in regional economies. This conc…

Facial Appearance Capture at Home with Patch-Level Reflectance Prior Open

Yuxuan Han, Junfeng Lyu, Kaicheng Sheng, Minghao Que, Qixuan Zhang , et al. · 2025

Computer science Physics

Existing facial appearance capture methods can reconstruct plausible facial reflectance from smartphone-recorded videos. However, the reconstruction quality is still far behind the ones based on studio recordings. This paper fills the gap …

BANG: Dividing 3D Assets via Generative Exploded Dynamics Open

Qixuan Zhang, Haoran Jiang, Y. Bai, Wei Yang, Lan Xu , et al. · 2025

Computer science Physics

3D creation has always been a unique human strength, driven by our ability to deconstruct and reassemble objects using our eyes, mind and hand. However, current 3D design tools struggle to replicate this natural process, requiring consider…

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Open

Kaixin Yao, Xu Yan, Yan Zeng, Qixuan Zhang, Lan Xu , et al. · 2025

Computer science Physics

Recovering high-quality 3D scenes from a single RGB image is a challenging task in computer graphics. Current methods often struggle with domain-specific limitations or low-quality object generation. To address these, we propose CAST (Comp…

Facial Appearance Capture at Home with Patch-Level Reflectance Prior Open

Yuxuan Han, Junfeng Lyu, Kaicheng Sheng, Minghao Que, Qixuan Zhang , et al. · 2025

Existing facial appearance capture methods can reconstruct plausible facial reflectance from smartphone-recorded videos. However, the reconstruction quality is still far behind the ones based on studio recordings. This paper fills the gap …

Improving Open-Set Semantic Segmentation in 3D Point Clouds by Conditional Channel Capacity Maximization: Preliminary Results Open

Wang Fang, S. Rahimi, Olivia Bennett, Sophie Carter, Mitra Hassani , et al. · 2025

Point-cloud semantic segmentation underpins a wide range of critical applications. Although recent deep architectures and large-scale datasets have driven impressive closed-set performance, these models struggle to recognize or properly se…

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units Open

Youjia Wang, Yiwen Wu, Hengan Zhou, Hongyang Lin, Xingyue Peng , et al. · 2025

Computer science Physics

We present Capturing the Unseen (CAPUS), a novel facial motion capture (MoCap) technique that operates without visual signals. CAPUS leverages miniaturized Inertial Measurement Units (IMUs) as a new sensing modality for facial motion captu…

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Open

Kai Yao, Longwen Zhang, Xinhao Yan, Yan Zeng, Qixuan Zhang , et al. · 2025

Computer science Physics

Recovering high-quality 3D scenes from a single RGB image is a challenging task in computer graphics. Current methods often struggle with domain-specific limitations or low-quality object generation. To address these, we propose CAST (Comp…

EEG-based real-time BCI system using drones for attention visualization Open

Ran Zhang, Linfeng Sui, Chengyuan Shen, Lan Xu, Jianting Cao · 2025

Computer science Psychology Biology

Attention management is crucial for cognitive development, especially in children. This study presents a novel brain-computer interface (BCI) system that uses EEG signals to classify attention states. It analyzes these signals using a wave…

BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video Open

Yu Hong, Yize Wu, Zhehao Shen, Chengcheng Guo, Yuheng Jiang , et al. · 2025

Computer science Physics

Volumetric video enables immersive experiences by capturing dynamic 3D scenes, enabling diverse applications for virtual reality, education, and telepresence. However, traditional methods struggle with fixed lighting conditions, while neur…

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints Open

P. E. Long, Zijun Zhao, Min Ouyang, Qingcheng Zhao, Qixuan Zhang , et al. · 2025

Computer science Art Engineering

Hairstyles are intricate and culturally significant with various geometries, textures, and structures. Existing text or image-guided generation methods fail to handle the richness and complexity of diverse styles. We present TANGLED, a nov…

CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings Open

Jingsong Mu, Fuyi Yang, Yanshun Zhang, Junxiong Zhang, Yongjian Luo , et al. · 2024

Computer science Engineering Sociology

We introduce CADSpotting, an effective method for panoptic symbol spotting in large-scale architectural CAD drawings. Existing approaches often struggle with symbol diversity, scale variations, and overlapping elements in CAD designs, and …

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos Open

Yuheng Jiang, Zhehao Shen, Yu Hong, Chengcheng Guo, Yize Wu , et al. · 2024

Computer science Art

Volumetric video represents a transformative advancement in visual media, enabling users to freely navigate immersive virtual experiences and narrowing the gap between digital and real worlds. However, the need for extensive manual interve…

LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Open

Jiadi Cui, Junming Cao, Fenfen Zhao, Zhipeng He, Yifan Chen , et al. · 2024

Computer science Geology Geography

Large garages are ubiquitous yet intricate scenes that present unique challenges due to their monotonous colors, repetitive patterns, reflective surfaces, and transparent vehicle glass. Conventional Structure from Motion (SfM) methods for …

V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians Open

Penghao Wang, Zhirui Zhang, L.-C. WANG, Kaixin Yao, Siyuan Xie , et al. · 2024

Computer science

Experiencing high-fidelity volumetric video as seamlessly as 2D videos is a long-held dream. However, current dynamic 3DGS methods, despite their high rendering quality, face challenges in streaming on mobile devices due to computational a…

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos Open

Yuheng Jiang, Zhehao Shen, Yu Hong, Chengcheng Guo, Yize Wu , et al. · 2024

Computer science Art Physics

Volumetric video represents a transformative advancement in visual media, enabling users to freely navigate immersive virtual experiences and narrowing the gap between digital and real worlds. However, the need for extensive manual interve…

HiSC4D: Human-Centered Interaction and 4D Scene Capture in Large-Scale Space Using Wearable IMUs and LiDAR Open

Yudi Dai, Zhiyong Wang, Xiping Lin, Chenglu Wen, Lan Xu , et al. · 2024

Computer science Geography Mathematics

We introduce HiSC4D, a novel Human-centered interaction and 4D Scene Capture method, aimed at accurately and efficiently creating a dynamic digital world, containing large-scale indoor-outdoor scenes, diverse human motions, rich human-huma…

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance Open

Kai He, Kaixin Yao, Qixuan Zhang, Jingyi Yu, Lingjie Liu , et al. · 2024

Computer science Engineering History

Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal for digital human creation. Nonetheless, garment generation …

Implicit Swept Volume SDF: Enabling Continuous Collision-Free Trajectory Generation for Arbitrary Shapes Open

J. Wang, Tingrui Zhang, Qixuan Zhang, Chuxiao Zeng, Jingyi Yu , et al. · 2024

Computer science Mathematics Physics

In the field of trajectory generation for objects, ensuring continuous collision-free motion remains a huge challenge, especially for non-convex geometries and complex environments. Previous methods either oversimplify object shapes, which…

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Open

Qingcheng Zhao, P. E. Long, Qixuan Zhang, Dafei Qin, Han Liang , et al. · 2024

Computer science

The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism…

A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals Open

Jiangnan Tang, Jingya Wang, Kaiyang Ji, Lan Xu, Jingyi Yu , et al. · 2024

Computer science Physics Engineering

Estimating full-body human motion via sparse tracking signals from head-mounted displays and hand controllers in 3D scenes is crucial to applications in AR/VR. One of the biggest challenges to this task is the one-to-many mapping from spar…

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method Open

Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin , et al. · 2024

Computer science

Comprehensive capturing of human motions requires both accurate captures of complex poses and precise localization of the human within scenes. Most of the HPE datasets and methods primarily rely on RGB, LiDAR, or IMU data. However, solely …

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations Open

Yilan Dong, Chunlin Yu, Ruiyang Ha, Shi Ye, Yuexin Ma , et al. · 2024

Computer science Geography Medicine

Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space. In this paper, we propose the first in-the-wild benchmark CCGait f…

Gaze-guided Hand-Object Interaction Synthesis: Dataset and Method Open

Jie Tian, Lingxiao Yang, Ran Ji, Yuexin Ma, Lan Xu , et al. · 2024

Computer science Psychology Geography

Gaze plays a crucial role in revealing human attention and intention, particularly in hand-object interaction scenarios, where it guides and synchronizes complex tasks that require precise coordination between the brain, hand, and object. …

LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment Open

Yiming Ren, Xiao Han, Chengfeng Zhao, Jingya Wang, Lan Xu , et al. · 2024

Computer science Geography Engineering

For human-centric large-scale scenes, fine-grained modeling for 3D human global pose and shape is significant for scene understanding and can benefit many real-world applications. In this paper, we present LiveHPS, a novel single-LiDAR-bas…

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units Open

Youjia Wang, Yiwen Wu, Ruiqian Li, Hengan Zhou, Hongyang Lin , et al. · 2024

Computer science

We present Capturing the Unseen (CAPUS), a novel facial motion capture (MoCap) technique that operates without visual signals. CAPUS leverages miniaturized Inertial Measurement Units (IMUs) as a new sensing modality for facial motion captu…

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Open

Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang , et al. · 2024

Computer science

The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism…

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers Open

Han Liang, Jiacheng Bao, Ruonan Zhang, Sihan Ren, Yuecheng Xu , et al. · 2023

Computer science Philosophy Mathematics

We have recently seen tremendous progress in realistic text-to-motion generation. Yet, the existing methods often fail or produce implausible motions with unseen text inputs, which limits the applications. In this paper, we present OMG, a …

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics Open

Wenqian Zhang, Molin Huang, Yuxuan Zhou, Juze Zhang, Jingyi Yu , et al. · 2023

Computer science Engineering Psychology

The recently emerging text-to-motion advances have spired numerous attempts for convenient and interactive human motion generation. Yet, existing methods are largely limited to generating body motions only without considering the rich two-…

Lan Xu YOU? Author Swipe