Explanipedia

Expanding the phenotypic spectrum of Rauch-Steindl syndrome: A novel NSD2 variant with atrial septal defect in a Chinese patient Open

Hui Zhu, Min Du, Lan Zeng, Jinglin Liu, Jin Wang , et al. · 2025

Background Rauch-Steindl syndrome (RSS) is a very rare autosomal dominant disorder caused by pathogenic variants in the NSD2 gene, characterized by dysmorphic facial features, prenatal and postnatal growth retardation, and variable develop…

Magnetic Circuit Analysis and Design Optimized for Cost-Effectiveness of Surface-Inserted Rare Earth Consequent-Pole Permanent Magnet Machines Open

Li Wang, Mohamed Saeed, Zhaoyang Fu, Jinglin Liu, Xinzhen Wu , et al. · 2025

In consequent-pole permanent magnet (CPPM) machines, the configuration where PM poles and iron poles are alternately arranged causes distortion in the air-gap magnetic field. This results in significant differences in magnetic circuit char…

Analytical Modeling and Analysis of Halbach Array Permanent Magnet Synchronous Motor Open

Jinglin Liu, Maixia Shang, Chao Gong · 2025

The Halbach array permanent magnet can improve the power density of motors. This paper uses analytical modeling to analyze and optimize the Halbach array permanent magnet synchronous motor (PMSM). Firstly, a general motor model is establis…

DualHet-YOLO: A Dual-Backbone Heterogeneous YOLO Network for Inspection Robots to Recognize Yellow-Feathered Chicken Behavior in Floor-Raised House Open

Y. S. ZHANG, Linwei Chen, Hongfei Chen, Tao Liu, Jinglin Liu , et al. · 2025

The behavior of floor-raised chickens is closely linked to their health status and environmental comfort. As a type of broiler chicken with special behaviors, understanding the daily actions of yellow-feathered chickens is crucial for accu…

Deep Learning-Based Detection and Digital Twin Implementation of Beak Deformities in Caged Layer Chickens Open

Hengtai Li, Hongfei Chen, Jinglin Liu, Qiuhong Zhang, Tao Liu , et al. · 2025

With the increasing urgency for digital transformation in large-scale caged layer farms, traditional methods for monitoring the environment and chicken health, which often rely on human experience, face challenges related to low efficiency…

Reference Prototype of High Lift Motor for Distributed Electric Propulsion All‐Electric Aircraft Open

Maixia Shang, Jinglin Liu, Chao Gong · 2025

The all‐electric aircraft with a distributed electric propulsion system being studied in this article uses 11 motors, of which 10 are specifically used for lift enhancement during takeoff and landing. This paper describes the iterative des…

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes Open

Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Karen Jiang, Jiawei Huang , et al. · 2024

Talking face generation (TFG) aims to animate a target identity's face to create realistic talking videos. Personalized TFG is a variant that emphasizes the perceptual identity similarity of the synthesized result (from the perspective of …

MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Open

Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Karen Jiang, Zhenhui Ye , et al. · 2024

Voice conversion aims to modify the source speaker's voice to resemble the target speaker while preserving the original speech content. Despite notable advancements in voice conversion these days, multi-lingual voice conversion (including …

A compensation method for PMSM sensorless control with parameter identification considering SMO observation error Open

Ruizhi Guan, Jinglin Liu, Mengqi Li, Minglang Xiao, Xinran Shi · 2024

Sensorless control of permanent magnet synchronous motor (PMSM) can increase the reliability of electric actuators of more electrical aircraft. Numerical online parameter estimation method will enhance the performance for sensorless contro…

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head Open

Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang , et al. · 2024

Large language models (LLMs) have exhibited remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Despite the recent success, current LLMs are not capable of processing comp…

Layered co-continuous structure in bone scaffold fabricated by laser additive manufacturing for enhancing electro-responsive shape memory properties Open

Cijun Shuai, Wentao Xu, Haofan He, Feng Yang, Jinglin Liu , et al. · 2024

Porous scaffold based on electro-responsive shape memory polymers (ESMPs) possesses great potential applications in minimally invasive surgery for bone defect repair because it provides the ability for remote control and internal heating. …

Genotype characterization of tetrahydrobiopterin deficiency in two Tibetan children Open

Shuyao Zhu, Qi Hu, Yunxia Yang, Hui Zhu, Jin Wang , et al. · 2024

We identified and treated two cases of BH4D in Tibetan populations in China, marking the first confirmed instances. Our report emphasizes the significance of conducting differential diagnosis tests for BH4D.

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis Open

Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li , et al. · 2024

One-shot 3D talking portrait generation aims to reconstruct a 3D avatar from an unseen image, and then animate it with a reference video or audio to generate a talking portrait video. The existing methods fail to simultaneously achieve the…

C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model Open

Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang , et al. · 2023

Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive g…

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis Open

Ziyue Karen Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang , et al. · 2023

Zero-shot text-to-speech (TTS) aims to synthesize voices with unseen speech prompts, which significantly reduces the data and computation requirements for voice cloning by skipping the fine-tuning process. However, the prompting mechanisms…

Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis Open

Zhenhui Ye, Ziyue Karen Jiang, Yi Ren, Jinglin Liu, Chen Zhang , et al. · 2023

We are interested in a novel task, namely low-resource text-to-talking avatar. Given only a few-minute-long talking person video with the audio track as the training data and arbitrary texts as the driving input, we aim to synthesize high-…

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias Open

Ziyue Karen Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang , et al. · 2023

Scaling text-to-speech to a large and wild dataset has been proven to be highly effective in achieving timbre and speech style generalization, particularly in zero-shot TTS. However, previous works usually encode speech into latent using a…

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation Open

Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye , et al. · 2023

Large diffusion models have been successful in text-to-audio (T2A) synthesis tasks, but they often suffer from common issues such as semantic misalignment and poor temporal consistency due to limited natural language understanding and data…

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation Open

Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li , et al. · 2023

Direct speech-to-speech translation (S2ST) aims to convert speech from one language into another, and has demonstrated significant progress to date. Despite the recent success, current S2ST models still suffer from distinct degradation in …

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training Open

Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Karen Jiang, Jinglin Liu , et al. · 2023

Improving text representation has attracted much attention to achieve expressive text-to-speech (TTS). However, existing works only implicitly learn the prosody with masked token reconstruction tasks, which leads to low training efficiency…

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis Open

Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui , et al. · 2023

We are interested in a challenging task, Realistic-Music-Score based Singing Voice Synthesis (RMS-SVS). RMS-SVS aims to generate high-quality singing voices given realistic music scores with different note types (grace, slur, rest, etc.). …

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment Open

Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao · 2023

The speech-to-singing (STS) voice conversion task aims to generate singing samples corresponding to speech recordings while facing a major challenge: the alignment between the target (singing) pitch contour and the source (speech) content …

GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation Open

Zhenhui Ye, Jinzheng He, Ziyue Karen Jiang, Rongjie Huang, Jiawei Huang , et al. · 2023

Generating talking person portraits with arbitrary speech audio is a crucial problem in the field of digital human and metaverse. A modern talking face generation method is expected to achieve the goals of generalized audio-lip synchroniza…

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head Open

Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang , et al. · 2023

Large language models (LLMs) have exhibited remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Despite the recent success, current LLMs are not capable of processing comp…

Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG) Open

Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen , et al. · 2023

ICASSP2023 General Meeting Understanding and Generation Challenge (MUG) focuses on prompting a wide range of spoken language processing (SLP) research on meeting transcripts, as SLP applications are critical to improve users' efficiency in…

MUG: A General Meeting Understanding and Generation Benchmark Open

Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen , et al. · 2023

Listening to long video/audio recordings from video conferencing and online courses for acquiring information is extremely inefficient. Even after ASR systems transcribe recordings into long-form spoken language documents, reading ASR tran…

GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Open

Zhenhui Ye, Ziyue Karen Jiang, Yi Ren, Jinglin Liu, JinZheng He , et al. · 2023

Generating photo-realistic video portrait with arbitrary speech audio is a crucial problem in film-making and virtual reality. Recently, several works explore the usage of neural radiance field in this task to improve 3D realness and image…

Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models Open

Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu , et al. · 2023

Large-scale multimodal generative modeling has created milestones in text-to-image and text-to-video generation. Its application to audio still lags behind for two main reasons: the lack of large-scale datasets with high-quality text-audio…

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training Open

Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Karen Jiang, Jinglin Liu , et al. · 2023

Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2023.

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis Open

Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui , et al. · 2023

We are interested in a challenging task, Realistic-Music-Score based Singing Voice Synthesis (RMS-SVS). RMS-SVS aims to generate high-quality singing voices given realistic music scores with different note types (grace, slur, rest, etc.). …

Jinglin Liu YOU? Author Swipe