Explanipedia

Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction Open

Tianyun Zhong, Guozhao Mo, Yanjiang Liu, Yihan Chen, Lingdi Kong , et al. · 2025

With the emergence of large language models (LLMs), there is an expectation that LLMs can effectively extract explicit information from complex real-world documents (e.g., papers, reports). However, most LLMs generate paragraph-style answe…

Computational Machine Ethics: A Survey Open

Tianyun Zhong, Song Yang, Raynaldio Limarga, Maurice Pagnucco · 2025

Computational Machine Ethics (CME) is an interdisciplinary field that integrates moral philosophy into an agent’s decision-making process, contributing to the broader domain of Artificial Intelligence Ethics. Technological advancements hav…

FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation Open

Tianyun Zhong, Chao Liang, Jianwen Jiang, Gaojie Lin, Jiaqi Yang , et al. · 2024

Diffusion-based audio-driven talking avatar methods have recently gained attention for their high-fidelity, vivid, and expressive results. However, their slow inference speed limits practical applications. Despite the development of variou…

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes Open

Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Karen Jiang, Jiawei Huang , et al. · 2024

Talking face generation (TFG) aims to animate a target identity's face to create realistic talking videos. Personalized TFG is a variant that emphasizes the perceptual identity similarity of the synthesized result (from the perspective of …

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Open

Jianwen Jiang, Chao Liang, Jiaqi Yang, Gaojie Lin, Tianyun Zhong , et al. · 2024

With the introduction of diffusion-based video generation techniques, audio-conditioned human video generation has recently achieved significant breakthroughs in both the naturalness of motion and the synthesis of portrait details. Due to …

CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention Open

Gaojie Lin, Jianwen Jiang, Chao Liang, Tianyun Zhong, Jiaqi Yang , et al. · 2024

Diffusion-based video generation technology has advanced significantly, catalyzing a proliferation of research in human animation. However, the majority of these studies are confined to same-modality driving settings, with cross-modality h…

MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices Open

Jianwen Jiang, Gaojie Lin, Zhengkun Rong, Chao Liang, Yongming Zhu , et al. · 2024

Existing neural head avatars methods have achieved significant progress in the image quality and motion range of portrait animation. However, these methods neglect the computational overhead, and to the best of our knowledge, none is desig…

Superior and Pragmatic Talking Face Generation with Teacher-Student Framework Open

Chao Liang, Jianwen Jiang, Tianyun Zhong, Gaojie Lin, Zhengkun Rong , et al. · 2024

Talking face generation technology creates talking videos from arbitrary appearance and motion signal, with the "arbitrary" offering ease of use but also introducing challenges in practical applications. Existing methods work well with sta…

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis Open

Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li , et al. · 2024

One-shot 3D talking portrait generation aims to reconstruct a 3D avatar from an unseen image, and then animate it with a reference video or audio to generate a talking portrait video. The existing methods fail to simultaneously achieve the…

Language Model is a Branch Predictor for Simultaneous Machine Translation Open

Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao · 2023

The primary objective of simultaneous machine translation (SiMT) is to minimize latency while preserving the quality of the final translation. Drawing inspiration from CPU branch prediction techniques, we propose incorporating branch predi…

Gloss Attention for Gloss-free Sign Language Translation Open

Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin , et al. · 2023

Most sign language translation (SLT) methods to date require the use of gloss annotations to provide additional supervision information, however, the acquisition of gloss is not easy. To solve this problem, we first perform an analysis of …

UIRISC at SemEval-2023 Task 10: Explainable Detection of Online Sexism by Ensembling Fine-tuning Language Models Open

Tianyun Zhong, Runhui Song, Xunyuan Liu, Juelin Wang, Boya Wang , et al. · 2023

Under the umbrella of anonymous social networks, many women have suffered from abuse, discrimination, and other sexist expressions online. However, exsiting methods based on keyword filtering and matching performed poorly on online sexism …

Tianyun Zhong YOU? Author Swipe