Explanipedia

Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation Open

Sirui Wang, Andong Chen, Tiejun Zhao · 2025

Emotional text-to-speech (E-TTS) is central to creating natural and trustworthy human-computer interaction. Existing systems typically rely on sentence-level control through predefined labels, reference audio, or natural language prompts. …

Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning Open

Yihong Tang, Kehai Chen, Muyun Yang, Zheng-Yu Niu, Jing Li , et al. · 2025

The advancement of Large Language Models (LLMs) has spurred significant interest in Role-Playing Agents (RPAs) for applications such as emotional companionship and virtual interaction. However, recent RPAs are often built on explicit dialo…

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design Open

Weilin Zhao, Tiejun Zhao, Xu Wang, Hailong Cao, Conghui Zhu · 2025

Speculative decoding and quantization effectively accelerate memory-bound inference of large language models. Speculative decoding mitigates the memory bandwidth bottleneck by verifying multiple tokens within a single forward pass, which i…

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory Open

Hongli Zhou, Han Huang, Ziqing Zhao, L.-K. Han, H X Wang , et al. · 2025

The evaluation of large language models (LLMs) via benchmarks is widespread, yet inconsistencies between different leaderboards and poor separability among top models raise concerns about their ability to accurately reflect authentic model…

Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy Open

Zihao Feng, Xiaoxue Wang, Bowen Wu, Weihong Zhong, Zhen Xu , et al. · 2025

Task-oriented dialogue systems based on Large Language Models (LLMs) have gained increasing attention across various industries and achieved significant results. Current approaches condense complex procedural workflows into a single agent …

MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training Open

Hui Huang, Jiaheng Liu, Yancheng He, Shilong Li, Bing Xu , et al. · 2025

Complex instruction-following with elaborate constraints is imperative for Large Language Models (LLMs). While existing methods have constructed data for complex instruction alignment, they all rely on a more advanced model, especially GPT…

Evaluating o1-Like LLMs: Unlocking Reasoning for Translation through Comprehensive Analysis Open

Andong Chen, Yuchen Song, Wenxin Zhu, Kehai Chen, Muyun Yang , et al. · 2025

The o1-Like LLMs are transforming AI by simulating human cognitive processes, but their performance in multilingual machine translation (MMT) remains underexplored. This study examines: (1) how o1-Like LLMs perform in MMT tasks and (2) wha…

MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training Open

Hui Huang, Jiaheng Liu, Yancheng He, Shilong Li, Bing Xu , et al. · 2025

Benchmarking LLMs for Translating Classical Chinese Poetry: Evaluating Adequacy, Fluency, and Elegance Open

Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang , et al. · 2025

A Knowledge-Fused Maximum Mean Discrepancy for Cross-Lingual Named Entity Recognition Open

Hailong Cao, Shang Junlin, Muyun Yang, Tiejun Zhao · 2025

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation Open

Andong Chen, Yuchen Song, Kehai Chen, Xuefeng Bai, Muyun Yang , et al. · 2025

A Knowledge-Fused Maximum Mean Discrepancy for Cross-Lingual Named Entity Recognition Open

Hailong Cao, Shang Junlin, Muyun Yang, Tiejun Zhao · 2025

Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning Open

Mingze Xu, Guang Liang, Kehai Chen, Sheng Wang, Xun Zhou , et al. · 2025

LLM-based Translation Inference with Iterative Bilingual Understanding Open

Andong Chen, Kehai Chen, Yang Xiang, Xuefeng Bai, Muyun Yang , et al. · 2025

Empowering Beyond English-Centric Machine Translation on Llms by Multilingual Fusion Instruction Tuning Open

Zihao Feng, Hailong Cao, Qiuyu Ding, Tiejun Zhao · 2025

ASMem: Anchor Sparse Memory for Multi-Domain Knowledge Editing of Large Language Models Open

Guanyu Zheng, Zhenyu Wang, Yang Zhao, Tingting He, Xv Wang , et al. · 2025

LLM-based Discriminative Reasoning for Knowledge Graph Question Answering Open

Mingze Xu, Kehai Chen, Xuefeng Bai, Muyun Yang, Tiejun Zhao , et al. · 2024

Large language models (LLMs) based on generative pre-trained Transformer have achieved remarkable performance on knowledge graph question-answering (KGQA) tasks. However, LLMs often produce ungrounded subgraph planning or reasoning results…

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation Open

Andong Chen, Yuchen Song, Kehai Chen, Muyun Yang, Tiejun Zhao , et al. · 2024

Visual information has been introduced for enhancing machine translation (MT), and its effectiveness heavily relies on the availability of large amounts of bilingual parallel sentence pairs with manual image annotations. In this paper, we …

UCFA‐Net: A U‐shaped cross‐fusion network with attention mechanism for enhanced polyp segmentation Open

Shuai Wang, Tiejun Zhao, Guocun Wang, Ye Han, Fan Wu · 2024

Enhancing the precision of computer‐assisted polyp segmentation and delineation during colonoscopies assists in the removal of potentially precancerous tissue, thus reducing the risk of malignant transformation. Most of the current medical…

LLM-based Translation Inference with Iterative Bilingual Understanding Open

Andong Chen, Kehai Chen, Yang Xiang, Xuefeng Bai, Muyun Yang , et al. · 2024

The remarkable understanding and generation capabilities of large language models (LLMs) have greatly improved translation performance. However, incorrect understanding of the sentence to be translated can degrade translation quality. To a…

Mitigating the Bias of Large Language Model Evaluation Open

Hongli Zhou, Hui Huang, Yunfei Long, Bing Xu, Conghui Zhu , et al. · 2024

Recently, there has been a trend of evaluating the Large Language Model (LLM) quality in the flavor of LLM-as-a-Judge, namely leveraging another LLM to evaluate the current output quality. However, existing judges are proven to be biased, …

Large Language Models for Classical Chinese Poetry Translation: Benchmarking, Evaluating, and Improving Open

Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang , et al. · 2024

Different from the traditional translation tasks, classical Chinese poetry translation requires both adequacy and fluency in translating culturally and historically significant content and linguistic poetic elegance. Large language models …

STAR: Scale-wise Text-conditioned AutoRegressive image generation Open

Xiaoxiao Ma, Mohan Zhou, Liang Tao, Yalong Bai, Tiejun Zhao , et al. · 2024

We introduce STAR, a text-to-image model that employs a scale-wise auto-regressive paradigm. Unlike VAR, which is constrained to class-conditioned synthesis for images up to 256$\times$256, STAR enables text-driven image generation up to 1…

DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms Open

Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang , et al. · 2024

Recently, large language models (LLMs) enhanced by self-reflection have achieved promising performance on machine translation. The key idea is guiding LLMs to generate translation with human-like feedback. However, existing self-reflection…

Morphological Identification Characteristics of Basil (Ocimum spp.) in Tabanan Regency, Bali, Indonesia Open

Shara Yulita Harianja, I Putu Sudiarta, Tiejun Zhao · 2024

Basil (Ocimum spp.) is an aromatic plant and is the wealthiest essential oil-producing genera from the Lamiaceae family. Due to the various phytochemical compounds or secondary metabolites, Basil has the potential of medicinal plant germpl…

DesignProbe: A Graphic Design Benchmark for Multimodal Large Language Models Open

Jieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin · 2024

A well-executed graphic design typically achieves harmony in two levels, from the fine-grained design elements (color, font and layout) to the overall design. This complexity makes the comprehension of graphic design challenging, for it ne…

Dual Instruction Tuning with Large Language Models for Mathematical Reasoning Open

Yongwei Zhou, Tiejun Zhao · 2024

Recent advancements highlight the success of instruction tuning with large language models (LLMs) utilizing Chain-of-Thought (CoT) data for mathematical reasoning tasks. Despite the fine-tuned LLMs, challenges persist, such as incorrect, m…

Enhancing Bilingual Lexicon Induction via Bi-directional Translation Pair Retrieving Open

Qiuyu Ding, Hailong Cao, Tiejun Zhao · 2024

Most Bilingual Lexicon Induction (BLI) methods retrieve word translation pairs by finding the closest target word for a given source word based on cross-lingual word embeddings (WEs). However, we find that solely retrieving translation fro…

Self-Evaluation of Large Language Model based on Glass-box Features Open

Hui Huang, Yingqi Qu, Jing Liu, Muyun Yang, Tiejun Zhao · 2024

The proliferation of open-source Large Language Models (LLMs) underscores the pressing need for evaluation methods. Existing works primarily rely on external evaluators, focusing on training and prompting strategies. However, a crucial asp…

Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio Open

Wang Xu, Yongliang MA, Kehai Chen, Ming Zhou, Muyun Yang , et al. · 2024

Non-autoregressive generation has attracted more and more attention due to its fast decoding speed. Latent alignment objectives, such as CTC, are designed to capture the monotonic alignments between the predicted and output tokens, which h…

Tiejun Zhao YOU? Author Swipe