Explanipedia

A method for improving multilingual quality and diversity of instruction fine-tuning datasets Open

Chunguang Zhao, Yilun Liu, Pufan Zeng, Yuanchang Luo, Shimin Tao , et al. · 2025

Multilingual Instruction Fine-Tuning (IFT) is essential for enabling large language models (LLMs) to generalize effectively across diverse linguistic and cultural contexts. However, the scarcity of high-quality multilingual training data a…

RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning Open

Song Xu, Weibin Meng, Shenglin Zhang, Yongqian Sun, Daimeng Wei · 2025

Logs constitute a form of evidence signaling the operational status of software systems. Automated log anomaly detection is crucial for ensuring the reliability of modern software systems. However, existing approaches face significant limi…

Generative Annotation for ASR Named Entity Correction Open

Yuanchang Luo, Daimeng Wei, Shaojun Li, Hengchao Shang, Jiaxin Guo , et al. · 2025

End-to-end automatic speech recognition systems often fail to transcribe domain-specific named entities, causing catastrophic failures in downstream tasks. Numerous fast and lightweight named entity correction (NEC) models have been propos…

MIDB: Multilingual Instruction Data Booster for Enhancing Cultural Equality in Multilingual Instruction Synthesis Open

Yilun Liu, Xinhua Yang, Huan Zeng, Shimin Tao, M. He , et al. · 2025

Despite doubts on data quality, instruction synthesis has been widely applied into instruction tuning (IT) of LLMs as an economic and rapid alternative. Recent endeavors focus on improving data quality for synthesized instruction pairs in …

Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation Open

Zhanglin Wu, Daimeng Wei, Xiaoyu Chen, Hengchao Shang, Jiaxin Guo , et al. · 2025

Large language model (LLM) shows promising performances in a variety of downstream tasks, such as machine translation (MT). However, using LLMs for translation suffers from high computational costs and significant latency. Based on our eva…

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement Open

Xinglin Lyu, Junhui Li, Daimeng Wei, Shimin Tao · 2025

Recent research has shown that large language models (LLMs) can enhance translation quality through self-refinement. In this paper, we build on this idea by extending the refinement from sentence-level to document-level translation, specif…

DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation Open

Xinglin Lyu, Wei Tang, Yuang Li, Xiaofeng Zhao, Ming Zhu , et al. · 2025

Document-level context is crucial for handling discourse challenges in text-to-text document-level machine translation (MT). Despite the increased discourse challenges introduced by noise from automatic speech recognition (ASR), the integr…

R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Open

M. He, Yilun Liu, Shimin Tao, Huan Zeng, Chang Su , et al. · 2025

Despite recent breakthroughs in reasoning-enhanced large language models (LLMs) like DeepSeek-R1, incorporating inference-time reasoning into machine translation (MT), where human translators naturally employ structured, multi-layered reas…

Chain-of-Description: What I can understand, I can put into words Open

Jiaxin Guo, Daimeng Wei, Zongyao Li, Hengchao Shang, Yuanchang Luo , et al. · 2025

In this paper, we propose a novel strategy defined as Chain-of-Description (CoD) Prompting, tailored for Multi-Modal Large Language Models. This approach involves having the model first provide a detailed description of the multi-modal inp…

Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Open

Jiaxin Guo, Yuanchang Luo, Daimeng Wei, Ling Zhang, Zongyao Li , et al. · 2025

Computer science Chemistry

The field of artificial intelligence has witnessed significant advancements in natural language processing, largely attributed to the capabilities of Large Language Models (LLMs). These models form the backbone of Agents designed to addres…

M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models Open

Jiaxin Guo, Daimeng Wei, Yuanchang Luo, Shimin Tao, Hengchao Shang , et al. · 2024

Computer science

With the widespread application of Large Language Models (LLMs) in the field of Natural Language Processing (NLP), enhancing their performance has become a research hotspot. This paper presents a novel multi-prompt ensemble decoding approa…

Context-aware and Style-related Incremental Decoding framework for Discourse-Level Literary Translation Open

Yuanchang Luo, Jiaxin Guo, Daimeng Wei, Hengchao Shang, Zongyao Li , et al. · 2024

Computer science Psychology Art

This report outlines our approach for the WMT24 Discourse-Level Literary Translation Task, focusing on the Chinese-English language pair in the Constrained Track. Translating literary texts poses significant challenges due to the nuanced m…

Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning Open

Bin Wei, J.L. Zhen, Zongyao Li, Zhanglin Wu, Daimeng Wei , et al. · 2024

Computer science Chemistry Philosophy

This paper introduces the submission by Huawei Translation Center (HW-TSC) to the WMT24 Indian Languages Machine Translation (MT) Shared Task. To develop a reliable machine translation system for low-resource Indian languages, we employed …

Multilingual Transfer and Domain Adaptation for Low-Resource Languages of Spain Open

Yuanchang Luo, Zhanglin Wu, Daimeng Wei, Hengchao Shang, Zongyao Li , et al. · 2024

Computer science Geography Psychology

This article introduces the submission status of the Translation into Low-Resource Languages of Spain task at (WMT 2024) by Huawei Translation Service Center (HW-TSC). We participated in three translation tasks: spanish to aragonese (es-ar…

Exploring the traditional NMT model and Large Language Model for chat translation Open

Jinlong Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li , et al. · 2024

Computer science Chemistry

This paper describes the submissions of Huawei Translation Services Center(HW-TSC) to WMT24 chat translation shared task on English$\leftrightarrow$Germany (en-de) bidirection. The experiments involved fine-tuning models using chat data an…

HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks Open

Zhanglin Wu, Yuanchang Luo, Daimeng Wei, Jiawei Zheng, Bin Wei , et al. · 2024

Computer science Psychology Chemistry

This paper presents the submission of Huawei Translation Services Center (HW-TSC) to machine translation tasks of the 20th China Conference on Machine Translation (CCMT 2024). We participate in the bilingual machine translation task and mu…

Choose the Final Translation from NMT and LLM hypotheses Using MBR Decoding: HW-TSC's Submission to the WMT24 General MT Shared Task Open

Zhanglin Wu, Daimeng Wei, Zongyao Li, Hengchao Shang, Jiaxin Guo , et al. · 2024

Computer science Biology Economics

This paper presents the submission of Huawei Translate Services Center (HW-TSC) to the WMT24 general machine translation (MT) shared task, where we participate in the English to Chinese (en2zh) language pair. Similar to previous years' wor…

LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation Open

Shaojun Li, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li , et al. · 2024

Computer science

Recent advancements in integrating speech information into large language models (LLMs) have significantly improved automatic speech recognition (ASR) accuracy. However, existing methods often constrained by the capabilities of the speech …

An End-to-End Speech Summarization Using Large Language Model Open

Hengchao Shang, Zongyao Li, Jiaxin Guo, Shaojun Li, Zhiqiang Rao , et al. · 2024

Computer science

ive Speech Summarization (SSum) aims to generate human-like text summaries from spoken content. It encounters difficulties in handling long speech input and capturing the intricate cross-modal mapping between long speech inputs and short t…

Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR Open

Shaojun Li, Daimeng Wei, Jiaxin Guo, ZongYao Li, Zhanglin Wu , et al. · 2024

Computer science Psychology

Despite recent improvements in End-to-End Automatic Speech Recognition (E2E ASR) systems, the performance can degrade due to vocal characteristic mismatches between training and testing data, particularly with limited target speaker adapta…

Cross-Domain Audio Deepfake Detection: Dataset and Analysis Open

Yuang Li, Min Zhang, Mengxin Ren, Miaomiao Ma, Daimeng Wei , et al. · 2024

Computer science Mathematics

Audio deepfake detection (ADD) is essential for preventing the misuse of synthetic voices that may infringe on personal rights and privacy. Recent zero-shot text-to-speech (TTS) models pose higher risks as they can clone voices with a sing…

A Novel Paradigm Boosting Translation Capabilities of Large Language Models Open

Jiaxin Guo, Yang Hao, Zongyao Li, Daimeng Wei, Hengchao Shang , et al. · 2024

Computer science Chemistry

This paper presents a study on strategies to enhance the translation capabilities of large language models (LLMs) in the context of machine translation (MT) tasks. The paper proposes a novel paradigm consisting of three stages: Secondary P…

DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators Open

Xinglin Lyu, Junhui Li, Yanqing Zhao, M. Zhang, Daimeng Wei , et al. · 2024

Computer science History Chemistry

Generally, the decoder-only large language models (LLMs) are adapted to context-aware neural machine translation (NMT) in a concatenating way, where LLMs take the concatenation of the source sentence (i.e., intra-sentence context) and the …

R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation Open

Jiaxin Guo, Zhanglin Wu, Zongyao Li, Hengchao Shang, Daimeng Wei , et al. · 2024

Computer science Chemistry

Incremental Decoding is an effective framework that enables the use of an offline model in a simultaneous setting without modifying the original model, making it suitable for Low-Latency Simultaneous Speech Translation. However, this frame…

Daimeng Wei YOU? Author Swipe