Kaifu Zhang
YOU?
Author Swipe
View article: Beyond Single-Reward: Multi-Pair, Multi-Perspective Preference Optimization for Machine Translation
Beyond Single-Reward: Multi-Pair, Multi-Perspective Preference Optimization for Machine Translation Open
Direct Preference Optimization (DPO) is a powerful paradigm for aligning Large Language Models (LLMs) to human preferences in Machine Translation (MT), but current methods are hindered by two fundamental challenges: (1) flawed reward signa…
View article: Generative AI and Firm Productivity: Field Experiments in Online Retail
Generative AI and Firm Productivity: Field Experiments in Online Retail Open
We quantify the impact of Generative Artificial Intelligence (GenAI) on firm productivity through a series of large-scale randomized field experiments involving millions of users and products at a leading cross-border online retail platfor…
View article: Enhanced morphology and conductivity in aerosol jet printing via optimization of print speed range under various deposition rate
Enhanced morphology and conductivity in aerosol jet printing via optimization of print speed range under various deposition rate Open
View article: Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models
Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models Open
Instruction-following capability has become a major ability to be evaluated for Large Language Models (LLMs). However, existing datasets, such as IFEval, are either predominantly monolingual and centered on English or simply machine transl…
View article: Strain rate dependent material removal mechanism and surface morphology formation mechanism for high strength CFRTP
Strain rate dependent material removal mechanism and surface morphology formation mechanism for high strength CFRTP Open
View article: Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation
Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation Open
Vision-Language Translation (VLT) is a challenging task that requires accurately recognizing multilingual text embedded in images and translating it into the target language with the support of visual context. While recent Large Vision-Lan…
View article: Multimodal Tabular Reasoning with Privileged Structured Information
Multimodal Tabular Reasoning with Privileged Structured Information Open
Tabular reasoning involves multi-step information extraction and logical inference over tabular data. While recent advances have leveraged large language models (LLMs) for reasoning over structured tables, such high-quality textual represe…
View article: Investigation on Hole Quality and Tool Wear of <scp>CFRP</scp> Drilling With Cryogenic‐Minimal Quantity Lubrication
Investigation on Hole Quality and Tool Wear of <span>CFRP</span> Drilling With Cryogenic‐Minimal Quantity Lubrication Open
Carbon fiber reinforced polymer (CFRP) is extensively utilized in various fields, including aerospace and automotive industries, where drilling serves as a common method for mechanical assembly. However, the inherent anisotropy and inhomog…
View article: USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models Open
Despite their remarkable achievements and widespread adoption, Multimodal Large Language Models (MLLMs) have revealed significant security vulnerabilities, highlighting the urgent need for robust safety evaluation benchmarks. Existing MLLM…
View article: Beyond Safe Answers: A Benchmark for Evaluating True Risk Awareness in Large Reasoning Models
Beyond Safe Answers: A Benchmark for Evaluating True Risk Awareness in Large Reasoning Models Open
Despite the remarkable proficiency of \textit{Large Reasoning Models} (LRMs) in handling complex reasoning tasks, their reliability in safety-critical scenarios remains uncertain. Existing evaluations primarily assess response-level safety…
View article: TransBench: Benchmarking Machine Translation for Industrial-Scale Applications
TransBench: Benchmarking Machine Translation for Industrial-Scale Applications Open
Machine translation (MT) has become indispensable for cross-border communication in globalized industries like e-commerce, finance, and legal services, with recent advancements in large language models (LLMs) significantly enhancing transl…
View article: Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Open
Recent years have seen remarkable progress in both multimodal understanding models and image generation models. Despite their respective successes, these two domains have evolved independently, leading to distinct architectural paradigms: …
View article: Effect of cooling temperature on machinability and hole quality in cryogenic drilling of CFRP/Ti stacks
Effect of cooling temperature on machinability and hole quality in cryogenic drilling of CFRP/Ti stacks Open
Carbon fiber reinforced polymer/titanium alloy (CFRP/Ti) stacks are extensively employed in modern aircraft owing to their outstanding mechanical properties. At present, cryogenic machining is considered an effective method for reducing dr…
View article: The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Open
As large language models (LLMs) continue to advance in linguistic capabilities, robust multilingual evaluation has become essential for promoting equitable technological progress. This position paper examines over 2,000 multilingual (non-E…
View article: New Trends for Modern Machine Translation with Large Reasoning Models
New Trends for Modern Machine Translation with Large Reasoning Models Open
Recent advances in Large Reasoning Models (LRMs), particularly those leveraging Chain-of-Thought reasoning (CoT), have opened brand new possibility for Machine Translation (MT). This position paper argues that LRMs substantially transforme…
View article: Marco-o1 v2: Towards Widening The Distillation Bottleneck for Reasoning Models
Marco-o1 v2: Towards Widening The Distillation Bottleneck for Reasoning Models Open
Large Reasoning Models(LRMs) such as OpenAI o1 and DeepSeek-R1 have shown remarkable reasoning capabilities by scaling test-time compute and generating long Chain-of-Thought(CoT). Distillation--post-training on LRMs-generated data--is a st…
View article: Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models
Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models Open
Multi-aspect controllable text generation aims to control text generation in attributes from multiple aspects, making it a complex but powerful task in natural language processing. Supervised fine-tuning methods are often employed for this…
View article: Thermally activated fluorescence in 9,10-DPA single crystals enabling high-performance fast neutron detection
Thermally activated fluorescence in 9,10-DPA single crystals enabling high-performance fast neutron detection Open
Organic scintillators occupy a significant niche in the realm of fast neutron detection. Nevertheless, the cultivation of large-sized and high-quality organic single crystals has persistently posed a formidable challenge. 9,10-diphenylanth…
View article: Multi-objective optimal design for flexible bio-inspired meta-structure with ultra-broadband microwave absorption and thin thickness
Multi-objective optimal design for flexible bio-inspired meta-structure with ultra-broadband microwave absorption and thin thickness Open
View article: Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models
Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models Open
View article: Enhanced Morphology and Conductivity in Aerosol Jet Printing Via Optimization of Print Speed Range Under Various Deposition Rate
Enhanced Morphology and Conductivity in Aerosol Jet Printing Via Optimization of Print Speed Range Under Various Deposition Rate Open
View article: Investigation on Static and Fatigue Performance of Cfrp/Al-Alloy Interference Bolted Joint Considering the Influence of Hole-Axis Error
Investigation on Static and Fatigue Performance of Cfrp/Al-Alloy Interference Bolted Joint Considering the Influence of Hole-Axis Error Open
View article: (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts Open
Literary translations remains one of the most challenging frontiers in machine translation due to the complexity of capturing figurative language, cultural nuances, and unique stylistic elements. In this work, we introduce TransAgents, a n…
View article: Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language
Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Open
View article: Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Open
Large Language Models (LLMs) have achieved remarkable progress in recent years; however, their excellent performance is still largely limited to major world languages, primarily English. Many LLMs continue to face challenges with multiling…
View article: PMMT: Preference Alignment in Multilingual Machine Translation via LLM Distillation
PMMT: Preference Alignment in Multilingual Machine Translation via LLM Distillation Open
Translation is important for cross-language communication, and many efforts have been made to improve its accuracy. However, less investment is conducted in aligning translations with human preferences, such as translation tones or styles.…
View article: Fatigue Damage Monitoring of Composite Structures Based on Lamb Wave Propagation and Multi-Feature Fusion
Fatigue Damage Monitoring of Composite Structures Based on Lamb Wave Propagation and Multi-Feature Fusion Open
To address the challenges associated with fatigue damage monitoring in load-bearing composite structures, we developed a method that utilizes Lamb wave propagation and partial least squares regression (PLSR) for effective monitoring. Initi…
View article: Ultrasonic Guided Wave Health Monitoring of High-Temperature Aircraft Structures Based on Variational Mode Decomposition and Fuzzy Entropy
Ultrasonic Guided Wave Health Monitoring of High-Temperature Aircraft Structures Based on Variational Mode Decomposition and Fuzzy Entropy Open
This paper presents an innovative approach to high-temperature health monitoring of aircraft structures utilizing an ultrasonic guided wave transmission and reception system integrated with a zirconia heat buffer layer. Aiming to address t…
View article: Enhanced aerosol-jet printing using annular acoustic field for high resolution and minimal overspray
Enhanced aerosol-jet printing using annular acoustic field for high resolution and minimal overspray Open
View article: Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Ovis: Structural Embedding Alignment for Multimodal Large Language Model Open
Current Multimodal Large Language Models (MLLMs) typically integrate a pre-trained LLM with another pre-trained vision transformer through a connector, such as an MLP, endowing the LLM with visual capabilities. However, the misalignment be…