Explanipedia

Printed sensing human-machine interface with individualized adaptive machine learning Open

Guohui Wang, Yao Tang, Xinran Luo, Shengdi Lu, Yiru Zhou , et al. · 2025

Developing intelligent robots with integrated sensing capabilities is critical for advanced manufacturing, medical robots, and embodied intelligence. Existing robotic sensing technologies are limited to recording of acceleration, driving t…

Word Level Timestamp Generation for Automatic Speech Recognition and Translation Open

Ke Hu, Krishna C. Puvvada, Elena Rastorgueva, Zhehuai Chen, He Huang , et al. · 2025

We introduce a data-driven approach for enabling word-level timestamp prediction in the Canary model. Accurate timestamp information is crucial for a variety of downstream tasks such as speech content retrieval and timed subtitles. While t…

SALM-Duplex: Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model Open

Ke Hu, Ehsan Hosseini-Asl, Chen Chen, Edresson Casanova, Subhankar Ghosh , et al. · 2025

Spoken dialogue is an intuitive form of human-computer interaction, yet current speech language models often remain constrained to turn-based exchanges, lacking real-time adaptability such as user barge-in. We propose a novel duplex speech…

<i>TESS</i> photometry, radial velocity, and orbital period investigations of four eclipsing contact binaries Open

Zi-Bin Meng, Pei-Ru Wu, Shuguang Zeng, Yun-Xia Yu, Ke Hu , et al. · 2025

We collected photometric data from the Transiting Exoplanet Survey Satellite and spectroscopic observations from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope. Using this data, we simultaneously analyzed the radial velocity…

Enhanced Landslide Risk Evaluation in Hydroelectric Reservoir Zones Utilizing an Improved Random Forest Approach Open

Aimin Wei, Ke Hu, Shuni He, Mingliang JIANG, Zeying Yao , et al. · 2025

Landslides on reservoir slopes are one of the key geologic hazards that threaten the safe operation of hydropower plants. The aim of our study was to reduce the limitations of the existing methods of landslide risk assessment when dealing …

Research on the Cross-Industry Application of Autonomous Driving Technology in the Field of FinTech Open

Yong Wang, Zepeng Shen, Jia Wei Chew, Zhiyuan Wang, Ke Hu · 2025

This thesis focuses on the interdisciplinary integration of autonomous driving technology and financial technology (FinTech), exploring the synergistic effects and application prospects of these two cutting-edge fields under the impetus of…

Artificial Intelligence Empowering Robo-Advisors: A Data-Driven Wealth Management Model Analysis Open

Zepeng Shen, Zhiyuan Wang, Jonathan Chew, Ke Hu, Yong Wang · 2025

In the digital age, the rapid development of financial technology has brought new opportunities to wealth management, especially with the emergence of robo-advisors as an innovative wealth management model that is increasingly favored by i…

Adversarial Machine Learning in Cybersecurity: Attacks and Defenses Open

Ke Hu, Jian Xu, Yong Wang, Heyao Chen, Zepeng Shen · 2025

Adversarial Machine Learning (AML) refers to the research field that involves testing and improving machine learning models by introducing adversarial samples or attack techniques. In the cybersecurity domain, AML has significant potential…

SpeechIQ: Speech-Agentic Intelligence Quotient Across Cognitive Levels in Voice Understanding by Large Language Models Open

Zhen Wan, Chao-Han Huck Yang, Yahan Yu, Jinchuan Tian, Sheng Li , et al. · 2025

We introduce Speech-based Intelligence Quotient (SIQ) as a new form of human cognition-inspired evaluation pipeline for voice understanding large language models, LLM Voice, designed to assess their voice understanding ability. Moving beyo…

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning Open

Yifan Peng, Krishna C. Puvvada, Zhehuai Chen, Piotr Żelasko, He Huang , et al. · 2025

Cation exchange reshapes Cu active sites to promote C−C coupling for efficiently selective production of C2H4 from CO2 photoreduction Open

Weili Dai, Yong Xu, Ke Hu, Jie Zheng, Yue Wang , et al. · 2025

NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model Open

Yen‐Ting Lin, Zhehuai Chen, Piotr Żelasko, Zhen Wan, Xuesong Yang , et al. · 2025

Large-scale acceleration algorithms for a deep convective physical parameterization scheme on GPU Open

Yongfei Wang, Jun Ping Wang, Jiarui Tian, Lin Li, F.C. Ma , et al. · 2024

Early warning of geological hazards requires monitoring extreme weather conditions, such as heavy rainfall. Atmospheric circulation models are used for weather forecasting and climate simulation. As a critical physical process in atmospher…

NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model Open

Yen‐Ting Lin, Chao-Han Huck Yang, Zhehuai Chen, Piotr Żelasko, Xuesong Yang , et al. · 2024

Construction of a general-purpose post-recognition error corrector poses a crucial question: how can we most effectively train a model on a large mixture of domain datasets? The answer would lie in learning dataset-specific features and di…

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning Open

Yifan Peng, Krishna C. Puvvada, Zhehuai Chen, Piotr Żelasko, He Huang , et al. · 2024

Recent studies have augmented large language models (LLMs) with speech capabilities, leading to the development of speech language models (SpeechLMs). Earlier SpeechLMs focused on single-turn speech-based question answering (QA), where use…

EMMeTT: Efficient Multimodal Machine Translation Training Open

Piotr Żelasko, Zhehuai Chen, Mengru Wang, Daniel Gálvez, Oleksii Hrinchuk , et al. · 2024

A rising interest in the modality extension of foundation language models warrants discussion on the most effective, and efficient, multimodal training approach. This work focuses on neural machine translation (NMT) and proposes a joint mu…

Chain-of-Thought Prompting for Speech Translation Open

Ke Hu, Zhehuai Chen, Chao-Han Huck Yang, Piotr Żelasko, Oleksii Hrinchuk , et al. · 2024

Large language models (LLMs) have demonstrated remarkable advancements in language understanding and generation. Building on the success of text-based LLMs, recent research has adapted these models to use speech embeddings for prompting, r…

OO Leo: An Active Contact Binary with Possible Solar-like Differential Rotation Open

Zi-Bin Meng, Pei-Ru Wu, Yun-Xia Yu, Ke Hu, Fu-Yuan Xiang · 2024

With Transiting Exoplanet Survey Satellite (TESS) high-precision photometry and Large Sky Area Multi-object Fiber Spectroscopic Telescope medium-resolution spectra, we present the first light and radial velocity curve analyses for the ecli…

Research on a Multi-Dimensional Indicator Assessment Model for Evaluating Landslide Risk near Large Alpine Reservoirs Open

Hanyin Hu, Ke Hu, Xinyao Zhang, Jianbo Yi · 2024

Geological disasters in large alpine reservoirs primarily take the form of landslide occurrences and are predominantly induced by slope instability. Presently, risk monitoring and assessment strategies tend to prioritize sudden alerts over…

Enhancing Visual Continual Learning with Language-Guided Supervision Open

Bolin Ni, Zhao Hong-bo, Chenghao Zhang, Ke Hu, Gaofeng Meng , et al. · 2024

Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category n…

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study Open

W. Ronny Huang, Cyril Allauzen, Tongzhou Chen, Kilol Gupta, Ke Hu , et al. · 2024

In the era of large models, the autoregressive nature of decoding often results in latency serving as a significant bottleneck. We propose a non-autoregressive LM-fused ASR system that effectively leverages the parallelization capabilities…

Preparation and Performance Investigation of Silicate Non-Sintered Ceramsite Using Engineering Waste Soil Under The Action of Alkali-Thermal Activation Open

Lijuan Wang, Ke Hu, Chengzhi Xiao, Shuai Wang, Bei Li · 2024

Influence of Curve Location and Type of Adolescent Idiopathic Scoliosis on Static and Dynamic Plantar Pressure Open

Dongmei Ai, Wei Jin, Jiyuan Li, Biyun Xu, Zaixing Liu , et al. · 2024

Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights Open

Ke Hu, Weidong Qiu, Peng Tang · 2023

In the field of federated learning, addressing non-independent and identically distributed (non-i.i.d.) data remains a quintessential challenge for improving global model performance. This work introduces the Feature Norm Regularized Feder…

Improving Joint Speech-Text Representations Without Alignment Open

Cal Peyser, Zhong Meng, Ke Hu, Rohit Prabhavalkar, Andrew E. Rosenberg , et al. · 2023

The last year has seen astonishing progress in text-prompted image generation premised on the idea of a cross-modal representation space in which the text and image domains are represented jointly. In ASR, this idea has found application a…

Mixture-of-Expert Conformer for Streaming Multilingual ASR Open

Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Françoise Beaufays · 2023

End-to-end models with large capacity have significantly improved multilingual automatic speech recognition, but their computation cost poses challenges for on-device applications. We propose a streaming truly multilingual Conformer incorp…

IP Lyn: A Totally Eclipsing Contact Binary with an Extremely Low Mass Ratio Open

Zi-Xuan Yin, Zi-Bin Meng, Pei-Ru Wu, Xu-Dong Zhang, Yunxia Yu , et al. · 2023

We present the first photometric and orbital period investigations for a neglected totally eclipsing contact binary IP Lyn. The photometric solutions derived from both ground-based and several surveys’ observations suggest that it is a sha…

Hot Subdwarf Stars Identified in LAMOST DR8 with Single-lined and Composite Spectra Open

Zhenxin Lei, Ruijie He, Péter Németh, J. Vos, Xuan Zou , et al. · 2023

A total of 222 hot subdwarf stars were identified with LAMOST DR8 spectra, among which 131 stars show composite spectra and have been decomposed, while 91 stars present single-lined spectra. Atmospheric parameters of all sample stars were …

Hybrid CNN Based Attention with Category Prior for User Image Behavior Modeling Open

Xin Chen, Qingtao Tang, Ke Hu, Xu Yue, Shihang Qiu , et al. · 2022

User historical behaviors are proved useful for Click Through Rate (CTR) prediction in online advertising system. In Meituan, one of the largest e-commerce platform in China, an item is typically displayed with its image and whether a user…

Deep Position-wise Interaction Network for CTR Prediction Open

Jianqiang Huang, Ke Hu, Qingtao Tang, Mingjian Chen, Yi Qi , et al. · 2021

Click-through rate (CTR) prediction plays an important role in online\nadvertising and recommender systems. In practice, the training of CTR models\ndepends on click data which is intrinsically biased towards higher positions\nsince higher…

Ke Hu YOU? Author Swipe