Explanipedia

MIRA: A Transformer-Based Framework for Idler Roller Anomaly Detection and Localization Open

Younho Nam, S I Shim, Kyeong Min Shin, Young-Joo Suh · 2025

Monitoring the condition of belt conveyor idlers is critical for ensuring safe and efficient operation of industrial conveying systems. However, existing methods often suffer from limited scalability and delayed fault detection, particular…

HyFLM: A Hypernetwork-Based Federated Learning with Multidimensional Trajectory Optimization on Diffusion Paths Open

Young-Joo Suh · 2025

The effective training of large-scale distributed deep learning models has become an active and emerging research area in recent years. Federated learning (FL) can address those challenges by training global models through parameter exchan…

UDirEar: Heading Direction Tracking with Commercial UWB Earbud by Interaural Distance Calibration Open

Minseok Kim, Younho Nam, Jinyou Kim, Young-Joo Suh · 2025

Accurate heading direction tracking is essential for immersive VR/AR, spatial audio rendering, and robotic navigation. Existing IMU-based methods suffer from drift and vibration artifacts, vision-based approaches require LoS and raise priv…

A Wi-Fi Fingerprinting Indoor Localization Framework Using Feature-Level Augmentation via Variational Graph Auto-Encoder Open

Dongdeok Kim, Jae-Hyeon Park, Young-Joo Suh · 2025

Computer science Philosophy

Wi-Fi fingerprinting is a widely adopted technique for indoor localization in location-based services (LBS) due to its cost-effectiveness and ease of deployment using existing infrastructure. However, the performance of these systems often…

EmoSDS: Unified Emotionally Adaptive Spoken Dialogue System Using Self-Supervised Speech Representations Open

J. S. Lee, Youngjun Sim, Jinyou Kim, Young-Joo Suh · 2025

Computer science

In recent years, advancements in artificial intelligence, speech, and natural language processing technology have enhanced spoken dialogue systems (SDSs), enabling natural, voice-based human–computer interaction. However, discrete, token-b…

MILD: Minimizing Idle Listening Energy Consumption via Down-Clocking for Energy-Efficient Wi-Fi Communications Open

Jae-Hyeon Park, Young-Joo Suh, Dongdeok Kim, Harim Lee, Hyeongtae Ahn , et al. · 2025

Computer science Engineering Physics

Mobile devices, such as smartphones and laptops, face energy consumption challenges due to battery limitations, with Wi-Fi being one of the major sources of energy consumption in these devices. The IEEE 802.11 standard addresses this issue…

Domain Generalized Open-Set Fault Detection and Diagnosis for Belt Conveyor Systems With Prototype Learning Open

Jinyou Kim, Il-Cheol Yi, Young-Joo Suh · 2025

Computer science Geology Mathematics

Belt conveyor systems are essential across various industries but are prone to faults due to their distinctive design and challenging operational environments. Various approaches have been explored for fault detection and diagnosis (FDD) i…

Improving Monocular Depth Estimation Through Knowledge Distillation: Better Visual Quality and Efficiency Open

Cheuk Hung Lee, Dong Ju Kim, Young-Joo Suh, Do Kyung Hwang · 2024

Computer science Engineering Philosophy

This paper introduces a novel knowledge distillation (KD) framework for monocular depth estimation (MDE), incorporating dynamic weight adaptation to address critical challenges. The proposed approach effectively mitigates visual limitation…

Designing a Multivariate Belt Conveyor Idler Stall Detection and Identification System with Scalability Analysis Open

Kyeong Min Shin, Younho Nam, Young-Joo Suh · 2024

Engineering Computer science

Belt conveyor idlers are freely rotating idlers supporting the belt of a conveyor, and can induce severe frictional damage to the belt as they fail. Therefore, fast and accurate detection of idler faults is crucial for the effective mainte…

QR-VC: Leveraging Quantization Residuals for Linear Disentanglement in Zero-Shot Voice Conversion Open

Youngjun Sim, Jinsung Yoon, Young-Joo Suh · 2024

Computer science Psychology Chemistry

Zero-shot voice conversion is a technique that alters the speaker identity of an input speech to match a target speaker using only a single reference utterance, without requiring additional training. Recent approaches extensively utilize s…

WKNN-Based Wi-Fi Fingerprinting with Deep Distance Metric Learning via Siamese Triplet Network for Indoor Positioning Open

Jae-Hyeon Park, Dongdeok Kim, Young-Joo Suh · 2024

Computer science Engineering Geography

Weighted k-nearest neighbor (WKNN)-based Wi-Fi fingerprinting is popular in indoor location-based services due to its ease of implementation and low computational cost. KNN-based methods rely on distance metrics to select the nearest neigh…

Exploring Public Data Vulnerabilities in Semi-Supervised Learning Models through Gray-box Adversarial Attack Open

Junhyung Jo, J.-H. Kim, Young-Joo Suh · 2024

Computer science Medicine

Semi-supervised learning (SSL) models, integrating labeled and unlabeled data, have gained prominence in vision-based tasks, yet their susceptibility to adversarial attacks remains underexplored. This paper unveils the vulnerability of SSL…

Automatic Fingerprint Data Labeling Using WiFi Signal and Smartphone Camera for Indoor Positioning Open

Dongdeok Kim, Young-Joo Suh · 2024

Computer science

WiFi fingerprinting has been one of the most practical approaches for implementing an indoor positioning system. However, the need to measure location labels for fingerprint data has hindered the deployment of WiFi fingerprint‐based positi…

AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion Open

Haeyun Choi, Jio Gim, Y.-J. Lee, Young‐In Kim, Young-Joo Suh · 2023

Computer science Philosophy

This paper proposes a simple and robust zero-shot voice conversion system with a cycle structure and mel-spectrogram pre-processing. Previous works suffer from information loss and poor synthesis quality due to their reliance on a carefull…

GConvLoc: WiFi Fingerprinting-Based Indoor Localization Using Graph Convolutional Networks Open

Dongdeok Kim, Young-Joo Suh · 2023

Computer science

We propose GConvLoc, a WiFi fingerprinting-based in-door localization method utilizing graph convolutional networks. Using the graph structure, we can consider the fingerprint data of the reference points and their location labels in addit…

Glocal Retriever: Glocal Image Retrieval Using the Combination of Global and Local Descriptors Open

Zeu Kim, Youngin Kim, Young-Joo Suh · 2023

Computer science Economics

Development of deep learning has led to progress in computer vision, including metric learning tasks such as image retrieval, through convolutional neural networks. In image retrieval, the metric distance (i.e., the similarity) between the…

Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech Open

Yeunju Choi, Youngmoon Jung, Young-Joo Suh, Hoirin Kim · 2022

Computer science Economics Philosophy

Although recent neural text-to-speech (TTS) systems have achieved\nhigh-quality speech synthesis, there are cases where a TTS system generates\nlow-quality speech, mainly caused by limited training data or information loss\nduring knowledg…

Data-driven modeling reveals the Western dominance of global public interest in earthquakes Open

Jonghun Kam, Jihun Park, Wanyun Shao, Junho Song, Jinhee Kim , et al. · 2021

Political science Economics Computer science

Catastrophic earthquakes stimulate information-seeking behaviors beyond the affected geographical boundaries; however, our understanding of the dynamics of global public interest in earthquakes remains limited. Herein, we harness Big Data …

Improving Classification Accuracy of Hand Gesture Recognition Based on 60 GHz FMCW Radar with Deep Learning Domain Adaptation Open

Hyo Ryun Lee, Jihun Park, Young-Joo Suh · 2020

Computer science

With the recent development of small radars with high resolution, various human–computer interaction (HCI) applications using them have been developed. In particular, a method of applying a user’s hand gesture recognition using a short-ran…

Perceptually Guided End-to-End Text-to-Speech With MOS Prediction Open

Yeunju Choi, Youngmoon Jung, Young-Joo Suh, Hoirin Kim · 2020

Computer science Engineering Philosophy

Although recent end-to-end text-to-speech (TTS) systems have achieved high-quality speech synthesis, there are still several factors that degrade the quality of synthesized speech, including lack of training data or information loss during…

Perceptually Guided End-to-End Text-to-Speech. Open

Yeunju Choi, Youngmoon Jung, Young-Joo Suh, Hoirin Kim · 2020

Computer science Engineering Psychology

Several fast text-to-speech (TTS) models have been proposed for real-time processing, but there is room for improvement in speech quality. Meanwhile, there is a mismatch between the loss function for training and the mean opinion score (MO…

Non-parallel voice conversion based on source-to-target direct mapping Open

Sung‐Hee Jung, Young-Joo Suh, Yeunju Choi, Hoirin Kim · 2020

Computer science Mathematics

Recent works of utilizing phonetic posteriograms (PPGs) for non-parallel voice conversion have significantly increased the usability of voice conversion since the source and target DBs are no longer required for matching contents. In this …

Designing and Implementing an Enhanced Bluetooth Low Energy Scanner with User-Level Channel Awareness and Simultaneous Channel Scanning Open

Sangwook Bak, Young-Joo Suh · 2019

Computer science Mathematics

This paper proposes an enhanced BLE scanner with user-level channel awareness and simultaneous channel scanning to increase theoretical scanning capability by up to three times. With better scanning capability, channel analysis quality als…

An end-to-end synthesis method for Korean text-to-speech systems Open

Yeunju Choi, Youngmoon Jung, Younggwan Kim, Young-Joo Suh, Hoirin Kim · 2018

Computer science Mathematics

A typical statistical parametric speech synthesis (text-to-speech, TTS) system consists of separate modules, such as a text analysis module, an acoustic modeling module, and a speech synthesis module. This causes two problems: 1) expert kn…

Young-Joo Suh YOU? Author Swipe