Explanipedia

Practicalizing Tree-Based Model Acceleration with CAM through Model Pruning and Data Placement Optimization Open

Yi-Chun Liao, Chieh-Lin Tsai, Yuan-Hao Chang, Camélia Slimani, Jalil Boukhobza , et al. · 2025

International audience

ReCross: Efficient Embedding Reduction Scheme for In-Memory Computing using ReRAM-Based Crossbar Open

Yu‐Hong Lai, Chieh-Lin Tsai, Wen Sheng Lim, Han-Wen Hu, Tei‐Wei Kuo , et al. · 2025

Deep learning-based recommendation models (DLRMs) are widely deployed in commercial applications to enhance user experience. However, the large and sparse embedding layers in these models impose substantial memory bandwidth bottlenecks due…

Retrieval-Augmented Generation for Natural Language Processing: A Survey Open

Shangyu Wu, Ying Xiong, Yufei Cui, Haolun Wu, Can Chen , et al. · 2025

Computer science Geography

Large language models (LLMs) have demonstrated great success in various fields, benefiting from their huge amount of parameters that store knowledge.However, LLMs still suffer from several key issues, such as hallucination problems, knowle…

RETENTION: Resource-Efficient Tree-Based Ensemble Model Acceleration with Content-Addressable Memory Open

Yi-Chun Liao, Chieh-Lin Tsai, Yuan-Hao Chang, Camélia Slimani, Jalil Boukhobza , et al. · 2025

Although deep learning has demonstrated remarkable capabilities in learning from unstructured data, modern tree-based ensemble models remain superior in extracting relevant information and learning from structured datasets. While several e…

Easz: An Agile Transformer-based Image Compression Framework for Resource-constrained IoTs Open

Yu Mao, Jingzong Li, Jun Wang, Hong Xu, Tei‐Wei Kuo , et al. · 2025

Neural image compression, necessary in various machine-to-machine communication scenarios, suffers from its heavy encode-decode structures and inflexibility in switching between different compression levels. Consequently, it raises signifi…

Search-in-Memory (SiM): Reliable, Versatile, and Efficient Data Matching in SSD's NAND Flash Memory Chip for Data Indexing Acceleration Open

Yun-Chih Chen, Yuan-Hao Chang, Tei‐Wei Kuo · 2024

Computer science Mathematics Physics

To index the increasing volume of data, modern data indexes are typically stored on SSDs and cached in DRAM. However, searching such an index has resulted in significant I/O traffic due to limited access locality and inefficient cache util…

Retrieval-Augmented Generation for Natural Language Processing: A Survey Open

Shangyu Wu, Ying Xiong, Yufei Cui, Haolun Wu, Can Chen , et al. · 2024

Computer science Geography

Large language models (LLMs) have demonstrated great success in various fields, benefiting from their huge amount of parameters that store knowledge. However, LLMs still suffer from several key issues, such as hallucination problems, knowl…

RAEE: A Robust Retrieval-Augmented Early Exiting Framework for Efficient Inference Open

Lianming Huang, Shangyu Wu, Yufei Cui, Ying Xiong, Xue Liu , et al. · 2024

Computer science Geography

Deploying large language model inference remains challenging due to their high computational overhead. Early exiting optimizes model inference by adaptively reducing the number of inference layers. Existing methods typically train internal…

ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion Open

Shangyu Wu, Ying Xiong, Yufei Cui, Xue Liu, Buzhou Tang , et al. · 2024

Computer science Political science Engineering

Retrieval-based augmentations (RA) incorporating knowledge from an external database into language models have greatly succeeded in various knowledge-intensive (KI) tasks. However, integrating retrievals in non-knowledge-intensive (NKI) ta…

Pipette: Efficient Fine-Grained Reads for SSDs Open

Shuhan Bai, Hu Wan, Yun Huang, Xuan Sun, Fei Wu , et al. · 2023

Computer science Mathematics Chemistry

Big data applications, such as recommendation system and social network, often generate a huge number of fine-grained reads to the storage. Block-oriented storage devices upon the traditional storage system rely on the paging mechanism to …

BiTrackGAN: Cascaded CycleGANs to Constraint Face Aging Open

Tsung-Han Kuo, Zhenge Jia, Tei‐Wei Kuo, Jingtong Hu · 2023

Computer science Mathematics Philosophy

With the increased accuracy of modern computer vision technology, many access control systems are equipped with face recognition functions for faster identification. In order to maintain high recognition accuracy, it is necessary to keep t…

Variational Nested Dropout Open

Yufei Cui, Yu Mao, Ziquan Liu, Qiao Li, Antoni B. Chan , et al. · 2023

Computer science

Nested dropout is a variant of dropout operation that is able to order network parameters or features based on the pre-defined importance during training. It has been explored for: I. Constructing nested nets Cui et al. 2020, Cui et al. 20…

Bits-Ensemble: Toward Light-Weight Robust Deep Ensemble by Bits-Sharing Open

Yufei Cui, Shang-Yu Wu, Qiao Li, Antoni B. Chan, Tei‐Wei Kuo , et al. · 2022

Computer science Chemistry

Robustness and uncertainty estimation is crucial to the safety of deep neural networks (DNNs) deployed on the edge. The deep ensemble model, composed of a set of individual DNNs (namely members), has strong performance in accuracy, uncerta…

Message from the General and Program Chairs Open

Tei‐Wei Kuo, Jen-Wei Hsieh · 2022

Computer science Engineering

The 11th IEEE Non-Volatile Memory Systems and Application Symposium (NVMSA) is a premier conference for new ideas and research results in the area of non-volatile memory systems and emerging memory technologies.This year, NVMSA was held hy…

Pipette Open

Shuhan Bai, Hu Wan, Yun Huang, Xuan Sun, Fei Wu , et al. · 2022

Computer science Chemistry Engineering

Big data applications, such as recommendation system and social network, often generate a huge number of fine-grained reads to the storage. Block-oriented storage devices tend to suffer from these fine-grained read operations in terms of I…

NFL: Robust Learned Index via Distribution Transformation Open

Shang-Yu Wu, Yufei Cui, Jinghuan Yu, Xuan Sun, Tei‐Wei Kuo , et al. · 2022

Computer science Mathematics Chemistry

Recent works on learned index open a new direction for the indexing field. The key insight of the learned index is to approximate the mapping between keys and positions with piece-wise linear functions. Such methods require partitioning ke…

RM-SSD: In-Storage Computing for Large-Scale Recommendation Inference Open

Xuan Sun, Hu Wan, Qiao Li, Chia-Lin Yang, Tei‐Wei Kuo , et al. · 2022

Computer science Physics

To meet the strict service level agreement requirements of recommendation systems, the entire set of embeddings in recommendation systems needs to be loaded into the memory. However, as the model and dataset for production-scale recommenda…

A Fast Transformer-based General-Purpose Lossless Compressor Open

Yu M, Yufei Cui, Tei‐Wei Kuo, Chun Jason Xue · 2022

Computer science Engineering

Deep-learning-based compressor has received interests recently due to much improved compression ratio. However, modern approaches suffer from long execution time. To ease this problem, this paper targets on cutting down the execution time …

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points Open

Yu-Chen Lin, Cheng Yu, Yi‐Te Hsu, Szu‐Wei Fu, Yu Tsao , et al. · 2021

Computer science Mathematics

Numerous compression and acceleration strategies have achieved outstanding results on classification tasks in various fields. Nevertheless, the same strategies may yield unsatisfactory performance on regression tasks because the nature bet…

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points Open

Yu‐Chen Lin, Cheng Yu, Yi‐Te Hsu, Szu‐Wei Fu, Yu Tsao , et al. · 2021

Computer science

Numerous compression and acceleration strategies have achieved outstanding results on classification tasks in various fields, such as computer vision and speech signal processing. Nevertheless, the same strategies have yielded ungratified …

Intermittent Speech Recovery. Open

Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri , et al. · 2021

Computer science Mathematics Philosophy

A large number of Internet of Things (IoT) devices today are powered by batteries, which are often expensive to maintain and may cause serious environmental pollution. To avoid these problems, researchers have begun to consider the use of …

Speech Recovery for Real-World Self-powered Intermittent Devices Open

Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri , et al. · 2021

Computer science Art Philosophy

The incompleteness of speech inputs severely degrades the performance of all the related speech signal processing applications. Although many researches have been proposed to address this issue, they controlled the data missing conditions …

PASSLEAF: A Pool-bAsed Semi-Supervised LEArning Framework for Uncertain Knowledge Graph Embedding Open

Zhu-Mu Chen, Mi-Yen Yeh, Tei‐Wei Kuo · 2021

Computer science Mathematics Psychology

In this paper, we study the problem of embedding uncertain knowledge graphs, where each relation between entities is associated with a confidence score. Observing the existing embedding methods may discard the uncertainty information, only…

Variational Nested Dropout Open

Yufei Cui, Yu M, Ziquan Liu, Qiao Li, Antoni B. Chan , et al. · 2021

Computer science Political science

Nested dropout is a variant of dropout operation that is able to order network parameters or features based on the pre-defined importance during training. It has been explored for: I. Constructing nested nets: the nested nets are neural ne…

Fully Nested Neural Network for Adaptive Compression and Quantization Open

Yufei Cui, Ziquan Liu, Wuguannan Yao, Qiao Li, Antoni B. Chan , et al. · 2020

Computer science

Neural network compression and quantization are important tasks for fitting state-of-the-art models into the computational, memory and power constraints of mobile devices and embedded hardware. Recent approaches to model compression/quanti…

Spatiotemporal Super-Resolution with Cross-Task Consistency and Its Semi-supervised Extension Open

Han-Yi Lin, Pi-Cheng Hsiu, Tei‐Wei Kuo, Yen‐Yu Lin · 2020

Computer science

Spatiotemporal super-resolution (SR) aims to upscale both the spatial and temporal dimensions of input videos, and produces videos with higher frame resolutions and rates. It involves two essential sub-tasks: spatial SR and temporal SR. We…

Tei‐Wei Kuo YOU? Author Swipe