Jenq‐Kuen Lee
YOU?
Author Swipe
View article: TreeHouse: An MLIR-based Compilation Flow for Real-Time Tree-based Inference
TreeHouse: An MLIR-based Compilation Flow for Real-Time Tree-based Inference Open
Tree-based ensembles stand as the prominent resource-efficient approaches for real-time inference. To optimize their performance, researchers have developed several solutions to accommodate their unique program structure, i.e., consecutive…
View article: Rewriting and Optimizing Vector Length Agnostic Intrinsics from Arm SVE to RVV
Rewriting and Optimizing Vector Length Agnostic Intrinsics from Arm SVE to RVV Open
Advanced processors incorporate SIMD extensions to execute data-parallel operations efficiently. As technology advances, new generations of SIMD extensions evolve with longer vector register lengths, making target-specific program non-port…
View article: Case Study: Optimization Methods With TVM Hybrid-OP on RISC-V Packed SIMD
Case Study: Optimization Methods With TVM Hybrid-OP on RISC-V Packed SIMD Open
In recent years, considerable research has focused on the use of custom hardware to accelerate deep learning on edge devices. However, the end-to-end flow of deep learning includes preprocessing and postprocessing. Deep learning hardware a…
View article: Accelerating AI performance with the incorporation of TVM and MediaTek NeuroPilot
Accelerating AI performance with the incorporation of TVM and MediaTek NeuroPilot Open
The continuing prominence of machine learning has led to an increased focus on enhancing the inference performance of edge devices to reduce latency and improve efficiency. Two widely adopted strategies for accelerating computational perfo…
View article: SIMD Everywhere Optimization from ARM NEON to RISC-V Vector Extensions
SIMD Everywhere Optimization from ARM NEON to RISC-V Vector Extensions Open
Many libraries, such as OpenCV, FFmpeg, XNNPACK, and Eigen, utilize Arm or x86 SIMD Intrinsics to optimize programs for performance. With the emergence of RISC-V Vector Extensions (RVV), there is a need to migrate these performance legacy …
View article: Support of MISRA C++ Analyzer for Reliability of Embedded Systems
Support of MISRA C++ Analyzer for Reliability of Embedded Systems Open
Cyber-Physical Systems (CPS) are increasingly used in many complex applications, such as autonomous delivery drones, the automotive CPS design, power grid control systems, and medical robotics. However, existing programming languages lack …
View article: Guest Editorial: Special Issue on Systems Optimizations for DSP and AI Applications
Guest Editorial: Special Issue on Systems Optimizations for DSP and AI Applications Open
View article: Efficient Realization of Decision Trees for Real-Time Inference
Efficient Realization of Decision Trees for Real-Time Inference Open
For timing-sensitive edge applications, the demand for efficient lightweight machine learning solutions has increased recently. Tree ensembles are among the state-of-the-art in many machine learning applications. While single decision tree…
View article: Case Study: Design Strategies for Enabling Visual Application Blocks of Bluetooth Library
Case Study: Design Strategies for Enabling Visual Application Blocks of Bluetooth Library Open
Block-based tools can make it easier for beginners to learn programming by arranging blocks. Their block concept and extensible characteristics make block-based designs suitable for introductory programming. However, block-based tools are …
View article: Guest Editorial: Special Issue on Embedded Multicore Applications and Optimization
Guest Editorial: Special Issue on Embedded Multicore Applications and Optimization Open