Explanipedia

Artificial intelligence-enabled automatic segmentation of impacted mandibular third molars: A comprehensive comparison of multiple algorithms Open

Shadow Yeung, Wai Ying Kot, Yiu Yan Leung, Ka‐Hou Chan, Pui Hang Leung , et al. · 2025

Modular Multi-Task Learning for Emotion-Aware Stance Inference in Online Discourse Open

Sio‐Kei Im, Ka‐Hou Chan · 2025

Stance detection on social media is increasingly vital for understanding public opinion, mitigating misinformation, and enhancing digital trust. This study proposes a modular Multi-Task Learning (MTL) framework that jointly models stance d…

Image harmonization and de-harmonization based on singular value decomposition (SVD) in medical domain Open

H. Q. CHEN, Xinze Li, Ka‐Hou Chan, Yue Sun, Rongsheng Wang , et al. · 2025

The proposed SVD-based harmonization and de-harmonization algorithms present a robust solution to the challenges of image variability in medical imaging. By addressing inconsistencies across different datasets and imaging modalities, while…

Enhanced Localisation and Handwritten Digit Recognition Using ConvCARU Open

Sio‐Kei Im, Ka‐Hou Chan · 2025

Predicting the motion of handwritten digits in video sequences is challenging due to complex spatiotemporal dependencies, variable writing styles, and the need to preserve fine-grained visual details—all of which are essential for real-tim…

MoEdit: On Learning Quantity Perception for Multi-object Image Editing Open

Y. Li, Ka‐Hou Chan, Yue Sun, Chan–Tong Lam, Tong Tong , et al. · 2025

Multi-object images are prevalent in various real-world scenarios, including augmented reality, advertisement design, and medical imaging. Efficient and precise editing of these images is critical for these applications. With the advent of…

Contrastive learning through randomly generated dynamic supervision signals Open

Shibo Wang, Yaofei Duan, Zili Ma, Ka‐Hou Chan, Yue Sun , et al. · 2025

Attention-CARU With Texture-Temporal Network for Video Depth Estimation Open

Sio‐Kei Im, Ka‐Hou Chan · 2025

Video depth estimation has a wide range of applications, especially in the tasks of robot navigation and autonomous driving. RNN-based encoder-decoder architectures are the most commonly used methods for depth feature prediction, but recur…

GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation Open

Ka‐Hou Chan, Sio‐Kei Im · 2024

Nowadays, video is a common social media in our lives. Video summarisation has become an interesting task for information extraction, where the challenge of high redundancy of key scenes leads to difficulties in retrieving important messag…

Faster Intra-Prediction of Versatile Video Coding Using a Concatenate-Designed CNN via DCT Coefficients Open

Sio‐Kei Im, Ka‐Hou Chan · 2024

As the next generation video coding standard, Versatile Video Coding (VVC) significantly improves coding efficiency over the current High-Efficiency Video Coding (HEVC) standard. In practice, this improvement comes at the cost of increased…

Local feature‐based video captioning with multiple classifier and CARU‐attention Open

Sio‐Kei Im, Ka‐Hou Chan · 2024

Video captioning aims to identify multiple objects and their behaviours in a video event and generate captions for the current scene. This task aims to generate a detailed description of the current video in real‐time using natural languag…

Neural Machine Translation with CARU-Embedding Layer and CARU-Gated Attention Layer Open

Sio‐Kei Im, Ka‐Hou Chan · 2024

The attention mechanism performs well for the Neural Machine Translation (NMT) task, but heavily depends on the context vectors generated by the attention network to predict target words. This reliance raises the issue of long-term depende…

Dynamic estimator selection for double‐bit‐range estimation in VVC CABAC entropy coding Open

Sio‐Kei Im, Ka‐Hou Chan · 2024

CABAC is the only entropy coding used in Versatile Video Coding (VVC). This is achieved through multiple estimators approach that provide more accurate predictions by considering different estimated probability results, but CABAC coding re…

Reconsidering the Meanings of "-Scape" in Soundscape Open

Ka‐Hou Chan, Murray Schafer, Garth Paine · 2024

This paper serves as an attempt to delve into and to manifest a discourse that invite readers to gaze through ideas and writings by Murray Schafer (The Soundscape: Our Sonic Environment and the Tuning of the World in 1977), Francisco López…

Light‐field image super‐resolution with depth feature by multiple‐decouple and fusion module Open

Ka‐Hou Chan, Sio‐Kei Im · 2024

Light‐field (LF) images offer the potential to improve feature capture in live scenes from multiple perspectives, and also generate additional normal vectors for performing super‐resolution (SR) image processing. With the benefit of machin…

Parallel Dense Video Caption Generation with Multi-Modal Features Open

Xuefei Huang, Ka‐Hou Chan, Wei Ke, Hao Sheng · 2023

The task of dense video captioning is to generate detailed natural-language descriptions for an original video, which requires deep analysis and mining of semantic captions to identify events in the video. Existing methods typically follow…

Fusion of Multi-Modal Features to Enhance Dense Video Caption Open

Xuefei Huang, Ka‐Hou Chan, Weifan Wu, Hao Sheng, Wei Ke · 2023

Dense video caption is a task that aims to help computers analyze the content of a video by generating abstract captions for a sequence of video frames. However, most of the existing methods only use visual features in the video and ignore…

A Study of Assessment of Casinos’ Risk of Ruin in Casino Games with Poisson Distribution Open

Ka-Meng Siu, Ka‐Hou Chan, Sio‐Kei Im · 2023

Gambling, as an uncertain business involving risks confronting casinos, is commonly analysed using the risk of ruin (ROR) formula. However, due to its brevity, the ROR does not provide any implication of nuances in terms of the distributio…

Vector quantization using <i>k</i> ‐means clustering neural network Open

Sio‐Kei Im, Ka‐Hou Chan · 2023

Vector Quantization (VQ) is a clustering problem in the fields of signal processing, source coding, information theory etc. Taking advantage of recent advances in the field of deep neural networks, this paper investigates the performance b…

Session 2C : Artificial Intelligent 2 Open

Sio‐Kei Im, Ka‐Hou Chan, Vu Viet Thang, M. Liu, Mr Gao , et al. · 2023

Context-Adaptive-Based Image Captioning by Bi-CARU Open

Sio‐Kei Im, Ka‐Hou Chan · 2023

Image captions are abstract expressions of content representations using text sentences, helping readers to better understand and analyse information between different media. With the advantage of encoder-decoder neural networks, captions …

Neural Optimizer for Inverse Design of Complex‐Modulated Hologram Implemented by Plasmonic Metasurfaces Open

Huade Mao, Yue Yu, Yu‐Xuan Ren, Ka‐Hou Chan, Jiqiang Kang , et al. · 2022

Inverse design of a metasurface involves searching parameters in a high‐dimensional space, which needs huge computational power. To ease the computational burden, neural network, a well‐researched computer science stream, has demonstrated …

A propagation model for package loss refinement in VVC Open

Sio‐Kei Im, Ka‐Hou Chan · 2022

A propagation model for the CABAC entropy codec for VTM is proposed and analysed in terms of its effective estimation and packing loss. In order to be compatible and implement a next‐generation coding framework, the proposed model is desig…

Double bit range estimation with eight estimators for CABAC in VVC Open

Ka‐Hou Chan, Sio‐Kei Im · 2022

This work describes the modification of Context‐based Adaptive Binary Arithmetic Coding (CABAC) using the double bit range estimation in the VVC engine and the consideration of range updates by using eight hypothetical probability estimato…

Sentiment analysis by using Naïve‐Bayes classifier with stacked CARU Open

Ka‐Hou Chan, Sio‐Kei Im · 2022

A long sequence always contains long‐term dependency problems, which leads to paragraph‐based sentiment analysis being a very challenging task and difficult to evaluate by using a simple RNN network. It is proposed in this letter to use a …

Validating an inertial measurement unit for cricket fast bowling: a first step in assessing the feasibility of diagnosing back injury risk in cricket fast bowlers during a tele-sport-and-exercise medicine consultation Open

Keegan Harnett, Brenda Plint, Ka‐Hou Chan, Benjamin Y. Clark, Kevin Netto , et al. · 2022

This study aimed to validate an array-based inertial measurement unit to measure cricket fast bowling kinematics as a first step in assessing feasibility for tele-sport-and-exercise medicine. We concurrently captured shoulder girdle relati…

A Multilayer CARU Framework to Obtain Probability Distribution for Paragraph-Based Sentiment Analysis Open

Wei Ke, Ka‐Hou Chan · 2021

Paragraph-based datasets are hard to analyze by a simple RNN, because a long sequence always contains lengthy problems of long-term dependencies. In this work, we propose a Multilayer Content-Adaptive Recurrent Unit (CARU) network for para…

Pattern Matching Based on Object Graphs Open

Wei Ke, Ka‐Hou Chan · 2021

Pattern matching has been widely adopted in functional programming languages, and is gradually getting popular in OO languages, from Scala to Python. The structural pattern matching currently in use has its foundation on algebraic data typ…

Chebyshev Ambient Occlusion Open

Ka‐Hou Chan, Sio‐Kei Im · 2021

Ambient Occlusion (AO) is a widely used shadowing technique in 3D rendering. One of the main disadvantages of using it is that it requires not only the surface depth but also the normal vector, which usually causes severe aliasing. This wo…

Effect of Thickness on the Optical and Electrical Properties of ITO/Au/ITO Sandwich Structures Open

Ka Kin Lam, Sheung Mei Ng, Hon Fai Wong, Linfeng Fei, Yukuai Liu , et al. · 2020

Tin-doped indium oxide (ITO)/Au/ITO sandwich structures with varying top and bottom ITO film thicknesses were deposited by magnetron sputtering. The effects of varying thickness of the two ITO films on the structural, electrical, and optic…

Pilot study on comparisons between the effectiveness of mobile video-guided and paper-based home exercise programs on improving exercise adherence, self-efficacy for exercise and functional outcomes of patients with stroke with 3-month follow-up: A single-blind randomized controlled trial Open

Bryan Ping Ho Chung, Wendy Kam Ha Chiang, Herman Lau, Titanic Fuk On Lau, Chen-Ling Lai , et al. · 2020

Objective: To compare the effectiveness of mobile video-guided home exercise program and standard paper-based home exercise program. Methods: Eligible participants were randomly assigned to either experimental group with mobile video-guide…

Ka‐Hou Chan YOU? Author Swipe