Ka‐Hou Chan
YOU?
Author Swipe
View article: Artificial intelligence-enabled automatic segmentation of impacted mandibular third molars: A comprehensive comparison of multiple algorithms
Artificial intelligence-enabled automatic segmentation of impacted mandibular third molars: A comprehensive comparison of multiple algorithms Open
View article: Modular Multi-Task Learning for Emotion-Aware Stance Inference in Online Discourse
Modular Multi-Task Learning for Emotion-Aware Stance Inference in Online Discourse Open
Stance detection on social media is increasingly vital for understanding public opinion, mitigating misinformation, and enhancing digital trust. This study proposes a modular Multi-Task Learning (MTL) framework that jointly models stance d…
View article: Image harmonization and de-harmonization based on singular value decomposition (SVD) in medical domain
Image harmonization and de-harmonization based on singular value decomposition (SVD) in medical domain Open
The proposed SVD-based harmonization and de-harmonization algorithms present a robust solution to the challenges of image variability in medical imaging. By addressing inconsistencies across different datasets and imaging modalities, while…
View article: Enhanced Localisation and Handwritten Digit Recognition Using ConvCARU
Enhanced Localisation and Handwritten Digit Recognition Using ConvCARU Open
Predicting the motion of handwritten digits in video sequences is challenging due to complex spatiotemporal dependencies, variable writing styles, and the need to preserve fine-grained visual details—all of which are essential for real-tim…
View article: MoEdit: On Learning Quantity Perception for Multi-object Image Editing
MoEdit: On Learning Quantity Perception for Multi-object Image Editing Open
Multi-object images are prevalent in various real-world scenarios, including augmented reality, advertisement design, and medical imaging. Efficient and precise editing of these images is critical for these applications. With the advent of…
View article: Contrastive learning through randomly generated dynamic supervision signals
Contrastive learning through randomly generated dynamic supervision signals Open
View article: Attention-CARU With Texture-Temporal Network for Video Depth Estimation
Attention-CARU With Texture-Temporal Network for Video Depth Estimation Open
Video depth estimation has a wide range of applications, especially in the tasks of robot navigation and autonomous driving. RNN-based encoder-decoder architectures are the most commonly used methods for depth feature prediction, but recur…
View article: GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation
GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation Open
Nowadays, video is a common social media in our lives. Video summarisation has become an interesting task for information extraction, where the challenge of high redundancy of key scenes leads to difficulties in retrieving important messag…
View article: Faster Intra-Prediction of Versatile Video Coding Using a Concatenate-Designed CNN via DCT Coefficients
Faster Intra-Prediction of Versatile Video Coding Using a Concatenate-Designed CNN via DCT Coefficients Open
As the next generation video coding standard, Versatile Video Coding (VVC) significantly improves coding efficiency over the current High-Efficiency Video Coding (HEVC) standard. In practice, this improvement comes at the cost of increased…
View article: Local feature‐based video captioning with multiple classifier and CARU‐attention
Local feature‐based video captioning with multiple classifier and CARU‐attention Open
Video captioning aims to identify multiple objects and their behaviours in a video event and generate captions for the current scene. This task aims to generate a detailed description of the current video in real‐time using natural languag…
View article: Neural Machine Translation with CARU-Embedding Layer and CARU-Gated Attention Layer
Neural Machine Translation with CARU-Embedding Layer and CARU-Gated Attention Layer Open
The attention mechanism performs well for the Neural Machine Translation (NMT) task, but heavily depends on the context vectors generated by the attention network to predict target words. This reliance raises the issue of long-term depende…
View article: Dynamic estimator selection for double‐bit‐range estimation in VVC CABAC entropy coding
Dynamic estimator selection for double‐bit‐range estimation in VVC CABAC entropy coding Open
CABAC is the only entropy coding used in Versatile Video Coding (VVC). This is achieved through multiple estimators approach that provide more accurate predictions by considering different estimated probability results, but CABAC coding re…
View article: Reconsidering the Meanings of "-Scape" in Soundscape
Reconsidering the Meanings of "-Scape" in Soundscape Open
This paper serves as an attempt to delve into and to manifest a discourse that invite readers to gaze through ideas and writings by Murray Schafer (The Soundscape: Our Sonic Environment and the Tuning of the World in 1977), Francisco López…
View article: Light‐field image super‐resolution with depth feature by multiple‐decouple and fusion module
Light‐field image super‐resolution with depth feature by multiple‐decouple and fusion module Open
Light‐field (LF) images offer the potential to improve feature capture in live scenes from multiple perspectives, and also generate additional normal vectors for performing super‐resolution (SR) image processing. With the benefit of machin…
View article: Parallel Dense Video Caption Generation with Multi-Modal Features
Parallel Dense Video Caption Generation with Multi-Modal Features Open
The task of dense video captioning is to generate detailed natural-language descriptions for an original video, which requires deep analysis and mining of semantic captions to identify events in the video. Existing methods typically follow…
View article: Fusion of Multi-Modal Features to Enhance Dense Video Caption
Fusion of Multi-Modal Features to Enhance Dense Video Caption Open
Dense video caption is a task that aims to help computers analyze the content of a video by generating abstract captions for a sequence of video frames. However, most of the existing methods only use visual features in the video and ignore…
View article: A Study of Assessment of Casinos’ Risk of Ruin in Casino Games with Poisson Distribution
A Study of Assessment of Casinos’ Risk of Ruin in Casino Games with Poisson Distribution Open
Gambling, as an uncertain business involving risks confronting casinos, is commonly analysed using the risk of ruin (ROR) formula. However, due to its brevity, the ROR does not provide any implication of nuances in terms of the distributio…
View article: Vector quantization using <i>k</i> ‐means clustering neural network
Vector quantization using <i>k</i> ‐means clustering neural network Open
Vector Quantization (VQ) is a clustering problem in the fields of signal processing, source coding, information theory etc. Taking advantage of recent advances in the field of deep neural networks, this paper investigates the performance b…
View article: Session 2C : Artificial Intelligent 2
Session 2C : Artificial Intelligent 2 Open
View article: Context-Adaptive-Based Image Captioning by Bi-CARU
Context-Adaptive-Based Image Captioning by Bi-CARU Open
Image captions are abstract expressions of content representations using text sentences, helping readers to better understand and analyse information between different media. With the advantage of encoder-decoder neural networks, captions …
View article: Neural Optimizer for Inverse Design of Complex‐Modulated Hologram Implemented by Plasmonic Metasurfaces
Neural Optimizer for Inverse Design of Complex‐Modulated Hologram Implemented by Plasmonic Metasurfaces Open
Inverse design of a metasurface involves searching parameters in a high‐dimensional space, which needs huge computational power. To ease the computational burden, neural network, a well‐researched computer science stream, has demonstrated …
View article: A propagation model for package loss refinement in VVC
A propagation model for package loss refinement in VVC Open
A propagation model for the CABAC entropy codec for VTM is proposed and analysed in terms of its effective estimation and packing loss. In order to be compatible and implement a next‐generation coding framework, the proposed model is desig…
View article: Double bit range estimation with eight estimators for CABAC in VVC
Double bit range estimation with eight estimators for CABAC in VVC Open
This work describes the modification of Context‐based Adaptive Binary Arithmetic Coding (CABAC) using the double bit range estimation in the VVC engine and the consideration of range updates by using eight hypothetical probability estimato…
View article: Sentiment analysis by using Naïve‐Bayes classifier with stacked CARU
Sentiment analysis by using Naïve‐Bayes classifier with stacked CARU Open
A long sequence always contains long‐term dependency problems, which leads to paragraph‐based sentiment analysis being a very challenging task and difficult to evaluate by using a simple RNN network. It is proposed in this letter to use a …
View article: Validating an inertial measurement unit for cricket fast bowling: a first step in assessing the feasibility of diagnosing back injury risk in cricket fast bowlers during a tele-sport-and-exercise medicine consultation
Validating an inertial measurement unit for cricket fast bowling: a first step in assessing the feasibility of diagnosing back injury risk in cricket fast bowlers during a tele-sport-and-exercise medicine consultation Open
This study aimed to validate an array-based inertial measurement unit to measure cricket fast bowling kinematics as a first step in assessing feasibility for tele-sport-and-exercise medicine. We concurrently captured shoulder girdle relati…
View article: A Multilayer CARU Framework to Obtain Probability Distribution for Paragraph-Based Sentiment Analysis
A Multilayer CARU Framework to Obtain Probability Distribution for Paragraph-Based Sentiment Analysis Open
Paragraph-based datasets are hard to analyze by a simple RNN, because a long sequence always contains lengthy problems of long-term dependencies. In this work, we propose a Multilayer Content-Adaptive Recurrent Unit (CARU) network for para…
View article: Pattern Matching Based on Object Graphs
Pattern Matching Based on Object Graphs Open
Pattern matching has been widely adopted in functional programming languages, and is gradually getting popular in OO languages, from Scala to Python. The structural pattern matching currently in use has its foundation on algebraic data typ…
View article: Chebyshev Ambient Occlusion
Chebyshev Ambient Occlusion Open
Ambient Occlusion (AO) is a widely used shadowing technique in 3D rendering. One of the main disadvantages of using it is that it requires not only the surface depth but also the normal vector, which usually causes severe aliasing. This wo…
View article: Effect of Thickness on the Optical and Electrical Properties of ITO/Au/ITO Sandwich Structures
Effect of Thickness on the Optical and Electrical Properties of ITO/Au/ITO Sandwich Structures Open
Tin-doped indium oxide (ITO)/Au/ITO sandwich structures with varying top and bottom ITO film thicknesses were deposited by magnetron sputtering. The effects of varying thickness of the two ITO films on the structural, electrical, and optic…
View article: Pilot study on comparisons between the effectiveness of mobile video-guided and paper-based home exercise programs on improving exercise adherence, self-efficacy for exercise and functional outcomes of patients with stroke with 3-month follow-up: A single-blind randomized controlled trial
Pilot study on comparisons between the effectiveness of mobile video-guided and paper-based home exercise programs on improving exercise adherence, self-efficacy for exercise and functional outcomes of patients with stroke with 3-month follow-up: A single-blind randomized controlled trial Open
Objective: To compare the effectiveness of mobile video-guided home exercise program and standard paper-based home exercise program. Methods: Eligible participants were randomly assigned to either experimental group with mobile video-guide…