Explanipedia

TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation Open

Jiacheng Xie, Ziyang Zhang, Biplab Poudel, Caili Guo, Yu Yang , et al. · 2025

Tongue imaging serves as a valuable diagnostic tool, particularly in Traditional Chinese Medicine (TCM). The quality of tongue surface segmentation significantly affects the accuracy of tongue image classification and subsequent diagnosis …

RIDAS: A Multi-Agent Framework for AI-RAN with Representation- and Intention-Driven Agents Open

Kuiyuan Ding, Caili Guo, Yang Yang, Jun Guo · 2025

Sixth generation (6G) networks demand tight integration of artificial intelligence (AI) into radio access networks (RANs) to meet stringent quality of service (QoS) and resource efficiency requirements. Existing solutions struggle to bridg…

Remote Sensing-Based Land Use/Land Cover and Photovoltaic Panel Classification via Google Earth Engine and Deep Learning Open

Lei Ren, Fugang Li, Jianqiong Wang, Xiaojun Guan, L. Xue , et al. · 2025

This study presents an integrated approach for land use and land cover (LULC) and photovoltaic panel mapping in Qinghai Province by combining Google Earth Engine (GEE) with deep neural network (DNN) modelling. Time-series, cloud-free Lands…

Lightweight Task-Oriented Semantic Communication Empowered by Large-Scale AI Models Open

Chuanhong Liu, Caili Guo, Yang Yang, Mingzhe Chen, Tony Q. S. Quek · 2025

Recent studies have focused on leveraging large-scale artificial intelligence (LAI) models to improve semantic representation and compression capabilities. However, the substantial computational demands of LAI models pose significant chall…

The value of cytokines in evaluating the efficacy of glucocorticoids in the treatment of severe Mycoplasma pneumoniae pneumonia in children Open

Zhaoqian Shan, Wanyu Jia, Shuqin Fu, Caili Guo, Chunlan Song · 2025

Real-Time Oil Spill Concentration Assessment Through Fluorescence Imaging and Deep Learning Open

Biplab Poudel, Jiacheng Xie, Caili Guo, Olivia E Watt, Erin L. Pulster , et al. · 2025

Conformal Distributed Remote Inference in Sensor Networks Under Reliability and Communication Constraints Open

Meiyi Zhu, Matteo Zecchin, Sangwoo Park, Caili Guo, Chunyan Feng , et al. · 2024

This paper presents communication-constrained distributed conformal risk control (CD-CRC) framework, a novel decision-making framework for sensor networks under communication constraints. Targeting multi-label classification problems, such…

On the Impact of Uncertainty and Calibration on Likelihood-Ratio Membership Inference Attacks Open

Meiyi Zhu, Caili Guo, Chunyan Feng, Osvaldo Simeone · 2024

In a membership inference attack (MIA), an attacker exploits the overconfidence exhibited by typical machine learning models to determine whether a specific data point was used to train a target model. In this paper, we analyze the perform…

A Survey on Indoor Visible Light Positioning Systems: Fundamentals, Applications, and Challenges Open

Zhiyu Zhu, Yang Yang, Mingzhe Chen, Caili Guo, Julian Cheng , et al. · 2024

The growing demand for location-based services in areas like virtual reality, robot control, and navigation has intensified the focus on indoor localization. Visible light positioning (VLP), leveraging visible light communications (VLC), b…

OFDM-Based Digital Semantic Communication with Importance Awareness Open

Chuanhong Liu, Caili Guo, Yang Yang, Wanli Ni, Tony Q. S. Quek · 2024

Semantic communication (SemCom) has received considerable attention for its ability to reduce data transmission size while maintaining task performance. However, existing works mainly focus on analog SemCom with simple channel models, whic…

Multi-View Visual Semantic Embedding for Cross-Modal Image-Text Retrieval Open

Zheng Li, Caili Guo, Xin Wang, Hao Zhang, Lin Hu · 2024

Federated Inference With Reliable Uncertainty Quantification Over Wireless Channels via Conformal Prediction Open

Meiyi Zhu, Matteo Zecchin, Sangwoo Park, Caili Guo, Chunyan Feng , et al. · 2024

In this paper, we consider a wireless federated inference scenario in which devices and a server share a pre-trained machine learning model. The devices communicate statistical information about their local data to the server over a common…

Revisiting Hard Negative Mining in Contrastive Learning for Visual Understanding Open

Hao Zhang, Zheng Li, Jiahui Yang, Xin Wang, Caili Guo , et al. · 2023

Efficiently mining and distinguishing hard negatives is the key to Contrastive Learning (CL) in various visual understanding tasks. By properly emphasizing the penalty of hard negatives, Hard Negative Mining (HNM) can improve the CL perfor…

Boundary-Aware Proposal Generation Method for Temporal Action Localization Open

Hao Zhang, Feng Chun-yan, Jiahui Yang, Zheng Li, Caili Guo · 2023

The goal of Temporal Action Localization (TAL) is to find the categories and temporal boundaries of actions in an untrimmed video. Most TAL methods rely heavily on action recognition models that are sensitive to action labels rather than t…

Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission Open

Lunan Sun, Yang Yang, Mingzhe Chen, Caili Guo · 2023

Joint source and channel coding (JSCC) has attracted increasing attention due to its robustness and high efficiency. However, JSCC is vulnerable to privacy leakage due to the high relevance between the source image and channel input. In th…

Privacy-Aware Joint Source-Channel Coding for image transmission based on Disentangled Information Bottleneck Open

Lunan Sun, Caili Guo, Mingzhe Chen, Yang Yang · 2023

Current privacy-aware joint source-channel coding (JSCC) works aim at avoiding private information transmission by adversarially training the JSCC encoder and decoder under specific signal-to-noise ratios (SNRs) of eavesdroppers. However, …

Federated Inference with Reliable Uncertainty Quantification over Wireless Channels via Conformal Prediction Open

Meiyi Zhu, Matteo Zecchin, Sangwoo Park, Caili Guo, Chunyan Feng , et al. · 2023

In this paper, we consider a wireless federated inference scenario in which devices and a server share a pre-trained machine learning model. The devices communicate statistical information about their local data to the server over a common…

Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval Open

Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Yanjun Wang · 2023

Image-Text Retrieval (ITR) is essentially a ranking problem. Given a query caption, the goal is to rank candidate images by relevance, from large to small. The current ITR datasets are constructed in a pairwise manner. Image-text pairs are…

Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching Open

Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Zhongtian Du · 2023

Recently, a series of Image-Text Matching (ITM) methods achieve impressive performance. However, we observe that most existing ITM models suffer from gradients vanishing at the beginning of training, which makes these models prone to falli…

Deep Joint Source-Channel Coding for Wireless Image Transmission with Semantic Importance Open

Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang , et al. · 2023

The sixth-generation mobile communication system proposes the vision of smart interconnection of everything, which requires accomplishing communication tasks while ensuring the performance of intelligent tasks. A joint source-channel codin…

Physical Layer Authentication Based on Channel Polarization Response in Dual-Polarized Antenna Communication Systems Open

Yuemei Wu, Dong Wei, Caili Guo, Weiqing Huang · 2023

This study presents a novel approach for physical layer authentication based on channel polarization response (CPR). CPR is sensitive to variation in the physical properties of scatterers, and the CPR difference between various channels is…

Information Bottleneck-Inspired Type Based Multiple Access for Remote Estimation in IoT Systems Open

Meiyi Zhu, Chunyan Feng, Caili Guo, Zhe Liu, Nan Jiang , et al. · 2022

Type-based multiple access (TBMA) is a semantics-aware multiple access protocol for remote inference. In TBMA, codewords are reused across transmitting sensors, with each codeword being assigned to a different observation value. Existing T…

Joint design of ordered QR precoding and SIC detection for MIMO VLC systems Open

Congcong Wang, Chunyan Feng, Yang Yang, Caili Guo, Bowen Jia · 2022

Ordered successive interference cancellation (OSIC) detection has been investigated to mitigate the high spatial correlation for multiple-input multiple-output (MIMO) visible light communication (VLC) systems. However, existing OSIC scheme…

Image-Text Retrieval with Binary and Continuous Label Supervision Open

Zheng Li, Caili Guo, Zerun Feng, Jenq–Neng Hwang, Ying Jin , et al. · 2022

Most image-text retrieval work adopts binary labels indicating whether a pair of image and text matches or not. Such a binary indicator covers only a limited subset of image-text semantic relations, which is insufficient to represent relev…

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval Open

Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Jenq–Neng Hwang , et al. · 2022

There are two popular loss functions used for vision-language retrieval, i.e., triplet loss and contrastive learning loss, both of them essentially minimize the difference between the similarities of negative pairs and positive pairs. More…

Deep Joint Source-Channel Coding Based on Semantics of Pixels Open

Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Rui Tang , et al. · 2022

The semantic information of the image for intelligent tasks is hidden behind the pixels, and slight changes in the pixels will affect the performance of intelligent tasks. In order to preserve semantic information behind pixels for intelli…

Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications Open

Chuanhong Liu, Caili Guo, Yang Yang, Nan Jiang · 2022

Task-oriented communication is a new paradigm that aims at providing efficient connectivity for accomplishing intelligent tasks rather than the reception of every transmitted bit. In this paper, a deep learning-based task-oriented communic…

Positioning Using Visible Light Communications: A Perspective Arcs Approach Open

Zhiyu Zhu, Caili Guo, Rongzhen Bao, Mingzhe Chen, Walid Saad , et al. · 2022

Visible light positioning (VLP) is an accurate indoor positioning technology that uses luminaires as transmitters. In particular, circular luminaires are a common source type for VLP, that are typically treated only as point sources for po…

Adaptive Information Bottleneck Guided Joint Source and Channel Coding for Image Transmission Open

Lunan Sun, Caili Guo, Yang Yang · 2022

Joint source and channel coding (JSCC) for image transmission has attracted increasing attention due to its robustness and high efficiency. However, the existing deep JSCC research mainly focuses on minimizing the distortion between the tr…

Semantic-assisted image compression Open

Qizheng Sun, Caili Guo, Yang Yang, Jiujiu Chen, Xijun Xue · 2022

Conventional image compression methods typically aim at pixel-level consistency while ignoring the performance of downstream AI tasks.To solve this problem, this paper proposes a Semantic-Assisted Image Compression method (SAIC), which can…

Caili Guo YOU? Author Swipe