Explanipedia

ViTaL: A Multimodality Dataset and Benchmark for Multi-pathological Ovarian Tumor Recognition Open

You Zhou, Lijiang Chen, Guangxia Cui, Wenpei Bai, Yu Guo , et al. · 2025

Ovarian tumor, as a common gynecological disease, can rapidly deteriorate into serious health crises when undetected early, thus posing significant threats to the health of women. Deep neural networks have the potential to identify ovarian…

DUAL: Dynamic Uncertainty-Aware Learning Open

Jiahao Qin, Bei Peng, Feng Liu, Guangliang Cheng, Lu Zong · 2025

Deep learning models frequently encounter feature uncertainty in diverse learning scenarios, significantly impacting their performance and reliability. This challenge is particularly complex in multi-modal scenarios, where models must inte…

Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing Open

Sihao Wu, Xiaonan Si, Chi Xing, Jianhong Wang, Gaojie Jin , et al. · 2025

The integration of preference alignment with diffusion models (DMs) has emerged as a transformative approach to enhance image generation and editing capabilities. Although integrating diffusion models with preference alignment strategies p…

Position: Towards a Responsible LLM-empowered Multi-Agent Systems Open

Jinwei Hu, Yi Dong, Shuang Ao, Zhuoyun Li, Boxuan Wang , et al. · 2025

Computer science Business

The rise of Agent AI and Large Language Model-powered Multi-Agent Systems (LLM-MAS) has underscored the need for responsible and dependable system operation. Tools like LangChain and Retrieval-Augmented Generation have expanded LLM capabil…

BEARD: Benchmarking the Adversarial Robustness for Dataset Distillation Open

Zheng Zhou, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiaowei Huang , et al. · 2024

Computer science Economics Chemistry

Dataset Distillation (DD) is an emerging technique that compresses large-scale datasets into significantly smaller synthesized datasets while preserving high test performance and enabling the efficient training of large models. However, cu…

Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Open

Shuchang Lyu, Qi Zhaoa, Guangliang Cheng, Yiwei He, Zheng Zhou , et al. · 2024

Computer science Engineering Geography

Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation (UDA-RSSeg) addresses the challenge of adapting a model trained on source domain data to target domain samples, thereby minimizing the need for annotated data across d…

Shape-Dependent Dynamic Label Assignment for Oriented Remote Sensing Object Detection Open

Xue Zhang, Yanxia Wu, Guoyin Zhang, Ye Yuan, Guangliang Cheng , et al. · 2024

Computer science Geology

Oriented remote sensing object detection (ORSOD) has gained increasing significance in both military and civilian applications due to the necessity of accurately identifying objects with varying shapes and orientations in remote sensing da…

Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving Open

Saisai Wu, Jiaxu Liu, Xiangyu Yin, Guangliang Cheng, Fang Meng , et al. · 2024

Computer science Psychology

The integration of Large Language Models (LLMs) into autonomous driving systems demonstrates strong common sense and reasoning abilities, effectively addressing the pitfalls of purely data-driven methods. Current LLM-based agents require l…

OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping Open

Meng Li, Qi Zhao, Lyu Shuchang, Chunlei Wang, MA Yu-jing , et al. · 2024

Computer science Psychology Philosophy

Recognizing and grasping novel-category objects remains a crucial yet challenging problem in real-world robotic applications. Despite its significance, limited research has been conducted in this specific domain. To address this, we seamle…

MSTF: Multiscale Transformer for Incomplete Trajectory Prediction Open

Zhanwen Liu, Chao Li, Yang Nan, Yang Wang, Jiaqi Ma , et al. · 2024

Computer science Physics Engineering

Motion forecasting plays a pivotal role in autonomous driving systems, enabling vehicles to execute collision warnings and rational local-path planning based on predictions of the surrounding vehicles. However, prevalent methods often assu…

BACON: Bayesian Optimal Condensation Framework for Dataset Distillation Open

Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu , et al. · 2024

Computer science Mathematics Chemistry

Dataset Distillation (DD) aims to distill knowledge from extensive datasets into more compact ones while preserving performance on the test set, thereby reducing storage costs and training expenses. However, existing methods often suffer f…

Efficient Decoder and Intermediate Domain for Semantic Segmentation in Adverse Conditions Open

Xiaohong Chen, Nan Jiang, Yifeng Li, Guangliang Cheng, Liang Zheng , et al. · 2024

Computer science Mathematics Geography

In smart city contexts, traditional methods for semantic segmentation are affected by adverse conditions, such as rain, fog, or darkness. One challenge is the limited availability of semantic segmentation datasets, specifically for autonom…

Self-training guided disentangled adaptation for cross-domain remote sensing image semantic segmentation Open

Qi Zhao, Shuchang Lyu, Hongbo Zhao, Binghao Liu, Lijiang Chen , et al. · 2024

Computer science Physics Philosophy

Remote sensing (RS) image semantic segmentation using deep convolutional neural networks (DCNNs) has shown great success in various applications. However, the high dependence on annotated data makes it challenging for DCNNs to adapt to dif…

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection Open

Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li , et al. · 2023

Computer science Geography Philosophy

Open-vocabulary object detection (OVOD) aims to detect the objects beyond the set of classes observed during training. This work introduces a straightforward and efficient strategy that utilizes pre-trained vision-language models (VLM), li…

Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow Open

Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang , et al. · 2023

Computer science Mathematics Philosophy

In this paper, we focus on exploring effective methods for faster and accurate semantic segmentation. A common practice to improve the performance is to attain high-resolution feature maps with strong semantic representation. Two strategie…

Learn by Oneself: Exploiting Weight-Sharing Potential in Knowledge Distillation Guided Ensemble Network Open

Qi Zhao, Shuchang Lyu, Lijiang Chen, Binghao Liu, Ting-Bing Xu , et al. · 2023

Computer science Political science Philosophy

Recent CNNs (convolutional neural networks) have become more and more compact. The elegant structure design highly improves the performance of CNNs. With the development of knowledge distillation technique, the performance of CNNs gets fur…

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation Open

Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang , et al. · 2023

Computer science Mathematics Engineering

Video segmentation aims to segment and track every pixel in diverse scenarios accurately. In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks of video segmentation with a unified architecture. Our …

Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search Open

Guangliang Cheng, Peng Sun, Ting-Bing Xu, Shuchang Lyu, Pei‐Wen Lin · 2023

Computer science Engineering

Neural Architecture Search (NAS) has shown great potentials in automatically designing neural network architectures for real-time semantic segmentation. Unlike previous works that utilize a simplified search space with cell-sharing way, we…

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation Open

Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng , et al. · 2023

Computer science Engineering Biology

Panoptic Part Segmentation (PPS) unifies panoptic and part segmentation into one task. Previous works utilize separate approaches to handle things, stuff, and part predictions without shared computation and task association. We aim to unif…

TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers Open

Qianyu Zhou, Xiangtai Li, Lu H, Yibo Yang, Guangliang Cheng , et al. · 2022

Computer science Engineering

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors. However, their…

Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior Open

Chenguang Li, Jia Shi, Ya Wang, Guangliang Cheng · 2022

Computer science Mathematics Materials science

In this paper, we propose an advanced approach in targeting the problem of monocular 3D lane detection by leveraging geometry structure underneath the process of 2D to 3D lane reconstruction. Inspired by previous methods, we first analyze …

Multi-level Domain Adaptation for Lane Detection Open

Chenguang Li, Boheng Zhang, Jia Shi, Guangliang Cheng · 2022

Computer science Mathematics Physics

We focus on bridging domain discrepancy in lane detection among different scenarios to greatly reduce extra annotation and re-training costs for autonomous driving. Critical factors hinder the performance improvement of cross-domain lane d…

Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition Open

Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong , et al. · 2022

Computer science Geology Physics

Human fashion understanding is one crucial computer vision task since it has comprehensive information for real-world applications. This focus on joint human fashion segmentation and attribute recognition. Contrary to the previous works th…

Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation Open

Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng , et al. · 2022

Computer science Mathematics Philosophy

This paper presents Video K-Net, a simple, strong, and unified framework for fully end-to-end video panoptic segmentation. The method is built upon K-Net, a method that unifies image segmentation via a group of learnable kernels. We observ…

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation Open

Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng , et al. · 2021

Computer science Geography Mathematics

Cross-domain object detection and semantic segmentation have witnessed impressive progress recently. Existing approaches mainly consider the domain shift resulting from external environments including the changes of background, illuminatio…

Guangliang Cheng YOU? Author Swipe