Explanipedia

Chatting with Interactive Memory for Text-based Person Retrieval (ChinaMM 2024) Open

Chen He, Shenshen Li, Zheng Wang, Hua Chen, Fumin Shen , et al. · 2024

Computer science

Text-based person retrieval aims to match a specific pedestrian image with textual descriptions. Traditional approaches have largely focused on utilizing a "single-shot" query with text description.They may not align well with real-world s…

Anti-Collapse Loss for Deep Metric Learning Based on Coding Rate Metric Open

Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Xian‐Sheng Hua , et al. · 2024

Mathematics Computer science Economics

Deep metric learning (DML) aims to learn a discriminative high-dimensional embedding space for downstream tasks like classification, clustering, and retrieval. Prior literature predominantly focuses on pair-based and proxy-based methods to…

PTAN: Principal Token-aware Adjacent Network for Compositional Temporal Grounding Open

Z. X. Wei, Xun Jiang, Zheng Wang, Fumin Shen, Xing Xu · 2024

Computer science Mathematics Economics

Compositional temporal grounding (CTG) aims to localize the most relevant segment from an untrimmed video based on a given natural language sentence, and the test samples for this task contain novel components not seen in training. However…

Dual Dynamic Threshold Adjustment Strategy for Deep Metric Learning Open

Xiruo Jiang, Yazhou Yao, Sheng Liu, Fumin Shen, Liqiang Nie , et al. · 2024

Computer science Mathematics Economics

Loss functions and sample mining strategies are essential components in deep metric learning algorithms. However, the existing loss function or mining strategy often necessitate the incorporation of additional hyperparameters, notably the …

Dual Dynamic Threshold Adjustment Strategy Open

Xiruo Jiang, Yazhou Yao, Sheng Liu, Fumin Shen, Liqiang Nie , et al. · 2024

Computer science Art

Loss functions and sample mining strategies are essential components in deep metric learning algorithms. However, the existing loss function or mining strategy often necessitates the incorporation of additional hyperparameters, notably the…

Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval Open

Shenshen Li, Chen He, Xing Xu, Fumin Shen, Yang Yang , et al. · 2024

Computer science Psychology

Text-based person retrieval aims at retrieving a specific pedestrian image from a gallery based on textual descriptions. The primary challenge is how to overcome the inherent heterogeneous modality gap in the situation of significant intra…

Hierarchical Graph Pattern Understanding for Zero-Shot VOS Open

Gensheng Pei, Fumin Shen, Yazhou Yao, Tao Chen, Xian‐Sheng Hua , et al. · 2023

Computer science Chemistry

The optical flow guidance strategy is ideal for obtaining motion information of objects in the video. It is widely utilized in video segmentation tasks. However, existing optical flow-based methods have a significant dependency on optical …

BatchNorm-based Weakly Supervised Video Anomaly Detection Open

Yixuan Zhou, Yi Qu, Xing Xu, Fumin Shen, Jingkuan Song , et al. · 2023

Computer science Physics Psychology

In weakly supervised video anomaly detection (WVAD), where only video-level labels indicating the presence or absence of abnormal events are available, the primary challenge arises from the inherent ambiguity in temporal annotations of abn…

MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection Open

Yixuan Zhou, Xing Xu, Jingkuan Song, Fumin Shen, Heng Tao Shen · 2023

Computer science Mathematics Geology

Unsupervised anomaly detection (UAD) attracts a lot of research interest and drives widespread applications, where only anomaly-free samples are available for training. Some UAD applications intend to further locate the anomalous regions w…

AnoOnly: Semi-Supervised Anomaly Detection with the Only Loss on Anomalies Open

Yixuan Zhou, Peiyu Yang, Yi Qu, Xing Xu, Fumin Shen , et al. · 2023

Computer science Mathematics Sociology

Semi-supervised anomaly detection (SSAD) methods have demonstrated their effectiveness in enhancing unsupervised anomaly detection (UAD) by leveraging few-shot but instructive abnormal instances. However, the dominance of homogeneous norma…

Co-attention Propagation Network for Zero-Shot Video Object Segmentation Open

Gensheng Pei, Yazhou Yao, Fumin Shen, Dan Huang, Xingguo Huang , et al. · 2023

Computer science Philosophy Psychology

Zero-shot video object segmentation (ZS-VOS) aims to segment foreground objects in a video sequence without prior knowledge of these objects. However, existing ZS-VOS methods often struggle to distinguish between foreground and background …

Attention Map Guided Transformer Pruning for Edge Device Open

Junzhu Mao, Yazhou Yao, Zeren Sun, Xingguo Huang, Fumin Shen , et al. · 2023

Computer science Physics

Due to its significant capability of modeling long-range dependencies, vision transformer (ViT) has achieved promising success in both holistic and occluded person re-identification (Re-ID) tasks. However, the inherent problems of transfor…

Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation Open

Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang , et al. · 2022

Computer science Engineering Physics

Optical flow is an easily conceived and precious cue for advancing unsupervised video object segmentation (UVOS). Most of the previous methods directly extract and fuse the motion and appearance features for segmenting target objects in th…

TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval Open

Jialin Tian, Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen · 2022

Computer science Engineering

In this paper, we study the zero-shot sketch-based image retrieval (ZS-SBIR) task, which retrieves natural images related to sketch queries from unseen categories. In the literature, convolutional neural networks (CNNs) have become the de-…

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation Open

Tao Chen, Yazhou Yao, Lei Zhang, Qiong Wang, Guo-Sen Xie , et al. · 2022

Computer science

Weakly supervised semantic segmentation with only image-level labels aims to\nreduce annotation costs for the segmentation task. Existing approaches\ngenerally leverage class activation maps (CAMs) to locate the object regions\nfor pseudo …

Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach Open

Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen , et al. · 2021

Computer science Business Geography

Learning from the web can ease the extreme dependence of deep learning on large-scale manually labeled datasets. Especially for fine-grained recognition, which targets at distinguishing subordinate categories, it will significantly reduce …

PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation Open

Yiran Zhu, Xing Xu, Fumin Shen, Yanli Ji, Lianli Gao , et al. · 2021

Computer science Engineering

Graph neural networks (GNNs) have been widely used in the 3D human pose estimation task, since the pose representation of a human body can be naturally modeled by the graph structure. Generally, most of the existing GNN-based models utiliz…

Enhancing Audio-Visual Association with Self-Supervised Curriculum Learning Open

Jingran Zhang, Xing Xu, Fumin Shen, Huimin Lu, Xin Liu , et al. · 2021

Computer science Philosophy Psychology

The recent success of audio-visual representations learning can be largely attributed to their pervasive concurrency property, which can be used as a self-supervision signal and extract correlation information. While most recent works focu…

Prototype-supervised Adversarial Network for Targeted Attack of Deep Hashing Open

Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu · 2021

Computer science Physics Political science

Due to its powerful capability of representation learning and high-efficiency computation, deep hashing has made significant progress in large-scale image retrieval. However, deep hashing networks are vulnerable to adversarial examples, wh…

Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation Open

Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen , et al. · 2021

Computer science Mathematics Art

Semantic segmentation aims to classify every pixel of an input image. Considering the difficulty of acquiring dense labels, researchers have recently been resorting to weak labels to alleviate the annotation burden of segmentation. However…

Jo-SRC: A Contrastive Approach for Combating Noisy Labels Open

Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu , et al. · 2021

Computer science Mathematics Psychology

Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance. Existing state-of-the-art methods primarily adopt a sample selection strategy, which selects small-los…

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation Open

Tao Chen, Guo-Sen Xie, Yazhou Yao, Qiong Wang, Fumin Shen , et al. · 2021

Computer science Chemistry

One-shot semantic image segmentation aims to segment the object regions for the novel class with only one annotated image. Recent works adopt the episodic training strategy to mimic the expected situation at testing time. However, these ex…

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones Open

Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiu-Shen Wei, Fumin Shen , et al. · 2021

Computer science

Labeling objects at a subordinate level typically requires expert knowledge, which is not always available when using random annotators. As such, learning directly from web images for fine-grained recognition has attracted broad attention.…

A Survey Of zero shot detection: Methods and applications Open

Chufeng Tan, Xing Xu, Fumin Shen · 2021

Computer science Philosophy Chemistry

Zero shot learning (ZSL) is aim to identify objects whose label is unavailable during training. This learning paradigm makes classifier has the ability to distinguish unseen class. The traditional ZSL method only focuses on the image recog…

Dual ResGCN for Balanced Scene GraphGeneration Open

Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yanbo Fan, Fumin Shen , et al. · 2020

Computer science Psychology Biology

Visual scene graph generation is a challenging task. Previous works have achieved great progress, but most of them do not explicitly consider the class imbalance issue in scene graph generation. Models learned without considering the class…

Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification Open

Chuanyi Zhang, Yazhou Yao, Huafeng Liu, Guo-Sen Xie, Xiangbo Shu , et al. · 2020

Computer science Political science

Labeling objects at the subordinate level typically requires expert knowledge, which is not always available from a random annotator. Accordingly, learning directly from web images for fine-grained visual classification (FGVC) has attracte…

Auto-Encoding Twin-Bottleneck Hashing Open

Yuming Shen, Jie Qin, Jiaxin Chen, Mengyang Yu, Li Liu , et al. · 2020

Computer science Mathematics

Conventional unsupervised hashing methods usually take advantage of similarity graphs, which are either pre-computed in the high-dimensional space or obtained from random anchor points. On the one hand, existing methods uncouple the proced…

Fast Large-Scale Discrete Optimization Based on Principal Coordinate Descent Open

Huan Xiong, Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen , et al. · 2019

Computer science Mathematics Physics

Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are N…

MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning Open

Zhijun Mai, Guosheng Hu, Dexiong Chen, Fumin Shen, Heng Tao Shen · 2019

Computer science Mathematics

MixUp is an effective data augmentation method to regularize deep neural networks via random linear interpolations between pairs of samples and their labels. It plays an important role in model regularization, semi-supervised learning and …

Fumin Shen YOU? Author Swipe