Explanipedia

A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection Open

Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie , et al. · 2025

Open-vocabulary object detection (OVD) aims to detect objects beyond the training annotations, where detectors are usually aligned to a pre-trained vision-language model, eg, CLIP, to inherit its generalizable recognition ability so that d…

LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Open

Shenghao Fu, Qize Yang, Qijie Mo, Junkai Yan, Xihan Wei , et al. · 2025

Computer science Philosophy

Recent open-vocabulary detectors achieve promising performance with abundant region-level annotated data. In this work, we show that an open-vocabulary detector co-training with a large language model by generating image-level detailed cap…

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding Open

Jiaxing Zhao, Qize Yang, Yixing Peng, Detao Bai, Sicong Yao , et al. · 2025

Computer science Philosophy

In human-centric scenes, the ability to simultaneously understand visual and auditory information is crucial. While recent omni models can process multiple modalities, they generally lack effectiveness in human-centric scenes due to the ab…

Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Open

Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie , et al. · 2024

Computer science Geography

Recent vision foundation models can extract universal representations and show impressive abilities in various tasks. However, their application on object detection is largely overlooked, especially without fine-tuning them. In this work, …

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation Open

Junkai Yan, Yipeng Gao, Qize Yang, Xihan Wei, Xuansong Xie , et al. · 2024

Computer science

Text-to-3D generation, which synthesizes 3D assets according to an overall text description, has significantly progressed. However, a challenge arises when the specific appearances need customizing at designated viewpoints but referring so…

PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation Open

Hanbing Liu, Jun-Yan He, Zhi-Qi Cheng, Wangmeng Xiang, Qize Yang , et al. · 2023

Computer science Mathematics Engineering

Existing 3D human pose estimators face challenges in adapting to new datasets due to the lack of 2D-3D pose pairs in training sets. To overcome this issue, we propose \textit{Multi-Hypothesis \textbf{P}ose \textbf{Syn}thesis \textbf{D}omai…

Interactive Self-Training with Mean Teachers for Semi-supervised Object Detection Open

Qize Yang, Xihan Wei, Biao Wang, Xian‐Sheng Hua, Lei Zhang · 2021

Computer science Engineering

The goal of semi-supervised object detection is to learn a detection model using only a few labeled data and large amounts of unlabeled data, thereby reducing the cost of data labeling. Although a few studies have proposed various self-tra…

Caspase-11-Gasdermin D-Mediated Pyroptosis Is Involved in the Pathogenesis of Atherosclerosis Open

Mengqing Jiang, Xuejing Sun, Suzhen Liu, Yan Tang, Yunming Shi , et al. · 2021

Medicine Chemistry

Background: Pyroptosis is a form of cell death triggered by proinflammatory signals. Recent studies have reported that oxidized phospholipids function as caspase-11 agonists to induce noncanonical inflammasome activation in immune cells. A…

Triglycerides to total cholesterol ratio: an early screening tool for NAFLD in Chinese populations Open

Jingyuan Chen, Yi‐Ping Yang, Qize Yang, Jiangang Wang, Zhiheng Chen , et al. · 2020

Medicine

Background Non-alcoholic fatty liver disease(NAFLD) has a high prevalence in the general population worldwide. Both triglycerides (TG) and total cholesterol (TC) are correlated with the prevalence of NAFLD. The study purpose is to determin…

Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect Open

Xinyang Jiang, Yifei Gong, Xiaowei Guo, Qize Yang, Feiyue Huang , et al. · 2020

Computer science Philosophy Engineering

Recently, the research interest of person re-identification (ReID) has gradually turned to video-based methods, which acquire a person representation by aggregating frame features of an entire video. However, existing video-based ReID meth…

Person Re-Identification by Contour Sketch Under Moderate Clothing Change Open

Qize Yang, Ancong Wu, Wei‐Shi Zheng · 2019

Computer science Geography Biology

Person re-identification (re-id), the process of matching pedestrian images across different camera views, is an important task in visual surveillance. Substantial development of re-id has recently been observed, and the majority of existi…

Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect Open

Xinyang Jiang, Yifei Gong, Xiaowei Guo, Qize Yang, Feiyue Huang , et al. · 2019

Computer science Biology Materials science

Recently, the research interest of person re-identification (ReID) has gradually turned to video-based methods, which acquire a person representation by aggregating frame features of an entire video. However, existing video-based ReID meth…

Qize Yang YOU? Author Swipe