Explanipedia

Leveraging Prior Knowledge of Diffusion Model for Person Search Open

G. Jiyun Kim, S.-H. Yang, Jihyong Oh, Myungjoo Kang, Chanho Eom · 2025

Person search aims to jointly perform person detection and re-identification by localizing and identifying a query person within a gallery of uncropped scene images. Existing methods predominantly utilize ImageNet pre-trained backbones, wh…

Visual Representation Alignment for Multimodal Large Language Models Open

Heeji Yoon, Jaewoo Jung, Jin‐Hoi Kim, Hyun-Jun Choi, Heeseong Shin , et al. · 2025

Multimodal large language models (MLLMs) trained with visual instruction tuning have achieved strong performance across diverse tasks, yet they remain limited in vision-centric tasks such as object counting or spatial reasoning. We attribu…

Domain Generalization for Person Re-identification: A Survey Towards Domain-Agnostic Person Matching Open

Hyeonseo Lee, Juhyun Park, Jihyong Oh, Chanho Eom · 2025

Person Re-identification (ReID) aims to retrieve images of the same individual captured across non-overlapping camera views, making it a critical component of intelligent surveillance systems. Traditional ReID methods assume that the train…

AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation Open

Dahyeon Kye, Chang-Hyun Roh, Sung-Jea Ko, Chanho Eom, Jihyong Oh · 2025

Video Frame Interpolation (VFI) is a fundamental Low-Level Vision (LLV) task that synthesizes intermediate frames between existing ones while maintaining spatial and temporal coherence. VFI techniques have evolved from classical motion com…

Subnet-Aware Dynamic Supernet Training for Neural Architecture Search Open

Jeimin Jeon, Youngmin Oh, Junghyup Lee, Donghyeon Baek, Dohyung Kim , et al. · 2025

N-shot neural architecture search (NAS) exploits a supernet containing all candidate subnets for a given search space. The subnets are typically trained with a static training strategy (e.g., using the same learning rate (LR) scheduler and…

Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation Open

Geon Lee, Chanho Eom, Won-Kyung Lee, Hye-Kang Park, Bumsub Ham · 2022

We present a novel unsupervised domain adaptation method for semantic segmentation that generalizes a model trained with source images and corresponding ground-truth labels to a target domain. A key to domain adaptive semantic segmentation…

Disentangled Representations for Short-Term and Long-Term Person Re-Identification Open

Chanho Eom, Won-Kyung Lee, Geon Lee, Bumsub Ham · 2021

Computer science Physics Economics

We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class vari…

Video-based Person Re-identification with Spatial and Temporal Memory Networks Open

Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham · 2021

Computer science Physics

Video-based person re-identification (reID) aims to retrieve person videos with the same identity as a query person across multiple cameras. Spatial and temporal distractors in person videos, such as background clutter and partial occlusio…

Learning Disentangled Representation for Robust Person Re-identification Open

Chanho Eom, Bumsub Ham · 2019

Computer science Political science Physics

We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class vari…

Temporally Consistent Depth Prediction with Flow-Guided Memory Units Open

Chanho Eom, Hyunjong Park, Bumsub Ham · 2019

Computer science Mathematics Economics

Predicting depth from a monocular video sequence is an important task for autonomous driving. Although it has advanced considerably in the past few years, recent methods based on convolutional neural networks (CNNs) discard temporal cohere…

Chanho Eom YOU? Author Swipe