Explanipedia

Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions Open

Youngmin Oh, Hyunju Lee, Bumsub Ham · 2025

Neural architecture search (NAS) enables finding the best-performing architecture from a search space automatically. Most NAS methods exploit an over-parameterized network (i.e., a supernet) containing all possible architectures (i.e., sub…

Maximizing the Position Embedding for Vision Transformers with Global Average Pooling Open

Wonjun Lee, Bumsub Ham, Suhyun Kim · 2025

Computer science

In vision transformers, position embedding (PE) plays a crucial role in capturing the order of tokens. However, in vision transformer structures, there is a limitation in the expressiveness of PE due to the structure where position embeddi…

Subnet-Aware Dynamic Supernet Training for Neural Architecture Search Open

Jeimin Jeon, Youngmin Oh, Junghyup Lee, Donghyeon Baek, Dohyung Kim , et al. · 2025

N-shot neural architecture search (NAS) exploits a supernet containing all candidate subnets for a given search space. The subnets are typically trained with a static training strategy (e.g., using the same learning rate (LR) scheduler and…

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety Open

Wonjun Lee, Daewoo Lee, Eugene Choi, Shuishan Yu, Ashkan Yousefpour , et al. · 2025

Computer science Political science Medicine

Current Vision Language Models (VLMs) remain vulnerable to malicious prompts that induce harmful outputs. Existing safety benchmarks for VLMs primarily rely on automated evaluation methods, but these methods struggle to detect implicit har…

Maximizing the Position Embedding for Vision Transformers with Global Average Pooling Open

Wonjun Lee, Bumsub Ham, Suhyun Kim · 2025

Computer science Economics Engineering

In vision transformers, position embedding (PE) plays a crucial role in capturing the order of tokens. However, in vision transformer structures, there is a limitation in the expressiveness of PE due to the structure where position embeddi…

Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions Open

Youngmin Oh, Hyunju Lee, Bumsub Ham · 2024

Computer science Mathematics Physics

Neural architecture search (NAS) enables finding the best-performing architecture from a search space automatically. Most NAS methods exploit an over-parameterized network (i.e., a supernet) containing all possible architectures (i.e., sub…

Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients Open

Dohyung Kim, Junghyup Lee, Jeimin Jeon, Jaehyeon Moon, Bumsub Ham · 2024

Computer science Mathematics Geography

Network quantization generally converts full-precision weights and/or activations into low-bit fixed-point values in order to accelerate an inference process. Recent approaches to network quantization further discretize the gradients into …

FYI: Flip Your Images for Dataset Distillation Open

Byunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham · 2024

Computer science Engineering Chemistry

Dataset distillation synthesizes a small set of images from a large-scale real dataset such that synthetic and real images share similar behavioral properties (e.g, distributions of gradients or features) during a training process. Through…

Scheduling Weight Transitions for Quantization-Aware Training Open

Junghyup lee, Dohyung Kim, Jeimin Jeon, Bumsub Ham · 2024

Computer science Mathematics Physics

Quantization-aware training (QAT) simulates a quantization process during training to lower bit-precision of weights/activations. It learns quantized weights indirectly by updating latent weights,i.e., full-precision inputs to a quantizer,…

Instance-Aware Group Quantization for Vision Transformers Open

Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub Ham · 2024

Computer science Engineering Physics

Post-training quantization (PTQ) is an efficient model compression technique that quantizes a pretrained full-precision model using only a small calibration set of unlabeled samples without retraining. PTQ methods for convolutional neural …

RankMixup: Ranking-Based Mixup Training for Network Calibration Open

Jongyoun Noh, Hye-Kang Park, Junghyup Lee, Bumsub Ham · 2023

Computer science Mathematics

Network calibration aims to accurately estimate the level of confidences, which is particularly important for employing deep neural networks in real-world systems. Recent approaches leverage mixup to calibrate the network's predictions dur…

Camera-Driven Representation Learning for Unsupervised Domain Adaptive Person Re-identification Open

Geon Lee, Sanghoon Lee, Dohyung Kim, Younghoon Shin, Yongsang Yoon , et al. · 2023

Computer science Mathematics Biology

We present a novel unsupervised domain adaption method for person re-identification (reID) that generalizes a model trained on a labeled source domain to an unlabeled target domain. We introduce a camera-driven curriculum learning (CaCL) f…

ACLS: Adaptive and Conditional Label Smoothing for Network Calibration Open

Hye-Kang Park, Jongyoun Noh, Youngmin Oh, Donghyeon Baek, Bumsub Ham · 2023

Computer science

We address the problem of network calibration adjusting miscalibrated confidences of deep neural networks. Many approaches to network calibration adopt a regularization-based method that exploits a regularization term to smooth the miscali…

ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation Open

Youngmin Oh, Donghyeon Baek, Bumsub Ham · 2022

Computer science Philosophy

We address the problem of incremental semantic segmentation (ISS) recognizing novel object/stuff categories continually without forgetting previous ones that have been learned. The catastrophic forgetting problem is particularly severe in …

Decomposed Knowledge Distillation for Class-Incremental Semantic Segmentation Open

Donghyeon Baek, Youngmin Oh, Sanghoon Lee, Junghyup Lee, Bumsub Ham · 2022

Computer science Philosophy

Class-incremental semantic segmentation (CISS) labels each pixel of an image with a corresponding object/stuff class continually. To this end, it is crucial to learn novel classes incrementally without forgetting previously learned knowled…

Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation Open

Geon Lee, Chanho Eom, Won-Kyung Lee, Hye-Kang Park, Bumsub Ham · 2022

Computer science Mathematics Philosophy

We present a novel unsupervised domain adaptation method for semantic segmentation that generalizes a model trained with source images and corresponding ground-truth labels to a target domain. A key to domain adaptive semantic segmentation…

OIMNet++: Prototypical Normalization and Localization-aware Learning for Person Search Open

Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham · 2022

Computer science Sociology

We address the task of person search, that is, localizing and re-identifying query persons from a set of raw scene images. Recent approaches are typically built upon OIMNet, a pioneer work on person search, that learns joint person represe…

Disentangled Representations for Short-Term and Long-Term Person Re-Identification Open

Chanho Eom, Won-Kyung Lee, Geon Lee, Bumsub Ham · 2021

Computer science Physics Economics

We address the problem of person re-identification (reID), that is, retrieving person images from a large dataset, given a query image of the person of interest. A key challenge is to learn person representations robust to intra-class vari…

Video-based Person Re-identification with Spatial and Temporal Memory Networks Open

Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham · 2021

Computer science Physics

Video-based person re-identification (reID) aims to retrieve person videos with the same identity as a query person across multiple cameras. Spatial and temporal distractors in person videos, such as background clutter and partial occlusio…

Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences Open

Hyunjong Park, Sanghoon Lee, Junghyup Lee, Bumsub Ham · 2021

Computer science Chemistry Political science

We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting. Two main challenges in VI-reID are intra-class va…

Distance-aware Quantization Open

Dohyung kim, Junghyup Lee, Bumsub Ham · 2021

Computer science Mathematics

We address the problem of network quantization, that is, reducing bit-widths of weights and/or activations to lighten network architectures. Quantization methods use a rounding function to map full-precision values to the nearest quantized…

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation Open

Donghyeon Baek, Youngmin Oh, Bumsub Ham · 2021

Computer science

We address the problem of generalized zero-shot semantic segmentation (GZS3) predicting pixel-wise semantic labels for seen and unseen classes. Most GZS3 methods adopt a generative approach that synthesizes visual features of unseen classe…

Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation Open

Youngmin Oh, Beom‐Jun Kim, Bumsub Ham · 2021

Computer science

We address the problem of weakly-supervised semantic segmentation (WSSS) using bounding box annotations. Although object bounding boxes are good indicators to segment corresponding objects, they do not specify object boundaries, making it …

HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection Open

Jongyoun Noh, Sanghoon Lee, Bumsub Ham · 2021

Computer science Mathematics Philosophy

We address the problem of 3D object detection, that is, estimating 3D object bounding boxes from point clouds. 3D object detection methods exploit either voxel-based or point-based features to represent 3D objects in a scene. Voxel-based f…

Network Quantization with Element-wise Gradient Scaling Open

Junghyup Lee, Dohyung Kim, Bumsub Ham · 2021

Computer science Mathematics

Network quantization aims at reducing bit-widths of weights and/or activations, particularly important for implementing deep neural networks with limited hardware resources. Most methods use the straight-through estimator (STE) to train qu…

Learning Semantic Correspondence Exploiting an Object-Level Prior Open

Junghyup Lee, Won‐Kyung Lee, Juliana Rubio Ponce, Bumsub Ham · 2020

Computer science

We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks …

Bumsub Ham YOU? Author Swipe