Explanipedia

Think Clearly: Improving Reasoning via Redundant Token Pruning Open

Daewon Choi, Jimin Lee, Jihoon Tack, W. M. Song, Saket Dingliwal , et al. · 2025

Recent large language models have shown promising capabilities in long-form reasoning, following structured chains of thought before arriving at a final answer. However, we observe that these reasoning paths tend to include substantial red…

Mamba Drafters for Speculative Decoding Open

Daewon Choi, S.K. Oh, Saket Dingliwal, Jihoon Tack, K.H. Kim , et al. · 2025

Speculative decoding has emerged as a promising approach to accelerating large language model (LLM) generation using a fast drafter while maintaining alignment with the target model's distribution. However, existing approaches face a trade…

ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search Open

Hyunseok Lee, Jeonghoon Kim, Beomjun Kim, Jihoon Tack, Chansong Jo , et al. · 2025

Recent advances in Multimodal Large Language Models (MLLMs) have enabled autonomous agents to interact with computers via Graphical User Interfaces (GUIs), where accurately localizing the coordinates of interface elements (e.g., buttons) i…

Adversarial Self-Supervised Contrastive Learning Open

Minseon Kim, Jihoon Tack, Sung Ju Hwang · 2025

Existing adversarial learning approaches mostly use class labels to generate adversarial samples that lead to incorrect predictions, which are then used to augment the training of the model for improved robustness. While some recent works …

Tabular Transfer Learning via Prompting LLMs Open

Jaehyun Nam, Woomin Song, Seong Hyeon Park, Jihoon Tack, Sukmin Yun , et al. · 2024

Business Computer science

Learning with a limited number of labeled data is a central problem in real-world applications of machine learning, as it is often expensive to obtain annotations. To deal with the scarcity of labeled data, transfer learning is a conventio…

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning Open

Jae Hyun Nam, Kyuyoung Kim, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim , et al. · 2024

Computer science Mathematics Philosophy

In tabular prediction tasks, tree-based models combined with automated feature engineering methods often outperform deep learning approaches that rely on learned representations. While these feature engineering techniques are effective, th…

Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts Open

Shengzhuang Chen, Jihoon Tack, Yunqiao Yang, Yee Whye Teh, Jonathan Richard Schwarz , et al. · 2024

Computer science Mathematics Engineering

Recent successes suggest that parameter-efficient fine-tuning of foundation models as the state-of-the-art method for transfer learning in vision, replacing the rich literature of alternatives such as meta-learning. In trying to harness th…

Online Adaptation of Language Models with a Memory of Amortized Contexts Open

Jihoon Tack, Jae Hyung Kim, Eric Mitchell, Jinwoo Shin, Yee Whye Teh , et al. · 2024

Computer science Psychology

Due to the rapid generation and dissemination of information, large language models (LLMs) quickly run out of date despite enormous development costs. To address the crucial need to keep models updated, online learning has emerged as a cri…

Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder Open

Huiwon Jang, Jihoon Tack, Daewon Choi, Jongheon Jeong, Jinwoo Shin · 2023

Computer science Geography Sociology

Despite its practical importance across a wide range of modalities, recent advances in self-supervised learning (SSL) have been primarily focused on a few well-curated domains, e.g., vision and language, often relying on their domain-speci…

STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables Open

Jae Hyun Nam, Jihoon Tack, Kyungmin Lee, Hankook Lee, Jinwoo Shin · 2023

Computer science Mathematics Chemistry

Learning with few labeled tabular samples is often an essential requirement for industrial machine learning applications as varieties of tabular data suffer from high annotation costs or have difficulties in collecting new samples for nove…

Learning Large-scale Neural Fields via Context Pruned Meta-Learning Open

Jihoon Tack, Subin Kim, Sihyun Yu, Jaeho Lee, Jinwoo Shin , et al. · 2023

Computer science Economics Mathematics

We introduce an efficient optimization-based meta-learning technique for large-scale neural field training by realizing significant memory savings through automated online context point selection. This is achieved by focusing each learning…

Modality-Agnostic Variational Compression of Implicit Neural Representations Open

Jonathan Richard Schwarz, Jihoon Tack, Yee Whye Teh, Jaeho Lee, Jinwoo Shin · 2023

Computer science

We introduce a modality-agnostic neural compression algorithm based on a functional view of data and parameterised as an Implicit Neural Representation (INR). Bridging the gap between latent coding and sparsity, we obtain compact latent re…

Meta-Learning with Self-Improving Momentum Target Open

Jihoon Tack, Jongjin Park, Hankook Lee, Jaeho Lee, Jinwoo Shin · 2022

Computer science Mathematics Materials science

The idea of using a separately trained target model (or teacher) to improve the performance of the student model has been increasingly popular in various machine learning domains, and meta-learning is no exception; a recent discovery shows…

Consistency Regularization for Adversarial Robustness Open

Jihoon Tack, Sihyun Yu, Jongheon Jeong, Minseon Kim, Sung Ju Hwang , et al. · 2022

Computer science Mathematics Chemistry

Adversarial training (AT) is currently one of the most successful methods to obtain the adversarial robustness of deep neural networks. However, the phenomenon of robust overfitting, i.e., the robustness starts to decrease significantly du…

Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks Open

Sihyun Yu, Jihoon Tack, Sangwoo Mo, Hyun‐Su Kim, Jung-Woo Ha , et al. · 2022

Computer science Mathematics Physics

In the deep learning era, long video generation of high-quality still remains challenging due to the spatio-temporal complexity and continuity of videos. Existing prior works have attempted to model video distribution by representing video…

Meta-Learning Sparse Implicit Neural Representations Open

Jaeho Lee, Jihoon Tack, Namhoon Lee, Jinwoo Shin · 2021

Computer science Mathematics Political science

Implicit neural representations are a promising new avenue of representing general signals by learning a continuous function that, parameterized as a neural network, maps the domain of a signal to its codomain; the mapping from spatial coo…

Consistency Regularization for Adversarial Robustness Open

Jihoon Tack, Sihyun Yu, Jongheon Jeong, Minseon Kim, Sung Ju Hwang , et al. · 2021

Computer science Mathematics Chemistry

Adversarial training (AT) is currently one of the most successful methods to obtain the adversarial robustness of deep neural networks. However, the phenomenon of robust overfitting, i.e., the robustness starts to decrease significantly du…

CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances Open

Jihoon Tack, Sangwoo Mo, Jongheon Jeong, Jinwoo Shin · 2020

Computer science Psychology

Novelty detection, i.e., identifying whether a given sample is drawn from outside the training distribution, is essential for reliable machine learning. To this end, there have been many attempts at learning a representation well-suited fo…

CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances Open

Jihoon Tack, Sangwoo Mo, Jongheon Jeong, Jinwoo Shin · 2020

Computer science Mathematics Geography

Novelty detection, i.e., identifying whether a given sample is drawn from outside the training distribution, is essential for reliable machine learning. To this end, there have been many attempts at learning a representation well-suited fo…

Adversarial Self-Supervised Contrastive Learning Open

Minseon Kim, Jihoon Tack, Sung Ju Hwang · 2020

Computer science Psychology

Existing adversarial learning approaches mostly use class labels to generate adversarial samples that lead to incorrect predictions, which are then used to augment the training of the model for improved robustness. While some recent works …

Jihoon Tack YOU? Author Swipe