Explanipedia

Adversarial Reinforcement Learning Framework for ESP Cheater Simulation Open

In Kyu Park, Jeong-Gwan Lee, Taehwan Kwon, Juheon Choi, Seungku Kim , et al. · 2025

Extra-Sensory Perception (ESP) cheats, which reveal hidden in-game information such as enemy locations, are difficult to detect because their effects are not directly observable in player behavior. The lack of observable evidence makes it …

Learning to Generate Unit Test via Adversarial Reinforcement Learning Open

Dongjun Lee, Chang Ho Hwang, Kimin Lee · 2025

Unit testing is a core practice in programming, enabling systematic evaluation of programs produced by human developers or large language models (LLMs). Given the challenges in writing comprehensive unit tests, LLMs have been employed to a…

Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation Open

Daegi Hahm, Taywon Min, Weiwei Jin, Kimin Lee · 2025

Beyond simple text generation, Large Language Models (LLMs) have evolved into agentic systems capable of planning and interacting with external tools to solve complex tasks. This evolution involves fine-tuning LLMs on agent-specific tasks …

Pulsed electric field pretreatment to enhance sodium chloride and moisture diffusion in radish tissues while maintaining microstructural integrity Open

Se-Ho Jeong, Yoon‐Hee Kang, Jiyun Jung, Kimin Lee, Hafiz Muhammad Shahbaz , et al. · 2025

This study examined the influence of pulsed electric field (PEF) pretreatment on sodium chloride (NaCl) and moisture mass transfer in radish (Raphanus sativus L.) tissues, aiming to improve salting efficiency while preserving microstructur…

Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance Open

June Suk Choi, Kyungmin Lee, Sihyun Yu, Yisol Choi, Jinwoo Shin , et al. · 2025

Recent text-to-video (T2V) models have demonstrated strong capabilities in producing high-quality, dynamic videos. To improve the visual controllability, recent works have considered fine-tuning pre-trained T2V models to support image-to-v…

Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search Open

D.-J. Lee, Susanna Joo, Kimin Lee, Beomjoon Kim · 2025

The problem of relocating a set of objects to designated areas amidst movable obstacles can be framed as a Geometric Task and Motion Planning ( g-tamp ), a subclass of task and motion planning problem (TAMP). Traditional approaches to g-ta…

Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness Open

Yueran Yang, Euiin Yi, Jongwoo Ko, Kimin Lee, Zhijing Jin , et al. · 2025

The remarkable growth in large language model (LLM) capabilities has spurred exploration into multi-agent systems, with debate frameworks emerging as a promising avenue for enhanced problem-solving. These multi-agent debate (MAD) approache…

What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs Open

Sangyeop Kim, Yo Han Lee, Yun‐Heub Song, Kimin Lee · 2025

We investigate long-context vulnerabilities in Large Language Models (LLMs) through Many-Shot Jailbreaking (MSJ). Our experiments utilize context length of up to 128K tokens. Through comprehensive analysis with various many-shot attack set…

DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models Open

Daewon Chae, June Suk Choi, Jinkyu Kim, Kimin Lee · 2025

Computer science Physics

Fine-tuning text-to-image diffusion models to maximize rewards has proven effective for enhancing model performance. However, reward fine-tuning methods often suffer from slow convergence due to online sample generation. Therefore, obtaini…

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models Open

Sangwon Jang, June Suk Choi, Jaehyeong Jo, Kimin Lee, Sung Ju Hwang · 2025

Text-to-image diffusion models have achieved remarkable success in generating high-quality contents from text prompts. However, their reliance on publicly available data and the growing trend of data sharing for fine-tuning make these mode…

DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models Open

Daewon Chae, June Suk Choi, Jinkyu Kim, Kimin Lee · 2025

Computer science Physics

Fine-tuning text-to-image diffusion models to maximize rewards has proven effective for enhancing model performance. However, reward fine-tuning methods often suffer from slow convergence due to online sample generation. Therefore, obtaini…

Improbable Bigrams Expose Vulnerabilities of Incomplete Tokens in Byte-Level Tokenizers Open

Eugene Jang, Kimin Lee, Jin-Woo Chung, Keuntae Park, Seungwon Shin · 2024

Computer science Physics

Tokenization is a crucial step that bridges human-readable text with model-readable discrete tokens. However, recent studies have revealed that tokenizers can be exploited to elicit unwanted model behaviors. In this work, we investigate in…

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control Open

Ju-Yong Lee, Dongyoon Hahm, June Suk Choi, W. Bradley Knox, Kimin Lee · 2024

Business Computer science

Autonomous agents powered by large language models (LLMs) show promising potential in assistive tasks across various domains, including mobile device control. As these agents interact directly with personal information and device settings,…

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation Open

Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei , et al. · 2024

Computer science Medicine

Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This pa…

Latent Action Pretraining from Videos Open

Seonghyeon Ye, Joel Jang, Byeongguk Jeon, Susanna Joo, Jianwei Yang , et al. · 2024

Psychology Mathematics Physics

We introduce Latent Action Pretraining for general Action models (LAPA), an unsupervised method for pretraining Vision-Language-Action (VLA) models without ground-truth robot action labels. Existing Vision-Language-Action models require ac…

Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models Open

Yongjin Yang, Sihyeon Kim, Hojung Jung, Sangmin Bae, SangMook Kim , et al. · 2024

Computer science Physics

Fine-tuning text-to-image diffusion models with human feedback is an effective method for aligning model behavior with human intentions. However, this alignment process often suffers from slow convergence due to the large size and noise pr…

DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing Open

June Suk Choi, Kyungmin Lee, Jongheon Jeong, Saining Xie, Jinwoo Shin , et al. · 2024

Computer science Physics

Recent advances in diffusion models have introduced a new era of text-guided image manipulation, enabling users to create realistic edited images with simple textual prompts. However, there is significant concern about the potential misuse…

Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback Open

Kyuyoung Kim, Ah Jeong Seo, Hao Liu, Jinwoo Shin, Kimin Lee · 2024

Computer science Mathematics

Large language models (LLMs) fine-tuned with alignment techniques, such as reinforcement learning from human feedback, have been instrumental in developing some of the most capable AI systems to date. Despite their success, existing method…

By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting Open

H.S. Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee · 2024

Computer science Engineering

Large language models (LLMs) have demonstrated exceptional abilities across various domains. However, utilizing LLMs for ubiquitous sensing applications remains challenging as existing text-prompt methods show significant performance degra…

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation Open

Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei , et al. · 2024

Computer science

Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This pa…

Aligning Large Language Models with Self-generated Preference Data Open

D.-S. Kim, Kimin Lee, Jinwoo Shin, Jaehyung Kim · 2024

Computer science Mathematics

Aligning large language models (LLMs) with human preferences becomes a key component to obtaining state-of-the-art performance, but it yields a huge cost to construct a large human-annotated preference dataset. To tackle this problem, we p…

Benchmarking Mobile Device Control Agents across Diverse Configurations Open

Juyong Lee, Taywon Min, Minyong An, C. Kim, Kimin Lee , et al. · 2024

Computer science Business

Mobile device control agents can largely enhance user interactions and productivity by automating daily tasks. However, despite growing interest in developing practical agents, the absence of a commonly adopted benchmark in this area makes…

Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Open

Sangwon Jang, Jaehyeong Jo, Kimin Lee, Sung Ju Hwang · 2024

Computer science Art Engineering

Text-to-image diffusion models have shown remarkable success in generating personalized subjects based on a few reference images. However, current methods often fail when generating multiple subjects simultaneously, resulting in mixed iden…

Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models Open

Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham , et al. · 2024

Computer science Psychology

Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy ob…

SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay Open

H.S. Yoon, Jaehyun Kwak, Biniyam Aschalew Tolera, G.L. Dai, Mo Li , et al. · 2024

Computer science

Self-supervised learning has emerged as a method for utilizing massive unlabeled data for pre-training models, providing an effective feature extractor for various mobile sensing applications. However, when deployed to end-users, these mod…

Kimin Lee YOU? Author Swipe