Explanipedia

Privacy-Aware Continual Self-Supervised Learning on Multi-Window Chest Computed Tomography for Domain-Shift Robustness Open

Ren Tasai, Guang Li, Ren Togo, Takahiro Ogawa, Kenji Hirata , et al. · 2025

We propose a novel continual self-supervised learning (CSSL) framework for simultaneously learning diverse features from multi-window-obtained chest computed tomography (CT) images and ensuring data privacy. Achieving a robust and highly g…

Context-aware Image-to-Music Generation via Bridging Modalities through Musical Captions Open

Shilin Liu, Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Deep-learning-based automatic liver segmentation using computed tomography images in dogs Open

Seungyeon Lee, Genya Shimbo, Nozomu Yokoyama, Kensuke Nakamura, Ren Togo , et al. · 2025

Introduction Deep learning-based automated segmentation has significantly improved the efficiency and accuracy of human medicine applications. However, veterinary applications, particularly canine liver segmentation, remain limited. This s…

Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning Open

Minghao Yang, Ren Togo, Guang Li, Takahiro Ogawa, Miki Haseyama · 2025

Mixture-of-Experts (MoE) has emerged as a powerful framework for multi-task learning (MTL). However, existing MoE-MTL methods often rely on single-task pretrained backbones and suffer from redundant adaptation and inefficient knowledge sha…

GeoJapan Fusion Framework: A Large Multimodal Model for Regional Remote Sensing Recognition Open

Yaozong Gan, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa , et al. · 2025

Recent advances in large multimodal models (LMMs) have opened new opportunities for multitask recognition from remote sensing images. However, existing approaches still face challenges in effectively recognizing the complex geospatial char…

Discrete Prompt Tuning via Recursive Utilization of Black-box Multimodal Large Language Model for Personalized Visual Emotion Recognition Open

Ryo Takahashi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Visual Emotion Recognition (VER) is an important research topic due to its wide range of applications, including opinion mining and advertisement design. Extending this capability to recognize emotions at the individual level further broad…

Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification Open

Ayaka Tsutsumi, Guang Li, Ren Togo, Takahiro Ogawa, Sosuke Kondo , et al. · 2025

We propose a novel medical image classification method that integrates dual-model weight selection with self-knowledge distillation (SKD). In real-world medical settings, deploying large-scale models is often limited by computational resou…

Information-Guided Diffusion Sampling for Dataset Distillation Open

Linfeng Ye, S. Hamidi, Guang Li, Takahiro Ogawa, Miki Haseyama , et al. · 2025

Dataset distillation aims to create a compact dataset that retains essential information while maintaining model performance. Diffusion models (DMs) have shown promise for this task but struggle in low images-per-class (IPC) settings, wher…

Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling Open

Mingzhuo Li, Guang Li, Jiafeng Mao, Linfeng Ye, Takahiro Ogawa , et al. · 2025

To alleviate the reliance of deep neural networks on large-scale datasets, dataset distillation aims to generate compact, high-quality synthetic datasets that can achieve comparable performance to the original dataset. The integration of g…

Hyperbolic Dataset Distillation Open

Wenyuan Li, Guang Li, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

To address the computational and storage challenges posed by large-scale datasets in deep learning, dataset distillation has been proposed to synthesize a compact dataset that replaces the original while maintaining comparable model perfor…

Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory Open

Mingzhuo Li, Guang Li, Jiafeng Mao, Takahiro Ogawa, Miki Haseyama · 2025

Dataset distillation enables the training of deep neural networks with comparable performance in significantly reduced time by compressing large datasets into small and representative ones. Although the introduction of generative models ha…

Analysis of Model Merging Methods for Continual Updating of Foundation Models in Distributed Data Settings Open

Kenta Kubota, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Foundation models have achieved remarkable success across various domains, but still face critical challenges such as limited data availability, high computational requirements, and rapid knowledge obsolescence. To address these issues, we…

Enhancing Adversarial Defense via Brain Activity Integration Without Adversarial Examples Open

Takanori Nakajima, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama · 2025

Adversarial attacks on large-scale vision–language foundation models, such as the contrastive language–image pretraining (CLIP) model, can significantly degrade performance across various tasks by generating adversarial examples that are i…

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence Open

Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang , et al. · 2025

In few-shot action recognition (FSAR), long sub-sequences of video naturally express entire actions more effectively. However, the high computational complexity of mainstream Transformer-based methods limits their application. Recent Mamba…

Personalized Federated Learning for Egocentric Video Gaze Estimation with Comprehensive Parameter Frezzing Open

Yuhu Feng, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Egocentric video gaze estimation requires models to capture individual gaze patterns while adapting to diverse user data. Our approach leverages a transformer-based architecture, integrating it into a PFL framework where only the most sign…

StarMAP: Global Neighbor Embedding for Faithful Data Visualization Open

Koshi Watanabe, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Neighbor embedding is widely employed to visualize high-dimensional data; however, it frequently overlooks the global structure, e.g., intercluster similarities, thereby impeding accurate visualization. To address this problem, this paper …

Answer to the Letter to the Editor from Hinpetch Daungsupawong Concerning "Development of New Surgical Training for Full Endoscopic Surgery Using 3D-Printed Models" Open

Takahiro Ogawa, Masatoshi Morimoto, S Fujimoto, Masaru Tominaga, Yasuyuki Omichi , et al. · 2025

Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Open

Kenta Uesugi, Naoki Saito, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Composed Image Retrieval (CIR) provides an effective way to manage and access large-scale visual data. Construction of the CIR model utilizes triplets that consist of a reference image, modification text describing desired changes, and a t…

Expert Comment Generation Considering Sports Skill Level Using a Large Multimodal Model with Video and Spatial-Temporal Motion Features Open

Tatsuki Seino, Naoki Saito, Takahiro Ogawa, Satoshi Asamizu, Miki Haseyama · 2025

In sports training, personalized skill assessment and feedback are crucial for athletes to master complex movements and improve performance. However, existing research on skill transfer predominantly focuses on skill evaluation through vid…

Continual Self-supervised Learning Considering Medical Domain Knowledge in Chest CT Images Open

Ren Tasai, Guang Li, Ren Togo, Minghui Tang, Takaaki Yoshimura , et al. · 2025

We propose a novel continual self-supervised learning method (CSSL) considering medical domain knowledge in chest CT images. Our approach addresses the challenge of sequential learning by effectively capturing the relationship between prev…

Generative Dataset Distillation Based on Self-knowledge Distillation Open

Longzhen Li, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa , et al. · 2025

Dataset distillation is an effective technique for reducing the cost and complexity of model training while maintaining performance by compressing large datasets into smaller, more efficient versions. In this paper, we present a novel gene…

[Paper] Few-shot Personalized Saliency Prediction Based on Interpersonal Gaze Patterns Open

Yuya Moroto, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

Exponential Dissimilarity-Dispersion Family for Domain-Specific Representation Learning Open

Ren Togo, Nao Nakagawa, Takahiro Ogawa, Miki Haseyama · 2025

This paper presents a new domain-specific representation learning method, exponential dissimilarity-dispersion family (EDDF), a novel distribution family that includes a dissimilarity function and a global dispersion parameter. In generati…

Cross-Domain Multi-Step Thinking: Zero-Shot Fine-Grained Traffic Sign Recognition in the Wild Open

Yaozong Gan, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa , et al. · 2025

Enhancing Generative Class Incremental Learning Performance With a Model Forgetting Approach Open

Taro Togo, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

This study presents a novel approach to Generative Class Incremental Learning (GCIL) by introducing the forgetting mechanism, aimed at dynamically managing class information for better adaptation to streaming data. GCIL is one of the hot t…

Enhancing Classification Models With Sophisticated Counterfactual Images Open

Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2025

In deep learning, training data, which are mainly from realistic scenarios, often carry certain biases. This causes deep learning models to learn incorrect relationships between features when using these training data. However, because the…

Efficacy of Viltolarsen in Improving Motor Function in Patients with Duchenne Muscular Dystrophy Open

Hideyuki Iwayama, Shingo Numoto, Yoshiteru Azuma, Hirokazu Kurahashi, Yumiko Yasue , et al. · 2025

LLM is Knowledge Graph Reasoner: LLM's Intuition-aware Knowledge Graph Reasoning for Cold-start Sequential Recommendation Open

Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama · 2024

Knowledge Graphs (KGs) represent relationships between entities in a graph structure and have been widely studied as promising tools for realizing recommendations that consider the accurate content information of items. However, traditiona…

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence Open

Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang , et al. · 2024

In few-shot action recognition (FSAR), long sub-sequences of video naturally express entire actions more effectively. However, the high computational complexity of mainstream Transformer-based methods limits their application. Recent Mamba…

Generalizing Human Motion Style Transfer Method Based on Metadata-independent Learning Open

Yuki Era, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama · 2024

Takahiro Ogawa YOU? Author Swipe