Hong‐Jun Yoon
YOU?
Author Swipe
View article: ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling Open
Sparse observations and coarse-resolution climate models limit effective regional decision-making, underscoring the need for robust downscaling. However, existing AI methods struggle with generalization across variables and geographies and…
View article: A Decision Support System to Compile Environmental Mitigations from Hydropower Licensing Documents
A Decision Support System to Compile Environmental Mitigations from Hydropower Licensing Documents Open
The process of deciphering, extracting, and compiling information from texts dense with domain-specific terminology and technical jargon is a challenging endeavor. It demands considerable expertise and deep knowledge in the respective fiel…
View article: Improving Text Classification with Large Language Model-Based Data Augmentation
Improving Text Classification with Large Language Model-Based Data Augmentation Open
Large Language Models (LLMs) such as ChatGPT possess advanced capabilities in understanding and generating text. These capabilities enable ChatGPT to create text based on specific instructions, which can serve as augmented data for text cl…
View article: Enhancing Diagnosis through AI-driven Analysis of Reflectance Confocal Microscopy
Enhancing Diagnosis through AI-driven Analysis of Reflectance Confocal Microscopy Open
Reflectance Confocal Microscopy (RCM) is a non-invasive imaging technique used in biomedical research and clinical dermatology. It provides virtual high-resolution images of the skin and superficial tissues, reducing the need for physical …
View article: Enhancing diagnosis through AI-driven analysis of reflectance confocal microscopy
Enhancing diagnosis through AI-driven analysis of reflectance confocal microscopy Open
Reflectance Confocal Microscopy (RCM) is a non-invasive imaging technique used in biomedical research and clinical dermatology. It provides virtual high-resolution images of the skin and superficial tissues, reducing the need for physical …
View article: HyperKube: A Kubernetes based system for the automation of processing and analysis of hyperspectral data obtained from multiple hyperspectral imaging systems
HyperKube: A Kubernetes based system for the automation of processing and analysis of hyperspectral data obtained from multiple hyperspectral imaging systems Open
Hyperspectral imagery is an emerging field of technology that has enormous potential for remote and proximal sensing in numerous areas of research. The plant phenotyping community is applying this technology to advance the throughput and a…
View article: HyperKube: A Kubernetes Based System for the Automation of Processing and Analysis of Hyperspectral Data Obtained from Multiple Hyperspectral Imaging Systems
HyperKube: A Kubernetes Based System for the Automation of Processing and Analysis of Hyperspectral Data Obtained from Multiple Hyperspectral Imaging Systems Open
Hyperspectral imagery is an emerging field of technology that has enormous potential for remote and proximal sensing in numerous areas of research. The plant phenotyping community is applying this technology to advance the throughput and a…
View article: Leveraging hyperspectral imaging to identify drought tolerant Populus species and genotypes within species
Leveraging hyperspectral imaging to identify drought tolerant Populus species and genotypes within species Open
The aim of this study was to identity variation in drought tolerance across genotypes of Populus deltoides, Populus trichocarpa, and hybrids of the two species. A panel of 102 Populus genotypes, comprising 37 genotypes of P. trichocarpa, 3…
View article: Ultra-Long Sequence Distributed Transformer
Ultra-Long Sequence Distributed Transformer Open
Transformer models trained on long sequences often achieve higher accuracy than short sequences. Unfortunately, conventional transformers struggle with long sequence training due to the overwhelming computation and memory requirements. Exi…
View article: Enhancing Text Classification Models with Generative AI-aided Data Augmentation
Enhancing Text Classification Models with Generative AI-aided Data Augmentation Open
This study investigated the potential of enhancing the performance of text classification by augmenting the training dataset with external knowledge samples generated by a generative AI, specifically ChatGPT. The study conducted experiment…
View article: Scaling Resolution of Gigapixel Whole Slide Images Using Spatial Decomposition on Convolutional Neural Networks
Scaling Resolution of Gigapixel Whole Slide Images Using Spatial Decomposition on Convolutional Neural Networks Open
Gigapixel images are prevalent in scientific domains ranging from remote sensing, and satellite imagery to microscopy, etc. However, training a deep learning model at the natural resolution of those images has been a challenge in terms of …
View article: A comparison of histopathology imaging comprehension algorithms based on multiple instance learning
A comparison of histopathology imaging comprehension algorithms based on multiple instance learning Open
Whole slide imaging (WSI), also called digital virtual microscopy, is a new imaging modality. It allows for the application of AI and machine learning methods to cancer pathology to help establish a means for the automatic diagnosis of can…
View article: Distilling Knowledge from Ensembles of Cluster-Constrained-Attention Multiple-Instance Learners for Whole Slide Image Classification
Distilling Knowledge from Ensembles of Cluster-Constrained-Attention Multiple-Instance Learners for Whole Slide Image Classification Open
The peculiar nature of whole slide imaging (WSI), digitizing conventional glass slides to obtain multiple high resolution images which capture microscopic details of a patient’s histopathological features, has garnered increased interest f…
View article: Identification of Novel, Replicable Genetic Risk Loci for Suicidal Thoughts and Behaviors Among US Military Veterans
Identification of Novel, Replicable Genetic Risk Loci for Suicidal Thoughts and Behaviors Among US Military Veterans Open
Importance Suicide is a leading cause of death; however, the molecular genetic basis of suicidal thoughts and behaviors (SITB) remains unknown. Objective To identify novel, replicable genomic risk loci for SITB. Design, Setting, and Partic…
View article: ODP070 A deep learning-based osteopenia and osteoporosis screening system with lateral lumbar spine X-ray in women aged 50 years or older and men aged 60 years or older
ODP070 A deep learning-based osteopenia and osteoporosis screening system with lateral lumbar spine X-ray in women aged 50 years or older and men aged 60 years or older Open
Bone mineral density (BMD) measured by Dual-energy X-ray absorptiometry (DXA) is the gold standard to diagnose osteopenia or osteoporosis. However, this method is not always available and underutilized to screen for osteoporosis. An artifi…
View article: Quantitative Measurement of Pneumothorax Using Artificial Intelligence Management Model and Clinical Application
Quantitative Measurement of Pneumothorax Using Artificial Intelligence Management Model and Clinical Application Open
Artificial intelligence (AI) techniques can be a solution for delayed or misdiagnosed pneumothorax. This study developed, a deep-learning-based AI model to estimate the pneumothorax amount on a chest radiograph and applied it to a treatmen…
View article: Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports
Using ensembles and distillation to optimize the deployment of deep learning models for the classification of electronic cancer pathology reports Open
Lay Summary One of the goals of the Surveillance, Epidemiology, and End Results (SEER) program is to estimate incidence, prevalence, and mortality of all cancers. To that end, cancer registries across the country maintain a massive databas…
View article: A Scalable Pipeline for Gigapixel Whole Slide Imaging Analysis on Leadership Class HPC Systems
A Scalable Pipeline for Gigapixel Whole Slide Imaging Analysis on Leadership Class HPC Systems Open
Whole Slide Imaging (WSI) captures microscopic details of a patient's histopathological features at multiple res-olutions organized across different levels. Images produced by WSI are gigapixel-sized, and saving a single image in memory re…
View article: Automatic information extraction from childhood cancer pathology reports
Automatic information extraction from childhood cancer pathology reports Open
Objectives The International Classification of Childhood Cancer (ICCC) facilitates the effective classification of a heterogeneous group of cancers in the important pediatric population. However, there has been no development of machine le…
View article: CTSA MHRI Datasets
CTSA MHRI Datasets Open
Oak Ridge National Laboratory (ORNL) has collaborated with MedStar Health Research Institute (MHRI) to develop, test, and validate health outcomes using electronic health records (EHRs) from hospitals associated with participating Clinical…
View article: Image transformers for classifying acute lymphoblastic leukemia
Image transformers for classifying acute lymphoblastic leukemia Open
Cancer is the leading cause of death by disease in American children. Each year, nearly 16,000 children in the United States and over 300,000 children globally are diagnosed with cancer. Leukemia is a form of blood cancer that originates i…
View article: Optimal vocabulary selection approaches for privacy-preserving deep NLP model training for information extraction and cancer epidemiology
Optimal vocabulary selection approaches for privacy-preserving deep NLP model training for information extraction and cancer epidemiology Open
BACKGROUND: With the use of artificial intelligence and machine learning techniques for biomedical informatics, security and privacy concerns over the data and subject identities have also become an important issue and essential research t…
View article: A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification
A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification Open
Recent applications ofdeep learning have shown promising results for classifying unstructured text in the healthcare domain. However, the reliability of models in production settings has been hindered by imbalanced data sets in which a sma…
View article: Optimal vocabulary selection approaches for privacy-preserving deep NLP model training for information extraction and cancer epidemiology.
Optimal vocabulary selection approaches for privacy-preserving deep NLP model training for information extraction and cancer epidemiology. Open
The comparison outcomes suggest that the proposed vocabulary selection methods resulted in lower privacy vulnerability while maintaining the same level of clinical task performance.
View article: Creating a Tools Ecosystem for Cross-Discipline Environmental Data Reuse
Creating a Tools Ecosystem for Cross-Discipline Environmental Data Reuse Open
Reusing data is difficult even within well-defined science communities and only gets worse when combining data from multiple communities and disciplines. Through the lens of current work on constructing an environmental epidemiological dat…