Hyogi Sim
YOU?
Author Swipe
View article: Exploiting user activeness for data retention in HPC systems
Exploiting user activeness for data retention in HPC systems Open
HPC systems typically rely on the fixed-lifetime (FLT) data retention strategy, which only considers temporal locality of data accesses to parallel file systems. However, our extensive analysis based on the leadership-class HPC system trac…
View article: An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers
An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers Open
Supercomputer design is a complex, multi-dimensional optimization process, wherein several subsystems need to be reconciled to meet a desired figure of merit performance for a portfolio of applications and a budget constraint. However, ove…
View article: MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing
MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing Open
Scientific applications often require high-bandwidth shared storage to perform joint simulations and collaborative data analytics. Shared memory pools provide a chance to satisfy such needs. Recently, a high-speed network such as Gen-Z uti…
View article: Persistent Memory Object Storage and Indexing for Scientific Computing
Persistent Memory Object Storage and Indexing for Scientific Computing Open
This paper presents Mosiqs, a persistent memory object storage framework with metadata indexing and querying for scientific computing. We design Mosiqs based on the key idea that memory objects on shared PM pool can live beyond the applica…
View article: An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers
An Analysis of System Balance and Architectural Trends Based on Top500 Supercomputers Open
Supercomputer design is a complex, multi-dimensional optimization process, wherein several subsystems need to be reconciled to meet a desired figure of merit performance for a portfolio of applications and a budget constraint. However, ove…
View article: An Integrated Indexing and Search Service for Distributed File Systems
An Integrated Indexing and Search Service for Distributed File Systems Open
Data services such as search, discovery, and management in scalable distributed environments have traditionally been decoupled from the underlying file systems, and are often deployed using external databases and indexing services. However…
View article: US Department of Energy, Office of Science High Performance Computing Facility Operational Assessment 2019 Oak Ridge Leadership Computing Facility
US Department of Energy, Office of Science High Performance Computing Facility Operational Assessment 2019 Oak Ridge Leadership Computing Facility Open
Oak Ridge National Laboratory's (ORNL's) Leadership Computing Facility (OLCF) continues to surpass its operational target goals: supporting users; delivering fast, reliable computational ecosystems; creating innovative solutions for high p…
View article: Customizable Scale-Out Key-Value Stores
Customizable Scale-Out Key-Value Stores Open
Enterprise KV stores are often not well suited for HPC applications, and thus cumbersome end-to-end KV design customization is required to meet the needs of modern HPC applications. To this end, in this article we present bespoKV, an adapt…
View article: Profiling the Usage of an Extreme-Scale Archival Storage System
Profiling the Usage of an Extreme-Scale Archival Storage System Open
Profiling the archival storage system in scientific computing environments has received much less attention compared to the parallel file system, but is equally important since it stores the final data products safely, for a long duration.…
View article: BESPOKV: Application Tailored Scale-Out Key-Value Stores
BESPOKV: Application Tailored Scale-Out Key-Value Stores Open
Enterprise KV stores are not well suited for HPC applications, and entail customization and cumbersome end-to-end KV design to extract the HPC application needs. To this end, in this paper we present BESPOKV, an adaptive, extensible, and s…
View article: An Analysis Workflow-Aware Storage System for Multi-Core Active Flash Arrays
An Analysis Workflow-Aware Storage System for Multi-Core Active Flash Arrays Open
Here, the need for novel data analysis is urgent in the face of a data deluge from modern applications. Traditional approaches to data analysis incur significant data movement costs, moving data back and forth between the storage system an…
View article: SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers
SCISPACE: A Scientific Collaboration Workspace for File Systems in Geo-Distributed HPC Data Centers Open
Future terabit networks are committed to dramatically improving big data motion between geographically dispersed HPC data centers.The scientific community takes advantage of the terabit networks such as DOE's ESnet and accelerates the tren…
View article: Tagit
Tagit Open
Data services such as search, discovery, and management in scalable distributed environments have traditionally been decoupled from the underlying file systems, and are often deployed using external databases and indexing services. However…
View article: Scientific user behavior and data-sharing trends in a petascale file system
Scientific user behavior and data-sharing trends in a petascale file system Open
The Oak Ridge Leadership Computing Facility (OLCF) runs the No. 4 supercomputer in the world, supported by a petascale file system, to facilitate scientific discovery. In this paper, using the daily file system metadata snapshots collected…
View article: Diving into petascale production file systems through large scale profiling and analysis
Diving into petascale production file systems through large scale profiling and analysis Open
As leadership computing facilities grow their storage capacity into the multi- petabyte range, the number of files and directories leap into the scale of billions. A complete profiling of such a parallel file system in a production environ…
View article: AnalyzeThat: A Programmable Shared-Memory System for an Array of Processing-In-Memory Devices
AnalyzeThat: A Programmable Shared-Memory System for an Array of Processing-In-Memory Devices Open
Processing In Memory (PIM), the concept of integrating processing directly with memory, has been attracting a lot of attention since PIM can assist in overcoming the throughput limitation caused by data movement between CPU and memory. The…