Arnab K. Paul
YOU?
Author Swipe
View article: UnifyFL: Enabling Decentralized Cross-Silo Federated Learning
UnifyFL: Enabling Decentralized Cross-Silo Federated Learning Open
Federated Learning (FL) is a decentralized machine learning (ML) paradigm in which models are trained on private data across several devices called clients and combined at a single node called an aggregator rather than aggregating the data…
View article: Benchmarking Mutual Information-based Loss Functions in Federated Learning
Benchmarking Mutual Information-based Loss Functions in Federated Learning Open
Federated Learning (FL) has attracted considerable interest due to growing privacy concerns and regulations like the General Data Protection Regulation (GDPR), which stresses the importance of privacy-preserving and fair machine learning a…
View article: Factors Affecting the Duration of Hospital Stay in Newborn with Hyperbilirubinemia Admitted in a Tertiary Care Pediatric Hospital of Kolkata
Factors Affecting the Duration of Hospital Stay in Newborn with Hyperbilirubinemia Admitted in a Tertiary Care Pediatric Hospital of Kolkata Open
Introduction: Despite being a temporary condition, neonatal jaundice is still the most common cause of hospitalization in the first week of life. Physiological jaundice is due to the developmental insufficiency of bilirubin uptake, transpo…
View article: When Less is More: Achieving Faster Convergence in Distributed Edge Machine Learning
When Less is More: Achieving Faster Convergence in Distributed Edge Machine Learning Open
Distributed Machine Learning (DML) on resource-constrained edge devices holds immense potential for real-world applications. However, achieving fast convergence in DML in these heterogeneous environments remains a significant challenge. Tr…
View article: Tarazu: An Adaptive End-to-end I/O Load-balancing Framework for Large-scale Parallel File Systems
Tarazu: An Adaptive End-to-end I/O Load-balancing Framework for Large-scale Parallel File Systems Open
The imbalanced I/O load on large parallel file systems affects the parallel I/O performance of high-performance computing (HPC) applications. One of the main reasons for I/O imbalances is the lack of a global view of system-wide resource c…
View article: An End-to-end High-performance Deduplication Scheme for Docker Registries and Docker Container Storage Systems
An End-to-end High-performance Deduplication Scheme for Docker Registries and Docker Container Storage Systems Open
The wide adoption of Docker containers for supporting agile and elastic enterprise applications has led to a broad proliferation of container images. The associated storage performance and capacity requirements place a high pressure on the…
View article: Analyzing File Access Patterns on Large-Scale HPC Systems: Opportunities for File Prefetching
Analyzing File Access Patterns on Large-Scale HPC Systems: Opportunities for File Prefetching Open
This paper explores the potential opportunities for implementing file prefetching techniques on large-scale high-performance computing (HPC) systems. Specifically, we investigate the file access patterns of various applications across mult…
View article: Role of Sildenafil in Severe COVID-19 Pneumonia in Infancy - A Case Series
Role of Sildenafil in Severe COVID-19 Pneumonia in Infancy - A Case Series Open
SARS-CoV-2 virus primarily affects the respiratory system, although other organ systems are also involved. Though pulmonary arterial hypertension (PAH) has been described in as a sequela of COVID pneumonia in adults, only a coincidental as…
View article: I/O performance analysis of machine learning workloads on leadership scale supercomputer
I/O performance analysis of machine learning workloads on leadership scale supercomputer Open
View article: Hvac: Removing I/O Bottleneck for Large-Scale Deep Learning Applications
Hvac: Removing I/O Bottleneck for Large-Scale Deep Learning Applications Open
Scientific communities are increasingly adopting deep learning (DL) models in their applications to accelerate scientific discovery processes. However, with rapid growth in the computing capabilities of HPC supercomputers, large-scale DL a…
View article: Machine Learning Assisted HPC Workload Trace Generation for Leadership Scale Storage Systems
Machine Learning Assisted HPC Workload Trace Generation for Leadership Scale Storage Systems Open
Monitoring and analyzing a wide range of I/O activities in an HPC cluster is important in maintaining mission-critical performance in a large-scale, multi-user, parallel storage system. Center-wide I/O traces can provide high-level informa…
View article: Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load
Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load Open
Scientific computing workloads at HPC facilities have been shifting from traditional numerical simulations to AI/ML applications for training and inference while processing and producing ever-increasing amounts of scientific data. To addre…
View article: Smoky Mountain Data Challenge 2021: An Open Call to Solve Scientific Data Challenges Using Advanced Data Analytics and Edge Computing
Smoky Mountain Data Challenge 2021: An Open Call to Solve Scientific Data Challenges Using Advanced Data Analytics and Edge Computing Open
View article: April 2020 Darshan counters from the Summit supercomputer
April 2020 Darshan counters from the Summit supercomputer Open
This dataset is the Darshan counters collected from the Summit supercomputer in a month of April 2020. 1. Description of methods used for collection/generation of data: Job submitted on Summit HPC system when completed successfully and has…
View article: Characterizing Machine Learning I/O Workloads on Leadership Scale HPC Systems
Characterizing Machine Learning I/O Workloads on Leadership Scale HPC Systems Open
High performance computing (HPC) is no longer solely limited to traditional workloads such as simulation and modeling. With the increase in the popularity of machine learning (ML) and deep learning (DL) technologies, we are observing that …
View article: Table of Contents
Table of Contents Open
View article: Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective
Parallel I/O Evaluation Techniques and Emerging HPC Workloads: A Perspective Open
Emerging workloads such as artificial intelligence, big data analytics and complex multi-step workflows alongside future exascale applications are anticipated future HPC workloads, which will result in a more diverse I/O system workload an…
View article: SMC 2021 Data Challenge: Analyzing Resource Utilization and User Behavior on Titan Supercomputer
SMC 2021 Data Challenge: Analyzing Resource Utilization and User Behavior on Titan Supercomputer Open
Resource utilization statistics of submitted jobs on a supercomputer can help us understand how users from various scientific domains use HPC platforms and better design a job scheduler. We explore to generate insight regarding workload di…
View article: Understanding HPC Application I/O Behavior Using System Level Statistics
Understanding HPC Application I/O Behavior Using System Level Statistics Open
The processor performance of high performance computing (HPC) systems is increasing at a much higher rate than storage performance. This imbalance leads to I/O performance bottlenecks in massively parallel HPC applications. Therefore, ther…
View article: An Application-Attuned Framework for Optimizing HPC Storage Systems
An Application-Attuned Framework for Optimizing HPC Storage Systems Open
High performance computing (HPC) is routinely employed in diverse domains such as life sciences, and Geology, to simulate and understand the behavior of complex phenomena. Big data driven scientific simulations are resource intensive and r…
View article: Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets
Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets Open
In this paper, we introduce some new generalized weighted similarity measures based on the exponential functions defined on truth-membership function, indeterminacy membership function and falsity membership function of a single valued neu…
View article: Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets
Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets Open
View article: Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets
Generalized Weighted Exponential Similarity Measures of Single Valued Neutrosophic Sets Open
View article: iez: Resource Contention Aware Load Balancing for Large-Scale Parallel File Systems
iez: Resource Contention Aware Load Balancing for Large-Scale Parallel File Systems Open
Parallel I/O performance is crucial to sustaining scientific applications on large-scale High-Performance Computing (HPC) systems. However, I/O load imbalance in the underlying distributed and shared storage systems can significantly reduc…
View article: I/O load balancing for big data HPC applications
I/O load balancing for big data HPC applications Open
High Performance Computing (HPC) big data problems require efficient distributed storage systems. However, at scale, such storage systems often experience load imbalance and resource contention due to two factors: the bursty nature of scie…