Beth Plale
YOU?
Author Swipe
View article: Vector embedding of multi-modal texts: a tool for discovery?
Vector embedding of multi-modal texts: a tool for discovery? Open
Computer science texts are particularly rich in both narrative content and illustrative charts, algorithms, images, annotated diagrams, etc. This study explores the extent to which vector-based multimodal retrieval, powered by vision-langu…
View article: Creating intelligent cyberinfrastructure for democratizing AI
Creating intelligent cyberinfrastructure for democratizing AI Open
Artificial intelligence (AI) has the potential for vast societal and economic gain; yet applications are developed in a largely ad hoc manner, lacking coherent, standardized, modular, and reusable infrastructures. The NSF‐funded Intelligen…
View article: CKN: An Edge AI Distributed Framework
CKN: An Edge AI Distributed Framework Open
The CKN framework supports AI at the Edge through a monitoring framework that builds and maintains a context for an edge application. Its objective is to maximize user experience. In this brief abstract, we introduce the CKN framework and …
View article: Assessing the FAIR Digital Object Framework for Global Biodiversity Research
Assessing the FAIR Digital Object Framework for Global Biodiversity Research Open
In the first decades of the 21 st century, there has been a global trend towards digitisation and the mobilisation of data from natural history museums and research institutions. The development of national and international aggregator sys…
View article: Cybershuttle: An End-to-End Cyberinfrastructure Continuum to Accelerate Discovery in Science and Engineering
Cybershuttle: An End-to-End Cyberinfrastructure Continuum to Accelerate Discovery in Science and Engineering Open
We introduce Cybershuttle, a novel user-facing cyberinfrastructure that offers researchers seamless access to various resources, thereby enhancing their productivity.
View article: Policy recommendations to ensure that research software is openly accessible and reusable
Policy recommendations to ensure that research software is openly accessible and reusable Open
Research data is optimized when it can be freely accessed and reused. To maximize research equity, transparency, and reproducibility, policymakers should take concrete steps to ensure that research software is openly accessible and reusabl…
View article: CKN Edge AI Dataset for Image inference at the Edge (CEAD)
CKN Edge AI Dataset for Image inference at the Edge (CEAD) Open
This synthetic workload models camera device requests for resource constrained inference requests at the Edge for Campaign Knowledge Network evaluation. The workload is a deterministic and pre-ordered set of time windows containing close t…
View article: CKN Edge AI Dataset for Image inference at the Edge (CEAD)
CKN Edge AI Dataset for Image inference at the Edge (CEAD) Open
This synthetic workload models camera device requests for resource constrained inference requests at the Edge for Campaign Knowledge Network evaluation. The workload is a deterministic and pre-ordered set of time windows containing close t…
View article: Campaign Knowledge Network: Building Knowledge for Campaign Efficiency
Campaign Knowledge Network: Building Knowledge for Campaign Efficiency Open
In the landscape of exascale computing collaborative research campaigns are conducted as co-design activities of loosely coordinated experiments. But the higher level context and the knowledge of individual experimental activity is lost ov…
View article: Reproducibility Practice in High-Performance Computing: Community Survey Results
Reproducibility Practice in High-Performance Computing: Community Survey Results Open
The integrity of science and engineering research is grounded in assumptions of rigor and transparency on the part of those engaging in such research. HPC community effort to strengthen rigor and transparency take the form of reproducibili…
View article: Reproducibility Practice in High Performance Computing: Community Survey Results
Reproducibility Practice in High Performance Computing: Community Survey Results Open
The integrity of science and engineering research is grounded in assumptions of rigor and transparency on the part of those engaging in such research. HPC community effort to strengthen rigor and transparency take the form of reproducibili…
View article: SC Transparency and Reproducibilty Community Survey
SC Transparency and Reproducibilty Community Survey Open
Results of a survey administered to the SC conference community in August 2020 to all those who had participated in SC17, SC18, or SC19 technical programs. The survey participants were self-selected among 9,949 unique individuals. 204 indi…
View article: SC Transparency and Reproducibilty Community Survey
SC Transparency and Reproducibilty Community Survey Open
Results of a survey administered to the SC conference community in August 2020 to all those who had participated in SC17, SC18, or SC19 technical programs. The survey participants were self-selected among 9,949 unique individuals. 204 indi…
View article: Transparency and Reproducibility Practice in Large-Scale Computational Science: A Preface to the Special Section
Transparency and Reproducibility Practice in Large-Scale Computational Science: A Preface to the Special Section Open
With this special section we bring you a practice and experience effort in transparency and reproducibility for large-scale computational science. A unique section, it consists of a research work plus six critques, each by a student team t…
View article: Fostering Interdisciplinary Data Cultures through Early Career Development: The RDA/US Data Share Fellowship
Fostering Interdisciplinary Data Cultures through Early Career Development: The RDA/US Data Share Fellowship Open
Openness and interdisciplinarity in research and data are among the challenges that are frequently discussed in the context of changing scientific and scholarly practices. Gradually, the visions of open and widely shared data are being rec…
View article: Corrigendum to: Rice Galaxy: an open resource for plant science
Corrigendum to: Rice Galaxy: an open resource for plant science Open
International audience
View article: Pilot evaluation of Collection API with PID Kernel Information
Pilot evaluation of Collection API with PID Kernel Information Open
As digital data become increasingly available for research, there is a growing awareness of the value of domain agnostic Persistent Identifiers (PIDs) for data. A PID is a globally unique reference to a digital object, which in our case is…
View article: Rice Galaxy: an open resource for plant science
Rice Galaxy: an open resource for plant science Open
Background Rice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high-density genotyping platforms for genome variation analysis and gene disc…
View article: Reliable Access to Massive Restricted Texts: Experience-based Evaluation
Reliable Access to Massive Restricted Texts: Experience-based Evaluation Open
Libraries are seeing growing numbers of digitized textual corpora that frequently come with restrictions on their content. Computational analysis corpora that are large, while of interest to scholars, can be cumbersome because of the combi…
View article: Safe Open Science for Restricted Data
Safe Open Science for Restricted Data Open
Open science is prompting wide efforts to make data from research available for broader use. However, sharing data is complicated by important protections on the data (e.g., protections of privacy and intellectual property). The spectrum o…
View article: Intelligent systems for geosciences
Intelligent systems for geosciences Open
A research agenda for intelligent systems that will result in fundamental new capabilities for understanding the Earth system.
View article: Rice Galaxy: an open resource for plant science
Rice Galaxy: an open resource for plant science Open
Background Rice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high density genotyping platforms for genome variation analysis and gene disc…
View article: Workset Creation for Scholarly Analysis and Data Capsules (WCSA+DC): Laying the foundations for secure computation with copyrighted data in the HathiTrust Research Center, Phase I
Workset Creation for Scholarly Analysis and Data Capsules (WCSA+DC): Laying the foundations for secure computation with copyrighted data in the HathiTrust Research Center, Phase I Open
The primary objective of the WCSA+DC project is the seamless integration of the workset model and tools with the Data Capsule framework to provide non-consumptive research access HathiTrust’s massive corpus of data objects, securely and at…
View article: <scp>P</scp>acific <scp>R</scp>im <scp>A</scp>pplications and <scp>G</scp>rid <scp>M</scp>iddleware <scp>A</scp>ssembly (<scp>PRAGMA</scp>): <scp>I</scp>nternational clouds for data science
<span>P</span>acific <span>R</span>im <span>A</span>pplications and <span>G</span>rid <span>M</span>iddleware <span>A</span>ssembly (<span>PRAGMA</span>): <span>I</span>nternational clouds for data science Open
This special issue presents a selection of research emerging from the Pacific Rim Applications and Grid Middleware Assembly (PRAGMA), an assembly of center-scale organizations around the Pacific Rim with membership from Australia, China, I…
View article: Grand Challenge of Indiana Water: Estimate of Compute and Data Storage Needs
Grand Challenge of Indiana Water: Estimate of Compute and Data Storage Needs Open
This study is undertaken to assess the computational and storage needs for a large-scale research activity to study water in the State of Indiana. It draws its data and compute numbers from the Vortex II Forecast Data study of 2010 carried…