Software versioning ≈ Software versioning
View article
Technical Report on the CleverHans v2.1.0 Adversarial Examples Library Open
CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to pr…
View article
cleverhans v0.1: an adversarial machine learning library. Open
CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to pr…
View article
Managing Messes in Computational Notebooks Open
Data analysts use computational notebooks to write code for analyzing and visualizing data. Notebooks help analysts iteratively write analysis code by letting them interleave code with output, and selectively execute cells. However, as ana…
View article
Variolite Open
How do people ideate through code? Using semi-structured interviews and a survey, we studied data scientists who program, often with small scripts, to experiment with data. These studies show that data scientists frequently code new analys…
View article
ARENA: An Approach for the Automated Generation of Release Notes Open
Release notes document corrections, enhancements, and, in general, changes that were implemented in a new release of a software project. They are usually created manually and may include hundreds of different items, such as descriptions of…
View article
Innovation policies in the digital age Open
This paper looks at how digitalisation is transforming innovation, and the consequent need for innovation policies to adapt. The paper shows that the digital transformation affects the economics of information and knowledge, in particular …
View article
BIDS Apps: Improving ease of use, accessibility, and reproducibility of neuroimaging data analysis methods Open
The rate of progress in human neurosciences is limited by the inability to easily apply a wide range of analysis methods to the plethora of different datasets acquired in labs around the world. In this work, we introduce a framework for cr…
View article
Bottom-Up Technologies for Reuse: Automated Extractive Adoption of Software Product Lines Open
International audience
View article
Bringing your tools to CyVerse Discovery Environment using Docker Open
Docker has become a very popular container-based virtualization platform for software distribution that has revolutionized the way in which scientific software and software dependencies (software stacks) can be packaged, distributed, and d…
View article
Decibel Open
As scientific endeavors and data analysis become increasingly collaborative, there is a need for data management systems that natively support the versioning or branching of datasets to enable concurrent analysis, cleaning, integration, ma…
View article
Geoweaver: Advanced Cyberinfrastructure for Managing Hybrid Geoscientific AI Workflows Open
AI (artificial intelligence)-based analysis of geospatial data has gained a lot of attention. Geospatial datasets are multi-dimensional; have spatiotemporal context; exist in disparate formats; and require sophisticated AI workflows that i…
View article
DVID: Distributed Versioned Image-Oriented Dataservice Open
Open-source software development has skyrocketed in part due to community tools like github.com, which allows publication of code as well as the ability to create branches and push accepted modifications back to the original repository. As…
View article
Data Platform for Machine Learning Open
In this paper, we present a purpose-built data management system, MLdp, for all machine learning (ML) datasets. ML applications pose some unique requirements different from common conventional data processing applications, including but no…
View article
CAVE: Connectome Annotation Versioning Engine Open
Advances in Electron Microscopy, image segmentation and computational infrastructure have given rise to large-scale and richly annotated connectomic datasets which are increasingly shared across communities. To enable collaboration, users …
View article
CERN Analysis Preservation: A Novel Digital Library Service to Enable Reusable and Reproducible Research Open
The latest policy developments require immediate action for data preservation, as well as reproducible and Open Science. To address this, an unprecedented digital library service is presented to enable the High-Energy Physics community to …
View article
Operationalizing Machine Learning: An Interview Study Open
Organizations rely on machine learning engineers (MLEs) to operationalize ML, i.e., deploy and maintain ML pipelines in production. The process of operationalizing ML, or MLOps, consists of a continual loop of (i) data collection and label…
View article
Has My Release Disobeyed Semantic Versioning? Static Detection Based on Semantic Differencing Open
To enhance the compatibility in the version control of Java Third-party Libraries (TPLs), Maven adopts Semantic Versioning (SemVer) to standardize the underlying meaning of versions, but users could still confront abnormal execution and cr…
View article
Modeling History to Understand Software Evolution Open
Over the past three decades, more and more research has been spent on understanding software evolution. The development and spread of versioning systems made valuable data available for study. Indeed, versioning systems provide rich inform…
View article
A Parallel World Framework for scenario analysis in knowledge graphs Open
This paper presents Parallel World Framework as a solution for simulations of complex systems within a time-varying knowledge graph and its application to the electric grid of Jurong Island in Singapore. The underlying modeling system is b…
View article
Adapting Behavior Driven Development (BDD) for large-scale software systems Open
Context: Large-scale software projects require interaction between many stakeholders. Behavior-driven development (BDD) facilitates collaboration between stakeholders, and an adapted BDD process can help improve cooperation in a large-scal…
View article
The Lens MetaRecord and LensID: An open identifier system for aggregated metadata and versioning of knowledge artefacts Open
Ambiguity is inherent in the digital records of entities such as patents, scholarly works, human names, or institutions. While we have made some progress to preserve each entity’s one to one relationship using open persistent identifiers, …
View article
ForkBase: An Efficient Storage Engine for Blockchain and Forkable Applications Open
Existing data storage systems offer a wide range of functionalities to accommodate an equally diverse range of applications. However, new classes of applications have emerged, e.g., blockchain and collaborative analytics, featuring data ve…
View article
Open Educational Resources and Institutional Repositories Open
This installment of the Balance Point column delves into the ways in which libraries create and store open educational resources (OER) in institutional repositories (IR), addressing issues such as preservation and versioning of OER content…
View article
Quality Assurance for Open Educational Resources: The OERTrust Framework Open
Learning Objects have met some barriers to their development and effective adoption, which varied from the lack of quality assurance mechanisms to the impossibility of editing and adapting most of them to real teaching and learning context…
View article
Software Evolution in Time and Space: Unifying Version and Variability Management (Dagstuhl Seminar 19191) Open
Effectively managing versions and variants of software systems are among the main challenges of software engineering. Over the last decades, two large research fields, Software Configuration Management (SCM) and Software Product Line Engin…
View article
Versioning Data Is About More than Revisions: A Conceptual Framework and Proposed Principles Open
A dataset, small or big, is often changed to correct errors, apply new algorithms, or add new data (e.g., as part of a time series), etc. In addition, datasets might be bundled into collections, distributed in different encodings or mirror…
View article
A Guide to Using GitHub for Developing and Versioning Data Standards and Reporting Formats Open
Data standardization combined with descriptive metadata facilitate data reuse, which is the ultimate goal of the Findable, Accessible, Interoperable, and Reusable (FAIR) principles. Community data or metadata standards are increasingly cre…
View article
xSDK Foundations: Toward an Extreme-scale Scientific Software Development Kit Open
Here, extreme-scale computational science increasingly demands multiscale and multiphysics formulations. Combining software developed by independent groups is imperative: no single team has resources for all predictive science and decision…
View article
Using Annotations for Sensemaking About Code Open
Developers spend significant amounts of time finding, relating, navigating, and, more broadly, making sense of code. While sensemaking, developers must keep track of many pieces of information including the objectives of their task, the co…
View article
Fintech Achieving Sustainable Development: The Side Perspective of Crowdfunding Platform Open
Crowdfunding has gradually been an emerging financing model of raising small amounts of money from a large number of people, mostly by means of optimizing certain website. The article proposes to explore the development of crowdfunding pla…