doi.org
November 2017 • Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Geoffroy Vallée, Seung–Hwan Lim, Ali R. Butt
Data services such as search, discovery, and management in scalable distributed environments have traditionally been decoupled from the underlying file systems, and are often deployed using external databases and indexing services. However, modern data production rates, looming data movement costs, and the lack of metadata, entail revisiting the decoupled file system-data services design philosophy.In this paper, we present TagIt, a scalable data management service framework aimed at scientific datasets, which is …