doi.org
Data context informed data wrangling
December 2017 • Martin Koehler, Alex Bogatu, Cristina Civili, Νικόλαος Κωνσταντίνου, E. W. Abel, Alvaro A. A. Fernandes, John Keane, Leonid Libkin, Norman W. Paton
The process of preparing potentially large and complex data sets for further analysis or manual examination is often called data wrangling. In classical warehousing environments, the steps in such a process have been carried out using Extract-Transform-Load platforms, with significant manual involvement in specifying, configuring or tuning many of them. Cost-effective data wrangling processes need to ensure that data wrangling steps benefit from automation wherever possible. In this paper, we define a methodology …