Applied Sciences • Vol 11 • No 2
Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data
January 2021 • Hyeongmin Cho, Sangkyun Lee
Machine learning has been proven to be effective in various application areas, such as object and speech recognition on mobile systems. Since a critical key to machine learning success is the availability of large training data, many datasets are being disclosed and published online. From a data consumer or manager point of view, measuring data quality is an important first step in the learning process. We need to determine which datasets to use, update, and maintain. However, not many practical ways to measure da…