What is it about?
Potential data issues (i.e., smelly data) are typically hard to grasp and detect. We address this by first providing a sound concept of such smelly data. Secondly, we present a catalog of the most common types of such issues together with concrete detection techniques.
Featured Image
Photo by Markus Spiske on Unsplash
Why is it important?
Potential data issues can have serious consequences for almost every data-driven application. Being aware of them and detecting them is thus of utmost importance.
Read the Original
This page is a summary of: Data smells, May 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3522664.3528590.
You can read the full text:
Contributors
The following have contributed to this page