What is it about?

Data quality in companies is decisive and critical to the benefits their products and services can provide. However, in heterogeneous IT infrastructures where, e.g., different applications for Enterprise Resource Planning (ERP), Customer Relationship Management (CRM), product management, manufacturing, and marketing are used, duplicates, e.g., multiple entries for the same customer or product in a database or information system, occur. There can be several reasons for this, but the result of non-unique or duplicate records is a degraded data quality. This ultimately leads to poorer, inefficient, and inaccurate data-driven decisions. For this reason, in this paper, we develop a conceptual data governance framework for effective and efficient management of duplicate data, and improvement of data accuracy and consistency in large data ecosystems. We present methods and recommendations for companies to deal with duplicate data in a meaningful way.

Featured Image

Read the Original

This page is a summary of: Overlooked Aspects of Data Governance: Workflow Framework For Enterprise Data Deduplication, June 2023, Institute of Electrical & Electronics Engineers (IEEE),
DOI: 10.1109/iccns58795.2023.10193478.
You can read the full text:

Read

Contributors

The following have contributed to this page