Abstract
Data cleansing, also known as data cleaning, is the process of identifying and addressing problems in raw data to improve data quality (Fox 2018). Data quality is broadly defined as the precision and accuracy of data, which can significantly influence the information interpreted from the data (Broeck et al. 2005). Data quality issues usually involve inaccurate, unprecise, and/or incomplete data. Additionally, large amounts of data are being produced every day, and the intrinsic complexity and diversity of the data result in many quality issues. To extract useful information, data cleansing is an essential step in a data life cycle.;
Department
Publisher
Encyclopedia of Big Data
Relationships
Access