• Login
    View Item 
    •   DSpace@RPI Home
    • Tetherless World Constellation
    • Tetherless World Publications
    • View Item
    •   DSpace@RPI Home
    • Tetherless World Constellation
    • Tetherless World Publications
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Data Cleansing

    Author
    Huang, Fang
    Thumbnail
    Other Contributors
    Date Issued
    2019-01-01
    Degree
    Terms of Use
    Full Citation
    Fang Huang (2019) Data Cleansing. In: Schintler L., McNeely C. (eds) Encyclopedia of Big Data. Springer, Cham. https://doi.org/10.1007/978-3-319-32001-4_300-1
    Metadata
    Show full item record
    URI
    https://doi.org/10.1007/978-3-319-32001-4_300-1; https://hdl.handle.net/20.500.13015/6596
    Abstract
    Data cleansing, also known as data cleaning, is the process of identifying and addressing problems in raw data to improve data quality (Fox 2018). Data quality is broadly defined as the precision and accuracy of data, which can significantly influence the information interpreted from the data (Broeck et al. 2005). Data quality issues usually involve inaccurate, unprecise, and/or incomplete data. Additionally, large amounts of data are being produced every day, and the intrinsic complexity and diversity of the data result in many quality issues. To extract useful information, data cleansing is an essential step in a data life cycle.;
    Department
    Publisher
    Encyclopedia of Big Data
    Relationships
    Access
    Collections
    • Tetherless World Publications

    Browse

    All of DSpace@RPICommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2023  DuraSpace
    Contact Us | Send Feedback
    DSpace Express is a service operated by 
    Atmire NV