Improve Data Quality by Processing Null Values and Semantic Dependencies

HTML  XML Download Download as PDF (Size: 284KB)  PP. 78-85  
DOI: 10.4236/jcc.2016.45012    2,025 Downloads   2,957 Views  Citations

ABSTRACT

Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data Anomalies may be due to the poverty of their semantic descriptions, or even the absence of their description. In this paper, we propose an approach to better understand the semantics and the structure of the data. Our approach helps to correct automatically the intra-column anomalies and the inter-col- umns ones. We aim to improve the quality of data by processing the null values and the semantic dependencies between columns.

Share and Cite:

Zaidi, H. , Boufarès, F. and Pollet, Y. (2016) Improve Data Quality by Processing Null Values and Semantic Dependencies. Journal of Computer and Communications, 4, 78-85. doi: 10.4236/jcc.2016.45012.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.