Improve Data Quality by Processing Null Values and Semantic Dependencies - Journal of Computer and Communications

JCC > Vol.4 No.5, May 2016

Improve Data Quality by Processing Null Values and Semantic Dependencies ()

HTML XML

Download as PDF (Size: 284KB) PP. 78-85

DOI: 10.4236/jcc.2016.45012 2,025 Downloads 2,957 Views Citations

Author(s)

Houda Zaidi^1,2, Faouzi Boufarès³, Yann Pollet¹

Affiliation(s)

¹Laboratory CEDRIC, Conservatoire National des Arts et Métiers, Paris, France.
²Laboratory RIADI, University Manouba, Tunis, Tunisia.
³Laboratory LIPN, University Paris 13, Sorbonne Paris Cité, Villetaneuse, France.

ABSTRACT

Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data Anomalies may be due to the poverty of their semantic descriptions, or even the absence of their description. In this paper, we propose an approach to better understand the semantics and the structure of the data. Our approach helps to correct automatically the intra-column anomalies and the inter-col- umns ones. We aim to improve the quality of data by processing the null values and the semantic dependencies between columns.

KEYWORDS

Data Quality, Big Data, Contextual Semantics, Semantic Dependencies, Functional Dependencies, Null Values, Data Cleaning

Share and Cite:

Zaidi, H. , Boufarès, F. and Pollet, Y. (2016) Improve Data Quality by Processing Null Values and Semantic Dependencies. Journal of Computer and Communications, 4, 78-85. doi: 10.4236/jcc.2016.45012.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies