TITLE:
Semantic Recognition of a Data Structure in Big-Data
AUTHORS:
Aïcha Ben Salem, Faouzi Boufares, Sebastiao Correia
KEYWORDS:
Data Quality, Big-Data, Semantic Data Profiling, Data Dictionary, Regular Expressions, Ontology
JOURNAL NAME:
Journal of Computer and Communications,
Vol.2 No.9,
July
11,
2014
ABSTRACT:
Data governance is a subject that is
becoming increasingly important in business and government. In fact, good governance
data allows improved interactions between employees of one or more
organizations. Data quality represents a great challenge because the cost of
non-quality can be very high. Therefore the use of data quality becomes an
absolute necessity within an organization. To improve the data quality in a
Big-Data source, our purpose, in this paper, is to add semantics to data and
help user to recognize the Big-Data schema. The originality of this approach
lies in the semantic aspect it offers. It detects issues in data and proposes a
data schema by applying a semantic data profiling.