Clustering Categorical Data Based on Within-Cluster Relative Mean Difference

HTML  XML Download Download as PDF (Size: 575KB)  PP. 173-181  
DOI: 10.4236/ojs.2017.72013    1,628 Downloads   3,458 Views  Citations
Author(s)

ABSTRACT

The clustering on categorical variables has received intensive attention. In dataset with categorical features, some features show the superior performance on clustering procedure. In this paper, we propose a simple method to find such distinctive features by comparing pooled within-cluster mean relative difference and then partition the data upon such features and give subspace of the subgroups. The applications on zoo data and soybean data illustrate the performance of the proposed method.

Share and Cite:

Su, J. and Su, C. (2017) Clustering Categorical Data Based on Within-Cluster Relative Mean Difference. Open Journal of Statistics, 7, 173-181. doi: 10.4236/ojs.2017.72013.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.