Improvement the Accuracy of Six Applied Classification Algorithms through Integrated Supervised and Unsupervised Learning Approach ()
Sharareh R. Niakan Kalhori,
Xiao-Jun Zeng
1Department of Public Health, School of Health, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
2Social Determinants of Health Research Center, School of Health, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
Department of Machine Learning and Optimization, School of Computer Science, The University of
Manchester, Manchester, UK.
DOI: 10.4236/jcc.2014.24027
PDF
HTML
3,314
Downloads
5,011
Views
Citations
Abstract
We have presented an integrated approach
based on supervised and unsupervised learning tech- nique to improve the
accuracy of six predictive models. They are developed to predict outcome of
tuberculosis treatment course and their accuracy needs to be improved as they
are not precise as much as necessary. The integrated supervised and unsupervised
learning method (ISULM) has been proposed as a new way to improve model
accuracy. The dataset of 6450 Iranian TB patients under DOTS therapy was
applied to initially select the significant predictors and then develop six predictive
models using decision tree, Bayesian network, logistic regression, multilayer
perceptron, radial basis function, and support vector machine algorithms.
Developed models have integrated with k-mean clustering analysis to calculate
more accurate predicted outcome of tuberculosis treatment course. Obtained
results, then, have been evaluated to compare prediction accuracy before and
after ISULM application. Recall, Precision, F-measure, and ROC area are other
criteria used to assess the models validity as well as change percentage to
show how different are models before and after ISULM. ISULM led to improve the
prediction accuracy for all applied classifiers ranging between 4% and 10%.
The most and least improvement for prediction accuracy were shown by logistic
regression and support vector machine respectively. Pre-learning by k- mean
clustering to relocate the objects and put similar cases in the same group can
improve the classification accuracy in the process of integrating supervised
and unsupervised learning.
Share and Cite:
Kalhori, S. and Zeng, X. (2014) Improvement the Accuracy of Six Applied Classification Algorithms through Integrated Supervised and Unsupervised Learning Approach.
Journal of Computer and Communications,
2, 201-209. doi:
10.4236/jcc.2014.24027.
Conflicts of Interest
The authors declare no conflicts of interest.
References
[1]
|
Kotsiantis, S.B. (2007) Supervised Machine Learning: A Review Of Classification Techniques. Informatica, 31, 249- 268.
|
[2]
|
Niakan Kalhori, R.S. (2013) Evaluation and Comparison of Different Machine Learning Methods to Predict Outcome of Tuberculosis Treatment Course. Journal of Intelligent Learning Systems and Applications, 5, 10 p.
|
[3]
|
Alpaydin, E. (2004) Introduction to Machine Learning. 1th Edition, The MIT Press, Cambridge.
|
[4]
|
Niakan Kalhori, R.S. (2011) Integrated Supervised and Unsupervised Learning Method to Predict the Outcome of Tuberculosis Treatment Course. Ph.D. Thesis, The University of Manchester.
|
[5]
|
Pao, Y. and Sobajic, D.J. (1992) Combined Use of Unsupervised and Supervised Learning for Dynamic Security Assessment. Transactions on Power Systems, 7, 878-884.
|
[6]
|
?muc, T., Gamberger, D. and Krstacic, G. (2001) Combining Unsupervised and Supervised Machine Learning in Analysis of the CHD Patient Database. The American Invitational Mathematics Examination, 109-112.
|
[7]
|
Boudour, M. and Hellal, A. (2005) Combined Use of Supervised and Unsupervised Learning for Power System Dynamic Security Mapping. Engineering Applications of Artificial Intelligence, 18, 673-683.
|
[8]
|
Han, J. and Kamber, M. (2006) Data Mining: Concepts and Techniques. 2nd Edition, Morgan Kaufman Publishers, USA.
|