Explanation vs Performance in Data Mining: A Case Study with Predicting Runaway Projects - Journal of Software Engineering and Applications

JSEA > Vol.2 No.4, November 2009

Explanation vs Performance in Data Mining: A Case Study with Predicting Runaway Projects ()

HTML

Download as PDF (Size: 466KB) PP. 221-236

DOI: 10.4236/jsea.2009.24030 6,116 Downloads 10,993 Views Citations

Author(s)

Tim MENZIES, Osamu MIZUNO, Yasunari TAKAGI, Tohru KIKUNO

Affiliation(s)

ABSTRACT

Often, the explanatory power of a learned model must be traded off against model performance. In the case of predict-ing runaway software projects, we show that the twin goals of high performance and good explanatory power are achievable after applying a variety of data mining techniques (discrimination, feature subset selection, rule covering algorithms). This result is a new high water mark in predicting runaway projects. Measured in terms of precision, this new model is as good as can be expected for our data. Other methods might out-perform our result (e.g. by generating a smaller, more explainable model) but no other method could out-perform the precision of our learned model.

KEYWORDS

Explanation, Data Mining, Runaway

Share and Cite:

T. MENZIES, O. MIZUNO, Y. TAKAGI and T. KIKUNO, "Explanation vs Performance in Data Mining: A Case Study with Predicting Runaway Projects," Journal of Software Engineering and Applications, Vol. 2 No. 4, 2009, pp. 221-236. doi: 10.4236/jsea.2009.24030.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies