Comparison of Model Performance for Basic and Advanced Modeling Approaches to Crime Prediction

HTML  XML Download Download as PDF (Size: 744KB)  PP. 123-132  
DOI: 10.4236/iim.2018.106011    999 Downloads   2,091 Views  Citations
Author(s)

ABSTRACT

A good machine learning model would greatly contribute to an accurate crime prediction. Thus, researchers select advanced models more frequently than basic models. To find out whether advanced models have a prominent advantage, this study focuses shift from obtaining crime prediction to on comparing model performance between these two types of models on crime prediction. In this study, we aimed to predict burglary occurrence in Los Angeles City, and compared a basic model just using prior year burglary occurrence with advanced models including linear regressor and random forest regressor. In addition, American Community Survey data was used to provide neighborhood level socio-economic features. After finishing data preprocessing steps that regularize the dataset, recursive feature elimination was utilized to determine the final features and the parameters of the two advanced models. Finally, to find out the best fit model, three metrics were used to evaluate model performance: R squared, adjusted R squared and mean squared error. The results indicate that linear regressor is the most suitable model among three models applied in the study with a slightly smaller mean squared error than that of basic model, whereas random forest model performed worse than the basic model. With a much more complex learning steps, advanced models did not show prominent advantages, and further research to extend the current study were discussed.

Share and Cite:

Zhu, Y. (2018) Comparison of Model Performance for Basic and Advanced Modeling Approaches to Crime Prediction. Intelligent Information Management, 10, 123-132. doi: 10.4236/iim.2018.106011.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.