论文标题
城市中心凶杀案的预测:一种机器学习方法
Prediction of Homicides in Urban Centers: A Machine Learning Approach
论文作者
论文摘要
相关研究在计算社区中得到了强调,以开发能够预测犯罪发生的机器学习模型,分析犯罪的环境,提取与犯罪相关的个人的概况以及随着时间的推移分析犯罪。但是,当前文献中通常没有发现能够预测特定犯罪的模型,例如杀人罪。这项研究提出了一种机器学习模型,以预测凶杀犯罪,该数据集使用基于34种不同类型犯罪的事件报告记录以及犯罪报告中的时间和空间数据,使用通用数据(无研究位置依赖性)的数据集。在实验上,使用了来自巴西的贝莱姆市的数据。这些数据进行了转换以使问题通用,从而使该模型复制到其他位置。在研究中,在创建的数据集上使用简单且可靠的算法进行了分析。这样,用11种不同的分类方法进行了统计检验,结果与其他注册犯罪发生后的月份发生了预测和凶杀犯罪犯罪的发生,并使用随机森林使用随机森林。结果被认为是提出问题的基线。
Relevant research has been highlighted in the computing community to develop machine learning models capable of predicting the occurrence of crimes, analyzing contexts of crimes, extracting profiles of individuals linked to crime, and analyzing crimes over time. However, models capable of predicting specific crimes, such as homicide, are not commonly found in the current literature. This research presents a machine learning model to predict homicide crimes, using a dataset that uses generic data (without study location dependencies) based on incident report records for 34 different types of crimes, along with time and space data from crime reports. Experimentally, data from the city of Belém - Pará, Brazil was used. These data were transformed to make the problem generic, enabling the replication of this model to other locations. In the research, analyses were performed with simple and robust algorithms on the created dataset. With this, statistical tests were performed with 11 different classification methods and the results are related to the prediction's occurrence and non-occurrence of homicide crimes in the month subsequent to the occurrence of other registered crimes, with 76% assertiveness for both classes of the problem, using Random Forest. Results are considered as a baseline for the proposed problem.