CN111062000B - Criminal perpetrator criminal land identification method based on discrete selection model - Google Patents

Criminal perpetrator criminal land identification method based on discrete selection model Download PDF

Info

Publication number
CN111062000B
CN111062000B CN201911269327.8A CN201911269327A CN111062000B CN 111062000 B CN111062000 B CN 111062000B CN 201911269327 A CN201911269327 A CN 201911269327A CN 111062000 B CN111062000 B CN 111062000B
Authority
CN
China
Prior art keywords
community
criminal
data
perpetrator
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911269327.8A
Other languages
Chinese (zh)
Other versions
CN111062000A (en
Inventor
龙冬平
徐铭恩
徐冲
肖露子
宋广文
陈建国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou University
Original Assignee
Guangzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou University filed Critical Guangzhou University
Priority to CN201911269327.8A priority Critical patent/CN111062000B/en
Publication of CN111062000A publication Critical patent/CN111062000A/en
Application granted granted Critical
Publication of CN111062000B publication Critical patent/CN111062000B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Abstract

The invention discloses a criminal land identification method based on a discrete selection model, which is mainly applied to the technical fields of public safety and crime geography. The method integrates multi-source space-time data such as police condition data, capturing data, POI data, mobile phone signaling data and the like, and based on the principle of discrete selection model, increases the data such as crowd flowing environment, crime prevention and control environment and the like, and optimizes the accuracy of a quantized model of criminal places. Compared with the basic model, the fitting precision of the full model to criminal selection is improved by 8.63%. The method also calculates the expected effect of the criminals on the perpetrated community and the probability of the criminals selecting the criminals through the effect function and the probability function, thereby accurately identifying the preference of the criminals for the criminals selecting. By adopting the embodiment provided by the invention, the criminal can be accurately identified in a criminal manner, the identification effectiveness and accuracy are improved, and the invention has an important reference function on police prevention and control.

Description

Criminal perpetrator criminal land identification method based on discrete selection model
Technical Field
The invention relates to the technical field of public safety and crime geography, in particular to a criminal plagued identification method based on a discrete selection model.
Background
How to create a public safe, stable, high-quality and high-efficiency social environment is always a main challenge of comprehensive treatment of public security in China. In the current urban process of China, how to absorb a great deal of past experience and training, analyze the selection behavior of criminals by using big data, find out rules or trends from the rules or trends, and perform crime prevention and control so as to solve the conventional thinking of crime problems is a focus of attention of various social circles.
Discrete spatial selection models (Discrete Spatial Choice Modling) are widely used to analyze selection behavior in micro-economics. It is based on the theory of stochastic utility, assuming that a decision maker must make a selection for a discrete set of choices. At the same time, the decision maker evaluates the relative utility of each alternative. Considering that the perpetrated selection is similar to the above selection, bernasco and Nieuwbeerta introduced the method into the crime study field and validated the perpetrated selection of burglars in the sea area of the Netherlands. Subsequently, clare, baudeins, johnson, bernasco et al, have studied a series of crime types using this method.
However, in the whole, research about criminal selection at home and abroad still has partial shortages, such as lack of consideration of the influence of crowd flowing environment and community crime prevention and control in terms of verification analysis of influence factors, resulting in poor fitting degree of the model to criminal selection. The choice of preferences in how to optimize the model, improve the fitting accuracy, and effectively identify criminals is a challenge.
Disclosure of Invention
The embodiment of the invention aims to provide a criminal land identification method based on a discrete selection model, which can realize accurate criminal land selection identification, improves the identification effectiveness and accuracy and has an important reference effect on police prevention and control.
In order to achieve the above object, the embodiment of the present invention provides a criminal plausible identification method based on a discrete selection model, including the steps of:
acquiring multi-source space-time data, and carrying out integrated processing on the multi-source space-time data to generate a community attribute set of each community;
obtaining a coding result corresponding to the address text of the area to be identified, optimizing the coding result through a multi-source geocode classification optimization model based on rules and clusters, and generating coding information corresponding to optimized address text data to construct a criminal perpetrator data set;
matching according to the perpetrator place data set and the community attribute sets of all communities to obtain a first community attribute set of the perpetrator place, and fusing the first community attribute set and the perpetrator place data set to generate a model sample;
obtaining a variable meeting the requirements by performing a co-linear diagnosis process on the model sample, fitting the criminal with a discrete selection model to perpetrate selection preferences, obtaining an expected effect selected by the perpetrator, and identifying a community in which the perpetrator is working to obtain the maximum expected effect;
fitting the result obtained by the perpetrator according to a conditional logic model and the discrete selection model, constructing a perpetrator selection probability function, and obtaining the probability of perpetrator selection according to the perpetrator selection probability function.
Further, the multi-source spatiotemporal data comprises original public security department data, mobile phone signaling data and census data.
Further, the integrated processing of the multi-source spatio-temporal data specifically includes:
according to the mobile phone signaling data, calculating to obtain the population of the base station of each community;
generating Thiessen polygons corresponding to all communities according to the population numbers of the base stations of all communities, and calculating the complete Thiessen polygon areas of all communities;
cutting the corresponding Thiessen polygons through shp files of all communities to obtain fragmented Thiessen polygons corresponding to all communities;
calculating to obtain the crowd flow density corresponding to each community according to the crowd flow density formula, the population of the base station of each community, the complete Thiessen polygon area of each community and the fragmented Thiessen polygons corresponding to each community;
and calculating the socioeconomic heterogeneity of each community according to a socioeconomic heterogeneity formula and the number of different socioeconomic groups in the census data.
Further, the crowd flow density formula is:
Figure BDA0002313730990000031
wherein D is i The crowd flow density of the ith community is that n is the number of Thiessen polygons related to the ith community, P k Is the kth Thiessen polygon population, S ji For the area of fragment j in the ith community, S ki S is the total area of the kth Thiessen polygon associated with the ith community i Is the area of the ith community.
Further, the socioeconomic heterogeneity formula is:
Figure BDA0002313730990000032
wherein n is the number of different socioeconomic groups, P ki SE for the proportion of the kth socioeconomic group living in the ith community i The larger the number of (c) indicates the more heterogeneous the community population.
Further, the method includes the steps of obtaining a variable meeting requirements by performing co-linearity diagnosis on the model sample, fitting the criminal perpetrator selection preference through a discrete selection model, obtaining an expected effect selected by the criminal perpetrator, and identifying a community in which the criminal can obtain the maximum expected effect by performing the criminal perpetrator, wherein the community comprises the following specific steps:
screening variables affecting the scheme selection from the model sample, and performing co-linearity diagnosis on the variables through a variance expansion formula to obtain variables with variance expansion less than 10; the variables comprise build environment factors, social environment factors, crowd flow environment factors and crime prevention and control factor variables;
and calculating the expected effect of the criminals on the selection of each community through an expected effect calculation formula to obtain the community with the maximum expected effect of criminals executing the criminal.
Further, the variance expansion formula is:
Figure BDA0002313730990000041
wherein R is i As variable x i The negative correlation coefficient of regression analysis was performed on the remaining variables, VIF being the coefficient of variance expansion.
Further, the expected effect calculation formula is:
U ij =βx ijij
wherein U is ij Selecting for an ith perpetrator an expected effect of crime to be performed in a jth community, x ij For the values of the explanatory variables related to the ith perpetrator and the jth community, β is the estimated coefficient of the explanatory variable, ε ij Is the random error of the model.
Further, the contemplatively selected probability function is a Prob probability function.
Further, the calculation formula of the Prob probability function is as follows:
Figure BDA0002313730990000042
wherein Y is i Is the choice for the ith perpetrator, x ij Beta is the estimated coefficient of the explanatory variable, which is the value of the explanatory variable related to the i-th perpetrator and the j-th community.
The embodiment of the invention has the following beneficial effects:
compared with the prior art, the criminal land identification method based on the discrete selection model provided by the embodiment of the invention generates a community attribute set of each community by acquiring multi-source space-time data and carrying out integrated processing on the multi-source space-time data; then obtaining a coding result corresponding to the address text of the area to be identified, optimizing the coding result through a multi-source geocode classification optimization model based on rules and clusters, and generating coding information corresponding to optimized address text data to construct a criminal perpetrator data set; matching according to the perpetrator data set and the community attribute sets of all communities to obtain a first community attribute set of the perpetrator, and fusing the first community attribute set and the perpetrator data set to generate a model sample; then, the model sample is subjected to co-linear diagnosis processing to obtain a variable which meets the requirements, the criminal is fitted to the criminal by a discrete selection model to select preference in a criminal manner, obtaining an expected effect selected by the perpetrator, and identifying a community in which the perpetrator is working to obtain the maximum expected effect; and finally, fitting the result obtained by the criminal according to a conditional logic model and the discrete selection model, constructing a criminal selection probability function, and obtaining the probability of criminal selection according to the criminal selection probability function.
According to the embodiment provided by the invention, the multi-source space-time data such as police condition data, capturing data, POI data and mobile phone signaling data are fused, and based on the principle of discrete selection model, the quantitative model of criminal perpetrators is optimized by increasing the data such as crowd flowing environment, crime prevention and control environment and the like. Compared with the basic model, the fitting precision of the full model to criminal selection is improved by 8.63%.
The embodiment provided by the invention also calculates the expected effect of the criminals on the perpetrator community and the probability of criminal perpetrator selection through the effect function and the probability function, therefore, the preference of criminals for criminal selection is accurately identified, the criminals can be accurately identified, the identification effectiveness and accuracy are improved, and the criminal identification method has an important reference function for police prevention and control.
Drawings
FIG. 1 is a flow chart of a criminal plausible identification method based on a discrete selection model provided by an embodiment of the invention;
FIG. 2 is a flow chart of a criminal identification method based on a discrete selection model according to another embodiment of the present invention;
FIG. 3 is a flowchart of a multi-source geocoding classification optimization model based on rules and clusters provided by an embodiment of the invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, fig. 1 is a schematic flow chart of an embodiment of a criminal identification method based on a discrete selection model provided in the present invention; the embodiment of the invention provides a criminal plausible identification method based on a discrete selection model, which comprises the following steps of S1 to S5;
s1, multi-source space-time data are obtained, integrated processing is carried out on the multi-source space-time data, and a community attribute set of each community is generated.
In the embodiment of the invention, the multi-source space-time data comprises original police department data (police condition data and capturing data), built environment data, mobile phone signaling data, population census data and other data, and the arcGIS software is utilized to effectively integrate the multi-source heterogeneous big data so as to generate a community attribute set of each community.
The public security department data comprises case information data and criminal individual information data, and the case information data and the criminal individual information data respectively comprise information such as case reporting time, address text, case records, case issuing addresses, living places, case issuing time, capturing time, robbery objects, robbery places, crime tools, robbery modes, robbery articles and the like, and information such as names, nationalities, sexes, native, birth dates, occupation, drug absorption and the like.
The step S3 specifically comprises the following steps: according to the mobile phone signaling data, calculating to obtain the population of the base station of each community; generating Thiessen polygons corresponding to all communities according to the population numbers of the base stations of all communities, and calculating the complete Thiessen polygon areas of all communities; cutting the corresponding Thiessen polygons through shp files of all communities to obtain fragmented Thiessen polygons corresponding to all communities; calculating to obtain the crowd flow density corresponding to each community according to the crowd flow density formula, the population of the base station of each community, the complete Thiessen polygon area of each community and the fragmented Thiessen polygons corresponding to each community; and calculating the socioeconomic heterogeneity of each community according to a socioeconomic heterogeneity formula and the number of different socioeconomic groups in the census data.
In one embodiment of the present invention, the crowd flow density is measured using the mobile phone signaling data and used as an index of crowd flow environment. Firstly generating Thiessen polygons according to population numbers of base stations in different time periods, then calculating the whole Thiessen polygon area, then cutting the Thiessen polygons by shp files of communities to obtain fragmented Thiessen polygons and communities to which each fragment belongs, calculating the fragment area, and finally dividing the fragment area by the community area to obtain the crowd flow density. The calculation formula is as follows:
Figure BDA0002313730990000071
wherein D is i The crowd flow density of the ith community is that n is the number of Thiessen polygons related to the ith community, P k Is the kth Thiessen polygon population, S ji For the area of fragment j in the ith community, S ki S is the total area of the kth Thiessen polygon associated with the ith community i Is the area of the ith community.
And then, describing socioeconomic heterogeneity by using the housing property difference in the census data, wherein the calculation formula is as follows:
Figure BDA0002313730990000072
wherein n is the number of different socioeconomic groups, P ki For livingProportion of kth socioeconomic group in ith community, SE i The larger the number of (c) indicates the more heterogeneous the community population.
S2, obtaining a coding result corresponding to the address text of the area to be identified, optimizing the coding result through a multisource geocode classification optimization model based on rules and clusters, and generating coding information corresponding to optimized address text data to construct a criminal place data set.
And S3, matching the perpetrator place data set with the community attribute sets of all communities to obtain a first community attribute set of the perpetrator place, and fusing the first community attribute set with the perpetrator place data set to generate a model sample.
S4, performing co-linear diagnosis processing on the model sample to obtain a variable meeting the requirement, fitting the criminal perpetrator selection preference through a discrete selection model to obtain an expected effect selected by the criminal perpetrator, and identifying a community in which the criminal perpetrator can obtain the maximum expected effect.
The step S4 specifically comprises the following steps: screening variables affecting the scheme selection from the model sample, and performing co-linearity diagnosis on the variables through a variance expansion formula to obtain variables with variance expansion less than 10; the variables comprise build environment factors, social environment factors, crowd flow environment factors and crime prevention and control factor variables; and calculating the expected effect of the criminals on the selection of each community through an expected effect calculation formula to obtain the community with the maximum expected effect of criminals executing the criminal.
In another embodiment of the invention, independent variables affecting the contemplation are screened and co-linearity diagnostics performed. Independent variables include built-up environment, social environment, crowd flow environment and crime prevention and control factors. The co-linearity diagnosis uses the coefficient of expansion of variance, calculated as follows:
Figure BDA0002313730990000081
wherein R is i As variable x i The negative correlation coefficient of the regression analysis was performed on the remaining variables, VIF was the coefficient of variance expansion, and was required to be as small as possible at VIF < 10.
And then according to the independent variables obtained by screening in the steps, calculating the expected effect of the community selected by the criminal in a criminal way based on a discrete selection model principle through an effect function to obtain the community with the maximum expected effect of the criminal in the criminal way, wherein the calculation formula is as follows:
U ij =βx ijij
wherein U is ij Selecting for an ith perpetrator an expected effect of crime to be performed in a jth community, x ij For the values of the explanatory variables related to the ith perpetrator and the jth community, β is the estimated coefficient of the explanatory variable, ε ij Is the random error of the model.
And S5, fitting the result obtained by the criminal according to the conditional logic model and the discrete selection model, constructing a criminal selection probability function, and obtaining the criminal selection probability according to the criminal selection probability function.
In this embodiment, the scenario selects a probability function as a Prob probability function, and a calculation formula of the Prob probability function is:
Figure BDA0002313730990000082
wherein Y is i Is the choice for the ith perpetrator, x ij Beta is the estimated coefficient of the explanatory variable, which is the value of the explanatory variable related to the i-th perpetrator and the j-th community.
In order to better illustrate the principles of the method of the present invention, the following are specific examples of the method provided by the present invention:
referring to fig. 2, first, data information such as ZG city public security department data, built environment data, mobile phone signaling data, census data and the like is obtained, classified ZG city crime data, built environment data, social environment data, crowd flowing data, crime prevention and control data and the like are calculated through a crowd flowing density formula, a socioeconomic heterogeneity formula and the like, and multisource heterogeneous big data is effectively integrated by ArcGIS software to generate attribute sets of communities in areas needing to be identified. The data used include, but are not limited to: the data provided by the public security department of ZG city is of two types, the first type is the police condition data of 2012-2016 ZG street robbery. The data has 85,898 police records, and records the case information of street robbery, such as robbery time and robbery address, robbery objects, lost property and the like. The second category is the capture data of 2012-2016 ZG urban head robbers, which is recorded in 14,863 pieces. The data records detailed individual information and case information of the street robbery. Individual information such as name, sex, ethnicity, address of household, date of birth, cultural degree, and whether to take medicine or not of the hijack; case information such as robbery time, robbery address and residence at the time of robbery, and crime object, crime place, crime tool, crime measure, and lost property or money. The ZG city building environment data comprise 2014 POI data (point coordinates), 2014 traffic network vector map data, 2015 bus stop data (point coordinates) grabbed based on hundred-degree API and the like. The mobile phone signaling data is the mobile phone signaling data of 2G and 3G in ZG city of 2016 (day 12 5 month-day 18), and the data is derived from a certain communication company in ZG city of China. The population census data is population census data of the sixth 2010 of ZG city, and the data is based on communities, and is mainly used for counting social attributes such as community general population, community foreign population, community juvenile population, community people average housing area, community housing property and the like.
And then obtaining the coding result of the address text by using a Geocoding API, preliminarily judging the credibility of the coding result by using a simple rule, and cleaning and optimizing the coding result by using a multi-source Geocoding classification optimizing model based on the rule and the cluster to obtain coding information corresponding to the optimized address text data to form a composition data set.
According to the principle of fig. 3, by utilizing the Stata software dataset intersection function, the community attribute set to which the crime perpetrator belongs is matched, and the community attribute set and the crime perpetrator dataset are effectively fused, so that a sample is constructed. The ZG urban head robber selects 1 community of 1,971 communities to act as a table, the selected community is marked as "1", otherwise, the unselected communities are all marked as "0". Each community selects 19 attributes, 1,971 lines are 27,949 plots in one-to-one correspondence, and cross coding is adopted in Stata software, so that 55,087,479 samples are constructed.
And (3) utilizing the collinearity diagnosis to screen variables conforming to VIF < 10, fitting criminal perpetrator selection preference based on a discrete selection model, and obtaining the expected effect that the ith criminal selects the jth community as the perpetrator.
And obtaining a Prob probability function according to the conditional logic model form, and calculating the probability of selecting a criminal to act in the jth community.
The method simulates a Full Model (FM) selected by criminals in a criminal case, and performs fitting degree test comparison of a pseudo R party with a basic Model (NM) and a combined Model1 and a combined Model2 (CM 1 and CM 2) which are respectively integrated with only a variable of a people stream environment dimension and a variable of a crime prevention and control dimension on the basic Model. The fitness test requires that the pseudo-R-square be greater than 0.20, otherwise it does not have a perfect fitness. The degree of fit effect of each model based on the discrete selection model is compared with that of the basic model, as shown in the following table:
Figure BDA0002313730990000101
in summary, the criminal plagued identification method based on the discrete selection model provided by the embodiment of the invention has the following beneficial effects compared with the prior art:
(1) The quantitative model selected by criminals is optimized by fusing multi-source space-time data such as police condition data, capturing data, POI data, mobile phone signaling data and the like, increasing data such as crowd flowing environment, crime prevention and control environment and the like. Compared with the basic model, the fitting precision of the full model for criminal perpetration selection is improved by 8.63%, and the full model plays an important role in improving the fitting precision for criminal perpetration selection.
(2) Based on the discrete selection model, the expected benefit and probability of criminal selecting the criminal community are calculated by using the effect function and the probability function, so that the criminal selecting is accurately identified, the effectiveness and the accuracy are improved, and the method has an important reference effect on police control.
Those skilled in the art will appreciate that implementing all or part of the above-described methods in accordance with the embodiments may be accomplished by way of a computer program stored on a computer readable storage medium, which when executed may comprise the steps of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), or the like.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that changes and modifications may be made without departing from the principles of the invention, such changes and modifications are also intended to be within the scope of the invention.

Claims (6)

1. A criminal plausible recognition method based on a discrete selection model, comprising the steps of:
acquiring multi-source space-time data and carrying out integrated processing on the multi-source space-time data;
according to the mobile phone signaling data, calculating to obtain the population of the base station of each community;
generating Thiessen polygons corresponding to all communities according to the population numbers of the base stations of all communities, and calculating the complete Thiessen polygon areas of all communities;
cutting the corresponding Thiessen polygons through shp files of all communities to obtain fragmented Thiessen polygons corresponding to all communities;
calculating to obtain the crowd flow density corresponding to each community according to the crowd flow density formula, the population of the base station of each community, the complete Thiessen polygon area of each community and the fragmented Thiessen polygons corresponding to each community;
generating a community attribute set of each community according to a social economic heterogeneity formula and the number of different social economic groups in the census data; wherein, the liquid crystal display device comprises a liquid crystal display device,
the crowd flow density formula is:
Figure FDA0004130187190000011
wherein D is i The crowd flow density of the ith community is that n is the number of Thiessen polygons related to the ith community, P k Is the kth Thiessen polygon population, S ji For the area of fragment j in the ith community, S ki S is the total area of the kth Thiessen polygon associated with the ith community i Is the area of the ith community;
the socioeconomic heterogeneity formula is:
Figure FDA0004130187190000021
wherein n is the number of different socioeconomic groups, P ki SE for the proportion of the kth socioeconomic group living in the ith community i The larger the number of (a) indicates the more heterogeneous the community population;
obtaining a coding result corresponding to the address text of the area to be identified, optimizing the coding result through a multi-source geocode classification optimization model based on rules and clusters, and generating coding information corresponding to optimized address text data to construct a criminal investigation region data set;
matching according to the perpetrator place data set and the community attribute sets of all communities to obtain a first community attribute set of the perpetrator place, and fusing the first community attribute set and the perpetrator place data set to generate a model sample;
screening variables affecting the scheme selection from the model sample, and performing co-linearity diagnosis on the variables through a variance expansion formula to obtain variables with variance expansion less than 10; the variables comprise build environment factors, social environment factors, crowd flow environment factors and crime prevention and control factor variables;
fitting the criminal place selection preference through a discrete selection model, and calculating the expected effect of the criminal on each community selection through an expected effect calculation formula to obtain a community in which the criminal performs criminal work to obtain the maximum expected effect;
fitting the result obtained by the perpetrator according to a conditional logic model and the discrete selection model, constructing a perpetrator selection probability function, and obtaining the probability of perpetrator selection according to the perpetrator selection probability function.
2. The discrete selection model based criminal plague recognition method as in claim 1, wherein the multi-source spatiotemporal data comprises raw police department data, the cell phone signaling data, the census data.
3. The method for perpetrator identification based on discrete selection model as set forth in claim 1, wherein said variance expansion formula is:
Figure FDA0004130187190000031
wherein R is i As variable x i The negative correlation coefficient of regression analysis was performed on the remaining variables, VIF being the coefficient of variance expansion.
4. The method for perpetrator identification based on discrete selection model as set forth in claim 1, wherein the expected effect calculation formula is:
U ij =βx ijij
wherein U is ij Selecting for an ith perpetrator an expected effect of crime to be performed in a jth community, x ij To and from the ith perpetratorThe value of the explanatory variable associated with the j-th community, beta being the estimated coefficient of the explanatory variable, epsilon ij Is the random error of the model.
5. The method of claim 1, wherein the perpetrator crime identification based on discrete selection model is a Prob probability function.
6. The method for perpetrator crime identification based on discrete selection model as set forth in claim 5, wherein the calculation formula of the Prob probability function is:
Figure FDA0004130187190000032
wherein Y is i Is the choice for the ith perpetrator, x ij Beta is the estimated coefficient of the explanatory variable, which is the value of the explanatory variable related to the i-th perpetrator and the j-th community.
CN201911269327.8A 2019-12-11 2019-12-11 Criminal perpetrator criminal land identification method based on discrete selection model Active CN111062000B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911269327.8A CN111062000B (en) 2019-12-11 2019-12-11 Criminal perpetrator criminal land identification method based on discrete selection model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911269327.8A CN111062000B (en) 2019-12-11 2019-12-11 Criminal perpetrator criminal land identification method based on discrete selection model

Publications (2)

Publication Number Publication Date
CN111062000A CN111062000A (en) 2020-04-24
CN111062000B true CN111062000B (en) 2023-06-27

Family

ID=70298863

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911269327.8A Active CN111062000B (en) 2019-12-11 2019-12-11 Criminal perpetrator criminal land identification method based on discrete selection model

Country Status (1)

Country Link
CN (1) CN111062000B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107180015A (en) * 2017-05-16 2017-09-19 安徽工业大学 A kind of serial crime case geography target Forecasting Methodology
CN110472775A (en) * 2019-07-26 2019-11-19 广州大学 A kind of series case suspect's foothold prediction technique

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107180015A (en) * 2017-05-16 2017-09-19 安徽工业大学 A kind of serial crime case geography target Forecasting Methodology
CN110472775A (en) * 2019-07-26 2019-11-19 广州大学 A kind of series case suspect's foothold prediction technique

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Dongping Long 等.Assessing the Influence of Prior on Subsequent Street Robbery Location Choices: A Case Study in ZG City, China.《Sustainability》.2018,第1-17页. *
肖露子 等.ZG市工作日地铁站点扒窃案件的时空分布及其影响因素.《地理科学》.2018,第38卷(第8期),第1227-1234页. *

Also Published As

Publication number Publication date
CN111062000A (en) 2020-04-24

Similar Documents

Publication Publication Date Title
Ali et al. A data-driven approach for multi-scale GIS-based building energy modeling for analysis, planning and support decision making
Huang et al. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices
Biljecki A scientometric analysis of selected GIScience journals
WO2019200752A1 (en) Semantic understanding-based point of interest query method, device and computing apparatus
Ballesteros et al. Safe cities. A participatory sensing approach
CN105404890A (en) Criminal gang discrimination method considering locus space-time meaning
Li et al. A new urban typology model adapting data mining analytics to examine dominant trajectories of neighborhood change: a case of metro detroit
US20130238643A1 (en) Life arcs as an entity resolution feature
Huang et al. Research on urban modern architectural art based on artificial intelligence and GIS image recognition system
He et al. Persistence of crime hot spots: an ordered probit analysis
Li et al. Spatio-temporal pattern analysis and prediction for urban crime
Morshed et al. Viscrimepredict: a system for crime trajectory prediction and visualisation from heterogeneous data sources
Winke Being close to ethnic peers: the formation of ethnic residential clustering from a household perspective
Benito et al. Efficiency in the Provision of Public Municipal Cultural Facilities.
Ramakers et al. Escaping the family tradition: A multi-generation study of occupational status and criminal behaviour
CN111062000B (en) Criminal perpetrator criminal land identification method based on discrete selection model
Fariñas et al. Multinational activity of European firms and heterogeneity
CN114912717B (en) Smart city guarantee housing application risk assessment method and system based on Internet of things
CN115809280A (en) Group house renting identification and iteration identification method
Ristea et al. Geosocial Media Data as Predictors in a GWR Application to Forecast Crime Hotspots (Short Paper)
CN111768031B (en) Method for predicting crowd gathering trend based on ARMA algorithm
Economou et al. Evaluation of the OECD’s prediction algorithm for the annual GDP growth rate
Lee et al. Determinants of crime incidence in Korea: a mixed GWR approach
CN110674981A (en) Alarm situation prediction method and prediction system based on layered ridge regression algorithm
CN111881303A (en) Graph network structure method for classifying urban heterogeneous nodes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Long Dongping

Inventor after: Xu Mingen

Inventor after: Xu Chong

Inventor after: Xiao Luzi

Inventor after: Song Guangwen

Inventor after: Chen Jianguo

Inventor before: Long Dongping

Inventor before: Liu Lin

Inventor before: Xu Mingen

Inventor before: Xu Chong

Inventor before: Xiao Luzi

Inventor before: Song Guangwen

Inventor before: Chen Jianguo

GR01 Patent grant
GR01 Patent grant