CN111324657A

CN111324657A - Emergency plan content optimization method and computer equipment

Info

Publication number: CN111324657A
Application number: CN202010088012.XA
Authority: CN
Inventors: 陈顺清; 彭进双; 包世泰; 林时君; 邓明亮; 江千腾; 曹兵; 魏琴; 龙杰; 柴理想
Original assignee: Augur Intelligence Technology Guangzhou Co ltd
Current assignee: Augur Intelligence Technology Guangzhou Co ltd
Priority date: 2020-02-12
Filing date: 2020-02-12
Publication date: 2020-06-23
Anticipated expiration: 2040-02-12
Also published as: CN111324657B

Abstract

The invention relates to the field of safety emergency management, in particular to an emergency plan content optimization method and computer equipment. The method comprises the following steps: collecting data and establishing a historical safety emergency plan database; processing and sorting the acquired data to form a pattern suitable for data analysis; and (3) data mining and analyzing, namely, setting a proper algorithm model, putting training set data into the algorithm model for training, and continuously optimizing the algorithm model to finally obtain a calculation result value with the minimum error with an actual result so as to realize the optimization of the emergency plan content.

Description

Emergency plan content optimization method and computer equipment

Technical Field

The invention relates to the field of safety emergency management, in particular to an emergency plan content optimization method and computer equipment based on historical case big data mining.

Background

The safety emergency plan is a scientific and effective plan and arrangement which is made in advance for specific equipment, facilities, places and environments, on the basis of safety evaluation, in order to reduce the loss of personnel, property and environment caused by accidents, on the basis of emergency rescue mechanisms and personnel after the accidents occur, emergency rescue equipment, facilities, conditions and environments, action steps and outlines, accident development control methods and programs and the like. The safety emergency plans comprise a comprehensive emergency plan for comprehensive treatment, a special emergency plan for special treatment, a field disposal plan for field treatment, a department emergency plan for disposal of a certain department, a unit emergency plan for emergency disposal of each unit and the like. The following four categories can be classified according to event types: natural disasters, accident disasters, public health events, social security events.

Along with the continuous accumulation of the historical data of the safety emergency plan, the application of the big data analysis technology to the optimization of the safety emergency plan becomes a brand-new direction, and the whole process from the prior to the subsequent of the safety emergency plan is optimized by establishing different algorithm models, wherein the process comprises the steps of making the content of the safety emergency plan, setting certain parameters in the plan, scheduling during event disposal, optimizing an evacuation disposal mode and the like. The safety emergency plan content optimization based on big data analysis provides reliable technical support for the establishment of a safety emergency plan system.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides an emergency plan content optimization method, which optimizes the content of an emergency plan on the basis of historical emergency case data by utilizing technologies such as big data analysis, data mining and the like, so that the content attribute setting of the safety emergency plan can better accord with the actual case condition, and the aim of intelligently setting the attribute parameters of the safety emergency plan is fulfilled.

The invention also proposes a computer device which, when executing an executable program, carries out the steps of the optimization method of the invention.

The optimization method is realized by adopting the following technical scheme: the emergency plan content optimization method comprises the following steps:

s1, collecting data, and establishing a historical safety emergency plan database;

s2, processing and sorting the collected data to form a pattern suitable for data analysis;

and S3, data mining and analyzing, putting training set data into the algorithm model for training by establishing a proper algorithm model, and finally obtaining a calculation result value with the minimum error with an actual result by continuously optimizing the algorithm model, so as to realize the optimization of the emergency plan content.

In a preferred embodiment, step S3 includes optimizing attribute parameters of the safety emergency plan based on a random forest algorithm:

analyzing historical emergency command case data, preprocessing the historical emergency command case data, extracting attribute parameter influence characteristics of a safety emergency plan, and dividing the attribute parameters into category attribute parameters for qualitatively analyzing the emergency plan and numerical attribute parameters for quantitatively determining the disposal mode of the emergency plan;

and selecting a random forest algorithm to establish a regression model and a classification model, predicting the attribute parameter values, continuously optimizing the regression model and the classification model according to the error between the prediction result of the attribute parameter values and the actual case parameter values, and finally enabling the category attribute parameter values and the numerical attribute parameter values predicted according to the regression model and the classification model to be closer to the actual case parameter values so as to realize the setting of the attribute parameters of the intelligent safety emergency plan.

In a preferred embodiment, step S3 is to repeatedly and randomly extract k samples from the original training sample set N to generate a new training sample set by using a random forest algorithm, and then generate k decision trees from the new training sample set to form a random forest, where the prediction result of the sample to be tested is determined by an average of the prediction values of the k decision trees.

The step S3 is based on the process of predicting the attribute parameter values by the random forest algorithm, and comprises the following steps:

s31, acquiring emergency case data in a historical safety emergency plan database, and constructing an initial sample data set;

s32, processing the data of the safety emergency plan;

s33, processing the characteristics of the emergency data, screening the characteristics of which the correlation is greater than a set threshold value, and combining the characteristics of which the correlation is strong;

s34, extracting the characteristics of the emergency data, randomly selecting j characteristics from the i characteristics for each training set, wherein j < < i, and constructing a single decision tree by using the randomly selected j characteristics;

s35, traversing each feature in each training set and the feature value corresponding to the feature value, selecting the optimal feature to segment the training sets, and segmenting each training set into a data space R1 and a data space R2;

s36, recursively calling the step S35 for the two segmented data spaces R1 and R2 until no optimal features can be used for segmenting the training set or the deepest depth of the tree is reached, and generating a single decision tree;

s37, repeating the steps S35 and S36 by using n training sets respectively to construct a single decision tree, and finally constructing a random forest consisting of n decision trees;

and S38, predicting the attribute parameter values by using the random forest.

In a preferred embodiment, step S3 includes optimizing the attribute structure of the safety emergency protocol based on a correlation algorithm:

analyzing the attribute structure of the safety emergency plan, establishing an association item set table of the contents and the attributes of the safety emergency plan, finding a frequent item set of the contents and the attributes of the safety emergency plan, and finding a strong association rule of the contents and the attributes of the safety emergency plan according to the set support degree and confidence; and adding the corresponding attributes into the safety emergency plan model by combining the found strong association rules, and continuously optimizing the attribute structure of the safety emergency plan model, so that the setting of the content attributes of the safety emergency plan can better accord with the actual case situation.

Preferably, the step S3 of optimizing the attribute structure of the safety emergency plan based on the association algorithm includes:

s31', acquiring a data set, analyzing the association model of the emergency resource scheduling class and the corresponding attribute in the safety emergency plan, and discovering the attribute of the emergency resource scheduling class; or the safety emergency plan is selected as a class, and the corresponding attribute is found.

S32', establishing a frequent item set, firstly setting a support degree and a confidence degree threshold value, wherein the support degree represents the times of the emergency resource scheduling class and a certain attribute appearing in a safety emergency plan at the same time, and the confidence degree represents the times of the emergency resource scheduling class and a certain attribute appearing in a safety emergency plan/the number of historical emergency plans;

s33', according to the confidence coefficient, extracting all strong association rules in the frequent item set established in the step S32' to find the strong association relation between the class and the attribute, thereby optimizing the attribute structure of a certain class of the safety emergency plan.

Wherein, the process of establishing the frequent item set in step S32' is as follows:

establishing an initial plan item set table, traversing all cases in the initial plan item set table, and counting each attribute to obtain a candidate attribute item set table; deleting the item set with the support degree smaller than a preset support degree threshold value in the candidate attribute item set table, reserving the item set with the support degree larger than or equal to the preset support degree threshold value to obtain a first frequent item set, and obtaining a second candidate attribute item set table by utilizing self-connection of the first frequent item set; deleting all the item sets with the support degrees smaller than a preset support degree threshold value in the second candidate attribute item set table to obtain a second frequent item set table; traversing the second frequent item set table, and self-connecting the second frequent item set table to obtain a third candidate attribute item set table; deleting the item set with the support degree smaller than a preset support degree threshold value in the third candidate attribute item set table to obtain a third complex item set table;

and repeating in a circulating mode, acquiring a new candidate attribute item set table by traversing the previous frequent item set table and connecting the previous frequent item set table, and deleting an item set with the support degree smaller than a preset support degree threshold value in the new candidate attribute item set table to obtain a new frequent item set table until the candidate attribute item set is empty, thereby obtaining all frequent item sets.

The computer equipment comprises a memory and a processor, wherein the memory is provided with an executable program, and the processor executes the executable program to realize the steps of the optimization method.

Compared with the prior art, the invention has the following beneficial effects: the method extracts general information of the emergency plan from the existing emergency plan case, wherein the general information comprises basic information of emergency plan category of emergency events, emergency plan level of emergency events, emergency plan organization mechanism of emergency events, emergency plan resource allocation scheme, emergency plan disposal process and the like, so that a historical safety emergency plan database is established; the emergency plan content optimization method based on big data analysis is provided by combining a large amount of historical emergency plan data, and comprises the safety emergency plan attribute parameter optimization based on a random forest algorithm and the safety emergency plan attribute structure optimization based on an associated algorithm, so that the safety emergency plan content attribute setting can better accord with the actual case situation, and the goal of intelligent safety emergency plan attribute parameter setting is realized.

Drawings

FIG. 1 is a flow chart of an emergency plan content optimization method of the present invention;

FIG. 2 is an illustration of a historical safety emergency plan database;

FIG. 3 is an exemplary graph of impact range prediction based on random forests.

Detailed Description

The following describes in detail an implementation of the present invention in conjunction with examples and drawings, but the present invention is not limited to these.

Examples

The embodiment provides an emergency plan content optimization method, which is used for mining and analyzing safety emergency plan data by using a big data analysis technology and a data mining technology, and mainly optimizes the content and partial attribute parameters of a safety emergency plan aiming at historical safety emergency plan data, wherein the safety emergency plan content optimization mainly comprises two parts, namely safety emergency plan attribute parameter optimization and safety emergency plan attribute structure optimization, and as shown in fig. 1, the method specifically comprises the following steps:

step 1, collecting data, and establishing a historical safety emergency plan database, namely a safety emergency plan historical database and a historical emergency command case database.

In the embodiment, a text form emergency plan is analyzed, and a content structure and attribute parameters of the text form emergency plan are extracted to form a primary historical safety emergency plan database; when the actual emergency type is the same as the safety emergency plan type, carrying out automatic emergency intelligent command according to the emergency plan designed by the safety emergency plan model, and simultaneously adding the emergency content into a historical safety emergency plan database; if the actual emergency type is different from the safety emergency plan type, adding the contents of each part of the actual emergency into a historical safety emergency plan database so as to complete the establishment of the historical safety emergency plan database in the step. The design use graph of the historical safety emergency plan database is shown in figure 2.

And 2, processing data. The data processing is to process and arrange the acquired data to form a pattern suitable for data analysis, so that the consistency and the effectiveness of the data are ensured. Such as conversion of data types in safety emergency plans, integration of different data, and processing of data loss and errors.

After data processing, data mining analysis is carried out, and a proper algorithm model is mainly established, and training set data are put into the algorithm model for training to obtain a calculation result. Aiming at different analysis purposes, such as classification, prediction and the like, different algorithm models are established, and a calculation result value with the minimum error with an actual result is finally obtained by continuously optimizing the algorithm models, so that the optimization of the emergency plan content is realized. The invention mainly optimizes the attribute parameters and the attribute results of the safety emergency plan, and the detailed description is as follows:

and 3, optimizing the attribute parameters of the safety emergency plan based on the random forest algorithm.

The method comprises the steps of analyzing historical emergency command case data, preprocessing the historical emergency command case data, extracting attribute parameter influence characteristics of a safety emergency plan, and dividing the attribute parameters into category attribute parameters (namely, classification attribute parameters) and numerical attribute parameters, wherein the classification attribute parameters are used for qualitatively analyzing the emergency plan, such as an affected object of an emergency event, an emergency scheduling resource category and the like, and the numerical attribute parameters are used for quantitatively determining a disposal mode of the emergency plan. The attribute values of the attribute parameters are usually set according to manual experience, such as the quantity setting, the type setting, the setting of an emergency influence object, the setting of an influence range and the like of certain types of emergency resources, and the setting is subjective depending on human, so that the invention analyzes the content data of the historical safety emergency plan and provides a safety emergency plan attribute parameter optimization method based on a random forest algorithm.

The method selects a random forest algorithm to establish a regression model and a classification model, predicts the attribute parameter values, continuously optimizes the regression model and the classification model according to the error between the prediction result of the attribute parameter values and the actual case parameter values, and finally enables the category attribute parameter values and the numerical attribute parameter values predicted according to the regression model and the classification model to be closer to the actual case parameter values, so that the aim of setting the attribute parameters of the intelligent safety emergency plan is fulfilled.

The principle of the random forest algorithm is that k samples are repeatedly and randomly extracted from an original training sample set N in a replacement mode to generate a new training sample set, then k decision trees are generated according to the new training sample set to form a random forest, and the prediction result of a sample to be tested is determined through the average value of the predicted values of the k decision trees. As shown in equation (1):

the left side of the equation in equation (1) represents the final predicted value, h_i(x) The predicted values for each decision tree are represented. Similarly, when a random forest is used for predicting the classification attribute parameters, the classification of the classification attribute parameters is determined based on a voting principle of a plurality of decision trees.

The invention takes a sudden flood disaster as an example, and explains the specific steps of the invention aiming at the prediction of two different attribute parameters of an affected object and an affected range. The influence range is subjected to regression prediction based on random forests, and the influenced objects are subjected to prediction based on random forest classification. The influence range prediction method based on the random forest algorithm comprises the following steps:

and 311, constructing a sample data set. Acquiring emergency case data such as flood disasters and the like in a historical safety emergency plan database, constructing an initial sample data set, setting an influence range as y, setting other description information of the emergency as X, setting X as a vector and setting dimensionality as a characteristic number, and constructing the initial sample data set in the shape shown in table 1:

TABLE 1 initial sample data set

Case m

Event rank x1

Time x2

Affected object x3

Wind direction x3

Wind speed x4

Weather x5

.....xn

Extent of influence y

Case 1

Level 1

17:20

Construction of buildings

Northern wind

4km/s

All-weather

1000m

……

Case m

Grade 3

9:20

Personnel

Northwest wind

5km/s

Yin (kidney)

……

3000m

And (3) randomly putting back samples from the initial sample data set by using a Bootstrap (random resampling) method to take m samples, carrying out sampling for n times in total to generate n training sets with the same size as the initial sample data set, and using the residual data as test data to form a test set for algorithm evaluation.

And step 312, processing emergency plan data. Normalization and standardization processing are carried out on the emergency plan data, so that the data cannot be influenced by magnitude. For example, the event occurrence time is labeled by using a number, such as the morning, afternoon and evening, and then converted into a digital label. Since the characteristics of each case are different, there may be a case where the characteristic value is missing, and therefore interpolation processing needs to be performed on the missing value.

Step 313, processing the characteristics of the emergency data, screening the characteristics with the correlation larger than a set threshold value, and combining the characteristics with strong correlation. As shown in table 1, the correlations between the n eigenvalues and the influence range of the emergency are different, so that the relationship between the independent variable and the single dependent variable needs to be analyzed, a correlation coefficient matrix is established, and features with correlations greater than 0.7 are screened. Meanwhile, there may be strong correlation between features, so it is necessary to merge features with strong correlation. The features are processed to obtain a data set as shown in table 2:

TABLE 2 data set after screening for features

Case m

Event rank x1

Time x2

Affected object x3

Wind direction x3

Wind speed x4

Weather x5

.....xi

Extent of influence y

Case 1

Level 1

17:20

Construction of buildings

Northern wind

4km/s

All-weather

1000m

……

Case m

Grade 3

9:20

Personnel

Northwest wind

5km/s

Yin (kidney)

……

3000m

I < < n in Table 2.

And step 314, extracting the characteristics of the emergency data so as to construct a single decision tree. After the features are screened, j features are randomly selected from the i features for each training set, wherein j < < i, and a single decision tree is constructed by using the randomly selected j features.

Step 315, traversing each feature in each training set and the feature value corresponding to the feature, selecting the optimal feature to segment the training sets, and segmenting each training set into a data space R1 and a data space R2; the principle of optimal feature selection is shown in equation (2):

wherein j represents the j-th feature, s represents the s-th feature value of the j-th feature, x represents the sample, y represents the actual value of the influence range, c1 represents the predicted value of the influence range under the data space R1, and similarly, c2 represents the predicted value of the influence range under the data space R2. Namely, the influence range prediction error can be minimized after the training set is divided according to the s-th eigenvalue of the j-th characteristic. The training set is divided into two parts, for example, wind speed is selected as the optimal characteristic in table 2, 5km/s is selected as the optimal characteristic value, the training set can be segmented into two training sets with wind speed >5km./s and wind speed <5km/s, and meanwhile, for each segmented training set, the predicted value can be obtained according to formula (3).

Wherein R is_mRepresenting the divided training set, y_iShows the influence range of each caseThe left side of the equation represents the predicted impact range, i.e., the predicted value of the impact range, and the right side of the equation represents the mean of the impact ranges of all cases in the training set.

Step 316, recursively invoking step 315 on the two segmented data spaces R1 and R2, stopping until no optimal features are available for segmenting the training set or the deepest depth of the tree is reached, thereby generating a single decision tree.

And 317, repeating the steps 315 and 316 by using n training sets respectively to construct a single decision tree, and finally constructing a random forest consisting of n decision trees. The random forest construction process is shown in fig. 3.

And 318, predicting attribute parameter values by using the random forest.

Randomly extracting a flood disaster case in the test set as a case to be predicted, firstly putting the flood disaster case into a decision tree, segmenting the data set according to a first optimal characteristic (the wind speed is more than 5km/s), and if the sample is classified into a leaf node data set after segmentation, taking the mean value of the influence range of the data set as the predicted value of the influence range of the case to be predicted; if the sample is not the leaf node data set, continuing segmentation until the sample is included in the leaf node data set, and taking the mean value of the influence range of the final data set as a predicted value of the influence range of the case to be predicted; and putting the case to be predicted into the other decision trees, repeating the steps, and solving the mean value of the n predicted values to obtain the predicted value of the final influence range.

The above process is only an optimization method of one numerical attribute parameter in the safety emergency plan, and other numerical attribute parameters, such as the quantity of emergency scheduling resources, the evacuation range and the like, can be predicted by using the algorithm to give a predicted value.

The method for predicting the affected objects based on the random forest algorithm comprises the following steps:

step 321, constructing a sample data set. Acquiring flood disaster case data in a historical safety emergency plan database, constructing an initial sample data set, setting an affected object as y, setting other description information of an emergency as X, setting X as a vector and dimensionality as a characteristic number, and constructing the initial sample data set in the form shown in Table 3:

TABLE 3 initial sample data set

Case m

Event rank x1

Time x2

Event location x3

Water flow rate x4

Weather x5

.....xn

Affected object y

Case 1

Level 1

17:20

Country

4km/s

All-weather

Construction of buildings

……

Case m

Grade 3

9:20

Suburb

5km/s

Yin (kidney)

……

Personnel, livestock

And 322, processing emergency plan data. Normalization and standardization processing are carried out on the emergency plan data, so that the data cannot be influenced by magnitude. For example, the event occurrence time is labeled by using a number, such as the morning, afternoon and evening, and then converted into a digital label. Since the characteristics of each case are different, there may be a case where the characteristic value is missing, and therefore interpolation processing needs to be performed on the missing value.

Step 323, processing the characteristics of the emergency data, screening the characteristics with the correlation greater than a set threshold value, and combining the characteristics with strong correlation. As shown in table 3, the correlations between the n eigenvalues and the affected objects of the emergency are different, so that the relationship between the independent variable and the single dependent variable needs to be analyzed, a correlation coefficient matrix is established, and features with correlations greater than 0.7 are screened. Meanwhile, there may be strong correlation between features, so it is necessary to merge features with strong correlation. The features are processed to obtain a data set as shown in table 4:

TABLE 4 data set after screening for features

Case m

Event rank x1

Time x2

Location x3

Rainfall x4

Weather x5

.....xi

Affected object y

Case 1

Level 1

17:20

Country

5mm

Heavy rain

Construction of buildings

……

Case m

Grade 3

9:20

Suburb

10mm

Storm rain

……

Personnel, livestock

I < < n in Table 4.

And 324, extracting the characteristics of the emergency data so as to construct a single decision tree. After the features are screened, j features are randomly selected from i features for each training set, wherein j < < i, and a single decision tree is constructed by using the j features.

Step 325, traversing each feature in each training set and the feature value corresponding to the feature value, selecting the optimal feature to segment the training sets, and segmenting each training set into a data space R1 and a data space R2; the principle of optimal feature selection is shown in equation (4):

the formula (4) is a damping coefficient of the selected characteristic, the damping coefficient represents the impure degree of the model, and the smaller the damping coefficient is, the lower the impure degree is, and the better the characteristic is. And K represents the category of the affected object, namely the prediction error of the affected range can be minimized after the training set is segmented according to the K-th optimal characteristic. The training set is divided into two parts, for example, the rainfall is selected as the optimal characteristic in table 4, and 5mm is selected as the optimal characteristic value, so that the training set can be divided into two subsets, namely, the rainfall >5mm and the rainfall <5 mm.

Step 326, recursively invoking step 325 on the two segmented data spaces R1 and R2 until no optimal features are available for segmenting the training set or the deepest depth of the tree is reached, thereby generating a single decision tree.

And 327, repeating the step 325 and the step 326 by using n training sets respectively to construct a single decision tree, and finally constructing a random forest consisting of n decision trees.

And 328, predicting attribute parameter values by using the random forest.

Randomly extracting a flood disaster case in the test set as a case to be predicted, firstly putting the flood disaster case into a decision tree, segmenting the data set according to a first optimal characteristic (rainfall is greater than 5mm), and if the sample is classified into a leaf node data set after segmentation, taking the maximum sample number probability in the data set as a predicted value of the affected range of the case to be predicted; if the sample is not a leaf node dataset, continuing to cut until the sample is included in the leaf node dataset; and putting the cases to be predicted into the other decision trees, repeating the steps to obtain the predicted values of the n affected objects, and taking the category with the most sample prediction as the final predicted value of the affected objects according to the voting principle.

Similarly, the above process is only an optimization method of one category attribute parameter in the safety emergency plan, and other category attribute parameters can be predicted by using the algorithm to give a predicted value.

And 4, optimizing the attribute structure of the safety emergency plan based on the association algorithm.

The safety emergency plan model can be established in advance. In the process of establishing a safety emergency plan model, the attributes of each part of the emergency plan are manually increased or reduced, a large amount of historical safety emergency plan data are analyzed, an emergency plan class-attribute association model is established by combining an association algorithm, and the attribute structure of the safety emergency plan is optimized, and the method specifically comprises the following steps:

and step 41, acquiring a data set. The invention analyzes the correlation model of the emergency resource scheduling class and the corresponding attribute in the safety emergency plan and finds the attribute of the emergency resource scheduling class. Similarly, the safety emergency plan can be selected as a class, and the corresponding attribute can be found.

And step 42, establishing a frequent item set. Firstly, setting a support degree and a confidence degree threshold value, wherein the support degree represents the times of the emergency resource scheduling class and a certain attribute (such as the maximum scheduling radius) appearing in an emergency plan at the same time, and the confidence degree represents the times of the emergency resource scheduling class and a certain attribute (such as the maximum scheduling radius) appearing in an emergency plan/the number of historical emergency plans. An initial plan entry set table is then established, as shown in table 5:

TABLE 5 initial plan entry set Table

In the table, L1 represents the emergency resource scheduling scheme class, L2 represents the emergency resource category, L3 represents the emergency resource name, L4 represents the number of emergency resources, and L5 represents the maximum scheduling radius. All cases in table 3 are traversed and each attribute is counted, resulting in a first candidate attribute item set table C1, as shown in table 6:

table 6 candidate table C1

Item set	Degree of support
		{L1}	10
{L2}	5
		{L3}	4
{L4}	4
		{L5}	5

The support degree in the table indicates the number of times the set of items occurs in all cases. Assuming that the preset support threshold is 2, the item set greater than or equal to 2 is reserved, the item set smaller than 2 is deleted, and all the item sets in C1 are reserved because the support of all the item sets is greater than 2. At this time, C1 was designated as the first frequent item set Lable 1. After getting Lable1, self-join using the first frequent item set Lable1 gets a second candidate property item set Table C2, as shown in Table 7:

table 7 candidate table C2

Deleting all the item sets with the support degrees smaller than the preset support degree threshold 2 in the second candidate attribute item set table to obtain a second frequent item set table Lable2, as shown in Table 8:

TABLE 8 frequent item set Table Lable2

Item set	Degree of support
		{L1,L2}	5
{L1,L3}	4
		{L1,L4}	4
{L1,L5}	5
		{L2,L4}	2
{L2,L5}	2
		{L4,L5}	2

Traversing the second frequent item set table Lable2, and simultaneously self-connecting the second frequent item set table Lable2 to obtain a third candidate attribute item set table C3, as shown in Table 9:

table 9 candidate table C3

Item set
	{L1,L2,L3}
{L1,L2,L4}
	{L1,L2,L5}
{L1,L3,L4}
	{L1,L3,L5}
{L1,L4,L5}
	{L2,L4,L5}

In table 7, since non-frequent item sets exist in the item sets { L1, L2, L3}, { L1, L3, L4}, { L1, L3, and L5}, deletion is required, and the remaining item sets are retained to obtain the final third candidate set table C3, as shown in table 10

Table 10 candidate table C3

Deleting the item set with the support degree smaller than the preset support degree threshold value 2 in the third candidate set table to obtain a third frequent item set table lab 3 shown in table 11:

TABLE 11 frequent item set Table L3

Item set	Degree of support
		{L1,L2,L4}	2
{L1,L2,L5}	2

Traversing the third frequent item set table Lable3, and self-connecting the third frequent item set table to obtain { L1, L2, L4, L5}, deleting the item set { L2, L4, L5} which does not belong to Lable3, so that the candidate item set C4 is empty, and ending the algorithm to obtain all frequent item sets.

And 43, discovering the strong association relationship between the classes and the attributes. Extracting all strong association rules in the third frequent item set Lable3 according to the confidence coefficient to obtain the strong association rule discovery result shown in Table 12:

table 12 strong association rule discovery

Association rules	Confidence level
		{L2,L5}→L1	Support of { L1, L2, L5 }/{ L2, L5}, 100%

The calculation method of confidence in the table is shown in formula (5):

as can be seen from table 12, when the attributes L2 (emergency resource category) and L5 (maximum scheduling radius) occur simultaneously, they must be in the emergency resource scheduling scheme class. With the increase of the number of cases in the safety emergency plan historical library, the method can be used for discovering the strong association relationship between the class and the attribute, so that the attribute structure of a certain class of the safety emergency plan is optimized.

Based on the same inventive concept, this embodiment further provides a computer device, which includes a memory and a processor, where the memory has an executable program, and when the processor executes the executable program, the computer device implements the above steps of the optimization method of the present invention.

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. The emergency plan content optimization method is characterized by comprising the following steps:

2. The emergency plan content optimization method according to claim 1, wherein step S1 is to form a historical security emergency plan database by analyzing the text form emergency plan, and extracting content structure and attribute parameters of the text form emergency plan; when the actual emergency type is the same as the safety emergency plan type, carrying out automatic emergency intelligent command according to the emergency plan designed by the safety emergency plan model, and simultaneously adding the emergency content into a historical safety emergency plan database; and if the actual emergency type is different from the safety emergency plan type, adding the contents of each part of the actual emergency into a historical safety emergency plan database.

3. The emergency plan content optimization method according to claim 1, wherein the step S3 includes optimizing attribute parameters of the safety emergency plan based on a random forest algorithm:

4. The emergency plan content optimization method according to claim 3, wherein step S3 is implemented by using a random forest algorithm, repeatedly and randomly extracting k samples from the original training sample set N in a replacement manner to generate a new training sample set, and then generating k decision trees according to the new training sample set to form a random forest, wherein the prediction result of the sample to be tested is determined by an average value of predicted values of the k decision trees.

5. The emergency plan content optimization method of claim 3, wherein the step S3 is based on a process of predicting attribute parameter values by a random forest algorithm, and comprises:

s32, processing the data of the safety emergency plan;

and S38, predicting the attribute parameter values by using the random forest.

6. The method for optimizing the contents of an emergency plan according to claim 5, wherein the step S38 of predicting the numerical attribute parameter values by using a random forest comprises the following steps:

randomly extracting an emergency case in the test set as a case to be predicted, firstly putting the emergency case into a decision tree, segmenting the data set according to a first optimal characteristic, and if the sample is classified into a leaf node data set after segmentation, taking the mean value of the influence range of the data set as a numerical attribute parameter prediction value of the case to be predicted; if the sample is not the leaf node data set, continuing segmentation until the sample is included in the leaf node data set, and taking the numerical attribute parameter mean value of the final data set as a predicted value of the numerical attribute parameter of the case to be predicted; putting the case to be predicted into other decision trees, repeating the steps, and solving the mean value of the n predicted values to obtain the predicted value of the final numerical attribute parameter;

the process of predicting the category attribute parameter values by using the random forest in step S38 is as follows:

randomly extracting an emergency case in the test set as a case to be predicted, firstly putting the emergency case into a decision tree, segmenting the data set according to a first optimal characteristic, and if the sample is classified into a leaf node data set after segmentation, taking the maximum sample number probability in the data set as a category attribute parameter prediction value of the case to be predicted; if the sample is not a leaf node dataset, continuing to cut until the sample is included in the leaf node dataset; and putting the cases to be predicted into the other decision trees, repeating the steps to obtain predicted values of the n category attribute parameters, and taking the category with the most prediction of the sample as the final predicted value of the category attribute parameters according to a voting principle.

7. The emergency protocol content optimization method according to claim 1, wherein the step S3 includes optimizing an attribute structure of the safety emergency protocol based on a correlation algorithm:

8. The emergency protocol content optimization method according to claim 7, wherein the step S3 is a process of optimizing an attribute structure of the security emergency protocol based on a correlation algorithm, and includes:

9. The emergency plan content optimization method according to claim 8, wherein the step S32' creates a frequent item set as follows:

10. Computer device comprising a memory and a processor, the memory having an executable program thereon, characterized in that the processor, when executing the executable program, performs the steps of the optimization method according to any one of claims 1 to 9.