Summary of the invention
For above-mentioned problems of the prior art, the invention provides a kind of method of carrying out event based on Mining Security Quality standard, to be applied in management practice production run, improve business administration efficiency.
For realizing above-mentioned goal of the invention, the present invention adopts following technical scheme:
A method of carrying out event based on Mining Security Quality standard, comprises text pretreatment module, inscape analysis module, syntactic structure parsing module, corresponding relation execution module, Standard Decomposition refinement module, event relation MBM, data center's module; Its Chinese version pretreatment module is Words partition system interface, completes the function that extracts word; Inscape analysis module completes the function of analyzing quality control standard and event inscape; Syntactic structure parsing module difficulty action accomplishment administrative standard is converted into the function of the syntactic structure of event; The function of the corresponding relation of corresponding relation execution module difficulty action accomplishment administrative standard and event; The decomposition of Standard Decomposition refinement module difficulty action accomplishment administrative standard and the function of refinement strategy; Event relation MBM difficulty action accomplishment administrative standard develops into the function of event relation modeling; Data center's module completes the function that can generate event data center by the model of relationship modeling;
Realize a kind of method of carrying out event based on Mining Security Quality standard, comprise the steps:
1. text pretreatment module major function is that in the quality control standard document obtaining by ICTCLAS participle interface, text carries out Chinese word segmentation and part-of-speech tagging;
2. inscape analysis module major function is to analyze quality control standard and event inscape, and event information will possess such function, at least must be clear and definite who what is done, does, how to do, accomplish what degree, this four problems of cognition clearly.By above-mentioned participle interface, get word and part-of-speech tagging, just can clearly analyze the fundamental of event: behavioral agent, behavior performance, behavior condition and performance degree;
3. syntactic structure parsing module major function is the syntactic structure that analysis mass administrative standard is converted into event, and the basic syntactic structure that will analyze it at least consists of the key concept (noun) of actional verb and the sensing of these verbs; Specifically, the main syntactic structure of quality standard transformation event has four kinds: described behavior performance, behavior performance+behavior condition, behavior performance+behavior condition, show+performance of behavior degree, behavior performance+behavior condition+performance degree;
4. corresponding relation execution module major function is the corresponding relation of difficulty action accomplishment administrative standard and event, by just having a lot of words and part-of-speech tagging, inscape and constituent grammar after above-mentioned text pretreatment module, it is a complicated course that information in quality control standard document is converted into each event information, there are three kinds of relations of multiple minute disorientation (is olation, decomposition strategy and corresponding relation), one to one, one-to-many, many-to-one concept;
5. Standard Decomposition refinement module major function is decomposition and the refinement strategy of difficulty action accomplishment administrative standard, in the standard execution event based on quality management, generates in procedure, and the effect of decomposing refinement progressively forms the base attribute element of event information; According to three kinds of corresponding relations of above-mentioned quality control standard and event mission bit stream, we are defined as four kinds of corresponding strategies by the decomposition of quality control standard: substitute, disassemble, combine, focus on connection;
6. event relation MBM major function is that quality control standard develops into event relation modeling, in decomposition and strategy by above-mentioned quality control standard, can develop into the model of event, roughly need two steps, the one, the generation of event, the cluster process of quality control standard namely, the 2nd, the foundation of event dependent, by specific algorithm judge and other events between Existence dependency relationship whether;
7. data center's module major function is that model by relationship modeling can generate event data center module, with classification, feature, the management of event, mainly take the data center of place, time, the department partition table concept in dimension is converted into relational database.
Technical scheme of the present invention has following beneficial effect: technical scheme of the present invention, part of speech while distinguishing very soon by participle interface, and by its strategy of selection of its Constitution Elements of quality standard administrative analysis, grammer, corresponding relation, intelligence, quality management standardization has been formed to event model, draw to event take organizational structure as classification or each event between incidence relation, rapid interpretive quality standard is converted into the execution event guiding opinion in production run in enterprise, thereby enhances productivity.
Embodiment
Referring to Fig. 1 and Fig. 2, a kind of method of carrying out event based on Mining Security Quality standard, comprises text budget processing modules A, inscape analysis module B, syntactic structure parsing module C, corresponding relation execution module D, Standard Decomposition refinement module E, event relation MBM F, the module G of data center; The method major function of carrying out event based on Mining Security Quality standard is resolve inscape, grammer, relation and select its strategy, develops into event relation model, generates event data center.
1) text pretreatment module A
Major function is that the quality control standard text to obtaining carries out Chinese word segmentation and part-of-speech tagging, and result is outputed to MBM.
2) inscape analysis module B
Event will possess such function, at least must be clear and definite who what is done, does, how to do, accomplish what degree, clearly answering this four problems, is by above-mentioned 1) participle just can analyze the fundamental of event: behavioral agent, behavior performance, behavior condition and performance degree.
Behavioral agent---behavioral agent is executor, supvr.Quality control standard is to describe executor's behavior and gerentocratic behavior.Standard is converted into event: " XXXX sector member should ... ", " XXXX member coordinate check ventilating system ", " surveying work is carried out advice note system ", suitable event is for specific executor.
Behavior performance--behavior performance comprises wishes the task that executor completes and the result of reaching, and the behavior outcome of expection can be divided three classes: achievement result, as formulated ventilation plan, the management system of relieving; Experience property result, as the training mechanism of Erecting and improving; Novelty result, bursts as the inrush through faults that mine is constituted a threat to, roof gushing water, Water Inrush From Karstic Collapse Columns, earth's surface that the various water damages such as water detect, diagnosis and pre-control.Therefore they are comprised of two parts: actional verb and key concept (noun).Actional verb forms exercisable concrete behavior in order to describe executor, as checked, record, formulate, diagnose, distinguish, pre-control, comparison, indicate, solution, extraction, examine etc.Key concept (noun) is the object that actional verb points to, as inspection sheet, the document of relieving, plenum area, gas, plan, scheme etc.
Behavior condition--behavior condition refers to affects the result of executor's operation or the specific limited of complete operation task or scope etc., and the sight of the number quantitative limitations such as supplementary means or instrument, the information providing or prompting, time/number of times/space, consummatory behavior etc. is be provided.As " being with upper xxxx detector ", " at 8106 workplaces ", " completing after xxxx ", in " in the work hours ", " by observing and discussing " etc.
Performance degree--performance degree refers to that a certain colony or the executor of organizational structure complete the minimum performance level of event, in order to evaluate the performance of operation or the degree that execution result reaches.As: " can not transfinite ", " solving completely ", " effective measures ", " up to standard ", " normally operation " etc.
3) syntactic structure parsing module C
The result of quality standard transforms generation event, and the key concept (noun) that its basic syntactic structure is at least pointed to by actional verb and these verbs forms.Specifically, the main syntactic structure of quality standard transformation event has four kinds.
The first: described behavior performance
This syntactic structure of quality standard is the most basic, modal a kind of, and it is to consist of actional verb and two parts of key concept, description be " what is done ".In the quality standard of this syntactic structure, actional verb only has one sometimes, sometimes has several; And key concept may be one or several nouns, may be also the noun that adds determiner, or even a proposition.Such as " mine ground is surveyed anti-harness the river data, technical report etc. and is examined process " etc.
The second: behavior performance+behavior condition
The second grammer of quality control standard is on the basis of said structure, has increased the sentence element of a regulation behavior condition, for executor or proofer provide guidance, suggestion.It is to consist of behavior performance and two parts of behavior condition, description be " what is done " and " how doing ".For example " should carry out post staff training, its ability meets Piao and answers job position request; ", " emulsion pump pressure and concentration of emulsion used meet the requirements, and have Site Detection means; " etc.
The third: show+performance of behavior degree
The third syntactic structure of quality standard is on the basis of basic structure, increase adjective and adverbial word etc. and described the composition of behavior performance level, performance program for executor's behavior or learning outcome has proposed clearer and more definite requirement, for the exploitation of production performance provides guidance.It is that two parts of behavior performance and performance program form, description be " what is done " and " doing what what degree ".For example: " operating regulation and measure specific aim, strong operability, examination and approval procedures are complete, implement, examination and record of signature complete, operating regulation is at least organized 1 reexamination for every 2 months and is had reexamination suggestion; "; " before and after ventilation equipment, within the scope of 5 meters, supporting is intact, no-sundries, ponding and mud " etc.
The 4th kind: behavior performance+behavior condition+performance degree
This syntactic structure of quality control standard is rare, and it is on the basis of basic structure, has increased the composition of describing behavior condition and behavior performance level simultaneously.Such quality standard is to provide and instructs and suggestion production run, is also that executor's operating result has been proposed to general requirement.This is to consist of behavior performance, behavior condition and three parts of performance degree, description be " what is done ", " how doing ", " accomplishing what degree ".Such as: " electromechanical equipment choice model proof, integrated management program standard, all informations such as equipment account, technical drawing such as purchase, install, use, safeguard, overhaul, renovate, scrap; ", " workplace outlet is unimpeded, return airway and transportation lane section meet ventilation, transportation, pedestrian, equipment are installed, the needs of maintenance ".
4) corresponding relation execution module D
By just having a lot of words and phrase after above-mentioned text pre-service, add constituent grammar, it is a complicated course that information in quality control standard document is converted into each event information, has multiple minute disorientation, is olation, decomposition strategy and corresponding relation.Quality standard is resolved into event, and corresponding relation quantitatively roughly has following three kinds of situations (seeing the following form):
As shown above, to refer to that a quality standard is reached the correspondence of an event information obviously visible for one-one relationship.Many-one relationship refers to that a quality standard need to resolve into many event tasks and just can reach.Many-to-one relation be by many quality standards or wherein relevant event combinations of factors, focus on or be bound up and become an event mission bit stream.
5) Standard Decomposition refinement module E
In the standard based on quality management, carry out in event generation system process the design of the information attribute that acts on formation event that quality control standard is decomposed.According to three kinds of corresponding relations of above-mentioned quality control standard and event mission bit stream, we are defined as four kinds of corresponding strategies by the decomposition of quality control standard: substitute, disassemble, combine, focus on/connect, see the following form.
(1) alternative strategy
Utilize man-to-man corresponding relation, with certain word, replace the keyword in initial quality standard, form event mission bit stream.The statement concept that replacement " is provided with " as " well-shoot material depot is provided with independently ventilating system " use " inspection ", can form the event information ventilating in checking.And for example, " correct methods ventilation production run face a danger mode of operation ", wherein can " production run of ventilating " replacement " gas exceeding limit ", can form an Action Events mission bit stream in gas exceeding limit unit.
(2) disassemble strategy
Use the corresponding relation of one-to-many, claim standard to disassemble into several associated thin indexs mutually quality control standard, with this, form concrete event information.As " mine should be worked out ventilation plan " " ventilation plan " wherein disassemble into " year ventilation operation plan ", " season ventilation operation plan ", " month ventilation operation plan ", etc., can form many event tasks in a planned quality control standard; These are still planned is all inactive, and also disassembled is the event of the corresponding cooperation obtaining information of different departments, such as to obtain of that month production output, with go into the well the information of number.And for example, " the maintain measure of (inverted ventilation) facility that ventilates." " maintenance " can disassemble as " spot check of equipment ", " equipment lubricated ", " patrolling and examining of equipment " etc., can form in " to (inverted ventilation) facility that ventilates " quality control standard many event informations targetedly.
(3) combined strategy
Use many-to-one corresponding relation, merge many quality control standards, form an event task.As " formulating ventilation instrument rules for storage ", " formulate ventilation instrument maintenance ", " formulating ventilation instrument maintenance regime " is the subordinate concept of formulation system owing to taking care of and maintaining, therefore, can be combined, be formed " management system of formulating ventilation instrument " this event information.
(4) focus on/connect strategy
This is also to use many-to-one corresponding relation, chooses partial content identical or that only have relevance in many quality standards and, as the focus of event, forms an event information.As " gas pumping pump driver accomplishes to take appointment with certificate ", " reexamination on time of gas pumping pump driver certificate ", can focus on/connect, and forms " each post work requires the problem of taking appointment with certificate " this event information.
Can find out, disassembling with combined strategy is a pair of contrary process.Disassembling strategy is that a quality control standard is decomposed into less event information, and combined strategy is polymerized to a large event several little event informations.Emphasize, decompose quality management according to process that in production run, truth situation combines.In this process, basic decomposition strategy is to disassemble, even if adopt, focuses on/connect strategy, also must be on the basis of " disassembling ", focus on/connect the wherein identical or associated part of content and form event information, therefore, disassembling strategy is the elementary tactics that decomposes quality control standard.
6) event relation MBM F
Described quality control standard develops into relationship modeling, by the summary of above-mentioned event relation modeling, we can find out the model that will obtain event, roughly need two aspects, the one, the generation of event, namely quality control standard in be polymerized to more fine-grained event procedure, the 2nd, the foundation of event dependent, by certain place, space or sequential, causal dependence, judge and other events between Existence dependency relationship whether.
Want complete simulating human identification to carry out the process of event relation, need a large amount of association area knowledge, the complexity of algorithm is also huge.Under more susceptible condition, main employing relates to time factor, locality factors, organizational structure's factor, character factor, adopts traditional TF-IDF method weighting technique, and the modeling method of introducing analogical learning realizes:
1, in quality control standard, be to sort out with place, organizational structure, time or other definition mostly;
Whether 2, the similarity between event information is applied in cluster, associated between judgement event;
For example: " gas preventing and control should meet the requirements " as planting subevent, it is associated other events all if any: gas preventing and control professional contingent, gas density Comparison of standards, gas-geologic map, gas detection place, gas detection number of times, gas detection industry and traffic are succeeded etc. and to be met relevant regulations.
For the problems referred to above, propose to calculate based on semantic distance the quality that similarity and two Stage Clustering schemes between document improve Text Clustering Algorithm, first, from analytical documentation semantically, adopt nearest neighbor algorithm to carry out cluster for the first time; Secondly, according to similarity weight, category feature word is selected the superior and eliminated the inferior; Then carry out class merging, last, carry out cluster for the second time, solve nearest neighbor algorithm to input order sensitive issue.
Text Clustering Algorithm based on semantic distance is used adjacent clustering algorithm, and first using first piece of document as the first kind, then document is left in scanning successively, if document is the most similar to current class, belongs to current class, and upgrades class center; Otherwise increase by a class, Xin Lei center is current document, and judges the computing method whether document is similar to class.
The flow process of algorithm:
Algorithm flow is: initialization first kind center; Cluster for the first time; Arrange class; Cluster for the second time;
The formal specification of respective algorithms
initialization first kind center
{?obtain?first?documentin?in?ArtistList;
K=1; // K: current class sum
InitCenter(K)?;}
cluster for the first time
Cluster is based on nearest neighbor classifier algorithm for the first time.
FirstQuster (ArtistList, W, WaitQuster) // W: the classification set that cluster produces; K: current class sum.
{ WaitCluste={ }; The W={ initialization first kind }; K=1;
For?each?document?i?in?ArtistList?Do
{?Max?=?0?;?Flag?=?1;?SumSim?=?0?;
For?each?class?k?in?W?Do
{ call function ComputeSim (i, k);
Calculating makes the maximum class t of SumSim value and i related term to Count Max;
{ i adds t to If CountMax >=2 Then; UpdateCenter (t); }
Else { K=K+ 1; Init Center (10; A newly-increased class adds in W }
}?}
For each class j in W Do //m: { in If m=l then j, document adds in WaitCluste the number of files comprising in j; ?
arrange class
CleanClusteKW) // current class adds up to K, and class set is combined into W.
{ number of documents in For each class i and j in W Do //Size (i): class i
{?If?Size?(i)?>?3?and?Size?(j)?>?3?and?VVSimCenter?(i)?>0.8?and
VVSimCenter(j)?>0.8
{ merge i and j is m to Then; UpdateCenter (m); }
{ { merge i and j is m to Else for Ifi and j related term logarithm > Size/2 Then;
UpdateCenter(m)?;?}?}
}?}
cluster for the second time
Secondduster(WaitCluster,VVCenter?,?W?,?K)
{ the W={ class set that cluster produces for the first time }; K=count (W);
For?each?document?i?in?WaitCluster?Do
{?Max?=?0?;?Flag?=?0?;
For?each?class?k?in?VV?Center?Do
{?ComputeSim(i,k);
If?CountMax?>?=2
{ i adds in k Then; Up dateCenter (k); Flag=1; Exit; }
//CountMax: similarity is greater than 0. 8 similar word to quantity
If?Flag?=?0
Then { K=K+1; InitCenter (K); A newly-increased class adds in W;
exit;}
}
}
}
Propose a kind of method (Event Threading) that is modeling from event dimension, can be interpreted as sequence of events is together in series simply, namely excavate event between relation.A quality control standard modeling is not only to identify relevant event, also comprises the dependence between the event of foundation.The net result of this new modeling method is a series of inline event set, is called the event model (Events Model) of quality standard.Event model can present the overall content of quality standardization, and it is quicker to make to travel through whole quality standard.
Events Threading will be take time, place, organizational structure, character as factor and be converted into dimension and be refine in event model.
7) data center's module G
Event classified information: the roughly standard of description event classification, the data item comprising has: classification numbering, item name, event are described substantially, keyword etc.
Affair character information: describe time, place and the influence degree of event, the data item comprising has: the numbering of event, scene, working time, duration, affect situation, executive arm etc.
Incident management information: describe the data recording of incident management process, comprise that data item has: the method for Case Number, search, incident management strategy etc.