Summary of the invention
For the problems referred to above, the invention provides a kind of building method of the infosystem operation rule storehouse based on association rule mining, utilize fault-tree technology and Association Rule Mining to carry out intelligent information generated system cloud gray model rule base, and adopt machine learning techniques to be optimized rule.Further, devise three domain structures of rule, achieve auto-sequencing and the adjustment automatically of rule.
For realizing above-mentioned technical purpose, reach above-mentioned technique effect, the present invention is achieved through the following technical solutions:
Based on the building method in the infosystem operation rule storehouse of association rule mining, it is characterized in that, comprise the steps:
S01: obtain the network topology architecture of infosystem and the dynamic monitoring index of all devices and static monitor control index;
S02: by the dynamic and static monitor control index generating network fault tree of network topology architecture and equipment, and generate primitive rule storehouse by network failure tree;
S03: association rules mining algorithm is performed to the historical data of infosystem, obtains correlation rule storehouse;
S04: carry out reasoning in conjunction with primitive rule storehouse and correlation rule storehouse and generate extension rule storehouse;
Wherein, the retrieval priority of each rule base is: > extension rule storehouse, > correlation rule storehouse, primitive rule storehouse.
Preferably, each rule in primitive rule storehouse is three domain structures, namely comprises,
Sequence of rules territory: the number of times that rule runs succeeded in the operational process of reality, performs failed number of times, the final counting of rule and rule compositor;
Rule identification field: the subordinate object being used for identifying this rule;
Regulatory body territory: for the detailed description to rule.
Preferably, system real-time executing rule sort algorithm Sum fanction flow algorithms carries out the refreshing of priority determination Sum fanction to rule.
Wherein, in each rule base, determine by the final counting index of the rule in sequence of rules territory the priority that rule is retrieved, wherein, the formula of the final counting of rule is:
F=R-0.5W
In formula, F is final counting, and R is the number of times that rule runs succeeded in actual moving process, and W is that rule performs failed number of times; If carry out machine learning to the failed scene of execution, to dependency rule through optimizing and solving relevant issues, then the failed number of times W of corresponding execution subtracts one.
Preferably, the regular flow algorithm in correlation rule storehouse is: in running process, as long as rule is once proved to be correct, directly moves to primitive rule storehouse; If this rule has be proved to be mistake twice, then delete this rule.
Preferably, the regular flow algorithm in extension rule storehouse is: use historical data to verify strictly all rules,
For success ratio in the rule of 80% ~ 100%, after using historical data to carry out machine learning, directly move to primitive rule storehouse;
For success ratio 60% ~ 80% rule, after using historical data to carry out machine learning, if success ratio is greater than 80%, move to primitive rule storehouse, otherwise continue to stay extension rule storehouse, and accept the machine learning of service data, until its success ratio is greater than 80%;
For success ratio in the rule of 50% ~ 60%, use historical data and service data to carry out machine learning, until its success ratio is greater than 80%, move to primitive rule storehouse, otherwise continue to stay extension rule storehouse;
Success ratio is less than to the rule of 50%, directly deletes.
The present invention realizes infosystem operation rule storehouse dynamic construction and optimization, can be applicable to company information O&M comprehensive supervision platform, make the foundation of monitoring alarm rule easier with maintenance, rule matching efficiency is higher, thus adapt to rapidly the various changes in infosystem object, running environment, running state data source, meet extensive INFORMATION SYSTEM PRECEPTS collection matching treatment requirement of real-time simultaneously, greatly improve the practicality of algorithm, promote information system monitoring warning, safety management, behavior auditing and conjunction rule management quality.
The invention has the beneficial effects as follows:
One, the compartmentalization structure of rule base: the rule base of the inventive method design has three subregions, and store primitive rule, correlation rule and extension rule respectively, wherein the priority of primitive rule is the highest, and correlation rule takes second place, and the priority of extension rule is minimum.By the subregion of rule base, the priority orders of the priority management determination rule search of rule can be passed through, and low area rule can be upgraded by continuous real-time machine study, implementation rule flowing from low to high.
Two, three domain structures of rule: three domain structures of rule comprise sequence of rules territory, regular identification field Sum fanction main body territory: the prioritization of sequence of rules territory means implementation rule by quantifying; Rule identification field is used for identifying the subordinate object of this rule, so that rule base self-adaptative adjustment when network topology architecture changes; Regulatory body territory stores the main part of rule, and this is the detailed description to rule.
Three, real-time adaptive threshold adjustment: system utility historical data and service data, analysis meter calculates the alarm threshold of applicable service operation Alerting requirements, improve the alarm self-learning capability for infosystem, adopt threshold value planning algorithm dynamic conditioning alarm threshold, accomplish to reduce volume of event from the source of event, improve the quality of monitoring alarm.
Four, the robotization analysis on its rationality of newly-increased rule warehouse-in: newly-increased rule can be generated automatically by system, also can manually add.For newly-increased rule, adopt historical data and real-time running data to carry out rationalization to rule and analyze, determine the availability of rule.
Five, the automatic adjusting and optimizing of rule: by real-time executing rule sort algorithm Sum fanction flow algorithms, carry out priority to rule to determine and the refreshing of priority or upgrading, guarantee that rule base is in optimum state, improve the accuracy of the recall precision Sum fanction of rule, thus improve system performance.
Embodiment
Below in conjunction with accompanying drawing and specific embodiment, technical solution of the present invention is described in further detail, can better understand the present invention to make those skilled in the art and can be implemented, but illustrated embodiment is not as a limitation of the invention.
Based on the building method in the infosystem operation rule storehouse of association rule mining, as shown in Figure 1, comprise the steps:
S01: obtain the network topology architecture of infosystem and the dynamic monitoring index of all devices and static monitor control index.
First network topology architecture is obtained by Topology Discovery technology, then to each network equipment in topological structure, gather corresponding dynamic monitoring index and static monitor control index, comprise the large class of network index, safety index, main frame index, database index, middleware index and operation system index six.
Network index comprises chain-circuit time delay, network equipment health runs duration, network device state, network equipment CPU usage, network equipment memory usage, accept packet loss, send packet loss, receive Packet Error Rate, send Packet Error Rate, interface flow, interface transmitted traffic, interface total flow and interface broad band availability; Safety index comprises security incident, the state (CPU, internal memory etc.) of safety equipment and compliance; Main frame index comprises Host Status, healthy operation duration, CPU usage, memory usage, disk space utilization rate, critical processes number and host configuration information.
Database index has SqlServer index, Oracle index and DB2 index.Wherein SqlServer index comprises the hit rate of SGA, available cache memory size, the hit rate of dictionary buffer, the hit rate of shared cache area, the hit rate in Redo log buffer district, number of sessions, available sessions quantity, transaction response time, table space available rate, table space rate of growth and MTS performance; Oracle index comprises number of sessions, available sessions quantity, transaction response time, table space available rate, table space rate of growth, shared drive utilization rate, shared drive hit rate and roll-back segment utilization rate; DB2 index comprises Process available rate, Buffer Pool (Bufferpool) available rate, Buffer Pool hit rate, table space available rate, table space rate of growth, sequence index (SortsPerTransaction), number of sessions and available sessions quantity.
Middleware index has Weblogic index and Websphere index.Wherein Weblogic index comprises JVM heap free quantity, JVM heap total amount, JVM heap utilization rate, the all execution durations called of Servlet, the single the longest execution duration called of Servlet, Servlet on average performs duration, Servlet performs number of times, JDBC pool max cap., JDBC Pool is flexibly connected the high-water line of number, JDBC Pool waits for the high-water line of linking number, linking number accumulative since JDBC Pool instantiation, JDBC Pool mean activity linking number, JDBC Pool on average connects time delay, the linking number that JDBC Pool leaks, the current capacities of JDBC pool, the failed number that JDBC Pool reconnects, the maximum available linking number of JDBC Pool, the maximum unavailable linking number of JDBCPool, JDBC Pool LEAKED linking number, available linking number in JDBC Pool, unavailable linking number in JDBC POOL, JDBC Pool utilization factor, current sessions number, maximum number of sessions and session occupancy, Websphere index comprises JVM internal memory free quantity, JVM memory amount, JVM memory usage, average session life cycle, the total sessions of current accessed, the total sessions of current survival, JDBC pool max cap., JDBC Pool mean activity linking number, JDBC Pool on average connects time delay, the linking number that JDBC Pool leaks, the current capacities of JDBC pool, the failed number that JDBC Pool reconnects, the maximum available linking number of JDBC Pool, the maximum unavailable linking number of JDBC Pool, JDBC Pool LEAKED linking number, available linking number in JDBC Pool, unavailable linking number in JDBC POOL and JDBC Pool utilization factor.
Operation system index comprise online user number, day login user number, service system running state, operation system Interface status and operation system health run duration.
S02: by the dynamic and static monitor control index generating network fault tree of network topology architecture and equipment, and generate primitive rule storehouse by network failure tree.By the structure of fault tree can be short and sweet each monitor control index of expression and each network equipment between relation.Wherein, the dependent thresholds in primitive rule storehouse is by determining with execution threshold value planning algorithm the machine learning of historical data.
For primitive rule, devise three domain structures of rule, as shown in Figure 2, comprise sequence of rules territory, regular identification field Sum fanction main body territory.
Sequence of rules territory be used for number of times that storage rule runs succeeded in the operational process of reality, perform failed number of times, rule and finally count and rule compositor.The object that sequence of rules territory exists is for the ease of sorting to the priority of rule, improves the recall precision of rule.
Rule identification field is used for identifying the subordinate object of this rule, and such as rule is the exclusive rule of certain network equipment, or rule is subordinated to certain subnet or whole network.The object that rule identification field exists is to identify every rule, when network topology structure changes, the rule of deletion and amendment can be needed by the identification field identification of rule, and the additions and deletions carrying out implementation rule by regenerating corresponding primitive rule to the topological structure of variation part change, intelligence structure adapts to the rule base of the new network architecture.
Regulatory body territory stores the main part of rule, and this is the detailed description to rule.Rule is exactly production rule, refers to a kind of fixed logic structural relation in people's thinking judgement.The structure of general production can be expressed as natural language form, in fact, in natural language expressing, people's widely used various " reason--result ", " condition-conclusion ", " prerequisite-operation ", " true-progress ", structures such as " situation-behaviors ", all can be summed up as the knowledge representation form of production.The citation form of rule: A → B or IF A THENB, A are the prerequisites (former piece) of production, for point out this production whether can condition.B is one group of conclusion or operation (consequent), when the condition for pointing out indicated by the current A of carrying meets, and the operation that the conclusion that should draw maybe should perform.The inference mode of production rule reasoning has forward reasoning, backward inference and bidirection reasoning three kinds.Three kinds of inference modes have corresponding advantage under different situation, consider when rule-based reasoning way selection.
S03: perform association rules mining algorithm to the historical data of infosystem, obtain correlation rule storehouse, correlation rule is generated by association rule mining, and by rule that historical data is checked.
Preferably, the Apriori algorithm based on the improvement of branch's screening and optimizing strategy and database single sweep operation technology is adopted to carry out the excavation of historical data correlation rule.Apriori algorithm is a kind of frequent item set algorithm of Mining Association Rules, and algorithm is divided into two stages: find frequent item set and by frequent item set mining correlation rule.Algorithm principle finds from data centralization the frequent item set meeting minimum support, and then produce correlation rule according to frequent item set.Apriori algorithm is a very classical association rules mining algorithm, but there are two drawbacks, produces a lot of Candidate Set, waste a large amount of counting yield and time, and need Multiple-Scan database, have a strong impact on efficiency of algorithm in the rally of searching frequent episode.For first problem, adopt Hash table and position container to filter Candidate Set, reduce algorithm and producing the consumption on Candidate Set.Because the main consumption of classic algorithm is in the generation of C1, L1, C2, L2, in the generation of C2, filter more branch, greatly can improve efficiency of algorithm.For Second Problem, classic algorithm calculates support at every turn and all needs to scan whole database, and the frequency calculating support in algorithm is very high, and this just needs frequent scan database, causes efficiency of algorithm not high.So by safeguarding that a Boolean matrix carrys out transaction informations all in database of record, only need run-down database just can build Boolean matrix, this Boolean matrix contains all data calculating support and need, just do not need again scan database later, substantially increase efficiency of algorithm.
By the Apriori algorithm improved, can carry out association rule mining to historical data, the result obtained, under the cooperation of threshold value planning algorithm, intelligence can generate correlation rule storehouse.Correlation rule excavates out from historical data, and have passed the inspection of historical data, Reliability ratio is higher, but correlation rule still exists some uncertainties, just must can upgrade to primitive rule by the inspection of service data.
Dependent thresholds in correlation rule storehouse is by determining with execution threshold value planning algorithm the machine learning of historical data.
Primitive rule storehouse and correlation rule storehouse are in the determination of threshold value, utility historical data, analysis meter calculates the alarm threshold of applicable service operation Alerting requirements, improve the alarm self-learning capability for infosystem, optimize alarm logic, dynamic conditioning alarm threshold, accomplishes to reduce volume of event from the source of event, improves the quality of monitoring alarm.
Preferably, the threshold value planning algorithm of certain index is:
Statistical study is carried out to the historical data of index under network normal operational condition, determines its maximal value, minimum value and median, then carry out definite threshold as follows:
In formula, T
ifor threshold value, D
ifor the index maximal value under network normal operational condition, X
ifor the index minimum value under network normal operational condition, M
ifor the maximal value of index Design, Z
ifor the index median under network normal operational condition.
After rule base puts into operation, under network normal operational condition, all effective values of this index can participate in calculating in real time, determine the threshold value of this index in real time.The self-adaptation of threshold value on-the-fly modifies the ability that improve threshold value adaptive system, is conducive to the raising of system performance.
S04: carry out reasoning in conjunction with primitive rule storehouse and correlation rule storehouse and generate extension rule storehouse.
Rule is exactly production rule, refers to a kind of fixed logic structural relation in people's thinking judgement.The citation form of rule: A → B or IF A THEN B, A are the prerequisites (former piece) of production, for point out this production whether can condition.B is one group of conclusion or operation (consequent), when the condition for pointing out indicated by the current A of carrying meets, and the operation that the conclusion that should draw maybe should perform.Use primitive rule and correlation rule directly can generate extension rule by rule-based reasoning.Illustrate to there is rule " A → B ", " B → C " and " A if deposited in primitive rule with correlation rule
d ", by rule-based reasoning, three extension rules " B → C " can be obtained, " D → B " " D → C ".
Extension rule by primitive rule and correlation rule reasoning out, inherently there is uncertainty in the reasoning of rule, so it is minimum for expanding Rules control, have to pass through strict checking (comprising the checking of historical data and the checking of service data), just can upgrade to primitive rule.
On the basis of the constructing technology in research information system operation monitoring alarm rule storehouse, from the type of monitoring, data, source, alarm time, alert mode, the aspects such as performance data are set about, by the analysis to monitoring historical data and relevant daily O&M work order fault type, from infosystem peak hours/period, the different time sections such as idle period are set out, the business hours of combining information system and portfolio, the tide bulge and fall of understanding business, utility historical data, analysis meter calculates the alarm threshold of applicable service operation Alerting requirements, improve the alarm self-learning capability for infosystem, dynamic conditioning alarm threshold, accomplish to reduce volume of event from the source of event, improve the quality of monitoring alarm.
Rule base can be divided into three subregions by us, stores dissimilar rule respectively, and such as a district stores primitive rule storehouse, and 2nd district store correlation rule storehouse, three district's Memory Extension rule bases.Wherein, the retrieval priority of each rule base is: > extension rule storehouse, > correlation rule storehouse, primitive rule storehouse.In the retrieving of rule, first retrieve the primitive rule in a district, if do not find corresponding rule, just the correlation rule in Hui Dui bis-district and the extension rule in 3rd district are retrieved.To the correlation rule in 2nd district and the extension rule in 3rd district by carrying out the automatic adjusting and optimizing of rule to the machine learning of historical data, in addition, dependency rule needs could be retained by the reasonablencess check of historical data, otherwise directly removes this rule.
In addition, in each rule base subregion, the priority of rule can be determined by rule compositor algorithm, the priority that rule is retrieved is determined particular by the final counting index of the rule in sequence of rules territory, the rule precedence retrieval that priority is high, the rule that priority is low postpones retrieval, can improve rule search efficiency like this.Wherein, the formula of the final counting of rule is:
F=R-0.5W
In formula, F is final counting, and R is the number of times that rule runs succeeded in actual moving process, and W is that rule performs failed number of times; If carry out machine learning to the failed scene of execution, to dependency rule through optimizing and solving relevant issues, then the failed number of times W of corresponding execution subtracts one.
By the inspection of historical data and service data, can find out in the strictly all rules of rule base which is rational, which is irrational, and the rationality of rule can be determined by the means of quantitative test, such as can carry out the rationality of quantizing rule by the final counting index in sequence of rules territory in three domain structures of rule.After the analysis on its rationality of rule, can intelligence rule is further processed, such as, some rule through checking, meet system requirements; Some regular rationality is general, needs just can be used through machine learning; The rationality of some rule is poor, may be just directly deleted.
Same, by the machine learning of historical data and service data, can constantly promote regular performance, make it higher with system matches, and provide corresponding performance optimization to adjust suggestion.Such as threshold value is not unalterable, can be carried out the self-adaptation real-time learning of rule, improve regular rationality by system operation data.
In the design of rule base, also allow the flowing of rule in rudimentary region to premium area.Rule is from rudimentary region to the flowing of premium area, and first is need the rational checking of rule, and second is need by machine learning, improves constantly the rationality of rule.In the operational process of reality, dynamically automatic adjusting and optimizing is carried out to rule by real-time service data: by rule compositor algorithm, carry out priority to the rule in rule base one district, 2nd district, 3rd district determine and sort, undertaken upgrading or refreshing by the rule in regular flow algorithm Lai Dui bis-district and 3rd district.
Wherein, the regular flow algorithm in correlation rule storehouse is: in running process, as long as rule is once proved to be correct, directly moves to primitive rule storehouse; If this rule has be proved to be mistake twice, then delete this rule.
The regular flow algorithm in extension rule storehouse is as shown in Figure 3: use historical data to verify strictly all rules,
For success ratio in the rule of 80% ~ 100%, after using historical data to carry out machine learning, directly move to primitive rule storehouse;
For success ratio 60% ~ 80% rule, after using historical data to carry out machine learning, if success ratio is greater than 80%, move to primitive rule storehouse, otherwise continue to stay extension rule storehouse, and accept the machine learning of service data, until its success ratio is greater than 80%;
For success ratio in the rule of 50% ~ 60%, use historical data and service data to carry out machine learning, until its success ratio is greater than 80%, move to primitive rule storehouse, otherwise continue to stay extension rule storehouse;
Success ratio is less than to the rule of 50%, directly deletes.
By carrying out priority adjustment in real time to rule, rule base can be allowed to be in optimum state, improving the accuracy of the recall precision Sum fanction of rule, thus improve system performance.The priority adjustment of rule is extremely important, and conventional rule and the higher rule of rationality ought to retrieved beforehand, and the rule be of little use and the lower rule of rationality can postpone retrieval, can improve the recall precision of rule like this, thus improve system performance.
In addition, can also carry out certain operations by manual type, such as system operation maintenance personnel can directly increase and deletion rule, and modifies to well-regulated association attributes.
The invention has the beneficial effects as follows:
One, the compartmentalization structure of rule base: the rule base of the inventive method design has three subregions, and store primitive rule, correlation rule and extension rule respectively, wherein the priority of primitive rule is the highest, and correlation rule takes second place, and the priority of extension rule is minimum.By the subregion of rule base, the priority orders of the priority management determination rule search of rule can be passed through, and low area rule can be upgraded by continuous real-time machine study, implementation rule flowing from low to high.
Two, three domain structures of rule: three domain structures of rule comprise sequence of rules territory, regular identification field Sum fanction main body territory: the prioritization of sequence of rules territory means implementation rule by quantifying; Rule identification field is used for identifying the subordinate object of this rule, so that rule base self-adaptative adjustment when network topology architecture changes; Regulatory body territory stores the main part of rule, and this is the detailed description to rule.
Three, real-time adaptive threshold adjustment: system utility historical data and service data, analysis meter calculates the alarm threshold of applicable service operation Alerting requirements, improve the alarm self-learning capability for infosystem, adopt threshold value planning algorithm dynamic conditioning alarm threshold, accomplish to reduce volume of event from the source of event, improve the quality of monitoring alarm.
Four, the robotization analysis on its rationality of newly-increased rule warehouse-in: newly-increased rule can be generated automatically by system, also can manually add.For newly-increased rule, adopt historical data and real-time running data to carry out rationalization to rule and analyze, determine the availability of rule.
Five, the automatic adjusting and optimizing of rule: by real-time executing rule sort algorithm Sum fanction flow algorithms, carry out priority to rule to determine and the refreshing of priority or upgrading, guarantee that rule base is in optimum state, improve the accuracy of the recall precision Sum fanction of rule, thus improve system performance.
The present invention realizes infosystem operation rule storehouse dynamic construction and optimization, can be applicable to company information O&M comprehensive supervision platform, make the foundation of monitoring alarm rule easier with maintenance, rule matching efficiency is higher, thus adapt to rapidly the various changes in infosystem object, running environment, running state data source, meet extensive INFORMATION SYSTEM PRECEPTS collection matching treatment requirement of real-time simultaneously, greatly improve the practicality of algorithm, promote information system monitoring warning, safety management, behavior auditing and conjunction rule management quality.
These are only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every utilize instructions of the present invention and accompanying drawing content to do equivalent structure or equivalent flow process conversion; or be directly or indirectly used in the technical field that other are relevant, be all in like manner included in scope of patent protection of the present invention.