CN109947614A - Multimachine room relies on monitoring method, device, equipment and computer readable storage medium - Google Patents

Multimachine room relies on monitoring method, device, equipment and computer readable storage medium Download PDF

Info

Publication number
CN109947614A
CN109947614A CN201811436481.5A CN201811436481A CN109947614A CN 109947614 A CN109947614 A CN 109947614A CN 201811436481 A CN201811436481 A CN 201811436481A CN 109947614 A CN109947614 A CN 109947614A
Authority
CN
China
Prior art keywords
service
room
computer room
service invocation
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811436481.5A
Other languages
Chinese (zh)
Inventor
窦文生
王月凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201811436481.5A priority Critical patent/CN109947614A/en
Publication of CN109947614A publication Critical patent/CN109947614A/en
Pending legal-status Critical Current

Links

Abstract

The embodiment of the present disclosure provides multimachine room and relies on monitoring method, device, equipment and computer readable storage medium.It includes: the multiple logs of acquisition that multimachine room, which relies on monitoring method, and extracts service invocation information from multiple logs for specific transactions mark;Across computer room service invocation information is filtered out from service invocation information;Determine the service link that the represented business of specific transactions mark is belonged to;The service link that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to is subjected to real time correlation to determine whether service link is service link that multimachine room relies on, can solve the problems, such as that single log analysis can not solve multimachine room and rely on monitoring with the more log correlation analysis of real-time perfoming.In addition, carrying out online real time correlation processing to multiple daily record datas, solves the problems, such as that offline logs analytical plan can not rely on situation to multimachine room and carry out association analysis in real time, provide real time data foundation for computer room disaster tolerance decision.

Description

Multimachine room relies on monitoring method, device, equipment and computer readable storage medium
Technical field
The embodiment of the present disclosure be related to field of computer technology more particularly to multimachine room rely on monitoring method, device, equipment and Computer readable storage medium.
Background technique
In the related art, each computer room or computer room unit are understood to be an offer whole station application function Deployment unit.All machining functions of corresponding product are capable of providing similar to a production line, a computer room unit needs The full link processing ability of respective service is provided, and independent of other computer room units.Since each computer room unit can be held The flow pressure received has the upper limit, and for important application, each computer room unit needs the unit of backup, to cope with computer room failure institute The enterprise level service risk of generation.
With the expansion at full speed of some large enterprises grade application demands, the application deployment mode in single machine room because flow bandwidth, The electric power upper limit, has been unable to satisfy growing service traffics demand, and the deployment of multimachine room has become increasingly mature and more Add popular enterprise-level application deployment scheme.
For important application, computer room disaster tolerance is always one of crucial importance, and the project of great challenge.Computer room disaster tolerance refers to Any computer room unit has when that can not combat a natural disaster difficult occur (computer room suspension, power-off etc.), flow can be switched to backup automatically Computer room, to cope with corresponding failure risk.
Due to multimachine room deployment be not it is stranghtforward, dispose realize during, often there are certain multimachines Room relies on (relying between computer room) problem.Since these presence relied on carry out computer room switching can not glibly, thus can not Meet real-time computer room disaster tolerance.Therefore, it is necessary to rely on situation to computer room to carry out early warning analysis, while eliminating computer room dependence, Also the new Dependence Problem of publication change tape because of application is taken precautions against, ensureing that all computer rooms are in always can disaster tolerance state.
In a scheme of the relevant technologies, it can determine that computer room relies on situation by odd-numbered day will real-time statistic analysis, Odd-numbered day will real-time statistics scheme is only capable of the time window data according to single log, carries out the polymerization and comparative analysis of data.It is right In multimachine room Dependence Problem, the real-time analytical plan of odd-numbered day will is unable to satisfy analysis demand.
In another scheme of the relevant technologies, it can determine that computer room relies on situation by off-line data association analysis. Off-line data association analysis can solve the related question of more logs, however not have real-time, can not feed back the real-time of computer room dependence State is not able to satisfy real time monitoring demand.
Summary of the invention
In view of this, disclosure first aspect provides a kind of multimachine room dependence monitoring method, comprising:
Multiple logs are acquired, and extract service invocation information from the multiple log for specific transactions mark;
Across computer room service invocation information is filtered out from the service invocation information;
Determine the service link that the represented business of the specific transactions mark is belonged to;
The industry that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Business link carry out real time correlation with the determination service link whether be multimachine room rely on service link.
Disclosure second aspect provides a kind of multimachine room dependence monitoring device, comprising:
Acquisition and extraction module are configured as acquiring multiple logs, and identify for specific transactions from the multiple day Service invocation information is extracted in will;
Screening module is configured as filtering out across computer room service invocation information from the service invocation information;
Determining module is configured to determine that the service link that the represented business of the specific transactions mark is belonged to;
Real time correlation module, is configured as across the computer room service invocation information that will be filtered out and the specific transactions identify institute Whether it is industry that multimachine room relies on that service link that the business of expression is belonged to carries out real time correlation with the determination service link Business link.
The disclosure third aspect provides a kind of electronic equipment, including memory and processor;Wherein, the memory is used In storing one or more computer instruction, wherein one or more computer instruction is executed by the processor with reality Existing following steps:
Multiple logs are acquired, and extract service invocation information from the multiple log for specific transactions mark;
Across computer room service invocation information is filtered out from the service invocation information;
Determine the service link that the represented business of the specific transactions mark is belonged to;
The industry that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Business link carry out real time correlation with the determination service link whether be multimachine room rely on service link.
Disclosure fourth aspect provides a kind of computer readable storage medium, is stored thereon with computer instruction, the meter Method as described in relation to the first aspect is realized in the instruction of calculation machine when being executed by processor.
In disclosure embodiment, identify by acquiring multiple logs, and for specific transactions from the multiple day Service invocation information is extracted in will;Across computer room service invocation information is filtered out from the service invocation information;Determine the spy Determine the service link that business represented by service identification is belonged to;By across the computer room service invocation information filtered out with it is described specific Whether it is multimachine that the service link that business represented by service identification is belonged to carries out real time correlation with the determination service link The service link that room relies on, can be with the more log correlation analysis of real-time perfoming, and multimachine room can not be solved by solving single log analysis The problem of relying on monitoring.In addition, carrying out online real time correlation processing to multiple daily record datas, solves offline logs analytical plan Can not to multimachine room rely on situation carry out association analysis in real time the problem of, for computer room disaster tolerance decision provide real time data according to According to.
These aspects or other aspects of the disclosure can more straightforwards in the following description.
Detailed description of the invention
Technical solution in order to illustrate more clearly of the embodiment of the present disclosure or in the related technology, below will be to exemplary implementation Attached drawing needed in example or description of Related Art is briefly described, it should be apparent that, the accompanying drawings in the following description It is some exemplary embodiments of the disclosure, for those of ordinary skill in the art, before not making the creative labor It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 shows the flow chart that monitoring method is relied on according to the multimachine room of one embodiment of the disclosure;
Fig. 2 shows the exemplary processes for the step S101 that monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure;
Fig. 3 shows the exemplary process that the step S104 of monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure;
Fig. 4 shows the flow chart that monitoring method is relied on according to the multimachine room of another embodiment of the disclosure;
Fig. 5 shows the signal that the Application Scenarios-Example of monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure;
Fig. 6 is shown to be shown according to the another application Sample Scenario that the multimachine room of one embodiment of the disclosure relies on monitoring method It is intended to;
Fig. 7 shows the structural block diagram that monitoring device is relied on according to the multimachine room of another embodiment of the disclosure;
Fig. 8 shows the structural block diagram of the electronic equipment according to one embodiment of the disclosure;
Fig. 9 is adapted for relying on the computer system of monitoring method according to the multimachine room of one embodiment of the disclosure for realizing Structural schematic diagram.
Specific embodiment
In order to make those skilled in the art more fully understand disclosure scheme, below in conjunction with the exemplary implementation of the disclosure Attached drawing in example, is clearly and completely described the technical solution in disclosure exemplary embodiment.
In some processes of the description in the specification and claims of the disclosure and above-mentioned attached drawing, contain according to Multiple operations that particular order occurs, but it should be clearly understood that these operations can not be what appears in this article suitable according to its Sequence is executed or is executed parallel, and serial number of operation such as 101,102 etc. is only used for distinguishing each different operation, serial number It itself does not represent and any executes sequence.In addition, these processes may include more or fewer operations, and these operations can To execute or execute parallel in order.It should be noted that the description such as " first " herein, " second ", is for distinguishing not Same message, equipment, module etc., does not represent sequencing, does not also limit " first " and " second " and be different type.
Below in conjunction with the attached drawing in disclosure exemplary embodiment, to the technical solution in disclosure exemplary embodiment It being clearly and completely described, it is clear that described exemplary embodiment is only disclosure a part of the embodiment, rather than Whole embodiments.Based on the embodiment in the disclosure, those skilled in the art institute without creative efforts The every other embodiment obtained belongs to the range of disclosure protection..
Fig. 1 shows the flow chart that monitoring method is relied on according to the multimachine room of one embodiment of the disclosure.This method can wrap Include step S101, S102 and S103 and S104.
In step s101, multiple logs are acquired, and the service of extracting is adjusted from multiple logs for specific transactions mark Use information.
In step s 102, across computer room service invocation information is filtered out from service invocation information.
In step s 103, the service link that the represented business of specific transactions mark is belonged to is determined.
In step S104, by the represented business institute of across the computer room service invocation information filtered out and specific transactions mark The service link of ownership carries out real time correlation to determine whether service link is service link that multimachine room relies on.
In one embodiment of the present disclosure, multiple logs include remote procedure call log, caching system access log With Database Systems access log.Remote procedure call log refers to the case where business carries out remote procedure call to service Log, for example, rpc_digest_log.Caching system access log refers to the case where business accesses to caching system Log, for example, tair_digest_log.Database Systems access log refers to what business accessed to Database Systems The log of situation, for example, db_digest_log.It will be understood by those skilled in the art that the log in disclosure embodiment is not It is only limited to remote procedure call log, caching system access log and Database Systems access log, can also include other Log, moreover, the naming method of log is also not necessarily limited to aforementioned exemplary.
In one embodiment of the present disclosure, service invocation information is remote procedure call information.Remote procedure call (RPC, Remote Procedure Call) is that business carries out the technology used when service call.In an implementation of the disclosure In example, the service call technology by such as remote procedure call etc is also required to the access of caching system, Database Systems To realize.Therefore, the service invocation information of such as remote procedure call information etc can be extracted from the log of these systems.
In one embodiment of the present disclosure, specific transactions mark refers to the unique identification of a certain specific transactions.For example, Service identification traceid is the unique identification of a business, globally unique in an operation flow (for example, primary transaction), The service identification corresponds to multiple service calls, for example, rpc is called.
In one embodiment of the present disclosure, service invocation information may include not across computer room service invocation information and across The service invocation information of computer room.According to the purpose of the embodiment of the present disclosure, across computer room clothes can be filtered out from service invocation information Business recalls information.For example, carrying out the screening across computer room service invocation information using rpc_digest_log as data Source log.
For example, identifying traceid_a for specific transactions, remote process is extracted from rpc_digest_log data Source log (rpc) recalls information is as follows:
Rpc_1:<traceid_a, et15, em14>
Rpc_2:<traceid_a, et15, et14>
Rpc_3:<traceid_a, et15, et15>
Rpc_4:<traceid_a, et14, et14>
Rpc_5:<traceid_a, et15, et15>
Rpc_6:<traceid_a, em14, em14>
Rpc_7:<traceid_a, et14, et15>
From the above remote procedure call information as it can be seen that a remote procedure call can occur in identical computer room, for example, For rpc_3, remote procedure call occurs in computer room et15.That is, traceid_a is identified for specific transactions, in long-range mistake In journey calling process rpc_3, the called side of service and the provider of service are in computer room et15.In addition, a remote process tune With can occur in different computer rooms, that is, multimachine room, for example, remote procedure call occurs in computer room et15 for rpc_1 In em14.That is, identifying traceid_a, in remote procedure call process rpc_1, the called side of service for specific transactions In computer room et15, the provider of service is in computer room em14.
In one embodiment of the present disclosure, across computer room service invocation information is filtered out from service invocation information to be referred to Be carry out Log Filter, select the rpc recalls information across computer room.For example, can be screened from the above remote procedure call information Across computer room remote process (rpc) recalls information is as follows out:
Rpc_1:<traceid_a, et15, em14>
Rpc_2:<traceid_a, et15, et14>
Rpc_7:<traceid_a, et14, et15>
That is, above 3 remote procedure call processes are across computer room far call processes, this 3 remote procedure call processes Corresponding service invocation information is exactly across computer room service invocation information.
In one embodiment of the present disclosure, step S102 includes: and gathers to filtering out across computer room service invocation information Close include to obtain whole computer room informations in across computer room service invocation information aggregation information.It include that service is adjusted in aggregation information The information of whole computer rooms in across the computer room calling process involved in process.For example, from above 3 across computer room service call letters Cease available aggregation information:
<traceid_a,et14,et15,em14>
That is, identifying traceid_a for specific transactions, there is the service call of across computer room et14, et15 and em14.
In one embodiment of the present disclosure, the service link for determining that the represented business of specific transactions mark is belonged to can So that manager knows which service link (example across the computer room service call for specific transactions mark relates to Such as, the service link of specific application).
In one embodiment of the present disclosure, across the computer room service invocation information filtered out and specific transactions are identified into institute's table Service link that the business shown is belonged to carries out real time correlation to determine whether service link is service link that multimachine room relies on. When including whole computer rooms in across computer room service invocation information to obtain to filtering out across computer room service invocation information and being polymerize When the aggregation information of information, the service link that the represented business of aggregation information and specific transactions mark can be belonged to is carried out Real time correlation with the determination service link whether be multimachine room rely on service link.In this case, it is possible to the business of determination There is the service call across which computer room (for example, computer room et14, et15 and em14) in link.This facilitates the clothes of prompt service The closure for not having and calling computer room is called in business, need to be transformed, to support computer room disaster tolerance.
In disclosure embodiment, by acquiring multiple logs, and for specific transactions mark from multiple logs Extract service invocation information;Across computer room service invocation information is filtered out from service invocation information;Determine that specific transactions identify institute The service link that the business of expression is belonged to;Across the computer room service invocation information filtered out and specific transactions mark is represented Service link that business is belonged to carries out real time correlation to determine whether service link is service link that multimachine room relies on, can be with The more log correlation analysis of real-time perfoming solve the problems, such as that single log analysis can not solve multimachine room and rely on monitoring.In addition, right Multiple daily record datas carry out online real time correlation processing, solve offline logs analytical plan multimachine room can not be relied on situation into The problem of row association analysis in real time, real time data foundation is provided for computer room disaster tolerance decision.
Referring to Fig. 2 to according to the multimachine room of one embodiment of the disclosure rely on monitoring method step S101 carry out into The description of one step.
Fig. 2 shows the exemplary processes for the step S101 that monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure.As shown in Fig. 2, step S101 includes step S201 and S202.
In step s 201, extracted from multiple logs with specific transactions identify the information of corresponding access place computer room with And service invocation procedure mark.
It in step S202, is identified for specific transactions, service call is determined according to the sequence that service invocation procedure identifies The corresponding relationship for the computer room that process and service invocation procedure are occurred.
In one embodiment according to the disclosure, identified for specific transactions, it is understood that there may be the access to caching system With the access to Database Systems, can be identified by service invocation procedure (for example, the remote process tune of such as rpc_1 etc With mark) get up the access to caching system and to the access " series connection " of Database Systems.That is, the access to caching system It is realized with the access to Database Systems and by way of the service call of such as remote procedure call etc.Cause This, identifies for specific transactions, can determine service invocation procedure and service call according to the sequence that service invocation procedure identifies The corresponding relationship for the computer room that process is occurred.
In one embodiment according to the disclosure, step S102 includes: from service invocation procedure and service invocation procedure The service invocation procedure occurred under the conditions of across computer room is filtered out in the corresponding relationship of the computer room occurred.That is, service call is believed The corresponding relationship for the computer room that presence service calling process and service invocation procedure are occurred in breath, therefore generation can be filtered out and existed Across the service invocation procedure under the conditions of computer room.
Referring to Fig. 3 to according to the multimachine room of one embodiment of the disclosure rely on monitoring method step S104 carry out into The description of one step.
Fig. 3 shows the exemplary process that the step S104 of monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure.As shown in figure 3, step S104 includes step S301 and S302.
In step S301, by the represented business institute of across the computer room service invocation information filtered out and specific transactions mark The service link of ownership carries out real time correlation to obtain across the computer room service invocation information of service link.
In step s 302, determine whether service link is multimachine room according to across the computer room service invocation information of service link The service link of dependence.
In one embodiment according to the disclosure, across the computer room service call filtered out is believed by specific transactions mark Breath carries out real time correlation with service link to obtain across the computer room service invocation information of service link, can grasp in real time a certain The service link of application currently whether there is across computer room calling situation, that is, according to across the computer room service invocation information of service link Determine service link whether be multimachine room rely on service link.Therefore, solving offline logs analytical plan can not be to multimachine Room relies on the problem of situation carries out association analysis in real time, provides real time data foundation for computer room disaster tolerance decision.
It is further retouched referring to Fig. 4 to according to the multimachine room of another embodiment of disclosure dependence monitoring method It states.
Fig. 4 shows the flow chart that monitoring method is relied on according to the multimachine room of another embodiment of the disclosure.Reality shown in Fig. 4 The difference for applying mode and embodiment shown in FIG. 1 is to further include step S401.
It is the determination for the service link that multimachine room relies on as a result, issuing multimachine in response to service link in step S401 Room relies on pre-warning signal.
In one embodiment according to the disclosure, based on the multimachine room to service link realized in the aforementioned embodiment The monitoring for relying on situation can carry out the multimachine room based on monitored results and rely on early warning.So that application where service link The available early warning of manager, and then be accordingly transformed, to support computer room disaster tolerance.
An application scenarios for relying on monitoring method according to the multimachine room of one embodiment of the disclosure are shown referring to Fig. 5 Example is described.
Fig. 5 shows the signal that the Application Scenarios-Example of monitoring method is relied on according to the multimachine room of one embodiment of the disclosure Figure.
As shown in figure 5, can to the time window data of single log of the online log 1 into online log N respectively into Row statistical analysis, and then the control based on statistic analysis result progress single machine room, the control in single machine room includes single machine room flow, list The control that computer room performance, single machine room report an error etc..Furthermore it is also possible to by the way that online log 1 is patrolled to online log N is associated It collects and carries out real time implementation realization, the real time monitoring demand to the association control of multimachine room can be coped with, and meet relevant change early warning Function.The association control of multimachine room includes that computer room relies on monitoring, change influences monitoring aspect.
Fig. 6 is shown to be shown according to the another application Sample Scenario that the multimachine room of one embodiment of the disclosure relies on monitoring method It is intended to.
As shown in fig. 6, multiple logs include rpc_digest_log, tair_digest_log and db_digest_log. Access place computer room information can be extracted from the multiple log for specific transactions mark traceid and call access Remote procedure call rpcid.To access carry out rpc upstream and downstream connect it is available for specific transactions mark traceid clothes Business recalls information, and then across computer room service invocation information can be filtered out.Furthermore it is also possible to be identified for specific transactions Traceid determines the service link of specific application (for example, apple payment, transaction guaranty etc.), that is, determines that traceid is corresponding Affiliated service link information.
Across the computer room service invocation information filtered out the business represented with specific transactions mark can be belonged to Service link carry out real time correlation.In the case, the targeted traceid of service invocation information is extracted from log (a.traceid) and determine that targeted both the traceid (b.traceid) of service link are same traceid.It is closed in real time After connection, the rpc of available App is across computer room information, tair across computer room information, db across computer room information.So as to based on institute Determining carries out across computer room information using transformation.It should be noted that simply by the presence of in any across computer room information, so that it may determine There are the dependences of multimachine room for service link.
Fig. 7 shows the structural block diagram that monitoring device is relied on according to the multimachine room of another embodiment of the disclosure.The device can To include acquisition and extraction module 701, screening module 702, determining module 703 and real time correlation module 704.
Acquisition and extraction module 701 are configured as acquiring multiple logs, and identify for specific transactions from multiple logs Middle extraction service invocation information.
Screening module 702 is configured as filtering out across computer room service invocation information from service invocation information.
Determining module 703 is configured to determine that the service link that the represented business of specific transactions mark is belonged to.
Real time correlation module 704 is configured as across the computer room service invocation information that will be filtered out and specific transactions identify institute's table Service link that the business shown is belonged to carries out real time correlation to determine whether service link is service link that multimachine room relies on The foregoing describe built-in functions and structure that multimachine room relies on monitoring device.
In a possible design, the structure which relies on monitoring system can realize that relying on monitoring for multimachine room sets Standby, as shown in Figure 8, which may include processor 801 and memory 802.
The memory 802 supports multimachine room to rely on monitoring system and execute multimachine room in any of the above-described embodiment for storing The program of monitoring method is relied on, the processor 801 is configurable for executing the program stored in the memory 802.
The memory 802 is for storing one or more computer instruction, wherein one or more computer refers to It enables and being executed by the processor 801 to perform the steps of
Multiple logs are acquired, and extract service invocation information from multiple logs for specific transactions mark;
Across computer room service invocation information is filtered out from service invocation information;
Determine the service link that the represented business of specific transactions mark is belonged to;
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carry out real time correlation with determine service link whether be multimachine room rely on service link.
In one embodiment of the present disclosure, multiple logs include remote procedure call log, caching system access log With Database Systems access log.Remote procedure call log refers to the case where business carries out remote procedure call to service Log, for example, rpc_digest_log.Caching system access log refers to the case where business accesses to caching system Log, for example, tair_digest_log.Database Systems access log refers to what business accessed to Database Systems The log of situation, for example, db_digest_log.It will be understood by those skilled in the art that the log in disclosure embodiment is not It is only limited to remote procedure call log, caching system access log and Database Systems access log, can also include other Log, moreover, the naming method of log is also not necessarily limited to aforementioned exemplary.
In one embodiment of the present disclosure, service invocation information is remote procedure call information.Remote procedure call is Business carries out the technology used when service call.In one embodiment of the present disclosure, to the visit of caching system, Database Systems It asks and is also required to realize by the service call technology of such as remote procedure call etc.It therefore, can be from the day of these systems The service invocation information of such as remote procedure call information etc is extracted in will.
In one embodiment of the present disclosure, specific transactions mark refers to the unique identification of a certain specific transactions.For example, Service identification traceid is the unique identification of a business, globally unique in an operation flow (for example, primary transaction), The service identification corresponds to multiple service calls, for example, rpc is called.
In one embodiment of the present disclosure, service invocation information may include not across computer room service invocation information and across The service invocation information of computer room.According to the purpose of the embodiment of the present disclosure, across computer room clothes can be filtered out from service invocation information Business recalls information.For example, carrying out the screening across computer room service invocation information using rpc_digest_log as data Source log.
For example, identifying traceid_a for specific transactions, remote process is extracted from rpc_digest_log data Source log (rpc) recalls information is as follows:
Rpc_1:<traceid_a, et15, em14>
Rpc_2:<traceid_a, et15, et14>
Rpc_3:<traceid_a, et15, et15>
Rpc_4:<traceid_a, et14, et14>
Rpc_5:<traceid_a, et15, et15>
Rpc_6:<traceid_a, em14, em14>
Rpc_7:<traceid_a, et14, et15>
From the above remote procedure call information as it can be seen that a remote procedure call can occur in identical computer room, for example, For rpc_3, remote procedure call occurs in computer room et15.That is, traceid_a is identified for specific transactions, in long-range mistake In journey calling process rpc_3, the called side of service and the provider of service are in computer room et15.In addition, a remote process tune With can occur in different computer rooms, that is, multimachine room, for example, remote procedure call occurs in computer room et15 for rpc_1 In em14.That is, identifying traceid_a, in remote procedure call process rpc_1, the called side of service for specific transactions In computer room et15, the provider of service is in computer room em14.
In one embodiment of the present disclosure, across computer room service invocation information is filtered out from service invocation information to be referred to Be carry out Log Filter, select the rpc recalls information across computer room.For example, can be screened from the above remote procedure call information Across computer room remote process (rpc) recalls information is as follows out:
Rpc_1:<traceid_a, et15, em14>
Rpc_2:<traceid_a, et15, et14>
Rpc_7:<traceid_a, et14, et15>
That is, above 3 remote procedure call processes are across computer room far call processes, this 3 remote procedure call processes Corresponding service invocation information is exactly across computer room service invocation information.
In one embodiment of the present disclosure, across computer room service invocation information is filtered out from service invocation information, comprising: It include whole computer rooms letter in across computer room service invocation information to obtain to filtering out across computer room service invocation information and being polymerize The aggregation information of breath.It include the letter of whole computer rooms in across computer room calling process involved in service invocation procedure in aggregation information Breath.For example, from above 3 across the computer room available aggregation informations of service invocation information:
<traceid_a,et14,et15,em14>
That is, identifying traceid_a for specific transactions, there is the service call of across computer room et14, et15 and em14.
In one embodiment of the present disclosure, the service link for determining that the represented business of specific transactions mark is belonged to can So that manager knows which service link (example across the computer room service call for specific transactions mark relates to Such as, the service link of specific application).
In one embodiment of the present disclosure, across the computer room service invocation information filtered out and specific transactions are identified into institute's table Service link that the business shown is belonged to carries out real time correlation to determine whether service link is service link that multimachine room relies on. When including whole computer rooms in across computer room service invocation information to obtain to filtering out across computer room service invocation information and being polymerize When the aggregation information of information, the service link that the represented business of aggregation information and specific transactions mark can be belonged to is carried out Real time correlation with the determination service link whether be multimachine room rely on service link.In this case, it is possible to the business of determination There is the service call across which computer room (for example, computer room et14, et15 and em14) in link.This facilitates the clothes of prompt service The closure for not having and calling computer room is called in business, need to be transformed, to support computer room disaster tolerance.
In disclosure embodiment, by acquiring multiple logs, and for specific transactions mark from multiple logs Extract service invocation information;Across computer room service invocation information is filtered out from service invocation information;Determine that specific transactions identify institute The service link that the business of expression is belonged to;Across the computer room service invocation information filtered out and specific transactions mark is represented Service link that business is belonged to carries out real time correlation to determine whether service link is service link that multimachine room relies on, can be with The more log correlation analysis of real-time perfoming solve the problems, such as that single log analysis can not solve multimachine room and rely on monitoring.In addition, right Multiple daily record datas carry out online real time correlation processing, solve offline logs analytical plan multimachine room can not be relied on situation into The problem of row association analysis in real time, real time data foundation is provided for computer room disaster tolerance decision.
In one embodiment according to the disclosure, multiple logs are acquired, and identify from multiple days for specific transactions Service invocation information is extracted in will, comprising:
The information and service call mistake of computer room where identifying corresponding access with specific transactions are extracted from multiple logs Journey mark.
It is identified for specific transactions, service invocation procedure and service call is determined according to the sequence that service invocation procedure identifies The corresponding relationship for the computer room that process is occurred.
In one embodiment according to the disclosure, identified for specific transactions, it is understood that there may be the access to caching system With the access to Database Systems, can be identified by service invocation procedure (for example, the remote process tune of such as rpc_1 etc With mark) get up the access to caching system and to the access " series connection " of Database Systems.That is, the access to caching system It is realized with the access to Database Systems and by way of the service call of such as remote procedure call etc.Cause This, identifies for specific transactions, can determine service invocation procedure and service call according to the sequence that service invocation procedure identifies The corresponding relationship for the computer room that process is occurred.
In one embodiment according to the disclosure, across computer room service invocation information is filtered out from service invocation information, It include: that generation is filtered out from the corresponding relationship for the computer room that service invocation procedure and service invocation procedure are occurred in across computer room item Service invocation procedure under part.That is, the machine that presence service calling process and service invocation procedure are occurred in service invocation information The corresponding relationship in room, therefore the service invocation procedure occurred under the conditions of across computer room can be filtered out.
In one embodiment according to the disclosure, across the computer room service invocation information filtered out and specific transactions are identified Service link that represented business is belonged to carries out real time correlation to determine whether service link is business that multimachine room relies on Link, comprising:
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carries out real time correlation to obtain across the computer room service invocation information of service link.
According to across the computer room service invocation information of service link determine service link whether be multimachine room rely on business chain Road.
In one embodiment according to the disclosure, across the computer room service call filtered out is believed by specific transactions mark Breath carries out real time correlation with service link to obtain across the computer room service invocation information of service link, can grasp in real time a certain The service link of application currently whether there is across computer room calling situation, that is, according to across the computer room service invocation information of service link Determine service link whether be multimachine room rely on service link.Therefore, solving offline logs analytical plan can not be to multimachine Room relies on the problem of situation carries out association analysis in real time, provides real time data foundation for computer room disaster tolerance decision.
In one embodiment according to the disclosure, one or more computer instruction is also executed by the processor It is the determination for the service link that multimachine room relies on as a result, issuing the dependence of multimachine room to perform the steps of in response to service link Pre-warning signal.
In one embodiment according to the disclosure, based on the multimachine room to service link realized in the aforementioned embodiment The monitoring for relying on situation can carry out the multimachine room based on monitored results and rely on early warning.So that application where service link The available early warning of manager, and then be accordingly transformed, to support computer room disaster tolerance.
The processor 801 is used to execute all or part of the steps in aforementioned approaches method step.
Wherein, it can also include communication interface that the multimachine room, which relies in the structure of monitoring device, rely on for multimachine room Monitoring device and other equipment or communication.
Disclosure exemplary embodiment additionally provides a kind of computer storage medium, relies on prison for storing the multimachine room Computer software instructions used in control system, it includes rely on monitoring method institute for executing multimachine room in any of the above-described embodiment The program being related to.
Fig. 9 is adapted for relying on the computer system of monitoring method according to the multimachine room of one embodiment of the disclosure for realizing Structural schematic diagram.
As shown in figure 9, computer system 900 includes central processing unit (CPU) 901, it can be read-only according to being stored in Program in memory (ROM) 902 or be loaded into the program in random access storage device (RAM) 903 from storage section 908 and Execute the various processing in above-mentioned embodiment shown in FIG. 1.In RAM903, be also stored with system 900 operate it is required each Kind program and data.CPU901, ROM902 and RAM903 are connected with each other by bus 904.Input/output (I/O) interface 905 It is also connected to bus 904.
I/O interface 905 is connected to lower component: the importation 906 including keyboard, mouse etc.;It is penetrated including such as cathode The output par, c 907 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 908 including hard disk etc.; And the communications portion 909 of the network interface card including LAN card, modem etc..Communications portion 909 via such as because The network of spy's net executes communication process.Driver 910 is also connected to I/O interface 905 as needed.Detachable media 911, such as Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 910, in order to read from thereon Computer program be mounted into storage section 908 as needed.
Particularly, according to embodiment of the present disclosure, it is soft to may be implemented as computer above with reference to Fig. 1 method described Part program.For example, embodiment of the present disclosure includes a kind of computer program product comprising be tangibly embodied in and its readable Computer program on medium, the computer program include the program code for executing the data processing method of Fig. 1.At this In the embodiment of sample, which can be downloaded and installed from network by communications portion 909, and/or from can Medium 911 is dismantled to be mounted.
Flow chart and block diagram in attached drawing illustrate system, method and computer according to the various embodiments of the disclosure The architecture, function and operation in the cards of program product.In this regard, each box in course diagram or block diagram can be with A part of a module, section or code is represented, a part of the module, section or code includes one or more Executable instruction for implementing the specified logical function.It should also be noted that in some implementations as replacements, institute in box The function of mark can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are practical On can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it wants It is noted that the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart, Ke Yiyong The dedicated hardware based system of defined functions or operations is executed to realize, and/or specialized hardware and meter can be used The combination of calculation machine instruction is realized.
Being described in unit or module involved in disclosure embodiment can be realized by way of software, can also It is realized in a manner of through hardware.Described unit or module also can be set in the processor, these units or module Title do not constitute the restriction to the unit or module itself under certain conditions.
As on the other hand, the disclosure additionally provides a kind of computer readable storage medium, the computer-readable storage medium Matter can be computer readable storage medium included in device described in above embodiment;It is also possible to individualism, Without the computer readable storage medium in supplying equipment.Computer-readable recording medium storage has one or more than one journey Sequence, described program is used to execute by one or more than one processor is described in disclosed method.
Above description is only the preferred embodiment of the disclosure and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the disclosure, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed in the disclosure Can technical characteristic replaced mutually and the technical solution that is formed.
Above description is only the preferred embodiment of the disclosure and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the disclosure, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed in the disclosure Can technical characteristic replaced mutually and the technical solution that is formed.
Storage media be stored with one perhaps more than one program described program used by one or more than one processor Disclosed method is described in execute.
Above description is only the preferred embodiment of the disclosure and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the disclosure, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed in the disclosure Can technical characteristic replaced mutually and the technical solution that is formed.

Claims (18)

1. a kind of multimachine room relies on monitoring method characterized by comprising
Multiple logs are acquired, and extract service invocation information from the multiple log for specific transactions mark;
Across computer room service invocation information is filtered out from the service invocation information;
Determine the service link that the represented business of the specific transactions mark is belonged to;
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carry out real time correlation with the determination service link whether be multimachine room rely on service link.
2. the method according to claim 1, wherein described acquire multiple logs, and being directed to specific transactions mark Knowledge extracts service invocation information from the multiple log, comprising:
The information of computer room and service where identifying corresponding access with the specific transactions is extracted from the multiple log to adjust Use process identifier;
It is identified for specific transactions, service invocation procedure and service call is determined according to the sequence that the service invocation procedure identifies The corresponding relationship for the computer room that process is occurred.
3. according to the method described in claim 2, it is characterized in that, described filter out from the service invocation information across computer room Service invocation information, comprising:
Generation is filtered out in the corresponding relationship of the computer room occurred from the service invocation procedure and service invocation procedure across machine Service invocation procedure under the conditions of room.
4. the method according to claim 1, wherein described filter out from the service invocation information across computer room Service invocation information, comprising:
It include complete in across the computer room service invocation information to obtain to filtering out across computer room service invocation information to be polymerize The aggregation information of portion's computer room information;
Wherein, across the computer room service invocation information that will the be filtered out business represented with specific transactions mark is belonged to Service link carry out real time correlation with the determination service link whether be multimachine room rely on service link, comprising:
The service link that the represented business of the aggregation information and specific transactions mark is belonged to is subjected to real time correlation With the determination service link whether be multimachine room rely on service link.
5. the method according to claim 1, wherein across the computer room service invocation information that will be filtered out and institute State the service link that the represented business of specific transactions mark is belonged to carry out real time correlation with the determination service link whether It is the service link that multimachine room relies on, comprising:
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carries out real time correlation to obtain across the computer room service invocation information of the service link;
According to across the computer room service invocation information of the service link determine the service link whether be multimachine room rely on industry Business link.
6. the method according to claim 1, wherein the multiple log includes remote procedure call log, delays Deposit system access log and Database Systems access log.
7. the method according to claim 1, wherein the service invocation information is remote procedure call information.
8. the method according to claim 1, wherein further include:
It is the determination for the service link that multimachine room relies on as a result, issuing multimachine room dependence early warning letter in response to the service link Number.
9. a kind of multimachine room relies on monitoring device characterized by comprising
Acquisition and extraction module are configured as acquiring multiple logs, and for specific transactions mark from the multiple log Extract service invocation information;
Screening module is configured as filtering out across computer room service invocation information from the service invocation information;
Determining module is configured to determine that the service link that the represented business of the specific transactions mark is belonged to;
Real time correlation module is configured as represented by across the computer room service invocation information that will be filtered out and specific transactions mark The service link that is belonged to of business whether carry out real time correlation with the determination service link be business chain that multimachine room relies on Road.
10. a kind of electronic equipment, which is characterized in that including memory and processor;Wherein, the memory is for storing one Or a plurality of computer instruction, wherein one or more computer instruction is executed by the processor to perform the steps of
Multiple logs are acquired, and extract service invocation information from the multiple log for specific transactions mark;
Across computer room service invocation information is filtered out from the service invocation information;
Determine the service link that the represented business of the specific transactions mark is belonged to;
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carry out real time correlation with the determination service link whether be multimachine room rely on service link.
11. electronic equipment according to claim 10, which is characterized in that the multiple logs of acquisition, and for specific Service identification extracts service invocation information from the multiple log, comprising:
The information of computer room and service where identifying corresponding access with the specific transactions is extracted from the multiple log to adjust Use process identifier;
It is identified for specific transactions, service invocation procedure and service call is determined according to the sequence that the service invocation procedure identifies The corresponding relationship for the computer room that process is occurred.
12. electronic equipment according to claim 11, which is characterized in that described to be filtered out from the service invocation information Across computer room service invocation information, comprising:
Generation is filtered out in the corresponding relationship of the computer room occurred from the service invocation procedure and service invocation procedure across machine Service invocation procedure under the conditions of room.
13. electronic equipment according to claim 10, which is characterized in that described to be filtered out from the service invocation information Across computer room service invocation information, comprising:
It include complete in across the computer room service invocation information to obtain to filtering out across computer room service invocation information to be polymerize The aggregation information of portion's computer room information;
Wherein, across the computer room service invocation information that will the be filtered out business represented with specific transactions mark is belonged to Service link carry out real time correlation with the determination service link whether be multimachine room rely on service link, comprising:
The service link that the represented business of the aggregation information and specific transactions mark is belonged to is subjected to real time correlation With the determination service link whether be multimachine room rely on service link.
14. electronic equipment according to claim 10, which is characterized in that described to believe across the computer room service call filtered out The service link that the breath business represented with specific transactions mark is belonged to carries out real time correlation with the determination business chain Road whether be multimachine room rely on service link, comprising:
The business chain that the represented business of across the computer room service invocation information filtered out and specific transactions mark is belonged to Road carries out real time correlation to obtain across the computer room service invocation information of the service link;
According to across the computer room service invocation information of the service link determine the service link whether be multimachine room rely on industry Business link.
15. electronic equipment according to claim 10, which is characterized in that the multiple log includes remote procedure call day Will, caching system access log and Database Systems access log.
16. electronic equipment according to claim 10, which is characterized in that the service invocation information is remote procedure call Information.
17. electronic equipment according to claim 10, which is characterized in that one or more computer instruction is also by institute Processor is stated to execute to perform the steps of
It is the determination for the service link that multimachine room relies on as a result, issuing multimachine room dependence early warning letter in response to the service link Number.
18. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the computer instruction quilt Processor realizes the method according to claim 1 when executing.
CN201811436481.5A 2018-11-28 2018-11-28 Multimachine room relies on monitoring method, device, equipment and computer readable storage medium Pending CN109947614A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811436481.5A CN109947614A (en) 2018-11-28 2018-11-28 Multimachine room relies on monitoring method, device, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811436481.5A CN109947614A (en) 2018-11-28 2018-11-28 Multimachine room relies on monitoring method, device, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN109947614A true CN109947614A (en) 2019-06-28

Family

ID=67005915

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811436481.5A Pending CN109947614A (en) 2018-11-28 2018-11-28 Multimachine room relies on monitoring method, device, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN109947614A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442641A (en) * 2019-08-06 2019-11-12 中国工商银行股份有限公司 A kind of link topology figure methods of exhibiting, device, storage medium and equipment
CN110780857A (en) * 2019-10-23 2020-02-11 杭州涂鸦信息技术有限公司 Unified log component
CN111158995A (en) * 2019-11-29 2020-05-15 武汉物易云通网络科技有限公司 Method and system for realizing cross-system log tracking query based on skywalk and ELK platform
CN115037653A (en) * 2022-06-28 2022-09-09 北京奇艺世纪科技有限公司 Service flow monitoring method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100020680A1 (en) * 2008-07-28 2010-01-28 Salam Samer M Multi-chassis ethernet link aggregation
CN102143008A (en) * 2010-01-29 2011-08-03 国际商业机器公司 Method and device for diagnosing fault event in data center
CN105763382A (en) * 2016-04-14 2016-07-13 北京思特奇信息技术股份有限公司 Realization method and device based on end-to-end service monitoring
CN106970843A (en) * 2016-01-14 2017-07-21 阿里巴巴集团控股有限公司 remote invocation method and device
CN107465767A (en) * 2017-09-29 2017-12-12 网宿科技股份有限公司 A kind of method and system of data syn-chronization
CN108833184A (en) * 2018-06-29 2018-11-16 腾讯科技(深圳)有限公司 Service fault localization method, device, computer equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100020680A1 (en) * 2008-07-28 2010-01-28 Salam Samer M Multi-chassis ethernet link aggregation
CN102143008A (en) * 2010-01-29 2011-08-03 国际商业机器公司 Method and device for diagnosing fault event in data center
CN106970843A (en) * 2016-01-14 2017-07-21 阿里巴巴集团控股有限公司 remote invocation method and device
CN105763382A (en) * 2016-04-14 2016-07-13 北京思特奇信息技术股份有限公司 Realization method and device based on end-to-end service monitoring
CN107465767A (en) * 2017-09-29 2017-12-12 网宿科技股份有限公司 A kind of method and system of data syn-chronization
CN108833184A (en) * 2018-06-29 2018-11-16 腾讯科技(深圳)有限公司 Service fault localization method, device, computer equipment and storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442641A (en) * 2019-08-06 2019-11-12 中国工商银行股份有限公司 A kind of link topology figure methods of exhibiting, device, storage medium and equipment
CN110780857A (en) * 2019-10-23 2020-02-11 杭州涂鸦信息技术有限公司 Unified log component
CN110780857B (en) * 2019-10-23 2024-01-30 杭州涂鸦信息技术有限公司 Unified log component
CN111158995A (en) * 2019-11-29 2020-05-15 武汉物易云通网络科技有限公司 Method and system for realizing cross-system log tracking query based on skywalk and ELK platform
CN111158995B (en) * 2019-11-29 2020-12-29 武汉物易云通网络科技有限公司 Method and system for realizing cross-system log tracking query based on skywalk and ELK platform
CN115037653A (en) * 2022-06-28 2022-09-09 北京奇艺世纪科技有限公司 Service flow monitoring method and device, electronic equipment and storage medium
CN115037653B (en) * 2022-06-28 2023-10-13 北京奇艺世纪科技有限公司 Service flow monitoring method, device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109947614A (en) Multimachine room relies on monitoring method, device, equipment and computer readable storage medium
US20190378073A1 (en) Business-Aware Intelligent Incident and Change Management
US8572244B2 (en) Monitoring tool deployment module and method of operation
JP5102901B2 (en) Method and system for maintaining data integrity between multiple data servers across a data center
CN103399781B (en) Cloud Server and virtual machine management method thereof
CN109120428B (en) Method and system for wind control analysis
CN102799519A (en) Automatic test method for cluster file system
US20130036359A1 (en) Monitoring Implementation Module and Method of Operation
CN110489320A (en) Restoring method, device, terminal device and the medium of test data
CN104967532A (en) TOC technology operation and maintenance system and application method
US8984122B2 (en) Monitoring tool auditing module and method of operation
CN113553236B (en) Centralized automatic management system and method for physical machines in data center
CN112787853B (en) Automatic generation method and device of network change scheme and related equipment
CN115577160A (en) Production line data acquisition method, device, equipment and medium
CN108280012A (en) A kind of method and device of monitoring server system process
US8560375B2 (en) Monitoring object system and method of operation
CN108121730A (en) A kind of device and method by data update Fast synchronization to operation system
Lin et al. An analysis of using state of the art technologies to implement real-time continuous assurance
CN109614139A (en) A kind of system service configuration method, device, equipment and medium
Cao et al. IT Operation and Maintenance Process improvement and design under virtualization environment
CN109582666A (en) Data major key generation method, device, electronic equipment and storage medium
CN108874589A (en) A kind of electric power plant stand complex automatic system host and station data unify standby system
CN117714453B (en) Intelligent device management method and system based on Internet of things card
CN113282431B (en) Abnormal data processing method and device, storage medium and electronic equipment
CN112907009B (en) Standardized model construction method and device, storage medium and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200921

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant before: Advanced innovation technology Co.,Ltd.

Effective date of registration: 20200921

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Applicant before: Alibaba Group Holding Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20190628

RJ01 Rejection of invention patent application after publication