CN117614805A - Data processing system for monitoring state of data center - Google Patents
Data processing system for monitoring state of data center Download PDFInfo
- Publication number
- CN117614805A CN117614805A CN202311568218.2A CN202311568218A CN117614805A CN 117614805 A CN117614805 A CN 117614805A CN 202311568218 A CN202311568218 A CN 202311568218A CN 117614805 A CN117614805 A CN 117614805A
- Authority
- CN
- China
- Prior art keywords
- data center
- time
- monitoring end
- request
- state monitoring
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 131
- 238000012545 processing Methods 0.000 title claims abstract description 21
- 238000004891 communication Methods 0.000 claims abstract description 86
- 238000000034 method Methods 0.000 claims abstract description 6
- 230000001360 synchronised effect Effects 0.000 claims description 22
- 238000012790 confirmation Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 7
- 230000005540 biological transmission Effects 0.000 claims description 4
- 238000012795 verification Methods 0.000 claims description 3
- 230000002159 abnormal effect Effects 0.000 description 4
- 230000009977 dual effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000005856 abnormality Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000013307 optical fiber Substances 0.000 description 2
- 238000003672 processing method Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Abstract
The invention provides a data processing system for monitoring the state of a data center, which relates to the field of data processing, and comprises: the method comprises the following steps of: acquiring a current communication state between the first data center and the second data center in a current time period, and keeping monitoring the current communication state when the current communication state is a non-fault state; when the current communication state is a fault state, acquiring a fault type of the current communication state, and determining state monitoring of a first data center and a second data center based on the fault type; and through the first equipment state monitoring end and the second equipment state monitoring end, the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.
Description
Technical Field
The present invention relates to the field of data processing, and in particular, to a data processing system for monitoring a status of a data center.
Background
Today, where information systems are more and more important, many business systems have built a co-city dual-activity environment, and arbitration needs to be performed on important business systems when dual-activity is abnormal. In the prior art, a distributed arbitration service is required to be constructed in a third party machine room and used as an arbitration data center; when any data center has optical fiber link fault or data center overall fault, the arbitration systems of the two data centers respectively initiate arbitration requests to the arbitration clusters of the third parties, the arbitration winning party continues to provide service, the losing party stops service, and the data center with high priority has arbitration priority. However, when the third party arbitrates to fail simultaneously with the fiber link, this results in the entire dual active cluster to fail entirely, thereby terminating the external service.
Disclosure of Invention
Aiming at the technical problems, the invention adopts the following technical scheme: a data processing system for monitoring a status of a data center, the system comprising: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program;
the first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode;
when the computer program is executed by a processor, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center;
s200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; when the fault type of the current communication state is a network fault, executing S500;
s300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end;
s400, receiving an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center;
s500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, executing S600;
s600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700;
s700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
The invention has at least the following beneficial effects: in summary, acquiring a current communication state between the first data center and the second data center in a current time period; when the current communication state is a fault state, acquiring a fault type of the current communication state; when the fault type of the current communication state is a disk fault, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and when simultaneously receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end, realizing state monitoring of the first data center and the second data center; when the fault type of the current communication state is a network fault, acquiring the fault type of the last fault state between the first data center and the second data center, if the fault type of the last fault state is a disk fault in the fault type, executing a processing method of the disk fault, otherwise, receiving a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and receiving an instruction of the arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center; through the first equipment state monitoring end and the second equipment state monitoring end, when any equipment state monitoring end is in a problem, the other equipment state monitoring end continues to work, and different processing schemes are adopted under the condition of disk faults and network faults, so that the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a data processing system for monitoring the status of a data center according to an embodiment of the present invention when executing a computer program.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to fall within the scope of the invention.
An embodiment of the present invention provides a data processing system for monitoring a status of a data center, the system including: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program.
The first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode.
Specifically, the communication connection between the first data center and the second data center is that the first data center and the second data center are in communication connection through optical fibers.
Specifically, the first communication connection mode is ethernet communication connection, and the second communication connection mode is 5G network communication connection.
Specifically, the first device state monitoring end may be a third party arbitration service unit or a cloud arbitration service unit; the second equipment state monitoring end can be a third party arbitration service unit or a cloud arbitration service unit.
When the computer program is executed by a processor, as shown in fig. 1, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center in the current time period.
S200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; and when the fault type of the current communication state is a network fault, executing S500.
Specifically, the current communication state is a fault state, and the current communication state is that a disk abnormality and a network abnormality occur.
Specifically, the disk failure is abnormal in the disk dropping time of the data packet of the first data center and the second data center; the network anomaly is a network module anomaly in the first data center and the second data center or a communication connection anomaly between the first data center and the second data center.
S300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end.
It can be understood that when a disk failure occurs, the first data center sends an arbitration request to the first device state monitoring end and the second device state monitoring end at the same time, and the second data center sends an arbitration request to the first device state monitoring end and the second device state monitoring end at the same time.
S400, receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center.
Specifically, the instruction of the arbitration state request is used for judging whether the first data center continues to provide service or the second data center continues to provide service.
Specifically, the first equipment state monitoring end and the second equipment state monitoring end determine an instruction of an arbitration state request based on priorities of a first data center and a second data center; it can be understood that, when the first equipment state monitoring end and the second equipment state monitoring end arbitrate, the data center with high priority continues to provide service; since the instructions of the arbitration state request are determined based on the priorities of the first data center and the second data center, the instructions of the arbitration state request issued by the first device state monitoring side and the second device state monitoring side are identical.
S500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, S600 is performed.
Specifically, when the current communication state is a network failure of a failure state and the historical communication state is a disk failure in a failure type, the connection between the first data center and the first equipment state monitoring end as well as the connection between the second data center and the second equipment state monitoring end are considered to be in a normal working state, so that S400 is executed; when the current communication state is a network fault of a fault state, it is not clear that the network port of the network module of the first equipment center or the second equipment center is abnormal or the communication connection between the first data center and the second data center is abnormal, so that the 5G network is preferentially used for arbitration.
S600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700.
S700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
In summary, acquiring a current communication state between the first data center and the second data center in a current time period; when the current communication state is a non-fault state, monitoring the current communication state is maintained; when the current communication state is a fault state, acquiring a fault type of the current communication state; when the fault type of the current communication state is a disk fault, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and when simultaneously receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end, realizing state monitoring of the first data center and the second data center; when the fault type of the current communication state is a network fault, acquiring the fault type of the last fault state between the first data center and the second data center, if the fault type of the last fault state is a disk fault in the fault type, executing a processing method of the disk fault, otherwise, receiving a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and receiving an instruction of the arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center; through the first equipment state monitoring end and the second equipment state monitoring end, when any equipment state monitoring end is in a problem, the other equipment state monitoring end continues to work, and different processing schemes are adopted under the condition of disk faults and network faults, so that the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.
Further, after S700, the method further includes:
s710, receiving a request instruction from the first data center to send an arbitration request to the first equipment state monitoring end and sending a request instruction from the second data center to send an arbitration request to the first equipment state monitoring end.
S720, when the instruction of the arbitration state request sent by the first equipment state monitoring end is not received, receiving a request instruction of the first data center for sending the arbitration request to the first equipment state monitoring end every a preset verification time period in a future time period, and sending the request instruction of the arbitration request to the first equipment state monitoring end by the second data center.
Specifically, the preset verification period may be determined according to actual requirements. It can be understood that when a network fault occurs, an instruction for sending an arbitration state request to the second device state monitoring end is sent preferentially, and then an instruction for sending an arbitration state request to the first device state monitoring end is used for verifying whether the first device state monitoring end can work normally.
Specifically, the method further comprises the step of determining the current communication state through the following steps:
s101, clock information of a first data center and clock information of a second data center are acquired.
Specifically, those skilled in the art know that any method for obtaining clock information in the prior art belongs to the protection scope of the present invention, and will not be described herein.
S102, acquiring clock information of a first data center and time error delta T of standard time 1 And acquiring clock information of the second data center and time error DeltaT of standard time 2 Thereby obtaining a time error Δt between the first data center and the second data center.
Specifically, deltaT 1 As absolute value of time difference between clock information of first data center and standard time, deltaT 2 The delta T is the absolute value of the time difference between the clock information of the first data center and the standard time, and the delta T is the absolute value of the clock information of the first data center and the clock information of the second data center.
S103, when DeltaT is larger than a first time difference threshold, calibrating the clock of the first data center and the clock of the second data center.
S104, when the DeltaT is less than or equal to a first time difference threshold, acquiring synchronous data packet information between the first data center and the second data center, so as to determine that the current communication state is a non-fault state according to the synchronous data packet information between the first data center and the second data center.
Based on S101 to S104, when the time error of the clock information of the first data center and the clock information of the second data center is not greater than the first time difference threshold, synchronous data packet information between the first data center and the second data center is acquired, and the current communication state is determined based on the synchronous packet information between the first data center and the second data center.
Further, in S104, the following steps are further included:
s1041, obtaining a main data center; wherein, if DeltaT 1 >△T 2 Taking the second data center as a main data center; otherwise, the first data center is used as a main data center; the backup data center is the other data center of the first data center and the second data center other than the main data center.
S1042, obtaining n first presets of the standby data centerInterval T 1 Synchronous data packet transmission time list set A= { A in the inner 1 ,A 2 ,…,A j ,…,A n J-th synchronous data packet transmitting time list A j ={A j1 ,A j2 ,……,A ji ,……,A jm },A ji Is the transmission time of the ith synchronous data packet time in the jth first preset time period, the value range of j is 1 to n, the value range of i is 1 to m, and m is the first time period T 1 The number of sync packets collected internally.
Specifically T 1 The following conditions are satisfied: t (T) 1 =Floor(T 0 ×n 0 /int(△T/T 0 ) X delta T, floor () is an upward rounding function, int () is a rounding function, n 0 Is T 1 Number of transmitted data packets, T 0 Is the time at which the data packet is transmitted.
S1043, obtaining the n first preset time periods T of the main data center 1 Synchronous data packet receiving time set B= { B in the frame 1 ,B 2 ,…,B j ,…,B n },B j ={B j1 ,B j2 ,……,B ji ,……,B jm },B ji Is the reception time of the ith synchronization packet time within the jth first preset time period.
Specifically, n first preset time periods of the main data center are the same as n first preset time periods of the standby data center.
S1044, obtaining a time difference set t= { t 1 ,t 2 ,…,t j ,…,t n J-th time difference list t j ={t j1 ,t j2 ,…,t ji ,…,t jm And t is }, where ji =B ji -A ji -△T。
S1045, according to the time difference set t, obtaining a time difference mean Deltat.
Specifically, Δt=1/n×1/m Σ n j=1 ∑ m i=1 t ji 。
S1046, when Deltat > the second time difference threshold, determining that the current communication state is a fault state.
S1047, when the delta t is less than or equal to the second time difference threshold, determining that the current communication state is a non-fault state.
In summary, a main data center is acquired, and standby data centers are acquired in n first preset time periods T 1 The method comprises the steps of obtaining a synchronous data packet sending time set in a main data center in n first preset time periods T 1 And receiving the time set of the synchronous data packet, acquiring a time difference set, acquiring a time difference mean value based on the time difference set, and determining the current communication state based on the time difference mean value.
Specifically, in S200, the fault type of the current communication state is also determined by:
s210, acquiring n first preset time periods T of the main data center 1 Ca=1/n×Σof the average reply acknowledgement time of (a) n i=1 C ji ;C ji =B ji -B0 ji -G,B0 ji The response confirmation time from the receiving of the ith synchronous data packet by the main data center to the standby data center in the jth first preset time period is the response confirmation time, and G is the preset response confirmation time.
S220, when CA > the third time threshold, determining the fault type of the current communication state as disk fault.
S230, when the CA is less than or equal to a third time threshold, determining that the fault type of the current communication state is network fault.
In summary, acquiring average reply confirmation time of the main data center in n first preset time periods, and considering that the main data center is a disk fault when the average reply time is greater than a third time threshold; otherwise, the network is considered to be faulty.
Further, the invention also comprises:
s2101, obtaining the backup data center and average reply confirmation time cb=1/n×Σof the backup data center in n first preset time periods n i=1 E ji ,E ji =F ji -F0 ji -G,F ji Is the time of the ith synchronous data packet received by the data center in the jth first preset time periodBetween F0 ji The standby data center receives the time of replying and confirming the ith synchronous data packet to the main data center in the jth first preset time period.
S2102, obtaining a first weight factor W corresponding to a main data center c1 First weight factor W corresponding to standby data center c2 ,W c1 =1-(CA/(CA+CB)),W c2 =1-(CB/CA+CB)。
Further, the invention also comprises:
s001, obtaining a second weight factor W corresponding to the main data center t1 Second weight factor W corresponding to standby data center t2 ,W t1 =1-△T 1 /(△T 1 +△T 2 ),W t2 =1-△T 2 /(△T 1 +△T 2 )。
Still further, the present invention further comprises:
s002, obtain da=1/nx1/mx Σ n i=1 (∑ m j=1 H ji );H ji =B ji -B0 ji 。
S003, obtaining db=1/nx1/mx Σ n i=1 (∑ m j=1 I ji ),I ji =F ji -F0 ji 。
S004, obtaining a third weight factor W corresponding to the main data center d1 Third weight factor W corresponding to standby data center d2 ,W d1 =1-(DA/(DA+DB)),W d2 =1-(DB/DA+DB)。
Further, the priorities of the first data center and the second data center are determined by:
s005, obtaining the weight factor W of the main data center 1 And weight factor W of backup data center 2 ,W 1 =W t1 +W c1 +W d1 ,
W 2 =W t2 +W c2 +W d2 。
S006, based on W 1 And W is 2 Determining priorities of a first data center and a second data center, the first data centerThe priorities of a data center and a second data center are used for determining an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end.
In summary, a first weight factor corresponding to the main data center and a first weight factor corresponding to the standby data center are obtained, a second weight factor corresponding to the main data center and a second weight factor corresponding to the standby data center are obtained, a third weight factor corresponding to the main data center and a third weight factor corresponding to the standby data center are obtained, and thus the weight factors of the main data center and the standby data center are obtained, and priorities of the first data center and the second data center are obtained, so that the data center with more accurate clock information and faster landing speed continues to provide service.
While certain specific embodiments of the invention have been described in detail by way of example, it will be appreciated by those skilled in the art that the above examples are for illustration only and are not intended to limit the scope of the invention. Those skilled in the art will also appreciate that many modifications may be made to the embodiments without departing from the scope and spirit of the invention. The scope of the invention is defined by the appended claims.
Claims (10)
1. A data processing system for monitoring the status of a data center, the system comprising: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program;
the first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode;
when the computer program is executed by a processor, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center;
s200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; when the fault type of the current communication state is a network fault, executing S500;
s300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end;
s400, receiving an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center;
s500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, executing S600;
s600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700;
s700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
2. The data processing system for monitoring a status of a data center of claim 1, further comprising, after S700:
s710, receiving a request instruction from the first data center to send an arbitration request to the first equipment state monitoring end and the second data center to send an arbitration request to the first equipment state monitoring end;
s720, when the instruction of the arbitration state request sent by the first equipment state monitoring end is not received, receiving a request instruction of the first data center for sending the arbitration request to the first equipment state monitoring end every a preset verification time period in a future time period, and sending the request instruction of the arbitration request to the first equipment state monitoring end by the second data center.
3. The data processing system for monitoring the status of a data center of claim 1, further comprising determining the current communication status by:
s101, acquiring clock information of a first data center and clock information of a second data center;
s102, acquiring clock information of a first data center and time error delta T of standard time 1 And acquiring clock information of the second data center and time error DeltaT of standard time 2 Thereby obtaining a time error Δt between the first data center and the second data center;
s103, when DeltaT is larger than a first time difference threshold, calibrating the clock of the first data center and the clock of the second data center;
s104, when the DeltaT is less than or equal to a first time difference threshold, acquiring data packet information between the first data center and the second data center, so as to determine the current communication state according to the data packet information between the first data center and the second data center.
4. A data processing system for monitoring the status of a data center as claimed in claim 3, further comprising the step of, in S104:
s1041, acquiring a main data center and a standby data center; wherein, if DeltaT 1 >△T 2 Taking the second data center as a main data center; otherwise, the first data center is used as a main data center; the standby data center is the other data center except the main data center in the first data center and the second data center;
s1042, acquiring n first preset time periods T of the standby data center 1 Synchronous data packet transmission time list set A= { A in the inner 1 ,A 2 ,…,A j ,…,A n J-th synchronous data packet transmitting time list A j ={A j1 ,A j2 ,……,A ji ,……,A jm },A ji Is the transmission time of the ith synchronous data packet time in the jth first preset time period, the value range of j is 1 to n, the value range of i is 1 to m, and m is the first time period T 1 The number of internally acquired synchronous data packets;
s1043, obtaining the n first preset time periods T of the main data center 1 Synchronous data packet receiving time set B= { B in the frame 1 ,B 2 ,…,B j ,…,B n },B j ={B j1 ,B j2 ,……,B ji ,……,B jm },B ji Is the receiving time of the ith synchronous data packet time in the jth first preset time period;
s1044, obtaining a time difference set t= { t 1 ,t 2 ,…,t j ,…,t n },t j ={t j1 ,t j2 ,…,t ji ,…,t jm And t is }, where ji =B ji -A ji -△T;
S1045, obtaining a time difference mean Deltat according to the time difference set t;
s1046, when Deltat is more than a second time difference threshold, determining that the current communication state is a fault state;
s1047, when the delta t is less than or equal to the second time difference threshold, determining that the current communication state is a non-fault state.
5. The data processing system for monitoring a status of a data center of claim 4, wherein the type of failure of the current communication status is further determined in S200 by:
s210, acquiring average reply confirmation time CA=1/n×Σof the main data center in n first preset time periods n i=1 C ji ;C ji =B ji -B0 ji -G,B0 ji The method comprises the steps that the main data center receives the reply confirmation time from the ith synchronous data packet to the standby data center in the jth first preset time period, and G is preset reply confirmation time;
s220, when CA is more than a third time threshold, determining that the fault type of the current communication state is a disk fault;
s230, when the CA is less than or equal to a third time threshold, determining that the fault type of the current communication state is network fault.
6. The data processing system for monitoring the status of a data center of claim 5, further comprising:
s2101, obtaining the backup data center and average reply confirmation time cb=1/n×Σof the backup data center in n first preset time periods n i=1 E ji ,E ji =F ji -F0 ji -G,F ji Is the time of the ith synchronous data packet received by the standby data center in the jth first preset time period, F0 ji The standby data center receives the time of replying and confirming the ith synchronous data packet to the main data center in the jth first preset time period;
s2102, obtaining a first weight factor W corresponding to a main data center c1 First weight factor W corresponding to standby data center c2 ,W c1 =1-(CA/(CA+CB)),W c2 =1-(CB/CA+CB)。
7. The data processing system for monitoring the status of a data center of claim 6, further comprising:
s001, obtaining a second weight factor W corresponding to the main data center t1 Second weight factor corresponding to standby data centerSon W t2 ,W t1 =1-△T 1 /(△T 1 +△T 2 ),W t2 =1-△T 2 /(△T 1 +△T 2 )。
8. The data processing system for monitoring the status of a data center of claim 6, further comprising:
s002, obtain da=1/nx1/mx Σ n i=1 (∑ m j=1 H ji );H ji =B ji -B0 ji ;
S003, obtaining db=1/nx1/mx Σ n i=1 (∑ m j=1 I ji ),I ji =F ji -F0 ji ;
S004, obtaining a third weight factor W corresponding to the main data center d1 Third weight factor W corresponding to standby data center d2 ,W d1 =1-(DA/(DA+DB)),W d2 =1-(DB/DA+DB)。
9. The data processing system for monitoring the status of a data center of claim 8, wherein the priorities of the first data center and the second data center are determined by:
s005, obtaining the weight factor W of the main data center 1 And weight factor W of backup data center 2 ,W 1 =W t1 +W c1 +W d1 ,
W 2 =W t2 +W c2 +W d2 ;
S006, based on W 1 And W is 2 And determining the priorities of a first data center and a second data center, wherein the priorities of the first data center and the second data center are used for determining an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end.
10. The data processing system for monitoring the status of a data center of claim 1, wherein the first communication connection is an ethernet communication connection and the second communication connection is a 5G network communication connection.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311568218.2A CN117614805A (en) | 2023-11-21 | 2023-11-21 | Data processing system for monitoring state of data center |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311568218.2A CN117614805A (en) | 2023-11-21 | 2023-11-21 | Data processing system for monitoring state of data center |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117614805A true CN117614805A (en) | 2024-02-27 |
Family
ID=89949054
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311568218.2A Pending CN117614805A (en) | 2023-11-21 | 2023-11-21 | Data processing system for monitoring state of data center |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117614805A (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060168192A1 (en) * | 2004-11-08 | 2006-07-27 | Cisco Technology, Inc. | High availability for intelligent applications in storage networks |
CN105893176A (en) * | 2016-03-28 | 2016-08-24 | 杭州宏杉科技有限公司 | Management method and device of network storage system |
CN106170948A (en) * | 2015-07-30 | 2016-11-30 | 华为技术有限公司 | A kind of referee method for dual-active data center, Apparatus and system |
CN106909307A (en) * | 2015-12-22 | 2017-06-30 | 华为技术有限公司 | A kind of method and device for managing dual-active storage array |
CN107147528A (en) * | 2017-05-23 | 2017-09-08 | 郑州云海信息技术有限公司 | One kind stores gateway intelligently anti-fissure system and method |
US20180260125A1 (en) * | 2017-03-10 | 2018-09-13 | Pure Storage, Inc. | Synchronously replicating datasets and other managed objects to cloud-based storage systems |
US20200112499A1 (en) * | 2018-10-07 | 2020-04-09 | Hewlett Packard Enterprise Development Lp | Multiple quorum witness |
CN116668269A (en) * | 2022-02-21 | 2023-08-29 | 华为云计算技术有限公司 | Arbitration method, device and system for dual-activity data center |
-
2023
- 2023-11-21 CN CN202311568218.2A patent/CN117614805A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060168192A1 (en) * | 2004-11-08 | 2006-07-27 | Cisco Technology, Inc. | High availability for intelligent applications in storage networks |
CN106170948A (en) * | 2015-07-30 | 2016-11-30 | 华为技术有限公司 | A kind of referee method for dual-active data center, Apparatus and system |
CN106909307A (en) * | 2015-12-22 | 2017-06-30 | 华为技术有限公司 | A kind of method and device for managing dual-active storage array |
CN105893176A (en) * | 2016-03-28 | 2016-08-24 | 杭州宏杉科技有限公司 | Management method and device of network storage system |
US20180260125A1 (en) * | 2017-03-10 | 2018-09-13 | Pure Storage, Inc. | Synchronously replicating datasets and other managed objects to cloud-based storage systems |
CN107147528A (en) * | 2017-05-23 | 2017-09-08 | 郑州云海信息技术有限公司 | One kind stores gateway intelligently anti-fissure system and method |
US20200112499A1 (en) * | 2018-10-07 | 2020-04-09 | Hewlett Packard Enterprise Development Lp | Multiple quorum witness |
CN116668269A (en) * | 2022-02-21 | 2023-08-29 | 华为云计算技术有限公司 | Arbitration method, device and system for dual-activity data center |
Non-Patent Citations (2)
Title |
---|
柏玉锋;: "基于阵列双活功能的本地存储双活部署方案", 信息通信, no. 01, 15 January 2017 (2017-01-15) * |
魏雪梅;: "双活数据中心解决方案", 甘肃科技纵横, no. 06, 25 June 2018 (2018-06-25) * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107295080B (en) | Data storage method applied to distributed server cluster and server | |
EP0416943B1 (en) | Method for controlling failover between redundant network interface modules | |
EP2362585B1 (en) | Information processor and network control system | |
US6385665B1 (en) | System and method for managing faults in a data transmission system | |
US20100154016A1 (en) | Multi-channel ranging for a cable modem | |
CN111355600B (en) | Main node determining method and device | |
CN112583683B (en) | Master-slave CAN FD bus application layer communication method and system and electronic equipment | |
US8521869B2 (en) | Method and system for reporting defects within a network | |
KR20100020253A (en) | Monitoring apparatus for message transmission in network for a vehicle | |
CN113261220B (en) | Method and control node for handling failure of TSN communication link | |
CN114844809A (en) | Multi-factor arbitration method and device based on network heartbeat and kernel disk heartbeat | |
CN117614805A (en) | Data processing system for monitoring state of data center | |
CN108667640B (en) | Communication method and device, and network access system | |
US11874786B2 (en) | Automatic switching system and method for front end processor | |
CN115378841B (en) | Method and device for detecting state of equipment accessing cloud platform, storage medium and terminal | |
CN113850033B (en) | Redundancy system, redundancy management method and readable storage medium | |
CN114884767B (en) | Synchronous dual-redundancy CAN bus communication system, method, equipment and medium | |
CN115842760A (en) | Fault detection method and device, electronic equipment and storage medium | |
CN114356810B (en) | Communication connection method, device, equipment and medium for host and storage system | |
JP2007166446A (en) | Pon system, its abnormality determining method, and station side device | |
US9118540B2 (en) | Method for monitoring a plurality of rack systems | |
US8566634B2 (en) | Method and system for masking defects within a network | |
CN112034774A (en) | Hot redundancy control method | |
CN116795290A (en) | Data synchronization method, baseboard management controller, electronic device and storage medium | |
JP7417773B1 (en) | Network interface card and transmission performance monitoring method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |