CN117614805A - Data processing system for monitoring state of data center - Google Patents

Data processing system for monitoring state of data center Download PDF

Info

Publication number
CN117614805A
CN117614805A CN202311568218.2A CN202311568218A CN117614805A CN 117614805 A CN117614805 A CN 117614805A CN 202311568218 A CN202311568218 A CN 202311568218A CN 117614805 A CN117614805 A CN 117614805A
Authority
CN
China
Prior art keywords
data center
time
monitoring end
request
state monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311568218.2A
Other languages
Chinese (zh)
Inventor
陈栋
魏兴华
李春
李建辉
张文件
罗春
吴炎
臧冰凌
王显伟
杨禹航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Woqu Technology Co ltd
Original Assignee
Hangzhou Woqu Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Woqu Technology Co ltd filed Critical Hangzhou Woqu Technology Co ltd
Priority to CN202311568218.2A priority Critical patent/CN117614805A/en
Publication of CN117614805A publication Critical patent/CN117614805A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Abstract

The invention provides a data processing system for monitoring the state of a data center, which relates to the field of data processing, and comprises: the method comprises the following steps of: acquiring a current communication state between the first data center and the second data center in a current time period, and keeping monitoring the current communication state when the current communication state is a non-fault state; when the current communication state is a fault state, acquiring a fault type of the current communication state, and determining state monitoring of a first data center and a second data center based on the fault type; and through the first equipment state monitoring end and the second equipment state monitoring end, the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.

Description

Data processing system for monitoring state of data center
Technical Field
The present invention relates to the field of data processing, and in particular, to a data processing system for monitoring a status of a data center.
Background
Today, where information systems are more and more important, many business systems have built a co-city dual-activity environment, and arbitration needs to be performed on important business systems when dual-activity is abnormal. In the prior art, a distributed arbitration service is required to be constructed in a third party machine room and used as an arbitration data center; when any data center has optical fiber link fault or data center overall fault, the arbitration systems of the two data centers respectively initiate arbitration requests to the arbitration clusters of the third parties, the arbitration winning party continues to provide service, the losing party stops service, and the data center with high priority has arbitration priority. However, when the third party arbitrates to fail simultaneously with the fiber link, this results in the entire dual active cluster to fail entirely, thereby terminating the external service.
Disclosure of Invention
Aiming at the technical problems, the invention adopts the following technical scheme: a data processing system for monitoring a status of a data center, the system comprising: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program;
the first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode;
when the computer program is executed by a processor, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center;
s200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; when the fault type of the current communication state is a network fault, executing S500;
s300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end;
s400, receiving an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center;
s500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, executing S600;
s600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700;
s700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
The invention has at least the following beneficial effects: in summary, acquiring a current communication state between the first data center and the second data center in a current time period; when the current communication state is a fault state, acquiring a fault type of the current communication state; when the fault type of the current communication state is a disk fault, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and when simultaneously receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end, realizing state monitoring of the first data center and the second data center; when the fault type of the current communication state is a network fault, acquiring the fault type of the last fault state between the first data center and the second data center, if the fault type of the last fault state is a disk fault in the fault type, executing a processing method of the disk fault, otherwise, receiving a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and receiving an instruction of the arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center; through the first equipment state monitoring end and the second equipment state monitoring end, when any equipment state monitoring end is in a problem, the other equipment state monitoring end continues to work, and different processing schemes are adopted under the condition of disk faults and network faults, so that the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a data processing system for monitoring the status of a data center according to an embodiment of the present invention when executing a computer program.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to fall within the scope of the invention.
An embodiment of the present invention provides a data processing system for monitoring a status of a data center, the system including: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program.
The first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode.
Specifically, the communication connection between the first data center and the second data center is that the first data center and the second data center are in communication connection through optical fibers.
Specifically, the first communication connection mode is ethernet communication connection, and the second communication connection mode is 5G network communication connection.
Specifically, the first device state monitoring end may be a third party arbitration service unit or a cloud arbitration service unit; the second equipment state monitoring end can be a third party arbitration service unit or a cloud arbitration service unit.
When the computer program is executed by a processor, as shown in fig. 1, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center in the current time period.
S200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; and when the fault type of the current communication state is a network fault, executing S500.
Specifically, the current communication state is a fault state, and the current communication state is that a disk abnormality and a network abnormality occur.
Specifically, the disk failure is abnormal in the disk dropping time of the data packet of the first data center and the second data center; the network anomaly is a network module anomaly in the first data center and the second data center or a communication connection anomaly between the first data center and the second data center.
S300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end.
It can be understood that when a disk failure occurs, the first data center sends an arbitration request to the first device state monitoring end and the second device state monitoring end at the same time, and the second data center sends an arbitration request to the first device state monitoring end and the second device state monitoring end at the same time.
S400, receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center.
Specifically, the instruction of the arbitration state request is used for judging whether the first data center continues to provide service or the second data center continues to provide service.
Specifically, the first equipment state monitoring end and the second equipment state monitoring end determine an instruction of an arbitration state request based on priorities of a first data center and a second data center; it can be understood that, when the first equipment state monitoring end and the second equipment state monitoring end arbitrate, the data center with high priority continues to provide service; since the instructions of the arbitration state request are determined based on the priorities of the first data center and the second data center, the instructions of the arbitration state request issued by the first device state monitoring side and the second device state monitoring side are identical.
S500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, S600 is performed.
Specifically, when the current communication state is a network failure of a failure state and the historical communication state is a disk failure in a failure type, the connection between the first data center and the first equipment state monitoring end as well as the connection between the second data center and the second equipment state monitoring end are considered to be in a normal working state, so that S400 is executed; when the current communication state is a network fault of a fault state, it is not clear that the network port of the network module of the first equipment center or the second equipment center is abnormal or the communication connection between the first data center and the second data center is abnormal, so that the 5G network is preferentially used for arbitration.
S600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700.
S700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
In summary, acquiring a current communication state between the first data center and the second data center in a current time period; when the current communication state is a non-fault state, monitoring the current communication state is maintained; when the current communication state is a fault state, acquiring a fault type of the current communication state; when the fault type of the current communication state is a disk fault, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and when simultaneously receiving an instruction of the arbitration state request sent by the first equipment state monitoring end and an instruction of the arbitration state request sent by the second equipment state monitoring end, realizing state monitoring of the first data center and the second data center; when the fault type of the current communication state is a network fault, acquiring the fault type of the last fault state between the first data center and the second data center, if the fault type of the last fault state is a disk fault in the fault type, executing a processing method of the disk fault, otherwise, receiving a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end, and receiving an instruction of the arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center; through the first equipment state monitoring end and the second equipment state monitoring end, when any equipment state monitoring end is in a problem, the other equipment state monitoring end continues to work, and different processing schemes are adopted under the condition of disk faults and network faults, so that the normal monitoring of the first data center and the second data center is ensured, and further the normal service provision is ensured.
Further, after S700, the method further includes:
s710, receiving a request instruction from the first data center to send an arbitration request to the first equipment state monitoring end and sending a request instruction from the second data center to send an arbitration request to the first equipment state monitoring end.
S720, when the instruction of the arbitration state request sent by the first equipment state monitoring end is not received, receiving a request instruction of the first data center for sending the arbitration request to the first equipment state monitoring end every a preset verification time period in a future time period, and sending the request instruction of the arbitration request to the first equipment state monitoring end by the second data center.
Specifically, the preset verification period may be determined according to actual requirements. It can be understood that when a network fault occurs, an instruction for sending an arbitration state request to the second device state monitoring end is sent preferentially, and then an instruction for sending an arbitration state request to the first device state monitoring end is used for verifying whether the first device state monitoring end can work normally.
Specifically, the method further comprises the step of determining the current communication state through the following steps:
s101, clock information of a first data center and clock information of a second data center are acquired.
Specifically, those skilled in the art know that any method for obtaining clock information in the prior art belongs to the protection scope of the present invention, and will not be described herein.
S102, acquiring clock information of a first data center and time error delta T of standard time 1 And acquiring clock information of the second data center and time error DeltaT of standard time 2 Thereby obtaining a time error Δt between the first data center and the second data center.
Specifically, deltaT 1 As absolute value of time difference between clock information of first data center and standard time, deltaT 2 The delta T is the absolute value of the time difference between the clock information of the first data center and the standard time, and the delta T is the absolute value of the clock information of the first data center and the clock information of the second data center.
S103, when DeltaT is larger than a first time difference threshold, calibrating the clock of the first data center and the clock of the second data center.
S104, when the DeltaT is less than or equal to a first time difference threshold, acquiring synchronous data packet information between the first data center and the second data center, so as to determine that the current communication state is a non-fault state according to the synchronous data packet information between the first data center and the second data center.
Based on S101 to S104, when the time error of the clock information of the first data center and the clock information of the second data center is not greater than the first time difference threshold, synchronous data packet information between the first data center and the second data center is acquired, and the current communication state is determined based on the synchronous packet information between the first data center and the second data center.
Further, in S104, the following steps are further included:
s1041, obtaining a main data center; wherein, if DeltaT 1 >△T 2 Taking the second data center as a main data center; otherwise, the first data center is used as a main data center; the backup data center is the other data center of the first data center and the second data center other than the main data center.
S1042, obtaining n first presets of the standby data centerInterval T 1 Synchronous data packet transmission time list set A= { A in the inner 1 ,A 2 ,…,A j ,…,A n J-th synchronous data packet transmitting time list A j ={A j1 ,A j2 ,……,A ji ,……,A jm },A ji Is the transmission time of the ith synchronous data packet time in the jth first preset time period, the value range of j is 1 to n, the value range of i is 1 to m, and m is the first time period T 1 The number of sync packets collected internally.
Specifically T 1 The following conditions are satisfied: t (T) 1 =Floor(T 0 ×n 0 /int(△T/T 0 ) X delta T, floor () is an upward rounding function, int () is a rounding function, n 0 Is T 1 Number of transmitted data packets, T 0 Is the time at which the data packet is transmitted.
S1043, obtaining the n first preset time periods T of the main data center 1 Synchronous data packet receiving time set B= { B in the frame 1 ,B 2 ,…,B j ,…,B n },B j ={B j1 ,B j2 ,……,B ji ,……,B jm },B ji Is the reception time of the ith synchronization packet time within the jth first preset time period.
Specifically, n first preset time periods of the main data center are the same as n first preset time periods of the standby data center.
S1044, obtaining a time difference set t= { t 1 ,t 2 ,…,t j ,…,t n J-th time difference list t j ={t j1 ,t j2 ,…,t ji ,…,t jm And t is }, where ji =B ji -A ji -△T。
S1045, according to the time difference set t, obtaining a time difference mean Deltat.
Specifically, Δt=1/n×1/m Σ n j=1m i=1 t ji
S1046, when Deltat > the second time difference threshold, determining that the current communication state is a fault state.
S1047, when the delta t is less than or equal to the second time difference threshold, determining that the current communication state is a non-fault state.
In summary, a main data center is acquired, and standby data centers are acquired in n first preset time periods T 1 The method comprises the steps of obtaining a synchronous data packet sending time set in a main data center in n first preset time periods T 1 And receiving the time set of the synchronous data packet, acquiring a time difference set, acquiring a time difference mean value based on the time difference set, and determining the current communication state based on the time difference mean value.
Specifically, in S200, the fault type of the current communication state is also determined by:
s210, acquiring n first preset time periods T of the main data center 1 Ca=1/n×Σof the average reply acknowledgement time of (a) n i=1 C ji ;C ji =B ji -B0 ji -G,B0 ji The response confirmation time from the receiving of the ith synchronous data packet by the main data center to the standby data center in the jth first preset time period is the response confirmation time, and G is the preset response confirmation time.
S220, when CA > the third time threshold, determining the fault type of the current communication state as disk fault.
S230, when the CA is less than or equal to a third time threshold, determining that the fault type of the current communication state is network fault.
In summary, acquiring average reply confirmation time of the main data center in n first preset time periods, and considering that the main data center is a disk fault when the average reply time is greater than a third time threshold; otherwise, the network is considered to be faulty.
Further, the invention also comprises:
s2101, obtaining the backup data center and average reply confirmation time cb=1/n×Σof the backup data center in n first preset time periods n i=1 E ji ,E ji =F ji -F0 ji -G,F ji Is the time of the ith synchronous data packet received by the data center in the jth first preset time periodBetween F0 ji The standby data center receives the time of replying and confirming the ith synchronous data packet to the main data center in the jth first preset time period.
S2102, obtaining a first weight factor W corresponding to a main data center c1 First weight factor W corresponding to standby data center c2 ,W c1 =1-(CA/(CA+CB)),W c2 =1-(CB/CA+CB)。
Further, the invention also comprises:
s001, obtaining a second weight factor W corresponding to the main data center t1 Second weight factor W corresponding to standby data center t2 ,W t1 =1-△T 1 /(△T 1 +△T 2 ),W t2 =1-△T 2 /(△T 1 +△T 2 )。
Still further, the present invention further comprises:
s002, obtain da=1/nx1/mx Σ n i=1 (∑ m j=1 H ji );H ji =B ji -B0 ji
S003, obtaining db=1/nx1/mx Σ n i=1 (∑ m j=1 I ji ),I ji =F ji -F0 ji
S004, obtaining a third weight factor W corresponding to the main data center d1 Third weight factor W corresponding to standby data center d2 ,W d1 =1-(DA/(DA+DB)),W d2 =1-(DB/DA+DB)。
Further, the priorities of the first data center and the second data center are determined by:
s005, obtaining the weight factor W of the main data center 1 And weight factor W of backup data center 2 ,W 1 =W t1 +W c1 +W d1
W 2 =W t2 +W c2 +W d2
S006, based on W 1 And W is 2 Determining priorities of a first data center and a second data center, the first data centerThe priorities of a data center and a second data center are used for determining an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end.
In summary, a first weight factor corresponding to the main data center and a first weight factor corresponding to the standby data center are obtained, a second weight factor corresponding to the main data center and a second weight factor corresponding to the standby data center are obtained, a third weight factor corresponding to the main data center and a third weight factor corresponding to the standby data center are obtained, and thus the weight factors of the main data center and the standby data center are obtained, and priorities of the first data center and the second data center are obtained, so that the data center with more accurate clock information and faster landing speed continues to provide service.
While certain specific embodiments of the invention have been described in detail by way of example, it will be appreciated by those skilled in the art that the above examples are for illustration only and are not intended to limit the scope of the invention. Those skilled in the art will also appreciate that many modifications may be made to the embodiments without departing from the scope and spirit of the invention. The scope of the invention is defined by the appended claims.

Claims (10)

1. A data processing system for monitoring the status of a data center, the system comprising: the system comprises a first data center, a second data center, a first equipment state monitoring end, a second equipment state monitoring end, a processor and a memory storing a computer program;
the first data center is in communication connection with the second data center; the first data center and the second data center are in communication connection with the first equipment state monitoring end in a first communication mode; the first data center and the second data center are in communication connection with the second equipment state monitoring end in a second communication mode; and the first communication mode is different from the second communication mode;
when the computer program is executed by a processor, the following steps are implemented:
s100, acquiring the current communication state between the first data center and the second data center;
s200, when the current communication state is a fault state, acquiring a fault type of the current communication state, and when the fault type of the current communication state is a disk fault, executing S300; when the fault type of the current communication state is a network fault, executing S500;
s300, simultaneously receiving a request instruction of the first data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the first data center for sending an arbitration request to the second equipment state monitoring end, and simultaneously receiving a request instruction of the second data center for sending an arbitration request to the first equipment state monitoring end and a request instruction of the second data center for sending an arbitration request to the second equipment state monitoring end;
s400, receiving an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end at the same time so as to realize state monitoring of the first data center and the second data center;
s500, acquiring the fault type of the last fault state between the first data center and the second data center, and executing S300 if the fault type of the last fault state is a disk fault; otherwise, executing S600;
s600, receiving a request instruction from the first data center to send an arbitration request to the second equipment state monitoring end and the second data center to send the request instruction from the arbitration request to the second equipment state monitoring end, and executing S700;
s700, receiving an instruction of an arbitration state request sent by the second equipment state monitoring end so as to realize state monitoring of the first data center and the second data center.
2. The data processing system for monitoring a status of a data center of claim 1, further comprising, after S700:
s710, receiving a request instruction from the first data center to send an arbitration request to the first equipment state monitoring end and the second data center to send an arbitration request to the first equipment state monitoring end;
s720, when the instruction of the arbitration state request sent by the first equipment state monitoring end is not received, receiving a request instruction of the first data center for sending the arbitration request to the first equipment state monitoring end every a preset verification time period in a future time period, and sending the request instruction of the arbitration request to the first equipment state monitoring end by the second data center.
3. The data processing system for monitoring the status of a data center of claim 1, further comprising determining the current communication status by:
s101, acquiring clock information of a first data center and clock information of a second data center;
s102, acquiring clock information of a first data center and time error delta T of standard time 1 And acquiring clock information of the second data center and time error DeltaT of standard time 2 Thereby obtaining a time error Δt between the first data center and the second data center;
s103, when DeltaT is larger than a first time difference threshold, calibrating the clock of the first data center and the clock of the second data center;
s104, when the DeltaT is less than or equal to a first time difference threshold, acquiring data packet information between the first data center and the second data center, so as to determine the current communication state according to the data packet information between the first data center and the second data center.
4. A data processing system for monitoring the status of a data center as claimed in claim 3, further comprising the step of, in S104:
s1041, acquiring a main data center and a standby data center; wherein, if DeltaT 1 >△T 2 Taking the second data center as a main data center; otherwise, the first data center is used as a main data center; the standby data center is the other data center except the main data center in the first data center and the second data center;
s1042, acquiring n first preset time periods T of the standby data center 1 Synchronous data packet transmission time list set A= { A in the inner 1 ,A 2 ,…,A j ,…,A n J-th synchronous data packet transmitting time list A j ={A j1 ,A j2 ,……,A ji ,……,A jm },A ji Is the transmission time of the ith synchronous data packet time in the jth first preset time period, the value range of j is 1 to n, the value range of i is 1 to m, and m is the first time period T 1 The number of internally acquired synchronous data packets;
s1043, obtaining the n first preset time periods T of the main data center 1 Synchronous data packet receiving time set B= { B in the frame 1 ,B 2 ,…,B j ,…,B n },B j ={B j1 ,B j2 ,……,B ji ,……,B jm },B ji Is the receiving time of the ith synchronous data packet time in the jth first preset time period;
s1044, obtaining a time difference set t= { t 1 ,t 2 ,…,t j ,…,t n },t j ={t j1 ,t j2 ,…,t ji ,…,t jm And t is }, where ji =B ji -A ji -△T;
S1045, obtaining a time difference mean Deltat according to the time difference set t;
s1046, when Deltat is more than a second time difference threshold, determining that the current communication state is a fault state;
s1047, when the delta t is less than or equal to the second time difference threshold, determining that the current communication state is a non-fault state.
5. The data processing system for monitoring a status of a data center of claim 4, wherein the type of failure of the current communication status is further determined in S200 by:
s210, acquiring average reply confirmation time CA=1/n×Σof the main data center in n first preset time periods n i=1 C ji ;C ji =B ji -B0 ji -G,B0 ji The method comprises the steps that the main data center receives the reply confirmation time from the ith synchronous data packet to the standby data center in the jth first preset time period, and G is preset reply confirmation time;
s220, when CA is more than a third time threshold, determining that the fault type of the current communication state is a disk fault;
s230, when the CA is less than or equal to a third time threshold, determining that the fault type of the current communication state is network fault.
6. The data processing system for monitoring the status of a data center of claim 5, further comprising:
s2101, obtaining the backup data center and average reply confirmation time cb=1/n×Σof the backup data center in n first preset time periods n i=1 E ji ,E ji =F ji -F0 ji -G,F ji Is the time of the ith synchronous data packet received by the standby data center in the jth first preset time period, F0 ji The standby data center receives the time of replying and confirming the ith synchronous data packet to the main data center in the jth first preset time period;
s2102, obtaining a first weight factor W corresponding to a main data center c1 First weight factor W corresponding to standby data center c2 ,W c1 =1-(CA/(CA+CB)),W c2 =1-(CB/CA+CB)。
7. The data processing system for monitoring the status of a data center of claim 6, further comprising:
s001, obtaining a second weight factor W corresponding to the main data center t1 Second weight factor corresponding to standby data centerSon W t2 ,W t1 =1-△T 1 /(△T 1 +△T 2 ),W t2 =1-△T 2 /(△T 1 +△T 2 )。
8. The data processing system for monitoring the status of a data center of claim 6, further comprising:
s002, obtain da=1/nx1/mx Σ n i=1 (∑ m j=1 H ji );H ji =B ji -B0 ji
S003, obtaining db=1/nx1/mx Σ n i=1 (∑ m j=1 I ji ),I ji =F ji -F0 ji
S004, obtaining a third weight factor W corresponding to the main data center d1 Third weight factor W corresponding to standby data center d2 ,W d1 =1-(DA/(DA+DB)),W d2 =1-(DB/DA+DB)。
9. The data processing system for monitoring the status of a data center of claim 8, wherein the priorities of the first data center and the second data center are determined by:
s005, obtaining the weight factor W of the main data center 1 And weight factor W of backup data center 2 ,W 1 =W t1 +W c1 +W d1
W 2 =W t2 +W c2 +W d2
S006, based on W 1 And W is 2 And determining the priorities of a first data center and a second data center, wherein the priorities of the first data center and the second data center are used for determining an instruction of an arbitration state request sent by the first equipment state monitoring end and an instruction of an arbitration state request sent by the second equipment state monitoring end.
10. The data processing system for monitoring the status of a data center of claim 1, wherein the first communication connection is an ethernet communication connection and the second communication connection is a 5G network communication connection.
CN202311568218.2A 2023-11-21 2023-11-21 Data processing system for monitoring state of data center Pending CN117614805A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311568218.2A CN117614805A (en) 2023-11-21 2023-11-21 Data processing system for monitoring state of data center

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311568218.2A CN117614805A (en) 2023-11-21 2023-11-21 Data processing system for monitoring state of data center

Publications (1)

Publication Number Publication Date
CN117614805A true CN117614805A (en) 2024-02-27

Family

ID=89949054

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311568218.2A Pending CN117614805A (en) 2023-11-21 2023-11-21 Data processing system for monitoring state of data center

Country Status (1)

Country Link
CN (1) CN117614805A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060168192A1 (en) * 2004-11-08 2006-07-27 Cisco Technology, Inc. High availability for intelligent applications in storage networks
CN105893176A (en) * 2016-03-28 2016-08-24 杭州宏杉科技有限公司 Management method and device of network storage system
CN106170948A (en) * 2015-07-30 2016-11-30 华为技术有限公司 A kind of referee method for dual-active data center, Apparatus and system
CN106909307A (en) * 2015-12-22 2017-06-30 华为技术有限公司 A kind of method and device for managing dual-active storage array
CN107147528A (en) * 2017-05-23 2017-09-08 郑州云海信息技术有限公司 One kind stores gateway intelligently anti-fissure system and method
US20180260125A1 (en) * 2017-03-10 2018-09-13 Pure Storage, Inc. Synchronously replicating datasets and other managed objects to cloud-based storage systems
US20200112499A1 (en) * 2018-10-07 2020-04-09 Hewlett Packard Enterprise Development Lp Multiple quorum witness
CN116668269A (en) * 2022-02-21 2023-08-29 华为云计算技术有限公司 Arbitration method, device and system for dual-activity data center

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060168192A1 (en) * 2004-11-08 2006-07-27 Cisco Technology, Inc. High availability for intelligent applications in storage networks
CN106170948A (en) * 2015-07-30 2016-11-30 华为技术有限公司 A kind of referee method for dual-active data center, Apparatus and system
CN106909307A (en) * 2015-12-22 2017-06-30 华为技术有限公司 A kind of method and device for managing dual-active storage array
CN105893176A (en) * 2016-03-28 2016-08-24 杭州宏杉科技有限公司 Management method and device of network storage system
US20180260125A1 (en) * 2017-03-10 2018-09-13 Pure Storage, Inc. Synchronously replicating datasets and other managed objects to cloud-based storage systems
CN107147528A (en) * 2017-05-23 2017-09-08 郑州云海信息技术有限公司 One kind stores gateway intelligently anti-fissure system and method
US20200112499A1 (en) * 2018-10-07 2020-04-09 Hewlett Packard Enterprise Development Lp Multiple quorum witness
CN116668269A (en) * 2022-02-21 2023-08-29 华为云计算技术有限公司 Arbitration method, device and system for dual-activity data center

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
柏玉锋;: "基于阵列双活功能的本地存储双活部署方案", 信息通信, no. 01, 15 January 2017 (2017-01-15) *
魏雪梅;: "双活数据中心解决方案", 甘肃科技纵横, no. 06, 25 June 2018 (2018-06-25) *

Similar Documents

Publication Publication Date Title
CN107295080B (en) Data storage method applied to distributed server cluster and server
EP0416943B1 (en) Method for controlling failover between redundant network interface modules
EP2362585B1 (en) Information processor and network control system
US6385665B1 (en) System and method for managing faults in a data transmission system
US20100154016A1 (en) Multi-channel ranging for a cable modem
CN111355600B (en) Main node determining method and device
CN112583683B (en) Master-slave CAN FD bus application layer communication method and system and electronic equipment
US8521869B2 (en) Method and system for reporting defects within a network
KR20100020253A (en) Monitoring apparatus for message transmission in network for a vehicle
CN113261220B (en) Method and control node for handling failure of TSN communication link
CN114844809A (en) Multi-factor arbitration method and device based on network heartbeat and kernel disk heartbeat
CN117614805A (en) Data processing system for monitoring state of data center
CN108667640B (en) Communication method and device, and network access system
US11874786B2 (en) Automatic switching system and method for front end processor
CN115378841B (en) Method and device for detecting state of equipment accessing cloud platform, storage medium and terminal
CN113850033B (en) Redundancy system, redundancy management method and readable storage medium
CN114884767B (en) Synchronous dual-redundancy CAN bus communication system, method, equipment and medium
CN115842760A (en) Fault detection method and device, electronic equipment and storage medium
CN114356810B (en) Communication connection method, device, equipment and medium for host and storage system
JP2007166446A (en) Pon system, its abnormality determining method, and station side device
US9118540B2 (en) Method for monitoring a plurality of rack systems
US8566634B2 (en) Method and system for masking defects within a network
CN112034774A (en) Hot redundancy control method
CN116795290A (en) Data synchronization method, baseboard management controller, electronic device and storage medium
JP7417773B1 (en) Network interface card and transmission performance monitoring method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination