CN102056207B - Method and system for realizing disaster recovery and switching - Google Patents

Method and system for realizing disaster recovery and switching Download PDF

Info

Publication number
CN102056207B
CN102056207B CN200910236738.7A CN200910236738A CN102056207B CN 102056207 B CN102056207 B CN 102056207B CN 200910236738 A CN200910236738 A CN 200910236738A CN 102056207 B CN102056207 B CN 102056207B
Authority
CN
China
Prior art keywords
data access
access link
switching
response time
service quality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN200910236738.7A
Other languages
Chinese (zh)
Other versions
CN102056207A (en
Inventor
章恩华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN200910236738.7A priority Critical patent/CN102056207B/en
Priority to PCT/CN2010/073261 priority patent/WO2010145428A1/en
Publication of CN102056207A publication Critical patent/CN102056207A/en
Application granted granted Critical
Publication of CN102056207B publication Critical patent/CN102056207B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/04Arrangements for maintaining operational condition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W8/00Network data management
    • H04W8/02Processing of mobility data, e.g. registration information at HLR [Home Location Register] or VLR [Visitor Location Register]; Transfer of mobility data, e.g. between HLR, VLR or external networks
    • H04W8/04Registration at HLR or HSS [Home Subscriber Server]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a method and a system for realizing disaster recovery and switching, wherein the method comprises the following steps: judging whether synchronous switching conditions are met or not according to service quality test result of a data access link between the front end or the back end after the disaster recovery and the switching and the back end or the front end without the disaster recovery and the switching of a network device for the network device performing the disaster recovery and the switching at one of the front end and the back end; and performing synchronous switching on the back end or the front end without the disaster recovery and the switching of the network device according to synchronous switching rules when the judgment result is that the synchronous switching conditions are met. With the help of the invention, not only the unnecessary switching between the FE (front end) and the BE (back end) can be avoided, but also the service performance of the system can be ensured through automatic synchronous switching between the FE and the BE, and the service interruption time caused by manual switching can be further avoided.

Description

The implementation method that disaster tolerance is switched and system
Technical field
The present invention relates to the communications field, particularly relate to implementation method and system that a kind of disaster tolerance switches.
Background technology
Attaching position register (home location register, referred to as HLR) for the data of storing mobile communication the whole network contracted user, high speed development along with mobile communication business and the increase increasingly to fail safe and integrated services demand, the active and standby and active and standby Disaster Tolerant Scheme of N+1 is evolved to distributed HLR from simple 1+1 to impel HLR.
At present, distributed HLR has following disaster tolerance: (1) is responsible for front end (the Front End of business logic processing, referred to as FE) and rear end (the Back End of responsible storage of subscriber data, referred to as BE) separated from one another, FE and BE both can be deployed in same geographic sites, also can be deployed in different geographic sites; (2) FE and BE can configure respective Disaster Tolerant Scheme respectively, realize symmetrical or asymmetric deployment, and when asymmetric deployment, the quantity of FE and BE is unequal, can realize the concentrated storage of data.
Usually, in traditional active and standby HLR Disaster Tolerant Scheme, no matter be primary website, or disaster tolerance website, all need to dispose FE and BE, and FE can only access the BE of this website.And in distributed HLR, FE can access all BE, thus effectively can avoid switching rear FE and to need and BE binds the restriction brought.Will make like this distributed HLR and traditional HLR Disaster Tolerant Scheme exist on system is switched below difference: traditional HLR Disaster Tolerant Scheme provide a website switching to another one website, particularly, in the process that primary website is switched to disaster tolerance website, include Business Processing and be switched to disaster tolerance FE from primary FE, and data access is switched to disaster tolerance BE from primary BE; And for distributed HLR, FE and BE can switch respectively, that is, primary FE breaks down, and only needs Business Processing to be switched to FE for subsequent use, does not need BE to carry out synchronous switching.Same, when BE needs to switch, do not require that FE carries out synchronous switching.
Fig. 1 shows the deployment architecture of the active and standby disaster tolerance system of traditional HLR 1+1.As shown in Figure 1, the data of primary HLR 401 by based on storing, the copy mode of database or application is synchronized to redundancy HLR 402, simultaneously, at Signalling Transfer Point (signaling transfer point, referred to as STP) 101,102 alternate routings 601 that primary HLR route 801 is set, and point to redundancy HLR 402.Before being switched to redundancy HLR 402, all service request are sent to primary HLR 401 by STP 101,102, and in primary HLR 401, primary FE 201 accesses the process of primary BE 301 finishing service request.After primary FE 201 or primary BE 301 fault, when being switched to redundancy HLR 402, stop the data Replica of primary BE 301 and disaster tolerance BE302, and block the signaling link of STP 101,102 and primary FE 201, all service request are sent to HLR 402 for subsequent use automatically.
Fig. 2 shows the deployment architecture of distributed HLR.As shown in Figure 2, primary FE 201, FE202 for subsequent use, primary BE 301, BE for subsequent use 302 all can be deployed in different geographical position.When primary FE 201 fault, business will be switched to FE 202 for subsequent use, and FE 202 for subsequent use can access the process of primary BE 301 finishing service request.
Because FE and BE allows to be deployed in different geographic sites, therefore, the service quality between FE and BE, just becomes the factor that restriction FE accesses BE data performance.Owing to there is the factor that geographical far and near difference etc. affects data performance between different FE and BE, FE can be caused to access some BE and can obtain optimal data performance.Such as, in the system shown in Fig. 2 is disposed, primary FE may data performance when accessing primary BE best, and data performance when accessing BE for subsequent use is poor.If only FE switches, BE does not carry out synchronous switching, and the service quality of switching between rear BE and FE can be made very poor, and data performance reduces, and when data performance can not meet business demand, can carry out by traffic affecting; In like manner, when BE switches separately, there will be this problem equally.
In like manner, except the HLR of distributed deployment, other can carry out distributed deployment, and business logic processing and data store the network equipment that carries out independently of one another (such as, comprise authentication, authorize, charging (Authentication, Authorization, Accounting is referred to as AAA) server, home subscriber server (Home Subscriber Server, referred to as HSS), equipment identity register (EquipmentIdentity Register, referred to as EIR)), all exist because FE and BE does not carry out synchronous switching and cause the affected problem of quality of service.
For causing data performance between BE and the FE after switching not to be guaranteed because BE and FE carries out asynchronous switching in the network equipment of distributed deployment in correlation technique, even can reduce data performance and then problem that the business that affects normally is carried out, not yet propose effective solution at present.
Summary of the invention
For the problem causing data access performance between FE and the BE after switching not to be guaranteed because FE and BE carries out asynchronous switching in the network equipment of distributed deployment in correlation technique, the business that even can affect because of the reduction of data access performance is normally carried out, the present invention proposes the implementation that a kind of disaster tolerance is switched, and can ensure to switch the data performance between rear FE and BE.
Technical scheme of the present invention is achieved in that
According to an aspect of the present invention, provide the implementation method that a kind of disaster tolerance is switched, for distributed deployment and the network equipment that business logic processing is separated with data storage realize switching.
Method according to the present invention comprises: for the side in front-end and back-end, the network equipment switched of disaster tolerance occurs, according to carry out disaster tolerance switch after front end or the service quality testing result of data access link that do not occur between rear end that disaster tolerance switches or front end of rear end and the network equipment judge whether to meet synchronous switching condition; When judged result is for meeting synchronous switching condition, according to synchronous switching rule rear end that disaster tolerance switches is not occurred to the network equipment or synchronous switching is carried out in front end.
Occur in the front end of the network equipment or rear end after disaster tolerance switches, the method can comprise further: the service quality occurring front end that disaster tolerance switches or rear end to detect data access link obtains the service quality that rear end that disaster tolerance switches or front-end detection data access link do not occur service quality testing result and/or the network equipment and obtains service quality testing result.
Preferably, the service quality testing result of data access link comprise following one of at least: in predetermined amount of time data access link two ends between request sum, in predetermined amount of time data access link two ends between request response time be greater than data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
And, when service quality testing result meet following one of at least, judge that data access link meets synchronous switching condition: in predetermined amount of time data access link two ends between the request response time number of requests that is greater than response time threshold value exceed default amount threshold, in predetermined amount of time data access link two ends between average request response time be greater than average response time threshold value, in predetermined amount of time data access link two ends between the ratio shared in total request amount of the request response time number of requests that is greater than the first default response time threshold value exceed default proportion threshold value.
Preferably, synchronous switching rule can comprise the new rear end of data access link of the foundation of the front end after switching with disaster tolerance or rear end or the priority of front end.
Preferably, the network equipment can be one of following: authentication, authorizes, accounting server; Attaching position register; Equipment identity register; Home subscriber server.
According to a further aspect in the invention, what provide that a kind of disaster tolerance switches realizes system, for to distributed deployment and the network equipment that business logic processing is separated with data storage realize switching, this system can comprise performance monitoring module, switch decision-making module and switching control module.
Particularly, performance monitoring module is arranged at the front end of each network equipment and/or is arranged at the rear end of each network equipment, for detecting the front end at this performance monitoring module place or rear end and with the service quality between the front end at this place or the rear end of back end communications or front end, obtaining service quality testing result; Switch decision-making module for receiving the service quality testing result from performance monitoring module, and according to service quality testing result judge network equipment generation disaster tolerance switch after front end or rear end and service quality that the data access link that disaster tolerance is switched between rear end or front end do not occur whether meet synchronous switching condition; Switching control module is used for when the judged result of switching decision-making module is for meeting synchronous switching condition, to the network equipment, rear end that disaster tolerance switches does not occur or synchronous switching is carried out in front end according to synchronous switching rule.
Wherein, the service quality testing result of data access link can comprise following one of at least: in predetermined amount of time data access link two ends between request sum, in predetermined amount of time data access link two ends between request response time be greater than data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
Preferably, when service quality testing result meet following one of at least, switch decision-making module and can judge that data access link meets synchronous switching condition: in predetermined amount of time data access link two ends between the request response time number of requests that is greater than response time threshold value exceed default amount threshold, in predetermined amount of time data access link two ends between average request response time be greater than average response time threshold value, in predetermined amount of time data access link two ends between the ratio shared in total request amount of the request response time number of requests that is greater than the first default response time threshold value exceed default proportion threshold value.
Preferably, the network equipment can be one of following: authentication, authorizes, accounting server; Attaching position register; Equipment identity register; Home subscriber server.
By means of technique scheme of the present invention, by monitoring the service quality (data performance) between FE and BE, coordinate the synchronous switching of FE and BE, unnecessary FE and BE can not only be avoided to switch, can also be switched by the automatic synchronization of FE and BE and ensure system business performance, and auto switching it also avoid the service outage duration that Manual Switch causes.
Accompanying drawing explanation
Fig. 1 is the deployment architecture of the active and standby disaster tolerance system of HLR 1+1 according to correlation technique;
Fig. 2 is the deployment architecture of the distributed HLR according to correlation technique;
Fig. 3 is the flow chart of the implementation method of switching according to the disaster tolerance of the inventive method embodiment;
Fig. 4 is the flow chart of the system that realizes that the disaster tolerance of system according to the invention embodiment is switched.
Embodiment
For distributed deployment in correlation technique, and the network equipment that business logic processing is separated with data storage (such as, HLR, aaa server, HSS, EIR etc.) side in upper FE and BE switches rear the opposing party and do not carry out synchronous switching and cause the data access performance between FE and the BE after switching cannot guaranteed problem, although the present invention considers that FE and BE in distributed HLR system can independently switch, but in some cases, after a side in FE and BE switches, system can be made to obtain better data access performance if the opposing party carries out synchronous switching, therefore, whether the present invention's service quality of giving chapter and verse between each FE and each BE carries out synchronous switching to the opposing party after determining switching in FE or BE, if after having carried out switching, data access performance between FE and BE is poor, then synchronous switching is carried out to the side do not switched in FE and BE.
Below in conjunction with accompanying drawing, describe embodiments of the invention in detail.
Embodiment of the method
In the present embodiment, provide the implementation method that a kind of disaster tolerance is switched, for to distributed deployment and the network equipment (such as, the equipment such as HLR, HSS, aaa server, EIR can be comprised) that business logic processing is separated with data storage realize switching.
Realizing in process of the present invention, first needing according to the communication conditions configuration in network by switching rule.Particularly, for FE, can determine this FE can preferentially and which BE set up data access link so that when after disaster tolerance is switched, this FE is called primary FE, be that this FE selects optimum BE; Similarly, for BE, can determine this BE can preferentially and which FE set up data access link so that when after disaster tolerance is switched, this BE is called primary BE, be that this BE selects optimum FE; Afterwards, alternatively, this rule can be preserved by form, such as, as shown in table 1.
Table 1
List item is numbered Access main body Performance ranking when access main body conducts interviews
1001 FE201 BE301 BE302
1002 FE202 BE302 BE301
1003 BE301 FE201 FE202
1004 BE302 FE202 FE201
This switch rule can the network planning time pre-configured, the factor such as geographical position relation of its configuration result and each FE, BE is relevant, is not described in detail herein.
In addition, also needing each front end of all-network equipment and/or backend arrangement is the service quality can monitoring the data access link relevant to this front end and/or rear end.
Such as, in the network deployment structure shown in Fig. 2, primary FE 201 can detect the service quality between itself and primary BE301, can detect the service quality between itself and primary BE 301; Same, primary BE 301 also can detect the service quality between itself and primary FE 201, and BE 302 for subsequent use also can detect the service quality between itself and FE for subsequent use 202, and, can two-way detection be carried out between FE and BE when being necessary.
Actual carry out service quality detect time, can adopt a variety of method, fairly simple and effective mode is exactly the mode of request/response, namely, send request to the other side in FE and/or BE and wait until the response that the other side returns, according to the time detecting service quality of response.
When statistics network result, for each FE and BE, the information such as request sum, average response time, the number of responses exceeding default response time threshold value in certain hour section therebetween can be recorded.
Below in conjunction with accompanying drawing, describe processing procedure of the present invention in detail.
As shown in Figure 3, comprise according to the implementation method that the disaster tolerance of the present embodiment is switched: step S302, for the network equipment that the side's generation disaster tolerance in FE and BE is switched, according to carry out disaster tolerance switch after FE or BE and the network equipment there is not data access link between BE or FE that disaster tolerance switches (namely, after disaster tolerance is switched, newly-established data access link) service quality testing result judge whether to meet synchronous switching condition, synchronous switching condition can have multiple situation, such as, the service quality can thinking on data access link meets synchronous switching condition when can not meet business demand, , when judged result is for meeting synchronous switching condition, there is not to the network equipment BE or FE that disaster tolerance switches according to synchronous switching rule (this rule can be obtained by pre-configured) and carrying out synchronous switching in step S304.
In step s 302, occur after disaster tolerance switches at FE or BE of the network equipment, FE or BE that disaster tolerance switches occurs to be needed to detect the service quality of data access link and obtains service quality testing result and/or the network equipment and the service quality that BE or FE that disaster tolerance switches detect data access link does not occur obtain service quality testing result.
Preferably, the service quality testing result of data access link can comprise following one of at least: in predetermined amount of time data access link two ends between request sum, in predetermined amount of time data access link two ends between request response time be greater than data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
When whether meeting the judgement of business demand, if at least one in meeting the following conditions, then think and can not meet business demand: the ratio that the quantity exceeding the response of predetermined response time threshold exceedes certain numerical value, the quantity that exceedes the response of predetermined response time threshold is occupied in overall response quantity exceedes certain proportion threshold value, average response time exceedes default average response time threshold value.
Because different business is different to the requirement of service quality, so above-mentioned condition can be used singly or in combination.Such as, for the undemanding business of quality of service requirement, can only judge with the condition of in above-mentioned condition.
Such as, suppose in the network deployment structure shown in Fig. 2, if primary FE 201 is switched to FE 202 for subsequent use, whether the service quality now with regard to the data access link between the primary BE 301 that needs to judge that the FE for subsequent use 202 and not carrying out after switching switches can meet business demand; If (namely judged result for being, do not meet synchronous switching condition), then synchronous switching is not carried out to primary BE 301, otherwise need primary BE 301 to switch to BE 302 for subsequent use, to be cooperated with BE 302 for subsequent use by FE 202 for subsequent use and carry out the business of deal with data.
First BE is switched, judges whether FE carries out the situation of synchronous switching, and processing mode is similar, no longer describes in detail afterwards here.
When selecting the object of synchronous switching, as shown in table 1, each list item specifies the auto switching strategy of FE and BE, such as, in list item 1001, FE 201 accesses the priority of BE 301 higher than BE302, therefore, if the performance that FE 201 accesses BE 302 (can not meet business demand lower than setting threshold values, namely, meet synchronous switching condition) time, BE 301 can switch as primary by switching control module automatically, does not then allow on the contrary; The performance that configuration item 1003, FE 201 accesses BE 301 accesses BE301 higher than FE 202, and therefore, if FE 202 accesses the performance of BE 301 lower than when setting threshold values, FE 201 can switch as primary by switching control module automatically, does not then allow on the contrary.
By above-mentioned process, automatic decision can switch with the synchronous disaster tolerance of control FE and BE, under making system run on optimum performance all the time, particularly for the asymmetric deployment of FE and BE, the service outage duration because Manual Switch produces can be avoided.
System embodiment
In the present embodiment, what provide that a kind of disaster tolerance switches realizes system, for distributed deployment and the network equipment (such as, comprising HLR, HSS, EIR, aaa server etc.) that business logic processing is separated with data storage realize switching.
The system that realizes of switching according to the disaster tolerance of the present embodiment comprises performance monitoring module, switches decision-making module and switching control module, particularly:
Performance monitoring module is arranged at the FE of each network equipment and/or is arranged at the BE of each network equipment, for the service quality between BE or FE that FE or BE detecting this performance monitoring module place communicates with FE or BE with this place, obtain service quality testing result; Switch decision-making module for receiving the service quality testing result from performance monitoring module, and according to service quality testing result judge network equipment generation disaster tolerance switch after FE or BE and service quality that the data access link that disaster tolerance is switched between BE or FE do not occur whether meet pre-configured synchronous switching condition; Switching control module is used for when the judged result of switching decision-making module is for meeting synchronous switching condition, to the network equipment, BE or FE that disaster tolerance switches does not occur carry out synchronous switching according to pre-configured synchronous switching rule.
Wherein, the service quality testing result of data access link can comprise following one of at least: in predetermined amount of time data access link two ends between request sum, in predetermined amount of time data access link two ends between request response time be greater than data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
Preferably, when service quality testing result meet following one of at least, switch decision-making module and can judge that data access link meets synchronous switching condition: in predetermined amount of time data access link two ends between the request response time number of requests that is greater than response time threshold value exceed default amount threshold, in predetermined amount of time data access link two ends between average request response time be greater than average response time threshold value, in predetermined amount of time data access link two ends between the ratio shared in total request amount of the request response time number of requests that is greater than the first default response time threshold value exceed default proportion threshold value.
Preferably, the network equipment can be one of following: authentication, authorizes, accounting server; Attaching position register; Equipment identity register; Home subscriber server.
Below HLR is in a distributed manner deployed as example describe in detail switch according to the disaster tolerance of the embodiment of the present invention realize system.
Fig. 4 is the structured flowchart of the system that realizes of switching according to the disaster tolerance of the present embodiment.
As shown in Figure 4, the service processing equipment of distributed HLR is made up of primary FE 201 and FE for subsequent use 202, is deployed in different geographical position; The data storage device of distributed HLR is made up of primary BE 301 and BE for subsequent use 302, is deployed in different geographical position, and primary BE 301 and BE for subsequent use 302 can with primary FE 201, FE for subsequent use 202 in same geographical position.
In the present embodiment, suppose at FE deploy performance monitoring module, namely, as shown in Figure 4, performance monitoring module 101 is deployed on primary FE 201, performance monitoring module 102 is deployed on FE 202 for subsequent use, and performance monitoring module 101 accesses the response time of BE 301 for monitoring primary FE 201, and performance monitoring module 102 accesses the response time of BE 302 for monitoring FE 202 for subsequent use.
Switch decision-making module 401 and switching control module 501 can be deployed on identical node, also can be deployed on different nodes respectively, preferably, can will switch decision-making module 401 and switching control module 501 is deployed in same geographical position.
Below for primary FE 201 fault, illustrate disaster tolerance according to the present invention and switch in the system of realization after FE breaks down and switches, the performance of accessing BE according to FE coordinates the disaster tolerance reversed process of FE and BE.
After primary FE 201 fault, the signaling link of STP and primary FE 201 interrupts, and arrange according to route stand-by, all service request are sent to FE 202 for subsequent use by STP.Because primary BE 301 does not break down, therefore, FE 202 accesses the process of the user data finishing service request of primary BE 301.
Performance monitoring module 102 can monitor the response time that FE 202 accesses primary BE 301 in real time, and sends in the mode regularly reported and switch decision-making module 401.Performance monitoring module 102 can pre-configured exception response time threshold values T1, makes to report the information of switching decision-making module 401 and can comprise the number of requests that total number of requests in this time period, average response time and response time are greater than exception response time threshold values T1.
Switch decision-making module 401 and collect the performance data that performance monitoring module 102 reports, and gather and analyze the performance data of all properties monitoring module of FE 202 in pre-configured time period T2.Switching decision-making module 401 uses pre-configured performance index threshold values to determine whether to need to trigger BE auto switching.Performance index threshold values can comprise average response time threshold values T3 and exception response request ratio threshold values T4, if gathering in time period T2, average response time exceedes threshold values T3, or the ratio that the request that the response time is greater than exception response time threshold values T1 accounts for total request is greater than exception response request ratio threshold values T4, switch decision-making module 401 to determine to need to trigger BE auto switching, and notify that switching control module 501 starts to switch.
After switching control module 501 receives the instruction of switching decision-making module 401, need to judge whether to allow BE auto switching, and select a target BE as new primary BE.If distributed HLR allows auto switching, and BE 302 running status for subsequent use is normal, then first switching control module 501 controls primary FE 301 and switches as subsequent use, then controlling FE 302 for subsequent use switches as primary, data copy to from BE 302 process that BE 301, FE 202 accesses the request of BE 302 finishing service.
Below will for primary BE 301 fault, illustrate according in disaster tolerance exchanging system of the present invention after BE breaks down and switches, the disaster tolerance that the performance of accessing BE according to FE coordinates FE and BE switches step.
After primary BE 301 fault, need BE 302 for subsequent use to switch as primary, and all service request are still sent to primary FE 201 by STP, primary FE 201 can only access the process of the user data finishing service request of BE 302.
Performance monitoring module 101 monitors the response time that primary FE 201 accesses BE 302, in the mode regularly reported, sends to and switches decision-making module 401.Performance monitoring module 101 can pre-configured exception response time threshold values T1, makes to report the information of switching decision-making module 401 and can comprise the number of requests that total number of requests in this time period, average response time and response time are greater than exception response time threshold values T1.
Switch decision-making module 401 and collect the performance data that performance monitoring module 101 reports, and gather and analyze the performance data of all properties monitoring module of primary FE 201 in pre-configured time period T2.Switching decision-making module 401 uses pre-configured performance index threshold values to determine whether to need to trigger FE auto switching.Performance index threshold values can comprise average response time threshold values T3 and exception response request ratio threshold values T4, if gathering in time period T2, average response time exceedes threshold values T3, or the ratio that the request that the response time is greater than exception response time threshold values T1 accounts for total request is greater than exception response request ratio threshold values T4, switch decision-making module 401 to determine to need to trigger FE auto switching, and notify that switching control module 501 starts to switch.
After switching control module 501 receives the instruction of switching decision-making module 401, need to judge whether to be configured to allow FE auto switching, and select a target FE as new primary FE.In Fig. 2, if distributed HLR is configured to allow auto switching, and FE 202 running status for subsequent use normal (and the service quality between FE 202 and BE 302 can meet business demand), then switching control module 501 first control FE 202 go block signaling link, then control FE 201 blocks signaling link, all service request are sent to the process that FE 202, FE 202 accesses the request of BE 302 finishing service by STP.
Preferably, when switching control module 501 selects auto switching target FE or target BE, the priority form shown in table 1 can be used.
Each priority list item in table 1 specifies the auto switching strategy of FE and BE, such as, in list item 1001, FE 201 accesses the priority of BE 301 higher than BE 302, therefore, if FE 201 accesses the performance of BE 302 lower than when setting threshold values, BE 301 can switch as primary by switching control module automatically, does not then allow on the contrary; The performance that configuration item 1003, FE 201 accesses BE 301 accesses BE 301 higher than FE 202, and therefore, if FE 202 accesses the performance of BE 301 lower than when setting threshold values, FE 201 can switch as primary by switching control module automatically, does not then allow on the contrary.
For the situation that there is plural FE and BE in system, can switch according to the method described above equally, detailed process is similar with process described before, no longer repeats here.
In addition, except the situation that performance monitoring module is set on FE shown in Fig. 4, also performance monitoring module can be set at BE, thus the response of monitoring FE, and, in the case of necessary, on FE and BE, all performance monitoring module can be set, thus reach the object mutually detected.
In addition, in the present invention, switching control module can block or go to block switching of means control FE and BE such as Signaling System Number 7 link, foundation or release SCTP coupling, and concrete implementation is known in those skilled in the art, will not enumerate herein.
By said system, automatic decision can switch with the synchronous disaster tolerance of control FE and BE, under making system run on optimum performance all the time, particularly for the asymmetric deployment of FE and BE, the service outage duration because Manual Switch produces can be avoided.
In sum, by means of technique scheme of the present invention, on the fault auto switching basis of FE and BE, by monitoring the data performance between FE and BE, coordinate the synchronous switching of FE and BE, unnecessary FE and BE can be avoided to switch, can also be switched by the automatic synchronization of FE and BE and ensure system business performance, and the service outage duration that Manual Switch causes can be avoided.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a disaster tolerance implementation method of switching, for distributed deployment and the network equipment that business logic processing is separated with data storage realize switching, it is characterized in that, described method comprises:
The network equipment switched of disaster tolerance is occurred for the side in front-end and back-end, according to carry out disaster tolerance switch after front end or the service quality testing result of data access link that do not occur between rear end that disaster tolerance switches or front end of rear end and the described network equipment judge whether to meet synchronous switching condition;
When judged result is for meeting described synchronous switching condition, according to synchronous switching rule rear end that disaster tolerance switches is not occurred to the described network equipment or synchronous switching is carried out in front end.
2. method according to claim 1, is characterized in that, after the front end of the described network equipment or rear end generation disaster tolerance are switched, described method comprises further:
The service quality occurring the disaster tolerance described front end of switching or rear end to detect described data access link obtains the service quality that data access link described in rear end that disaster tolerance switches or front-end detection do not occur described service quality testing result and/or the described network equipment and obtains described service quality testing result.
3. method according to claim 1, it is characterized in that, the service quality testing result of described data access link comprise following one of at least: in predetermined amount of time described data access link two ends between request sum, in predetermined amount of time described data access link two ends between request response time be greater than described data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
4. method according to claim 3, is characterized in that, when described service quality testing result meet following one of at least, judge that described data access link meets described synchronous switching condition:
In predetermined amount of time described data access link two ends between the request response time number of requests that is greater than described response time threshold value exceed default amount threshold, in predetermined amount of time described data access link two ends between average request response time be greater than average response time threshold value, in predetermined amount of time described data access link two ends between the ratio shared in total request amount of the request response time number of requests that is greater than the first default response time threshold value exceed default proportion threshold value.
5. method according to claim 1, is characterized in that, the rear end of data access link that the foundation that described synchronous switching rule comprises the described front end after switching with disaster tolerance or rear end is new or the priority of front end.
6. method according to any one of claim 1 to 5, is characterized in that, the described network equipment is one of following: authentication, authorizes, accounting server; Attaching position register; Equipment identity register; Home subscriber server.
7. what a disaster tolerance was switched realizes system, for to distributed deployment and the network equipment that business logic processing is separated with data storage realize switching, it is characterized in that, described system comprises performance monitoring module, switches decision-making module and switching control module, wherein
Described performance monitoring module is arranged at the front end of each network equipment and/or is arranged at the rear end of described each network equipment, for detecting the front end at this performance monitoring module place or rear end and with the service quality between the front end at this place or the rear end of back end communications or front end, obtaining service quality testing result;
Described decision-making module of switching for receiving the service quality testing result from described performance monitoring module, and according to described service quality testing result judge network equipment generation disaster tolerance switch after front end or rear end and service quality that the data access link that disaster tolerance is switched between rear end or front end do not occur whether meet synchronous switching condition;
Described switching control module is used for when described judged result of switching decision-making module is for meeting described synchronous switching condition, to the described network equipment, rear end that disaster tolerance switches does not occur or synchronous switching is carried out in front end according to synchronous switching rule.
8. system according to claim 7, it is characterized in that, the service quality testing result of described data access link comprise following one of at least: in predetermined amount of time described data access link two ends between request sum, in predetermined amount of time described data access link two ends between request response time be greater than described data access link in the number of requests of response time threshold value, predetermined amount of time two ends between average request response time.
9. system according to claim 8, is characterized in that, when described service quality testing result meet following one of at least, described in switch decision-making module and judge that described data access link meets described synchronous switching condition:
In predetermined amount of time described data access link two ends between the request response time number of requests that is greater than described response time threshold value exceed default amount threshold, in predetermined amount of time described data access link two ends between average request response time be greater than average response time threshold value, in predetermined amount of time described data access link two ends between the ratio shared in total request amount of the request response time number of requests that is greater than the first default response time threshold value exceed default proportion threshold value.
10. the system according to any one of claim 7 to 9, is characterized in that, the described network equipment is one of following: authentication, authorizes, accounting server; Attaching position register; Equipment identity register; Home subscriber server.
CN200910236738.7A 2009-10-29 2009-10-29 Method and system for realizing disaster recovery and switching Active CN102056207B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN200910236738.7A CN102056207B (en) 2009-10-29 2009-10-29 Method and system for realizing disaster recovery and switching
PCT/CN2010/073261 WO2010145428A1 (en) 2009-10-29 2010-05-26 Method and system for implementing disaster recovery switch

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910236738.7A CN102056207B (en) 2009-10-29 2009-10-29 Method and system for realizing disaster recovery and switching

Publications (2)

Publication Number Publication Date
CN102056207A CN102056207A (en) 2011-05-11
CN102056207B true CN102056207B (en) 2015-04-01

Family

ID=43355813

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910236738.7A Active CN102056207B (en) 2009-10-29 2009-10-29 Method and system for realizing disaster recovery and switching

Country Status (2)

Country Link
CN (1) CN102056207B (en)
WO (1) WO2010145428A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109242414A (en) * 2018-08-16 2019-01-18 深圳前海启航供应链管理有限公司 A kind of contract management system
CN116193384A (en) * 2021-11-26 2023-05-30 中兴通讯股份有限公司 Disaster recovery switching method, system, electronic equipment and storage medium
CN115225467A (en) * 2022-07-05 2022-10-21 中国铁道科学研究院集团有限公司 5G-EIR disaster recovery backup, fault detection and service recovery method for railway 5G private network system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6434713B1 (en) * 1998-09-03 2002-08-13 Lg Information & Communications, Ltd. Processor management method of mobile communication home location register (HLR) system
CN1753373A (en) * 2004-09-23 2006-03-29 华为技术有限公司 Remote disaster allowable system and method
CN101365182A (en) * 2008-09-23 2009-02-11 中兴通讯股份有限公司 Method, system and media gateway implementing relay gateway disaster tolerated uninterrupted call

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204762C (en) * 2002-03-22 2005-06-01 华为技术有限公司 Redundant disaster method for realizing adscription of location registers throuth mobile communication operating system
CN1946058A (en) * 2006-10-28 2007-04-11 武汉市中光通信公司 Soft exchange device allopatric disaster recovery solution system and its method for software exchange network
CN101247568A (en) * 2008-03-12 2008-08-20 中兴通讯股份有限公司 Data fault-tolerance system and method and intermediate equipment
CN101426306B (en) * 2008-10-24 2011-06-01 中国移动通信集团山东有限公司 A disaster tolerance switching method, system and apparatus

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6434713B1 (en) * 1998-09-03 2002-08-13 Lg Information & Communications, Ltd. Processor management method of mobile communication home location register (HLR) system
CN1753373A (en) * 2004-09-23 2006-03-29 华为技术有限公司 Remote disaster allowable system and method
CN101365182A (en) * 2008-09-23 2009-02-11 中兴通讯股份有限公司 Method, system and media gateway implementing relay gateway disaster tolerated uninterrupted call

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
分布式HLR技术及引入策略研究;王钢;《现代电信科技》;20080815(第8期);全文 *
基于分布式统一数据中心的G/C双网运营方案;中兴通讯股份有限公司;《通信世界》;20090608(第21期);第2节分布式,第3节数据融合与业务融合 *
大容量分布式HLR的引入策略研究;邱钧;《移动通信》;20090823(第16期);全文 *

Also Published As

Publication number Publication date
CN102056207A (en) 2011-05-11
WO2010145428A1 (en) 2010-12-23

Similar Documents

Publication Publication Date Title
CN102870464B (en) For the storage of the adjacent cell information of fast quick-recovery in the situation that of handoff failure
US20140254347A1 (en) Ethernet Ring Protection Switching Method, Node, and System
RU2423008C2 (en) METHOD AND SYSTEM FOR AUTOMATIC PROTECTION OF Ethernet NETWORK
CN103001799B (en) Method and node for achieving redundancy on basis of chain network
CN102257848B (en) Main and secondary apparatuses conversion method betwenn communication equipment, communication equipment and system, and request equipment of system and service
CN102761479B (en) Link selecting method and device
CN100563392C (en) Call recovery method when network failure and system
US8213319B2 (en) Fault location
US8078201B2 (en) Method and system for shunting short messages
CN101860492A (en) Quick switching method, device and system
JPH04504347A (en) Automatic failure recovery in packet networks
CN102265555A (en) Conversion method between an main communicational equipment and a spare communicational equipment, the communicational equipments and a service request equipment
US8331950B2 (en) Mobile communication system, radio network controller and method
CN101729305A (en) Method and system for automatically restoring fault, and control network element
CN101827032A (en) Method and device for converging two-layer multi-cast network
CN103873336A (en) Distributed resilient network interconnection-based service bearing method and device
CN102056207B (en) Method and system for realizing disaster recovery and switching
CN101232402A (en) Wireless network system and communication method thereof
CN102265556B (en) Conversion method between an main communicational equipment and a spare communicational equipment, the communicational equipments and a service request equipment
CN103119893B (en) A kind of APS service channel changing method, system and APS equipment
US7035939B2 (en) Method for balancing load on a plurality of switching apparatus
US20010034790A1 (en) Communication apparatus and congestion regulation control method
CN103190121B (en) Ethernet ring protection switching reverse method, node and system
CN102045185A (en) User information backup method and device
CN101330400A (en) Share backup method for baseband collocation resource

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant