CN1996969A - Integrated access media gateway device based on the high-available system - Google Patents

Integrated access media gateway device based on the high-available system Download PDF

Info

Publication number
CN1996969A
CN1996969A CNA2006101252058A CN200610125205A CN1996969A CN 1996969 A CN1996969 A CN 1996969A CN A2006101252058 A CNA2006101252058 A CN A2006101252058A CN 200610125205 A CN200610125205 A CN 200610125205A CN 1996969 A CN1996969 A CN 1996969A
Authority
CN
China
Prior art keywords
standby
module
master
active
state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2006101252058A
Other languages
Chinese (zh)
Other versions
CN100496030C (en
Inventor
王波
徐立
沈振刚
赵幸
黄勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZHONGGUANG COMMUNICATION CO WUHAN
Original Assignee
ZHONGGUANG COMMUNICATION CO WUHAN
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHONGGUANG COMMUNICATION CO WUHAN filed Critical ZHONGGUANG COMMUNICATION CO WUHAN
Priority to CNB2006101252058A priority Critical patent/CN100496030C/en
Publication of CN1996969A publication Critical patent/CN1996969A/en
Application granted granted Critical
Publication of CN100496030C publication Critical patent/CN100496030C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Hardware Redundancy (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

This invention discloses one integral interface media network gate device based on high utility system, which comprises the following keys: its total control media unit adopts main redundant error tolerant rack by simultaneous temperature backup type; the main and back top layer task modules and key data real-time brushing to realize main and back simultaneous process; real-time monitoring the top layer task module status and ensuring the device utility when there is fault force main conversion.

Description

Integrated access media gateway device based on highly available system
Technical field
The present invention relates to the integrated access media gateway device in the network communication field, relate in particular to a kind of integrated access media gateway device based on highly available system.
Background technology
Soft switch is the core of next generation network (NGN); Be based on packet switch, realize exchanging communication network new technology with call control management with software.Flexible exchanging network is present telecommunication network development tendency, and it transmits in the mode of grouping the access data of variety of protocol on IP backbone, realize the fusion of diverse network.
Integrated access media gateway (AG) is positioned at the access aspect of flexible exchanging network system.It finishes the access (as conventional telephone set access, various broadband access, wireless access or the like) of various terminal equipments and the function of media stream transition under the control of soft switch.Therefore, the realization of its high availability will improve the reliability of access network greatly, thereby improve the service quality (QoS) of whole flexible exchanging network.
Described high availability (HA) is meant by utilization hardware or software redundancy technology and reduces the downtime that General Maintenance and the system failure cause, makes system's continuous service.The availability index of telecommunication apparatus is 99.999%, this means that the down time that causes because of a variety of causes in a year must not be above 5.256 minutes.
1, makes up the basic principle of high availability equipment
For a communication equipment, make up its high availability and generally be divided into two aspects:
(1) hardware based high availability;
(2) based on the high availability of software.
This requires the hardware and software of system all to have fault tolerant mechanism---when a hardware circuit or software module occur realizing detection, location, isolation and recovery to mistake when unusual or wrong:
(1) error detection is carried out error detection to whole system dual mode: the one, and active mode uses special detection module that system is carried out detection initiatively; The 2nd, passive mode, when promptly each parts in the system break down, the own error detection module that initiatively reports.Both combine and use in actual applications.
(2) the error message report that receives is analyzed in location of mistake combination and defined type of error, thereby obtains the information of error reporting, and is correspondingly processed.
(3) mistake is isolated and to be made the module that makes a mistake unavailable, does like this mistake is not diffused in the module that interrelates with it go.
(4) mistake recovers to take certain measure to make whole system continue operate as normal.It is to restart that mistake is recovered the most frequently used method.
2, based on high availability (HA) the realization technology of redundancy backup mode
Redundancy backup is meant carries out 1+1 backup with hardware module crucial in the equipment and software module, during the system operation, and standby backup as main usefulness, with because a variety of causes when unavailable, the main usefulness of standby adapter makes system's continuous service and does not interrupt when main.
High availability (HA) realizes the key of technology is how to control standby adapter master usefulness, and critical data and state synchronization between the active and standby usefulness.Introduce three kinds of backup modes below:
(1) Hot Spare
Active and standby with simultaneously in running order, handle the information of input simultaneously.Therefore active and standbyly all be consistent with various states and data parameters in the module.Output, only will currently be main usefulness output as the output of system, when breaking down, system selects standby output as output, the standby master's usefulness that becomes when main.
(2) warm spare
Main with receiving input and providing service to the outside; After the initialization of standby process finishes, be in standby attitude, prepare to take over main using at any time.During the system operate as normal, main use with the change of critical data parameter real-time be sent to standbyly, the standby state refresh information that then only receives autonomous usefulness refreshes the critical data parameter of own this side.When breaking down, it is the main state of using that the HA system management module can be notified standby switching, and takes over main using.
(3) cold standby
Main with receiving input and output; Standby neither the reception imported, and also do not receive the state refresh information of autonomous usefulness.During operate as normal, main using writing the system mode refreshing information in the stable state memory device, and when breaking down, the HA manager notifies standby current all system status informations that read from the stable state memory device when main, and standby switching is main usefulness.
By above analysis as can be known: the advantage of Hot Spare is that switching time is short, but because active and standby all consistent with the hardware-software structure, and synchronous operation, the therefore living fault that is prone to of active and standby usefulness, thus cause the unavailable of whole system.Cold standby realizes that principle is simple and reliable, and is active and standby asynchronous during owing to system's operation, so switching time is longer.When warm spare has been realized system's operation synchronously, switching time is short, and the reliability height, therefore is widely adopted to realizing the high availability technology of redundancy backup.
Summary of the invention
The purpose of this invention is to provide a kind of integrated access media gateway device based on highly available system.This system has improved the high availability of access media gateway device and competitiveness in telecommunication market, and (at present, the realization of high availability is the emphasis and the focus of present telecommunication apparatus research, is the core technology of telecommunication apparatus; Therefore also be the trade secret of each big telecom equipment manufacturers).
The object of the present invention is achieved like this:
Key hardware and software (being the master control unit of integrated access media gateway device---highly available system 10) are taked the fault-tolerant framework of master-slave redundancy, the synchronous employing warm spare mode between active and standby.
(1) refreshing in real time of state between main usefulness, the standby upper strata task module 120,220 and critical data, active and standby synchronous to realize.
(2) monitor in real time with the state of upper strata task module 120 main, force masterslave switchover when breaking down, it is available to guarantee that equipment continues.
Specifically, as Fig. 1, this integrated access media gateway device (abbreviation equipment) comprises imput output circuit unit 300 and business interface unit 400; Be provided with highly available system 10, highly available system 10 is made up of structure, the standby master control unit 100 and 200 of advocating peace that function is identical;
The standby master control unit 100 and 200 of advocating peace is communicated with imput output circuit unit 300, business interface unit 400 respectively;
The master comprises that with master control unit 100 master uses bottom operation supporting interface 130 with control module 110, master with upper strata task module 120 and master;
The master comprises that with control module 110 master uses message distribution module 111, master to use task management module 112, master to use heartbeat detection module 113, master to use loading administration module 114, master to use switching/state management module 115 and master to use synchronous refresh module 116;
Advocate peace and send the state refresh information of advocating peace between standby upper strata task module 120 and 220 mutually by the standby synchronous refresh module 116 and 216 of advocating peace between the standby master control unit 100,200; Send the control information between standby control module 110 and 210 of advocating peace by the standby message distribution module 111 and 211 of advocating peace;
In standby control module 110 and 210 inside of advocating peace:
Primary, spare message distribution module 111,211 is responsible for the message between each submodule in the highly available system 10 is distributed and transmitted;
The state of primary, spare upper strata task module 120,220 is in charge of and is detected to primary, spare task management module 112,212;
Link State between the primary, spare master control unit 100,200 is managed and monitored to primary, spare heartbeat detection module 113,213 by heartbeat message;
Primary, spare loading administration module 114,214 is responsible for the startup and the loading of each software module in the primary, spare master control unit 100,200;
Primary, spare switching/state management module 115,215 is responsible for the state variation of master control unit 100,200 and the masterslave switchover when breaking down;
Primary, spare synchronous refresh module 116,216 is responsible for critical data and state synchronization between the upper application module 120,220.
Operation principle of the present invention is:
Highly available system 10 adopts master-slave redundancies, and two boards is stuck on the hardware and software in full accord, and just each is used or standby mode loads according to main when initialization is loaded.When the equipment operate as normal, main in running order with hardware in the main switchboard and software systems.At this moment highly available system 10 is mainly used to detect the heartbeat message between active and standby, and realizes the state synchronized between active and standby.
The upper strata task module is the kernel software of master control unit, is used for realizing the major function of comprehensive access gate.In case certain protocol layer in the task module of upper strata breaks down, and will cause the unavailable of entire equipment.Therefore the effect of corresponding upper strata task module is with the backup of task module at the middle and upper levels as main in the standby master control unit 200.State refresh information be used for active and standby between upper strata task module critical data and state synchronization.
Highly available system 10 is used for realizing: control between the primary, spare master control unit 100,200 and data mutual, data and state synchronization between the primary, spare upper application module 120,220, the condition managing and the switching of primary, spare master control unit 100,200.
The present invention has the following advantages and good effect:
1, because highly available system 10 can detect each hardware and the contingent mistake of software, therefore make equipment have very high reliability.
2, possesses the switching time that is exceedingly fast.When mistake takes place, standbyly can various business of communicating by letter not interrupted in very short time inner connecting tube master usefulness.
3, support online upgrading, promptly under the situation of equipment operation, realize successively standby, main with carrying out software upgrading, and do not influence the normal use of equipment, do not interrupt the business of communicating by letter.
Concrete operations are to close standbyly earlier, and it is upgraded, upgrade successfully after, load in standby mode, then all state refresh are realized active and standby synchronously.Active and standby synchronously after, send and switch order and carry out masterslave switchover, so upgrading software is used in operation in the equipment as main to primary and backup.Then standby after switching carried out software upgrading, upgrade successfully after, load in standby mode.So far, the software upgrade job of entire equipment is finished.In this process, do not make equipment unavailable.
4, highly available system 10 is not only applicable to integrated access media gateway device, and can be transplanted to easily in other telecommunication apparatus and go.
Description of drawings
Fig. 1 is a basic framework block diagram of the present invention;
Fig. 2 is highly available system 10 state transition diagrams;
Fig. 3 is the state refresh flow chart;
Fig. 4 is the workflow diagram of active and standby heartbeat processing module 113,213;
Fig. 5 is the switching flow figure that standby heartbeat detection failure causes;
Fig. 6 is main hand-off process flow chart of initiating with master control unit 100;
Fig. 7 is active and standby manual switchover process chart.
Wherein:
10-highly available system comprises main with master control unit 100 and standby master control unit 200;
100-main master control the unit of using,
110-main the nucleus module of using,
111-main message the distribution module of using,
112-main task management the module of using,
113-main heartbeat the detection module of using,
114-main task management the module of using,
115-main heartbeat the detection module of using,
116-main synchronous refresh the module of using;
120-main upper strata the task module of using;
130-main bottom operation the supporting interface of using.
200-standby master control unit,
210-spare core module
211-standby message distribution module,
212-standby task management module,
213-standby heartbeat detection module,
214-standby task management module,
215-standby heartbeat detection module,
216-standby synchronous refresh module;
220-standby upper strata task module;
230-standby bottom operation supporting interface;
300-imput output circuit unit.
400-business interface unit.
A-active and standby all normal; B-major error is equipped with normal; The just standing fault of C-master;
D-active and standby all faults; E-switching intermediateness;
A-master uses fault; B-initiation manual switchover; C-manual switchover is finished;
D-standby fault recovery; E-standby fault; F-initiation is automaticallyed switch;
G-automatic switchover is finished; H-master is normally by the main fault of using under the failure condition;
I-active and standby simultaneous faults;
Main English abbreviation:
NGN-next generation network;
Softswitch-soft switch;
AG-integrated access media gateway;
HA-high availability.
Embodiment
Further specify below in conjunction with drawings and Examples:
One, each functional unit of this equipment
1, the imput output circuit unit 300
Imput output circuit unit 300 is a kind of hardware modules, is used for being connected of this equipment upstream Interface and highly available system 10.
2, the business interface unit 400
Business interface unit 400 is a kind of hardware modules, is used for being connected of each business interface dish of this equipment and highly available system 10.
3, highly available system 10
Highly available system 10 is made up of with master control unit 100 and standby master control unit 200 main.Being connected to from the data process imput output circuit unit 300 that upper layer communication equipment is imported is the main main master control unit 100 of using with state at present, makes it to handle current input.Being connected to by business interface unit 400 from the information of customer service place in circuit integrated circuit board input is main main with master control unit 100 with state at present, and is handled with master control unit 100 by main.
Primary, spare upper strata task module 120,220
Main is main working software modules with master control unit 100 with upper strata task module 120, is used for handling the workflow and the task of whole comprehensive media gateway equipment, is the specific implementation of comprehensive media gateway equipment function.
Standby upper strata task module 220 is working software modules of master control unit 200.Standby upper strata task module 220 is operated in stand-by state.When the master broke down with master control unit 100, main usefulness was taken in standby master control unit 200, so standby upper strata task module 220 is also just taken over the main upper strata task module 120 of using, comprehensive media gateway equipment was worked on.
Primary, spare bottom operation supporting interface 130,230
Main is to carrying out interface operable with the hardware on the master control unit 100 with bottom operation supporting interface 130; Standby bottom operation supporting interface 230 is to carrying out interface operable with the hardware on the standby master control unit 200.
Primary, spare with control module 110,210
1) primary, spare message distribution module 111,211
Among Fig. 1, primary, spare message distribution module 111,211 is responsible for the forwarding of message between each module in the highly available systems 10, comprise active and standby between the transmission of disparate modules control information and data.There is not direct message transmission between other each module in the highly available system 10.
2) primary, spare synchronous refresh module 116,216
Among Fig. 1, the critical data between primary, spare synchronous refresh module 116,216 is responsible active and standby jointly between the task module of upper strata and the transmission of state information, active and standby synchronous to guarantee, the standby master's usefulness of taking at any time during operation.Primary, spare synchronous refresh module 116,216 does not have interface with other module in the highly available system 10, realizes active and standby synchronous entity as highly available system 10, and it only provides service for the upper strata task module.
3) primary, spare loading administration module 114,214
Primary, spare loading administration module 114,214 is responsible for the loading and the startup of each software module in the master control unit; The mode that loads can be divided into main with load, standby loading and restart loading, be applicable to respectively: the main startup of using master control unit 100; The startup of standby master control unit 200; Restarting of primary, spare master control unit 100,200.When primary, spare master control unit 100,200 starts, read the active and standby state of using from hardware logic electric circuit earlier, select loading procedure then.
4) primary, spare heartbeat detection module 113,213
Primary, spare heartbeat detection module 113,213 is used for realizing the heartbeat detection function between the primary, spare master control unit 100,200.When the heartbeat detection module detects the heartbeat failure, will notify primary, spare switching/state management module 115,215 to take corresponding actions.
5) primary, spare switching/state management module 115,215
Primary, spare switching/state management module 115,215 is responsible for the state variation of highly available systems 10 and the masterslave switchover when breaking down.Five kinds of states of definition highly available system 10 are: active and standby all normal A; Active and standby all normal B; Main just standing fault C; Active and standby all faults D; Switch intermediateness E.The switching command of definition has: 1. force to switch and 2. manual switchover.Force to switch and be meant that system hardware or software fault are detected the automatic switchover that the back system makes; Manual switchover is the switching command that is sent by webmaster, and this moment, equipment might not break down.
6) primary, spare task management module 112,212
Primary, spare task management module 112,212 workflows are identical.Primary, spare task management module 112,212 is responsible for the task of this locality is detected, the mode that detects can be following two kinds: 1, by system call task status is carried out timing and inquire about, send alarm information to switching/state management module when going out the ossified or abnormal end situation of current task; 2, by heartbeat message task is monitored, when the task heartbeat timeout, send alarm information to switching/state management module.The task management module is born the interface of all highly available systems 10 and upper strata task module, comprises heartbeat detection, switching notice, message such as data backup requests and data backup response.
Two, the status change of highly available system 10
As Fig. 2, nine kinds of reasons that defined the state of five kinds of highly available systems 10 and caused status change:
1. the state of highly available system 10
5 kinds of states of highly available system 10 comprise: active and standby all normal A, and major error is equipped with normal B, main just standing fault C, active and standby faults D switches intermediateness E.
2. cause the reason of state variation
9 kinds of reasons that cause state variation comprise: the main fault a that uses, initiate manual switchover b, and c is finished in manual switchover, standby fault recovery d, standby fault e initiates automatic switchover f, automatically switch and finish g, active and standby simultaneous faults h, main normally by the main fault i that uses under the failure condition.
Below Fig. 2 is described:
1, under normal operation, highly available system 10 is in active and standby all normal A condition.
2, under the normal operation of active and standby all normal A, if the master breaks down with master control unit 100, promptly the master uses fault a, and then the workflow of system is followed successively by:
2. enter major error and be equipped with normal B,
2. initiate automatic switchover f,
3. enter and switch intermediateness E,
4. automatically switch and finish g,
5. enter main just standing fault C state,
6. standby fault recovery d,
7. enter active and standby all normal A condition, system is operate as normal again.
3, under the working condition of active and standby all normal A, if standby fault e occurs, then system enters main just standing fault C state, and system can initiate standby fault recovery d operation then, and final system reenters active and standby all normal operating conditionss of normal A.
4, under the working condition of active and standby all normal A, if lead the order that receives masterslave switchover with master control unit 100, system then initiates manual switchover b, enters then to switch intermediateness E, after c was finished in manual switchover, system reentered active and standby all normal operating conditionss of normal A.
5, under the working condition of active and standby all normal A, if primary, spare master control unit 100,200 breaks down simultaneously, promptly active and standby simultaneous faults h, system then enters active and standby all faults D state.After entering active and standby all faults D state, system can't operate as normal.
6, under the state of main just standing fault C, system operates in the sub-health state that does not back up master control unit 200.If this moment, the master broke down with master control unit 100, then system enters active and standby all faults D state; After entering active and standby all faults D state, system can't operate as normal.
Three, the state refresh flow process of highly available system 10
High availability (HA) realizes that the key of technology is: how to make 1. that critical data and state keep synchronously between the active and standby usefulness; 2. how to control standby adapter master usefulness, promptly switch.
Realize that critical data and state keep synchronously between the active and standby usefulness, realize that just critical data and the state between the primary, spare upper strata task module 120,220 of primary, spare master control unit 100,200 keeps synchronously.We realize by the user mode flush mechanism, as shown in Figure 3, realize state refresh by primary, spare synchronous refresh module 116,216 between the primary, spare upper strata task module 120,220, and idiographic flow is followed successively by: 6A, 600,601,602,603,6B, 604,605,606,6C.
Upper strata task module 120 internal states during 6A-master uses change when work.
600-main extremely main with synchronous refresh module 116 with 120 transmissions " state refresh message " of upper strata task module, comprise the data message that changes in this message.
601-main synchronous refresh the module 116 of using sends " state refresh message " to standby synchronous refresh module 216.
602-standby synchronous refresh module 216 sends " state refresh message " to standby upper strata task module 220.
6B-standby upper strata task module 220 refreshes own inner corresponding state and parameter according to the data message in the message.
604-standby upper strata task module 220 sends " state refresh is finished " to standby synchronous refresh module 216.
605-standby synchronous refresh module 216 sends " state refresh is finished " to the main synchronous refresh module 116 of using.
606-main extremely main with upper strata task module 120 with 116 transmissions " state refresh is finished " of synchronous refresh module.
6C-state refresh is finished.
Four, the switching flow of highly available system 10
Change action between the primary, spare total control module 100,200 can be divided into:
1. force to switch:
After system detects main fault with total control module 100 generations, the automatic switchover of making.Fault is failed by heartbeat detection and is detected the master and causes with the upper strata fault.
2. manual switchover:
Manual switchover is the switching command that is sent by webmaster, and active and standby might not break down with total control module 100 this moment.
1, the workflow of active and standby heartbeat processing module 113,213
As Fig. 4, regularly to the opposite end jumping request message of making up one's mind, wait for that then the other side responds between the active and standby usefulness.Respond if receive heartbeat, then think the opposite end operate as normal; Otherwise think and send the opposite end fault opposite end failure message and handle to switching/state management module.During if break down is main with master control unit 100, then carries out masterslave switchover; During if break down is standby master control unit 200, and then highly available system 10 enters main just standing malfunction C.
2, the workflow of switching is forced in the standby initiation under the heartbeat failure scenarios
As Fig. 5, standby master control unit 200 detects main with after the 100 heartbeats failure of master control unit, carries out the workflow of switching and is followed successively by: 7A, 700,701,702,703,7B, 704,705,706,7C.
7A-standby heartbeat detection module 213 detects the master and fails with heartbeat.
700-main heartbeat the detection module 113 of using sends " heartbeat failure " message to standby switching/state management module 215.
701-standby switching/state management module 215 sends " handoff request " to standby message distribution module 211.
702-standby message distribution module 211 sends " handoff request " to standby task management module 212.
703-standby task management module 212 sends " handoff request " to standby upper strata task module 220.
7B-standby upper strata task module 220 is carried out and is switched.
704-standby upper strata task module 220 sends " switching is finished " to standby task management module 212.
705-standby task management module 212 sends " switching is finished " to standby message distribution module 211.
706-standby message distribution module 211 sends " switching is finished " to standby switching/state management module 215.
7C-standby master control unit 200 switches to main with finishing.
3, detect the main pressure switching flow of using after breaking down
As Fig. 6, primary, spare task management module 112,212 has the function of the working condition that detects primary, spare upper strata task module 120,220, when the master detects main fault with upper strata task module 120 with task management module 112 after, to initiate change action, it is main with master control unit 100 that standby master control unit 200 is taken over.Concrete workflow is followed successively by: 8A, 800,801,802,803,804,8B, 805,806,807,808,8C.
8A-master detects the master with task management module 112 and breaks down with upper strata task module 120.
800-main extremely main with switching/state management module 115 with 112 transmissions " handoff request " of task management module.
801-main extremely main with message distribution module 111 with 115 transmissions " handoff request " of switching/state management module.
802-main message the distribution module 111 of using sends " handoff request " to standby message distribution module 211.
803-standby message distribution module 211 sends " handoff request " to standby switching/state management module 215.
804-standby switching/state management module 215 sends " handoff request " to standby task management module 212.
8B-standby task management module 212 notifies standby upper strata task module 220 to carry out switching.
After 805-switching was finished, standby task management module 212 sent " switching is finished " to standby switching/state management module 215.
806-standby switching/state management module 215 sends " switching is finished " standby message distribution module 211.
807-standby message distribution module 211 sends " switching is finished " to the main message distribution module 111 of using.
808-main extremely main with switching/state management module 115 with 111 transmissions " switching is finished " of message distribution module.
8C-execution failure recovery operation, as restart.
4, the active and standby manual switchover flow process sent of webmaster
As Fig. 6, active and standby manual switchover order is a kind of switching command that is sent by the webmaster program by the apparatus manager.This order sends to the main task management module 112 of using through main with upper strata task module 120, is initiated to switch with task management module 112 by main then.Concrete workflow is followed successively by: 9A, 900,901,902,903,904,9B, 905,906,907,908,909,9C, 910,9D.
9A-master receives the manual switchover request with task management module 112.
900-main extremely main with switching/state management module 115 with 112 transmissions " handoff request " of task management module.
901-main extremely main with message distribution module 111 with 115 transmissions " handoff request " of switching/state management module.
902-main message the distribution module 111 of using sends " handoff request " to standby message distribution module 211.
903-message distribution module 211 sends " handoff request " to switching/state management module 215.
904-standby switching/state management module 215 sends " handoff request " to standby task management module 212.
9B-standby task management module 212 notifies standby upper strata task module 220 to carry out switching.
After 905-switching was finished, standby task management module 212 sent " switching is finished " to standby switching/state management module 215.
906-standby switching/state management module 215 sends " switching is finished " standby message distribution module 211.
907-standby message distribution module 211 sends " switching is finished " to the main message distribution module 111 of using.
908-main extremely main with switching/state management module 115 with 111 transmissions " switching is finished " of message distribution module.
909-main extremely main with task management module 112 with 115 transmissions " switching is finished " of switching/state management module.
9C-master is switched with main the execution with upper strata task module 120 of task management module 112 notices.
After 910-switching is finished, main with the extremely main switching/state management module 115 of using of task management module 112 transmissions " switching is finished ".
9D-active and standby manual switchover is finished.

Claims (8)

1, a kind of integrated access media gateway device based on highly available system comprises imput output circuit unit (300) and business interface unit (400); It is characterized in that:
Be provided with highly available system (10), highly available system (10) is made up of the identical standby master control unit (100 and 200) of advocating peace of structure;
The standby master control unit (100 and 200) of advocating peace is communicated with imput output circuit unit (300), business interface unit (400) respectively;
The master comprises that with master control unit (100) master uses bottom operation supporting interface (130) with control module (110), master with upper strata task module (120) and master;
The master comprises that with control module (110) master uses message distribution module (111), master to use task management module (112), master to use heartbeat detection module (113), master to use loading administration module (114), master to use switching/state management module (115) and master to use synchronous refresh module (116);
Advocate peace between the standby master control unit (100,200) and to send state refresh information between the primary, spare upper strata task module (120,220) mutually by the standby synchronous refresh module (116 and 216) of advocating peace; Send control information between the standby control module of advocating peace (110 and 210) by the standby message distribution module of advocating peace (111 and 211).
2, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
Primary, spare message distribution module (111,211) is responsible for the forwarding of message between each module in the highly available system (10), comprise active and standby between the transmission of disparate modules control information and data.
3, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
The critical data between primary, spare synchronous refresh module (116,216) is responsible active and standby jointly between the task module of upper strata and the transmission of state information, active and standby synchronous to guarantee, the standby master's usefulness of taking at any time during operation.
4, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
Primary, spare loading administration module (114,214) is responsible for the loading and the startup of each software module in the master control unit; The mode that loads can be divided into main with load, standby loading and restart loading, be applicable to respectively: the main startup of using master control unit (100); The startup of standby master control unit (200); Restarting of primary, spare master control unit (100,200).
5, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
Primary, spare heartbeat detection module (113,213) is used for realizing the heartbeat detection function between the primary, spare master control unit (100,200); After the heartbeat detection module detects the heartbeat failure, will notify primary, spare switching/state management module (115,215) to take corresponding actions.
6, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
Primary, spare switching/state management module (115,215) is responsible for the state variation of highly available system (10) and the masterslave switchover when breaking down; Five kinds of states of definition highly available system (10) are: active and standby all normal (A); Major error is equipped with normal (B); Main just standing fault (C); Active and standby all faults (D); Switch intermediateness (E); The switching command of definition has: 1. force to switch and 2. manual switchover.
7, by the described a kind of integrated access media gateway device of claim 1, it is characterized in that based on highly available system:
Primary, spare task management module (112,212) is responsible for the task of this locality is detected, and the mode of detection has following two kinds:
1. by system call task status is carried out timing inquiry, when going out the ossified or abnormal end situation of current task, send alarm information to switching/state management module;
2. by heartbeat message task is monitored, when the task heartbeat timeout, send alarm information to switching/state management module.
8,, it is characterized in that the status change of highly available system (10) by the described a kind of integrated access media gateway device of claim 1 based on highly available system:
1) under normal operation, highly available system (10) is in active and standby all normal (A) state;
2) under the normal operation of active and standby all normal (A), if the master breaks down with master control unit (100), promptly main with fault (a), then the workflow of system is followed successively by: enter major error and be equipped with normal (B), initiate automatically switch (f), enter and switch intermediateness (E), automatically switch and finish (g), enter main just standing fault (C) state, standby fault recovery (d), enter active and standby all normal (A) state, system is operate as normal again;
3) under the working condition of active and standby all normal (A), if standby fault (e) occurs, then system enters main just standing fault (C) state, and system can initiate standby fault recovery (d) operation then, and final system reenters active and standby all normal operating conditionss of normal (A);
4) under the working condition of active and standby all normal (A), if the main order that receives masterslave switchover with master control unit (100), system then initiates manual switchover (b), enter then and switch intermediateness (E), after (c) finished in manual switchover, system reentered active and standby all normal operating conditionss of normal (A);
5) under the working condition of active and standby all normal (A), if primary, spare master control unit (100,200) breaks down simultaneously, promptly active and standby simultaneous faults (h), system then enters active and standby all faults (D) state; After entering active and standby all faults (D) state, system can't operate as normal;
6) under the state of main just standing fault (C), system operates in the sub-health state that does not back up master control unit (200).If this moment, the master broke down with master control unit 100, then system enters active and standby all faults (D) state; After entering active and standby all faults (D) state, system can't operate as normal.
CNB2006101252058A 2006-11-30 2006-11-30 Integrated access media gateway device based on the high-available system Active CN100496030C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006101252058A CN100496030C (en) 2006-11-30 2006-11-30 Integrated access media gateway device based on the high-available system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006101252058A CN100496030C (en) 2006-11-30 2006-11-30 Integrated access media gateway device based on the high-available system

Publications (2)

Publication Number Publication Date
CN1996969A true CN1996969A (en) 2007-07-11
CN100496030C CN100496030C (en) 2009-06-03

Family

ID=38251905

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006101252058A Active CN100496030C (en) 2006-11-30 2006-11-30 Integrated access media gateway device based on the high-available system

Country Status (1)

Country Link
CN (1) CN100496030C (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102014029A (en) * 2010-12-25 2011-04-13 中国人民解放军国防科学技术大学 Method for detecting abnormal conditions of external service network in dual active
CN101695069B (en) * 2009-10-22 2012-04-04 南京科远自动化集团股份有限公司 Enetgw communication gateway
CN103067947A (en) * 2013-01-09 2013-04-24 上海大唐移动通信设备有限公司 Front end test device and communication method and automatic drive test system thereof
CN101567804B (en) * 2009-05-21 2013-06-05 华为数字技术(成都)有限公司 Method, equipment and system for realizing protection of system anomaly
CN106850255A (en) * 2016-12-20 2017-06-13 曙光信息产业(北京)有限公司 A kind of implementation method of multi-computer back-up
CN112904754A (en) * 2020-12-30 2021-06-04 国电南京自动化股份有限公司 Main and standby center switching control subsystem and method of integrated monitoring system
CN113946623A (en) * 2021-09-28 2022-01-18 南京莱斯信息技术股份有限公司 Real-time data synchronization system and method for main and standby systems

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101567804B (en) * 2009-05-21 2013-06-05 华为数字技术(成都)有限公司 Method, equipment and system for realizing protection of system anomaly
CN101695069B (en) * 2009-10-22 2012-04-04 南京科远自动化集团股份有限公司 Enetgw communication gateway
CN102014029A (en) * 2010-12-25 2011-04-13 中国人民解放军国防科学技术大学 Method for detecting abnormal conditions of external service network in dual active
CN103067947A (en) * 2013-01-09 2013-04-24 上海大唐移动通信设备有限公司 Front end test device and communication method and automatic drive test system thereof
CN103067947B (en) * 2013-01-09 2016-05-11 上海大唐移动通信设备有限公司 A kind of head end test equipment and communication means thereof, automatic drive test system
CN106850255A (en) * 2016-12-20 2017-06-13 曙光信息产业(北京)有限公司 A kind of implementation method of multi-computer back-up
CN106850255B (en) * 2016-12-20 2021-04-06 曙光信息产业(北京)有限公司 Method for implementing multi-machine backup
CN112904754A (en) * 2020-12-30 2021-06-04 国电南京自动化股份有限公司 Main and standby center switching control subsystem and method of integrated monitoring system
CN112904754B (en) * 2020-12-30 2022-06-28 国电南京自动化股份有限公司 Main and standby center switching control subsystem and method of integrated monitoring system
CN113946623A (en) * 2021-09-28 2022-01-18 南京莱斯信息技术股份有限公司 Real-time data synchronization system and method for main and standby systems
CN113946623B (en) * 2021-09-28 2024-04-09 南京莱斯信息技术股份有限公司 Main and standby system data real-time synchronization system and method

Also Published As

Publication number Publication date
CN100496030C (en) 2009-06-03

Similar Documents

Publication Publication Date Title
CN100496030C (en) Integrated access media gateway device based on the high-available system
US20070288585A1 (en) Cluster system
CN100461697C (en) Service take-over method based on device disaster tolerance, service switching device and backup machine
CN101594383B (en) Method for monitoring service and status of controllers of double-controller storage system
CN101645915B (en) Disk array host channel daughter card, on-line switching system and switching method thereof
CN101212287A (en) Port based redundant backup method
CN101916217A (en) Method, control device and system for switching a plurality of controllers
CN101247568A (en) Data fault-tolerance system and method and intermediate equipment
CN101237413B (en) Method for realizing high-availability of control part under forward and control separated network architecture
CN101547053A (en) Clock-switching method and clock-generating device
CN107040403A (en) The method that Distributed system reliability is improved based on DDS technologies
WO2006005251A1 (en) Method and system for carrying out the function of switching in a communication system
KR100324275B1 (en) Dual State Control Method Of Duplicated Processors
CN102487332B (en) Fault processing method, apparatus thereof and system thereof
CN101247213A (en) Method and system for master/standby rearrangement
JP4781696B2 (en) IP phone system
CN1327727C (en) Method for realizing double homes of media gateway
CN116340058A (en) Master-slave switching method and device
JP5176914B2 (en) Transmission device and system switching method for redundant configuration unit
CN100490343C (en) A method and device for realizing switching between main and backup units in communication equipment
US7577135B2 (en) IP telephone system
CN101146037A (en) Multi-cascading system and service processing method
JP4579018B2 (en) IP phone system
KR950010490B1 (en) Double-using method for the control system of an electronic exchanger
CN217037201U (en) Management network device for storing products and storage system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant