WO2023274164A1

WO2023274164A1 - Automatic main/standby switching method, control plane device, vbras system and storage medium

Info

Publication number: WO2023274164A1
Application number: PCT/CN2022/101589
Authority: WO
Inventors: 刘硕
Original assignee: 中兴通讯股份有限公司
Priority date: 2021-06-28
Filing date: 2022-06-27
Publication date: 2023-01-05
Also published as: CN115604087A

Abstract

An automatic main/standby switching method, a control plane device, a vBRAS system and a storage medium. The automatic main/standby switching method is applied to a second control plane device in a vBRAS system. The vBRAS system further comprises a first control plane device and a forwarding plane device, wherein the first control plane device is provided with a first instance in a main state, the second control plane device is provided with a second instance in a standby state, the first instance communicates with the forwarding plane device by means of a first channel, and the second instance communicates with the forwarding plane device by means of a second channel. The method comprises: receiving fault information, wherein the fault information represents that a failure rate of a first channel is greater than a first preset threshold value (S100); acquiring the current failure rate of a second channel according to the fault information (S200); and when the current failure rate of the second channel is less than or equal to a second preset threshold value, controlling a second instance to switch from a standby state to a main state (S300).

Description

Active-standby automatic switching method, control plane equipment, vBRAS system and storage medium

Cross References to Related Applications

This application is based on a Chinese patent application with application number 202110719886.5 and a filing date of June 28, 2021, and claims the priority of this Chinese patent application. The entire content of this Chinese patent application is hereby incorporated by reference into this application.

technical field

The present application relates to the technical field of communications, and in particular to a master-standby automatic switching method, a control plane device, a vBRAS system, and a computer-readable storage medium.

Background technique

vBRAS (virtual Broadband Remote Access Server, virtual broadband remote access server), as an emerging BRAS (Broadband Remote Access Server, broadband remote access server) equipment form, can be divided into centralized and separation of transfer and control according to the architecture. Mode. Among them, the separated transfer and control vBRAS system refers to a vBRAS system in which forwarding and control are separated, the control plane is virtualized and centralized, and the forwarding plane coexists between virtual and real. Generally, a vBRAS system with separation of forwarding and control includes a control plane device, a forwarding plane device, and a standardized interface between the control plane device and the forwarding plane device.

At present, there is a deficiency in related technologies that control plane devices cannot be switched automatically. Therefore, if a single control plane device fails due to a power outage in the computer room, fire, etc., the peer control plane device will not be able to sense and switch in time, causing the user's service function to fail, thereby affecting the user's network experience.

Contents of the invention

This application aims to solve at least one of the technical problems existing in the prior art. To this end, the present application proposes a master-standby automatic switching method, a control plane device, a vBRAS system, and a computer-readable storage medium.

In the first aspect, the embodiment of the present application provides an active-standby automatic switchover method, which is applied to the second control plane device in the vBRAS system. The vBRAS system also includes a first control plane device and a forwarding plane device. The first The control plane device is provided with a first instance in a master state, the second control plane device is provided with a second instance in a standby state, the first instance communicates with the forwarding plane device through a first channel, and the second The second instance communicates with the forwarding plane device through the second channel, the method includes: receiving fault information, wherein the fault information indicates that the failure rate of the first channel is greater than a first preset threshold; according to the fault information Obtain the current failure rate of the second channel; when the current failure rate of the second channel is less than or equal to a second preset threshold, control the second instance to switch from the standby state to the main state.

In the second aspect, the embodiment of the present application also provides a control plane device, including: a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor executes the The computer program implements the master-standby automatic switching method as described in the embodiment of the first aspect above.

In a third aspect, the embodiment of the present application further provides a vBRAS system, including the control plane device described in the embodiment of the second aspect above.

In the fourth aspect, the embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores computer-executable instructions, and the computer-executable instructions are used to make the computer execute the above-mentioned embodiment of the first aspect. The main-standby automatic switching method described above.

Additional aspects and advantages of the application will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the application.

Description of drawings

The accompanying drawings are used to provide a further understanding of the technical solution of the present application, and constitute a part of the specification, and are used together with the embodiments of the present application to explain the technical solution of the present application, and do not constitute a limitation to the technical solution of the present application.

Fig. 1 is the schematic diagram of the system architecture platform of the main-standby automatic switching method provided by one embodiment of the present application;

FIG. 2 is a schematic diagram of an application scenario of an active-standby switching method provided by an embodiment of the present application;

FIG. 3 is a schematic diagram of a network of an active-standby switching method provided by an embodiment of the present application;

Fig. 4 is a schematic diagram of a state transition of a second example provided by an embodiment of the present application;

Fig. 5 is a specific step diagram of an active-standby automatic switching method provided by an embodiment of the present application;

FIG. 6 is a diagram of specific steps of an active-standby automatic switching method provided in another embodiment of the present application;

FIG. 7 is a diagram of specific steps of an active-standby automatic switching method provided by another embodiment of the present application;

FIG. 8 is a diagram of specific steps of an active-standby automatic switching method provided by another embodiment of the present application;

FIG. 9 is a diagram of specific steps of an active-standby automatic switching method provided by another embodiment of the present application;

FIG. 10 is a diagram of specific steps of an active-standby automatic switching method provided by another embodiment of the present application;

FIG. 11 is a diagram of specific steps of a method for automatically switching between master and backup according to another embodiment of the present application;

Fig. 12 is a diagram of specific steps of an active-standby automatic switching method provided by another embodiment of the present application.

detailed description

In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.

It should be noted that although the functional modules are divided in the schematic diagram of the device, and the logical sequence is shown in the flowchart, in some cases, it can be executed in a different order than the module division in the device or the flowchart in the flowchart. steps shown or described. The terms "first", "second" and the like in the specification, claims or the above drawings are used to distinguish similar objects, and not necessarily used to describe a specific order or sequence.

As an emerging BRAS equipment form, vBRAS can be mainly divided into centralized and separated transfer and control according to the architecture. Among them, the transfer-control separated vBRAS system refers to the technical ideas of SDN (Software Defined Network, software-defined network) and NFV (Network Functions Virtualization, network function virtualization), combined with CT (Communication technology, communication technology) and IT (Information technology, information technology), according to the actual application scenario requirements of operators, realize the vBRAS system that separates forwarding and control, virtualizes and centralizes the control plane, and coexists virtual and real things on the forwarding plane. Generally, a vBRAS system with separation of forwarding and control includes a control plane device, a forwarding plane device, and a standardized interface between the control plane device and the forwarding plane device.

At present, there is a deficiency in related technologies that automatic switching among multiple control plane devices cannot be performed. Therefore, if a single control plane device fails due to a power outage in the computer room, fire, etc., the peer control plane device will not be able to sense and switch in time, causing the user's service function to fail, thereby affecting the user's network experience.

Based on the foregoing, an embodiment of the present application provides an active-standby automatic switching method, a control plane device, a vBRAS system, and a computer-readable storage medium, wherein the active-standby automatic switching method is applied to the second control plane device in the vBRAS system, The vBRAS system further includes a first control plane device and a forwarding plane device, the first control plane device is provided with a first instance in a master state, and the second control plane device is provided with a second instance in a standby state, The first instance communicates with the forwarding plane device through a first channel, and the second instance communicates with the forwarding plane device through a second channel. The method includes but is not limited to the following steps: receiving fault information, wherein, The failure information indicates that the failure rate of the first channel is greater than a first preset threshold; the current failure rate of the second channel is obtained according to the failure information; when the current failure rate of the second channel is less than or equal to the first threshold Two preset thresholds, controlling the second instance to switch from the standby state to the main state. According to the solution provided by the embodiment of the present application, the second control plane device can sense the failure of the first control plane device in time according to the fault information, and switch the second instance from the standby state when it judges that it has the ability to upgrade to the master In the active state, the active/standby automatic switching between the control plane devices is completed, thereby improving the disaster recovery performance of the vBRAS system and optimizing the user's network experience.

The embodiments of the present application will be further described below in conjunction with the accompanying drawings.

As shown in FIG. 1 , FIG. 1 is a schematic diagram of a system architecture platform for performing an active-standby automatic switching method provided by an embodiment of the present application.

In the example shown in FIG. 1 , the system architecture platform is provided with a processor 100 and a memory 200 , wherein the processor 100 and the memory 200 may be connected via a bus or in other ways. In FIG. 1 , connection via a bus is taken as an example.

As a non-transitory computer-readable storage medium, the memory 200 can be used to store non-transitory software programs and non-transitory computer-executable programs. In addition, the memory 200 may include a high-speed random access memory 200 , and may also include a non-transitory memory 200 , such as at least one disk storage 200 , a flash memory device, or other non-transitory solid-state memory 200 . In some implementations, the memory 200 may optionally include memory 200 located remotely relative to the processor 100, and these remote memories 200 may be connected to the system architecture platform through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.

Those skilled in the art can understand that the system architecture platform can be applied to 3G communication network systems, LTE communication network systems, 5G communication network systems and subsequent evolved mobile communication network systems, etc., which is not specifically limited in this embodiment.

Those skilled in the art can understand that the system architecture platform shown in FIG. 1 does not constitute a limitation to the embodiment of the present application, and may include more or less components than those shown in the illustration, or combine some components, or have different Part placement.

In the system architecture platform shown in FIG. 1 , the processor 100 can call an information processing program stored in the memory 200 to execute the method for automatically switching between master and backup.

Based on the above-mentioned system architecture platform, various embodiments of the master-standby automatic switching method of the present application are proposed below.

Referring to FIG. 3 , for example, the active-standby automatic switching method is applied to the second control plane device 400 in the vBRAS system. The vBRAS system also includes a first control plane device 300 and a forwarding plane device 500. The first control plane device 300 is provided with The first instance in the master state, the second control plane device 400 is provided with a second instance in the standby state, the first instance communicates with the forwarding plane device 500 through the first channel (not shown in the figure), and the second instance communicates with the forwarding plane device 500 through the second The second channel (not shown in the figure) communicates with the forwarding plane device 500 .

Referring to FIG. 5 , the method for automatic active-standby switching specifically includes but is not limited to the following steps S100 , S200 and S300 .

Step S100: Receive failure information, wherein the failure information indicates that the failure rate of the first channel is greater than a first preset threshold.

It should be noted that the fault information may be generated by the first control plane device 300 and sent to the second control plane device 400, or may be generated by the second control plane device 400 itself according to the information sent by the forwarding plane device 500 , which is not limited in this embodiment.

It should be noted that the first preset threshold is the channel failure rate threshold preset in the first example, and the first preset threshold may be 50, 100, etc., which is not limited in this embodiment.

Step S200: Obtain the current failure rate of the second channel according to the failure information.

It should be noted that when the second control plane device 400 receives fault information indicating that the failure rate of the first channel is greater than the first preset threshold, it means that the first control plane device 300 is in a fault state, that is, the communication with the first instance The forwarding plane device 500 needs other control plane devices to perform the master upgrade operation to take over, so the second control plane device 400 will obtain the current failure rate of the second channel to determine whether the second instance has the master upgrade capability.

Step S300: When the current failure rate of the second channel is less than or equal to the second preset threshold, control the second instance to switch from the standby state to the main state.

It should be noted that when the current failure rate of the second channel is less than or equal to the second preset threshold, it means that the second control plane device 400 itself has the ability to upgrade to the master, and can switch the second instance from the standby state to the master state. Thus, the forwarding plane device 500 communicating with the first instance is taken over.

It should be noted that the second preset threshold is the channel failure rate threshold preset in the second example, and the second preset threshold may be 50, 100, etc., which is not limited in this embodiment.

It can be understood that, through steps S100 to S300, the second control plane device 400 can detect the fault of the first control plane device 300 in time according to the fault information, and when it is judged that it has the ability to upgrade to the master, the second control plane device 400 can The instance is switched from the standby state to the active state, and the automatic switchover between the active and standby devices on the control plane is completed, thereby improving the disaster recovery performance of the vBRAS system and optimizing the user's network experience.

It is worth noting that the active/standby automatic switchover method can be used to manage the active/standby states of two control plane devices, so that active/standby automatic switchover can be performed between the two control plane devices. At the same time, according to the actual application scenario requirements of operators, the automatic switching function of vBRAS system products across control plane devices is realized, thereby improving the disaster recovery performance of vBRAS system products and the reliability of system products. In addition, since the control plane equipment in this technical solution can identify the failure of the peer control plane equipment in time and automatically perform active/standby switchover, the stability of vBRAS system product operation is greatly improved, and the user's network experience is optimized, thereby It reduces the complaint rate of users and reduces the operation and maintenance costs of customers, which is of great significance to the subsequent development of vBRAS system-related technologies.

Referring to FIG. 3 and FIG. 6 , for example, the above step S100 specifically includes but not limited to the following step S110 .

Step S110: Receive the fault information sent from the first control plane device 300 through the heartbeat line, where the first control plane device 300 and the second control plane device 400 communicate through the heartbeat line.

Specifically, when the first control plane device 300 is disconnected from the first channel of the forwarding plane device 500, so that the failure rate of the first channel exceeds the first preset threshold of the first instance, the first control plane device 300 generates a fault information and send the fault information to the second control plane device 400 through the heartbeat line communicatively connected with the second control plane device 400; when the second control plane device 400 receives the fault information through the heartbeat line, it can judge that the first control plane The device 300 is in a fault state, and then the second control plane device 400 judges whether it has the ability to upgrade to the master, and if so, upgrades to the master.

Referring to FIG. 7 , for example, the above step S100 specifically includes but not limited to the following step S120 , step S130 and step S140 .

Step S120: Receive a fault event sent from the forwarding plane device 500 through the second channel, where the fault event indicates that there is a fault in the first channel.

Step S130: Calculate the failure rate of the first channel according to the failure events.

Step S140: When the failure rate of the first channel is greater than a first preset threshold, generate failure information.

Specifically, when the first control plane device 300 suddenly loses power or loses connection, the heartbeat line between the first control plane device 300 and the second control plane device 400 will be disconnected, and the first control plane device 300 and the forwarding plane device The first channel between 500 will also be disconnected. At this time, the second control plane device 400 receives the fault event sent from the forwarding plane device 500, and calculates the failure rate of the first channel according to the fault event, and then judges whether the failure rate of the first channel is greater than the first preset threshold, if so Then it can be judged that the first control plane device 300 is in a fault state, and then the second control plane device 400 judges whether it has the ability to upgrade to the master, and if so, upgrades to the master.

Referring to FIG. 2 to FIG. 4 and FIG. 8 , for example, the vBRAS system further includes a database 600, and the first control plane device 300 and the second control plane device 400 communicate with the database 600 respectively. After the above step S300, it specifically includes but It is not limited to the following steps S400 and S500.

Step S400: Control the second instance to switch from the standby state to the recovering state, where the recovering state is used for the second instance to extract the user data of the first instance from the database 600 .

Step S500: When the second instance finishes extracting the user data, control the second instance to switch from the recovering state to the main state.

Specifically, when the second control plane device 400 executes the master upgrade operation, the second control plane device 400 first controls the state of the second instance to switch from the standby state to the recovering state, and then pulls from the database 600 the information related to the first control plane device. 300 communicates with the online users of the forwarding plane device 500, extracts and restores these online users one by one to the second control plane device 400, and after the second control plane device 400 restores all these online users, restores the second instance from the The medium state switches to the main state.

Referring to FIG. 9 , for example, after the above step S400 , it specifically includes but not limited to the following steps S410 and S420 .

Step S410: Generate a pointing switch instruction.

Step S420: Send a pointing switch command to the forwarding plane device 500, so that the encapsulation and decapsulation table of the forwarding plane device 500 points to the second instance or the channel link of the forwarding plane device 500 points to the second instance.

Specifically, after the second control plane device 400 controls the second instance to switch from the standby state to the recovering state, it generates a pointing switch command and sends the pointing switch command to the forwarding plane device 500, so that the encapsulation of the forwarding plane device 500 is decapsulated. The encapsulation table points to the second instance; or, the pointing switching instruction is sent to the forwarding plane device 500, so that the channel link of the forwarding plane device 500 points to the second instance.

Referring to FIG. 10 , for example, after the above step S300 , it specifically includes but not limited to the following steps S600 and S700 .

Step S600: Generate a state switching instruction.

Step S700: Send a state switching instruction to the first instance, so that the first instance is switched from the master state to the standby state.

Specifically, after the current failure rate of the second channel is less than or equal to the second preset threshold, the second control plane device 400 generates a state switching instruction, and sends the state switching instruction to the first instance, so that the first The instance switches from the primary state to the standby state. It should be noted that at this time, the first control plane device 300 may not receive the state switching instruction due to a failure, but since the first instance and the second instance are independent of each other and do not affect each other, it will not affect the second instance. The control plane device 400 performs an upgrade to master operation.

Referring to FIG. 11 , for example, after the above step S300, it specifically includes but not limited to the following steps S800 and S900.

Step S800: When the failure rate of the first channel returns to less than or equal to the first preset threshold, obtain the first priority of the first instance and the second priority of the second instance, and send the second priority to the first controller Surface device 300.

Step S900: Compare the first priority with the second priority, and control the states of the second instance and the first instance according to the comparison result.

It should be noted that after the second control plane device 400 controls the second instance to switch from the standby state to the master state, because the first control plane device 300 may fail to receive a state switching instruction from the second control plane device 400, Therefore, when the first control plane device 300 returns to normal, both the first instance and the second instance will be in the master state, and the first control plane device 300 and the second control plane device 400 need to negotiate with each other to determine the final master state. Status of the control plane device. Specifically, when the failure rate of the first channel recovers to be less than or equal to the first preset threshold, the second control plane device 400 acquires the first priority of the first instance and the second priority of the second instance, and sends the second The second priority is assigned to the first control plane device 300, and then the first priority is compared with the second priority, and the statuses of the second instance and the first instance are controlled according to the comparison result.

It should be noted that both the first priority and the second priority are preset values, specifically 100, 200, etc., which are not limited in this embodiment.

Referring to FIG. 12 , for example, the above step S900 specifically includes but not limited to the following steps S910 and S920.

Step S910: When the first priority is higher than the second priority, control the second instance to restore from the master state to the standby state, and make the first control plane device 300 maintain the first instance according to the first priority and the second priority The state of is the main state.

Step S920: When the first priority is lower than the second priority, maintain the state of the second instance as the master state, and make the first control plane device 300 control the first instance to be mastered according to the first priority and the second priority. The state switches to the standby state.

Specifically, both the first control plane device 300 and the second control plane device 400 are in a normal working state without failure, and when the first priority is higher than the second priority, the second control plane device 400 controls the second instance by restore the master state to the standby state, and make the first control plane device 300 maintain the state of the first instance as the master state according to the first priority and the second priority; when the first priority is lower than the second priority, the second The control plane device 400 maintains the state of the second instance as the master state, and enables the first control plane device 300 to control the first instance to switch from the master state to the standby state according to the first priority and the second priority.

It can be understood that when both the first control plane device 300 and the second control plane device 400 have turned on the preemptive upgrade main switch, and the second priority is increased so that the second priority is higher than the first priority, then , even if the first control plane device 300 does not fail, the second control plane device 400 will automatically perform the master upgrade operation.

Based on the various embodiments of the above-mentioned master-standby automatic switchover method, an embodiment of the overall master-standby automatic switchover method of the present application is proposed below.

Referring to FIG. 2 and FIG. 3 , the vBRAS system includes a first control plane device 300 , a second control plane device 400 and a forwarding plane device 500 . Exemplarily, as shown in FIG. 3 , CP1 represents the first control plane device 300, CP2 represents the second control plane device 400, and UP1 to UP4 represent forwarding plane devices 500, wherein CP1 and CP2 include the same instance instance1, in Instance1 in CP1 represents the first instance, and instance1 in CP2 represents the second instance. When CP1 is normal, the first instance is in the active state, and the second instance is in the standby state, and UP1 and UP2 are connected through the first channel The first instance in CP1, UP1 and UP2 are connected to the second instance in CP2 through a second channel. Since the forwarding plane device 500 always sends user data to the control plane device in the master state, when the first control plane device 300 is normal, the first control plane device 300 takes over UP1 and UP2. When a user dials up to go online, the online message of the user whose physical network is connected to UP1 and UP2 will be delivered to the first control plane device 300, and the first control plane device 300 will save the user information after processing the user's online message into the database 600.

It should be noted that one control plane device can be configured with multiple geo-backup-instance instances, and each instance has its own independent master and backup state management, as follows.

1. The command supports configuring the geo-backup-instance instance switching mode as automatic: switch-mode auto.

2. The command supports configuring the priority of the geo-backup-instance instance, and the range is 1-254.

3. The command supports configuring the geo-backup-instance instance to determine the upper threshold of the channel failure rate threshold of the forwarding plane device 500 managed by itself, and the range is 1-100.

4. The command supports configuring the delay time delay-time between when the geo-backup-instance instance decision needs to be automatically upgraded to the execution of the upgrade action, and the range is 240-3600 seconds.

5. The command supports configuring whether the geo-backup-instance instance is enabled to preempt the master switch preempt enable/disable.

It should be noted that the forwarding plane device 500 supports reporting the state of the first channel between itself and the first control plane device 300, such as the OpenFlow channel, to the second control plane device 400, so that the second control plane device 400 reports the message through the forwarding plane device 500 To determine whether the current failure rate of the first channel between the first control plane device 300 and the forwarding plane device 500 at the opposite end has exceeded the first preset threshold.

It should be noted that the first control plane device 300 and the second control plane device 400 notify each other of the configuration under the geo-backup-instance instance through the SIB heartbeat line, and the same instance is allowed to be configured under the instance on the two control plane devices The parameters are different.

Exemplarily, two instances, instance1 and instance2, are configured on CP1 and CP2, where instance1 on CP1 is in the active state, instance2 is in the standby state, and instance1 on CP2 is in the standby state, and instance2 is in the active state. The instance1 of CP1 and CP2 is configured in automatic mode, and the OpenFlow channel between CP1 and CP2 and UP is good. The priority of instance1 of CP1 is configured as 200, and the priority of instance2 of CP1 is configured as 100. You can see that instance1 of CP1 is the master (master ), instance1 of CP2 is the backup (slave).

Configure the first preset threshold (threshold) of the OpenFlow channel on CP1 to be 40, and configure the second preset threshold (threshold) of the OpenFlow channel on CP2 to be 40, which means that when the OpenFlow channel of CP1 fails When the rate exceeds 40%, it can be determined that CP1 is abnormal, and CP2 needs to take over the following UP.

The user dials up to go online, and the user whose physical network is connected to UP1 and UP2 will deliver the online message to CP1, and CP1 will write the user information table into the database 600 .

When CP1 needs to be restarted or an unexpected failure occurs (such as CP1 server downtime, computer room power failure, CP1 network outbound interface link failure, etc.), the first channel between CP1 and UP1 and UP2 is disconnected, and CP1 and CP2 The sib heartbeat line between them is also disconnected. At this time, CP2 needs to take over UP1 and UP2 without affecting online users.

When UP1 and UP2 perceive that they are disconnected from the first channel of CP1, they report this event to CP2. After receiving the message reported by UP, CP2 calculates that the failure rate of the first channel of CP1 exceeds the first preset. Threshold (40%), at this time, CP2 knows that its second channel is good by obtaining the current failure rate of the second channel and according to the failure rate being less than or equal to the second preset threshold, so CP2 decides to promote the master, and CP2 Take over UP1 and UP2.

After CP2 performs the upgrade operation, the status of instance1 of CP2 becomes recovery (recovery), indicating that CP2 is recovering data. CP2 sends a state switching command to CP1, commanding the state of instance1 of CP1 to change to the standby state (slave). Although CP1 is down at this time and will not receive this message, it does not affect CP2 to continue to be promoted to master; at the same time, CP2 passes The second channel sends a pointing switch command to UP1 and UP2 to point the NSH encapsulation and decapsulation table to CP2, and make UP1 and UP2 point the channel link to the second instance of CP2. At the same time, after UP1 and UP2 receive the master upgrade message from CP2, they will start aging data such as user tables and network segment routes on UP1 and UP2, and then wait for CP2 to re-deliver service data.

In addition, after CP2 executes the upgrade operation, instance1 of CP2 is in recovery state, pulls the already online users of UP1 and UP2 from the database 600, and restores them to CP2. Moreover, each time CP2 acquires a user, it synchronizes the user with UP1 or UP2, so that UP1 and UP2 stop aging of the user after receiving the user synchronization information. After CP2 restores all users from the database 600, the state of CP2 changes from recovery to master state, that is, the master-standby switchover between CP1 and CP2 is completed.

It can be understood that when instance1 of CP2 is in the recovery state, CP2 needs to control the new users of UP1 and UP2 not to go online, because the resources occupied by the newly online users may be different from those of the instance1 that will be restored from the database 600. User conflict.

It can be understood that when instance1 of CP2 is in the recovery state, due to the existence of the load sharing mechanism, it can ensure that instance2 of CP2 maintains the master state (master) without being affected, and UP3 and UP4 can also go online normally with new users.

It is understandable that when instance1 of CP2 is in the recovery state, the user forwarding tables on UP1 and UP2 still exist, so that the user's uplink traffic and downlink traffic are normal, thereby ensuring that the user's online function and experience are not affected.

It is understandable that if a network failure occurs on the CP1 server, CP1 cannot receive the message from CP2 ordering CP1 to change to standby, so after CP2 becomes the master state, both CP1 and CP2 will be in the master state, resulting in a dual-master phenomenon. Therefore, when the CP1 server network is back to normal, CP1 and CP2 will negotiate to determine the active CP and standby CP, and the CP with the highest priority is determined to be the final active CP.

Based on the master-standby automatic switching method in the embodiment of the first aspect above, various embodiments of the control plane device in the second aspect of the present application are proposed below.

An embodiment of the present application provides a control plane device, which includes: a memory 200 , a processor 100 , and a computer program stored in the memory 200 and operable on the processor 100 .

The processor 100 and the memory 200 may be connected via a bus or in other ways.

It should be noted that the controller in this embodiment may correspond to include the memory 200 and the processor 100 in the embodiment shown in FIG. 1, which can constitute a part of the system architecture platform in the embodiment shown in FIG. 1. Both belong to the same inventive concept, so both have the same realization principle and beneficial effect, and will not be described in detail here.

The non-transitory software programs and instructions required to realize the master-standby automatic switching method of the above-mentioned embodiment are stored in the memory 200, and when executed by the processor 100, the master-standby automatic switchover method of the above-mentioned embodiment is executed, for example, the above description is performed. Method steps S100 to S300 in Fig. 5, method steps S110 in Fig. 6, method steps S120 to S140 in Fig. 7, method steps S400 to S500 in Fig. 8, method steps S410 to S420 in Fig. 9, Fig. The method steps S600 to S700 in FIG. 10 , the method steps S800 to S900 in FIG. 11 , and the method steps S910 to S920 in FIG. 12 .

The device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

It can be understood that, since the control plane device in the embodiment of the second aspect of the present application and the master-standby automatic switching method in any embodiment of the first aspect above belong to the same inventive concept, the control plane device in the embodiment of the second aspect of the present application For the specific implementation manner and technical effect, reference may be made to the specific implementation manner and technical effect of the master-standby automatic switching method in any embodiment of the first aspect above, and details are not repeated here.

Based on the control plane device in the embodiment of the second aspect above, various embodiments of the vBRAS system in the third aspect of the present application are proposed below.

Specifically, the vBRAS system in this embodiment of the present application is a transfer-control-separated vBRAS system, and the vBRAS system includes the control plane device in each embodiment of the second aspect above, and also includes the forwarding plane device 500 and at least one other control plane device, and the control A standardized interface is provided between the plane device and the forwarding plane device 500 .

It can be understood that since the vBRAS system in the embodiment of the third aspect of the present application and the control plane device in any embodiment of the second aspect above belong to the same inventive concept, the specific implementation manner of the vBRAS system in the embodiment of the second aspect of the present application For details and technical effects, reference may be made to the specific implementation manners and technical effects of the control plane device in any embodiment of the second aspect above, and details are not repeated here.

Based on the master/standby automatic switchover method in the embodiment of the first aspect above, various embodiments of the computer-readable storage medium in the fourth aspect of the present application are proposed below.

The computer-readable storage medium stores computer-executable instructions, and when the computer-executable instructions are used to execute the above-mentioned master-standby automatic switching method, for example, execute the above-described method steps S100 to S300 in FIG. 5 and the method in FIG. 6 Step S110, method steps S120 to S140 in FIG. 7, method steps S400 to S500 in FIG. 8, method steps S410 to S420 in FIG. 9, method steps S600 to S700 in FIG. 10, method step S800 in FIG. 11 Go to S900, the method steps S910 to S920 in FIG. 12 .

The embodiment of the present application includes a master-standby automatic switching method, a control plane device, a vBRAS system, and a computer-readable storage medium, wherein the master-standby automatic switchover method is applied to the second control plane device in the vBRAS system, and the vBRAS system also It includes a first control plane device and a forwarding plane device, the first control plane device is provided with a first instance in a master state, the second control plane device is provided with a second instance in a standby state, and the first instance Communicating with the forwarding plane device through a first channel, the second instance communicating with the forwarding plane device through a second channel, the method includes: receiving fault information, wherein the fault information represents the first channel The failure rate of the second channel is greater than the first preset threshold; the current failure rate of the second channel is obtained according to the failure information; when the current failure rate of the second channel is less than or equal to the second preset threshold, control the second The instance switches from the standby state to the primary state. According to the solution provided by the embodiment of the present application, the second control plane device can sense the failure of the first control plane device in time according to the fault information, and switch the second instance from the standby state when it judges that it has the ability to upgrade to the master In the active state, the active/standby automatic switching between the control plane devices is completed, thereby improving the disaster recovery performance of the vBRAS system and optimizing the user's network experience.

Those skilled in the art can understand that all or some of the steps and systems in the methods disclosed above can be implemented as software, firmware, hardware and an appropriate combination thereof. Some or all of the physical components may be implemented as software executed by a processor 100, such as a central processing unit 100, a digital signal processor 100, or a microprocessor 100, or as hardware, or as an integrated circuit, Such as application specific integrated circuits. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). As known to those of ordinary skill in the art, the term computer storage media includes both volatile and nonvolatile media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. permanent, removable and non-removable media. Computer storage media including, but not limited to, RAM, ROM, EEPROM, flash memory or other memory 200 technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, tape, magnetic disk storage or other magnetic storage devices, or Any other medium that can be used to store desired information and that can be accessed by a computer. In addition, as is well known to those of ordinary skill in the art, communication media typically embody computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .

The embodiments of the present application have been described in detail above in conjunction with the accompanying drawings, but the present application is not limited to the above-mentioned embodiments. Within the scope of knowledge of those of ordinary skill in the art, various modifications can be made without departing from the purpose of the present application. kind of change.

Claims

A master-standby automatic switching method, applied to a second control plane device in a vBRAS system, the vBRAS system also includes a first control plane device and a forwarding plane device, and the first control plane device is provided with a first control plane device in a master state In an example, the second control plane device is provided with a second instance in a standby state, the first instance communicates with the forwarding plane device through a first channel, and the second instance communicates with the forwarding plane device through a second channel Surface device communication, the method includes:

receiving fault information, wherein the fault information indicates that the fault rate of the first channel is greater than a first preset threshold;

Acquiring the current failure rate of the second channel according to the failure information;

When the current failure rate of the second channel is less than or equal to a second preset threshold, the second instance is controlled to switch from the standby state to the main state.
The method according to claim 1, wherein said receiving fault information comprises:

receiving fault information sent from the first control plane device through a heartbeat line, wherein communication between the first control plane device and the second control plane device is performed through the heartbeat line.
The method according to claim 1, wherein said receiving fault information comprises:

receiving a fault event sent from the forwarding plane device through the second channel, where the fault event indicates that a fault exists in the first channel;

calculating a failure rate of the first channel according to the failure event;

When the failure rate of the first channel is greater than a first preset threshold, failure information is generated.
The method according to claim 1, wherein the vBRAS system further includes a database, the first control plane device and the second control plane device communicate with the database respectively, and the control of the second instance is performed by The standby state is switched to the main state, including:

controlling the second instance to switch from a standby state to a recovering state, where the recovering state is used for the second instance to extract user data of the first instance from the database;

When the second instance finishes extracting the user data, the second instance is controlled to switch from the recovering state to the main state.
The method according to claim 4, wherein, after controlling the second instance to switch from the standby state to the recovering state, the method further comprises:

Generate pointing switch instructions;

Sending the pointing switching instruction to the forwarding plane device, so that the encapsulation and decapsulation table of the forwarding plane device points to the second instance or makes the channel link of the forwarding plane device point to the second instance.
The method according to claim 1, wherein, after the current failure rate of the second channel is less than or equal to a second preset threshold, the method further comprises:

Generate state switching instructions;

Sending the state switching instruction to the first instance, so that the first instance is switched from the master state to the standby state.
The method according to claim 1, wherein, after the second instance is switched from the standby state to the main state, the method further comprises:

When the failure rate of the first channel recovers to be less than or equal to the first preset threshold, obtain the first priority of the first instance and the second priority of the second instance, and send the first priority second priority to the first control plane device;

comparing the first priority with the second priority, and controlling the states of the second instance and the first instance according to the comparison result.
The method according to claim 7, wherein the controlling the states of the second instance and the first instance according to the comparison result comprises:

When the first priority is higher than the second priority, control the second instance to restore from the master state to the standby state, and make the first control plane device according to the first priority and the The second priority maintains the state of the first instance as the main state;

When the first priority is lower than the second priority, maintain the state of the second instance as the main state, and make the first control plane device according to the first priority and the second The priority controls the switching of the first instance from the master state to the standby state.
A control plane device, comprising: a memory, a processor, and a computer program stored on the memory and operable on the processor, when the processor executes the computer program, the implementation of claims 1 to 8 is achieved. The active-standby automatic switching method described in any one.
A vBRAS system, comprising the control plane device according to claim 9.
A computer-readable storage medium storing computer-executable instructions, the computer-executable instructions being used to execute the master/standby automatic switching method according to any one of claims 1 to 8.