WO2014067330A1 - 一种分布式通信设备软件升级方法及系统 - Google Patents

一种分布式通信设备软件升级方法及系统 Download PDF

Info

Publication number
WO2014067330A1
WO2014067330A1 PCT/CN2013/082411 CN2013082411W WO2014067330A1 WO 2014067330 A1 WO2014067330 A1 WO 2014067330A1 CN 2013082411 W CN2013082411 W CN 2013082411W WO 2014067330 A1 WO2014067330 A1 WO 2014067330A1
Authority
WO
WIPO (PCT)
Prior art keywords
software
main control
control unit
data
standby
Prior art date
Application number
PCT/CN2013/082411
Other languages
English (en)
French (fr)
Inventor
赵真富
陈维
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Priority to EP13851276.9A priority Critical patent/EP2916488B1/en
Publication of WO2014067330A1 publication Critical patent/WO2014067330A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • H04L41/0813Configuration setting characterised by the conditions triggering a change of settings
    • H04L41/082Configuration setting characterised by the conditions triggering a change of settings the condition being updates or upgrades of network functionality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/65Updates
    • G06F8/656Updates while running
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/085Retrieval of network configuration; Tracking network configuration history
    • H04L41/0859Retrieval of network configuration; Tracking network configuration history by keeping history of different configuration generations or by rolling back to previous configuration versions

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a method and system for upgrading a distributed communication device software. Background technique
  • a communication device In network communication devices operating on the network, many devices are distributed communication devices based on multi-process operating systems for performance and scalability.
  • a communication device using a multi-process operating system a plurality of business processes are generally used to cooperatively perform routing learning and data forwarding functions of the communication device.
  • the communication device interacts with the external device, learns the network route, and calculates the best routing table used to generate the forwarding plane. Then, it is sent to the forwarding plane, and the forwarding plane then performs routing and forwarding of data according to the optimal routing table.
  • the main purpose of the embodiments of the present invention is to provide a method and system for upgrading a distributed communication device software, which can avoid unnecessary service interruption caused by software upgrade and improve Business reliability and availability to enhance the user experience.
  • a distributed communication device software upgrade method is applied to a distributed communication device software upgrade system including an active main control unit, a standby main control unit, and a line card, and the method includes: the main control unit notifying the standby main control The unit starts the new version of the software and runs in the standby state, and restores the data and data state of the new version software on the standby main control unit through the old version software on the main control unit;
  • the active main control unit and the standby main control unit perform active/standby state switching
  • the line card starts the new version of the control plane and the management surface software and runs in the standby state, and performs data on the new version software on the line card through the control panel and the old version software of the management plane on the line card and Data state recovery;
  • the line card cancels the software of the old version of the control plane and the management plane, and runs the new version of the control plane and the management surface in the active state;
  • the line card refreshes the data and data status of the forwarding surface software.
  • the main control unit notifies the standby main control unit to start the new version software and runs in the standby state, and performs data and data on the new version software on the standby main control unit through the old version software on the main control unit.
  • State recovery including:
  • the main control unit notifies the standby main control unit to start the new version software on the standby main control unit; the old version software on the main control unit performs data and data state recovery on the new version software on the standby main control unit based on the data recovery component. .
  • the line card starts a new version of the control plane and the management plane and runs in a standby state, and performs data and data state recovery on the new version software on the line card by the current active main control unit, or Data and data state recovery of the new version of the software on the line card through the control plane and the old version of the management interface on the line card, including: After the active/standby state switching of the active main control unit and the standby main control unit is completed, the line card starts the new version of the control plane and the management plane on the line card and runs in the standby state;
  • the current version of the main control unit software restores the data and data status of the new version of the software on the line card based on the data recovery component, or restores the state of the control plane and the management plane on the line card;
  • the line card logs out the old version of the control plane and the management plane, and simultaneously runs the new version of the control plane and the management plane in the active state, refreshing the forwarding plane.
  • Software data and data status is required.
  • the method further includes:
  • the forwarding plane software on the line card is upgraded after the data and data status recovery on the line card is completed and before the data and data status of the forwarding plane software is refreshed.
  • the method further includes:
  • the old version software on the original active main control unit is upgraded to the new version software and run in the standby state.
  • a distributed communication device software upgrade system comprising: an active main control unit and a standby main control unit based on physical redundancy, and a line card;
  • the main control unit is configured to notify the standby main control unit to start a new version of the software; and to restore data and data status to the new version of the software on the standby main control unit by using the old version software on the main control unit; After the data and data state recovery is completed, the running state is switched to standby;
  • the standby main control unit is configured to start a new version of the software and run in a standby state; cooperate with the main control unit to perform data and data state recovery on the new version software; when the data and data state is restored, the running state is Switch to main use;
  • the line card is configured to be switched in the active/standby state of the active main control unit and the standby main control unit. After that, start the new version of the control plane and the management side of the line card and run it in the standby state, and restore, or, through the control panel on the line card and the old version of the management plane, the new version on the line card.
  • the version software performs data and data state recovery; after the data and data state recovery on the line card is completed, the old version software of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is mainly Run with status to refresh the data and data status of the forwarding surface software.
  • the main control unit includes: a first version management component and a first active and standby management component;
  • the standby main control unit includes: a second version management component and a second active and standby management component;
  • Software recovery components are included in the software;
  • the first version management component is configured to notify the second version management component to start the new version software on the standby main control unit; after the data and data state recovery is completed, notify the first active and standby management component to use the main control The operating state of the unit is switched to standby;
  • the second version management component is configured to start a new version of the software on the standby main control unit; after the data and data state recovery is completed, notify the second active/standby management component to switch the running state of the standby main control unit into main;
  • the first active/standby management component is configured to perform active/standby switchover on the running state of the active main control unit
  • the second active/standby management component is configured to perform active/standby switchover on the running status of the standby main control unit;
  • the old version software on the main control unit performs data and data state recovery on the new version of the software on the standby main control unit based on the data recovery component.
  • the line card comprises: a third version management component; the software on the main control unit and the line card each comprise a data recovery component;
  • the third version management component is configured to start a new version of the control plane and the management plane on the line card after the active/standby state switching of the active main control unit and the standby main control unit is completed, and to reserve After the data and data status recovery on the line card is completed, the old version software of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is run in the active state, and the forwarding is performed. Data and data status of the software;
  • the new version of the current main control unit software restores the data and data status based on the data recovery component to the new version of the software on the line card, or the control layer and the management version of the old version software on the line card are based on
  • the data recovery component performs data and data state recovery on the new version of the software on the line card.
  • the line card is further configured to upgrade the forwarding surface software on the line card after the data and data status on the line card is restored and before the data and data status of the forwarding plane software is refreshed.
  • the main control unit is further configured to upgrade the old version software on the original main control unit to the new version software after the main control unit and the standby main control unit perform the active/standby state switching. Standby state operation.
  • the distributed communication device software upgrade method and system sequentially upgrades the software on the main control unit and the line card, because the main control unit includes the main control unit and the standby main control unit based on physical redundancy, and the line
  • the card can enable the software to run in the master state or the standby state, that is, provide the master/slave switchover of the logic master unit and the standby master unit, and the master and slave running state control of the new version software and the old version software on the line card, so that the new version
  • the software seamlessly replaces the old version of the software into a working state.
  • the software of the control plane and the management plane is upgraded on the online card, and the forwarding surface software of the line card is upgraded if necessary to avoid unnecessary service interruption caused by software upgrade, and improve service reliability and Usability, avoiding business interruptions, causing significant economic losses and improving the user experience.
  • FIG. 1 is a flowchart of a method for upgrading a distributed communication device software according to a first embodiment of the present invention
  • 2 is a flowchart of a method for upgrading a software of a distributed communication device according to a second embodiment of the present invention
  • FIG. 3 is a schematic diagram of a system for upgrading a software of a distributed communication device according to a second embodiment of the present invention
  • FIG. 4 is a schematic diagram of an upgrade sequence of a distributed access gateway software in an application example of the present invention
  • FIG. 5 is a schematic diagram of a state before a distributed access gateway software is upgraded in an application example of the present invention
  • FIG. 7 is a schematic diagram of the operation state switching of the main control 2 of the distributed access gateway in the application example of the present invention, and starting the new version of the line card software for data and data state recovery;
  • FIG. 8 is a schematic diagram of a software for deregistering an old version of a distributed access gateway line card according to an application example of the present invention.
  • FIG. 9 is a schematic diagram of a software running into a new version of a distributed access gateway line card in an application example of the present invention.
  • FIG. 10 is a schematic diagram of upgrading an old version of the software of the distributed access gateway master 1 to a new version of the software in the application example of the present invention. detailed description
  • the embodiment of the invention describes a method for upgrading a distributed communication device software, which is applied to a distributed communication device software upgrade system including an active main control unit, a standby main control unit and a line card, and the method includes the following two stages. : The first phase, the main control unit software upgrade process; the second phase, the line card software upgrade process.
  • the main control unit and the standby main control unit are collectively referred to as a main control unit; wherein, the main control unit is also referred to as a main processing unit, and the function in the communication device is responsible for system management, route calculation, and the like.
  • the main control unit refers to the software running on the main processor running in the main state and providing services to the outside world.
  • the standby main control unit refers to the software running on the main processor in the standby state, which does not provide services to the outside world, and can only provide services to the outside world after being transferred to the main state.
  • the function of the line card in the communication device is to perform services such as forwarding data management and data forwarding.
  • the main control unit and the standby main control unit can be used by a central processing unit (CPU), or a digital signal processor (DSP), or a programmable gate array (FPGA, Field-Programmable). Gate Array) implementation.
  • the active main control unit comprises: a first version management component and a first active and standby management component;
  • the standby main control unit comprises: a second version management component and a second active and standby management component;
  • the line card includes: The third version management component; the software on the main control unit and the line card each include a data recovery component.
  • the method includes the following steps:
  • Step S101 the main control unit notifies the standby main control unit to start the new version software and runs in the standby state, and performs data and data state recovery on the new version software on the standby main control unit by using the old version software on the main control unit.
  • step S101 includes:
  • Step A1 The main control unit notifies the standby main control unit to start a new version of the software on the standby main control unit;
  • Step A2 The old version software on the main control unit performs data and data state recovery on the new version software on the standby main control unit based on the data recovery component.
  • Step A1 is specifically: the first version management component notifies the second version management component in the standby master Start the new version of the software on the unit.
  • step A2 includes:
  • Step A21 A single or multiple communication links are established between the active main control unit and the standby main control unit through respective data recovery components;
  • Step A22 The old version software on the active main control unit synchronizes the data and data status to the new version software on the standby main control unit based on the data recovery component.
  • the new version of the software on the alternate master unit also receives data and data status synchronized by the new version of the software on the master unit based on the data recovery unit.
  • Step S102 The active main control unit and the standby main control unit perform active/standby state switching.
  • the standby main control unit that has completed the software upgrade enters the active working state, and the original primary control unit enters the standby working state, and the software thereon waits for the upgrade.
  • the first version management component notifies the first active/standby management component to switch the running state of the active main control unit to standby, and the second version management component notifies the second The active and standby management components switch the running status of the standby main control unit to the primary.
  • Step S103 the current active main control unit notifies the line card to start the new version of the control plane and the management plane and runs in the standby state, and performs data on the new version software on the line card through the current active main control unit.
  • the new version of the software runs in the standby state; when the data and data state on the line card is restored, the old version of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is Running in the main state, refreshing the data and data status of the forwarding surface software;
  • the new version of the current master control unit is based on the data recovery component described on the line card
  • the new version of the software restores data and data status, or the status of the control and management planes on the line card is restored.
  • step S103 the current active main control unit performs data and data state recovery on the control panel and the new version software of the management plane on the line card, including:
  • Step B1 A single or multiple communication links are established between the current active main control unit and the line card based on the data recovery component;
  • Step B2 The new version of the current active master unit software synchronizes the data and data status based on the data recovery component to the control plane on the line card and the new version of the management side software.
  • the new version of the control plane and management plane on the line card receives data and data status synchronized by the primary master unit based on the data recovery component.
  • step S103 the data and data state recovery of the new version of the software on the line card is performed by the control plane on the line card and the old version software of the management plane, including:
  • Step C1 establishing a single or multiple communication links based on the data recovery component between the control layer and the management version of the old version software on the line card and the new version of the software on the line card;
  • Step C2 The old version of the control plane and the management plane on the line card synchronizes the data and data status based on the new version of the software on the line card based on the data recovery component.
  • the new version of the control plane and management plane on the line card is based on the data and data status of the data recovery component receiving the control plane on the line card and the old version of the management side.
  • Step S104 after the data and data state recovery on the line card is completed, the line card cancels the old version software of the control plane and the management plane, and simultaneously runs the new version of the control plane and the management plane in the active state. Refresh the data and data status of the forwarding surface software.
  • the third version management component logs out the old version software of the control plane and the management plane, and simultaneously uses the new version of the control plane and the management plane for the main use.
  • the status is running, and the data and data status of the forwarding surface software are refreshed.
  • the method further includes:
  • the main control unit here refers to the main control unit in the main state, that is, the main control unit, because the standby main control unit does not provide services to the system.
  • the embodiment of the present invention further describes a method for upgrading a distributed communication device software, which is applied to a distributed communication device software upgrade system including an active main control unit, a standby main control unit, and a line card, and the method includes the following two Phase: The first phase, the main control unit software upgrade process; the second phase, the line card software upgrade process; as shown in FIG. 2, the method includes the following steps:
  • Step S201 the main control unit notifies the standby main control unit to start the new version software and runs in the standby state, and performs data and data state recovery on the new version software on the standby main control unit through the old version software on the main control unit.
  • step S201 includes:
  • Step A1 The active main control unit notifies the standby main control unit to start a new version of the software on the standby main control unit;
  • Step A2 The old version software on the main control unit performs data and data state recovery on the new version software on the standby main control unit based on the data recovery component.
  • Step A1 is specifically as follows: the first version management component notifies the second version management component to start the new version software on the standby main control unit.
  • step A2 includes:
  • Step A21 A single or multiple communication links are established between the active main control unit and the standby main control unit through respective data recovery components;
  • Step A22 The old version software on the active main control unit synchronizes the data and data status to the new version software on the standby main control unit based on the data recovery component.
  • step S202 the active main control unit and the standby main control unit perform active/standby state switching.
  • the standby main control unit that has completed the software upgrade enters the active working state, and the original primary control unit enters the standby working state, and the software thereon waits for the upgrade.
  • the first version management component notifies the first active/standby management component to switch the running state of the active main control unit to standby, and the second version management component notifies the second The active and standby management components switch the running status of the standby main control unit to the primary.
  • Step S203 After the active main standby unit and the standby main control unit perform the active/standby state switching, upgrade the old version software on the original main control unit to the new version software and run in the standby state.
  • Step S204 the current active main control unit notifies the line card to start the new version of the control plane and the management plane and runs in the standby state, and performs data on the new version software on the line card through the current active main control unit.
  • the new version of the software runs in the standby state; when the data and data state on the line card is restored, the old version of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is Running in the main state, refreshing the data and data status of the forwarding surface software;
  • the current version of the main master control software restores data and data status based on the data recovery component to the new version of the software on the line card, or the state of the control plane and management plane on the line card is restored.
  • Step S205 after the data and data state recovery on the line card is completed, the line card cancels the old version software of the control plane and the management plane, and simultaneously runs the new version of the control plane and the management plane in the active state. Upgrade the forwarding software on the line card, refresh the data and number of the forwarding software According to the state.
  • the third version management component logs out the old version software of the control plane and the management plane, and simultaneously uses the new version of the control plane and the management plane for the main use.
  • the status is running, the forwarding plane software on the line card is upgraded, and the data and data status of the forwarding plane software are refreshed.
  • the face-to-face management software upgrade process is similar. The reason why the forwarding surface software upgrade is last is because the function of the forwarding plane directly affects the service operation of the communication device.
  • the control plane and management plane software on the line card are not upgraded first.
  • the service is interrupted, and the process of upgrading the forwarding surface software must also go through:
  • the old version of the software on the forwarding side logs out and starts the new version of the software on the forwarding side, and the forwarding on the line card is performed by the new version of the software on the current active main control unit.
  • the new version of the software performs data and data state recovery, and refreshes the data and data status of the new version of the software on the forwarding side, so that the forwarding entries of the new version forwarding plane take effect.
  • the time to upgrade the forwarding software is very short and hardly affects business operations.
  • the method further includes:
  • the main control unit here refers to the main control unit in the main state, that is, the main control unit, because the standby main control unit does not provide services to the system.
  • the embodiment of the present invention further describes a distributed communication device software upgrade system.
  • the system includes a main control unit 10; wherein the main control unit 10 includes an active main control unit 20 based on physical redundancy. And the standby main control unit 30;
  • the main control unit 20 is configured to notify the standby main control unit 30 to start the new version of the software; and the data and data state of the new version of the software on the standby main control unit 30 is restored by the old version software on the main control unit 20; When the data and data state recovery is completed, the running state is inverted. Change to spare.
  • the standby main control unit 30 is configured to start a new version of the software and run in a standby state; cooperate with the main control unit 20 to perform data and data state recovery on the new version of the software; when the data and data state is restored, the running state is Switch to the main use.
  • the main control unit 20 includes: a first version management component 21 and a first active and standby management component 22, and the standby main control unit 30 includes: a second version management component 31 and a second active/standby management component 32.
  • the software on unit 10 includes a data recovery component 11;
  • the first version management component 21 is configured to notify the second version management component 31 to start the new version software on the standby main control unit 30; after the data and data state recovery is completed, notify the first active/standby management component 22 to use the primary
  • the running state of the main control unit 20 is switched to standby, so that the old version software enters the standby working state and waits for the upgrade;
  • the data recovery component 11 the old version software on the main control unit 20 performs data and data state recovery on the new version software on the standby main control unit 30 based on the data recovery component 11;
  • the second version management component 31 is configured to start the new version of the software on the standby main control unit 30. After the data and data state recovery is completed, the second active/standby management component 32 is notified to invert the running state of the standby main control unit 30. Switch to the main use, so that the new version of the software after the upgrade is completed into the main working state;
  • the first active/standby management component 22 is configured to perform active/standby switchover on the running state of the active main control unit 20;
  • the second active/standby management component 32 is configured to perform an active/standby switchover on the running state of the standby main control unit 30.
  • the line card 40 is configured to start a new version of the control plane and the management plane after the main-standby state switching of the active main control unit 20 and the standby main control unit 30 is completed, and run in a standby state, and pass, or pass
  • the control panel on the line card and the old version of the management side software on the line card The version software performs data and data state recovery; after the data and data state recovery on the line card is completed, the old version software of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is mainly Run with status to refresh the data and data status of the forwarding surface software.
  • the line card 40 comprises: a third version management component 41, the software on the main control unit and the line card each comprise a data recovery component.
  • the third version management component 41 is configured to start the new version software of the control plane and the management plane and run in the standby state after the active/standby state switching of the active main control unit 20 and the standby main control unit 30 is completed; when the line card 40 After the data and data state recovery is completed, the old version software of the control plane and the management plane is deregistered, and the new version of the control plane and the management plane is run in the main state, and the data and data of the forwarding plane software are refreshed. State
  • the data recovery component 11 the new version software of the current active main control unit 20 performs data and data state recovery on the new version software on the line card 40 based on the data recovery component 11 , or the control plane on the line card 40 and
  • the old version of the management software is based on the data recovery component 11 to perform data and data state recovery on the new version of the software on the line card 40.
  • the embodiment of the present invention further describes a distributed communication device software upgrade system.
  • the distributed communication device software upgrade system of this embodiment is substantially the same as the distributed communication device software upgrade system described in FIG. 3 above, and the difference is that the implementation In the system of the example,
  • the main control unit 20 is further configured to upgrade the old version software on the original main control unit 20 to the new version software after the main control unit 20 and the standby main control unit 30 perform the active/standby state switching.
  • the status is running.
  • the line card 40 is further configured to upgrade the forwarding surface software on the line card 40 after the data and data status recovery on the line card 40 is completed and before the data and data status of the forwarding plane software is refreshed.
  • the third version management component 41 on the line card 40 has a similar upgrade process to the forwarding plane software and the control plane and management plane software upgrade process on the line card 40.
  • the level is placed at the end because the function of the forwarding plane directly affects the service operation of the communication device.
  • the upgrade of the control plane and management plane software on the line card 40 does not interrupt the service, and the process of upgrading the software of the forwarding plane must also pass:
  • the old version software of the forwarding plane is deregistered and starts the new version software of the forwarding plane, and the data and data state of the new version software of the forwarding plane on the line card 40 is restored by the new version software on the current main control unit 20.
  • Update the data and data status of the new version of the software on the forwarding plane to make the forwarding entries of the new version forwarding plane take effect.
  • the time to upgrade the forwarding software is very short and hardly affects business operations.
  • Scenario Access gateway software upgrade scenario.
  • FIG 4 is a sequence diagram of the distributed access gateway software upgrade interaction.
  • Figure 5 to Figure 10 show the various states of the distributed access gateway during software upgrade.
  • the access gateway in the figure is the distribution of the user access network.
  • the device is configured with the main and standby main control boards.
  • Figure 5 shows the status of the distributed access gateway software before the upgrade.
  • the processing unit on the main control board is the main control unit. 1.
  • the processing unit on the standby main control board is master control 2.
  • there are 4 line cards configured to complete user access and forwarding processing.
  • There are 80 processes running on the main control unit such as the supporting process, the Border Gateway Protocol (BGP) process, the Routing Information Protocol (RIP) process, and the Open Shortest Path First (OSPF).
  • Border Gateway Protocol Border Gateway Protocol
  • RIP Routing Information Protocol
  • OSPF Open Shortest Path First
  • the line card has 30 processes, including the packet sending and receiving process, the routing and forwarding process, and the user access process. Due to the failure of the running software version, the service will be randomly disconnected. Therefore, you need to consider upgrading your software version to resolve device failures.
  • the four line cards of the access gateway use a line-by-line card upgrade solution. That is to upgrade each line card in turn, first upgrade the line card, then upgrade the line card, then upgrade the line card, and finally upgrade the line card.
  • the access gateway software version upgrade involves the following components: Old version software, new version software, version This management component, the main and standby management components.
  • the old version software and the new version software implement the same functions;
  • the version management component is responsible for managing the software version upgrade, failure rollback, etc.;
  • the active and standby management components are responsible for the active and standby management of the new and old versions of the software, and in the old version to the new
  • the active/standby switchover of the new version of the software is triggered, so that the new version of the software can seamlessly replace the old version and enter the working state, and the service is not interrupted.
  • Phase I the main control software upgrade:
  • S4 The new version of the software on Master 2 is started in the ready state after booting. Before the main control 2 goes into the main state operation, the new version of the software on the main control 2 does not provide services to the outside of the system;
  • the new version of the software on the main control 2 recovers the data and data status through the old version software on the main control 1, so that it can seamlessly take over the services provided by the old version software to the outside world after entering the working state.
  • the communication link for data and data state recovery may be a single or multiple communication links;
  • the VM on the master 1 After receiving the notification of the completion of the new version of the standby master, the VM on the master 1 triggers the master/slave switchover of the master master to the active-standby management component (ASM) on the master 1, as shown in the figure. 7 is shown. At the same time, the VM on the master 2 is triggered to the ASM on the master 2 Use master/slave switching of the master;
  • ASM active-standby management component
  • Phase II, line card software upgrade (line-by-line card upgrade, the following is an example of upgrading the line card as an example, other line cards are similar):
  • the VM on the line card After receiving the new version of the software recovery completion notification, the VM on the line card initiates a logout command to the control plane on the line 1 card and the old version of the management software. At the same time, an active/standby switchover command is initiated to the control plane of the line 1 card and the new version of the software on the management plane;

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Stored Programmes (AREA)
  • Hardware Redundancy (AREA)

Abstract

本发明公开了一种分布式通信设备软件升级方法,包括:依次在主控单元和线卡上升级软件,使新老版本软件同时运行,通过主用主控单元与备用主控单元的主备倒换、以及线卡上新版本软件和老版本软件的主备运行状态控制,使得新版本软件无缝接替老版软件本进入工作状态。本发明还同时公开了一种分布式通信设备软件升级系统。采用本发明的技术方案,实现了在需要的情况下再升级线卡的转发面软件,以避免软件升级带来不必要的业务中断,提高业务的可靠性和可用性,避免业务中断带来重大的经济损失,提升用户业务体验。

Description

一种分布式通信设备软件升级方法及系统 技术领域
本发明涉及通信技术领域, 尤其涉及一种分布式通信设备软件升级方 法及系统。 背景技术
随着人们对随时随地接入网络以获取信息的需求不断扩大, 各种有线 和无线终端不断涌现并接入到网络中。 为了提升用户体验, 网络中用于承 载用户网络业务的网络设备就显得越来越重要, 网络设备出现异常, 轻则 影响用户接入体验, 导致用户流失, 重则影响企业的电子商务网络的运行, 导致重大的经济损失。
在网络上运行的网络通信设备中, 为了性能和可扩展性等考虑, 很多 设备是基于多进程操作系统实现的分布式的通信设备。 在使用多进程操作 系统的通信设备中, 通常采用多个业务进程协同完成通信设备的路由学习、 数据转发功能。 通信设备与外部设备进行交互, 学习网络路由, 并计算生 成转发面使用的最佳路由表。 然后下发到转发面, 转发面再根据最佳路由 表完成数据的路由转发。
在网络设备运行过程中, 因可能存在软件故障或者新增功能特性, 不 得不面临各种各样的网络设备软件升级。 而目前还没有一种有效的方法能 够避免网络设备在软件升级过程中引起的业务中断。 发明内容
有鉴于此, 本发明实施例的主要目的在于提供一种一种分布式通信设 备软件升级方法及系统, 能够避免软件升级带来不必要的业务中断, 提高 业务的可靠性和可用性, 提升用户业务体验。
为达到上述目的, 本发明实施例的技术方案是这样实现的:
一种分布式通信设备软件升级方法, 应用于包括主用主控单元、 备用 主控单元以及线卡的分布式通信设备软件升级系统中, 所述方法包括: 主用主控单元通知备用主控单元启动新版本软件且以备用状态运行, 并通过主用主控单元上的老版本软件对备用主控单元上新版本软件进行数 据及数据状态恢复;
主用主控单元与备用主控单元进行主备状态倒换;
线卡启动控制面和管理面的新版本软件且以备用状态运行, 并通过当 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上的所述新版本 软件进行数据及数据状态恢复;
线卡对控制面和管理面的老版本软件进行注销, 同时将控制面和管理 面的新版本软件以主用状态运行;
线卡刷新转发面软件的数据及数据状态。
优选地, 所述主用主控单元通知备用主控单元启动新版本软件且以备 用状态运行, 并通过主用主控单元上的老版本软件对备用主控单元上新版 本软件进行数据及数据状态恢复, 包括:
主用主控单元通知备用主控单元在备用主控单元上启动新版本软件; 主用主控单元上的老版本软件基于数据恢复组件对备用主控单元上新 版本软件进行数据及数据状态恢复。
优选地, 所述线卡启动控制面和管理面的新版本软件且以备用状态运 行, 并通过当前的主用主控单元对线卡上的所述新版本软件进行数据及数 据状态恢复, 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上 的所述新版本软件进行数据及数据状态恢复, 包括: 线卡在主用主控单元与备用主控单元的主备状态倒换完成后, 启动线 卡上的控制面和管理面的新版本软件且以备用状态运行;
当前的主用主控单元的新版本软件基于数据恢复组件对线卡上的所述 新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管理面的 状态恢复;
当线卡上的所述数据及数据状态恢复完成后, 线卡对控制面和管理面 的老版本软件进行注销, 同时将控制面和管理面的新版本软件以主用状态 运行, 刷新转发面软件的数据及数据状态。
优选地, 所述方法还包括:
当线卡上的所述数据及数据状态恢复完成之后且在刷新转发面软件的 数据及数据状态之前, 升级线卡上的转发面软件。
优选地, 所述方法还包括:
在主用主控单元与备用主控单元进行主备状态倒换后, 将原主用主控 单元上的老版本软件升级为新版本软件并以备用状态运行。
一种分布式通信设备软件升级系统, 包括: 基于物理冗余的主用主控 单元和备用主控单元、 线卡; 其中,
所述主用主控单元, 配置为通知备用主控单元启动新版本软件; 通过 主用主控单元上的老版本软件对备用主控单元上的新版本软件进行数据及 数据状态恢复; 当所述数据及数据状态恢复完成后, 将运行状态倒换成备 用;
所述备用主控单元, 配置为启动新版本软件并以备用状态运行; 配合 主用主控单元对新版本软件进行数据及数据状态恢复; 当所述数据及数据 状态恢复完成后, 将运行状态倒换成主用;
所述线卡, 配置为在主用主控单元与备用主控单元的主备状态倒换完 成后, 启动线卡上的控制面和管理面的新版本软件且以备用状态运行, 并 恢复, 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上的所述 新版本软件进行数据及数据状态恢复; 当线卡上的所述数据及数据状态恢 复完成后, 对控制面和管理面的老版本软件进行注销, 同时将控制面和管 理面的新版本软件以主用状态运行, 刷新转发面软件的数据及数据状态。
优选地, 所述主用主控单元包括: 第一版本管理组件和第一主备管理 组件; 所述备用主控单元包括: 第二版本管理组件和第二主备管理组件; 主控单元上的软件中均包含数据恢复组件;
所述第一版本管理组件, 配置为通知第二版本管理组件在备用主控单 元上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第一主备 管理组件将主用主控单元的运行状态倒换成备用;
所述第二版本管理组件, 配置为在备用主控单元上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第二主备管理组件将备用主控单 元的运行状态倒换成主用;
所述第一主备管理组件, 配置为对主用主控单元的运行状态进行主备 倒换;
所述第二主备管理组件, 配置为对备用主控单元的运行状态进行主备 倒换;
其中, 主用主控单元上的老版本软件基于数据恢复组件对备用主控单 元上新版本软件进行数据及数据状态恢复。
优选地, 所述线卡包括: 第三版本管理组件; 主控单元及线卡上的软 件均包含数据恢复组件;
所述第三版本管理组件, 配置为在主用主控单元与备用主控单元的主 备状态倒换完成后, 启动线卡上的控制面和管理面的新版本软件且以备用 状态运行; 当线卡上的所述数据及数据状态恢复完成后, 对控制面和管理 面的老版本软件进行注销, 同时将控制面和管理面的新版本软件以主用状 态运行, 刷新转发面软件的数据及数据状态;
其中, 当前的主用主控单元的新版本软件基于数据恢复组件对线卡上 的所述新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管 理面的老版本软件基于数据恢复组件对线卡上的所述新版本软件进行数据 及数据状态恢复。
优选地, 所述线卡, 还配置为当线卡上的所述数据及数据状态恢复完 成后且在刷新转发面软件的数据及数据状态之前, 升级线卡上的转发面软 件。
优选地, 所述主用主控单元, 还配置为在主用主控单元与备用主控单 元进行主备状态倒换后, 将原主用主控单元上的老版本软件升级为新版本 软件并以备用状态运行。
本发明实施例的分布式通信设备软件升级方法及系统, 依次在主控单 元和线卡上升级软件, 由于主控单元中包含基于物理冗余的主用主控单元 和备用主控单元, 线卡中可以使软件以主状态或者备状态运行即提供逻辑 用主控单元与备用主控单元的主备倒换、 以及线卡上新版本软件和老版本 软件的主备运行状态控制, 使得新版本软件无缝接替老版软件本进入工作 状态。 另外, 本发明实施例在线卡上升级控制面和管理面的软件, 在需要 的情况下再升级线卡的转发面软件, 以避免软件升级带来不必要的业务中 断, 提高业务的可靠性和可用性, 避免业务中断带来重大的经济损失, 提 升用户业务体验。 附图说明
图 1为本发明第一实施例中的分布式通信设备软件升级方法流程图; 图 2为本发明第二实施例中的分布式通信设备软件升级方法流程图; 图 3 为本发明第二实施例中的分布式通信设备软件升级系统组成示意 图;
图 4为本发明应用实例中分布式接入网关软件升级交互序列图; 图 5为本发明应用实例中分布式接入网关软件升级前的状态示意图; 图 6为本发明应用实例中分布式接入网关主控 2启动新版本软件并进 行数据及数据状态恢复的示意图;
图 7为本发明应用实例中分布式接入网关主控 2的运行状态倒换为主 用并启动线卡新版本软件进行数据及数据状态恢复的示意图;
图 8 为本发明应用实例中分布式接入网关线卡注销老版本软件的示意 图;
图 9为本发明应用实例中分布式接入网关线卡的新版本软件进入工作 状态示意图;
图 10为本发明应用实例中分布式接入网关主控 1的老版本软件升级到 新版本软件的示意图。 具体实施方式
为了能够更加详尽地了解本发明实施例的特点与技术内容, 下面结合 附图对本发明实施例的实现进行详细阐述, 所附附图仅供参考说明之用, 并非用来限定本发明实施例。
本发明实施例记载了一种分布式通信设备软件升级方法, 应用于包括 主用主控单元、 备用主控单元以及线卡的分布式通信设备软件升级系统中 , 所述方法包括以下两个阶段: 第一阶段, 主控单元软件升级过程; 第二阶 段, 线卡软件升级过程。
为了能够更加清楚地理解本发明实施例, 下面对本发明实施例中的相 关概念进行阐述。 主用主控单元、 备用主控单元合称为主控单元; 其中, 主控单元也称 为主处理单元, 在通信设备中的功能是负责系统管理、 路由计算等。
主用主控单元指的是主处理器上的软件运行在主状态, 并对外界提供 服务。
备用主控单元指的是主处理器上的软件运行在备状态, 不对外界提供 服务, 只有转为主状态后才能对外界提供服务。
线卡在通信设备中的功能是执行转发数据管理和数据转发等业务。 其中, 主用主控单元、 备用主控单元在实际应用中, 均可由中央处理 器(CPU )、 或数字信号处理器(DSP, Digital Signal Processor ), 或可编程 门阵列 (FPGA, Field - Programmable Gate Array ) 实现。
优选地, 主用主控单元包括: 第一版本管理组件和第一主备管理组件; 备用主控单元包括: 第二版本管理组件和第二主备管理组件; 主控单元上 软件中均包含数据恢复组件; 线卡包括: 第三版本管理组件; 主控单元及 线卡上的软件均包含数据恢复组件。 述, 如图 1所示, 所述方法包括以下步驟:
步驟 S101 , 主用主控单元通知备用主控单元启动新版本软件且以备用 状态运行, 并通过主用主控单元上的老版本软件对备用主控单元上新版本 软件进行数据及数据状态恢复。
优选地, 步驟 S101 , 包括:
步驟 A1 : 主用主控单元通知备用主控单元在备用主控单元上启动新版 本软件;
步驟 A2: 主用主控单元上的老版本软件基于数据恢复组件对备用主控 单元上新版本软件进行数据及数据状态恢复。
步驟 A1具体为:第一版本管理组件通知第二版本管理组件在备用主控 单元上启动新版本软件。
优选地, 步驟 A2, 包括:
步驟 A21 : 主用主控单元与备用主控单元之间通过各自的数据恢复组 件建立单条或多条通信链路;
步驟 A22: 主用主控单元上的老版本软件基于数据恢复组件向备用主 控单元上的新版本软件同步数据及数据状态。
相应地, 备用主控单元上的新版本软件也基于数据恢复组件接收主用 主控单元上的新版本软件同步过来的数据及数据状态。
步驟 S102, 主用主控单元与备用主控单元进行主备状态倒换。
如此, 使已经完成软件升级的备用主控单元进入主用工作状态, 原主 用主控单元进入备用工作状态, 其上的软件等待升级。
具体地, 在所述数据及数据状态恢复完成后, 第一版本管理组件通知 第一主备管理组件将主用主控单元的运行状态倒换成备用, 同时, 第二版 本管理组件通知第二主备管理组件将备用主控单元的运行状态倒换成主 用。
步驟 S103, 当前的主用主控单元通知线卡启动控制面和管理面的新版 本软件且以备用状态运行, 并通过当前的主用主控单元对线卡上的所述新 版本软件进行数据及数据状态恢复, 或者, 通过线卡上的控制面和管理面 具体地, 第三版本管理组件在主用主控单元与备用主控单元的主备状 态倒换完成后, 启动控制面和管理面的新版本软件且以备用状态运行; 当 线卡上的所述数据及数据状态恢复完成后, 对控制面和管理面的老版本软 件进行注销, 同时将控制面和管理面的新版本软件以主用状态运行, 刷新 转发面软件的数据及数据状态;
当前的主用主控单元的新版本软件基于数据恢复组件对线卡上的所述 新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管理面的 状态恢复。
在步驟 S103中, 所述通过当前的主用主控单元对线卡上的所述控制面 和管理面的新版本软件进行数据及数据状态恢复, 包括:
步驟 B1: 当前的主用主控单元与线卡之间基于数据恢复组件建立单条 或多条通信链路;
步驟 B2: 当前的主用主控单元的新版本软件基于数据恢复组件向线卡 上的控制面和管理面的新版本软件同步数据及数据状态。
相应地, 线卡上的控制面和管理面的新版本软件基于数据恢复组件接 收主用主控单元同步过来的数据及数据状态。
在步驟 S103中, 通过线卡上的控制面和管理面的老版本软件对线卡上 的所述新版本软件进行数据及数据状态恢复, 包括:
步驟 C1: 线卡上的控制面和管理面的老版本软件与线卡上的所述新版 本软件之间基于数据恢复组件建立单条或多条通信链路;
步驟 C2: 线卡上的控制面和管理面的老版本软件基于数据恢复组件向 线卡上的所述新版本软件同步数据及数据状态。
相应地, 线卡上的控制面和管理面的新版本软件基于数据恢复组件接 收线卡上的控制面和管理面的老版本软件同步过来的数据及数据状态。
步驟 S104, 当线卡上的所述数据及数据状态恢复完成后, 线卡对控制 面和管理面的老版本软件进行注销, 同时将控制面和管理面的新版本软件 以主用状态运行, 刷新转发面软件的数据及数据状态。
具体地, 当线卡上的所述数据及数据状态恢复完成后, 第三版本管理 组件对控制面和管理面的老版本软件进行注销, 同时将控制面和管理面的 新版本软件以主用状态运行, 刷新转发面软件的数据及数据状态。 优选地, 在步驟 S101之前, 所述方法还包括:
将新版本软件拷贝到通信设备的主控单元和线卡上; 或者, 将新版本 软件拷贝到通信设备的主控单元, 后续由线卡向主控单元请求获取新版本 软件进行升级。
这里的主控单元指的是处于主用状态的主控单元即主用主控单元, 因 为备用主控单元不对系统提供服务。
本发明实施例还记载了一种分布式通信设备软件升级方法, 应用于包 括主用主控单元、 备用主控单元以及线卡的分布式通信设备软件升级系统 中, 所述方法包括以下两个阶段: 第一阶段, 主控单元软件升级过程; 第 二阶段, 线卡软件升级过程; 如图 2所示, 所述方法包括以下步驟:
步驟 S201 , 主用主控单元通知备用主控单元启动新版本软件且以备用 状态运行, 并通过主用主控单元上的老版本软件对备用主控单元上新版本 软件进行数据及数据状态恢复。
优选地, 步驟 S201 , 包括:
步驟 A1: 主用主控单元通知备用主控单元在备用主控单元上启动新版 本软件;
步驟 A2: 主用主控单元上的老版本软件基于数据恢复组件对备用主控 单元上新版本软件进行数据及数据状态恢复。
步驟 A1具体为:第一版本管理组件通知第二版本管理组件在备用主控 单元上启动新版本软件。
优选地, 步驟 A2, 包括:
步驟 A21 : 主用主控单元与备用主控单元之间通过各自的数据恢复组 件建立单条或多条通信链路;
步驟 A22: 主用主控单元上的老版本软件基于数据恢复组件向备用主 控单元上的新版本软件同步数据及数据状态。 步驟 S202, 主用主控单元与备用主控单元进行主备状态倒换。
如此, 使已经完成软件升级的备用主控单元进入主用工作状态, 原主 用主控单元进入备用工作状态, 其上的软件等待升级。
具体地, 在所述数据及数据状态恢复完成后, 第一版本管理组件通知 第一主备管理组件将主用主控单元的运行状态倒换成备用, 同时, 第二版 本管理组件通知第二主备管理组件将备用主控单元的运行状态倒换成主 用。
步驟 S203 , 在主用主控单元与备用主控单元进行主备状态倒换后, 将 原主用主控单元上的老版本软件升级为新版本软件并以备用状态运行。
步驟 S204, 当前的主用主控单元通知线卡启动控制面和管理面的新版 本软件且以备用状态运行, 并通过当前的主用主控单元对线卡上的所述新 版本软件进行数据及数据状态恢复, 或者, 通过线卡上的控制面和管理面 具体地, 第三版本管理组件在主用主控单元与备用主控单元的主备状 态倒换完成后, 启动控制面和管理面的新版本软件且以备用状态运行; 当 线卡上的所述数据及数据状态恢复完成后, 对控制面和管理面的老版本软 件进行注销, 同时将控制面和管理面的新版本软件以主用状态运行, 刷新 转发面软件的数据及数据状态;
当前的主用主控单元的新版本软件基于数据恢复组件对线卡上的所述 新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管理面的 状态恢复。
步驟 S205 , 当线卡上的所述数据及数据状态恢复完成后, 线卡对控制 面和管理面的老版本软件进行注销, 同时将控制面和管理面的新版本软件 以主用状态运行, 升级线卡上的转发面软件, 刷新转发面软件的数据及数 据状态。
具体地, 当线卡上的所述数据及数据状态恢复完成后, 第三版本管理 组件对控制面和管理面的老版本软件进行注销, 同时将控制面和管理面的 新版本软件以主用状态运行, 升级线卡上的转发面软件, 刷新转发面软件 的数据及数据状态。 面和管理面软件升级过程类似, 之所以把转发面软件升级放在最后, 是因 为转发面的功能直接影响通信设备的业务运行, 先升级线卡上的控制面和 管理面软件并不会对业务造成中断, 转发面软件升级的过程也须经过: 转 发面老版本软件注销并启动转发面的新版本软件, 通过当前的主用主控单 元上的新版本软件对线卡上的所述转发面的新版本软件进行数据及数据状 态恢复, 刷新转发面新版本软件的数据及数据状态, 使新版本转发面的转 发表项生效。 升级转发面软件的时间非常短, 几乎不影响业务运行。
优选地, 在步驟 S201之前, 所述方法还包括:
将新版本软件拷贝到通信设备的主控单元和线卡上; 或者, 将新版本 软件拷贝到通信设备的主控单元, 后续由线卡向主控单元请求获取新版本 软件。
这里的主控单元指的是处于主用状态的主控单元即主用主控单元, 因 为备用主控单元不对系统提供服务。
本发明实施例还记载了一种分布式通信设备软件升级系统, 如图 3 所 示, 所述系统包括主控单元 10; 其中, 主控单元 10包含基于物理冗余的主 用主控单元 20和备用主控单元 30;
主用主控单元 20, 配置为通知备用主控单元 30启动新版本软件; 通过 主用主控单元 20上的老版本软件对备用主控单元 30上的新版本软件进行 数据及数据状态恢复; 当所述数据及数据状态恢复完成后, 将运行状态倒 换成备用。
备用主控单元 30, 配置为启动新版本软件并以备用状态运行; 配合主 用主控单元 20对新版本软件进行数据及数据状态恢复; 当所述数据及数据 状态恢复完成后, 将运行状态倒换成主用。
优选地, 主用主控单元 20包括: 第一版本管理组件 21和第一主备管 理组件 22, 备用主控单元 30包括: 第二版本管理组件 31和第二主备管理 组件 32, 主控单元 10上的软件中均包含数据恢复组件 11; 其中,
第一版本管理组件 21 ,配置为通知第二版本管理组件 31在备用主控单 元 30上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第一主 备管理组件 22将主用主控单元 20的运行状态倒换成备用, 使老版本软件 进入备用工作状态, 等待升级;
数据恢复组件 11 ,主用主控单元 20上的老版本软件基于数据恢复组件 11对备用主控单元 30上新版本软件进行数据及数据状态恢复;
第二版本管理组件 31 , 配置为在备用主控单元 30上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第二主备管理组件 32将备用主控 单元 30的运行状态倒换成主用, 使升级完成后的新版本软件进入主用工作 状态;
第一主备管理组件 22,配置为对主用主控单元 20的运行状态进行主备 倒换;
第二主备管理组件 32,配置为对备用主控单元 30的运行状态进行主备 倒换。
线卡 40, 配置为在主用主控单元 20与备用主控单元 30的主备状态倒 换完成后, 启动控制面和管理面的新版本软件且以备用状态运行, 并通过 复, 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上的所述新 版本软件进行数据及数据状态恢复; 当线卡上的所述数据及数据状态恢复 完成后, 对控制面和管理面的老版本软件进行注销, 同时将控制面和管理 面的新版本软件以主用状态运行, 刷新转发面软件的数据及数据状态。
优选地, 线卡 40包括: 第三版本管理组件 41 , 主控单元及线卡上的软 件均包含数据恢复组件。
第三版本管理组件 41 , 配置为在主用主控单元 20与备用主控单元 30 的主备状态倒换完成后, 启动控制面和管理面的新版本软件且以备用状态 运行; 当线卡 40上的所述数据及数据状态恢复完成后, 对控制面和管理面 的老版本软件进行注销, 同时将控制面和管理面的新版本软件以主用状态 运行, 刷新转发面软件的数据及数据状态;
数据恢复组件 11 , 当前的主用主控单元 20的新版本软件基于数据恢复 组件 11对线卡 40上的所述新版本软件进行数据及数据状态恢复, 或者, 线卡 40上的控制面和管理面的老版本软件基于数据恢复组件 11对线卡 40 上的所述新版本软件进行数据及数据状态恢复。
本发明实施例还记载了一种分布式通信设备软件升级系统, 本实施例 的分布式通信设备软件升级系统与上面图 3 所述的分布式通信设备软件升 级系统大致相同, 区别在于, 本实施例的所述系统中,
主用主控单元 20, 还配置为在主用主控单元 20与备用主控单元 30进 行主备状态倒换后, 将原主用主控单元 20上的老版本软件升级为新版本软 件并以备用状态运行。
另外, 线卡 40,还配置为当线卡 40上的所述数据及数据状态恢复完成 后且在刷新转发面软件的数据及数据状态之前, 升级线卡 40上的转发面软 件。
优选地, 线卡 40上的第三版本管理组件 41对转发面软件升级过程与 对线卡 40上的控制面和管理面软件升级过程类似, 之所以把转发面软件升 级放在最后, 是因为转发面的功能直接影响通信设备的业务运行, 先升级 线卡 40上的控制面和管理面软件并不会对业务造成中断, 转发面软件升级 的过程也须经过: 转发面老版本软件注销并启动转发面的新版本软件, 通 过当前的主用主控单元 20上的新版本软件对线卡 40上的所述转发面的新 版本软件进行数据及数据状态恢复, 刷新转发面新版本软件的数据及数据 状态, 使新版本转发面的转发表项生效。 升级转发面软件的时间非常短, 几乎不影响业务运行。
基于上述实施例, 现介绍一个本发明的应用实例, 该应用实例的应用 场景和执行过程如下:
场景: 接入网关软件升级场景。
图 4为分布式接入网关软件升级交互序列图,图 5~图 10分别显示了分 布式接入网关进行软件升级时的各种状态, 图中的接入网关是完成用户接 入网络的分布式通信设备, 出于高可靠性、 高可用性的要求, 设备配置有 主备用主控板, 图 5 为分布式接入网关软件升级前的状态, 主用主控板上 的处理单元为主控 1 , 备用主控板上的处理单元为主控 2。 同时配置有 4块 线卡完成用户的接入和转发处理。 主控单元上运行有 80个进程, 如支撑进 程、 边界网关协议(BGP, Border Gateway Protocol )进程、 路由信息协议 ( RIP, Routing Information Protocol )进程、 开放式最短路径优先( OSPF, Open Shortest Path First )进程、转发信息库 ( FIB, Forward Information Base ) 进程等。 线卡有 30个进程, 包括收发包进程、 路由转发进程、 用户接入进 程等。 由于运行的软件版本存在故障, 会导致业务随机断流。 因此, 需要 考虑进行软件版本升级, 以解决设备故障。 接入网关的 4块线卡采用的是 逐线卡升级方案。 即依次对每块线卡升级, 先升级 1号线卡, 再升级 2号 线卡, 然后升级 3号线卡, 最后升级 4号线卡。
接入网关软件版本升级涉及如下组件: 老版本软件、 新版本软件、 版 本管理组件, 主备管理组件。 其中, 老版本软件和新版本软件实现的功能 相同; 版本管理组件负责管理软件的版本升级、 失败回滚等; 主备管理组 件负责对新老版本软件进行主备管理, 并在老版本向新版本完成数据及数 据状态的主备同步后触发新版本软件的主备倒换, 以实现软件新版本无缝 接替老版本进入工作状态, 完成业务不中断升级;
在网络上实施分布式接入网关软件升级的处理步驟如下:
阶段一、 主控软件升级:
S1 : 在进行软件升级前, 需要将待升级的新版本软件拷贝到接入网关 的主用主控上(设备自动拷贝到备用主控)。 线卡新版本软件通过主控来请 求获取;
S2: 在接入网关的主控 1上的老版本软件以主状态运行, 用户向主控 1 上的版本管理组件(VM, Version Manager )发起主控软件升级指令;
S3: 主控 1上的 VM收到主控软件升级指令后, 发送命令到主控 上 的 VM, 在主控 2上升级并启动新版本软件, 如图 6所示;
S4: 主控 2上的新版本软件启动后以备状态运行。 在主控 2转入主状 态运行前, 主控 2上的新版本软件不对系统外界提供服务;
S5: 主控 2上的新版本软件通过主控 1上的老版本软件进行数据及数 据状态恢复, 以便进入工作状态后能无缝接管老版本软件对外界提供的服 务。 优选地, 数据及数据状态恢复的通信链路, 可以是单条或多条通信链 路;
S6: 主控 2上的新版本软件在数据及数据状态的恢复完成后, 通过主 控 2上的 VM向主控 1上的 VM发起备用主控新版本启动完成通知;
S7: 主控 1上的 VM收到备用主控新版本启动完成通知后, 向主控 1 上的主备管理组件(ASM, Active-Standby Manager )触发主用主控的主备 倒换, 如图 7所示。 同时, 通过主控 2上的 VM向主控 2上的 ASM触发备 用主控的主备倒换;
S8: 主控 1主备倒换成新备用主控, 主控 2主备倒换成新主用主控。 新主用主控上的新版本软件接替老版本软件进入主工作状态, 对外界提供 服务。 新备用主控上的老版本软件不再对外界提供服务;
S9: 优选地, 如图 10所示, 在主控 1倒换成新备用主控的同时, 可以 将主控 1上的老版本软件升级为新版本软件并以备状态启动;
S10: 主控 1上的 VM将软件升级成功的结果反馈给用户, 指示可以进 行后续的线卡软件升级。
阶段二、 线卡软件升级(逐线卡升级, 下面以升级 1 号线卡为例进行 详细说明, 其他线卡类似):
S11: 在主控软件升级成功后, 用户向主控 2上的 VM发起 1号线卡软 件升级指令;
S12: 主控 2上的 VM收到 1号线卡软件升级指令后, 发送线卡软件升 级命令到 1号线卡上的 VM;
S13: 如图 7所示, 1号线卡上的 VM收到线卡软件升级命令后, 从主 控 获取新版本软件, 启动线卡上的控制面和管理面的新版本软件并以备 用状态运行, 在转入主用状态前, 控制面和管理面的新版本软件不对系统 外界提供服务;
S14: 为了避免业务中断, 1号线卡上的控制面和管理面的新版本软件 在进入工作状态前, 需要恢复对外提供服务所需要的数据及数据状态。 接 入网关的 1 号线卡上的新版本软件的数据及数据状态恢复过程采用多通信 链路实现;
S15: 1号线卡上的控制面和管理面的新版本软件的数据及数据状态恢 复通过主控 2进行;
S16: 1号线卡上的控制面和管理面的新版本软件在数据及数据状态的 恢复完成后, 向线卡上的 VM发起数据及数据状态同步完成通知;
S17: 1号线卡上的 VM收到新版本软件恢复完成通知后, 向 1号线卡 上的控制面和管理面的老版本软件发起注销指令。 同时, 向 1 号线卡上的 控制面和管理面的新版本软件发起主备倒换指令;
S18:如图 8所示, 1号线卡上的控制面和管理面的老版本软件收到 VM 的注销指令后, 进行注销处理;
S19:如图 9所示, 1号线卡上的控制面和管理面的新版本软件收到 VM 的主备倒换指令后, 执行主备倒换, 接替控制面和管理面的老版本软件进 入工作状态;
S20: 1号线卡上的控制面和管理面的新版本软件进入工作状态后, 按 需升级转发面的新版本软件, 并触发刷新转发表, 使得新版本转发表项生 效。
S21: 1号线卡上的 VM向主控 上的 VM反馈线卡软件升级结果。 通过具体实施方式的说明, 应当可对本发明为达成预定目的所采取的 技术手段及功效得以更加深入且具体地了解, 然而所附图示仅是提供参考 与说明之用, 并非用来对本发明加以限制。

Claims

权利要求书
1、 一种分布式通信设备软件升级方法, 应用于包括主用主控单元、 备 用主控单元以及线卡的分布式通信设备软件升级系统中, 所述方法包括: 主用主控单元通知备用主控单元启动新版本软件且以备用状态运行, 并通过主用主控单元上的老版本软件对备用主控单元上新版本软件进行数 据及数据状态恢复;
主用主控单元与备用主控单元进行主备状态倒换;
线卡启动控制面和管理面的新版本软件且以备用状态运行, 并通过当 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上的所述新版本 软件进行数据及数据状态恢复;
线卡对控制面和管理面的老版本软件进行注销, 同时将控制面和管理 面的新版本软件以主用状态运行;
线卡刷新转发面软件的数据及数据状态。
2、 根据权利要求 1所述的分布式通信设备软件升级方法, 其中, 所述 主用主控单元通知备用主控单元启动新版本软件且以备用状态运行, 并通 过主用主控单元上的老版本软件对备用主控单元上新版本软件进行数据及 数据状态恢复, 包括:
主用主控单元通知备用主控单元在备用主控单元上启动新版本软件; 主用主控单元上的老版本软件基于数据恢复组件对备用主控单元上新 版本软件进行数据及数据状态恢复。
3、 根据权利要求 1所述的分布式通信设备软件升级方法, 其中, 所述 线卡启动控制面和管理面的新版本软件且以备用状态运行, 并通过当前的 行数据及数据状态恢复, 包括:
线卡在主用主控单元与备用主控单元的主备状态倒换完成后, 启动线 卡上的控制面和管理面的新版本软件且以备用状态运行;
当前的主用主控单元的新版本软件基于数据恢复组件对线卡上的所述 新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管理面的 状态恢复;
当线卡上的所述数据及数据状态恢复完成后, 线卡对控制面和管理面 的老版本软件进行注销, 同时将控制面和管理面的新版本软件以主用状态 运行, 刷新转发面软件的数据及数据状态。
4、 根据权利要求 1所述的分布式通信设备软件升级方法, 其中, 所述 方法还包括:
当线卡上的所述数据及数据状态恢复完成之后且在刷新转发面软件的 数据及数据状态之前, 升级线卡上的转发面软件。
5、 根据权利要求 1至 4任一项所述的分布式通信设备软件升级方法, 所述方法还包括:
在主用主控单元与备用主控单元进行主备状态倒换后, 将原主用主控 单元上的老版本软件升级为新版本软件并以备用状态运行。
6、 一种分布式通信设备软件升级系统, 包括: 主用主控单元和备用主 控单元、 线卡; 其中,
所述主用主控单元, 配置为通知备用主控单元启动新版本软件; 通过 主用主控单元上的老版本软件对备用主控单元上的新版本软件进行数据及 数据状态恢复; 当所述数据及数据状态恢复完成后, 将运行状态倒换成备 用;
所述备用主控单元, 配置为启动新版本软件并以备用状态运行; 配合 主用主控单元对新版本软件进行数据及数据状态恢复; 当所述数据及数据 状态恢复完成后, 将运行状态倒换成主用;
所述线卡, 配置为在主用主控单元与备用主控单元的主备状态倒换完 成后, 启动线卡上的控制面和管理面的新版本软件且以备用状态运行, 并 恢复, 或者, 通过线卡上的控制面和管理面的老版本软件对线卡上的所述 新版本软件进行数据及数据状态恢复; 当线卡上的所述数据及数据状态恢 复完成后, 对控制面和管理面的老版本软件进行注销, 同时将控制面和管 理面的新版本软件以主用状态运行, 刷新转发面软件的数据及数据状态。
7、 根据权利要求 6所述的分布式通信设备软件升级系统, 其中, 所述 主用主控单元包括: 第一版本管理组件和第一主备管理组件; 所述备用主 控单元包括: 第二版本管理组件和第二主备管理组件; 主控单元上的软件 中均包含数据恢复组件;
所述第一版本管理组件, 配置为通知第二版本管理组件在备用主控单 元上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第一主备 管理组件将主用主控单元的运行状态倒换成备用;
所述第二版本管理组件, 配置为在备用主控单元上启动新版本软件; 在所述数据及数据状态恢复完成后, 通知第二主备管理组件将备用主控单 元的运行状态倒换成主用;
所述第一主备管理组件, 配置为对主用主控单元的运行状态进行主备 倒换;
所述第二主备管理组件, 配置为对备用主控单元的运行状态进行主备 倒换;
其中, 主用主控单元上的老版本软件基于数据恢复组件对备用主控单 元上新版本软件进行数据及数据状态恢复。
8、 根据权利要求 6所述的分布式通信设备软件升级系统, 其中, 所述 线卡包括: 第三版本管理组件; 主控单元及线卡上的软件均包含数据恢复 组件;
所述第三版本管理组件, 配置为在主用主控单元与备用主控单元的主 备状态倒换完成后, 启动线卡上的控制面和管理面的新版本软件且以备用 状态运行; 当线卡上的所述数据及数据状态恢复完成后, 对控制面和管理 面的老版本软件进行注销, 同时将控制面和管理面的新版本软件以主用状 态运行, 刷新转发面软件的数据及数据状态;
其中, 当前的主用主控单元的新版本软件基于数据恢复组件对线卡上 的所述新版本软件进行数据及数据状态恢复, 或者, 线卡上的控制面和管 理面的老版本软件基于数据恢复组件对线卡上的所述新版本软件进行数据 及数据状态恢复。
9、 根据权利要求 6所述的分布式通信设备软件升级系统, 其中, 所述 线卡, 还配置为当线卡上的所述数据及数据状态恢复完成后且在刷新转发 面软件的数据及数据状态之前, 升级线卡上的转发面软件。
10、 根据权利要求 6至 9任一项所述的分布式通信设备软件升级系统, 其中, 所述主用主控单元, 还配置为在主用主控单元与备用主控单元进行 主备状态倒换后, 将原主用主控单元上的老版本软件升级为新版本软件并 以备用状态运行。
PCT/CN2013/082411 2012-11-01 2013-08-27 一种分布式通信设备软件升级方法及系统 WO2014067330A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP13851276.9A EP2916488B1 (en) 2012-11-01 2013-08-27 Method and system for upgrading software of distributed communication device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210428526.0A CN102937925B (zh) 2012-11-01 2012-11-01 一种分布式通信设备软件升级方法及系统
CN201210428526.0 2012-11-01

Publications (1)

Publication Number Publication Date
WO2014067330A1 true WO2014067330A1 (zh) 2014-05-08

Family

ID=47696824

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/082411 WO2014067330A1 (zh) 2012-11-01 2013-08-27 一种分布式通信设备软件升级方法及系统

Country Status (3)

Country Link
EP (1) EP2916488B1 (zh)
CN (1) CN102937925B (zh)
WO (1) WO2014067330A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3304282A4 (en) * 2015-06-05 2019-02-27 Shell International Research Maatschappij B.V. SYSTEM AND METHOD FOR REPLACING AN APPLICATION FOR LIVE CONTROL / ESTIMATION WITH A STAINLESS APPLICATION
US20210119940A1 (en) * 2019-10-21 2021-04-22 Sap Se Dynamic, distributed, and scalable single endpoint solution for a service in cloud platform

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102937925B (zh) * 2012-11-01 2015-08-12 中兴通讯股份有限公司 一种分布式通信设备软件升级方法及系统
CN104410672B (zh) * 2014-11-12 2017-11-24 华为技术有限公司 网络功能虚拟化应用升级的方法、转发业务的方法及装置
CN104486394B (zh) * 2014-12-10 2018-01-12 新华三技术有限公司 不中断业务软件升级方法及装置
CN107580322A (zh) * 2017-08-28 2018-01-12 驭势科技(北京)有限公司 无人驾驶车辆软件系统的升级方法、装置和无人驾驶车辆
CN109728886A (zh) * 2017-10-27 2019-05-07 中兴通讯股份有限公司 一种适于跨版本升级的数据同步方法、装置、设备及存储介质
CN108170464A (zh) * 2017-12-27 2018-06-15 联想(北京)有限公司 一种版本升级方法、服务平台及电子设备
CN110879657B (zh) * 2018-09-05 2023-04-07 中兴通讯股份有限公司 跨版本升级方法、装置及设备、计算机可读存储介质
CN111490935B (zh) * 2020-03-31 2022-05-27 新华三信息安全技术有限公司 分布式架构中的信息同步方法、转发单元及控制单元
EP4198712A1 (de) * 2022-12-16 2023-06-21 Pfeiffer Vacuum Technology AG Vakuumsystem und verfahren zum betreiben eines solchen

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101383724A (zh) * 2008-08-29 2009-03-11 上海华为技术有限公司 网元升级方法及操作维护单元、单板和网元升级系统
CN102081540A (zh) * 2011-01-18 2011-06-01 杭州华三通信技术有限公司 一种分布式双主控网络设备软件升级方法和装置
CN102937925A (zh) * 2012-11-01 2013-02-20 中兴通讯股份有限公司 一种分布式通信设备软件升级方法及系统

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6898189B1 (en) * 2000-08-23 2005-05-24 Cisco Technology, Inc. Restartable spanning tree for high availability network systems
US7177919B1 (en) * 2000-11-28 2007-02-13 Cisco Technology, Inc. Method and system for controlling tasks on network cards
US8806472B2 (en) * 2007-09-27 2014-08-12 Ericsson Ab In-service software upgrade utilizing metadata-driven state translation
CN102195845B (zh) * 2010-03-03 2015-01-14 杭州华三通信技术有限公司 一种实现主控板主备切换的方法、装置和设备
US8495418B2 (en) * 2010-07-23 2013-07-23 Brocade Communications Systems, Inc. Achieving ultra-high availability using a single CPU
US8402454B2 (en) * 2010-09-22 2013-03-19 Telefonaktiebolaget L M Ericsson (Publ) In-service software upgrade on cards of virtual partition of network element that includes directing traffic away from cards of virtual partition

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101383724A (zh) * 2008-08-29 2009-03-11 上海华为技术有限公司 网元升级方法及操作维护单元、单板和网元升级系统
CN102081540A (zh) * 2011-01-18 2011-06-01 杭州华三通信技术有限公司 一种分布式双主控网络设备软件升级方法和装置
CN102937925A (zh) * 2012-11-01 2013-02-20 中兴通讯股份有限公司 一种分布式通信设备软件升级方法及系统

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3304282A4 (en) * 2015-06-05 2019-02-27 Shell International Research Maatschappij B.V. SYSTEM AND METHOD FOR REPLACING AN APPLICATION FOR LIVE CONTROL / ESTIMATION WITH A STAINLESS APPLICATION
US11093235B2 (en) 2015-06-05 2021-08-17 Shell Oil Company System and method for replacing a live control/estimation application with a staged application
US20210119940A1 (en) * 2019-10-21 2021-04-22 Sap Se Dynamic, distributed, and scalable single endpoint solution for a service in cloud platform
US11706162B2 (en) * 2019-10-21 2023-07-18 Sap Se Dynamic, distributed, and scalable single endpoint solution for a service in cloud platform

Also Published As

Publication number Publication date
CN102937925A (zh) 2013-02-20
EP2916488A1 (en) 2015-09-09
CN102937925B (zh) 2015-08-12
EP2916488A4 (en) 2015-11-25
EP2916488B1 (en) 2019-04-10

Similar Documents

Publication Publication Date Title
WO2014067330A1 (zh) 一种分布式通信设备软件升级方法及系统
WO2014067335A1 (zh) 一种分布式通信设备软件升级方法及系统
US7894334B2 (en) Hierarchical redundancy for a distributed control plane
AU2005236835B2 (en) Routing system and method for transparently recovering routing states after a failover or during a software upgrade
WO2014067324A1 (zh) 一种补丁软件升级方法及系统
EP1528735B1 (en) High availability of recources in telecommunications network using synchronized redundancy mechanism
US9992058B2 (en) Redundant storage solution
JPH0363744A (ja) コンピユータ・ネツトワークにおける適用業務セッションの保存方法、制御方法及び保存装置
JP2005535241A (ja) マルチコンピュータ・アーキテクチャにおけるアプリケーション・ソフトウェアの移動方法、前記移動方法を用いて作動の連続性を実現するマルチコンピュータ方法および装置
WO2016063114A1 (en) System and method for disaster recovery of cloud applications
CN102388570B (zh) 一种主备模式下的单板运行方法及系统
WO2013037314A1 (zh) 用于数据处理中心容灾备份的系统及方法
JP7161008B2 (ja) アプリケーション冗長化管理システムおよびアプリケーション冗長化管理方法
CN102420746A (zh) 组播流量的转发方法及网络设备
CN112187523A (zh) 一种网络高可用实现方法及超融合系统
CN109412943B (zh) 一种sdn控制器集群流量处理方法、装置、设备及存储介质
JP2009075710A (ja) 冗長化システム
JP2009246475A (ja) 冗長構成を有する通信システム及び該システムによる系切り替え方法
CN100413261C (zh) 数据恢复的方法和系统
KR101596140B1 (ko) 이중화 구조에 기초한 tcp 기반의 고장 감내 장치 및 방법
KR101401006B1 (ko) 고가용성 시스템에서 소프트웨어 업데이트를 수행하기 위한 방법 및 장치
JP5344712B2 (ja) データ整合方法及びサービス提供装置
JP2015138987A (ja) 通信システムおよび通信システムにおけるサービス復旧方法
JP5614132B2 (ja) 二重化システム、そのデータ処理方法およびデータ処理装置
JP2007274256A (ja) 多重化システム及び通信制御装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13851276

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2013851276

Country of ref document: EP