CN116032731A - Method and device for realizing hot backup of RapidIO network system - Google Patents

Method and device for realizing hot backup of RapidIO network system Download PDF

Info

Publication number
CN116032731A
CN116032731A CN202310308153.1A CN202310308153A CN116032731A CN 116032731 A CN116032731 A CN 116032731A CN 202310308153 A CN202310308153 A CN 202310308153A CN 116032731 A CN116032731 A CN 116032731A
Authority
CN
China
Prior art keywords
host
standby
machine
multicast event
rapidio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310308153.1A
Other languages
Chinese (zh)
Other versions
CN116032731B (en
Inventor
朱珂
张明伟
徐庆阳
王盼
钟丹
常超
张波
肖峰
闻亮
毛英杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jingxin Microelectronics Technology Tianjin Co Ltd
Original Assignee
Jingxin Microelectronics Technology Tianjin Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jingxin Microelectronics Technology Tianjin Co Ltd filed Critical Jingxin Microelectronics Technology Tianjin Co Ltd
Priority to CN202310308153.1A priority Critical patent/CN116032731B/en
Publication of CN116032731A publication Critical patent/CN116032731A/en
Application granted granted Critical
Publication of CN116032731B publication Critical patent/CN116032731B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The disclosure provides a method and a device for realizing hot backup of a rapidIO network system. The method comprises the following steps: establishing a multicast event controller transmission path from the host to the standby; the host sends the multicast event control symbol according to fixed interval cycle; the standby machine detects whether the multicast event control symbol sent by the host machine is received periodically; the standby machine judges that the host machine fails according to the time interval of receiving the multicast event control symbol; the standby machine starts a network take-over program and is switched to be a new host machine. In addition, in the rapidIO network protocol, the multicast event controller has the highest priority, and compared with the scheme of carrying out heartbeat communication by adopting data packets such as maintenance packets, doorbell and the like, the instant response capability is more outstanding, and the performance and the reliability degree are obviously improved.

Description

Method and device for realizing hot backup of RapidIO network system
Technical Field
The disclosure relates to an embedded signal processing technology in the technical field of computers and the RapidIO network field, in particular to a method and a device for realizing hot backup of a RapidIO network system.
Background
RapidIO is a high-performance, low-pin-count, packet-switching-based interconnect architecture, which is an open interconnect technology standard designed to meet and future high-performance embedded system requirements. rapidIO is mainly applied to internal interconnection of an embedded system, supports chip-to-chip and board-to-board communication, and can be used as a backboard (Backplane) connection of embedded equipment.
The RapidIO protocol has very high advantages in terms of bandwidth, time delay, complexity, flexibility, reliability and the like of network interconnection for a high-performance embedded communication system, and is the best choice in embedded interconnection technology. Generally, rapidIO networks are composed of processing devices (PE, processing Element), which are mainly responsible for generating, transceiving and processing data packets, and switching devices (SWITCH), which accomplish the reception and forwarding of data packets. In the processing device, there is usually one Host node to complete network maintenance management work such as initial enumeration, route deployment, fault handling and the like of the RapidIO network.
From the reliability point of view, when the Host node itself or the network access of RapidIO fails, a hot backup mechanism is needed to ensure that the service and management of the RapidIO system cannot be out of control. The current mainstream hot standby technology is a dual hot standby system, and the implementation scheme can be divided into whether to rely on a third party arbitration mechanism. It is not difficult to find that the scheme adopting the third party arbitration mechanism is completely established on the premise that the third party arbitration can normally run, and the reliability of the whole system is not further improved; correspondingly, if a heartbeat communication mechanism is required to be established between the main machine and the standby machine without the aid of a third-party arbitration mechanism, when the main machine fails, the standby machine immediately starts a take-over program after sensing, so that the main machine and the standby machine are detected and switched, and the application is common in terms of overall cost and system complexity.
At present, a heartbeat communication mechanism between main computers and a standby computer in a RapidIO network mainly depends on additional hardware connection, such as General-purpose input/output (GPIO), and the like, so that the disadvantage in practical application is obvious, besides the increase of hardware cost, the hardware access of the main computers and the standby computers is difficult to dynamically adjust, and the overall network topology is greatly limited. The other implementation mode is to periodically send and maintain data packets such as a write, a doorbell and the like, and although extra hardware overhead is not required to be introduced, the data packets tend to occupy channel resources in a network, and because data interaction between forwarding and normal processing nodes of the data packets can generate conflict on routing configuration, the forwarding priority of the data packets is difficult to guarantee.
Disclosure of Invention
The disclosure provides a method and a device for realizing hot backup of a rapidIO network system.
In a first aspect, the present disclosure provides a method for implementing hot backup of a RapidIO network system, which is applicable to the RapidIO network system, where the RapidIO network system includes a plurality of switching devices and a plurality of endpoint devices, and the endpoint devices include a host and a standby machine, and the method includes: establishing a multicast event controller transmission path from the host to the standby; the host sends the multicast event control symbol according to fixed interval cycle; the standby machine detects whether the multicast event control symbol sent by the host machine is received periodically; the standby machine judges that the host machine fails according to the time interval of receiving the multicast event control symbol; the standby machine starts a network take-over program and is switched to be a new host machine.
In some alternative embodiments, before the establishing the multicast event controller transmission path from the host to the standby, the method further includes: the method comprises the steps that the host completes equipment detection and discovery of a rapidIO network through network enumeration operation, and a rapidIO channel between the host and other equipment in the rapidIO network is established; and determining the path from the host to the standby according to the network topology data obtained by the network enumeration operation.
In some alternative embodiments, the establishing the multicast event controller transmission path from the host to the standby includes: judging whether the path is empty or not; if not, selecting ports of the first switching device and the adjacent device in the path, starting the multicast event controller to send and enable, deleting the first switching device from the path, and returning to the step of judging whether the path is empty; if yes, confirming that the multicast event controller transmission path from the host to the standby is established.
In some alternative embodiments, the detecting, by the standby device, whether the multicast event controller sent by the host device is periodically received includes: the standby machine receives the first multicast event control Fu Shiji and records the receiving time, starts the standby machine control program, and continuously records the time of receiving the multicast event control symbol at least twice, so as to calculate the arithmetic average value Ta of the time spent by the host machine for transmitting the multicast event control symbol to the standby machine every time, and the arithmetic average value Ta is used as the period for the standby machine to detect the heartbeat of the host machine.
In some alternative embodiments, the backup determines a value not less than Ta, set to a time limit TL for detecting the loss of the heartbeat of the host.
In some alternative embodiments, the standby machine determining that the host machine fails according to the time interval of receiving the multicast event control symbol includes: the standby machine records a time interval Ti adjacent to two times of receiving the multicast event controller; if Ti is larger than the time limit TL or the time interval from the last time of receiving the multicast event controller exceeds the time limit TL, judging that the host fails and stopping detection.
In a second aspect, the present disclosure provides an implementation apparatus for hot backup of a RapidIO network system, which is applicable to the RapidIO network system, where the RapidIO network system includes a plurality of switching devices and a plurality of endpoint devices, and the endpoint devices include a host and a standby machine, and the apparatus includes: a setup module configured to setup a multicast event controller transmission path from the host to the standby; a transmission module configured to cause the host to cyclically transmit multicast event control symbols at fixed intervals; a detection module configured to enable the standby machine to detect whether the multicast event controller sent by the host machine is periodically received; a judging module configured to cause the standby machine to judge that the host machine fails according to a time interval in which the multicast event control symbol is received; and the switching module is configured to enable the standby machine to start a network takeover program and switch to a new host machine.
In some alternative embodiments, the determination module is further configured to: recording a time interval Ti adjacent to two times of receiving the multicast event controller by the standby machine; if Ti is larger than the time limit TL or the time interval from the last time of receiving the multicast event controller exceeds the time limit TL, judging that the host fails and stopping detection.
In a third aspect, the present disclosure provides a computer device comprising: one or more processors; and a storage device, on which one or more programs are stored, which when executed by the one or more processors, cause the one or more processors to implement the method for implementing hot standby of RapidIO network system according to the first aspect.
In a fourth aspect, the present disclosure provides a computer-readable storage medium, on which a computer program is stored, which when executed by one or more processors, implements a method for implementing a hot standby of a RapidIO network system according to the first aspect.
In order to solve the technical problem of performing main and standby heartbeat communication based on external hardware connection or data packets in a rapidIO network, the present disclosure provides a method and a device for implementing hot backup of a rapidIO network system. The method and the system utilize the multicast event controller as a heartbeat communication mode between the main machine and the standby machine, the main machine periodically sends the multicast event controller to the standby machine as a heartbeat according to the appointed time, and the standby machine judges the main machine fault when the main machine does not have the heartbeat, and immediately starts the take over to complete the switching of the main machine and the standby machine. Therefore, no additional hardware connection is needed, so that the cost of hardware is effectively reduced, and the flexibility of network topology design is greatly improved by getting rid of the constraint of hardware connection lines between the main machine and the standby machine.
In addition, in the rapidIO network protocol, the multicast event controller has the highest priority, and compared with the scheme of adopting data packets such as maintenance packets, doorbell and the like to carry out heartbeat communication, the timely response capability is more outstanding, and the performance and the reliability degree are obviously improved.
In addition, the multicast event controller forwarding is independent of a routing table of a data packet, has the characteristic of no coupling with application data service naturally, does not need to consume equipment ID and is independent of a data service routing system, and is perfectly suitable for the scene of the control layer of the system. In addition, the multicast event controller supports port-level forwarding switch control in exchange, so that the physical position of the standby machine is not required to be fixed when the main machine and the standby machine are designed, and different devices can be changed as the standby machine in different time periods according to service requirements.
Therefore, the present disclosure can better solve various problems existing in the existing schemes for performing active/standby heartbeat communication through external hardware connection or data packets.
Drawings
Other features, objects and advantages of the present disclosure will become more apparent upon reading of the detailed description of non-limiting embodiments, made with reference to the following drawings:
FIG. 1 is a flow chart of one embodiment of a method of implementing a hot-standby of a RapidIO network system in accordance with the present disclosure;
FIG. 2 is an exemplary system architecture according to one embodiment of the present disclosure;
FIG. 3 is a block diagram of one embodiment of an implementation apparatus for hot-standby of a RapidIO network system according to the present disclosure;
fig. 4 is a schematic diagram of a hardware composition structure of a computer device according to an embodiment of the present disclosure.
Description of the embodiments
The present disclosure is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be noted that, for convenience of description, only the portions related to the present invention are shown in the drawings.
In the description of the present disclosure, it should be noted that the terms "center," "upper," "lower," "left," "right," "vertical," "horizontal," "inner," "outer," and the like indicate orientations or positional relationships, which are merely for convenience in describing the present disclosure and simplifying the description, and do not indicate or imply that the devices or elements being referred to must have a particular orientation, be configured and operated in a particular orientation, and thus should not be construed as limiting the present disclosure. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
In the description of the present disclosure, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and the like are to be construed broadly and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium. The specific meaning of the terms in this disclosure will be understood by those of ordinary skill in the art as the case may be.
In the description of the present disclosure, it should be noted that, without conflict, embodiments of the present disclosure and features of the embodiments may be combined with each other.
Referring to fig. 1, fig. 1 is a flowchart of one embodiment of a method for implementing a hot-standby of a RapidIO network system according to the present disclosure. The method is suitable for the rapidIO network system, and the rapidIO network system can comprise a plurality of switching devices (i.e. switching devices) and a plurality of endpoint devices (i.e. processing devices), wherein the endpoint devices are connected with the switching devices. Here, no additional hardware connection is required, so there is no constraint on the topology design of RapidIO network, and all devices need to be connected through RapidIO channels. Among the plurality of endpoint devices, one endpoint device can be selected randomly or according to requirements as a main control processing node (i.e. a host), and the host is responsible for initial enumeration, configuration deployment, fault management and the like of the whole rapidIO network; and another endpoint device can be selected as a backup processing node (i.e. a standby machine) randomly or on demand, and similarly, the standby machine is selected without constraint, and the physical distance from the host machine is not required.
As shown in fig. 1, the implementation method of hot backup of the RapidIO network system in the embodiment of the disclosure mainly includes the following steps:
in the initial stage of the system, the host initiates network enumeration operation (or initiates preemptive network enumeration simultaneously) to complete device detection and discovery of the whole RapidIO network, and at this time, rapidIO channels of the host and other devices (including all switching devices and endpoint devices in the standby) in the RapidIO network are established.
Step 12, according to the network topology data obtained by enumeration, a complete path P from the host to the standby is planned (or determined), mainly including intermediate switching devices and corresponding ports of the path, and then, starting from the direct connection of the host to the switching devices, the switching devices included in the path P are configured one by one through maintenance packets, and the port Multicast Event Controller (MECS) of the next switching device (or standby) is started to transmit and enable, so that a multicast event controller transmission path from the host to the standby is established. Here, the step of establishing the multicast event controller transmission path from the host to the standby may include: judging whether the path P is empty or not, if not, selecting ports of a first switching device and adjacent devices in the path P, enabling a multicast event controller forwarding switch, deleting the first switching device from the path P, and returning to the step of judging whether the path P is empty or not, so as to ensure that the path P is empty; if the path P is determined to be empty, the transmission enable of the multicast event controller indicating the port of each switching device in the path is turned on, and it can be confirmed that the transmission path of the multicast event controller from the host to the standby is established.
And step 13, initializing heartbeat communication setting between the host and the standby, and circularly transmitting the multicast event controller as heartbeat by the host according to a fixed interval T.
In step 14, the standby machine records the receiving time T0 from the receiving of the first multicast event control Fu Shiji, and starts the standby machine control program to always detect whether there is a multicast event controller periodically received from the host machine.
Step 15, the standby machine continuously records the time of receiving the multicast event controller at least twice, so as to calculate the arithmetic average value Ta of the time spent by the host machine to transmit the multicast event controller to the standby machine each time, and the arithmetic average value Ta is used as the period for the standby machine to detect the heartbeat of the host machine. Illustratively, the second multicast time controller is received for time T1, and so on, with T2 (time of receipt of the third multicast time controller), T3 (time of receipt of the fourth multicast time controller pair), and so on, and the arithmetic mean Ta of the time taken for the host to send the multicast event controller to the host for three times corresponding to T1, T2, and T3 is calculated and recorded as the period for the host to detect the heartbeat. The initial setting of the heartbeat communication between the main machine and the standby machine is completed.
In step 16, considering that jitter may exist in network transmission, the time limit TL for determining heartbeat loss may be properly widened compared with Ta, and the value may be customized according to a specific application scenario. That is, the standby machine may determine a value not smaller than Ta according to the application scenario, network jitter, and other factors, and the time limit TL for detecting the heartbeat loss of the host machine may be referred to as a heartbeat loss determination time limit.
And step 17, the standby machine always detects the heartbeat information (the multicast event controller) sent by the host machine, and judges whether the host machine fails or not according to the time interval of receiving the multicast event controller. Illustratively, a time interval Ti is recorded adjacent to two times; if Ti does not exceed TL, the host is considered normal, and circulation waiting is continued; otherwise, if Ti is greater than TL or the time interval from the last heartbeat exceeds TL, the host computer is regarded as fault, and the cycle detection is stopped.
And step 18, if the standby machine judges that the host machine has a fault, the standby machine starts a network take-over program, and the role finishes the switching from the standby machine to the host machine, namely, the standby machine is switched to be a new host machine, so that the original host machine is replaced to maintain and manage the whole rapidIO network.
In the following, an application scenario is taken as an example in conjunction with the RapidIO network topology example shown in fig. 2, to further explain the method of the embodiment of the present disclosure. The RapidIO network topology shown in fig. 2 includes a plurality of switching devices (switches, SWs) and a plurality of endpoint devices. The plurality of switching devices are respectively denoted by SW1, SW2, … …, SWn, (n is a positive integer). The plurality of endpoint devices include, but are not limited to, PPC (PowerPC, a Reduced Instruction Set (RISC) architecture central processor), DSP (digital signal processor ), FPGA (Field Programmable Gate Array, programmable array logic), etc. The plurality of endpoint devices include a Host (Host) and a standby.
To highlight implementation of the disclosed solution, the RapidIO network shown in the topology of fig. 2 does not list enough endpoint devices, and the implementation of the disclosed solution is as follows:
1) Initiating network enumeration by a Host to obtain network topology including all switching devices and endpoint devices, wherein a routing path of the Host and any device in the network is provided;
2) Planning a path between a Host and a standby machine based on the network topology obtained in the previous step: p= { ( SW 1,0, 5), ( SW 4,2, 9), ( SW 3,7, 4), ( SW 5,13, 5), ( SW 6,10, 2) }, including all exchanges of Host to the whole path of the standby, and corresponding ingress and egress ports; here, 5 in SW1 in ( SW 1,0, 5) represents port 5, and 2 and 9 in ( SW 4,2, 9) represent port 2 and port 9, respectively, which are not listed one by one;
3) Configuring each switching device in the path P one by one, and respectively starting multicast event controller forwarding enabling of the SW1 port 5, the SW4 port 9, the SW3 port 4, the SW5 port 5 and the SW6 port 2, so that the heartbeat communication transmission path between the main machine and the standby machine is established;
4) Initial setting of heartbeat communication between the main machine and the standby machine is started to be executed:
a) The host circularly transmits the multicast event controller according to a fixed interval T;
b) Recording time T0 when the standby machine receives the first multicast event control symbol, and starting a standby machine control program;
c) The standby machine receives the second multicast time control symbol time T1, and the like, and T2 and T3 are calculated, and an arithmetic average value Ta of time spent by the standby machine for transmitting the multicast event control symbol to the standby machine by the host machine for three times corresponding to the T1, the T2 and the T3 is recorded as a period for detecting the heartbeat of the host machine by the standby machine;
5) The method comprises the steps of integrating network transmission, service scenes and other conditions, and determining heartbeat loss judging time limit TL=2Ta;
6) So far, the host computer has ready all configuration about heartbeat communication, can start to deploy network data service, and the data interaction of the whole rapidIO network application layer is fully developed immediately; as the real-time load and link state of the network may change dynamically, the paths between the host and the network may also fail;
7) The standby machine always detects heartbeat information sent by the host machine and records a time interval Ti adjacent to two times, if the Ti does not exceed TL, the standby machine is regarded as normal to the host machine, and circulation waiting is continued; otherwise, when Ti is larger than TL or the time from the last heartbeat exceeds TL, the host computer is regarded as fault, and the cycle detection is stopped;
8) The standby machine starts a network take-over program, and the role finishes the switching from the standby machine to the host machine, so that the original host machine is replaced to maintain and manage the whole rapidIO network.
Referring to fig. 3, fig. 3 is a block diagram of one embodiment of an implementation apparatus of a hot standby of a RapidIO network system according to the present disclosure. The device is suitable for a rapidIO network system, wherein the rapidIO network system comprises a plurality of switching devices and a plurality of endpoint devices, and the endpoint devices comprise a host and a standby.
As shown in fig. 3, an implementation apparatus 30 for hot backup of RapidIO network system according to an embodiment of the disclosure includes:
a setup module 31 configured to setup a multicast event controller transmission path from a host to a standby;
a transmission module 32 configured to cause the host to cyclically transmit the multicast event control symbol at regular intervals;
a detection module 33 configured to enable the standby to detect whether a multicast event controller sent from the host is periodically received;
a judging module 34 configured to cause the standby machine to judge that the host machine fails according to the time interval of receiving the multicast event controller;
the switching module 35 is configured to cause the standby to start the network takeover procedure and switch to a new host.
In some alternative embodiments, the determination module 34 is further configured to: recording a time interval Ti adjacent to two times of receiving the multicast event controller by the standby machine; if Ti is greater than the time limit TL or the time interval from the last time the multicast event controller is received exceeds the time limit TL, judging that the host fails and stopping detection.
It should be noted that, the implementation details and technical effects of each module in the apparatus of this embodiment may refer to the descriptions of other embodiments in this disclosure, which are not described herein. The implementation scheme in each module of the device has a variety, so long as the purpose of the module can be achieved, and the practical deployment is not limited to the specific implementation scheme.
Referring to fig. 4, fig. 4 is a schematic structural diagram of one embodiment of a computer device for implementing a server according to the present disclosure. As shown in fig. 4, a computer device 400 of the present disclosure may include:
one or more processors 401;
a memory 402 having one or more programs 403 stored thereon;
components such as processor 401 and memory 402 may be coupled together by bus system 404; bus system 404 is used to implement the connected communications between these components;
the one or more programs 403, when executed by the one or more processors 301, cause the one or more processors 401 to implement the implementation method of hot-standby for RapidIO network systems as disclosed in the method embodiments above.
The bus system 404 may include a power bus, a control bus, and a status signal bus in addition to a data bus. Memory 402 may be volatile memory or nonvolatile memory, and may include both volatile and nonvolatile memory. The processor 401 may be an integrated circuit chip with signal processing capability, may be a general purpose processor, a digital signal processor (DSP, digital Signal Processor), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, or the like.
The disclosed embodiments also provide a computer-readable storage medium having stored thereon a computer program which, when executed by one or more processors, implements a method of implementing a hot backup of a RapidIO network system as disclosed in the method embodiments above.
The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It should be understood that the terms "system" and "network" are often used interchangeably herein in this disclosure. The term "and/or" in this disclosure is merely one association relationship describing the associated object, and indicates that three relationships may exist, for example, a and/or B may indicate: a exists alone, A and B exist together, and B exists alone. In addition, the character "/" in the present disclosure generally indicates that the front and rear association objects are an or relationship.
The technical scheme of the present disclosure is described in detail through the specific embodiments. In the foregoing embodiments, the descriptions of the embodiments are each focused, and for those portions of one embodiment that are not described in detail, reference may be made to the related descriptions of other embodiments.
The foregoing description is only of the preferred embodiments of the present disclosure and description of the principles of the technology being employed. It will be appreciated by those skilled in the art that the scope of the invention referred to in this disclosure is not limited to the specific combination of features described above, but encompasses other embodiments in which features described above or their equivalents may be combined in any way without departing from the spirit of the invention. Such as those described above, are mutually substituted with the technical features having similar functions disclosed in the present disclosure (but not limited thereto).

Claims (10)

1. The method for realizing the hot backup of the rapidIO network system is characterized by being suitable for the rapidIO network system, wherein the rapidIO network system comprises a plurality of switching devices and a plurality of endpoint devices, the endpoint devices comprise a host computer and a standby computer, and the method comprises the following steps:
establishing a multicast event controller transmission path from the host to the standby;
the host sends the multicast event control symbol according to fixed interval cycle;
the standby machine detects whether the multicast event control symbol sent by the host machine is received periodically;
the standby machine judges that the host machine fails according to the time interval of receiving the multicast event control symbol;
the standby machine starts a network take-over program and is switched to be a new host machine.
2. The method of claim 1, wherein prior to the establishing the multicast event controller transmission path from the host to the slave, the method further comprises:
the method comprises the steps that the host completes equipment detection and discovery of a rapidIO network through network enumeration operation, and a rapidIO channel between the host and other equipment in the rapidIO network is established;
and determining the path from the host to the standby according to the network topology data obtained by the network enumeration operation.
3. The method of claim 2, wherein the establishing a multicast event controller transmission path from the host to the standby comprises:
judging whether the path is empty or not;
if not, selecting ports of the first switching device and the adjacent device in the path, starting the multicast event controller to send and enable, deleting the first switching device from the path, and returning to the step of judging whether the path is empty;
if yes, confirming that the multicast event controller transmission path from the host to the standby is established.
4. The method of claim 1, wherein the detecting by the standby machine whether the multicast event controller sent by the host machine is received periodically comprises:
the standby machine receives the first multicast event control Fu Shiji and records the receiving time, starts the standby machine control program, and continuously records the time of receiving the multicast event control symbol at least twice, so as to calculate the arithmetic average value Ta of the time spent by the host machine for transmitting the multicast event control symbol to the standby machine every time, and the arithmetic average value Ta is used as the period for the standby machine to detect the heartbeat of the host machine.
5. The method as recited in claim 4, further comprising:
the standby determines a value not smaller than Ta and sets a time limit TL for detecting the heartbeat loss of the host.
6. The method of claim 5, wherein the standby determining that the host failed based on the time interval in which the multicast event control symbol was received comprises:
the standby machine records a time interval Ti adjacent to two times of receiving the multicast event controller;
if Ti is larger than the time limit TL or the time interval from the last time of receiving the multicast event controller exceeds the time limit TL, judging that the host fails and stopping detection.
7. The utility model provides a realization device of rapidIO network system hot backup which characterized in that is applicable to rapidIO network system, rapidIO network system includes a plurality of switching devices and a plurality of endpoint device, endpoint device includes host computer and standby machine, the device includes:
a setup module configured to setup a multicast event controller transmission path from the host to the standby;
a transmission module configured to cause the host to cyclically transmit multicast event control symbols at fixed intervals;
a detection module configured to enable the standby machine to detect whether the multicast event controller sent by the host machine is periodically received;
a judging module configured to cause the standby machine to judge that the host machine fails according to a time interval in which the multicast event control symbol is received;
and the switching module is configured to enable the standby machine to start a network takeover program and switch to a new host machine.
8. The apparatus of claim 7, wherein the determination module is further configured to: recording a time interval Ti adjacent to two times of receiving the multicast event controller by the standby machine; if Ti is larger than the time limit TL or the time interval from the last time of receiving the multicast event controller exceeds the time limit TL, judging that the host fails and stopping detection.
9. A computer device, comprising:
one or more processors;
a storage device having one or more programs stored thereon,
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method of implementing a hot standby for a RapidIO network system as claimed in any of claims 1-6.
10. A computer readable storage medium, having stored thereon a computer program which when executed by one or more processors implements a method of implementing a hot standby for a RapidIO network system according to any of claims 1-6.
CN202310308153.1A 2023-03-28 2023-03-28 Method and device for realizing hot backup of RapidIO network system Active CN116032731B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310308153.1A CN116032731B (en) 2023-03-28 2023-03-28 Method and device for realizing hot backup of RapidIO network system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310308153.1A CN116032731B (en) 2023-03-28 2023-03-28 Method and device for realizing hot backup of RapidIO network system

Publications (2)

Publication Number Publication Date
CN116032731A true CN116032731A (en) 2023-04-28
CN116032731B CN116032731B (en) 2023-08-18

Family

ID=86078080

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310308153.1A Active CN116032731B (en) 2023-03-28 2023-03-28 Method and device for realizing hot backup of RapidIO network system

Country Status (1)

Country Link
CN (1) CN116032731B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116232864A (en) * 2023-05-05 2023-06-06 井芯微电子技术(天津)有限公司 Multi-machine hot backup method and system for network system based on event controller

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060109845A1 (en) * 2004-11-23 2006-05-25 Sandy Douglas L Method of transporting a RapidIO packet over an IP packet network
US20070121680A1 (en) * 2005-11-28 2007-05-31 Tundra Semiconductor Corporation Method and system for handling multicast event control symbols
CA2629504A1 (en) * 2007-04-19 2008-10-19 Tundra Semiconductor Corporation Data communications network
CN103746910A (en) * 2013-11-28 2014-04-23 苏州长风航空电子有限公司 RapidIO network recursive enumeration method
CN108183762A (en) * 2017-12-28 2018-06-19 天津芯海创科技有限公司 The method for synchronizing time of RapidIO network systems and RapidIO network systems
CN110704250A (en) * 2019-09-23 2020-01-17 天津津航计算技术研究所 Hot backup device of distributed system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060109845A1 (en) * 2004-11-23 2006-05-25 Sandy Douglas L Method of transporting a RapidIO packet over an IP packet network
US20070121680A1 (en) * 2005-11-28 2007-05-31 Tundra Semiconductor Corporation Method and system for handling multicast event control symbols
CA2629504A1 (en) * 2007-04-19 2008-10-19 Tundra Semiconductor Corporation Data communications network
CN103746910A (en) * 2013-11-28 2014-04-23 苏州长风航空电子有限公司 RapidIO network recursive enumeration method
CN108183762A (en) * 2017-12-28 2018-06-19 天津芯海创科技有限公司 The method for synchronizing time of RapidIO network systems and RapidIO network systems
CN110704250A (en) * 2019-09-23 2020-01-17 天津津航计算技术研究所 Hot backup device of distributed system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
邓豹: "RapidIO 多播传输事务与配置管理研究" *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116232864A (en) * 2023-05-05 2023-06-06 井芯微电子技术(天津)有限公司 Multi-machine hot backup method and system for network system based on event controller

Also Published As

Publication number Publication date
CN116032731B (en) 2023-08-18

Similar Documents

Publication Publication Date Title
EP1982447B1 (en) System and method for detecting and recovering from virtual switch link failures
CN105763359B (en) Distributed bidirectional forwarding detection protocol (D-BFD) for an interleaved fabric switch cluster
US8438307B2 (en) Method and device of load-sharing in IRF stack
CN111865779B (en) Route synchronization method and cross-device link aggregation group
EP2352253B1 (en) Method and apparatus for protecting link aggregation group of ethernet ring
US20100020680A1 (en) Multi-chassis ethernet link aggregation
US10805107B2 (en) Board registration method, board, and forwarding device
US20150365320A1 (en) Method and device for dynamically switching gateway of distributed resilient network interconnect
JP4922972B2 (en) Communications system
JP2007060184A (en) Network repeating installation and its control method
US9054983B2 (en) Centralized control and management planes for different independent switching domains
CN105340230A (en) Virtual chassis topology management
CN116032731B (en) Method and device for realizing hot backup of RapidIO network system
CN109218232B (en) Method, equipment and system for realizing Mux machine
CN103607293A (en) Flow protection method and equipment thereof
CN110912815B (en) Network element ring formation preventing method, device, equipment and readable storage medium
US11258666B2 (en) Method, device, and system for implementing MUX machine
US20150172173A1 (en) Communication system, communication apparatus and path switching method
WO2022217786A1 (en) Cross-network communicaton method, apparatus, and system for multi-bus network, and storage medium
GB2422509A (en) Gateway unit
CN102857423A (en) Service flow transferring method and node in distributed link polymerization system
JP4030880B2 (en) Synchronization clock supply system and synchronization clock supply method
KR102614809B1 (en) Network control methods, devices and systems
JP2008148290A (en) Method and system for verifying connectivity of logical link
CN100574288C (en) The data link layer device that is used for serial communication bus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant