CN111045863B - Sensor data distribution network fault tolerance architecture and method - Google Patents

Sensor data distribution network fault tolerance architecture and method Download PDF

Info

Publication number
CN111045863B
CN111045863B CN201911134419.5A CN201911134419A CN111045863B CN 111045863 B CN111045863 B CN 111045863B CN 201911134419 A CN201911134419 A CN 201911134419A CN 111045863 B CN111045863 B CN 111045863B
Authority
CN
China
Prior art keywords
fault
switching network
network
backup
sensor data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911134419.5A
Other languages
Chinese (zh)
Other versions
CN111045863A (en
Inventor
姜琳琳
王卫东
韩强
王纯委
张彬
海钰琳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Aeronautics Computing Technique Research Institute of AVIC
Original Assignee
Xian Aeronautics Computing Technique Research Institute of AVIC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Aeronautics Computing Technique Research Institute of AVIC filed Critical Xian Aeronautics Computing Technique Research Institute of AVIC
Priority to CN201911134419.5A priority Critical patent/CN111045863B/en
Publication of CN111045863A publication Critical patent/CN111045863A/en
Application granted granted Critical
Publication of CN111045863B publication Critical patent/CN111045863B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1479Generic software techniques for error detection or fault masking
    • G06F11/1489Generic software techniques for error detection or fault masking through recovery blocks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Hardware Redundancy (AREA)
  • Maintenance And Management Of Digital Transmission (AREA)

Abstract

Aiming at the requirements of the comprehensive avionics system on the reliability of the comprehensive signal processing subsystem, the invention provides a fault tolerance architecture and a fault tolerance method of a sensor data distribution network. The sensor data distribution network adopts a dual redundancy design, a hot backup working mode is adopted between redundancy, fault detection is carried out through a heartbeat mechanism, and the system controller manages the sensor data distribution network through a dual redundancy control channel. The fault tolerance method of the sensor data distribution network can effectively improve the reliability of the comprehensive signal processing subsystem.

Description

Sensor data distribution network fault tolerance architecture and method
Technical Field
The invention belongs to the technical field of comprehensive core processing system design, and particularly relates to a fault tolerance architecture and a fault tolerance method for a sensor data distribution network.
Background
In the prior signal processing subsystem, various front-end sensors on an airplane are respectively connected with corresponding signal processing units in a point-to-point mode, so that the signal processing subsystem does not have comprehensive processing capability, the system cannot be supported to realize the functions of multi-sensor collaborative detection, multi-sensor data fusion and the like, and when the signal processing unit corresponding to one sensor fails, the system loses the signal processing function of the sensor, and the reliability is lower.
With the application of the comprehensive modularized avionics system technology, the comprehensive signal processing subsystem needs to realize the comprehensive processing of multi-sensor signals, the physical positions of signal processing resources corresponding to each type of sensor are not fixed, when the system task operation mode changes or the comprehensive signal processing unit is subjected to resource reconstruction, the data allocation strategy between each type of sensor and the comprehensive signal processing unit needs to be configured dynamically and in real time, so that higher requirements are provided for a data allocation implementation method, and the prior art cannot meet the comprehensive processing requirements and fault tolerance requirements of the system.
Disclosure of Invention
The invention provides a fault tolerance architecture and a fault tolerance method of a sensor data distribution network, and aims to effectively improve the comprehensive processing capacity and the fault tolerance capacity of a comprehensive signal processing subsystem.
The technical scheme of the invention is as follows:
the invention provides a fault-tolerant architecture of a sensor data distribution network, which comprises two switching networks with identical software and hardware, wherein the two switching networks are respectively used as a working switching network and a backup switching network, the two switching networks adopt a dual redundancy design, a hot backup working mode is adopted between redundancy and the redundancy is crosslinked through a synchronous channel, and a system controller manages the two redundancy through a dual redundancy control channel. The hot backup working mode is that after power-on, two exchange networks run the same application software, the working exchange network outputs the running result, and the backup exchange network does not output the result.
The invention provides a fault tolerance method of a sensor data distribution network, which adopts a fault tolerance architecture of the sensor data distribution network and comprises a fault tolerance detection method and a fault tolerance processing method.
The fault tolerance detection method is that a heartbeat mechanism is adopted between the dual redundancy switching networks to perform fault detection, namely the working switching network periodically sends handshake signals to the backup switching network, and the backup switching network feeds back handshake success signals to the working switching network after receiving the handshake signals.
The fault tolerance processing method comprises three conditions:
firstly, if the backup switching network cannot receive a handshake signal sent by the working switching network and the sensor data received by the rear-end comprehensive signal processing unit is abnormal, the complex fault of the current working switching network is indicated, and a hot backup fault processing strategy is executed at the moment, namely a processor in the backup switching network immediately outputs a current network configuration table of the system to a corresponding digital crossbar switch, normal work is started, and current fault information is reported to a system controller through a control channel;
second, if the backup switching network does not receive the handshake signal sent by the working switching network, but the sensor data received by the back-end integrated signal processing unit is normal, which indicates that the processor circuit in the current working switching network has a fault, the digital crossbar circuit and the photoelectric converter circuit work normally, and two fault processing methods exist at this time:
(1) The processor in the backup exchange network immediately outputs the current network configuration table of the system to the corresponding digital cross switch, at the moment, a hot backup fault processing strategy is executed, the backup exchange network starts to work normally, and the current fault information is reported to the system controller through the control channel;
(2) The processor in the backup exchange network takes over the digital cross switch in the current work exchange network through the processor control bus, at this time, executes the cross management fault processing strategy, keeps the current network configuration table of the system unchanged, continues to work normally, and reports the current fault information to the system controller through the control channel;
thirdly, if the backup switching network normally receives the handshake signal sent by the working switching network, but the sensor data received by the back-end comprehensive signal processing unit is abnormal, which indicates that the digital cross switch circuit or the photoelectric converter circuit in the current working switching network has faults, the processor circuit works normally, and at the moment, a hot backup fault processing strategy or a cross management fault processing strategy can be executed.
The invention has the advantages that:
1. based on the digital crossbar design, the switching speed is high;
2. the hardware circuit is easy to realize, simple and reliable;
3. the fault tolerance speed is high, and the flexibility is high;
4. and the reliability of the comprehensive signal processing subsystem is improved.
Drawings
Fig. 1 is a schematic diagram of an integrated signal processing subsystem.
FIG. 2 is a diagram of a sensor data distribution network design architecture.
Fig. 3 is a schematic diagram of a hot-standby fault handling strategy (complex fault of the switching network).
FIG. 4 is a schematic diagram of a hot-standby fault handling strategy (processor circuit failure).
Fig. 5 is a schematic diagram of a cross-management fault handling strategy (processor circuit fault).
Detailed Description
The invention provides a sensor data distribution network fault tolerance architecture, which comprises the following steps:
1. the front end sensor on the aircraft comprises: communication navigation recognition sensors, radar sensors, photoelectric image sensors and the like, wherein sensor signals are input into a sensor data distribution network of the integrated signal processing subsystem through a fiber channel, as shown in fig. 1;
2. the sensor data distribution network can realize the unobstructed exchange of 40 paths of front-end sensor signals by 40 paths at maximum, and the sensor signals are output to different signal processing modules of the comprehensive signal processing unit according to a network configuration table specified by a system;
3. the sensor data distribution network adopts a dual redundancy design, as shown in fig. 2, the software and hardware of the two exchange networks are completely identical, a hot backup working mode is adopted between redundancy, namely, the two exchange networks operate the same application software, the working exchange network outputs an operation result, and the backup exchange network does not output the result;
4. the dual redundancy exchange networks of the sensor data distribution network adopt a heartbeat mechanism to perform fault detection and system synchronization, namely the working exchange network periodically sends handshake signals to the backup exchange network, and the backup exchange network feeds back handshake success signals to the working exchange network after receiving the handshake signals;
5. the avionics system controller manages the sensor data distribution network through a dual redundancy control channel, and comprises the steps of sending a network control command to the sensor data distribution network, loading a network configuration table, actively detecting faults and the like; the sensor data distribution network sends test results, reports fault information, etc. to the system controller.
The specific implementation method of the architecture is as follows:
step 1, after the system is powered on, the dual redundancy switching networks in the sensor data distribution network are respectively powered on self-detection, after the test is finished, test results are respectively reported to a system controller through a control channel, when the two switching networks are tested normally, the default switching network 1 is a working switching network, and the switching network 2 is a backup switching network;
step 2, if the system controller cannot receive the test result of a certain switching network or the received test result is wrong after the power-on self-detection, the switching network fails, and the system controller performs fault shielding on the switching network to stop working;
step 3, after the power-on self-detection is completed, the processors in the two switching networks respectively load sensor data distribution network management software and a default network configuration table from the local FLASH, and the processor in the working switching network sends the default network configuration information to the corresponding digital crossbar switch through a processor control bus to execute a sensor data distribution strategy defaulted by the system; the processors in the backup exchange network run the same software, but do not output the result;
step 4, when the system task operation mode changes or the comprehensive signal processing unit is reconfigured, the system controller loads a new network configuration table through a control channel, and after a processor in the work switching network receives the new network configuration table, the new network configuration table is sent to a corresponding digital cross switch to execute a new sensor data allocation strategy of the system;
step 5, in the normal operation process of the system, the working exchange network periodically sends a handshake signal to the backup exchange network, the handshake period is 50ms, and after the backup exchange network receives the handshake signal, the backup exchange network feeds back a handshake success signal to the working exchange network;
step 6, if the backup switching network cannot receive the handshake signal sent by the working switching network and the sensor data received by the rear-end comprehensive signal processing unit is abnormal, the complex fault of the current working switching network is indicated, a hot backup fault processing strategy is executed at the moment, as shown in fig. 3, a processor in the backup switching network immediately outputs a current network configuration table of the system to a corresponding digital crossbar switch, normal work is started, and current fault information is reported to a system controller through a control channel;
step 7, if the backup switching network cannot receive the handshake signal sent by the working switching network, but the sensor data received by the back-end integrated signal processing unit is normal, which indicates that the processor circuit in the current working switching network has faults, the digital crossbar circuit and the photoelectric converter circuit work normally, and two fault processing methods exist at the moment:
(1) The processor in the backup exchange network immediately outputs the current network configuration table of the system to the corresponding digital cross switch, and executes a hot backup fault processing strategy at the moment, as shown in fig. 4, the backup exchange network starts to work normally, and reports the current fault information to the system controller through the control channel;
(2) The processor in the backup exchange network takes over the digital cross switch in the current work exchange network through the processor control bus, and executes the cross management fault processing strategy at the moment, as shown in fig. 5, the current network configuration table of the system remains unchanged, and continues to work normally, and the processor in the backup exchange network reports the current fault information to the system controller through the control channel;
after executing the hot backup fault processing strategy in step 8, step 7, and step (1), the possible faults and fault processing methods in the subsequent system operation process are as follows:
(1) If the system controller cannot acquire the state information of the current working switching network through the control channel, the sensor data received by the rear-end integrated signal processing unit is normal, which indicates that the processor circuit in the current working switching network fails, the system controller can continue to work at the moment, and the system sensor data allocation strategy cannot be changed;
(2) If the system controller can acquire the state information of the current working switching network through the control channel, but the sensor data received by the rear-end integrated signal processing unit is abnormal, the digital cross switch circuit or the photoelectric converter circuit in the current working switching network is indicated to have faults, and then the cross management fault processing strategy can be executed again;
(3) If the system controller cannot acquire the state information of the current working switching network through the control channel and the sensor data received by the rear-end integrated signal processing unit is abnormal, the complex fault of the current working switching network is indicated, and at the moment, the integrated signal processing subsystem loses the function of the sensor data distribution network;
and 9, after the cross management fault processing strategy is executed in the step 7 (2), the possible faults and fault processing methods in the subsequent system operation process are as follows:
(1) If the system controller cannot acquire the state information of the current working switching network through the control channel, the sensor data received by the rear-end integrated signal processing unit is normal, which indicates that the processor circuit in the current working switching network fails, the system controller can continue to work at the moment, and the system sensor data allocation strategy cannot be changed;
(2) If the system controller can acquire the state information of the current working switching network through the control channel, but the sensor data received by the rear-end integrated signal processing unit is abnormal, the digital cross switch circuit or the photoelectric converter circuit in the current working switching network is indicated to have faults, at the moment, the processor in the current working switching network can disconnect the cross processor control bus, immediately output the current network configuration table of the system to the corresponding digital cross switch, and continue to work normally;
(3) If the system controller cannot acquire the state information of the current working switching network through the control channel and the sensor data received by the rear-end integrated signal processing unit is abnormal, the complex fault of the current working switching network is indicated, and at the moment, the integrated signal processing subsystem loses the function of the sensor data distribution network;
step 10, if the backup switching network normally receives the handshake signal sent by the working switching network, but the sensor data received by the back-end integrated signal processing unit is abnormal, which indicates that the digital crossbar switch circuit or the photoelectric converter circuit in the current working switching network has faults, the processor circuit works normally, and at the moment, a hot backup fault processing strategy or a cross management fault processing strategy can be executed, and the specific fault processing method is as described in steps 7-9.

Claims (1)

1. The utility model provides a sensor data distribution network fault tolerance method, has adopted a framework, the framework includes two switching networks that the software and hardware is the same entirely, respectively as work switching network, backup switching network, and two above-mentioned switching networks adopt dual redundancy design, adopt hot backup mode of operation and cross-link through the synchronous channel between the redundancy, and the system controller manages these two redundancy through dual redundancy control channel, its characterized in that: the method comprises a fault tolerance detection method and a fault tolerance processing method;
the fault tolerance detection method comprises the steps that a heartbeat mechanism is adopted between the dual redundancy switching networks to perform fault detection, namely, a working switching network periodically sends handshake signals to a backup switching network, and the backup switching network feeds back handshake success signals to the working switching network after receiving the handshake signals;
the fault tolerance processing method comprises three conditions:
firstly, if the backup switching network cannot receive a handshake signal sent by the working switching network and the sensor data received by the rear-end comprehensive signal processing unit is abnormal, the complex fault of the current working switching network is indicated, and a hot backup fault processing strategy is executed at the moment, namely a processor in the backup switching network immediately outputs a current network configuration table of the system to a corresponding digital crossbar switch, normal work is started, and current fault information is reported to a system controller through a control channel;
second, if the backup switching network does not receive the handshake signal sent by the working switching network, but the sensor data received by the back-end integrated signal processing unit is normal, which indicates that the processor circuit in the current working switching network has a fault, the digital crossbar circuit and the photoelectric converter circuit work normally, and two fault processing methods exist at this time:
(1) The processor in the backup exchange network immediately outputs the current network configuration table of the system to the corresponding digital cross switch, at the moment, a hot backup fault processing strategy is executed, the backup exchange network starts to work normally, and the current fault information is reported to the system controller through the control channel;
(2) The processor in the backup exchange network takes over the digital cross switch in the current work exchange network through the processor control bus, at this time, executes the cross management fault processing strategy, keeps the current network configuration table of the system unchanged, continues to work normally, and reports the current fault information to the system controller through the control channel;
thirdly, if the backup switching network normally receives the handshake signal sent by the working switching network, but the sensor data received by the back-end comprehensive signal processing unit is abnormal, which indicates that the digital cross switch circuit or the photoelectric converter circuit in the current working switching network has faults, the processor circuit works normally, and at the moment, a hot backup fault processing strategy or a cross management fault processing strategy can be executed.
CN201911134419.5A 2019-11-19 2019-11-19 Sensor data distribution network fault tolerance architecture and method Active CN111045863B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911134419.5A CN111045863B (en) 2019-11-19 2019-11-19 Sensor data distribution network fault tolerance architecture and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911134419.5A CN111045863B (en) 2019-11-19 2019-11-19 Sensor data distribution network fault tolerance architecture and method

Publications (2)

Publication Number Publication Date
CN111045863A CN111045863A (en) 2020-04-21
CN111045863B true CN111045863B (en) 2023-09-15

Family

ID=70232965

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911134419.5A Active CN111045863B (en) 2019-11-19 2019-11-19 Sensor data distribution network fault tolerance architecture and method

Country Status (1)

Country Link
CN (1) CN111045863B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112468318B (en) * 2020-11-05 2023-03-14 中国航空工业集团公司西安航空计算技术研究所 Dual-redundancy time-triggered network reconstruction method
CN112468328A (en) * 2020-11-13 2021-03-09 中国航空工业集团公司洛阳电光设备研究所 Dual-redundancy FC-AE-1553 network reconstruction method based on switched topology

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101281483A (en) * 2008-05-12 2008-10-08 北京邮电大学 Double-machine redundant tolerant system and redundant switching method thereof
CN105301955A (en) * 2015-10-19 2016-02-03 中国航空无线电电子研究所 System-level reconstruction management application software master-slave switching method
CN106909525A (en) * 2017-01-13 2017-06-30 电子科技大学 A kind of control Switching Module based on VPX buses
CN108768754A (en) * 2018-06-28 2018-11-06 西安微电子技术研究所 A kind of highly reliable tolerant system based on bus network

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004102853A2 (en) * 2003-05-06 2004-11-25 Overture Networks, Inc. Protected switching ring
WO2014038835A1 (en) * 2012-09-05 2014-03-13 삼성에스디에스 주식회사 Network backup device and network system including the device
CN108661823B (en) * 2018-04-28 2020-06-09 西安航天动力研究所 Liquid rocket engine thrust and mixing ratio adjusting redundancy electromechanical servo system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101281483A (en) * 2008-05-12 2008-10-08 北京邮电大学 Double-machine redundant tolerant system and redundant switching method thereof
CN105301955A (en) * 2015-10-19 2016-02-03 中国航空无线电电子研究所 System-level reconstruction management application software master-slave switching method
CN106909525A (en) * 2017-01-13 2017-06-30 电子科技大学 A kind of control Switching Module based on VPX buses
CN108768754A (en) * 2018-06-28 2018-11-06 西安微电子技术研究所 A kind of highly reliable tolerant system based on bus network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵琳.一种双余度FC网络管理方法.《电脑知识与技术》.2019,正文第72-73页. *

Also Published As

Publication number Publication date
CN111045863A (en) 2020-04-21

Similar Documents

Publication Publication Date Title
CN107187465B (en) ATO system architecture of unit-level hot standby redundancy
CN101807076B (en) Duplication redundancy fault-tolerant high-reliability control system having synergistic warm standby function based on PROFIBUS field bus
CN103647781B (en) Mixed redundancy programmable control system based on equipment redundancy and network redundancy
CN111352338B (en) Dual-redundancy flight control computer and redundancy management method
CN111045863B (en) Sensor data distribution network fault tolerance architecture and method
CN106648997A (en) Master-salve switching method based on non-real-time operating system
CN110361979A (en) A kind of safety computer platform in railway signal field
CN102724083A (en) Degradable triple-modular redundancy computer system based on software synchronization
CN110427283B (en) Dual-redundancy fuel management computer system
CN104111881A (en) Arbitration device for double computer redundancy hot backup computer
CN103853622A (en) Control method of dual redundancies capable of being backed up mutually
CN110488597B (en) Dual-redundancy control method for main processing unit of locomotive
CN103425553A (en) Duplicated hot-standby system and method for detecting faults of duplicated hot-standby system
CN112506830B (en) Redundancy synchronous communication method for multi-path transmission data bus
CN103793300A (en) Fast active-standby switching device in hot-standby system and active-standby switching method
CN100479295C (en) Synchronized switching controller and its control for parallel uninterrupted power supply
CN111930573B (en) Task-level dual-machine hot standby system based on management platform and method thereof
CN112201378A (en) Hot standby switching method, system, terminal and medium based on nuclear power plant DCS platform
CN110247809B (en) Communication control method of double-ring network control system
CN203733107U (en) Quick active/standby shifting device in active-standby system
CN116089176A (en) Hot standby dual-redundancy computer control system for AUV
CN109991954B (en) Fault-tolerant control method, system and equipment of motor controller and storage medium
CN113850033B (en) Redundancy system, redundancy management method and readable storage medium
CN112506633B (en) Multi-machine redundancy system and processing method
CN115396752A (en) Redis-based biplane data acquisition method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant