CN116257342A - Configurable fault-tolerant star software task scheduling system based on multi-core processor - Google Patents
Configurable fault-tolerant star software task scheduling system based on multi-core processor Download PDFInfo
- Publication number
- CN116257342A CN116257342A CN202310170290.3A CN202310170290A CN116257342A CN 116257342 A CN116257342 A CN 116257342A CN 202310170290 A CN202310170290 A CN 202310170290A CN 116257342 A CN116257342 A CN 116257342A
- Authority
- CN
- China
- Prior art keywords
- module
- mode
- processor
- core
- fault
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000005192 partition Methods 0.000 claims description 5
- 238000005259 measurement Methods 0.000 claims description 3
- 230000007246 mechanism Effects 0.000 claims description 2
- 238000001514 detection method Methods 0.000 abstract description 2
- 230000003068 static effect Effects 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 4
- 238000013461 design Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 230000000191 radiation effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Hardware Redundancy (AREA)
- Safety Devices In Control Systems (AREA)
Abstract
A configurable fault-tolerant star software task scheduling system based on a multi-core processor is configured into a low-power consumption mode, a triple-modular redundancy mode or a high-performance mode through the static configuration of at least three processors, and the dynamic configuration is realized by performing mode switching according to the satellite running state and the task requirements; the invention can fully utilize the advantages of the multi-core processor in various scenes; the implementation cost is low, no additional hardware is needed to realize arbitration, and the fault detection and the debugging are convenient; the function is flexible, and the working mode can be dynamically configured according to the satellite running state and the remote control instruction.
Description
Technical Field
The invention relates to the field of software and hardware of a star service computer, in particular to a configurable fault-tolerant star service software task scheduling system based on a multi-core processor.
Background
With the continuous improvement of the satellite system on the requirements of control frequency, autonomous operation capability, fault diagnosis capability and the like, the single-core processor is limited by moore's law and influenced by power wall factors, and cannot further improve the computing capability under the condition of limited power, so that the single-core processor gradually becomes the bottleneck of satellite control and information system design. Multi-core processors have greater advantages in terms of computing power and power consumption. Therefore, the use of multi-core processors in satellite control and data processing unit designs is a necessary option to increase the processing power of satellite information systems.
When the satellite is in a space radiation environment, the electronic equipment of the satellite is impacted by various high-energy particles, and a certain radiation effect is generated. The radiation effect is mainly: total dose of radiation, single event latch, single event flip. The failure rate caused by single event upset is approximately 40%, and the single event upset is one of main reasons for satellite on-orbit failure. The fault caused by single event upset belongs to repairability fault, and can be corrected by hardware redundancy, data coding and other error-tolerant technologies. How to design a fault tolerance scheme according to the requirements of a multi-core processor and star software is an important problem worthy of exploration.
The comparison document CN112667450a discloses a fault-tolerant system of a dynamically configurable multi-core processor, which performs a hardware state saving operation on a register of a main core CPU0, saves the checkpoint state saving operation of the CPU1 and the CPU2 to an on-chip SRAM in a software manner, and normally enables the dynamic operation to be transparent to the software, so that the software only needs to execute the checkpoint state saving when receiving a mode switching signal, and reads when recovering. However, all processors are required to be started in all three modes, the power consumption loss is large, and aiming at the granularity of an instruction level, the fault tolerance processing is carried out on each 1bit data stream output by using a three-mode redundancy mode, so that a special voting circuit is required.
Disclosure of Invention
In order to overcome the defects of the prior art, particularly the prior fault-tolerant technology has higher implementation cost and needs additional hardware; the conventional fault-tolerant system is difficult to dynamically configure aiming at the problems of satellite running states and task demands, and the like, and provides a configurable fault-tolerant star software task scheduling system based on a multi-core processor.
The technical scheme of the invention is as follows:
a configurable fault-tolerant star software task scheduling system based on a multi-core processor is characterized in that the system is statically configured into a low-power consumption mode, a three-mode redundancy mode or a high-performance mode through configuration of at least three processors, and the system is dynamically configured by performing mode switching according to satellite running states and task demands;
the low-power mode starts 1 processor core and runs star software with basic functions;
the three-mode redundancy mode enables at least 4 processor cores, wherein 1 processor core controls and supervises the operation of other modules, operates the data acquisition module and operates the arbitration module, and the other 3 processor cores respectively operate each functional module of the service software at the same time, and the arbitration unit arbitrates the calculation result and then transmits the result to each execution mechanism;
and the high-performance mode enables a plurality of processor cores, and each processor core independently runs the star software module distributed on the processor cores by adopting partition scheduling.
The mode switching module is used for switching the task scheduling mode of the configurable fault-tolerant star software according to the instruction when a remote control instruction is received, and automatically/manually switching to a low-power consumption mode when the whole star is in shortage of energy.
The multi-core processor is a 4-core SMP processor and the SMP processor.
The arbitration module adopts a voting system to vote according to the calculation results of the three fault-tolerant processors, and the result with a large number of votes is used as output.
The star software with the basic functions comprises a supervision module, a data acquisition module, a gesture control module, an energy module, a thermal control module, a remote control module, a telemetry module and a ground measurement module.
Compared with the prior art, the invention has the beneficial effects that:
(1) The invention can fully utilize the advantages of the multi-core processor in various scenes; the implementation cost is low, no additional hardware is needed to realize arbitration, and the fault detection and the debugging are convenient; the function is flexible, and the working mode can be dynamically configured according to the satellite running state and the remote control instruction;
(2) The implementation of the application hierarchy is applicable to a variety of processor architectures and operating systems.
(3) The method is characterized in that for module-level (thread-level) granularity, switching is performed based on thread migration and thread starting and stopping, partition scheduling of modules is comprehensively considered based on the module-level granularity, and only the final results of calculation modules except a supervision module and a data acquisition module are subjected to fault tolerance processing by using triple-modular redundancy, and a special voting circuit is not needed.
Drawings
FIG. 1 is a schematic diagram of a low power mode
FIG. 2 is a schematic diagram of a triple modular redundancy scheme
FIG. 3 high performance mode schematic
FIG. 4 is a schematic diagram of mode switching
Detailed Description
The invention is further illustrated in the following figures and examples, which should not be taken to limit the scope of the invention.
1. Three modes
In order to meet different performance and reliability requirements under different satellite running states and task demands, and fully utilize the multi-core processor in various scenes, the task scheduling of the configurable fault-tolerant star software is realized at an application level, and three modes are provided:
1.1 Low Power consumption mode
When the whole star is energy-intense, the power consumption of the multi-core processor needs to be reduced, and the star software only runs the most basic modules. As shown in fig. 1, in this mode, the star software enters a basic function operation mode, each module executes sequentially according to priority, runs on the processor main core, and shuts down the remaining processor cores.
1.2 triple modular redundancy mode
When the satellite runs in a severe orbit in a space environment, the probability of occurrence of a single event effect is increased, and at the moment, the satellite needs to be switched to a triple-modular redundancy mode, so that the reliability of the system is increased. As shown in fig. 2, the triple modular redundancy mode enables at least 4 processor cores.
A supervision module is operated on the Core0 and is responsible for controlling and supervising the operation of other modules; the data acquisition module is operated to respectively input the acquired sensor data into each fault-tolerant Core; and operating an arbitration module, synchronizing and arbitrating the calculation result after each fault-tolerant Core finishes the calculation of one module, and transmitting the arbitrated data to a corresponding executor.
The arbitration module employs a voting system, and the final result can be determined from the vote, since the transient fault belongs to a small probability event. Voting is carried out according to the calculation results of the three fault-tolerant cores, and the result with a large number of votes is output.
Fault tolerant techniques of data encoding may be employed on Core0 to monitor and correct its own operating conditions.
The Core1, the Core2 and the Core3 are fault-tolerant cores, the same functional modules are respectively operated, the functional modules on each Core are controlled by a supervision module operated on the Core0, the modules are sequentially operated according to priority, and after the module operation is completed, the synchronization of the Core0 is waited.
1.3 high Performance modes
When the satellite performs complex tasks requiring increased computing power, it may switch to a high performance mode. As shown in FIG. 3, high performance mode enablement enables some or all of the processor cores on demand for computing power.
And partitioning scheduling is adopted, the modules are distributed to each core according to the calculated amount and constraint relation among the modules, and each core executes the distributed modules according to the FIFO strategy.
2. Mode switching
The switching between modes can be automatically triggered by the change of the satellite running state, and can also be controlled by a remote control instruction. The switching between modes is shown in fig. 4.
2.1 entering Low Power consumption mode
The star software is reset, processor cores of Core1, core2, core3 and the like are closed, a basic function operation supervision module is established on Core0, and the basic function operation supervision module initializes a data acquisition module, a gesture control module, an energy module, a thermal control module, a remote control module, a telemetry module and a ground measurement module under a basic function operation mode. And the respective modules are sequentially executed in the above order at each cycle.
2.2 entering triple modular redundancy mode
When the low power consumption mode is changed into the triple-modular redundancy mode, the star software is reset, and Core1, core2 and Core3 are started. And a supervision module is created on Core0, the supervision module creates a data acquisition module and an arbitration module on Core0, and copies of all the other functional modules are respectively created on Core1, core2 and Core3.
When the high-performance mode is switched into the triple-mode redundancy mode, the supervision module and the data acquisition module are migrated to Core0, and the modules and functions only running in the high-performance mode are closed. All remaining functional modules are migrated to Core1. Functional module copies are created on Core2, core3, respectively.
2.3 entering high Performance mode
When the low power consumption mode is changed into the high performance mode, the star software is reset, and processor cores such as Core1, core2, core3 and the like are started. And creating a supervision module on the Core0 of the main Core, and creating corresponding belonging modules on each Core by the supervision module according to the partition table.
When the three-mode redundancy mode is used for entering the high-performance mode, the supervision module transfers the created modules to the corresponding cores according to the partition table, creates the modules which only operate in the high-performance mode on the corresponding cores, and opens the functions which only operate in the high-performance mode.
Claims (4)
1. A configurable fault-tolerant star software task scheduling system based on a multi-core processor is characterized in that the system is statically configured into a low-power consumption mode, a three-mode redundancy mode or a high-performance mode through configuration of at least three processors, and the system is dynamically configured by performing mode switching according to satellite running states and task demands;
the low-power mode starts 1 processor core and runs star software with basic functions;
the three-mode redundancy mode enables at least 4 processor cores, wherein 1 processor core controls and supervises the operation of other modules, operates a data acquisition module and operates an arbitration module, and the other 3 processor cores respectively operate each functional module of the service software at the same time, and the arbitration unit arbitrates the calculation result and then transmits the result to each execution mechanism;
and the high-performance mode enables a plurality of processor cores, and each processor core independently runs the star software module distributed on the processor cores by adopting partition scheduling.
2. The configurable fault-tolerant star software task scheduling system based on a multi-core processor of claim 1, wherein the multi-core processor is a 4-core and more SMP processor.
3. The configurable fault-tolerant star software task scheduling system based on a multi-core processor according to claim 1, wherein the arbitration module adopts a voting system to vote according to the calculation results of three fault-tolerant processors, and the result with a large number of votes is output.
4. The task scheduling system of configurable fault-tolerant star software based on a multi-core processor according to claim 1, wherein the star software with basic functions comprises a supervision module, a data acquisition module, a gesture control module, an energy module, a thermal control module, a remote control module, a telemetry module and a ground measurement module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310170290.3A CN116257342A (en) | 2023-02-27 | 2023-02-27 | Configurable fault-tolerant star software task scheduling system based on multi-core processor |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310170290.3A CN116257342A (en) | 2023-02-27 | 2023-02-27 | Configurable fault-tolerant star software task scheduling system based on multi-core processor |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116257342A true CN116257342A (en) | 2023-06-13 |
Family
ID=86687662
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310170290.3A Pending CN116257342A (en) | 2023-02-27 | 2023-02-27 | Configurable fault-tolerant star software task scheduling system based on multi-core processor |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116257342A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118035006A (en) * | 2024-04-12 | 2024-05-14 | 西北工业大学 | Control system capable of being dynamically configured for independent and lockstep operation of three-core processor |
CN118348881A (en) * | 2024-05-20 | 2024-07-16 | 西安格儒电子科技有限公司 | Fly-control computer system with tailorable redundancy |
-
2023
- 2023-02-27 CN CN202310170290.3A patent/CN116257342A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118035006A (en) * | 2024-04-12 | 2024-05-14 | 西北工业大学 | Control system capable of being dynamically configured for independent and lockstep operation of three-core processor |
CN118348881A (en) * | 2024-05-20 | 2024-07-16 | 西安格儒电子科技有限公司 | Fly-control computer system with tailorable redundancy |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116257342A (en) | Configurable fault-tolerant star software task scheduling system based on multi-core processor | |
US8032889B2 (en) | Methods and apparatus for managing and controlling power consumption and heat generation in computer systems | |
Lala et al. | Architectural principles for safety-critical real-time applications | |
US8103910B2 (en) | Local rollback for fault-tolerance in parallel computing systems | |
US7797575B2 (en) | Triple voting cell processors for single event upset protection | |
CN102270189B (en) | Inter-core communication method based on FPGA (Field Programmable Gate Array) multi-core system | |
KR20010005956A (en) | Fault tolerant computer system | |
US20110208997A1 (en) | Radiation hard and fault tolerant multicore processor and method for ionizing radiation environment | |
CN101946235A (en) | Method and apparatus for moving threads in a shared processor partitioning environment | |
CN110399034B (en) | Power consumption optimization method of SoC system and terminal | |
CN115168114A (en) | Satellite-borne integrated electronic system based on COTS device | |
US20160335103A1 (en) | Locking/Unlocking CPUs to Operate in Safety Mode or Performance Mode Without Rebooting | |
CN104850530A (en) | CubeSat on-board computer | |
Su et al. | Triple module redundancy reliability framework design based on heterogeneous multi-core processor | |
CN113806290A (en) | High-integrity system-on-chip for comprehensive modular avionics system | |
CN115114224A (en) | Flight control computer hardware system of SOC + FPGA | |
Fuchs et al. | Bringing fault-tolerant gigahertz-computing to space | |
CN115391251A (en) | Unmanned aerial vehicle intelligence flies accuse chip architecture | |
Zhu et al. | Reliability effects of process and thread redundancy on chip multiprocessors | |
Kempf et al. | Leveraging Adaptive Redundancy in Multi-Core Processors for Realizing Adaptive Fault Tolerance in Mixed-Criticality Systems | |
CN113778737B (en) | Method and system for running on-board computer based on redundancy and degradation | |
Tian et al. | Design and implementation of a low-cost fault-tolerant on-board computer for micro-satellite | |
Schwabl et al. | A survey of MARS | |
CN108845971A (en) | Multiprocessor plate reconfiguration system and method | |
Younis et al. | Integrating redundancy management and real-time services for ultra reliable control systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |