CN117002562A - Vehicle-mounted platform based on embedded software redundancy - Google Patents

Vehicle-mounted platform based on embedded software redundancy Download PDF

Info

Publication number
CN117002562A
CN117002562A CN202310976451.8A CN202310976451A CN117002562A CN 117002562 A CN117002562 A CN 117002562A CN 202310976451 A CN202310976451 A CN 202310976451A CN 117002562 A CN117002562 A CN 117002562A
Authority
CN
China
Prior art keywords
cpu
main
redundancy
standby
control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310976451.8A
Other languages
Chinese (zh)
Inventor
卢意
谢鲲
朱卡恩
付祥
邹劲柏
陈大山
许哲谱
陈迪来
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Institute of Technology
Original Assignee
Shanghai Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Institute of Technology filed Critical Shanghai Institute of Technology
Priority to CN202310976451.8A priority Critical patent/CN117002562A/en
Publication of CN117002562A publication Critical patent/CN117002562A/en
Pending legal-status Critical Current

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B61RAILWAYS
    • B61LGUIDING RAILWAY TRAFFIC; ENSURING THE SAFETY OF RAILWAY TRAFFIC
    • B61L15/00Indicators provided on the vehicle or train for signalling purposes
    • B61L15/0063Multiple on-board control systems, e.g. "2 out of 3"-systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Mechanical Engineering (AREA)
  • Hardware Redundancy (AREA)

Abstract

The application provides a vehicle-mounted platform based on embedded software redundancy, which relates to the technical field of vehicle-mounted equipment and comprises a railway special annunciator body and a control board with double CPUs, wherein a set of peripheral circuits are shared, two CPUs are mutually checked by using an application layer software redundancy switching method, one CPU is one master and one slave, the information synchronization is ensured, meanwhile, fault detection is realized, and a disturbance-free switching function can be realized when a fault occurs. The application adds the embedded software redundancy into the double CPU control and applies the embedded software redundancy to the vehicle-mounted platform, thereby enhancing the fault tolerance and anti-interference capability, improving the fault recovery capability, optimizing the delay and throughput and greatly enhancing the reliability and the stability of the vehicle-mounted platform.

Description

Vehicle-mounted platform based on embedded software redundancy
Technical Field
The application relates to the technical field of vehicle-mounted equipment, in particular to a vehicle-mounted platform based on embedded software redundancy.
Background
The vehicle-mounted platform is important equipment for guaranteeing driving safety, realizes communication connection among a train dispatcher, a station attendant and a train driver at any time, enables the train to run under the control of the dispatcher, has very important effects on improving transportation efficiency and guaranteeing driving safety, and can be processed timely if special conditions are met.
However, most of existing vehicle-mounted platform equipment adopts a single-core processor scheme, the system is difficult to complete a specified function when a local fault occurs, the fault processing time is prolonged, the performance is extremely easy to limit under the condition that the vehicle-mounted platform needs to process a plurality of tasks, the performance is poor under the condition that quick response is needed, the overall working efficiency and the response speed are affected, and the reliability and the stability of the vehicle-mounted platform equipment are greatly reduced.
Disclosure of Invention
The application aims to provide a vehicle-mounted platform based on embedded software redundancy, which can enhance the fault tolerance and anti-interference capability of the vehicle-mounted platform during working, improve the fault recovery capability of the vehicle-mounted platform and greatly improve the working efficiency.
In order to achieve the above purpose, the present application provides the following technical solutions: a vehicle-mounted platform based on embedded software redundancy comprises a railway special annunciator body and a control board with double CPUs, wherein a set of peripheral circuits are shared, two CPUs are mutually checked by using an application layer software redundancy switching method, and a master CPU and a slave CPU are mutually checked, so that a disturbance-free switching function can be realized when a fault occurs.
In the first aspect, for the multimedia embedded software redundancy hardware platform part, a modularized design is adopted, an embedded intelligent chip is taken as a core, redundancy is realized between the double main chips, peripheral circuits are shared, a processor module and an input/output module on a control panel are designed by adopting a double CPU redundancy architecture, and all the modules are connected through a motherboard to complete interconnection and intercommunication of internal systems and power supply.
In the second aspect, for the design part of the application layer software redundancy scheme, the two machines are simultaneously on-line and mutually monitored, master control rights are judged and transferred among programs, a dual CPU identity recognition mechanism of 'first start-up as a master' is adopted, the master and standby are switched between the two CPUs, the hot standby CPU is ready at any time, and once the master control CPU fails, the master control rights are immediately acquired to be the master control CPU, so that the undisturbed switching is realized. The method mainly adopts a CPU arbitration and transfer method based on master control rights and a synchronous control method of double CPU modules, and researches a software redundancy subsystem by using a structured programming method.
Compared with the prior art, the application has the technical effects and advantages that:
1. in the application, the software redundancy is added to the vehicle-mounted platform, so that the reliability and stability of the system can be improved. The software redundancy is realized by backing up two identical CPU modules, when the main control CPU fails or is in error, the system can be automatically switched to the hot standby CPU, so that the system is prevented from crashing or stopping working, and the normal operation of the vehicle-mounted station is ensured. The redundancy of the data is increased by using a redundancy coding technology, so that the anti-interference capability of the data is improved, and the probability of data transmission errors is reduced; the fault tolerance can be improved, and even if one CPU module fails or is wrong, the vehicle-mounted platform can still continue to work, which is very important for some application scenes with high requirements on reliability (such as automatic driving, safety monitoring and the like).
2. According to the application, the vehicle-mounted platform with software redundancy can realize rapid recovery and seamless switching of faults, when the main control CPU breaks down, the system can be rapidly switched to the hot standby CPU, and a user can continue to use the vehicle-mounted platform almost without perception; the parallel transmission mode can be used for transmitting data through one communication path, so that the data transmission efficiency is greatly improved; the maintainability of the system can be improved, the vehicle-mounted station can realize hot backup and hot switching of the modules through software redundancy, the downtime of system maintenance and upgrading is reduced, and the maintainability of the system is improved. The vehicle-mounted platform has higher flexibility and expandability, and can be automatically switched to the standby module when the fault occurs through software redundancy, so that system breakdown or stop work is avoided, and user experience and satisfaction are improved.
Drawings
FIG. 1 is a block diagram of a "hot-standby" redundancy scheme;
FIG. 2 is a hardware configuration diagram of a dual CPU redundancy control software scheme;
FIG. 3 is a schematic diagram of a master-slave CPU communication;
FIG. 4 is a schematic diagram of a master/slave CPU synchronization process;
FIG. 5 is a flow chart of a dual CPU control arbitration and transfer process;
FIG. 6 is a flow chart of a dual CPU synchronization control procedure;
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. It will be apparent that the described embodiments are only some, but not all, embodiments of the application.
As shown in fig. 1, the present application provides a technical solution: a vehicle-mounted platform based on embedded software redundancy comprises a railway special annunciator body, 1 control board containing 2 CPUs and sharing one set of peripheral circuits.
The hardware configuration is shown in fig. 2, and adopts a modular design. Under the principle of not increasing a large amount of cost, peripheral circuits are shared, only one set of parallel core processing units is added, and two CPUs are mutually checked by using an application layer software redundancy switching method, namely one main CPU and one standby CPU.
The control board is divided into a motherboard, a processor module, a communication module (CNBR), an input/output module (I/O), a power supply module (PSDA), a redundancy management module and the like, and simultaneously comprises a fault detection sub-module, a data processing sub-module and a redundancy switching sub-module. The fault detection sub-module is responsible for completing fault detection of the CPU module and comprises self-detection of the CPU module and mutual detection of the CPU module; the data processing sub-module completes the CPU output control logic according to the system input acquisition sub-module acquisition information and the fault detection result of the fault monitoring sub-module; the redundancy switching sub-module completes the control of the local load and the remote load according to the output control logic given by the system processing sub-module.
The two CPU control units on the processor module are operating synchronously during system operation. The two control units are identical and mirror images, one is a main control CPU unit, the other is a hot standby CPU unit, and the first started is the main control CPU unit. Redundant interaction channels are arranged between the main CPU control units and the standby CPU control units for data voting, system synchronization, mutual monitoring and the like.
The dual CPU control unit of the processor module has self-diagnosis capability, can detect the running state of the dual CPU control unit during running, and can timely generate a reset signal when faults occur. Meanwhile, the running states of the other parties can be mutually diagnosed between the two CPU control units, when one CPU control unit fails, the system channel can be timely found, the failure alarm can be carried out when the system channel is switched at the first time, and the safe and stable running of the system is ensured.
And the processor module and the I/O module adopt a CAN bus for data communication transmission. And an interface between the CAN protocol controller and the physical bus is added to provide differential transmission capability to the bus and differential reception capability to the CAN controller.
And a watchdog timer is added, so that the reliability and the safety of the system are improved, and the deadlock of the system is prevented. And an independent CAN controller is adopted, so that higher processing capacity and faster data transmission speed are provided, the burden of the main controller is reduced, and the complexity and development cost of the system are reduced. And the independent CAN controller CAN also provide more hardware resources and interfaces, thereby facilitating the expansion of the system and the increase of functions.
The dual-CPU identity recognition mechanism of 'primary start-up' is adopted, so that the main control CPU and the hot standby CPU have complete equality, namely the main control CPU is started up first and the hot standby CPU is started up later. The dual CPU identity recognition mechanism of the' first start-up is realized by adopting a port monitoring method. If the connection is successful, indicating that the other party is started, and marking the local machine as a hot standby CPU; the connection failure marks the local machine as a main control CPU, and the socket is bound to the communication port of the main and standby machines for monitoring. Thus, whenever the hot standby CPU is started, the main control CPU receives the connection request of the hot standby CPU and establishes connection with the hot standby CPU.
The heartbeat signals are adopted between the main CPU and the standby CPU to monitor the working state of each other and serve as an auxiliary means for diagnosing equipment faults. Once the heartbeat signal between the main control CPU and the hot standby CPU fails, the failure of one equipment is indicated, and meanwhile, the cause of the failed equipment can be primarily determined.
As shown in fig. 3, which is a flow of dual CPU communication, in order to further improve the usability of the dual CPU active/standby switching system, the dual channel adds a time stamp when the heartbeat of the other party is received. Under the condition that the heartbeat is invalid, selecting a network which receives the heartbeat finally according to the heartbeat time stamp, checking a remote detection point corresponding to the network, and if the remote detection point can be detected, indicating that the other party is down and offline, wherein the CPU is normal and can be used as a main control CPU to operate; otherwise, the CPU is marked to be down, and the CPU should work in the hot standby CPU at the moment.
As shown in fig. 4, master-slave CPU synchronization is achieved. The synchronization of the master CPU and the slave CPU not only comprises clock synchronization, but also comprises master CPU state and data synchronization. Under the condition that the main CPU and the standby CPU work normally, the main CPU and the standby CPU receive information from the network at the same time, but only the main CPU has authority to send information to the external equipment, and the standby CPU only receives the external information and is in a silent state for information sending. In order for the task initialization of the primary and secondary CPUs to begin synchronously, the primary and secondary CPUs must be powered on simultaneously. And when the main control CPU task starting instruction arrives at the first time, informing the hot standby CPU to start the task, so that the hot standby CPU enters a working mode at the same time.
The main and standby CPU fault detection and switching are carried out by adopting the following 3 measures: (1) a watchdog timer is provided in the host/slave. After the system is powered on, the processor can zero the watchdog counter at fixed time intervals during normal operation. If the main CPU or the standby CPU fails, the watchdog counter value exceeds a certain threshold value, and finally the watchdog generates a reset signal to reset the processor, so that whether the main CPU and the standby CPU fail can be judged. (2) After the main and standby CPUs are powered on, self-checking operation is carried out on the system, including memory detection, I/O equipment detection and the like, and whether the system hardware works normally can be detected when the power is on. (3) In the running process of the system, the main CPU and the standby CPU monitor the working state of each other through the state of the cardiac jumper and the state of the double networks. Because the main CPU and the standby CPU are definitely the main CPU or the standby CPU through identity recognition when the main CPU is powered on, if the main CPU detects the fault of the standby CPU in the running process of the system, the main CPU does not switch, and the main CPU keeps working continuously; if the standby CPU detects the failure of the main CPU, the switching is performed, and the standby CPU takes over the main CPU to continue working.
The working principle of the application
The system adopts a design scheme of double CPU redundancy fault tolerance, the processor module and the input/output module are designed by adopting a double CPU redundancy architecture, and all the modules are connected through a motherboard to complete interconnection and intercommunication of internal systems and power supply. The embedded control board operating system software platform adopts an Android system, and the annunciator operating system software platform adopts an Android SDK.
And placing two CPU modules with the same model on the same control board, and performing redundancy control by utilizing backboard communication. The redundant mode of the software is that only one CPU module is needed to be added, and the state monitoring and master control right transferring of the two CPU modules are realized through programming.
In the interactive communication system, an upper computer transmits data signals to a host computer, two CPUs synchronously receive related data, but only one CPU is responsible for mainly outputting signals to a lower computer, and the other CPU is in a monitoring preparation state and ready to take over work at any time. When the CPU responsible for main output fails, the other CPU takes over work to become main output, and the CPU originally responsible for main output automatically becomes standby output after the fault is repaired and stands by at any time. The method can reduce the switching frequency of the host and improve the real-time performance of the communication system.
As shown in FIG. 5, two CPUs run on line simultaneously, one is in the master control mode and the other is in the hot standby mode, and the arbitration and transfer of master control rights are completed. The CPU with the master control has I/O control, and the hot standby CPU output is forbidden, only data is collected and communication connection is maintained. The two CPU modules mutually monitor the running state and communication condition of each other, and if the main control CPU module is found to be faulty, the hot standby CPU module obtains the main control right.
As shown in fig. 6, the hot standby CPU is ready at any time, and once the master CPU fails, the master control right is immediately acquired to become the master control CPU, so that the synchronous control of the dual-CPU module is completed. The main control CPU must transmit its own state information to the hot standby CPU in real time, and the hot standby CPU must track the change of the main control CPU, keep synchronous with the main control CPU, and realize undisturbed switching when the two CPU modules transfer the main control rights.
The dual-CPU main-standby switching mode realizes real-time software redundancy, can be switched seamlessly when the main CPU fails, and ensures the continuity and reliability of the system. Meanwhile, the configuration of the double CPUs can improve the computing capacity and reliability of the system, and further enhance the effect of software redundancy.
In a word, the software redundancy is applied to the communication of the vehicle-mounted platform, so that the reliability, stability and anti-interference capability of the communication can be improved, the safe transmission of data and the normal operation of a system are ensured, and better service experience is provided for users.
The non-disclosed parts of the application are all prior art, and the specific structure, materials and working principle thereof are not described in detail. Although embodiments of the present application have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the application, the scope of which is defined in the appended claims and their equivalents.

Claims (6)

1. The utility model provides a vehicle-mounted platform based on embedded software redundancy, includes special signaler body of railway and a control panel that contains two CPUs, and peripheral circuit shares one set, its characterized in that:
(1) The mode of switching between the main CPU and the standby CPU is adopted, the control board comprises two identical CPUs, the identical tasks are executed, the consistency of data is maintained through real-time data synchronization and a decision algorithm, and when one CPU fails or is wrong, the other CPU can take over the functions in time, so that seamless switching is realized;
(2) The hardware part of the software redundancy control board adopts a modularized design, and the peripheral circuits are shared under the principle of not increasing a large amount of cost, and only one set of parallel core processing units are added, so that the embedded intelligent chip is used as a core, and the redundancy between the two main chips is realized.
2. The modular design of claim 1, wherein:
the processor module and the input/output module on the control panel are designed by adopting a double CPU redundancy architecture, and all the modules are connected through a motherboard to complete interconnection and power supply of internal systems.
3. The processor module dual CPU redundancy architecture design of claim 2, wherein:
the dual CPU control unit of the processor module has self-diagnosis capability, can detect the running state of the dual CPU control unit during running, and can timely generate a reset signal when faults occur.
4. A redundancy design as claimed in claim 3, characterized in that:
the dual computers are simultaneously on-line and mutually monitored, master control rights are judged and transferred among programs, a hot standby CPU is ready at any time by utilizing a mode of switching between the main CPU and the standby CPU, and once the main CPU fails, the master control rights are immediately acquired to be the master control CPU, so that the undisturbed switching is realized;
the method comprises the following steps: two CPUs run on line simultaneously, one is in a main control mode, the other is in a hot standby mode, the CPU with the main control right has I/O control right, the hot standby CPU is forbidden to output, and only data are collected and communication connection is kept. When the CPU responsible for main output fails, the other CPU takes over work to become main output, and the CPU originally responsible for main output automatically becomes standby output after the fault is repaired and stands by at any time.
5. The dual CPU active-standby switching mode of claim 4 wherein:
after the primary and secondary CPU is determined, the socket is bound to the communication port of the primary and secondary computers for monitoring so as to monitor the working state of the other party.
6. A method of monitoring the operational status of a counterpart as in claim 5, wherein:
the heartbeat signals are adopted between the main CPU and the standby CPU to monitor the working state of each other and serve as an auxiliary means for diagnosing equipment faults. Once the heartbeat signal between the main control CPU and the hot standby CPU fails, the failure of one equipment is indicated, and meanwhile, the cause of the failed equipment can be primarily determined.
CN202310976451.8A 2023-08-04 2023-08-04 Vehicle-mounted platform based on embedded software redundancy Pending CN117002562A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310976451.8A CN117002562A (en) 2023-08-04 2023-08-04 Vehicle-mounted platform based on embedded software redundancy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310976451.8A CN117002562A (en) 2023-08-04 2023-08-04 Vehicle-mounted platform based on embedded software redundancy

Publications (1)

Publication Number Publication Date
CN117002562A true CN117002562A (en) 2023-11-07

Family

ID=88575794

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310976451.8A Pending CN117002562A (en) 2023-08-04 2023-08-04 Vehicle-mounted platform based on embedded software redundancy

Country Status (1)

Country Link
CN (1) CN117002562A (en)

Similar Documents

Publication Publication Date Title
CN110361979B (en) Safety computer platform in railway signal field
CN110376876B (en) Double-system synchronous safety computer platform
CN110351174B (en) Module redundancy safety computer platform
CN102103532B (en) Safety redundancy computer system of train control vehicle-mounted equipment
CN100555234C (en) Dual redundant fault-tolerant system and redundancy switching method thereof
CN201909961U (en) Redundancy control system
CN101807076B (en) Duplication redundancy fault-tolerant high-reliability control system having synergistic warm standby function based on PROFIBUS field bus
CN107634855A (en) A kind of double hot standby method of embedded system
CN103647781A (en) Mixed redundancy programmable control system based on equipment redundancy and network redundancy
CN106648997A (en) Master-salve switching method based on non-real-time operating system
CN110427283B (en) Dual-redundancy fuel management computer system
CN103853622A (en) Control method of dual redundancies capable of being backed up mutually
CN111874049B (en) Brake control system for safety computer of train control
CN103425553A (en) Duplicated hot-standby system and method for detecting faults of duplicated hot-standby system
WO2024011906A1 (en) Master-slave redundancy control system and method
CN111186463A (en) Board-level redundant full-electronic computer interlocking system
JPH0934809A (en) Highly reliable computer system
CN110758489A (en) Automatic protection system of train
CN113791937B (en) Data synchronous redundancy system and control method thereof
CN202142052U (en) Safe redundant computer system of train-control vehicle-mounted equipment
CN101741532A (en) Two-computer switching device for redundant server switching
CN113665631A (en) Remote restarting method and device for interlocking host equipment
CN117002562A (en) Vehicle-mounted platform based on embedded software redundancy
CN115408240A (en) Redundant system active/standby method, device, equipment and storage medium
CN115328706A (en) Comprehensive control method and system for dual-CPU redundant architecture

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication