CN111949283B - BMC Flash mirror image self-recovery system and method - Google Patents

BMC Flash mirror image self-recovery system and method Download PDF

Info

Publication number
CN111949283B
CN111949283B CN202010915866.0A CN202010915866A CN111949283B CN 111949283 B CN111949283 B CN 111949283B CN 202010915866 A CN202010915866 A CN 202010915866A CN 111949283 B CN111949283 B CN 111949283B
Authority
CN
China
Prior art keywords
chip
bmc
flash
control chip
mirror image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010915866.0A
Other languages
Chinese (zh)
Other versions
CN111949283A (en
Inventor
孟庆振
赵现普
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010915866.0A priority Critical patent/CN111949283B/en
Publication of CN111949283A publication Critical patent/CN111949283A/en
Application granted granted Critical
Publication of CN111949283B publication Critical patent/CN111949283B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/61Installation
    • G06F8/63Image based installation; Cloning; Build to order
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/602Providing cryptographic facilities or services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/64Protecting data integrity, e.g. using checksums, certificates or signatures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/71Version control; Configuration management

Abstract

The invention provides a BMC Flash mirror image self-recovery system and a method, wherein one of double BMC Flash chips is mounted on a CPLD, and completely same mirror image files are burnt as backups, when the CPLD detects that the BMC chip is not started within a specified time, the data in the backup Flash chip is synchronously updated to a main Flash chip, so that the Flash mirror image file self-recovery is realized by mounting the Flash chip on the CPLD and through a logic circuit.

Description

BMC Flash mirror image self-recovery system and method
Technical Field
The invention relates to the technical field of server design, in particular to a BMC Flash mirror image self-recovery system and a method.
Background
The BMC chip and related components thereof are a relatively independent out-of-band monitoring system on the server, can independently run without depending on other hardware units, BIOS, operating systems and the like, mainly monitors the temperature, voltage, running state and the like of the server, and can complete the control of the on and off of the whole server and the update of firmware such as the BIOS, a control chip and the like. A special Flash memory is hung below the BMC chip, and data interaction is usually carried out between the BMC chip and the Flash based on an SPI bus protocol. In the power-on starting process of the BMC, the configuration of the internal register of the chip and the initialization of each functional unit are carried out by reading binary data which are pre-burned in Flash, and the BMC chip represents the current working state of the BMC chip by outputting watchdog signals with different frequencies outwards. Data in Flash are generally updated in an off-line burning or BMC on-line upgrading mode, and the correctness and stability of the data in Flash are guaranteed to be important for the normal work of the BMC monitoring system. In the current general server system design, in order to improve the safety and reliability of image files in Flash, algorithms such as MD5 are usually used for data encryption and verification in the encoding process, and SPI routing and the like on the motherboard also have strict requirements.
At present, a plurality of servers adopt a design of dual-BMC Flash, namely two identical flashes are mounted on a group of SPI buses output by BMC, data in the two flashes are mutually backed up, and when data in one Flash is abnormal, the Flash can be timely switched to a standby Flash to ensure that the BMC works normally. However, the dual BMC Flash design cannot completely guarantee the security of the mirror image data in practical engineering applications for the following reasons. Firstly, two flashes have the same SPI routing path and similar mainboard environment on a mainboard, the double flashes have poor anti-interference performance of circuit design similar to each other, when data in one Flash has problems, the other Flash is easy to repeat the condition of the former, and more importantly, when the data in one Flash is abnormally distorted, due to the synchronous design of the double-Flash data, for many servers, the Flash serving as a backup can automatically synchronize wrong mirror image versions, so that the double-redundant Flash redundancy does not play a real role. When the BMC Flash image file is damaged, the image cannot be refreshed remotely, and the image can only be updated one by one or returned to a factory for maintenance through a BMC serial port, so that the self-recovery of the BMC Flash image file is particularly important.
Disclosure of Invention
The invention aims to provide a BMC Flash mirror image self-recovery system and a method, and aims to solve the problem of poor anti-interference performance of a dual-BMC Flash design in the prior art, avoid abnormal tampering of data, enhance the capability of a server system against malicious intrusion and improve the running stability of a BMC system.
In order to achieve the technical purpose, the invention provides a BMC Flash mirror image self-recovery system, which comprises:
the device comprises a BMC, a bus switching chip, a control chip, a first Flash chip and a second Flash chip;
the BMC is respectively connected with the bus switching chip and the control chip;
the bus switching chip is respectively connected with the first Flash chip and the control chip;
the control chip is also connected with a second Flash chip;
the first Flash chip and the second Flash chip store the same BMC mirror image file;
when the mirror image in the first Flash chip loaded by the BMC fails, the control chip controls the bus switching chip to gate a line between the control chip and the first Flash chip, and a path between the first Flash chip and the second Flash chip is conducted to complete self-recovery of the mirror image; and after the mirror image is updated, the control chip controls the bus switching chip to gate the line between the BMC and the first Flash chip.
Preferably, the BMC outputs a watchdog signal, and after loading the Flash mirror image, the watchdog signal is changed into a periodic square wave signal with fixed frequency after finishing initialization and normal work.
Preferably, the bus switching chip gates a line between the BMC and the first Flash chip by default.
Preferably, the data communication between the BMC and the bus switching chip and between the bus switching chip and the first Flash chip is SPI bus communication; and the data communication between the control chip and the second Flash chip is SPI bus communication.
Preferably, the system further comprises an external computer, and the external computer is connected with the BMC through a network.
The invention also provides a BMC Flash mirror image self-recovery method, which comprises the following operations:
monitoring a watchdog signal output by the BMC through a control chip, and starting the control chip to actively update the BMC Flash mirror image when the watchdog signal exceeds a periodic signal with a preset frequency within a set time;
the control chip sets a reset signal of the BMC chip to be a low level, forces the BMC to enter a reset state, controls the bus switching chip to gate a line between the control chip and the first Flash chip, conducts the first Flash chip and the second Flash chip, and updates a mirror image file of the second Flash chip to the first Flash chip;
the control chip controls the bus switching chip to gate a circuit between the BMC and the first Flash chip, the control chip releases a reset signal of the BMC chip, and the BMC chip normally loads a mirror image file of the first Flash chip.
Preferably, the method further comprises upgrading the Flash chip image file, specifically:
the image file in the PC is transmitted to the BMC through a network signal, the BMC transmits the image file to the control chip, and the image file is transmitted into the second Flash chip through the control chip, so that the upgrading operation of the image file of the second Flash chip is completed;
and the BMC completes the upgrading operation of the image file in the first Flash chip through a default-gated channel between the BMC and the first Flash chip.
Preferably, the BMC transmits the image file to the control chip through an I2C bus.
The effect provided in the summary of the invention is only the effect of the embodiment, not all the effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:
compared with the prior art, one of the double-BMC Flash chips is mounted on the control chip, the completely same image file is burnt to be used as a backup, and when the control chip detects that the BMC chip is not started within a specified time, the data in the backup Flash chip is synchronously updated to the main Flash chip, so that the Flash image file is self-recovered by mounting the Flash chip on the control chip and through the logic circuit.
Drawings
FIG. 1 is a block diagram of a BMC Flash mirror image self-recovery system provided in an embodiment of the present invention;
FIG. 2 is a flow chart of a BMC Flash mirror image self-recovery method provided in the embodiment of the present invention;
in the figure, 101-server mainboard, 102-BMC, 103-first Flash chip, 104-control chip, 105-second Flash chip, 106-PC, 107-bus switching chip.
Detailed Description
In order to clearly explain the technical features of the present invention, the following detailed description of the present invention is provided with reference to the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. To simplify the disclosure of the present invention, the components and arrangements of specific examples are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and procedures are omitted so as to not unnecessarily limit the invention.
The following describes a BMC Flash image self-recovery system and method provided by an embodiment of the present invention in detail with reference to the accompanying drawings.
As shown in fig. 1, an embodiment of the present invention discloses a BMC Flash mirror image self-recovery system, which includes:
the device comprises a BMC, a bus switching chip, a control chip, a first Flash chip and a second Flash chip;
the BMC is respectively connected with the bus switching chip and the control chip;
the bus switching chip is respectively connected with the first Flash chip and the control chip;
the control chip is also connected with a second Flash chip;
the first Flash chip and the second Flash chip store the same BMC mirror image file;
when the mirror image in the first Flash chip loaded by the BMC fails, the control chip controls the bus switching chip to gate a line between the control chip and the first Flash chip, and a path between the first Flash chip and the second Flash chip is conducted to complete self-recovery of the mirror image; and after the mirror image is updated, the control chip controls the bus switching chip to gate the line between the BMC and the first Flash chip.
The control chip is a CPLD.
In the figure, a BMC chip realizes out-of-band monitoring functions of the temperature, the voltage, the running state and the like of the whole server, ASPEED AST2500 chips are selected, a configuration file required in a BMC initialization process is stored in a first Flash chip mounted on the BMC chip, the BMC chip performs configuration of an internal register of the chip and initialization of each functional unit by reading binary data which is pre-burned in Flash in the power-on starting process, data communication between the BMC and the first Flash chip is realized through a bus S1-S2, and the bus type includes but is not limited to an SPI type.
The control chip is used for controlling the power-on/power-off time sequence and state monitoring of the whole server, monitoring a watchdog signal output by the BMC, selecting an Intel MAX10 series, loading Flash data by the BMC within a specified time t, finishing initialization and normal work after the BMC loads the Flash data, changing the watchdog signal into a periodic square wave signal with the frequency of xHz, and if a xHz signal sent by the BMC is detected by the control chip within the time period, starting the default BMC normally, and enabling the data in the Flash to be normal; if the time exceeds t, the control chip circuit logic still does not monitor a periodic signal with the frequency of xHz, the control chip will default that the data in the BMC Flash is tampered, the BMC fails to load the data, and the control chip will be started to actively update the flow of the BMC Flash mirror image.
The second Flash chip is mounted under the control chip, the BMC image file which is the same as the first Flash chip is stored in the second Flash chip, and the second Flash chip and the control chip are in data communication through a bus S7, wherein the bus type includes but is not limited to an SPI type. Under the normal working condition of BMC, the data in the second Flash chip can not be read and written so as to keep the stability of the data.
The first Flash chip and the second Flash chip select MXIC MX25L51245GMI-08G chips.
The bus switching chip selects two bus paths S2-S1 and S6-S1 alternatively, the gating switch is controlled by the control chip and gates S2-S1 lines by default, and the type is NEXPERIA 74CBTLV 3257D. After the process of actively updating the BMC Flash by the control chip is started, the control chip sets a chip reset signal S4 of the BMC to be a low level at first to force the BMC to enter a reset state, the control chip controls a bus switching chip to gate an S6-S1 path through an S5 signal, the control chip controls S7 bus communication at the same time, and data in the second Flash chip is updated to the first Flash chip through the S7-S6-S1 path. After the data updating is finished, the control chip controls the bus switching chip to gate the S2-S1 line through the S5, and then the BMC releases the reset signal of the BMC through the control S4 signal, so that the BMC loads the data in the first Flash chip normally.
In conclusion, the self-recovery process of the BMC Flash image file can be completed once.
And for the upgrading of the Flash chip image file, upgrading is carried out in a BMC remote network mode. The image file in the PC is transmitted to the BMC through a network signal S8, the BMC transmits the image file to the control chip through S9, wherein S9 is not limited to I2C and other bus types, and finally the upgrading operation of the image file in the second Flash chip is completed through an S7 bus between the control chip and the Flash chip; in addition, the BMC completes the upgrading operation of the image file in the first Flash chip through an S2-S1 path which is gated by default.
In the embodiment of the invention, one of the double-BMC Flash chips is mounted on the control chip, the completely same image file is burnt to be used as a backup, and when the control chip detects that the BMC chip is not started within a specified time, the data in the backup Flash chip is synchronously updated to the main Flash chip, so that the Flash image file is self-restored by mounting the Flash chip on the control chip and through the logic circuit.
As shown in fig. 2, an embodiment of the present invention further discloses a BMC Flash mirror image self-recovery method implemented by using the system, which specifically includes the following operations:
monitoring a watchdog signal output by the BMC through a control chip, and starting the control chip to actively update the BMC Flash mirror image when the watchdog signal exceeds a periodic signal with a preset frequency within a set time;
the control chip sets a reset signal of the BMC chip to be a low level, forces the BMC to enter a reset state, controls the bus switching chip to gate a line between the control chip and the first Flash chip, conducts the first Flash chip and the second Flash chip, and updates a mirror image file of the second Flash chip to the first Flash chip;
the control chip controls the bus switching chip to gate a circuit between the BMC and the first Flash chip, releases the reset signal of the BMC chip, and normally loads the mirror image file of the first Flash chip.
The control chip is responsible for monitoring a watchdog signal output by the BMC, the watchdog signal is changed into a periodic square wave signal with the frequency of xHz after the BMC loads Flash data to complete initialization and normal work within a specified time t, if the control chip detects a xHz signal sent by the BMC within the specified time, the default BMC is started to be normal, and the data in the Flash is normal; if the time exceeds t, the control chip circuit logic still does not monitor a periodic signal with the frequency of xHz, the control chip will default that the data in the BMC Flash is tampered, the BMC fails to load the data, and the control chip will be started to actively update the flow of the BMC Flash mirror image.
After the process of actively updating the BMC Flash by the control chip is started, the control chip sets a chip reset signal S4 of the BMC to be a low level at first to force the BMC to enter a reset state, the control chip controls a bus switching chip to gate an S6-S1 path through an S5 signal, the control chip controls S7 bus communication at the same time, and data in the second Flash chip is updated to the first Flash chip through the S7-S6-S1 path. After the data updating is finished, the control chip controls the bus switching chip to gate the S2-S1 line through the S5, and then the BMC releases the reset signal of the BMC through the control S4 signal, so that the BMC loads the data in the first Flash chip normally.
And for the upgrading of the Flash chip image file, upgrading is carried out in a BMC remote network mode. The image file in the PC is transmitted to the BMC through a network signal S8, the BMC transmits the image file to the control chip through S9, wherein S9 is not limited to I2C and other bus types, and finally the upgrading operation of the image file in the second Flash chip is completed through an S7 bus between the control chip and the Flash chip; in addition, the BMC completes the upgrading operation of the image file in the first Flash chip through an S2-S1 path which is gated by default.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (5)

1. A BMC Flash mirror image self-recovery system, the system comprising:
the device comprises a BMC, a bus switching chip, a control chip, a first Flash chip and a second Flash chip;
the BMC is respectively connected with the bus switching chip and the control chip;
the bus switching chip is respectively connected with the first Flash chip and the control chip; the bus switching chip gates a circuit between the BMC and the first Flash chip by default;
the control chip is also connected with a second Flash chip;
the first Flash chip and the second Flash chip store the same BMC mirror image file;
when the mirror image in the first Flash chip loaded by the BMC fails, the control chip controls the bus switching chip to gate a line between the control chip and the first Flash chip, and a path between the first Flash chip and the second Flash chip is conducted to complete self-recovery of the mirror image; after the mirror image is updated, the control chip controls the bus switching chip to gate a line between the BMC and the first Flash chip;
the system also comprises an external computer, wherein the external computer is connected with the BMC through a network;
the upgrading of the Flash chip image file specifically comprises the following steps:
the image file in the external computer is transmitted to the BMC through a network signal, the BMC transmits the image file to the control chip, and the image file is transmitted into the second Flash chip through the control chip, so that the upgrading operation of the image file of the second Flash chip is completed;
and the BMC completes the upgrading operation of the image file in the first Flash chip through a default-gated channel between the BMC and the first Flash chip.
2. The BMC Flash image self-recovery system according to claim 1, wherein the BMC outputs a watchdog signal and changes to a periodic square wave signal with a fixed frequency after the Flash image is loaded, the initialization is completed and the normal operation is completed.
3. The BMC Flash mirror image self-recovery system according to claim 1, wherein the data communication between the BMC and the bus switch chip, the bus switch chip and the first Flash chip is SPI bus communication; and the data communication between the control chip and the second Flash chip is SPI bus communication.
4. A BMC Flash image self-recovery method implemented by the system of any of claims 1 to 3, characterized in that the method comprises the following operations:
monitoring a watchdog signal output by the BMC through a control chip, and starting the control chip to actively update the BMC Flash mirror image when the watchdog signal exceeds a periodic signal with a preset frequency within a set time;
the control chip sets a reset signal of the BMC chip to be a low level, forces the BMC to enter a reset state, controls the bus switching chip to gate a line between the control chip and the first Flash chip, conducts the first Flash chip and the second Flash chip, and updates a mirror image file of the second Flash chip to the first Flash chip;
the control chip controls the bus switching chip to gate a circuit between the BMC and the first Flash chip, releases the reset signal of the BMC chip, and normally loads the mirror image file of the first Flash chip.
5. The BMC Flash image self-recovery method of claim 4, wherein the BMC transmits the image file to the control chip via an I2C bus.
CN202010915866.0A 2020-09-03 2020-09-03 BMC Flash mirror image self-recovery system and method Active CN111949283B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010915866.0A CN111949283B (en) 2020-09-03 2020-09-03 BMC Flash mirror image self-recovery system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010915866.0A CN111949283B (en) 2020-09-03 2020-09-03 BMC Flash mirror image self-recovery system and method

Publications (2)

Publication Number Publication Date
CN111949283A CN111949283A (en) 2020-11-17
CN111949283B true CN111949283B (en) 2022-08-02

Family

ID=73366575

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010915866.0A Active CN111949283B (en) 2020-09-03 2020-09-03 BMC Flash mirror image self-recovery system and method

Country Status (1)

Country Link
CN (1) CN111949283B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112883384B (en) * 2021-03-04 2023-05-23 中国航空工业集团公司西安航空计算技术研究所 Protection method of embedded computer boot loader with strong robustness
CN114816571B (en) * 2022-04-15 2023-06-16 西安广和通无线通信有限公司 Method, device, equipment and storage medium for plug-in flash memory

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107844330A (en) * 2017-10-25 2018-03-27 郑州云海信息技术有限公司 A kind of method and system of enhancing ARM startup of server code reliabilities
CN108334367A (en) * 2017-08-31 2018-07-27 郑州云海信息技术有限公司 A kind of firmware remote update system and method towards BMC chip

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9842003B2 (en) * 2014-10-07 2017-12-12 Dell Products, L.P. Master baseboard management controller election and replacement sub-system enabling decentralized resource management control

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108334367A (en) * 2017-08-31 2018-07-27 郑州云海信息技术有限公司 A kind of firmware remote update system and method towards BMC chip
CN107844330A (en) * 2017-10-25 2018-03-27 郑州云海信息技术有限公司 A kind of method and system of enhancing ARM startup of server code reliabilities

Also Published As

Publication number Publication date
CN111949283A (en) 2020-11-17

Similar Documents

Publication Publication Date Title
US9798632B2 (en) Providing boot data in a cluster network environment
US9600370B2 (en) Server system
CN1770707B (en) Apparatus and method for quorum-based power-down of unresponsive servers in a computer cluster
JP2608904B2 (en) Multiple redundant false detection system and method of using same
US9015458B2 (en) Computer system and method for updating basic input/output system by switching between local mode and bypass mode through baseboard management controller
CN106201332B (en) Driver array policy control
CN111949283B (en) BMC Flash mirror image self-recovery system and method
WO2018095107A1 (en) Bios program abnormal processing method and apparatus
CN104102559A (en) Redundant heartbeat link and opposite-end restarting link based double-controller storage system
EP2082322A1 (en) Security features in interconnect centric architectures
US10606784B1 (en) Software filtering of redundant sideband device management bus communications
WO2012001780A1 (en) System control device, information processing system, and data migration and restoration method for information processing system
CN111984471B (en) Cabinet power BMC redundancy management system and method
JP3774826B2 (en) Information processing device
US10719310B1 (en) Systems and methods for reducing keyboard, video, and mouse (KVM) downtime during firmware update or failover events in a chassis with redundant enclosure controllers (ECs)
CN111158963A (en) Server firmware redundancy starting method and server
CN111008165A (en) Four-way server BIOS FLASH control device and method
US20030204658A1 (en) Hot swap circuit module
JP3266956B2 (en) System storage
CN215300646U (en) Bypass control device
CN117331863B (en) Power supply information communication method, system, electronic equipment and storage medium
CN112486868B (en) CPLD (Complex programmable logic device) -based storage double-control synchronization system, method and device and storage medium
TWI774464B (en) Expanded availability computing system
CN116722868B (en) Clock holding system, method and storage device
JPS62187901A (en) Method for controlling duplex controller

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant