CN107133186B - Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller) - Google Patents

Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller) Download PDF

Info

Publication number
CN107133186B
CN107133186B CN201710340285.7A CN201710340285A CN107133186B CN 107133186 B CN107133186 B CN 107133186B CN 201710340285 A CN201710340285 A CN 201710340285A CN 107133186 B CN107133186 B CN 107133186B
Authority
CN
China
Prior art keywords
management controller
slave
tmc
master
access
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710340285.7A
Other languages
Chinese (zh)
Other versions
CN107133186A (en
Inventor
王学龙
张兆义
乔英良
黄家明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201710340285.7A priority Critical patent/CN107133186B/en
Publication of CN107133186A publication Critical patent/CN107133186A/en
Application granted granted Critical
Publication of CN107133186B publication Critical patent/CN107133186B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/38Information transfer, e.g. on bus
    • G06F13/42Bus transfer protocol, e.g. handshake; Synchronisation
    • G06F13/4282Bus transfer protocol, e.g. handshake; Synchronisation on a serial bus, e.g. I2C bus, SPI bus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2213/00Indexing scheme relating to interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F2213/0016Inter-integrated circuit (I2C)

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Small-Scale Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses a method for simultaneously communicating master TMC and slave TMC with BMC, which comprises the following steps: s1, the first box management controller firstly sends a control command forbidden by the access baseboard management controller to the second box management controller when the process of the access baseboard management controller starts, and the second box management controller receives the control command to skip the process execution of the access baseboard management controller; s2, delaying time T1; s3, the host box management control module executes the delay time T2 after the operation of accessing the baseboard management controller; and S4, the first case management controller sends a control command allowed by the access baseboard management controller to the second case management controller, and the second case management controller process restores the access to the baseboard management controller after judging that the control command allowed by the access baseboard management controller is received.

Description

Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller)
Technical Field
The invention relates to the technical field of server management, in particular to a master-slave TMC and BMC communication method for solving the problem that a master-slave server management module accesses an i2c bus at the same time to cause blockage.
Background
With the increase of the computing demand of users on computers, the computing performance of a single computer is required to be higher and higher by the users. High-end servers have great advantages in computing performance and reliability over traditional servers and are becoming more and more popular in areas where real-time, reliability and availability are critical. The high-end server is a three-level Management structure of BMC, TMC and RMC, and the BMC managing each node is uniformly monitored by TMC (track Management controller) at the level of track, and TMC generally communicates with the BMC through i2c bus. In order to ensure the reliability of the system, the system adopts the backup structure design of the master TMC and the slave TMC, and the slave TMC can realize all functions of the master TMC. And under the condition that the main TMC is not in place or is down, the slave TMC replaces the main TMC to realize all functions of monitoring and managing the BMC.
Although the slave TMC acquires the node information from the master TMC through serial port communication under normal conditions, setting BMCIP, switching on and off, restarting BMC and other operations to be performed by the master TMC and the slave TMC separately, which causes that the master (slave) TMC may occupy the i2c bus to communicate with the BMC when the slave (master) TMC executes the above operations, so that the slave (master) TMC cannot normally use the i2c bus, the system card cannot return in the ioctrl () function, and further causes that the ipmimain process is continuously restarted overtime and the system crashes.
Disclosure of Invention
The invention aims to provide a method for simultaneously communicating master and slave TMC and BMC (baseboard management controller), and the method is used for avoiding system congestion and crash of a server management module caused by the fact that a master server management module (TMC) and a slave server management module (TMC) simultaneously access the BMC of the same node through an i2c bus. The reliability of the server management system can be greatly improved.
In order to achieve the purpose, the invention adopts the following technical scheme:
a master and slave TMC simultaneous and BMC communication method comprises the following steps:
s1, the first box management controller firstly sends a control command forbidden by the access baseboard management controller to the second box management controller when the process of the access baseboard management controller starts, and the second box management controller receives the control command to skip the process execution of the access baseboard management controller;
s2, setting the delay time to be T1, and turning to the step S3 after the delay of T1;
s3, the host box management control module sets the delay time to be T2 after executing the operation of accessing the baseboard management controller, and turns to the step S4 after the delay of T2;
and S4, the first case management controller sends a control command allowed by the access baseboard management controller to the second case management controller, and the second case management controller process restores the access to the baseboard management controller after judging that the control command allowed by the access baseboard management controller is received.
Further, the first casing management controller is a master casing management controller, and the second casing management controller is a slave casing management controller.
Further, the master cassette management controller sends a control command to the slave cassette management controller through the serial port session.
Further, the control command prohibited by the access baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function of the slave cassette management controller sets a flag variable to 1.
Further, the control command allowed by the access baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function of the slave cassette management controller sets the flag variable to 0.
Further, the first casing management controller is a slave casing management controller, and the second casing management controller is a master casing management controller.
Further, the slave cassette management controller sends a control command to the master cassette management controller through the serial port session.
Further, the control command prohibited by the access baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function of the host cassette management controller sets a flag variable to 1.
Further, the control command allowed by the access baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function of the host box management controller sets the flag variable to 0.
The beneficial effect of the invention is that,
the invention provides a method for solving the problem that one or even two TMC systems are crashed due to the blockage caused by the communication between the master TMC and the slave TMC and the BMC through the I2C bus. The method for suspending the use of the I2C bus by the slave (master) TMC when the master (slave) TMC occupies the I2C bus and communicates with the BMC by a mode of sending a command to the slave (master) TMC avoids the blockage caused by the fact that the master TMC and the slave TMC access one I2C bus at the same time, and can effectively solve the problems of TMC system crash and hang-up caused by the operations of setting the BMC IP, switching on and shutting down operation, TMC restarting and the like through the TMC.
Detailed Description
Example one
A master and slave TMC simultaneous and BMC communication method comprises the following steps:
s1, the master cassette management controller (master TMC) transmits a control command prohibited by the access BMC to the slave cassette management controller (slave TMC) through the serial port session when the access BMC process starts, where the control command prohibited by the access BMC is an OEM command conforming to the ipmi standard, and the command processing function of the slave cassette management controller (slave TMC) sets the flag variable to 1. After the slave cartridge management controller (slave TMC) judges that the flag variable is 1, the process operation of accessing a Baseboard Management Controller (BMC) through an I2C bus is skipped;
s2, delaying for 1 to 5 seconds;
s3, the host box management control module delays for one to five seconds after executing the operation of accessing a Baseboard Management Controller (BMC) through an I2C bus;
s4, the master magazine management controller (master TMC) sends a control command permitted to access the baseboard management controller (slave TMC) to the slave magazine management controller (slave TMC) via the serial port session, where the control command permitted to access the baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function setting flag variable of the slave magazine management controller (slave TMC) is 0. When the slave cartridge management controller (slave TMC) determines that the flag variable is 0, the access to the Baseboard Management Controller (BMC) via the I2C bus is resumed.
Example two
A master and slave TMC simultaneous and BMC communication method comprises the following steps:
s1, the slave magazine management controller (slave TMC) first transmits a control command prohibited from accessing the baseboard management controller (master TMC) to the master magazine management controller (master TMC) through the serial port session in the process of accessing the Baseboard Management Controller (BMC), where the control command prohibited from accessing the baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function setting flag variable of the master magazine management controller (master TMC) is 1. After the host cartridge receiver management controller (host TMC) judges that the flag variable is 1, skipping the process operation of accessing a Baseboard Management Controller (BMC) through an I2C bus;
s2, delaying for 1 to 5 seconds;
s3, delaying for one to five seconds after the slave casing management control module executes the operation of accessing a Baseboard Management Controller (BMC) through an I2C bus;
s4, the slave magazine management controller (slave TMC) sends a control command permitted to access the baseboard management controller (master TMC) to the master magazine management controller (master TMC) through the serial port session, where the control command permitted to access the baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function setting flag variable of the master magazine management controller (master TMC) is 0. And after the process of the host cartridge management controller (host TMC) judges that the flag variable is 0, the access to the Baseboard Management Controller (BMC) through the I2C bus is recovered.
Although the present invention has been described with reference to the specific embodiments, it should be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention.

Claims (9)

1. A master and slave TMC and BMC communication method is characterized by comprising the following steps:
s1, the first box management controller firstly sends a control command forbidden by the access baseboard management controller to the second box management controller when the process of the access baseboard management controller starts, and the second box management controller receives the control command to skip the process execution of the access baseboard management controller;
s2, setting the delay time to be T1, and turning to the step S3 after the delay of T1;
s3, after the first cassette management controller executes the operation of accessing the baseboard management controller, setting the delay time to be T2, and after the delay of T2, turning to the step S4;
and S4, the first case management controller sends a control command allowed by the access baseboard management controller to the second case management controller, and the second case management controller process restores the access to the baseboard management controller after judging that the control command allowed by the access baseboard management controller is received.
2. The method of claim 1, wherein the first cassette management controller is a master cassette management controller and the second cassette management controller is a slave cassette management controller.
3. The method of claim 2, wherein the master and slave TMC communication methods simultaneously communicate with the BMC, and wherein the master cassette management controller sends the control command to the slave cassette management controller through a serial session.
4. The method of claim 3, wherein the control command disabled by the access baseboard management controller is an OEM command conforming to the ipmi standard, and the command processing function of the slave cassette management controller sets a flag variable to 1.
5. The method of claim 3, wherein the control commands allowed by the access baseboard management controller are OEM commands conforming to the ipmi standard, and the command processing function of the slave cassette management controller sets a flag variable to 0.
6. The method of claim 1, wherein the first cassette management controller is a slave cassette management controller and the second cassette management controller is a master cassette management controller.
7. The method of claim 6, wherein the slave cassette management controller sends the control command to the master cassette management controller via a serial session.
8. The method of claim 7, wherein the control commands disabled by the access baseboard management controller are OEM commands conforming to ipmi standard, and the command processing function of the master box management controller sets a flag variable to 1.
9. The method of claim 7, wherein the control commands allowed by the access baseboard management controller are OEM commands conforming to ipmi standard, and the command processing function of the master box management controller sets a flag variable to 0.
CN201710340285.7A 2017-05-15 2017-05-15 Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller) Active CN107133186B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710340285.7A CN107133186B (en) 2017-05-15 2017-05-15 Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710340285.7A CN107133186B (en) 2017-05-15 2017-05-15 Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller)

Publications (2)

Publication Number Publication Date
CN107133186A CN107133186A (en) 2017-09-05
CN107133186B true CN107133186B (en) 2020-02-07

Family

ID=59731807

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710340285.7A Active CN107133186B (en) 2017-05-15 2017-05-15 Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller)

Country Status (1)

Country Link
CN (1) CN107133186B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108965382B (en) 2018-05-31 2021-06-29 郑州云海信息技术有限公司 File transfer method, device, equipment and medium based on BMC (baseboard management controller)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1558333A (en) * 2004-01-15 2004-12-29 中兴通讯股份有限公司 PCI arbitration mode configurable device and arbitration mode conversion method thereof
CN102446146A (en) * 2010-10-13 2012-05-09 鸿富锦精密工业(深圳)有限公司 Server and method for avoiding bus collision
CN202975735U (en) * 2012-11-16 2013-06-05 广东中钰科技有限公司 12C optical fiber communication driving system used for high-speed camera ball machine
CN104035831A (en) * 2014-07-01 2014-09-10 浪潮(北京)电子信息产业有限公司 High-end fault-tolerant computer management system and method
CN104123099A (en) * 2013-04-26 2014-10-29 株式会社东芝 Access control apparatusand access control method
CN104951251A (en) * 2015-05-29 2015-09-30 浪潮电子信息产业股份有限公司 Cloud server system architecture integrating architecture

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6965816B2 (en) * 2001-10-01 2005-11-15 Kline & Walker, Llc PFN/TRAC system FAA upgrades for accountable remote and robotics control to stop the unauthorized use of aircraft and to improve equipment management and public safety in transportation

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1558333A (en) * 2004-01-15 2004-12-29 中兴通讯股份有限公司 PCI arbitration mode configurable device and arbitration mode conversion method thereof
CN102446146A (en) * 2010-10-13 2012-05-09 鸿富锦精密工业(深圳)有限公司 Server and method for avoiding bus collision
CN202975735U (en) * 2012-11-16 2013-06-05 广东中钰科技有限公司 12C optical fiber communication driving system used for high-speed camera ball machine
CN104123099A (en) * 2013-04-26 2014-10-29 株式会社东芝 Access control apparatusand access control method
CN104035831A (en) * 2014-07-01 2014-09-10 浪潮(北京)电子信息产业有限公司 High-end fault-tolerant computer management system and method
CN104951251A (en) * 2015-05-29 2015-09-30 浪潮电子信息产业股份有限公司 Cloud server system architecture integrating architecture

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"列车通信网络系统形式化建模与验证方法研究";刘铭;《中国博士学位论文全文数据库 信息科技辑》;20120515;第36-37页 *

Also Published As

Publication number Publication date
CN107133186A (en) 2017-09-05

Similar Documents

Publication Publication Date Title
US7490264B2 (en) Method for error handling in a dual adaptor system where one adaptor is a master
US7451343B2 (en) System and method for communicating a software-generated pulse waveform between two servers in a network
US20160246866A1 (en) Distributed persistent memory using asynchronous streaming of log records
US8245077B2 (en) Failover method and computer system
CN109446169B (en) Double-control disk array shared file system
JP2001101033A (en) Fault monitoring method for operating system and application program
CN111209265B (en) Database switching method and terminal equipment
WO2020233001A1 (en) Distributed storage system comprising dual-control architecture, data reading method and device, and storage medium
CN113360347A (en) Server and control method thereof
CN102891762A (en) System and method for continuously processing network data
CN107133186B (en) Method for simultaneously communicating master TMC (remote control message), slave TMC (remote control message) and BMC (baseboard management controller)
KR102053849B1 (en) Airplane system and control method thereof
CN107528703B (en) Method and equipment for managing node equipment in distributed system
CN116820827A (en) Control method and system of substrate management controller of node server
CN114598594B (en) Method, system, medium and equipment for processing application faults under multiple clusters
CN111158963A (en) Server firmware redundancy starting method and server
US6957361B2 (en) Method, system, and program for error handling in a dual adaptor system
US10296469B1 (en) Access control in a flash storage system
CN110647427A (en) Main and standby system based on storage sharing and implementation method thereof
CN117520060A (en) Zfs-based dual-computer cluster high availability implementation method, device and computer equipment
CN116506448A (en) Networking method and device for data synchronization of baseboard management controller
CN117873767A (en) Cluster node fault processing method, storage medium and equipment
CN117453457A (en) Virtual machine backup method and device, intelligent network card and computer readable storage medium
CN115525458A (en) I2C bus network maintenance method, device, equipment and medium
CN117271666A (en) Data synchronization method, distributed system, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant