CN111352662B - A server startup sequence control method, system, terminal and storage medium - Google Patents
A server startup sequence control method, system, terminal and storage medium Download PDFInfo
- Publication number
- CN111352662B CN111352662B CN202010094802.9A CN202010094802A CN111352662B CN 111352662 B CN111352662 B CN 111352662B CN 202010094802 A CN202010094802 A CN 202010094802A CN 111352662 B CN111352662 B CN 111352662B
- Authority
- CN
- China
- Prior art keywords
- network port
- server
- bmc
- communication
- storage management
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/4401—Bootstrapping
- G06F9/4416—Network booting; Remote initial program loading [RIPL]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0817—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Environmental & Geological Engineering (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
本发明提供一种服务器启动顺序控制方法、系统、终端及存储介质,包括:验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接;在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开;定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态;若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。本发明能够自动控制服务器先启动存储设备再启动操作系统,并对故障部件生成相应的故障告警,因此相对于现有启动方法,本发明节省了人力资源,提高了机房服务器启动效率。
The invention provides a server startup sequence control method, system, terminal and storage medium, including: verifying that the control terminal can establish a communication connection with the BMC network port, the service network port and the storage management network port of the server to be started; The server's operating system power-on auto-start option is disabled and the storage device's power-on auto-start option is enabled; periodically send a communication establishment request to the server BMC network port, service network port, and storage management network port, and obtain the BMC network port of the server to be started. , the communication status of the service network port and the storage management network port; if the communication status is that both the BMC network port and the storage management network port are connected successfully, use the BMC bottom layer ipmitool command to start the operating system of the server to be started. The invention can automatically control the server to start the storage device first and then start the operating system, and generate corresponding fault alarms for the faulty components, so compared with the existing start-up method, the invention saves human resources and improves the start-up efficiency of the server in the computer room.
Description
技术领域technical field
本发明涉及服务器技术领域,具体涉及一种服务器启动顺序控制方法、系统、终端及存储介质。The present invention relates to the technical field of servers, in particular to a server startup sequence control method, system, terminal and storage medium.
背景技术Background technique
随着云计算、大数据等新型技术的发展,人们对服务器和存储的使用越来越多,同一个数据中心机房一般都会涉及许多的服务器和存储。对于存储,它不能独立的对外提供应用服务,必须通过光纤或者网线连接到服务器后,在服务器操作系统下面将存储磁盘挂载出来后才能使用。现在一台服务器下面一般会连接多台存储,而一台存储也会映射给多台服务器使用,同时每台服务器都会挂载多块存储磁盘,而且操作系统下面的许多应用软件会依赖这些存储磁盘进行数据读写操作,如果在服务器启动正常进入系统后,存储磁盘没有正常挂载上去,那么对应的应用软件就会运行报错,业务宕机,严重的可能出现数据丢失。With the development of new technologies such as cloud computing and big data, people use more and more servers and storage. The same data center room generally involves many servers and storage. For storage, it cannot provide external application services independently. It must be connected to the server through optical fiber or network cable, and then the storage disk can be mounted under the server operating system before it can be used. Now a server is generally connected to multiple storages, and one storage is also mapped to multiple servers. At the same time, each server will mount multiple storage disks, and many applications under the operating system will rely on these storage disks Data read and write operations are performed. If the storage disk is not mounted normally after the server starts and enters the system normally, the corresponding application software will run and report an error, business downtime, and serious data loss may occur.
对于存储磁盘没有正常挂载到系统下面一般分两种情况,一种就是存储出现故障无法正常启动;另一种就是存储启动比服务器慢,而服务器的操作系统启动完成后,不会再执行开机自动挂载存储磁盘的命令,这个时候就会出现存储磁盘没有正常挂载上去的现象。There are two situations in which the storage disk is not normally mounted to the system. One is that the storage fails to start normally; the other is that the storage starts slower than the server, and after the server's operating system is started, it will not restart. The command to automatically mount the storage disk, at this time, there will be a phenomenon that the storage disk is not mounted normally.
一般来说,为了避免出现上面提到的第二种异常情况,机房管理员进行设备维护时,都是先启动存储,然后登入存储管理界面确定存储启动正常后,再手动启动前端的服务器的。这个过程管理员需要一直等待设备完全启动,比较浪费时间。而如果出现机房意外断电后再恢复供电导致的服务器与存储同时重启的情况,就可能会出现之前提到存储比服务器启动更慢导致的存储磁盘无法正常挂载到操作系统下面。对于不同类型的存储,以及存储的硬件配置不同,它的启动时间是不确定的。所以无法设定准确的间隔时间来启动服务器来避免这种情况。Generally speaking, in order to avoid the second abnormal situation mentioned above, the computer room administrator starts the storage first, and then logs in to the storage management interface to confirm that the storage starts normally, and then manually starts the front-end server. In this process, the administrator needs to wait for the device to start completely, which is a waste of time. However, if the server and storage are restarted at the same time due to the unexpected power failure of the computer room and then the power is restored, it may happen that the storage disk cannot be properly mounted to the operating system due to the fact that the storage is slower than the server to start up. For different types of storage, and the hardware configuration of the storage is different, its startup time is uncertain. So there is no way to set the exact interval to start the server to avoid this.
发明内容SUMMARY OF THE INVENTION
针对现有技术的上述不足,本发明提供一种服务器启动顺序控制方法、系统、终端及存储介质,以解决上述技术问题。In view of the above deficiencies of the prior art, the present invention provides a server startup sequence control method, system, terminal and storage medium to solve the above technical problems.
第一方面,本发明提供一种服务器启动顺序控制方法,包括:In a first aspect, the present invention provides a server startup sequence control method, including:
验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接;Verify that the control terminal can establish a communication connection with the BMC network port, service network port and storage management network port of the server to be started;
在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开;In the BIOS, set the power-on auto-start option of the operating system of the server to be started to be disabled and the storage device power-on auto-start option to be enabled;
定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态;Periodically send a communication establishment request to the server BMC network port, service network port and storage management network port, and obtain the communication status with the BMC network port, service network port and storage management network port of the server to be started;
若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。If the communication state is that the connection to both the BMC network port and the storage management network port is successful, use the BMC bottom layer ipmitool command to start the operating system of the server to be started.
进一步的,所述验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接,包括:Further, the verification control terminal can establish a communication connection with the BMC network port, the service network port and the storage management network port of the server to be started, including:
上电启动所述待启动服务器;Power on and start the to-be-started server;
根据预存的IP地址分别向启动状态的BMC网口、业务网口和存储管理网口发送建立通信请求;Send a communication establishment request to the BMC network port, service network port and storage management network port in the activated state according to the pre-stored IP address;
获取通信监理情况,若与BMC网口、业务网口和存储管理网口均成功建立通信连接,则通过验证。Obtain the communication supervision information. If a communication connection is successfully established with the BMC network port, service network port, and storage management network port, the verification is passed.
进一步的,所述定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,包括:Further, the periodic sending of a communication establishment request to the server BMC network port, service network port and storage management network port includes:
设置发送周期;Set the sending period;
在监控到待启动服务器上电后根据所述发送周期循环向未建立通信连接的网口发送通信请求。After monitoring that the server to be started is powered on, a communication request is cyclically sent to the network port for which a communication connection has not been established according to the sending period.
进一步的,所述方法还包括:Further, the method also includes:
若通信状态为业务网口连接成功而存储管理网口连接失败,则生成存储启动错误告警;If the communication status is that the service network port is successfully connected but the storage management network port connection fails, a storage startup error alarm will be generated;
若通信状态为BMC网口连接失败而业务网口或存储管理网口连接成功,则生成BMC网口故障告警。If the communication status is that the BMC network port connection fails but the service network port or storage management network port is successfully connected, a BMC network port failure alarm is generated.
第二方面,本发明提供一种服务器启动顺序控制系统,包括:In a second aspect, the present invention provides a server startup sequence control system, comprising:
通信验证单元,配置用于验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接;A communication verification unit, configured to verify that the control terminal can establish a communication connection with the BMC network port, the service network port and the storage management network port of the server to be started;
选项设置单元,配置用于在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开;The option setting unit is configured to set in the BIOS that the power-on self-starting option of the operating system of the server to be started is turned off and the storage device power-on self-starting option is turned on;
请求发送单元,配置用于定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态;The request sending unit is configured to periodically send a communication establishment request to the server BMC network port, service network port and storage management network port, and obtain the communication status with the BMC network port, service network port and storage management network port of the server to be started;
启动执行单元,配置用于若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。The startup execution unit is configured to use the BMC bottom layer ipmitool command to start the operating system of the server to be started if the communication state is that both the BMC network port and the storage management network port are successfully connected.
进一步的,所述通信验证单元包括:Further, the communication verification unit includes:
启动完成模块,配置用于上电启动所述待启动服务器;a startup completion module, configured to power on and start the server to be started;
请求建立模块,配置用于根据预存的IP地址分别向启动状态的BMC网口、业务网口和存储管理网口发送建立通信请求;a request establishment module, configured to send a communication establishment request to the BMC network port, the service network port and the storage management network port in the activated state according to the pre-stored IP address;
验证通过模块,配置用于获取通信监理情况,若与BMC网口、业务网口和存储管理网口均成功建立通信连接,则通过验证。The verification pass module is configured to obtain the communication supervision status. If a communication connection is successfully established with the BMC network port, service network port, and storage management network port, the verification is passed.
进一步的,所述请求发送单包括:Further, the request sending form includes:
周期设置模块,配置用于设置发送周期;Period setting module, configured to set the sending period;
请求发送模块,配置用于在监控到待启动服务器上电后根据所述发送周期循环向未建立通信连接的网口发送通信请求。The request sending module is configured to cyclically send a communication request to the network port for which a communication connection is not established according to the sending period after monitoring that the server to be started is powered on.
进一步的,所述系统还包括:Further, the system also includes:
第一告警单元,配置用于若通信状态为业务网口连接成功而存储管理网口连接失败,则生成存储启动错误告警;a first alarm unit, configured to generate a storage startup error alarm if the communication state is that the service network port is successfully connected but the storage management network port fails to connect;
第二告警单元,配置用于若通信状态为BMC网口连接失败而业务网口或存储管理网口连接成功,则生成BMC网口故障告警。The second alarm unit is configured to generate a BMC network port failure alarm if the communication state is that the BMC network port connection fails but the service network port or the storage management network port is successfully connected.
第三方面,提供一种终端,包括:In a third aspect, a terminal is provided, including:
处理器、存储器,其中,processor, memory, which,
该存储器用于存储计算机程序,The memory is used to store computer programs,
该处理器用于从存储器中调用并运行该计算机程序,使得终端执行上述的终端的方法。The processor is used to call and run the computer program from the memory, so that the terminal executes the above-mentioned method of the terminal.
第四方面,提供了一种计算机存储介质,所述计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述各方面所述的方法。In a fourth aspect, a computer storage medium is provided, and instructions are stored in the computer-readable storage medium, which, when executed on a computer, cause the computer to perform the methods described in the above aspects.
本发明的有益效果在于,The beneficial effect of the present invention is that,
本发明提供的服务器启动顺序控制方法、系统、终端及存储介质,通过设定周期查询的间隔时间来监测多网口的状态,并通过对不同网口的通断状态使用二进制编码的方式来判断设备的运行状态,同时根据状态码分类处理各种故障类型,在控制服务器和存储的启动顺序的同时,还能启动故障告警的作用。本发明能够自动控制服务器先启动存储设备再启动操作系统,并对故障部件生成相应的故障告警,因此相对于现有启动方法,本发明节省了人力资源,提高了机房服务器启动效率。The server startup sequence control method, system, terminal and storage medium provided by the present invention monitor the status of multiple network ports by setting the interval time of periodic query, and use binary coding to judge the on-off status of different network ports. At the same time, it can classify and deal with various fault types according to the status code. It can also start the function of fault alarm while controlling the startup sequence of the server and storage. The invention can automatically control the server to start the storage device first and then start the operating system, and generate a corresponding fault alarm for the faulty components, so compared with the existing start-up method, the invention saves human resources and improves the start-up efficiency of the server in the computer room.
此外,本发明设计原理可靠,结构简单,具有非常广泛的应用前景。In addition, the present invention has reliable design principle and simple structure, and has a very wide application prospect.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. In other words, other drawings can also be obtained based on these drawings without creative labor.
图1是本发明一个实施例的方法的示意性流程图。FIG. 1 is a schematic flowchart of a method according to an embodiment of the present invention.
图2是本发明一个实施例的系统的示意性框图。FIG. 2 is a schematic block diagram of a system according to an embodiment of the present invention.
图3为本发明实施例提供的一种终端的结构示意图。FIG. 3 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
具体实施方式Detailed ways
为了使本技术领域的人员更好地理解本发明中的技术方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。In order to make those skilled in the art better understand the technical solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
图1是本发明一个实施例的方法的示意性流程图。其中,图1执行主体可以为一种服务器启动顺序控制系统。FIG. 1 is a schematic flowchart of a method according to an embodiment of the present invention. Wherein, the executive body of FIG. 1 may be a server startup sequence control system.
如图1所示,该方法100包括:As shown in Figure 1, the method 100 includes:
步骤110,验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接;
步骤120,在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开;
步骤130,定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态;
步骤140,若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。
为了便于对本发明的理解,下面以本发明服务器启动顺序控制方法的原理,结合实施例中对服务器启动顺序进行控制的过程,对本发明提供的服务器启动顺序控制方法做进一步的描述。In order to facilitate the understanding of the present invention, the following describes the server startup sequence control method provided by the present invention based on the principle of the server startup sequence control method of the present invention and the process of controlling the server startup sequence in the embodiment.
具体的,所述服务器启动顺序控制方法包括:Specifically, the server startup sequence control method includes:
S1、验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接。S1. Verify that the control terminal can establish a communication connection with the BMC network port, the service network port and the storage management network port of the server to be started.
确保该管理机可以和服务器BMC网口,服务器业务网口、存储管理网口正常通讯。若待启动服务器已经在管理机的管理列表内,则不需要再每次都验证(根据预设的验证周期进行周期性验证即可),若待启动服务器是新加入管理列表的,则验证方法如下:Make sure that the management machine can communicate with the server BMC network port, server service network port, and storage management network port normally. If the server to be started is already in the management list of the management machine, there is no need to verify it every time (periodic verification can be performed according to the preset verification period). If the server to be started is newly added to the management list, the verification method as follows:
上电启动完成待启动服务器,根据预存的IP地址分别向启动状态的BMC网口、业务网口和存储管理网口发送建立通信请求。获取通信监理情况,若与BMC网口、业务网口和存储管理网口均成功建立通信连接,则通过验证。After the server to be started is completed after power-on and startup, it sends a communication establishment request to the BMC network port, service network port, and storage management network port in the starting state according to the pre-stored IP address. Obtain the communication supervision information. If a communication connection is successfully established with the BMC network port, service network port, and storage management network port, the verification is passed.
S2、在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开。S2. In the BIOS, set the power-on self-starting option of the operating system of the server to be started to be off and the storage device power-on self-starting option to be turned on.
在BIOS下关闭服务器的加电自启动选项,而存储需要开启加电自启动。Turn off the power-on auto-start option of the server in the BIOS, and the storage needs to enable power-on auto-start.
S3、定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态。S3. Periodically send a communication establishment request to the BMC network port, service network port, and storage management network port of the server, and obtain the communication status with the BMC network port, service network port, and storage management network port of the server to be started.
在管理机上配置定时任务,编写指定时间或者间隔的指令,让系统按照定时计划自动执行该脚本程序。脚本程序会根据设定的查询时间间隔,在后台周期查询服务器BMC口、服务器业务口、存储管理口三个网口的状态。Configure a timed task on the management machine, write a command with a specified time or interval, and let the system automatically execute the script program according to the timed plan. The script program will periodically query the status of the three network ports of the server BMC port, server service port, and storage management port in the background according to the set query time interval.
由于服务器BMC网口是加电自启动的,与服务器是否开机无关,所以只要服务器不断电BMC口是可以一直ping通的,而服务器业务网口是跟操作系统相关的,只要操作系统启动后才可以ping通的。存储管理网口也只有在存储正常启动后才可以ping通,存储关机或者存储断电时都是不通的。Since the BMC network port of the server is powered on and starts automatically, it has nothing to do with whether the server is turned on or not. Therefore, as long as the server is not powered on, the BMC port can be pinged all the time. The service network port of the server is related to the operating system, only after the operating system is started. Can be pinged. The storage management network port can be pinged only after the storage is started normally, and cannot be connected when the storage is shut down or powered off.
可能出现的通信状态如下表(表1中默认服务器与存储输入电源为同一路电,如果为不同电源输入同理也可推断出服务器与存储相应的运行状态):The possible communication states are shown in the following table (in Table 1, the default input power for the server and storage is the same circuit. If the input power is different, the corresponding operating states of the server and storage can also be inferred):
表1Table 1
S4、若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。若通信状态为业务网口连接成功而存储管理网口连接失败,则生成存储启动错误告警;若通信状态为BMC网口连接失败而业务网口或存储管理网口连接成功,则生成BMC网口故障告警。S4. If the communication state is that the connection to both the BMC network port and the storage management network port is successful, use the BMC bottom layer ipmitool command to start the operating system of the server to be started. If the communication status is that the service network port is successfully connected but the storage management network port fails, a storage startup error alarm is generated; if the communication status is that the BMC network port connection fails but the service network port or storage management network port is successfully connected, the BMC network port is generated Fault alarm.
针对表1中的不同通信状态有以下相应处理方法:There are the following corresponding processing methods for different communication states in Table 1:
当状态码为“0”和“4”,即服务器和存储都处于关机状态时,会一直等待存储启动,当监测到存储管理口通后,通过BMC底层ipmitool命令启动服务器。When the status codes are "0" and "4", that is, when both the server and the storage are shut down, they will wait for the storage to start up. After monitoring the connection of the storage management port, start the server through the underlying BMC ipmitool command.
当状态码为“2”和“6”,即服务器开机,存储关机时,因为在服务器系统需要依赖存储磁盘来运行,所以正常运行时不可能出现这种情况,这种情况会判定为存储异常,不会通过BMC重启服务器,会直接向管理员发送存储告警,需要管理员去检查存储状态.When the status codes are "2" and "6", that is, when the server is turned on and the storage is turned off, because the server system needs to rely on the storage disk to run, it is impossible for this situation to occur during normal operation, and this situation will be judged as abnormal storage. , the server will not be restarted through the BMC, and a storage alarm will be sent directly to the administrator, requiring the administrator to check the storage status.
当状态码为“1”、“2”、“3”时,即BMC口不通,而服务器业务口或者存储管理口可以通,表示机柜没有断电,而BMC口正常情况只有在服务器断电时才不通,所以判定服务器BMC口故障,会直接向管理员发送服务器BMC口告警,需要管理员检查服务器BMC网络。When the status code is "1", "2", "3", that is, the BMC port is blocked, but the server service port or storage management port can be connected, indicating that the cabinet is not powered off, and the BMC port is normally only when the server is powered off. Therefore, it is determined that the server BMC port is faulty, and the server BMC port alarm will be sent directly to the administrator, and the administrator needs to check the server BMC network.
如图2示,该系统200包括:As shown in Figure 2, the system 200 includes:
通信验证单元210,配置用于验证控制端能够与待启动服务器BMC网口、业务网口和存储管理网口建立通信连接;The communication verification unit 210 is configured to verify that the control terminal can establish a communication connection with the BMC network port, the service network port and the storage management network port of the server to be started;
选项设置单元220,配置用于在BIOS中设置待启动服务器的操作系统上电自启选项关闭且存储设备上电自启选项打开;The option setting unit 220 is configured to set in the BIOS that the power-on self-starting option of the server to be started is turned off and the storage device power-on self-starting option is turned on;
请求发送单元230,配置用于定期向服务器BMC网口、业务网口和存储管理网口发送建立通信请求,并获取与待启动服务器的BMC网口、业务网口和存储管理网口的通信状态;The request sending unit 230 is configured to periodically send a request for establishing communication to the server BMC network port, service network port and storage management network port, and obtain the communication status with the BMC network port, service network port and storage management network port of the server to be started ;
启动执行单元240,配置用于若所述通信状态为与BMC网口和存储管理网口均连接成功,则利用BMC底层ipmitool命令启动待启动服务器的操作系统。The startup execution unit 240 is configured to use the BMC bottom layer ipmitool command to start the operating system of the server to be started if the communication state is that both the BMC network port and the storage management network port are successfully connected.
可选地,作为本发明一个实施例,所述通信验证单元包括:Optionally, as an embodiment of the present invention, the communication verification unit includes:
启动完成模块,配置用于上电启动所述待启动服务器;a startup completion module, configured to power on and start the server to be started;
请求建立模块,配置用于根据预存的IP地址分别向启动状态的BMC网口、业务网口和存储管理网口发送建立通信请求;a request establishment module, configured to send a communication establishment request to the BMC network port, the service network port and the storage management network port in the activated state according to the pre-stored IP address;
验证通过模块,配置用于获取通信监理情况,若与BMC网口、业务网口和存储管理网口均成功建立通信连接,则通过验证。The verification pass module is configured to obtain the communication supervision status. If a communication connection is successfully established with the BMC network port, service network port, and storage management network port, the verification is passed.
可选地,作为本发明一个实施例,所述请求发送单包括:Optionally, as an embodiment of the present invention, the request sending list includes:
周期设置模块,配置用于设置发送周期;Period setting module, configured to set the sending period;
请求发送模块,配置用于在监控到待启动服务器上电后根据所述发送周期循环向未建立通信连接的网口发送通信请求。The request sending module is configured to cyclically send a communication request to the network port for which a communication connection is not established according to the sending period after monitoring that the server to be started is powered on.
可选地,作为本发明一个实施例,所述系统还包括:Optionally, as an embodiment of the present invention, the system further includes:
第一告警单元,配置用于若通信状态为业务网口连接成功而存储管理网口连接失败,则生成存储启动错误告警;a first alarm unit, configured to generate a storage startup error alarm if the communication state is that the service network port is successfully connected but the storage management network port fails to connect;
第二告警单元,配置用于若通信状态为BMC网口连接失败而业务网口或存储管理网口连接成功,则生成BMC网口故障告警。The second alarm unit is configured to generate a BMC network port failure alarm if the communication state is that the BMC network port connection fails but the service network port or the storage management network port is successfully connected.
图3为本发明实施例提供的一种终端系统300的结构示意图,该终端系统300可以用于执行本发明实施例提供的服务器启动顺序控制方法。FIG. 3 is a schematic structural diagram of a
其中,该终端系统300可以包括:处理器310、存储器320及通信单元330。这些组件通过一条或多条总线进行通信,本领域技术人员可以理解,图中示出的服务器的结构并不构成对本发明的限定,它既可以是总线形结构,也可以是星型结构,还可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。The
其中,该存储器320可以用于存储处理器310的执行指令,存储器320可以由任何类型的易失性或非易失性存储终端或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。当存储器320中的执行指令由处理器310执行时,使得终端300能够执行以下上述方法实施例中的部分或全部步骤。Wherein, the
处理器310为存储终端的控制中心,利用各种接口和线路连接整个电子终端的各个部分,通过运行或执行存储在存储器320内的软件程序和/或模块,以及调用存储在存储器内的数据,以执行电子终端的各种功能和/或处理数据。所述处理器可以由集成电路(Integrated Circuit,简称IC)组成,例如可以由单颗封装的IC所组成,也可以由连接多颗相同功能或不同功能的封装IC而组成。举例来说,处理器310可以仅包括中央处理器(Central Processing Unit,简称CPU)。在本发明实施方式中,CPU可以是单运算核心,也可以包括多运算核心。The
通信单元330,用于建立通信信道,从而使所述存储终端可以与其它终端进行通信。接收其他终端发送的用户数据或者向其他终端发送用户数据。The
本发明还提供一种计算机存储介质,其中,该计算机存储介质可存储有程序,该程序执行时可包括本发明提供的各实施例中的部分或全部步骤。所述的存储介质可为磁碟、光盘、只读存储记忆体(英文:read-only memory,简称:ROM)或随机存储记忆体(英文:random access memory,简称:RAM)等。The present invention also provides a computer storage medium, wherein the computer storage medium can store a program, and when the program is executed, it can include some or all of the steps in the embodiments provided by the present invention. The storage medium may be a magnetic disk, an optical disk, a read-only memory (English: read-only memory, ROM for short) or a random access memory (English: random access memory, RAM for short).
因此,本发明通过设定周期查询的间隔时间来监测多网口的状态,并通过对不同网口的通断状态使用二进制编码的方式来判断设备的运行状态,同时根据状态码分类处理各种故障类型,在控制服务器和存储的启动顺序的同时,还能启动故障告警的作用。本发明能够自动控制服务器先启动存储设备再启动操作系统,并对故障部件生成相应的故障告警,因此相对于现有启动方法,本发明节省了人力资源,提高了机房服务器启动效率,本实施例所能达到的技术效果可以参见上文中的描述,此处不再赘述。Therefore, the present invention monitors the status of multiple network ports by setting the interval time of periodic query, and judges the running status of the equipment by using binary coding for the on-off status of different network ports, and simultaneously processes various network ports according to the status codes. The fault type can not only control the startup sequence of the server and storage, but also activate the function of fault alarm. The present invention can automatically control the server to start the storage device first and then start the operating system, and generate corresponding fault alarms for the faulty components. Therefore, compared with the existing start-up method, the present invention saves human resources and improves the start-up efficiency of the server in the computer room. This embodiment For the technical effects that can be achieved, reference may be made to the above description, which will not be repeated here.
本领域的技术人员可以清楚地了解到本发明实施例中的技术可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本发明实施例中的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中如U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质,包括若干指令用以使得一台计算机终端(可以是个人计算机,服务器,或者第二终端、网络终端等)执行本发明各个实施例所述方法的全部或部分步骤。Those skilled in the art can clearly understand that the technology in the embodiments of the present invention can be implemented by means of software plus a necessary general hardware platform. Based on such understanding, the technical solutions in the embodiments of the present invention can be embodied in the form of software products in essence or in the parts that make contributions to the prior art. The computer software products are stored in a storage medium such as a USB flash drive, a mobile Hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes, including several instructions to make a computer terminal (It may be a personal computer, a server, or a second terminal, a network terminal, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention.
本说明书中各个实施例之间相同相似的部分互相参见即可。尤其,对于终端实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例中的说明即可。It is sufficient to refer to each other for the same and similar parts among the various embodiments in this specification. In particular, for the terminal embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for related parts, please refer to the description in the method embodiment.
在本发明所提供的几个实施例中,应该理解到,所揭露的系统和方法,可以通过其它的方式实现。例如,以上所描述的系统实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,系统或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed system and method may be implemented in other manners. For example, the system embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection between systems or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
尽管通过参考附图并结合优选实施例的方式对本发明进行了详细描述,但本发明并不限于此。在不脱离本发明的精神和实质的前提下,本领域普通技术人员可以对本发明的实施例进行各种等效的修改或替换,而这些修改或替换都应在本发明的涵盖范围内/任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应所述以权利要求的保护范围为准。Although the present invention has been described in detail in conjunction with the preferred embodiments with reference to the accompanying drawings, the present invention is not limited thereto. Without departing from the spirit and essence of the present invention, those of ordinary skill in the art can make various equivalent modifications or substitutions to the embodiments of the present invention, and these modifications or substitutions should all fall within the scope of the present invention/any Those skilled in the art can easily think of changes or substitutions within the technical scope disclosed by the present invention, which should all be included within the protection scope of the present invention. Therefore, the protection scope of the present invention should be based on the protection scope of the claims.
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010094802.9A CN111352662B (en) | 2020-02-16 | 2020-02-16 | A server startup sequence control method, system, terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010094802.9A CN111352662B (en) | 2020-02-16 | 2020-02-16 | A server startup sequence control method, system, terminal and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111352662A CN111352662A (en) | 2020-06-30 |
CN111352662B true CN111352662B (en) | 2022-07-08 |
Family
ID=71195735
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010094802.9A Active CN111352662B (en) | 2020-02-16 | 2020-02-16 | A server startup sequence control method, system, terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111352662B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114385253B (en) * | 2021-12-23 | 2024-12-06 | 新华三云计算技术有限公司 | Server power-on management method, device, equipment and machine-readable storage medium |
CN114238030A (en) * | 2021-12-24 | 2022-03-25 | 中国建设银行股份有限公司 | A kind of host abnormal restart monitoring method and related device |
CN114461286B (en) * | 2022-01-29 | 2023-08-04 | 苏州浪潮智能科技有限公司 | A server startup method, device, electronic equipment and readable storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109510742A (en) * | 2019-01-25 | 2019-03-22 | 郑州云海信息技术有限公司 | A kind of server network interface card remote test method, device, terminal and storage medium |
CN110162435A (en) * | 2019-04-24 | 2019-08-23 | 苏州浪潮智能科技有限公司 | A kind of server PXE starting test method, system, terminal and storage medium |
CN110609765A (en) * | 2019-08-16 | 2019-12-24 | 苏州浪潮智能科技有限公司 | Method, system, device and storage medium for automatic random restart of server |
-
2020
- 2020-02-16 CN CN202010094802.9A patent/CN111352662B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109510742A (en) * | 2019-01-25 | 2019-03-22 | 郑州云海信息技术有限公司 | A kind of server network interface card remote test method, device, terminal and storage medium |
CN110162435A (en) * | 2019-04-24 | 2019-08-23 | 苏州浪潮智能科技有限公司 | A kind of server PXE starting test method, system, terminal and storage medium |
CN110609765A (en) * | 2019-08-16 | 2019-12-24 | 苏州浪潮智能科技有限公司 | Method, system, device and storage medium for automatic random restart of server |
Also Published As
Publication number | Publication date |
---|---|
CN111352662A (en) | 2020-06-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108600029B (en) | A configuration file updating method, device, terminal device and storage medium | |
CN111352662B (en) | A server startup sequence control method, system, terminal and storage medium | |
CN107147540A (en) | Fault Handling Method and Fault Handling Cluster in High Availability System | |
CN114116280B (en) | Interactive BMC self-recovery method, system, terminal and storage medium | |
WO2018095107A1 (en) | Bios program abnormal processing method and apparatus | |
CN1770707B (en) | Apparatus and method for quorum-based power-down of unresponsive servers in a computer cluster | |
CN112463239A (en) | BIOS and BMC interaction option default value modification method, system, terminal and storage medium | |
CN109032888A (en) | A kind of server start-up course fault detection method, device and controlled terminal | |
CN109976926A (en) | Method, circuit, terminal and the storage medium of protection BMC renewal process are restarted in a kind of shielding | |
CN111949216A (en) | Method, system, terminal and storage medium for automatic expansion of cloud platform storage volume | |
CN108737153A (en) | Block chain disaster recovery and backup systems, method, server and computer readable storage medium | |
CN107423620B (en) | Storage server service process management method and device | |
CN111506331A (en) | Server BMC refreshing method, system, terminal and storage medium | |
CN114640709A (en) | Edge node processing method, device and medium | |
CN111090537B (en) | Cluster starting method and device, electronic equipment and readable storage medium | |
CN118656245A (en) | A method, device, electronic device and medium for handling server exceptions | |
CN112035295A (en) | Virtual machine crash event processing method, system, terminal and storage medium | |
CN111488250A (en) | Test method, system, terminal and storage medium for high-density multi-node server | |
CN114461286B (en) | A server startup method, device, electronic equipment and readable storage medium | |
CN116449800A (en) | A control logic switching method, device, equipment and medium | |
CN110058866A (en) | Cluster component mounting method and equipment | |
CN112732486B (en) | Redundant firmware switching method, device, equipment and storage medium | |
CN115756648A (en) | Active recovery method and terminal for double-substrate management controller chip | |
CN110609765B (en) | Method, system, device and storage medium for automatic random restart of server | |
CN115562900A (en) | AMD server system installation power-off processing method, device, equipment and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |