CN114116330B - Server performance testing method, system, terminal and storage medium - Google Patents

Server performance testing method, system, terminal and storage medium Download PDF

Info

Publication number
CN114116330B
CN114116330B CN202111231193.8A CN202111231193A CN114116330B CN 114116330 B CN114116330 B CN 114116330B CN 202111231193 A CN202111231193 A CN 202111231193A CN 114116330 B CN114116330 B CN 114116330B
Authority
CN
China
Prior art keywords
log
basic input
options
target
output system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111231193.8A
Other languages
Chinese (zh)
Other versions
CN114116330A (en
Inventor
席泽鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202111231193.8A priority Critical patent/CN114116330B/en
Publication of CN114116330A publication Critical patent/CN114116330A/en
Application granted granted Critical
Publication of CN114116330B publication Critical patent/CN114116330B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2268Logging of test results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2289Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by configuration test

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a server performance test method, a system, a terminal and a storage medium, comprising the following steps: restoring default values of options of the basic input and output system; reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool; configuring an error injection instruction according to a test requirement, and executing the error injection instruction; and collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test. Compared with the conventional server RAS test, the method greatly saves labor and time and improves the test efficiency.

Description

Server performance testing method, system, terminal and storage medium
Technical Field
The invention relates to the technical field of server testing, in particular to a server performance testing method, a system, a terminal and a storage medium.
Background
With the development of servers, the performance requirements on the RAS are increasing. RAS performance refers to Reliability (Availability), availability (Availability), and Serviceability (Serviceability) of a machine. The RAS characteristic is characterized in that the whole server can reliably operate for a long time, the probability of machine offline caused by faults is reduced, and repairable errors are corrected in time. RAS capability mainly looks at several aspects: system-level RAS characteristics, CPURAS characteristics, memory RAS characteristics, I/ORAS characteristics, and the like. It can be seen that RAS performance is a comprehensive indicator and that the requirements for RAS performance are not the same for different business users. The RAS characteristic of the server memory is most critical, because all computers of the server are almost running in the memory, and the quality of the memory determines whether the server can function properly. For example, some HP servers run continuously for twenty years without shutting down once, so to speak, the program is not hosted on the server, but rather is hosted in memory. According to the collaborative research of Google and some institutions, the memory error rate is actually higher than imagination (refer), and the memory also has performance degradation due to aging and the like, which is information with reference value. According to the Bode customer service personnel, some Bode servers run continuously for five years and never stop until the equipment is updated and the work record is checked, so that the program never stops after running. Many sales promotion of X86 server sales are now focused on memory, such as ECC characteristics of memory, spark Row, chip Kill, SINGLE DEVICE DATA Correction (SDDC), etc., where Chip Kill is a proprietary technology of IBM, mainly used on high-end servers. The RAS performance of the CPU processor is also critical. The Xeon 7500 serial processors that were earlier introduced by intel bring the reliability, availability and maintainability (RAS) of the Xeon platform to a new height, port features such as the Machine Check Architecture (MCA) that were originally dedicated to RISC Architecture only, and also make the Xeon 7500 platform more air-hungry for high-end critical business applications. The release of Xeon 7500 is an important step in the demonstration of Intel's mind in the critical application market.
The existing RAS test is mostly a manual test, a setup is first entered to manually modify BIOS options, then ITP is connected, a command is used for test verification in CScripts, and finally logs of BMC, OS and the like are checked. The RAS characteristics are complex, and each characteristic is time-consuming and labor-consuming to test.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a server performance testing method, a system, a terminal and a storage medium, so as to solve the technical problems.
In a first aspect, the present invention provides a method for testing server performance, including:
restoring default values of options of the basic input and output system;
Reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool;
Configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
and collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
Further, restoring the default value of each option of the basic input/output system comprises:
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface;
the restart server loads the basic input and output system with default values.
Further, reading the target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input output system to the corresponding target values by using the installation and configuration tool comprises:
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
The basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
Further, configuring an error injection instruction according to a test requirement, and executing the error injection instruction includes:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Further, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, the test is passed, including:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
In a second aspect, the present invention provides a server performance testing system, including:
The option recovery unit is used for recovering default values of options of the basic input/output system;
the option modifying unit is used for reading the target option and the corresponding target value in the test configuration file, and modifying the value of the target option of the basic input/output system into the corresponding target value by using the installation and configuration tool;
The system error injection unit is used for configuring error injection instructions according to test requirements and executing the error injection instructions;
the log analysis unit is used for collecting the error report log, analyzing whether the error report log is matched with the executed error injection instruction, and if the error report log is matched with the executed error injection instruction, the test is passed.
Further, the option restoring unit includes:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
Further, the option modifying unit includes:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
Further, the system error injection unit is used for:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Further, the log analysis unit is configured to:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
In a third aspect, a terminal is provided, including:
A processor, a memory, wherein,
The memory is used for storing a computer program,
The processor is configured to call and run the computer program from the memory, so that the terminal performs the method of the terminal as described above.
In a fourth aspect, there is provided a computer storage medium having instructions stored therein which, when run on a computer, cause the computer to perform the method of the above aspects.
The server performance testing method, the system, the terminal and the storage medium have the beneficial effects that the five parts of simultaneous injection missing processes of recovering the BIOS default value, modifying the BIOS option, emptying the log, collecting the log and analyzing the log are integrated together based on the CScripts tool of Intel, so that CScripts is automatically executed, and full-flow automation is realized. Compared with the conventional server RAS test, the method greatly saves labor and time and improves the test efficiency.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required to be used in the description of the embodiments or the prior art will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a schematic flow chart of a method of one embodiment of the invention.
FIG. 2 is a schematic block diagram of a system of one embodiment of the present invention.
Fig. 3 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
Detailed Description
In order to make the technical solution of the present invention better understood by those skilled in the art, the technical solution of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
The following explains key terms appearing in the present invention.
BMC, execution server remote management controller, english is totally called Baseboard Management controller. The method can perform firmware upgrade, check machine equipment and other operations on the machine in a state that the machine is not started. Fully implementing IPMI functionality in a BMC requires a powerful 16-bit or 32-bit microcontroller and RAM for data storage, flash memory for non-volatile data storage, and firmware to provide basic remote manageability in terms of secure remote reboot, secure re-power-up, LAN alerting, and system health monitoring. In addition to the basic IPMI and system operation monitoring functions, the mBMC can also enable BIOS flash element selection and protection by storing the previous BIOS using one of the 2 flash memories. For example, when the system fails to boot after a remote BIOS upgrade, the remote administrator may switch back to the previously-working BIOS image to boot the system. Once the BIOS is upgraded, the BIOS image can be locked, so that the virus can be effectively prevented from invading the BIOS image.
BIOS is the acronym of English "Basic Input Output System", and the Chinese name is "basic input output System" after being translated. On ibm pc compatible systems, is a industry standard firmware interface. The computer is a set of programs solidified on a ROM chip on a main board in the computer, which stores the most important programs of basic input and output, self-checking programs after starting up and system self-starting programs, and can read and write specific information of system settings from CMOS. Its main function is to provide the lowest, most direct hardware setup and control for the computer. In addition, the BIOS provides some system parameters to the operating system. The system hardware changes are hidden by the BIOS and the program uses the BIOS functions rather than directly controlling the hardware. Modern operating systems ignore the abstraction layer provided by the BIOS and directly control the hardware components.
The CPU central processing unit (central processing unit, CPU for short) is used as the operation and control core of the computer system and is the final execution unit for information processing and program running.
Reliability (Reliability) is mainly applied in industry as a dual-machine hot backup. While the Itanium mini-machines, high-end servers such as PR4840R, have high stability, in some critical applications, there are ways users prefer to use dual-machine provisioning. The double-machine hot backup is that one host machine is a working machine, the other host machine is a backup machine, the working machine provides support for an information system under the normal condition of the system, the backup machine monitors the running condition of the working machine, the working machine also monitors whether the backup machine is normal or not, sometimes the backup machine is abnormal due to a certain reason, and the working machine can inform a system management staff to solve the problem as soon as possible so as to ensure the reliability of the next switching. When the working machine is abnormal and cannot support the operation of the information system, the backup machine actively takes over the work of the working machine and continues to support the operation of the information, so that the information system can be ensured to operate uninterruptedly. When the working machine is maintained and recovered to be normal, the working machine can automatically catch back the previous work and recover the working state in the previous normal state.
Availability (Availability) refers to the likelihood that the system may be fed back quickly and operated correctly when requested for use. Many critical businesses in industry applications are those that cannot be interrupted during operation, including business intelligence analysis, business process handling (transaction handling and ERP, etc.), core databases, etc., particularly those operated by government, national defense, security, telecommunications, finance, transportation, medical, etc., businesses and institutions involved in national business.
Maintainability (Serviceability) is a great deal of data volume to be processed, and when real-time, dynamic and efficiency are required, the carrying platform has excellent and easily expandable high-performance processing capability. Such as expansion of memory and expansion of hard disk, such as a modular server, storage server, etc. of the bode.
The intelligent platform management interface (INTELLIGENT PLATFORM MANAGEMENT INTERFACE) is an industry standard adopted by peripheral devices of an Intel architecture enterprise system. IPMI is also an open free standard that users can use without paying additional fees. IPMI can span different operating systems, firmware and hardware platforms, and can intelligently monitor, control and automatically report back the operation status of a large number of servers so as to reduce the cost of the server system.
FIG. 1 is a schematic flow chart of a method of one embodiment of the invention. The execution body of fig. 1 may be a server performance test system.
As shown in fig. 1, the method includes:
Step 110, recovering default values of options of the basic input/output system;
Step 120, reading the target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system to the corresponding target values by using the installation and configuration tool;
step 130, configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
And 140, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
In order to facilitate understanding of the present invention, the method for testing server performance provided by the present invention is further described below with reference to a process of testing server performance in an embodiment by using the principle of the method for testing server performance of the present invention.
CScripts is a debug and validation tool provided by Intel, written by Python, which has the ability to automatically execute Python scripts. The invention is based on the CScripts tool of Intel, and integrates five parts of simultaneous injection missing processes of recovering BIOS default value, modifying BIOS option, emptying log, collecting log and analyzing log, so that CScripts is automatically executed, and full-flow automation is realized. Specifically, the server performance test method comprises the following steps:
S1, recovering default values of options of the basic input and output system.
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface; the restart server loads the basic input and output system with default values.
To prevent the interference of the irrelevant BIOS option, the BIOS default is restored first, which is implemented by IPMI command. When this command is executed (ipmitool-I lanplus-H-U-P chassis bootdev none clear-cmos=yes), the BIOS will load the default value.
S2, reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool.
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file; the basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
Testing different RAS characteristics requires modifying different BIOS options, so the SCE tool of AMI is used to modify BIOS options under the OS according to test requirements. After the modification is completed, the machine is restarted to be effective.
S3, configuring an error injection instruction according to the test requirement, and executing the error injection instruction.
Configuring a plurality of error injection instructions and an execution sequence of the error injection instructions; and executing the error injection instructions in sequence according to the execution sequence.
And writing a fault injection instruction according to the RAS characteristics to be tested. For example, test memory, then configure the memory-related debug instruction.
S4, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
Clearing a system log and a baseboard management controller log before executing the fault injection instruction; collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs; and matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
Before the fault is injected, the log is emptied, so that the log generated after the fault is injected can be conveniently checked. The log is emptied BMCSEL using command ipmitool sel clear. For logs under Linux, direct deletion may cause log errors, no longer generate logs, use the following commands to empty the log (cat/dev/null >/var/log/messages). After the fault injection is completed, fault report logs can be generated under the BMC and the OS. We collect the SEL log with IPMI commands (ipmitool SEL elist), and the general purpose SFTP downloads the log in the OS locally. And finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module. PASS can be matched to the expected key, otherwise FAIL.
As shown in fig. 2, the system 200 includes:
an option restoring unit 210, configured to restore default values of options of the bios;
An option modifying unit 220, configured to read the target option and the corresponding target value in the test configuration file, and modify the value of the target option of the basic input output system to the corresponding target value by using the installation and configuration tool;
The system error injection unit 230 is configured to configure an error injection instruction according to a test requirement, and execute the error injection instruction;
The log analysis unit 240 is configured to collect the error log, analyze whether the error log is matched with the executed error injection instruction, and pass the test if the error log is matched with the executed error injection instruction.
Optionally, as an embodiment of the present invention, the option restoring unit includes:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
Optionally, as an embodiment of the present invention, the option modifying unit includes:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
Optionally, as an embodiment of the present invention, the system error injection unit is configured to:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Optionally, as an embodiment of the present invention, the log analysis unit is configured to:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
Fig. 3 is a schematic structural diagram of a terminal 300 according to an embodiment of the present invention, where the terminal 300 may be used to execute the server performance test method according to the embodiment of the present invention.
The terminal 300 may include: a processor 310, a memory 320 and a communication unit 330. The components may communicate via one or more buses, and it will be appreciated by those skilled in the art that the configuration of the server as shown in the drawings is not limiting of the invention, as it may be a bus-like structure, a star-like structure, or include more or fewer components than shown, or may be a combination of certain components or a different arrangement of components.
The memory 320 may be used to store instructions for execution by the processor 310, and the memory 320 may be implemented by any type of volatile or non-volatile memory terminal or combination thereof, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk, or optical disk. The execution of the instructions in memory 320, when executed by processor 310, enables terminal 300 to perform some or all of the steps in the method embodiments described below.
The processor 310 is a control center of the storage terminal, connects various parts of the entire electronic terminal using various interfaces and lines, and performs various functions of the electronic terminal and/or processes data by running or executing software programs and/or modules stored in the memory 320, and invoking data stored in the memory. The processor may be comprised of an integrated circuit (INTEGRATED CIRCUIT, simply referred to as an IC), for example, a single packaged IC, or may be comprised of multiple packaged ICs connected to one another for the same function or for different functions. For example, the processor 310 may include only a central processing unit (Central Processing Unit, CPU for short). In the embodiment of the invention, the CPU can be a single operation core or can comprise multiple operation cores.
And a communication unit 330 for establishing a communication channel so that the storage terminal can communicate with other terminals. Receiving user data sent by other terminals or sending the user data to other terminals.
The present invention also provides a computer storage medium in which a program may be stored, which program may include some or all of the steps in the embodiments provided by the present invention when executed. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), a random-access memory (random access memory RAM), or the like.
Therefore, the invention integrates five simultaneous injection missing processes of recovering BIOS default value, modifying BIOS option, clearing log, collecting log and analyzing log based on CScripts tool of Intel, so that CScripts is automatically executed, and full-flow automation is realized. Compared with the conventional server RAS test, the method and the device greatly save labor and time, improve test efficiency, and achieve the technical effects described above, which are not repeated here.
It will be apparent to those skilled in the art that the techniques of embodiments of the present invention may be implemented in software plus a necessary general purpose hardware platform. Based on such understanding, the technical solution in the embodiments of the present invention may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium such as a U-disc, a mobile hard disc, a Read-Only Memory (ROM), a random access Memory (RAM, random AccessMemory), a magnetic disk or an optical disk, etc. various media capable of storing program codes, including several instructions for causing a computer terminal (which may be a personal computer, a server, or a second terminal, a network terminal, etc.) to execute all or part of the steps of the method described in the embodiments of the present invention.
The same or similar parts between the various embodiments in this specification are referred to each other. In particular, for the terminal embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference should be made to the description in the method embodiment for relevant points.
In the several embodiments provided by the present invention, it should be understood that the disclosed systems and methods may be implemented in other ways. For example, the system embodiments described above are merely illustrative, e.g., the division of the elements is merely a logical functional division, and there may be additional divisions when actually implemented, e.g., multiple elements or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interface, system or unit indirect coupling or communication connection, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit.
Although the present invention has been described in detail by way of preferred embodiments with reference to the accompanying drawings, the present invention is not limited thereto. Various equivalent modifications and substitutions may be made in the embodiments of the present invention by those skilled in the art without departing from the spirit and scope of the present invention, and it is intended that all such modifications and substitutions be within the scope of the present invention/be within the scope of the present invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (9)

1. A method for testing performance of a server, comprising:
restoring default values of options of the basic input and output system;
Reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool;
Configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
Collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test;
Before the fault is injected, firstly, the log is emptied so as to be convenient for checking the log generated after the fault is injected; the BMC SEL log is cleared using command ipmitool SEL clear; for the log under Linux, the direct deletion can cause log errors, the log is not generated any more, and the command cat/dev/null >/var/log/messages are used for emptying the log; after the fault injection is completed, the BMC and the OS can generate fault report logs; collecting SEL logs by using an IPMI command ipmitool SEL elist, and downloading the logs in the OS to the local by using the general SFTP; finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module; PASS can be matched to the expected key, otherwise FAIL.
2. The method of claim 1, wherein restoring default values for the bios options comprises:
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface;
the restart server loads the basic input and output system with default values.
3. The method of claim 1, wherein reading the target options and corresponding target values in the test configuration file, modifying the values of the target options of the basic input output system to the corresponding target values using the installation and configuration tool, comprises:
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
The basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
4. The method of claim 1, wherein configuring the error injection instruction according to the test requirements, executing the error injection instruction, comprises:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
5. A server performance testing system, comprising:
The option recovery unit is used for recovering default values of options of the basic input/output system;
the option modifying unit is used for reading the target option and the corresponding target value in the test configuration file, and modifying the value of the target option of the basic input/output system into the corresponding target value by using the installation and configuration tool;
The system error injection unit is used for configuring error injection instructions according to test requirements and executing the error injection instructions;
The log analysis unit is used for collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test;
Before the fault is injected, firstly, the log is emptied so as to be convenient for checking the log generated after the fault is injected; the BMC SEL log is cleared using command ipmitool SEL clear; for the log under Linux, the direct deletion can cause log errors, the log is not generated any more, and the command cat/dev/null >/var/log/messages are used for emptying the log; after the fault injection is completed, the BMC and the OS can generate fault report logs; collecting SEL logs by using an IPMI command ipmitool SEL elist, and downloading the logs in the OS to the local by using the general SFTP; finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module; PASS can be matched to the expected key, otherwise FAIL.
6. The system of claim 5, wherein the option restoration unit comprises:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
7. The system of claim 5, wherein the option modification unit comprises:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
8. A terminal, comprising:
A processor;
A memory for storing execution instructions of the processor;
Wherein the processor is configured to perform the method of any of claims 1-4.
9. A computer readable storage medium storing a computer program, which when executed by a processor implements the method of any one of claims 1-4.
CN202111231193.8A 2021-10-22 2021-10-22 Server performance testing method, system, terminal and storage medium Active CN114116330B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111231193.8A CN114116330B (en) 2021-10-22 2021-10-22 Server performance testing method, system, terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111231193.8A CN114116330B (en) 2021-10-22 2021-10-22 Server performance testing method, system, terminal and storage medium

Publications (2)

Publication Number Publication Date
CN114116330A CN114116330A (en) 2022-03-01
CN114116330B true CN114116330B (en) 2024-05-24

Family

ID=80376532

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111231193.8A Active CN114116330B (en) 2021-10-22 2021-10-22 Server performance testing method, system, terminal and storage medium

Country Status (1)

Country Link
CN (1) CN114116330B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114978891B (en) * 2022-05-17 2024-04-30 西安易朴通讯技术有限公司 Processing method, device and storage medium for BIOS configuration of network device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122277A (en) * 2017-05-09 2017-09-01 郑州云海信息技术有限公司 The wrong test system of PCIERAS notes and method based on PCIE protocol analyzers
CN107562593A (en) * 2017-08-25 2018-01-09 郑州云海信息技术有限公司 A kind of automated testing method and system for verifying internal memory ECC functions
CN109800022A (en) * 2019-01-25 2019-05-24 郑州云海信息技术有限公司 A kind of method, system, terminal and the storage medium of automation comparison BIOS information difference
CN111124780A (en) * 2019-11-30 2020-05-08 苏州浪潮智能科技有限公司 UPI Link speed reduction test method, system, terminal and storage medium
CN111488288A (en) * 2020-04-17 2020-08-04 苏州浪潮智能科技有限公司 Method, device, terminal and storage medium for testing BMC ACD stability
CN112463239A (en) * 2020-12-07 2021-03-09 苏州浪潮智能科技有限公司 BIOS and BMC interaction option default value modification method, system, terminal and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9645874B2 (en) * 2015-01-14 2017-05-09 Dell Products L.P. Analyzing OpenManage integration for troubleshooting log to determine root cause

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122277A (en) * 2017-05-09 2017-09-01 郑州云海信息技术有限公司 The wrong test system of PCIERAS notes and method based on PCIE protocol analyzers
CN107562593A (en) * 2017-08-25 2018-01-09 郑州云海信息技术有限公司 A kind of automated testing method and system for verifying internal memory ECC functions
CN109800022A (en) * 2019-01-25 2019-05-24 郑州云海信息技术有限公司 A kind of method, system, terminal and the storage medium of automation comparison BIOS information difference
CN111124780A (en) * 2019-11-30 2020-05-08 苏州浪潮智能科技有限公司 UPI Link speed reduction test method, system, terminal and storage medium
CN111488288A (en) * 2020-04-17 2020-08-04 苏州浪潮智能科技有限公司 Method, device, terminal and storage medium for testing BMC ACD stability
CN112463239A (en) * 2020-12-07 2021-03-09 苏州浪潮智能科技有限公司 BIOS and BMC interaction option default value modification method, system, terminal and storage medium

Also Published As

Publication number Publication date
CN114116330A (en) 2022-03-01

Similar Documents

Publication Publication Date Title
WO2022160756A1 (en) Server fault positioning method, apparatus and system, and computer-readable storage medium
US9146839B2 (en) Method for pre-testing software compatibility and system thereof
US10365961B2 (en) Information handling system pre-boot fault management
US20160132420A1 (en) Backup method, pre-testing method for environment updating and system thereof
US7363546B2 (en) Latent fault detector
US9712418B2 (en) Automated network control
CN109684032B (en) OpenStack virtual machine high-availability computing node device for preventing brain cracking and management method
CN110750396B (en) Server operating system compatibility testing method and device and storage medium
CN104834575A (en) Firmware recovery method and device
US11157349B2 (en) Systems and methods for pre-boot BIOS healing of platform issues from operating system stop error code crashes
CN107111595B (en) Method, device and system for detecting early boot errors
CN114116280B (en) Interactive BMC self-recovery method, system, terminal and storage medium
KR20100050380A (en) Automated firmware recovery
CN111897686A (en) Server cluster hard disk fault processing method and device, electronic equipment and storage medium
CN115033419B (en) Method and system for realizing hardware fault self-healing
CN111090546B (en) Method, device and equipment for restarting operating system and readable storage medium
CN114116330B (en) Server performance testing method, system, terminal and storage medium
CN117130672A (en) Server start flow control method, system, terminal and storage medium
CN111488288A (en) Method, device, terminal and storage medium for testing BMC ACD stability
CN114003416B (en) Memory error dynamic processing method, system, terminal and storage medium
GB2532076A (en) Backup method, pre-testing method for environment updating and system thereof
CN112068980A (en) Method and device for sampling information before CPU hang-up, equipment and storage medium
CN111475335A (en) Method, system, terminal and storage medium for fast recovery of database
JP7389877B2 (en) Network optimal boot path method and system
CN114385379B (en) Method, system, terminal and storage medium for detecting on-board information refreshing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant