CN114116330B - Server performance testing method, system, terminal and storage medium - Google Patents
Server performance testing method, system, terminal and storage medium Download PDFInfo
- Publication number
- CN114116330B CN114116330B CN202111231193.8A CN202111231193A CN114116330B CN 114116330 B CN114116330 B CN 114116330B CN 202111231193 A CN202111231193 A CN 202111231193A CN 114116330 B CN114116330 B CN 114116330B
- Authority
- CN
- China
- Prior art keywords
- log
- basic input
- options
- target
- output system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012360 testing method Methods 0.000 title claims abstract description 80
- 238000002347 injection Methods 0.000 claims abstract description 85
- 239000007924 injection Substances 0.000 claims abstract description 85
- 238000000034 method Methods 0.000 claims abstract description 33
- 238000009434 installation Methods 0.000 claims abstract description 16
- 230000015654 memory Effects 0.000 claims description 34
- 108010028984 3-isopropylmalate dehydratase Proteins 0.000 claims description 8
- 238000011084 recovery Methods 0.000 claims description 8
- 238000012986 modification Methods 0.000 claims description 7
- 230000004048 modification Effects 0.000 claims description 7
- 238000004458 analytical method Methods 0.000 claims description 6
- 238000010200 validation analysis Methods 0.000 claims description 4
- 241000282326 Felis catus Species 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 3
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- UPPMZCXMQRVMME-UHFFFAOYSA-N valethamate Chemical compound CC[N+](C)(CC)CCOC(=O)C(C(C)CC)C1=CC=CC=C1 UPPMZCXMQRVMME-UHFFFAOYSA-N 0.000 claims description 3
- 238000011056 performance test Methods 0.000 abstract description 4
- 238000007726 management method Methods 0.000 description 17
- 230000006870 function Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000004891 communication Methods 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 238000012216 screening Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 241000700605 Viruses Species 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000021319 infantile-onset periodic fever-panniculitis-dermatosis syndrome Diseases 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000013515 script Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2273—Test methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2268—Logging of test results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2289—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by configuration test
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention provides a server performance test method, a system, a terminal and a storage medium, comprising the following steps: restoring default values of options of the basic input and output system; reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool; configuring an error injection instruction according to a test requirement, and executing the error injection instruction; and collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test. Compared with the conventional server RAS test, the method greatly saves labor and time and improves the test efficiency.
Description
Technical Field
The invention relates to the technical field of server testing, in particular to a server performance testing method, a system, a terminal and a storage medium.
Background
With the development of servers, the performance requirements on the RAS are increasing. RAS performance refers to Reliability (Availability), availability (Availability), and Serviceability (Serviceability) of a machine. The RAS characteristic is characterized in that the whole server can reliably operate for a long time, the probability of machine offline caused by faults is reduced, and repairable errors are corrected in time. RAS capability mainly looks at several aspects: system-level RAS characteristics, CPURAS characteristics, memory RAS characteristics, I/ORAS characteristics, and the like. It can be seen that RAS performance is a comprehensive indicator and that the requirements for RAS performance are not the same for different business users. The RAS characteristic of the server memory is most critical, because all computers of the server are almost running in the memory, and the quality of the memory determines whether the server can function properly. For example, some HP servers run continuously for twenty years without shutting down once, so to speak, the program is not hosted on the server, but rather is hosted in memory. According to the collaborative research of Google and some institutions, the memory error rate is actually higher than imagination (refer), and the memory also has performance degradation due to aging and the like, which is information with reference value. According to the Bode customer service personnel, some Bode servers run continuously for five years and never stop until the equipment is updated and the work record is checked, so that the program never stops after running. Many sales promotion of X86 server sales are now focused on memory, such as ECC characteristics of memory, spark Row, chip Kill, SINGLE DEVICE DATA Correction (SDDC), etc., where Chip Kill is a proprietary technology of IBM, mainly used on high-end servers. The RAS performance of the CPU processor is also critical. The Xeon 7500 serial processors that were earlier introduced by intel bring the reliability, availability and maintainability (RAS) of the Xeon platform to a new height, port features such as the Machine Check Architecture (MCA) that were originally dedicated to RISC Architecture only, and also make the Xeon 7500 platform more air-hungry for high-end critical business applications. The release of Xeon 7500 is an important step in the demonstration of Intel's mind in the critical application market.
The existing RAS test is mostly a manual test, a setup is first entered to manually modify BIOS options, then ITP is connected, a command is used for test verification in CScripts, and finally logs of BMC, OS and the like are checked. The RAS characteristics are complex, and each characteristic is time-consuming and labor-consuming to test.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a server performance testing method, a system, a terminal and a storage medium, so as to solve the technical problems.
In a first aspect, the present invention provides a method for testing server performance, including:
restoring default values of options of the basic input and output system;
Reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool;
Configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
and collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
Further, restoring the default value of each option of the basic input/output system comprises:
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface;
the restart server loads the basic input and output system with default values.
Further, reading the target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input output system to the corresponding target values by using the installation and configuration tool comprises:
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
The basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
Further, configuring an error injection instruction according to a test requirement, and executing the error injection instruction includes:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Further, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, the test is passed, including:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
In a second aspect, the present invention provides a server performance testing system, including:
The option recovery unit is used for recovering default values of options of the basic input/output system;
the option modifying unit is used for reading the target option and the corresponding target value in the test configuration file, and modifying the value of the target option of the basic input/output system into the corresponding target value by using the installation and configuration tool;
The system error injection unit is used for configuring error injection instructions according to test requirements and executing the error injection instructions;
the log analysis unit is used for collecting the error report log, analyzing whether the error report log is matched with the executed error injection instruction, and if the error report log is matched with the executed error injection instruction, the test is passed.
Further, the option restoring unit includes:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
Further, the option modifying unit includes:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
Further, the system error injection unit is used for:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Further, the log analysis unit is configured to:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
In a third aspect, a terminal is provided, including:
A processor, a memory, wherein,
The memory is used for storing a computer program,
The processor is configured to call and run the computer program from the memory, so that the terminal performs the method of the terminal as described above.
In a fourth aspect, there is provided a computer storage medium having instructions stored therein which, when run on a computer, cause the computer to perform the method of the above aspects.
The server performance testing method, the system, the terminal and the storage medium have the beneficial effects that the five parts of simultaneous injection missing processes of recovering the BIOS default value, modifying the BIOS option, emptying the log, collecting the log and analyzing the log are integrated together based on the CScripts tool of Intel, so that CScripts is automatically executed, and full-flow automation is realized. Compared with the conventional server RAS test, the method greatly saves labor and time and improves the test efficiency.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required to be used in the description of the embodiments or the prior art will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a schematic flow chart of a method of one embodiment of the invention.
FIG. 2 is a schematic block diagram of a system of one embodiment of the present invention.
Fig. 3 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
Detailed Description
In order to make the technical solution of the present invention better understood by those skilled in the art, the technical solution of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
The following explains key terms appearing in the present invention.
BMC, execution server remote management controller, english is totally called Baseboard Management controller. The method can perform firmware upgrade, check machine equipment and other operations on the machine in a state that the machine is not started. Fully implementing IPMI functionality in a BMC requires a powerful 16-bit or 32-bit microcontroller and RAM for data storage, flash memory for non-volatile data storage, and firmware to provide basic remote manageability in terms of secure remote reboot, secure re-power-up, LAN alerting, and system health monitoring. In addition to the basic IPMI and system operation monitoring functions, the mBMC can also enable BIOS flash element selection and protection by storing the previous BIOS using one of the 2 flash memories. For example, when the system fails to boot after a remote BIOS upgrade, the remote administrator may switch back to the previously-working BIOS image to boot the system. Once the BIOS is upgraded, the BIOS image can be locked, so that the virus can be effectively prevented from invading the BIOS image.
BIOS is the acronym of English "Basic Input Output System", and the Chinese name is "basic input output System" after being translated. On ibm pc compatible systems, is a industry standard firmware interface. The computer is a set of programs solidified on a ROM chip on a main board in the computer, which stores the most important programs of basic input and output, self-checking programs after starting up and system self-starting programs, and can read and write specific information of system settings from CMOS. Its main function is to provide the lowest, most direct hardware setup and control for the computer. In addition, the BIOS provides some system parameters to the operating system. The system hardware changes are hidden by the BIOS and the program uses the BIOS functions rather than directly controlling the hardware. Modern operating systems ignore the abstraction layer provided by the BIOS and directly control the hardware components.
The CPU central processing unit (central processing unit, CPU for short) is used as the operation and control core of the computer system and is the final execution unit for information processing and program running.
Reliability (Reliability) is mainly applied in industry as a dual-machine hot backup. While the Itanium mini-machines, high-end servers such as PR4840R, have high stability, in some critical applications, there are ways users prefer to use dual-machine provisioning. The double-machine hot backup is that one host machine is a working machine, the other host machine is a backup machine, the working machine provides support for an information system under the normal condition of the system, the backup machine monitors the running condition of the working machine, the working machine also monitors whether the backup machine is normal or not, sometimes the backup machine is abnormal due to a certain reason, and the working machine can inform a system management staff to solve the problem as soon as possible so as to ensure the reliability of the next switching. When the working machine is abnormal and cannot support the operation of the information system, the backup machine actively takes over the work of the working machine and continues to support the operation of the information, so that the information system can be ensured to operate uninterruptedly. When the working machine is maintained and recovered to be normal, the working machine can automatically catch back the previous work and recover the working state in the previous normal state.
Availability (Availability) refers to the likelihood that the system may be fed back quickly and operated correctly when requested for use. Many critical businesses in industry applications are those that cannot be interrupted during operation, including business intelligence analysis, business process handling (transaction handling and ERP, etc.), core databases, etc., particularly those operated by government, national defense, security, telecommunications, finance, transportation, medical, etc., businesses and institutions involved in national business.
Maintainability (Serviceability) is a great deal of data volume to be processed, and when real-time, dynamic and efficiency are required, the carrying platform has excellent and easily expandable high-performance processing capability. Such as expansion of memory and expansion of hard disk, such as a modular server, storage server, etc. of the bode.
The intelligent platform management interface (INTELLIGENT PLATFORM MANAGEMENT INTERFACE) is an industry standard adopted by peripheral devices of an Intel architecture enterprise system. IPMI is also an open free standard that users can use without paying additional fees. IPMI can span different operating systems, firmware and hardware platforms, and can intelligently monitor, control and automatically report back the operation status of a large number of servers so as to reduce the cost of the server system.
FIG. 1 is a schematic flow chart of a method of one embodiment of the invention. The execution body of fig. 1 may be a server performance test system.
As shown in fig. 1, the method includes:
Step 110, recovering default values of options of the basic input/output system;
Step 120, reading the target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system to the corresponding target values by using the installation and configuration tool;
step 130, configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
And 140, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
In order to facilitate understanding of the present invention, the method for testing server performance provided by the present invention is further described below with reference to a process of testing server performance in an embodiment by using the principle of the method for testing server performance of the present invention.
CScripts is a debug and validation tool provided by Intel, written by Python, which has the ability to automatically execute Python scripts. The invention is based on the CScripts tool of Intel, and integrates five parts of simultaneous injection missing processes of recovering BIOS default value, modifying BIOS option, emptying log, collecting log and analyzing log, so that CScripts is automatically executed, and full-flow automation is realized. Specifically, the server performance test method comprises the following steps:
S1, recovering default values of options of the basic input and output system.
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface; the restart server loads the basic input and output system with default values.
To prevent the interference of the irrelevant BIOS option, the BIOS default is restored first, which is implemented by IPMI command. When this command is executed (ipmitool-I lanplus-H-U-P chassis bootdev none clear-cmos=yes), the BIOS will load the default value.
S2, reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool.
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file; the basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
Testing different RAS characteristics requires modifying different BIOS options, so the SCE tool of AMI is used to modify BIOS options under the OS according to test requirements. After the modification is completed, the machine is restarted to be effective.
S3, configuring an error injection instruction according to the test requirement, and executing the error injection instruction.
Configuring a plurality of error injection instructions and an execution sequence of the error injection instructions; and executing the error injection instructions in sequence according to the execution sequence.
And writing a fault injection instruction according to the RAS characteristics to be tested. For example, test memory, then configure the memory-related debug instruction.
S4, collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test.
Clearing a system log and a baseboard management controller log before executing the fault injection instruction; collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs; and matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
Before the fault is injected, the log is emptied, so that the log generated after the fault is injected can be conveniently checked. The log is emptied BMCSEL using command ipmitool sel clear. For logs under Linux, direct deletion may cause log errors, no longer generate logs, use the following commands to empty the log (cat/dev/null >/var/log/messages). After the fault injection is completed, fault report logs can be generated under the BMC and the OS. We collect the SEL log with IPMI commands (ipmitool SEL elist), and the general purpose SFTP downloads the log in the OS locally. And finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module. PASS can be matched to the expected key, otherwise FAIL.
As shown in fig. 2, the system 200 includes:
an option restoring unit 210, configured to restore default values of options of the bios;
An option modifying unit 220, configured to read the target option and the corresponding target value in the test configuration file, and modify the value of the target option of the basic input output system to the corresponding target value by using the installation and configuration tool;
The system error injection unit 230 is configured to configure an error injection instruction according to a test requirement, and execute the error injection instruction;
The log analysis unit 240 is configured to collect the error log, analyze whether the error log is matched with the executed error injection instruction, and pass the test if the error log is matched with the executed error injection instruction.
Optionally, as an embodiment of the present invention, the option restoring unit includes:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
Optionally, as an embodiment of the present invention, the option modifying unit includes:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
Optionally, as an embodiment of the present invention, the system error injection unit is configured to:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
Optionally, as an embodiment of the present invention, the log analysis unit is configured to:
clearing a system log and a baseboard management controller log before executing the fault injection instruction;
collecting and clearing system logs and baseboard management controller logs after the execution of the fault injection instruction is completed, and screening fault report logs from all collected logs;
And matching the key words of the error report logs with the error injection types of the corresponding error injection instructions, and judging that the test is passed if all the error injection instructions have the matched error report logs.
Fig. 3 is a schematic structural diagram of a terminal 300 according to an embodiment of the present invention, where the terminal 300 may be used to execute the server performance test method according to the embodiment of the present invention.
The terminal 300 may include: a processor 310, a memory 320 and a communication unit 330. The components may communicate via one or more buses, and it will be appreciated by those skilled in the art that the configuration of the server as shown in the drawings is not limiting of the invention, as it may be a bus-like structure, a star-like structure, or include more or fewer components than shown, or may be a combination of certain components or a different arrangement of components.
The memory 320 may be used to store instructions for execution by the processor 310, and the memory 320 may be implemented by any type of volatile or non-volatile memory terminal or combination thereof, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk, or optical disk. The execution of the instructions in memory 320, when executed by processor 310, enables terminal 300 to perform some or all of the steps in the method embodiments described below.
The processor 310 is a control center of the storage terminal, connects various parts of the entire electronic terminal using various interfaces and lines, and performs various functions of the electronic terminal and/or processes data by running or executing software programs and/or modules stored in the memory 320, and invoking data stored in the memory. The processor may be comprised of an integrated circuit (INTEGRATED CIRCUIT, simply referred to as an IC), for example, a single packaged IC, or may be comprised of multiple packaged ICs connected to one another for the same function or for different functions. For example, the processor 310 may include only a central processing unit (Central Processing Unit, CPU for short). In the embodiment of the invention, the CPU can be a single operation core or can comprise multiple operation cores.
And a communication unit 330 for establishing a communication channel so that the storage terminal can communicate with other terminals. Receiving user data sent by other terminals or sending the user data to other terminals.
The present invention also provides a computer storage medium in which a program may be stored, which program may include some or all of the steps in the embodiments provided by the present invention when executed. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), a random-access memory (random access memory RAM), or the like.
Therefore, the invention integrates five simultaneous injection missing processes of recovering BIOS default value, modifying BIOS option, clearing log, collecting log and analyzing log based on CScripts tool of Intel, so that CScripts is automatically executed, and full-flow automation is realized. Compared with the conventional server RAS test, the method and the device greatly save labor and time, improve test efficiency, and achieve the technical effects described above, which are not repeated here.
It will be apparent to those skilled in the art that the techniques of embodiments of the present invention may be implemented in software plus a necessary general purpose hardware platform. Based on such understanding, the technical solution in the embodiments of the present invention may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium such as a U-disc, a mobile hard disc, a Read-Only Memory (ROM), a random access Memory (RAM, random AccessMemory), a magnetic disk or an optical disk, etc. various media capable of storing program codes, including several instructions for causing a computer terminal (which may be a personal computer, a server, or a second terminal, a network terminal, etc.) to execute all or part of the steps of the method described in the embodiments of the present invention.
The same or similar parts between the various embodiments in this specification are referred to each other. In particular, for the terminal embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference should be made to the description in the method embodiment for relevant points.
In the several embodiments provided by the present invention, it should be understood that the disclosed systems and methods may be implemented in other ways. For example, the system embodiments described above are merely illustrative, e.g., the division of the elements is merely a logical functional division, and there may be additional divisions when actually implemented, e.g., multiple elements or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interface, system or unit indirect coupling or communication connection, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit.
Although the present invention has been described in detail by way of preferred embodiments with reference to the accompanying drawings, the present invention is not limited thereto. Various equivalent modifications and substitutions may be made in the embodiments of the present invention by those skilled in the art without departing from the spirit and scope of the present invention, and it is intended that all such modifications and substitutions be within the scope of the present invention/be within the scope of the present invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (9)
1. A method for testing performance of a server, comprising:
restoring default values of options of the basic input and output system;
Reading target options and corresponding target values in the test configuration file, and modifying the values of the target options of the basic input/output system into the corresponding target values by using an installation and configuration tool;
Configuring an error injection instruction according to a test requirement, and executing the error injection instruction;
Collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test;
Before the fault is injected, firstly, the log is emptied so as to be convenient for checking the log generated after the fault is injected; the BMC SEL log is cleared using command ipmitool SEL clear; for the log under Linux, the direct deletion can cause log errors, the log is not generated any more, and the command cat/dev/null >/var/log/messages are used for emptying the log; after the fault injection is completed, the BMC and the OS can generate fault report logs; collecting SEL logs by using an IPMI command ipmitool SEL elist, and downloading the logs in the OS to the local by using the general SFTP; finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module; PASS can be matched to the expected key, otherwise FAIL.
2. The method of claim 1, wherein restoring default values for the bios options comprises:
Restoring the value of each option of the basic input/output system to a default value by using a basic input/output system initialization instruction of an intelligent platform management interface;
the restart server loads the basic input and output system with default values.
3. The method of claim 1, wherein reading the target options and corresponding target values in the test configuration file, modifying the values of the target options of the basic input output system to the corresponding target values using the installation and configuration tool, comprises:
Taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
The basic input and output system is modified under the operating system according to the test configuration file by using the installation and configuration tool, and the server is restarted to enable the target value of the target option of the basic input and output system to be effective.
4. The method of claim 1, wherein configuring the error injection instruction according to the test requirements, executing the error injection instruction, comprises:
configuring a plurality of error injection instructions and an execution sequence of the error injection instructions;
and executing the error injection instructions in sequence according to the execution sequence.
5. A server performance testing system, comprising:
The option recovery unit is used for recovering default values of options of the basic input/output system;
the option modifying unit is used for reading the target option and the corresponding target value in the test configuration file, and modifying the value of the target option of the basic input/output system into the corresponding target value by using the installation and configuration tool;
The system error injection unit is used for configuring error injection instructions according to test requirements and executing the error injection instructions;
The log analysis unit is used for collecting error report logs, analyzing whether the error report logs are matched with executed error injection instructions, and if the error report logs are matched with the executed error injection instructions, passing the test;
Before the fault is injected, firstly, the log is emptied so as to be convenient for checking the log generated after the fault is injected; the BMC SEL log is cleared using command ipmitool SEL clear; for the log under Linux, the direct deletion can cause log errors, the log is not generated any more, and the command cat/dev/null >/var/log/messages are used for emptying the log; after the fault injection is completed, the BMC and the OS can generate fault report logs; collecting SEL logs by using an IPMI command ipmitool SEL elist, and downloading the logs in the OS to the local by using the general SFTP; finally, analyzing the collected logs, and analyzing and matching the log files by using a Python matching module; PASS can be matched to the expected key, otherwise FAIL.
6. The system of claim 5, wherein the option restoration unit comprises:
The option recovery module is used for recovering the values of all options of the basic input/output system to default values by using a basic input/output system initialization instruction of the intelligent platform management interface;
And the recovery validation module is used for restarting the server to load the default value for the basic input/output system.
7. The system of claim 5, wherein the option modification unit comprises:
The test configuration module is used for taking corresponding options in the basic input and output system as target options according to the characteristics of the test, setting target values of the target options, and writing the target options and the corresponding target values into a test configuration file;
And the option modification module is used for modifying the basic input and output system according to the test configuration file under the operating system by using the installation and configuration tool, and restarting the server to enable the target value of the target option of the basic input and output system to be effective.
8. A terminal, comprising:
A processor;
A memory for storing execution instructions of the processor;
Wherein the processor is configured to perform the method of any of claims 1-4.
9. A computer readable storage medium storing a computer program, which when executed by a processor implements the method of any one of claims 1-4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111231193.8A CN114116330B (en) | 2021-10-22 | 2021-10-22 | Server performance testing method, system, terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111231193.8A CN114116330B (en) | 2021-10-22 | 2021-10-22 | Server performance testing method, system, terminal and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114116330A CN114116330A (en) | 2022-03-01 |
CN114116330B true CN114116330B (en) | 2024-05-24 |
Family
ID=80376532
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111231193.8A Active CN114116330B (en) | 2021-10-22 | 2021-10-22 | Server performance testing method, system, terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114116330B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114978891B (en) * | 2022-05-17 | 2024-04-30 | 西安易朴通讯技术有限公司 | Processing method, device and storage medium for BIOS configuration of network device |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107122277A (en) * | 2017-05-09 | 2017-09-01 | 郑州云海信息技术有限公司 | The wrong test system of PCIERAS notes and method based on PCIE protocol analyzers |
CN107562593A (en) * | 2017-08-25 | 2018-01-09 | 郑州云海信息技术有限公司 | A kind of automated testing method and system for verifying internal memory ECC functions |
CN109800022A (en) * | 2019-01-25 | 2019-05-24 | 郑州云海信息技术有限公司 | A kind of method, system, terminal and the storage medium of automation comparison BIOS information difference |
CN111124780A (en) * | 2019-11-30 | 2020-05-08 | 苏州浪潮智能科技有限公司 | UPI Link speed reduction test method, system, terminal and storage medium |
CN111488288A (en) * | 2020-04-17 | 2020-08-04 | 苏州浪潮智能科技有限公司 | Method, device, terminal and storage medium for testing BMC ACD stability |
CN112463239A (en) * | 2020-12-07 | 2021-03-09 | 苏州浪潮智能科技有限公司 | BIOS and BMC interaction option default value modification method, system, terminal and storage medium |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9645874B2 (en) * | 2015-01-14 | 2017-05-09 | Dell Products L.P. | Analyzing OpenManage integration for troubleshooting log to determine root cause |
-
2021
- 2021-10-22 CN CN202111231193.8A patent/CN114116330B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107122277A (en) * | 2017-05-09 | 2017-09-01 | 郑州云海信息技术有限公司 | The wrong test system of PCIERAS notes and method based on PCIE protocol analyzers |
CN107562593A (en) * | 2017-08-25 | 2018-01-09 | 郑州云海信息技术有限公司 | A kind of automated testing method and system for verifying internal memory ECC functions |
CN109800022A (en) * | 2019-01-25 | 2019-05-24 | 郑州云海信息技术有限公司 | A kind of method, system, terminal and the storage medium of automation comparison BIOS information difference |
CN111124780A (en) * | 2019-11-30 | 2020-05-08 | 苏州浪潮智能科技有限公司 | UPI Link speed reduction test method, system, terminal and storage medium |
CN111488288A (en) * | 2020-04-17 | 2020-08-04 | 苏州浪潮智能科技有限公司 | Method, device, terminal and storage medium for testing BMC ACD stability |
CN112463239A (en) * | 2020-12-07 | 2021-03-09 | 苏州浪潮智能科技有限公司 | BIOS and BMC interaction option default value modification method, system, terminal and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN114116330A (en) | 2022-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022160756A1 (en) | Server fault positioning method, apparatus and system, and computer-readable storage medium | |
US9146839B2 (en) | Method for pre-testing software compatibility and system thereof | |
US10365961B2 (en) | Information handling system pre-boot fault management | |
US20160132420A1 (en) | Backup method, pre-testing method for environment updating and system thereof | |
US7363546B2 (en) | Latent fault detector | |
US9712418B2 (en) | Automated network control | |
CN109684032B (en) | OpenStack virtual machine high-availability computing node device for preventing brain cracking and management method | |
CN110750396B (en) | Server operating system compatibility testing method and device and storage medium | |
CN104834575A (en) | Firmware recovery method and device | |
US11157349B2 (en) | Systems and methods for pre-boot BIOS healing of platform issues from operating system stop error code crashes | |
CN107111595B (en) | Method, device and system for detecting early boot errors | |
CN114116280B (en) | Interactive BMC self-recovery method, system, terminal and storage medium | |
KR20100050380A (en) | Automated firmware recovery | |
CN111897686A (en) | Server cluster hard disk fault processing method and device, electronic equipment and storage medium | |
CN115033419B (en) | Method and system for realizing hardware fault self-healing | |
CN111090546B (en) | Method, device and equipment for restarting operating system and readable storage medium | |
CN114116330B (en) | Server performance testing method, system, terminal and storage medium | |
CN117130672A (en) | Server start flow control method, system, terminal and storage medium | |
CN111488288A (en) | Method, device, terminal and storage medium for testing BMC ACD stability | |
CN114003416B (en) | Memory error dynamic processing method, system, terminal and storage medium | |
GB2532076A (en) | Backup method, pre-testing method for environment updating and system thereof | |
CN112068980A (en) | Method and device for sampling information before CPU hang-up, equipment and storage medium | |
CN111475335A (en) | Method, system, terminal and storage medium for fast recovery of database | |
JP7389877B2 (en) | Network optimal boot path method and system | |
CN114385379B (en) | Method, system, terminal and storage medium for detecting on-board information refreshing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |