CN107135120B - Automatic test method and device for HPC cluster - Google Patents

Automatic test method and device for HPC cluster Download PDF

Info

Publication number
CN107135120B
CN107135120B CN201710271165.6A CN201710271165A CN107135120B CN 107135120 B CN107135120 B CN 107135120B CN 201710271165 A CN201710271165 A CN 201710271165A CN 107135120 B CN107135120 B CN 107135120B
Authority
CN
China
Prior art keywords
servers
test
configuration information
script
installing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710271165.6A
Other languages
Chinese (zh)
Other versions
CN107135120A (en
Inventor
肖义军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZHONGKE SUGON INFORMATION INDUSTRY CHENGDU Co.,Ltd.
Dawning Information Industry Beijing Co Ltd
Original Assignee
Dawning Information Industry Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dawning Information Industry Beijing Co Ltd filed Critical Dawning Information Industry Beijing Co Ltd
Priority to CN201710271165.6A priority Critical patent/CN107135120B/en
Publication of CN107135120A publication Critical patent/CN107135120A/en
Application granted granted Critical
Publication of CN107135120B publication Critical patent/CN107135120B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/50Testing arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • G06F11/3672Test management
    • G06F11/3684Test management for test design, e.g. generating new test cases
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)
  • Stored Programmes (AREA)

Abstract

The invention provides an automatic test method and device for an HPC cluster, wherein the automatic test method comprises the following steps: receiving order information of the HPC cluster, wherein the order information comprises configuration information of a plurality of servers; according to the order information, building hardware environments of a plurality of servers, and compiling a test plan; and installing and testing the operating systems of the plurality of servers according to the pre-configured scripts, the test plans and the configuration information. According to the invention, the order information of the HPC cluster is received, then the hardware environments of the plurality of servers are built according to the order information, the test plan is compiled, and finally the plurality of servers are installed and tested according to the pre-configured script, the test plan and the configuration information, so that the cluster order test work can be greatly automated, and the work efficiency is improved.

Description

Automatic test method and device for HPC cluster
Technical Field
The invention relates to the field of servers, in particular to an automatic test method and device for an HPC cluster.
Background
At present, before a cluster order for an HPC (High Performance Computing) cluster user leaves a factory, a corresponding operating system needs to be customized according to user requirements, and unified testing is performed through various configured servers in the cluster to eliminate hidden dangers found in the production process of the cluster, so that the quality of the cluster order is improved. Meanwhile, by being deployed in advance before leaving the factory, the time of on-site construction is reduced, and the user satisfaction is improved.
However, at present, when an order is tested, a test engineer needs to deploy an operating system, configure a network environment, install test software, test, check and judge results and the like according to cluster planning provided by a user, and in most test plans, personnel are needed to participate, and the cost for developing a set of software for automatically processing data is high, but because the order is complicated (the requirements of the user are different), the test quantity is large (the number of general orders is more than 30), and manual input cannot meet the requirement of working efficiency, therefore, the prior art has the defects that a series of operations are completely completed by manpower, and thus, the labor is consumed, the efficiency is not high, and errors are easy to occur.
An effective solution to the problems in the related art has not been proposed yet.
Disclosure of Invention
Aiming at the problems in the related art, the invention provides an automatic test method and device for an HPC cluster, which can run a series of existing commands in a script mode based on a script, and do not need excessive operation in the test process, thereby greatly improving the efficiency and reducing the workload of testers.
The technical scheme of the invention is realized as follows:
according to one aspect of the invention, an automated test method for an HPC cluster is provided.
The automatic test method comprises the following steps: receiving order information of the HPC cluster, wherein the order information comprises configuration information of a plurality of servers; according to the order information, building hardware environments of a plurality of servers, and compiling a test plan; and installing and testing the operating systems of the plurality of servers according to the pre-configured scripts, the test plans and the configuration information.
According to one embodiment of the present invention, installing an operating system for a plurality of servers according to a preconfigured script, test plan, and configuration information comprises: and allocating IP addresses to the servers according to the test plans of the servers so as to deploy the network environment of the servers, wherein the test plans comprise the corresponding relations between the servers and the IP addresses.
According to one embodiment of the invention, the test plan is saved to an upload folder.
According to an embodiment of the present invention, installing and testing the operating system for the plurality of servers according to the pre-configured script, test plan and configuration information further comprises: installing operating systems and network card drivers of a plurality of servers according to a pre-configured script and configuration information, wherein the configuration information comprises type information and a version number of the operating system of each server; and installing test software according to the pre-configured script, and testing the operating system through the test software.
According to one embodiment of the invention, the preset script is a Shell script.
According to another aspect of the invention, an automated test equipment for an HPC cluster is provided.
This automatic testing arrangement includes: the HPC cluster server comprises a receiving module, a processing module and a processing module, wherein the receiving module is used for receiving order information of the HPC cluster, and the order information comprises configuration information of a plurality of servers; the compiling module is set up and used for setting up hardware environments of a plurality of servers according to the order information and compiling a test plan; and the installation testing module is used for installing and testing the operating systems of the servers according to the pre-configured scripts, the test plans and the configuration information.
According to one embodiment of the invention, installing a test module comprises: the deployment module is used for allocating IP addresses to the servers according to the test plans of the servers so as to deploy the network environment of the servers, wherein the test plans comprise the corresponding relations between the servers and the IP addresses.
According to an embodiment of the present invention, further comprising: and the storage module is used for storing the test plan to the uploading folder.
According to one embodiment of the invention, installing the test module further comprises: the first installation module is used for installing the operating systems and the network card drivers of the plurality of servers according to the pre-configured scripts and the configuration information, wherein the configuration information comprises the type information and the version number of the operating system of each server; and the second installation module is used for installing test software according to the pre-configured script and testing the operating system through the test software.
According to one embodiment of the invention, the preset script is a Shell script.
The invention has the beneficial technical effects that:
according to the invention, the order information of the HPC cluster is received, then the hardware environment of the plurality of servers is built according to the order information, the test plan is compiled, and finally the plurality of servers are installed and tested according to the pre-configured script, the test plan and the configuration information, so that the existing series of commands are run in a script mode based on the script, excessive operation is not needed in the test process, the cluster order test work can be greatly automated, the work efficiency is improved, in addition, in the whole process, the manual intervention is reduced, and the risk of manual operation errors is greatly reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
FIG. 1 is a flow diagram of an automated test method for an HPC cluster according to an embodiment of the present invention;
FIG. 2 is a detailed flow diagram of an automated test method for an HPC cluster according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a test environment deployment according to an embodiment of the present invention;
FIG. 4 is a block diagram of an apparatus for an automated test method for an HPC cluster according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments that can be derived by one of ordinary skill in the art from the embodiments given herein are intended to be within the scope of the present invention.
According to an embodiment of the invention, an automatic test method for an HPC cluster is provided.
As shown in FIG. 1, the automatic test method for the HPC cluster according to the embodiment of the invention comprises:
step S101, receiving order information of the HPC cluster, wherein the order information comprises configuration information of a plurality of servers;
step S103, building hardware environments of a plurality of servers according to order information, and compiling a test plan;
and step S105, installing and testing the operating systems of the plurality of servers according to the pre-configured scripts, test plans and configuration information.
In this embodiment, with continued reference to fig. 2, first, an order of the HPC cluster is received, where the order includes information such as serial numbers and model numbers of multiple required servers, and types and version numbers of required operating systems, then a test environment is built, that is, a hardware environment of the servers in the order is built, such as server power-on and interconnection of the servers, so as to ensure that the servers in the order can operate, then a plan is written, where the plan refers to building a network environment of the servers, that is, allocating IP addresses to the multiple servers in the order, for example, according to an embodiment of the present invention, the plan may be a table, where a corresponding relationship between a server and an IP address is recorded in the table, allocating an IP address to a server according to the corresponding relationship, and then performing boot network boot to install an operating system of the servers in the order, after the servers are powered on, an operator may select the type of the Operating System of each server according to an order, for example, according to an embodiment of the present invention, the type of the Operating System of the server may be a centros (Community Enterprise Operating System), and the version number of the System is 6.5, it is understood that the type and the version number of the Operating System of the server may also be selected according to actual requirements, and the Operating System of the server may also be a red hat Operating System.
According to the scheme, the order information of the HPC cluster is received, the hardware environments of the servers are built according to the order information, the test plan is compiled, and the servers are installed and tested according to the pre-configured script, the test plan and the configuration information, so that a series of existing commands are operated in a script mode based on the script, excessive operation is not needed in the test process, the cluster order test work can be greatly automated, the work efficiency is improved, manual intervention is reduced in the whole process, and the risk of manual operation errors is greatly reduced.
According to one embodiment of the present invention, installing an operating system for a plurality of servers according to a preconfigured script, test plan, and configuration information comprises: and allocating IP addresses to the servers according to the test plans of the servers so as to deploy the network environment of the servers, wherein the test plans comprise the corresponding relations between the servers and the IP addresses.
According to one embodiment of the invention, the test plan is saved to an upload folder (FTP file).
According to an embodiment of the present invention, installing and testing the operating system for the plurality of servers according to the pre-configured script, test plan and configuration information further comprises: installing operating systems and network card drivers of a plurality of servers according to a pre-configured script and configuration information, wherein the configuration information comprises type information and a version number of the operating system of each server; and installing test software according to the pre-configured script, and testing the operating system through the test software.
In this embodiment, with continued reference to FIG. 3, the automatic deployment of the test environment comprises: the method comprises the steps of automatically deploying an operating system, automatically installing a network card driver and automatically deploying a network environment, wherein the automatically deploying operating system is carried out according to orders, the operating system to be installed in each order is different due to different customer requirements, the type and the version of the corresponding operating system are required to be selected according to the requirements of a user, after the operating system is installed, the network card driver is installed, the network environment is automatically deployed, and an IP address is distributed for each server. In addition, after the operating system of the server is installed, test software can be installed on the server to test the operating system of the server, and fault information and the like are recorded in the test process.
According to one embodiment of the invention, the preset script is a Shell script, wherein the Shell script is a program file which is put into a file in advance by various commands and is convenient to execute at one time, and the Shell file is specifically as follows:
Figure BDA0001277433880000051
Figure BDA0001277433880000061
it is understood that the type of the script may be selected according to actual requirements, for example, the script may be a PHP script according to an embodiment of the present invention.
The following compares the existing test method with the automatic test method of the present invention.
Suppose that: the number of machines is N, the operating system installation time is T, the operating system selection time is M, the deployment single-point network environment is a, the installation infiniband (IB for short, infiniband) drive time is S, and the time used by the cluster order test in the prior art is generally: (a + M) × N + T + S, and the time for using the apparatus is: m × N + T + S, comparing the above two equations, it can be determined that the time that can be saved is about: a x N, therefore, the larger the order size, the more time is saved by the automatic testing method of the present invention.
For better description, the present invention is described below by taking an order of 60-node scale as an example, and the time used by the existing testing method is: the time of the order with the size of 60 nodes is specifically (0.017+0.017) × 60+0.5+0.3 ═ 2.84 (unit: hour), and the time of using the device by the automatic testing method of the invention is as follows: m N + T + S, the time of the order with the scale of 60 nodes is specifically 0.017 60+0.5+0.3 ═ 1.82 (unit: hour), therefore, the automatic testing method of the invention can greatly automate the test work of the cluster order, thereby improving the working efficiency, and in the whole process, the manual intervention is reduced, and the risk of the manual operation error is greatly reduced.
According to an embodiment of the invention, an automatic test device for an HPC cluster is also provided.
As shown in fig. 4, the automatic test apparatus for an HPC cluster according to an embodiment of the present invention includes:
a receiving module 41, configured to receive order information of the HPC cluster, where the order information includes configuration information of the plurality of servers;
the construction and compiling module 42 is used for constructing hardware environments of a plurality of servers according to the order information and compiling a test plan;
and an installation testing module 43, configured to perform installation and testing of the operating system on the plurality of servers according to the pre-configured script, the test plan, and the configuration information.
According to one embodiment of the present invention, installing the test module 43 includes: a deployment module (not shown) configured to allocate IP addresses to the multiple servers according to a test plan of the multiple servers to deploy a network environment of the multiple servers, where the test plan includes a correspondence between the multiple servers and the IP addresses.
According to an embodiment of the present invention, further comprising: a save module (not shown) for saving the test plan to the upload folder.
According to one embodiment of the invention, installing the test module further comprises: a first installation module (not shown) for installing the operating systems and the network card drivers of the plurality of servers according to a pre-configured script and configuration information, wherein the configuration information includes type information and version number of the operating system of each server; and a second installation module (not shown) for installing the test software according to the pre-configured script and testing the operating system through the test software.
According to one embodiment of the invention, the preset script is a Shell script.
In summary, according to the technical scheme of the present invention, by receiving order information of an HPC cluster, then building hardware environments of a plurality of servers according to the order information, compiling a test plan, and finally installing and testing operating systems of the plurality of servers according to a pre-configured script, the test plan and configuration information, an existing series of commands are run in a script manner based on the script, excessive operations are not required in a test process, the cluster order test work can be greatly automated, so that the work efficiency is improved, and in the whole process, manual intervention is reduced, and the risk of manual operation errors is greatly reduced.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (6)

1. An automated testing method for an HPC cluster, comprising:
receiving order information of the HPC cluster, wherein the order information comprises configuration information of a plurality of servers;
according to the order information, building hardware environments of the servers, and compiling a test plan; and
installing and testing the operating systems of the servers according to a pre-configured script, the test plan and the configuration information;
the installing and testing includes:
allocating IP addresses to the servers according to test plans of the servers so as to deploy a network environment of the servers, wherein the test plans comprise corresponding relations between the servers and the IP addresses, and allocating the IP addresses to the servers in an order according to the corresponding relations;
installing the operating systems and the network card drivers of the plurality of servers according to the pre-configured scripts and the configuration information, wherein the configuration information comprises type information and version numbers of the operating systems of the servers; and
and installing test software according to the pre-configured script, and testing the operating system through the test software.
2. The automated testing method of claim 1, wherein the test plan is saved to an upload folder.
3. The automatic test method according to claim 1, wherein the preset script is a Shell script.
4. An automatic test equipment for an HPC cluster, comprising:
a receiving module, configured to receive order information of the HPC cluster, where the order information includes configuration information of a plurality of servers;
the compiling module is set up and used for setting up hardware environments of the servers according to the order information and compiling a test plan; and
the installation testing module is used for installing and testing the operating systems of the servers according to a preset script, the test plan and the configuration information;
the installation test module includes:
the deployment module is used for allocating IP addresses to the servers according to the test plans of the servers so as to deploy the network environment of the servers, wherein the test plans comprise the corresponding relations between the servers and the IP addresses, and the IP addresses are allocated to the servers in the order according to the corresponding relations;
the first installation module is used for installing the operating systems and the network card drivers of the servers according to the pre-configured scripts and the configuration information, wherein the configuration information comprises type information and version numbers of the operating systems of the servers; and
and the second installation module is used for installing test software according to the pre-configured script and testing the operating system through the test software.
5. The automatic test device of claim 4, further comprising:
and the storage module is used for storing the test plan to an uploading folder.
6. The automatic test device according to claim 4, wherein the preset script is a Shell script.
CN201710271165.6A 2017-04-24 2017-04-24 Automatic test method and device for HPC cluster Active CN107135120B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710271165.6A CN107135120B (en) 2017-04-24 2017-04-24 Automatic test method and device for HPC cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710271165.6A CN107135120B (en) 2017-04-24 2017-04-24 Automatic test method and device for HPC cluster

Publications (2)

Publication Number Publication Date
CN107135120A CN107135120A (en) 2017-09-05
CN107135120B true CN107135120B (en) 2020-11-03

Family

ID=59715941

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710271165.6A Active CN107135120B (en) 2017-04-24 2017-04-24 Automatic test method and device for HPC cluster

Country Status (1)

Country Link
CN (1) CN107135120B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107635024A (en) * 2017-09-15 2018-01-26 郑州云海信息技术有限公司 A kind of automatic test approach of automatic identification server ip
CN109213494A (en) * 2018-08-03 2019-01-15 北京奇安信科技有限公司 A kind of processing method and processing device of automatic deployment cluster test host

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1848787A (en) * 2006-04-30 2006-10-18 西安交通大学 Automatic fast dispositioning method for aggregated server system node
CN102455936A (en) * 2010-11-25 2012-05-16 中标软件有限公司 Trunk quick allocation method
CN103970655A (en) * 2014-05-08 2014-08-06 浪潮电子信息产业股份有限公司 Server automatic cluster testing method based on expect
CN104834546A (en) * 2015-05-12 2015-08-12 浪潮电子信息产业股份有限公司 Method for upgrading cluster system by network
CN105205003A (en) * 2015-10-28 2015-12-30 努比亚技术有限公司 Automated testing method and device based on clustering system
CN106095526A (en) * 2016-06-06 2016-11-09 腾讯科技(深圳)有限公司 Virtual Server Cluster and the dispositions method of server, device and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8775507B2 (en) * 2012-01-05 2014-07-08 Microsoft Corporation Rest interface for job submission

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1848787A (en) * 2006-04-30 2006-10-18 西安交通大学 Automatic fast dispositioning method for aggregated server system node
CN102455936A (en) * 2010-11-25 2012-05-16 中标软件有限公司 Trunk quick allocation method
CN103970655A (en) * 2014-05-08 2014-08-06 浪潮电子信息产业股份有限公司 Server automatic cluster testing method based on expect
CN104834546A (en) * 2015-05-12 2015-08-12 浪潮电子信息产业股份有限公司 Method for upgrading cluster system by network
CN105205003A (en) * 2015-10-28 2015-12-30 努比亚技术有限公司 Automated testing method and device based on clustering system
CN106095526A (en) * 2016-06-06 2016-11-09 腾讯科技(深圳)有限公司 Virtual Server Cluster and the dispositions method of server, device and system

Also Published As

Publication number Publication date
CN107135120A (en) 2017-09-05

Similar Documents

Publication Publication Date Title
CN102622298B (en) Software testing system and method
CN102880532B (en) Cloud technology-based test system and method
CN110888414B (en) Test method for upgrading vehicle controller
CN110879712B (en) Cloud data center physical host installation method and related device
CN107566165B (en) Method and system for discovering and deploying available resources of power cloud data center
US20150100832A1 (en) Method and system for selecting and executing test scripts
US20150100829A1 (en) Method and system for selecting and executing test scripts
CN111245898A (en) Network equipment online method, device, server and storage medium
CN110765026B (en) Automatic test method, device, storage medium and equipment
US20150100830A1 (en) Method and system for selecting and executing test scripts
CN101411123A (en) Method, system and computer program for the centralized system management on endpoints of a distributed data processing system
US20150100831A1 (en) Method and system for selecting and executing test scripts
CN103164238A (en) Method for automatically and continuously installing operating system
CN109901985B (en) Distributed test apparatus and method, storage medium, and electronic device
CN112579108A (en) Automatic deployment method and device
CN106897223A (en) A kind of software kit delivery method and device based on continuous integrating
CN104360952A (en) Software test system and software test method
CN107135120B (en) Automatic test method and device for HPC cluster
US20230246913A1 (en) System and method for commissioning a network element
CN116414521A (en) Virtual machine remote command interaction method, device and medium for distributed heterogeneous shooting ranges
CN109660386B (en) Software upgrading method for semiconductor memory aging test system
US9207928B2 (en) Computer-readable medium, apparatus, and methods of automatic capability installation
CN109120433B (en) Method and apparatus for containerized deployment of hosts
CN109684138A (en) A kind of visualization hard disk automatic test approach, device, terminal and storage medium
CN110795118A (en) Cloud platform upgrading tool and upgrading method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20211011

Address after: 100089 building 36, courtyard 8, Dongbeiwang West Road, Haidian District, Beijing

Patentee after: Dawning Information Industry (Beijing) Co.,Ltd.

Patentee after: ZHONGKE SUGON INFORMATION INDUSTRY CHENGDU Co.,Ltd.

Address before: 100193 No. 36 Building, No. 8 Hospital, Wangxi Road, Haidian District, Beijing

Patentee before: Dawning Information Industry (Beijing) Co.,Ltd.

TR01 Transfer of patent right