CN111404780A - Function test method and device for remote direct memory access network cluster - Google Patents
Function test method and device for remote direct memory access network cluster Download PDFInfo
- Publication number
- CN111404780A CN111404780A CN202010174053.0A CN202010174053A CN111404780A CN 111404780 A CN111404780 A CN 111404780A CN 202010174053 A CN202010174053 A CN 202010174053A CN 111404780 A CN111404780 A CN 111404780A
- Authority
- CN
- China
- Prior art keywords
- network
- nodes
- memory access
- direct memory
- remote direct
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000006870 function Effects 0.000 title claims abstract description 93
- 238000010998 test method Methods 0.000 title description 3
- 238000012360 testing method Methods 0.000 claims abstract description 129
- 230000004044 response Effects 0.000 claims abstract description 44
- 238000000034 method Methods 0.000 claims abstract description 43
- 238000011990 functional testing Methods 0.000 claims description 33
- 230000004927 fusion Effects 0.000 abstract description 6
- 238000004891 communication Methods 0.000 description 7
- 238000004590 computer program Methods 0.000 description 6
- 101100268840 Danio rerio chrna1 gene Proteins 0.000 description 4
- PWHVEHULNLETOV-UHFFFAOYSA-N Nic-1 Natural products C12OC2C2(O)CC=CC(=O)C2(C)C(CCC2=C3)C1C2=CC=C3C(C)C1OC(O)C2(C)OC2(C)C1 PWHVEHULNLETOV-UHFFFAOYSA-N 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 101100388291 Arabidopsis thaliana DTX49 gene Proteins 0.000 description 2
- 101150065731 NIC1 gene Proteins 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000009434 installation Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0811—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0654—Management of faults, events, alarms or notifications using network fault recovery
- H04L41/0663—Performing the actions predefined by failover planning, e.g. switching to standby network elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0817—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Environmental & Geological Engineering (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention discloses a method and a device for testing functions of a remote direct memory access network cluster, wherein the method comprises the following steps: checking and confirming that a disk performance benchmark test tool exists in the network cluster; circularly detecting whether all nodes in the network cluster are mutually connected on the virtual network by using a disk performance benchmark test tool, and prompting the virtual network card names of the nodes which are not connected; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool; circularly detecting whether all nodes in the network cluster have network blockage by using a disk performance benchmark test tool, and removing the nodes with network blockage from the network cluster; and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes. The method and the system can quickly determine whether the RDMA network function in the Microsoft super fusion cluster can be normally connected to work.
Description
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for testing a function of a remote direct memory access network cluster.
Background
The Azure Stack HCI can be seamlessly connected to the Azure to realize the functions of cloud backup, site recovery, cloud monitoring and the like. Storage space cut-through uses industry standard servers with locally attached drivers to create highly available, highly scalable software defined storage, at a cost that is only a small fraction of a traditional SAN or NAS array, a converged or hyper-converged architecture fundamentally simplifies procurement and deployment, while caching, storage tier, erasure coding, and other functions, as well as new hardware innovations (such as RDMA networks and NVMe drivers) also represent higher efficiency and performance.
At present, no test method specially aiming at the functional connectivity of RDMA (remote direct memory access) networks in the micro-soft super-converged cluster exists, after the micro-soft super-converged cluster is built, the interoperability of the RDMA networks in all the servers in the cluster cannot be ensured even if the RDMA function of the whole cluster network is started, if all the RDMA networks in the cluster are not successfully connected, the performance of the super-converged cluster is greatly influenced, and the stability of the normal work of the micro-soft super-converged cluster cannot be ensured.
Aiming at the problem that the interoperability test of the RDMA network in the prior art is difficult, no effective solution is available at present.
Disclosure of Invention
In view of this, an object of the embodiments of the present invention is to provide a method and an apparatus for testing a remote direct memory access network cluster function, which can quickly determine whether an RDMA network function in a microsoft super fusion cluster can be connected normally, improve cluster stability and performance, and reduce network delay and connectivity failure problems.
In view of the foregoing, a first aspect of embodiments of the present invention provides a method for testing a function of a remote direct memory access network cluster, including sequentially performing, on an external device connected to the network cluster, the following steps:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the method further comprises, prior to testing using the external device, separately and sequentially performing at each node of the network cluster the steps of:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method further comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
A second aspect of an embodiment of the present invention provides a function testing apparatus for a remote direct memory access network cluster, including:
a processor; and
a memory storing program code executable by the processor, the program code when executed sequentially performing the following steps on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the program code, when executed, further performs the following steps separately and in sequence on each node of the network cluster prior to testing with the external device:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method also comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
The invention has the following beneficial technical effects: according to the method and the device for testing the functions of the remote direct memory access network cluster, provided by the embodiment of the invention, a disk performance benchmark test tool exists in the network cluster through checking and confirming; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool; responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart illustrating a method for testing the functions of a remote direct memory access network cluster according to the present invention;
fig. 2 is a detailed flowchart of a method for testing the functions of a remote direct memory access network cluster according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
In view of the above, a first aspect of the embodiments of the present invention provides an embodiment of a method for quickly determining whether RDMA network functions in microsoft super converged cluster can work normally. Fig. 1 is a schematic flow chart illustrating a method for testing functions of a remote direct memory access network cluster according to the present invention.
The method for testing the function of the remote direct memory access network cluster, as shown in fig. 1, includes the following steps executed in sequence on an external device connected to the network cluster:
step S101: checking and confirming that a disk performance benchmark test tool exists in the network cluster;
step S103: using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
step S105: responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
step S107: responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
step S109: and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), a Random Access Memory (RAM), or the like. Embodiments of the computer program may achieve the same or similar effects as any of the preceding method embodiments to which it corresponds.
In some embodiments, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the method further comprises, prior to testing using the external device, separately and sequentially performing at each node of the network cluster the steps of:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method further comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
The method disclosed according to an embodiment of the present invention may also be implemented as a computer program executed by a CPU (central processing unit), and the computer program may be stored in a computer-readable storage medium. The computer program, when executed by the CPU, performs the above-described functions defined in the method disclosed in the embodiments of the present invention. The above-described method steps and system elements may also be implemented using a controller and a computer-readable storage medium for storing a computer program for causing the controller to implement the functions of the above-described steps or elements.
The following further illustrates embodiments of the invention in terms of specific examples as shown in fig. 2.
1. According to the selection standard of Microsoft Azure stack HCI hardware, selecting server nodes supporting RDMA network cards to form a micro-soft super-fusion cluster;
2. configuring a cluster network:
assuming that the current cluster has 4 server nodes and the number of RDMA network cards of each node is 1, that is, there are now 4 RDMA functional network cards in the cluster, and the following operations are respectively performed under Windows powershell of each server node in the cluster:
PS>New-VMSwitch–Name SETswitch–NetAdapterName“nic1”–EnableEmbeddedTeaming$true
# Create virtual switch, nic1 is RDMA Port name
PS>Add-VMNetworkAdapter–switchname SETswitch–Name SMB_1–managementos
# create virtual network adapter SMB _1
PS>Enable-NetAdapterRDMA“vEthernet(SMB_1)”
RDMA function of # opening virtual network interface vEthernet (SMB _1)
PS>Set-VMNetworkAdapterTeamMapping-VMNetworkAdapterName"SMB_1"–ManagementOS–PhysicalNetAdapterName"NIC1"
# associating SMB _1 configured for RDMA with a Material adapter NIC1 connected to a virtual machine switch
3. Verifying RDMA normal connectivity in the micro-soft super-converged cluster:
PS>Get-NetIPConfiguration-InterfaceAlias"vEthernet(SMB_1)"|
ft InterfaceAlias,InterfaceIndex,IPv4Address
# obtaining Interface Index and IP corresponding to network port
PS C:\>Get-SmbClientNetworkInterface
# View RDMA network card status. If the RDMA capable state is false, the network card can be restarted or the state of the machine to be tested can be updated.
And then entering a cluster external host test stage. Create C: \ TEST directory under C disk, use DiskSpd tool and copy decompress to C: \ TEST path, copy script content at file end to C: \ TEST path of cluster external host TC in powershell script form, named TEST.
PS>C:\TEST\Test.PS1-IfIndex 3-IsRoCE$true-RemoteIpAddress192.168.1.1,192.168.1.2,192.168.1.3,192.168.1.4-PathToDiskspdC:\TEST\Diskspd-v2.0.17\amd64fre\
# verification RDMA connection status Using RoCE function (IP Address is virtual network Port IP Address in Microsoft super fusion Cluster node under test)
After execution, a prompt of connection success or failure occurs, and if success indicates that the RDMA network is normally connected, failure requires checking a specific prompt problem.
It can be seen from the foregoing embodiments that, in the functional test method for accessing a network cluster by a remote direct memory according to the embodiments of the present invention, by checking whether a disk performance benchmark test tool exists in the network cluster, a test is terminated and installation of the disk performance benchmark test tool is prompted in response to the absence of the disk performance benchmark test tool; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool, and responding to the mutual communication of the remote direct memory access functions and continuing the next step; using a disk performance benchmark test tool to circularly detect whether network congestion exists in all nodes in the network cluster, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
It should be particularly noted that, the steps in the embodiments of the method for testing functions of a remote direct memory access network cluster described above may be mutually intersected, replaced, added, or deleted, and therefore, these reasonable permutations and combinations should also belong to the scope of the present invention, and the scope of the present invention should not be limited to the described embodiments.
In view of the above objects, a second aspect of the embodiments of the present invention provides an embodiment of a device capable of quickly determining whether RDMA network functions in microsoft super converged cluster can work normally in a connected mode. The function test device of the remote direct memory access network cluster comprises:
a processor; and
a memory storing program code executable by the processor, the program code when executed sequentially performing the following steps on an external device connected to the network cluster:
checking and confirming whether a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the program code, when executed, further performs the following steps separately and in sequence on each node of the network cluster prior to testing with the external device:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method also comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
It can be seen from the foregoing embodiments that, in the functional test apparatus for a remote direct memory access network cluster according to an embodiment of the present invention, by checking whether a disk performance benchmark test tool exists in the network cluster, a test is terminated and installation of the disk performance benchmark test tool is prompted in response to the absence of the disk performance benchmark test tool; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool, and responding to the mutual communication of the remote direct memory access functions and continuing the next step; using a disk performance benchmark test tool to circularly detect whether network congestion exists in all nodes in the network cluster, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
It should be particularly noted that, the above-mentioned embodiment of the functional test apparatus of the remote direct memory access network cluster specifically describes the working process of each module by using the embodiment of the functional test method of the remote direct memory access network cluster, and those skilled in the art can easily think that these modules are applied to other embodiments of the functional test method of the remote direct memory access network cluster. Of course, since the steps in the embodiment of the method for testing the functions of the remote dma network cluster may be mutually intersected, replaced, added, or deleted, these reasonable permutations and combinations should also be changed to the function testing apparatus of the remote dma network cluster, and the scope of the present invention should not be limited to the embodiment.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items. The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of an embodiment of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.
Claims (10)
1. A method for functional testing of a remote direct memory access network cluster, comprising performing the following steps in sequence on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using the disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on a virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
in response to the absence of disconnected nodes, cyclically detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected with each other using the disk performance benchmark test tool;
responding to the remote direct memory access functions are communicated with each other, using the disk performance benchmark test tool to detect whether network congestion exists in all nodes in the network cluster in a circulating mode, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster so as to execute functional test again;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all of the above-mentioned passes.
2. The method of claim 1, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of virtual network cards of the nodes of which the remote direct memory access function is not connected; detecting, in response to the number being 1 or 2, whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected to each other; removing the nodes with disconnected remote direct memory access function from the network cluster to re-execute functional testing in response to the number being greater than 2.
3. The method of claim 1 or 2, wherein removing the network-congested node to re-perform functional testing comprises: removing nodes, adding new nodes for replacing the removed nodes for the network cluster, and re-executing functional test based on the processed network cluster.
4. The method of claim 1, further comprising performing the following steps separately and sequentially at each node of the network cluster prior to testing using external devices:
checking whether the node supports the remote direct memory access function, terminating testing and removing the node from the network cluster to re-execute functional testing in response to the node not supporting the remote direct memory access function;
responding to the node supporting the remote direct memory access function, checking the starting number of the virtual network ports of the node, and responding to the starting number not being equal to the number of the virtual network cards of the node to terminate the test and prompt the check of the virtual network cards.
5. The method of claim 1, wherein all nodes in the network cluster include one or more remote direct memory access network cards;
the method further comprises the following steps: all of the network cards in a node having a plurality of remote direct memory access network cards are tested in response to testing the node.
6. A functional test apparatus for a remote direct memory access network cluster, comprising:
a processor; and
a memory storing processor executable program code which when executed performs the following steps in sequence on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using the disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on a virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
in response to the absence of disconnected nodes, cyclically detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected with each other using the disk performance benchmark test tool;
responding to the remote direct memory access functions are communicated with each other, using the disk performance benchmark test tool to detect whether network congestion exists in all nodes in the network cluster in a circulating mode, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster so as to execute functional test again;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all of the above-mentioned passes.
7. The apparatus of claim 6, wherein the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of virtual network cards of the nodes of which the remote direct memory access function is not connected; detecting, in response to the number being 1 or 2, whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected to each other; removing the nodes with disconnected remote direct memory access function from the network cluster to re-execute functional testing in response to the number being greater than 2.
8. The apparatus of claim 6 or 7, wherein removing the network-congested node to re-perform functional testing comprises: removing nodes, adding new nodes for replacing the removed nodes for the network cluster, and re-executing functional test based on the processed network cluster.
9. The apparatus of claim 6, wherein the program code, when executed, further performs the following steps separately and sequentially at each node of the network cluster before testing with an external device:
checking whether the node supports the remote direct memory access function, terminating testing and removing the node from the network cluster to re-execute functional testing in response to the node not supporting the remote direct memory access function;
responding to the node supporting the remote direct memory access function, checking the starting number of the virtual network ports of the node, and responding to the starting number not being equal to the number of the virtual network cards of the node to terminate the test and prompt the check of the virtual network cards.
10. The apparatus of claim 6, wherein all nodes in the network cluster comprise one or more remote direct memory access network cards; the steps further include: all of the network cards in a node having a plurality of remote direct memory access network cards are tested in response to testing the node.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010174053.0A CN111404780B (en) | 2020-03-13 | 2020-03-13 | Function test method and device for remote direct memory access network cluster |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010174053.0A CN111404780B (en) | 2020-03-13 | 2020-03-13 | Function test method and device for remote direct memory access network cluster |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111404780A true CN111404780A (en) | 2020-07-10 |
CN111404780B CN111404780B (en) | 2022-02-25 |
Family
ID=71428912
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010174053.0A Active CN111404780B (en) | 2020-03-13 | 2020-03-13 | Function test method and device for remote direct memory access network cluster |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111404780B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112905718A (en) * | 2021-03-09 | 2021-06-04 | 东莞中国科学院云计算产业技术创新与育成中心 | Data management method, system, electronic device and medium based on super-fusion architecture |
CN112948229A (en) * | 2021-03-16 | 2021-06-11 | 广州虎牙科技有限公司 | Method and device for determining performance of scheduling cluster, computer equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106533825A (en) * | 2016-10-25 | 2017-03-22 | 广东浪潮大数据研究有限公司 | Linux system-based SR-IOV function testing method |
CN106899454A (en) * | 2017-03-06 | 2017-06-27 | 郑州云海信息技术有限公司 | A kind of method and system based on automatic test RoCE performances under linux system |
CN110460491A (en) * | 2019-07-05 | 2019-11-15 | 锐捷网络股份有限公司 | Performance test methods and device based on RDMA |
CN110830283A (en) * | 2018-08-10 | 2020-02-21 | 华为技术有限公司 | Fault detection method, device, equipment and system |
-
2020
- 2020-03-13 CN CN202010174053.0A patent/CN111404780B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106533825A (en) * | 2016-10-25 | 2017-03-22 | 广东浪潮大数据研究有限公司 | Linux system-based SR-IOV function testing method |
CN106899454A (en) * | 2017-03-06 | 2017-06-27 | 郑州云海信息技术有限公司 | A kind of method and system based on automatic test RoCE performances under linux system |
CN110830283A (en) * | 2018-08-10 | 2020-02-21 | 华为技术有限公司 | Fault detection method, device, equipment and system |
CN110460491A (en) * | 2019-07-05 | 2019-11-15 | 锐捷网络股份有限公司 | Performance test methods and device based on RDMA |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112905718A (en) * | 2021-03-09 | 2021-06-04 | 东莞中国科学院云计算产业技术创新与育成中心 | Data management method, system, electronic device and medium based on super-fusion architecture |
CN112948229A (en) * | 2021-03-16 | 2021-06-11 | 广州虎牙科技有限公司 | Method and device for determining performance of scheduling cluster, computer equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN111404780B (en) | 2022-02-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111404780B (en) | Function test method and device for remote direct memory access network cluster | |
CN107506295B (en) | Virtual machine backup testing method and device and computer readable storage medium | |
CN105553783A (en) | Automated testing method for switching of configuration two-computer resources | |
CN113300953B (en) | Management method, system and related device for multipath failover group | |
CN107181636B (en) | Health check method and device in load balancing system | |
CN105700975A (en) | Heat removing method and device as well as heat adding method and device for CPUs | |
CN111988169B (en) | Method, system, equipment and medium for cleaning and repairing abnormal disk of cloud platform | |
CN112579007B (en) | Method and device for acquiring storage full link and electronic equipment | |
CN116701518A (en) | Database cluster deployment method and device, computer equipment and storage medium | |
US20200351157A1 (en) | Optimistic and failsafe network configuration | |
CN116743619B (en) | Network service testing method, device, equipment and storage medium | |
CN113377702A (en) | Method and device for starting two-node cluster, electronic equipment and storage medium | |
CN109885420B (en) | PCIe link fault analysis method, BMC and storage medium | |
US9400605B2 (en) | Efficient management of a virtual tape library cluster | |
JP6760086B2 (en) | Setting program, setting method, and setting device | |
US11403036B2 (en) | Recover time improvement mechanism after device path failure in a storage system | |
CN110855526B (en) | Method and device for detecting data source connection, storage medium and electronic equipment | |
CN111147400B (en) | Method and device for command line configuration synchronization and electronic equipment | |
US20150195366A1 (en) | Switching device, non-transitory computer readable medium and zoning setting method | |
CN112131201A (en) | Method, system, equipment and medium for high availability of network additional storage | |
CN109542692A (en) | Greenplum cluster monitoring method and apparatus | |
CN118646595B (en) | Method, equipment and storage medium for repairing server to avoid secret mutual trust | |
US20140280369A1 (en) | Cloning tables and lists while retaining the relationship between entry elements and their assigned identifiers | |
CN117714458A (en) | Batch tenant migration method, system and device based on distribution | |
CN115484164A (en) | Method and system for deploying a production system in a virtualized environment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |