CN111404780A - Function test method and device for remote direct memory access network cluster - Google Patents

Function test method and device for remote direct memory access network cluster Download PDF

Info

Publication number
CN111404780A
CN111404780A CN202010174053.0A CN202010174053A CN111404780A CN 111404780 A CN111404780 A CN 111404780A CN 202010174053 A CN202010174053 A CN 202010174053A CN 111404780 A CN111404780 A CN 111404780A
Authority
CN
China
Prior art keywords
network
nodes
memory access
direct memory
remote direct
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010174053.0A
Other languages
Chinese (zh)
Other versions
CN111404780B (en
Inventor
蒋帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010174053.0A priority Critical patent/CN111404780B/en
Publication of CN111404780A publication Critical patent/CN111404780A/en
Application granted granted Critical
Publication of CN111404780B publication Critical patent/CN111404780B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0811Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking connectivity
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method and a device for testing functions of a remote direct memory access network cluster, wherein the method comprises the following steps: checking and confirming that a disk performance benchmark test tool exists in the network cluster; circularly detecting whether all nodes in the network cluster are mutually connected on the virtual network by using a disk performance benchmark test tool, and prompting the virtual network card names of the nodes which are not connected; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool; circularly detecting whether all nodes in the network cluster have network blockage by using a disk performance benchmark test tool, and removing the nodes with network blockage from the network cluster; and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes. The method and the system can quickly determine whether the RDMA network function in the Microsoft super fusion cluster can be normally connected to work.

Description

Function test method and device for remote direct memory access network cluster
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for testing a function of a remote direct memory access network cluster.
Background
The Azure Stack HCI can be seamlessly connected to the Azure to realize the functions of cloud backup, site recovery, cloud monitoring and the like. Storage space cut-through uses industry standard servers with locally attached drivers to create highly available, highly scalable software defined storage, at a cost that is only a small fraction of a traditional SAN or NAS array, a converged or hyper-converged architecture fundamentally simplifies procurement and deployment, while caching, storage tier, erasure coding, and other functions, as well as new hardware innovations (such as RDMA networks and NVMe drivers) also represent higher efficiency and performance.
At present, no test method specially aiming at the functional connectivity of RDMA (remote direct memory access) networks in the micro-soft super-converged cluster exists, after the micro-soft super-converged cluster is built, the interoperability of the RDMA networks in all the servers in the cluster cannot be ensured even if the RDMA function of the whole cluster network is started, if all the RDMA networks in the cluster are not successfully connected, the performance of the super-converged cluster is greatly influenced, and the stability of the normal work of the micro-soft super-converged cluster cannot be ensured.
Aiming at the problem that the interoperability test of the RDMA network in the prior art is difficult, no effective solution is available at present.
Disclosure of Invention
In view of this, an object of the embodiments of the present invention is to provide a method and an apparatus for testing a remote direct memory access network cluster function, which can quickly determine whether an RDMA network function in a microsoft super fusion cluster can be connected normally, improve cluster stability and performance, and reduce network delay and connectivity failure problems.
In view of the foregoing, a first aspect of embodiments of the present invention provides a method for testing a function of a remote direct memory access network cluster, including sequentially performing, on an external device connected to the network cluster, the following steps:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the method further comprises, prior to testing using the external device, separately and sequentially performing at each node of the network cluster the steps of:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method further comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
A second aspect of an embodiment of the present invention provides a function testing apparatus for a remote direct memory access network cluster, including:
a processor; and
a memory storing program code executable by the processor, the program code when executed sequentially performing the following steps on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the program code, when executed, further performs the following steps separately and in sequence on each node of the network cluster prior to testing with the external device:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method also comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
The invention has the following beneficial technical effects: according to the method and the device for testing the functions of the remote direct memory access network cluster, provided by the embodiment of the invention, a disk performance benchmark test tool exists in the network cluster through checking and confirming; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool; responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a schematic flow chart illustrating a method for testing the functions of a remote direct memory access network cluster according to the present invention;
fig. 2 is a detailed flowchart of a method for testing the functions of a remote direct memory access network cluster according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
In view of the above, a first aspect of the embodiments of the present invention provides an embodiment of a method for quickly determining whether RDMA network functions in microsoft super converged cluster can work normally. Fig. 1 is a schematic flow chart illustrating a method for testing functions of a remote direct memory access network cluster according to the present invention.
The method for testing the function of the remote direct memory access network cluster, as shown in fig. 1, includes the following steps executed in sequence on an external device connected to the network cluster:
step S101: checking and confirming that a disk performance benchmark test tool exists in the network cluster;
step S103: using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
step S105: responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
step S107: responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
step S109: and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), a Random Access Memory (RAM), or the like. Embodiments of the computer program may achieve the same or similar effects as any of the preceding method embodiments to which it corresponds.
In some embodiments, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the method further comprises, prior to testing using the external device, separately and sequentially performing at each node of the network cluster the steps of:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method further comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
The method disclosed according to an embodiment of the present invention may also be implemented as a computer program executed by a CPU (central processing unit), and the computer program may be stored in a computer-readable storage medium. The computer program, when executed by the CPU, performs the above-described functions defined in the method disclosed in the embodiments of the present invention. The above-described method steps and system elements may also be implemented using a controller and a computer-readable storage medium for storing a computer program for causing the controller to implement the functions of the above-described steps or elements.
The following further illustrates embodiments of the invention in terms of specific examples as shown in fig. 2.
1. According to the selection standard of Microsoft Azure stack HCI hardware, selecting server nodes supporting RDMA network cards to form a micro-soft super-fusion cluster;
2. configuring a cluster network:
assuming that the current cluster has 4 server nodes and the number of RDMA network cards of each node is 1, that is, there are now 4 RDMA functional network cards in the cluster, and the following operations are respectively performed under Windows powershell of each server node in the cluster:
PS>New-VMSwitch–Name SETswitch–NetAdapterName“nic1”–EnableEmbeddedTeaming$true
# Create virtual switch, nic1 is RDMA Port name
PS>Add-VMNetworkAdapter–switchname SETswitch–Name SMB_1–managementos
# create virtual network adapter SMB _1
PS>Enable-NetAdapterRDMA“vEthernet(SMB_1)”
RDMA function of # opening virtual network interface vEthernet (SMB _1)
PS>Set-VMNetworkAdapterTeamMapping-VMNetworkAdapterName"SMB_1"–ManagementOS–PhysicalNetAdapterName"NIC1"
# associating SMB _1 configured for RDMA with a Material adapter NIC1 connected to a virtual machine switch
3. Verifying RDMA normal connectivity in the micro-soft super-converged cluster:
PS>Get-NetIPConfiguration-InterfaceAlias"vEthernet(SMB_1)"|
ft InterfaceAlias,InterfaceIndex,IPv4Address
# obtaining Interface Index and IP corresponding to network port
PS C:\>Get-SmbClientNetworkInterface
# View RDMA network card status. If the RDMA capable state is false, the network card can be restarted or the state of the machine to be tested can be updated.
And then entering a cluster external host test stage. Create C: \ TEST directory under C disk, use DiskSpd tool and copy decompress to C: \ TEST path, copy script content at file end to C: \ TEST path of cluster external host TC in powershell script form, named TEST.
PS>C:\TEST\Test.PS1-IfIndex 3-IsRoCE$true-RemoteIpAddress192.168.1.1,192.168.1.2,192.168.1.3,192.168.1.4-PathToDiskspdC:\TEST\Diskspd-v2.0.17\amd64fre\
# verification RDMA connection status Using RoCE function (IP Address is virtual network Port IP Address in Microsoft super fusion Cluster node under test)
After execution, a prompt of connection success or failure occurs, and if success indicates that the RDMA network is normally connected, failure requires checking a specific prompt problem.
It can be seen from the foregoing embodiments that, in the functional test method for accessing a network cluster by a remote direct memory according to the embodiments of the present invention, by checking whether a disk performance benchmark test tool exists in the network cluster, a test is terminated and installation of the disk performance benchmark test tool is prompted in response to the absence of the disk performance benchmark test tool; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool, and responding to the mutual communication of the remote direct memory access functions and continuing the next step; using a disk performance benchmark test tool to circularly detect whether network congestion exists in all nodes in the network cluster, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
It should be particularly noted that, the steps in the embodiments of the method for testing functions of a remote direct memory access network cluster described above may be mutually intersected, replaced, added, or deleted, and therefore, these reasonable permutations and combinations should also belong to the scope of the present invention, and the scope of the present invention should not be limited to the described embodiments.
In view of the above objects, a second aspect of the embodiments of the present invention provides an embodiment of a device capable of quickly determining whether RDMA network functions in microsoft super converged cluster can work normally in a connected mode. The function test device of the remote direct memory access network cluster comprises:
a processor; and
a memory storing program code executable by the processor, the program code when executed sequentially performing the following steps on an external device connected to the network cluster:
checking and confirming whether a disk performance benchmark test tool exists in the network cluster;
using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
responding to the absence of disconnected nodes, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected or not by using a disk performance benchmark test tool;
responding to the mutual communication of the remote direct memory access functions, circularly detecting whether network congestion exists in all nodes in the network cluster by using a disk performance benchmark test tool, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all the passes.
In some embodiments, the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of the virtual network cards of the nodes of which the remote direct memory access function is not connected; in response to the number being 1 or 2, circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are communicated with each other; removing nodes from the network cluster for which the remote direct memory access functionality is not connected in response to the number being greater than 2 to re-perform the functional test.
In some embodiments, removing the network-congested node to re-perform the functional test includes: and removing the nodes, adding new nodes for replacing the removed nodes to the network cluster, and re-executing the functional test based on the processed network cluster.
In some embodiments, the program code, when executed, further performs the following steps separately and in sequence on each node of the network cluster prior to testing with the external device:
checking whether the node supports remote direct memory access functionality, terminating the test and removing the node from the network cluster to re-execute the functional test in response to the node not supporting remote direct memory access functionality;
the method comprises the steps of responding to the fact that the node supports the remote direct memory access function, checking the starting number of virtual network interfaces of the node, responding to the fact that the starting number is not equal to the number of virtual network cards of the node, terminating the test and prompting to check the virtual network cards.
In some embodiments, all nodes in the network cluster include one or more remote direct memory access network cards; the method also comprises the following steps: all of the network cards in the node are tested in response to testing the node having a plurality of remote direct memory access network cards.
It can be seen from the foregoing embodiments that, in the functional test apparatus for a remote direct memory access network cluster according to an embodiment of the present invention, by checking whether a disk performance benchmark test tool exists in the network cluster, a test is terminated and installation of the disk performance benchmark test tool is prompted in response to the absence of the disk performance benchmark test tool; using a disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on the virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes; circularly detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are mutually communicated by using a disk performance benchmark test tool, and responding to the mutual communication of the remote direct memory access functions and continuing the next step; using a disk performance benchmark test tool to circularly detect whether network congestion exists in all nodes in the network cluster, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster to re-execute the functional test; the technical scheme that successful test information of the remote direct memory access function is fed back to a tester through the external equipment in response to all the passing of the RDMA network function is responded, whether the RDMA network function in the Microsoft super fusion cluster can be connected normally or not can be determined quickly, the cluster stability and performance are improved, and the problems of network delay and connection failure are reduced.
It should be particularly noted that, the above-mentioned embodiment of the functional test apparatus of the remote direct memory access network cluster specifically describes the working process of each module by using the embodiment of the functional test method of the remote direct memory access network cluster, and those skilled in the art can easily think that these modules are applied to other embodiments of the functional test method of the remote direct memory access network cluster. Of course, since the steps in the embodiment of the method for testing the functions of the remote dma network cluster may be mutually intersected, replaced, added, or deleted, these reasonable permutations and combinations should also be changed to the function testing apparatus of the remote dma network cluster, and the scope of the present invention should not be limited to the embodiment.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items. The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of an embodiment of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.

Claims (10)

1. A method for functional testing of a remote direct memory access network cluster, comprising performing the following steps in sequence on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using the disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on a virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
in response to the absence of disconnected nodes, cyclically detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected with each other using the disk performance benchmark test tool;
responding to the remote direct memory access functions are communicated with each other, using the disk performance benchmark test tool to detect whether network congestion exists in all nodes in the network cluster in a circulating mode, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster so as to execute functional test again;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all of the above-mentioned passes.
2. The method of claim 1, further comprising: in response to the existence of the disconnected remote direct memory access function, counting the number of virtual network cards of the nodes of which the remote direct memory access function is not connected; detecting, in response to the number being 1 or 2, whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected to each other; removing the nodes with disconnected remote direct memory access function from the network cluster to re-execute functional testing in response to the number being greater than 2.
3. The method of claim 1 or 2, wherein removing the network-congested node to re-perform functional testing comprises: removing nodes, adding new nodes for replacing the removed nodes for the network cluster, and re-executing functional test based on the processed network cluster.
4. The method of claim 1, further comprising performing the following steps separately and sequentially at each node of the network cluster prior to testing using external devices:
checking whether the node supports the remote direct memory access function, terminating testing and removing the node from the network cluster to re-execute functional testing in response to the node not supporting the remote direct memory access function;
responding to the node supporting the remote direct memory access function, checking the starting number of the virtual network ports of the node, and responding to the starting number not being equal to the number of the virtual network cards of the node to terminate the test and prompt the check of the virtual network cards.
5. The method of claim 1, wherein all nodes in the network cluster include one or more remote direct memory access network cards;
the method further comprises the following steps: all of the network cards in a node having a plurality of remote direct memory access network cards are tested in response to testing the node.
6. A functional test apparatus for a remote direct memory access network cluster, comprising:
a processor; and
a memory storing processor executable program code which when executed performs the following steps in sequence on an external device connected to the network cluster:
checking and confirming that a disk performance benchmark test tool exists in the network cluster;
using the disk performance benchmark test tool to circularly detect whether all nodes in the network cluster are mutually connected on a virtual network, responding to the existence of the unconnected nodes to terminate the test and prompt the names of the virtual network cards of the unconnected nodes;
in response to the absence of disconnected nodes, cyclically detecting whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected with each other using the disk performance benchmark test tool;
responding to the remote direct memory access functions are communicated with each other, using the disk performance benchmark test tool to detect whether network congestion exists in all nodes in the network cluster in a circulating mode, responding to the nodes with the network congestion, terminating the test, and removing the nodes with the network congestion from the network cluster so as to execute functional test again;
and feeding back the remote direct memory access function test success information to the tester through the external device in response to all of the above-mentioned passes.
7. The apparatus of claim 6, wherein the steps further comprise: in response to the existence of the disconnected remote direct memory access function, counting the number of virtual network cards of the nodes of which the remote direct memory access function is not connected; detecting, in response to the number being 1 or 2, whether remote direct memory access functions of all nodes in the network cluster on the virtual network are connected to each other; removing the nodes with disconnected remote direct memory access function from the network cluster to re-execute functional testing in response to the number being greater than 2.
8. The apparatus of claim 6 or 7, wherein removing the network-congested node to re-perform functional testing comprises: removing nodes, adding new nodes for replacing the removed nodes for the network cluster, and re-executing functional test based on the processed network cluster.
9. The apparatus of claim 6, wherein the program code, when executed, further performs the following steps separately and sequentially at each node of the network cluster before testing with an external device:
checking whether the node supports the remote direct memory access function, terminating testing and removing the node from the network cluster to re-execute functional testing in response to the node not supporting the remote direct memory access function;
responding to the node supporting the remote direct memory access function, checking the starting number of the virtual network ports of the node, and responding to the starting number not being equal to the number of the virtual network cards of the node to terminate the test and prompt the check of the virtual network cards.
10. The apparatus of claim 6, wherein all nodes in the network cluster comprise one or more remote direct memory access network cards; the steps further include: all of the network cards in a node having a plurality of remote direct memory access network cards are tested in response to testing the node.
CN202010174053.0A 2020-03-13 2020-03-13 Function test method and device for remote direct memory access network cluster Active CN111404780B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010174053.0A CN111404780B (en) 2020-03-13 2020-03-13 Function test method and device for remote direct memory access network cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010174053.0A CN111404780B (en) 2020-03-13 2020-03-13 Function test method and device for remote direct memory access network cluster

Publications (2)

Publication Number Publication Date
CN111404780A true CN111404780A (en) 2020-07-10
CN111404780B CN111404780B (en) 2022-02-25

Family

ID=71428912

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010174053.0A Active CN111404780B (en) 2020-03-13 2020-03-13 Function test method and device for remote direct memory access network cluster

Country Status (1)

Country Link
CN (1) CN111404780B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112905718A (en) * 2021-03-09 2021-06-04 东莞中国科学院云计算产业技术创新与育成中心 Data management method, system, electronic device and medium based on super-fusion architecture
CN112948229A (en) * 2021-03-16 2021-06-11 广州虎牙科技有限公司 Method and device for determining performance of scheduling cluster, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106533825A (en) * 2016-10-25 2017-03-22 广东浪潮大数据研究有限公司 Linux system-based SR-IOV function testing method
CN106899454A (en) * 2017-03-06 2017-06-27 郑州云海信息技术有限公司 A kind of method and system based on automatic test RoCE performances under linux system
CN110460491A (en) * 2019-07-05 2019-11-15 锐捷网络股份有限公司 Performance test methods and device based on RDMA
CN110830283A (en) * 2018-08-10 2020-02-21 华为技术有限公司 Fault detection method, device, equipment and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106533825A (en) * 2016-10-25 2017-03-22 广东浪潮大数据研究有限公司 Linux system-based SR-IOV function testing method
CN106899454A (en) * 2017-03-06 2017-06-27 郑州云海信息技术有限公司 A kind of method and system based on automatic test RoCE performances under linux system
CN110830283A (en) * 2018-08-10 2020-02-21 华为技术有限公司 Fault detection method, device, equipment and system
CN110460491A (en) * 2019-07-05 2019-11-15 锐捷网络股份有限公司 Performance test methods and device based on RDMA

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112905718A (en) * 2021-03-09 2021-06-04 东莞中国科学院云计算产业技术创新与育成中心 Data management method, system, electronic device and medium based on super-fusion architecture
CN112948229A (en) * 2021-03-16 2021-06-11 广州虎牙科技有限公司 Method and device for determining performance of scheduling cluster, computer equipment and storage medium

Also Published As

Publication number Publication date
CN111404780B (en) 2022-02-25

Similar Documents

Publication Publication Date Title
CN111404780B (en) Function test method and device for remote direct memory access network cluster
CN107506295B (en) Virtual machine backup testing method and device and computer readable storage medium
CN105553783A (en) Automated testing method for switching of configuration two-computer resources
CN113300953B (en) Management method, system and related device for multipath failover group
CN107181636B (en) Health check method and device in load balancing system
CN105700975A (en) Heat removing method and device as well as heat adding method and device for CPUs
CN111988169B (en) Method, system, equipment and medium for cleaning and repairing abnormal disk of cloud platform
CN112579007B (en) Method and device for acquiring storage full link and electronic equipment
CN116701518A (en) Database cluster deployment method and device, computer equipment and storage medium
US20200351157A1 (en) Optimistic and failsafe network configuration
CN116743619B (en) Network service testing method, device, equipment and storage medium
CN113377702A (en) Method and device for starting two-node cluster, electronic equipment and storage medium
CN109885420B (en) PCIe link fault analysis method, BMC and storage medium
US9400605B2 (en) Efficient management of a virtual tape library cluster
JP6760086B2 (en) Setting program, setting method, and setting device
US11403036B2 (en) Recover time improvement mechanism after device path failure in a storage system
CN110855526B (en) Method and device for detecting data source connection, storage medium and electronic equipment
CN111147400B (en) Method and device for command line configuration synchronization and electronic equipment
US20150195366A1 (en) Switching device, non-transitory computer readable medium and zoning setting method
CN112131201A (en) Method, system, equipment and medium for high availability of network additional storage
CN109542692A (en) Greenplum cluster monitoring method and apparatus
CN118646595B (en) Method, equipment and storage medium for repairing server to avoid secret mutual trust
US20140280369A1 (en) Cloning tables and lists while retaining the relationship between entry elements and their assigned identifiers
CN117714458A (en) Batch tenant migration method, system and device based on distribution
CN115484164A (en) Method and system for deploying a production system in a virtualized environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant