US10911541B1 - Data transmission and network interface controller - Google Patents

Data transmission and network interface controller Download PDF

Info

Publication number
US10911541B1
US10911541B1 US16/945,621 US202016945621A US10911541B1 US 10911541 B1 US10911541 B1 US 10911541B1 US 202016945621 A US202016945621 A US 202016945621A US 10911541 B1 US10911541 B1 US 10911541B1
Authority
US
United States
Prior art keywords
network interface
interface controller
data packets
rdma network
rdma
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US16/945,621
Other versions
US20210014307A1 (en
Inventor
Changqing Li
Yinchao ZOU
Peng Wu
Jincan Kong
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Advanced New Technologies Co Ltd
Original Assignee
Advanced New Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201910624819.8A external-priority patent/CN110460412B/en
Priority to US16/945,621 priority Critical patent/US10911541B1/en
Application filed by Advanced New Technologies Co Ltd filed Critical Advanced New Technologies Co Ltd
Assigned to ALIBABA GROUP HOLDING LIMITED reassignment ALIBABA GROUP HOLDING LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LI, CHANGQING
Assigned to ADVANTAGEOUS NEW TECHNOLOGIES CO., LTD. reassignment ADVANTAGEOUS NEW TECHNOLOGIES CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALIBABA GROUP HOLDING LIMITED
Assigned to Advanced New Technologies Co., Ltd. reassignment Advanced New Technologies Co., Ltd. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ADVANTAGEOUS NEW TECHNOLOGIES CO., LTD.
Assigned to ALIBABA GROUP HOLDING LIMITED reassignment ALIBABA GROUP HOLDING LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KONG, JINCAN, ZOU, YINCHAO, WU, PENG
Publication of US20210014307A1 publication Critical patent/US20210014307A1/en
Priority to US17/164,678 priority patent/US11115474B2/en
Publication of US10911541B1 publication Critical patent/US10911541B1/en
Application granted granted Critical
Priority to US17/412,866 priority patent/US11736567B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/30Definitions, standards or architectural aspects of layered protocol stacks
    • H04L69/32Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
    • H04L69/322Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
    • H04L69/329Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/901Buffering arrangements using storage descriptor, e.g. read or write pointers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9084Reactions to storage capacity overflow
    • H04L49/9089Reactions to storage capacity overflow replacing packets in a storage arrangement, e.g. pushout
    • H04L49/9094Arrangements for simultaneous transmit and receive, e.g. simultaneous reading/writing from/to the storage element
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L5/00Arrangements affording multiple use of the transmission path
    • H04L5/003Arrangements for allocating sub-channels of the transmission path
    • H04L5/0053Allocation of signaling, i.e. of overhead other than pilot signals
    • H04L5/0055Physical resource allocation for ACK/NACK
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks
    • H04L67/1087Peer-to-peer [P2P] networks using cross-functional networking aspects
    • H04L67/1091Interfacing with client-server systems or between P2P systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/12Protocol engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • G06F15/163Interprocessor communication
    • G06F15/173Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
    • G06F15/17306Intercommunication techniques
    • G06F15/17331Distributed shared memory [DSM], e.g. remote direct memory access [RDMA]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]

Definitions

  • Implementations of the present disclosure relate to the field of information technologies, and more specifically, to data transmission methods, remote direct memory access (RDMA) network interface controllers, and computing devices.
  • RDMA remote direct memory access
  • RDMA technology can directly move data from a storage area of one host to a storage area of another host without involving an operating system and a central processing unit (CPU), thereby greatly reducing a data processing delay and improving transmission performance. Accelerating network data transmission by using the RDMA technology is a topic of research.
  • implementations of the present specification provide data transmission methods, RDMA network interface controllers (RNICs), and computing devices.
  • RNICs RDMA network interface controllers
  • an implementation of the present specification provides a data transmission method, where the method is executed by a first RDMA network interface controller of a first host, and the method includes: obtaining m to-be-sent data packets from a host memory of the first host, where m is a positive integer; sending the m data packets to a second RDMA network interface controller of a second host, and backing up the m data packets to a network interface controller memory integrated into the first RDMA network interface controller; if determining that the second RDMA network interface controller does not receive n data packets in the m data packets, obtaining the backed-up n data packets from the network interface controller memory, where n is a positive integer; and retransmitting the backed-up n data packets to the second RDMA network interface controller.
  • an implementation of the present specification provides a data transmission method, where the method is executed by a second RDMA network interface controller of a second host, and the method includes: communicating with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer; and if determining that n data packets in the m data packets are not received, performing the following operations: storing received data packets in the m data packets into a second network interface controller memory integrated into the second RDMA network interface controller; waiting to receive the n data packets retransmitted by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory, and n is a positive integer; and after the retrans
  • an implementation of the present specification provides an RDMA network interface controller, where the RDMA network interface controller is located in a first host, and the RDMA network interface controller includes a first acquisition unit, a backup unit, a sending unit, and a second acquisition unit; the first acquisition unit is configured to obtain m to-be-sent data packets from a host memory of the first host, where m is a positive integer; the backup unit is configured to back up the m data packets to a network interface controller memory integrated into the RDMA network interface controller; the sending unit is configured to send the m data packets to a second RDMA network interface controller of a second host; the second acquisition unit is configured to: if it is determined that the second RDMA network interface controller does not receive n data packets in the m data packets, obtain the backed-up n data packets from the network interface controller memory, where n is a positive integer; and the sending unit is further configured to retransmit the backed-up n data packets to the second
  • an implementation of the present specification provides an RDMA network interface controller, where the RDMA network interface controller is located in a second host, and the RDMA network interface controller includes: a communications unit, configured to communicate with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer; a data packet management unit, configured to: if it is determined that n data packets in the m data packets are not received, perform the following operations: storing received data packets in the m data packets into a second network interface controller memory integrated into the RDMA network interface controller, and waiting to receive the n data packets retransmitted by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network
  • an implementation of the present specification provides a computing device, including a top-layer application, a host memory, and an RDMA network interface controller, where the top-layer application stores m to-be-sent data packets into the host memory, and sends instructions associated with the m data packets to the RDMA network interface controller, where m is a positive integer; and the RDMA network interface controller is integrated with a network interface controller memory and implements, based on the instructions, the method executed by the first RDMA network interface controller.
  • an implementation of the present specification provides a computing device, including a top-layer application, a host memory, and an RDMA network interface controller.
  • the RDMA network interface controller is integrated with a network interface controller memory and implements the method executed by the second RDMA network interface controller.
  • the RDMA network interface controller sends, to the top-layer application, instructions associated with m data packets transmitted to the host memory, where m is a positive integer; and the top-layer application processes the m data packets based on the instructions.
  • the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • bandwidth for example, PCIe bandwidth
  • FIG. 1 is a schematic flowchart illustrating a data transmission method, according to an implementation
  • FIG. 2 is a schematic flowchart illustrating a data transmission method, according to an implementation
  • FIG. 3A is a schematic diagram illustrating an example of a data transmission scenario, according to an implementation
  • FIG. 3B is a schematic flowchart illustrating a data transmission process, according to an implementation
  • FIG. 4 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation
  • FIG. 5 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation
  • FIG. 6 is a schematic structural diagram illustrating a computing device, according to an implementation
  • FIG. 7 is a schematic structural diagram illustrating a computing device, according to an implementation.
  • the RDMA technology is usually implemented on a hardware network interface controller, and this network interface controller can be referred to as an RDMA network interface controller.
  • the RDMA network interface controller can be usually connected to a host memory by using a peripheral component interconnect express (PCIe), so as to implement direct access to the host memory. Costs of accessing the host memory by the network interface controller by using the PCIe are usually relatively high. Therefore, the RDMA technology currently used in the industry usually requires that a network is lossless (that is, the network does not lose a packet).
  • PCIe peripheral component interconnect express
  • the RDMA network interface controller needs to retransmit all subsequent data packets after the lost data packet, and the RDMA network interface controller needs to obtain these data packets from the host memory again by using the PCIe, which seriously affects data transmission efficiency. Therefore, the probability of such retransmission can be greatly reduced by ensuring that the network is lossless.
  • a host that sends data is referred to as a first host
  • a host that receives data is referred to as a second host.
  • the first host can include a first RDMA network interface controller used to implement the RDMA technology, and can further include a host memory.
  • the first RDMA network interface controller can connect to the host memory of the first host by using a PCIe, so as to access data on the host memory.
  • the first RDMA network interface controller can further have a programmable function, for example, have a field programmable gate array (FPGA).
  • FPGA field programmable gate array
  • the FPGA is programmed, so the first RDMA network interface controller can implement various operations described in the present specification.
  • the second host can also include a second RDMA network interface controller used to implement the RDMA technology and a host memory.
  • the second RDMA network interface controller can connect to the host memory of the second host by using a PCIe, so as to access data on the host memory.
  • the second RDMA network interface controller can also have a programmable function, for example, have an FPGA.
  • the FPGA is programmed, so the second RDMA network interface controller can implement various operations described in the present specification.
  • first RDMA network interface controller and the second RDMA network interface controller can communicate with each other, so as to implement data transmission.
  • the first RDMA network interface controller can be integrated with a memory.
  • the memory is referred to as a network interface controller memory in the present specification.
  • the network interface controller memory can be a dynamic random access memory (DRAM) or any other applicable memory. This is not limited in the present specification.
  • the network interface controller memory of the first RDMA network interface controller can be configured to back up a to-be-sent data packet. As such, when retransmission needs to be performed, the first RDMA network interface controller can directly obtain the backed-up data packet from the network interface controller memory, and does not need to obtain the to-be-retransmitted data packet from the host memory again by using the PCIe.
  • the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • bandwidth for example, PCIe bandwidth
  • a sliding window and a flow control mechanism can be implemented at a hardware level, thereby reducing impact of RDMA traffic on the network and reducing the probability of network congestion.
  • the second RDMA network interface controller can be integrated with a network interface controller memory.
  • the network interface controller memory of the second RDMA network interface controller can be configured to cache a received data packet. For example, when a packet loss occurs, the second RDMA network interface controller can cache the received data packet into the network interface controller memory.
  • the retransmitted data packet is received, the retransmitted data packet and the previously received data packet are transmitted together to the host memory of the second host.
  • the first RDMA network interface controller can retransmit only a lost data packet, and does not need to retransmit all subsequent data packets. Therefore, the number of data packets to be retransmitted can be greatly reduced, thereby effectively shortening a retransmission time and improving retransmission performance.
  • FIG. 1 is a schematic flowchart illustrating a data transmission method, according to an implementation.
  • the method in FIG. 1 can be performed by a first RDMA network interface controller of a first host.
  • the first RDMA network interface controller can obtain m to-be-sent data packets from a host memory of the first host, m is a positive integer.
  • the first RDMA network interface controller can send the m data packets to a second RDMA network interface controller of a second host, and back up the m data packets to a network interface controller memory integrated into the first RDMA network interface controller.
  • step 106 if the first RDMA network interface controller determines that the second RDMA network interface controller does not receive n data packets in the m data packets, the first RDMA network interface controller can obtain the backed-up n data packets from the network interface controller memory, where n is a positive integer.
  • the first RDMA network interface controller can retransmit the backed-up n data packets to the second RDMA network interface controller.
  • the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • bandwidth for example, PCIe bandwidth
  • a top-layer application of the first host can store the m data packets in the host memory of the first host.
  • the top-layer application can send, to the first RDMA network interface controller, instructions about the data packets that need to be sent.
  • the first RDMA network interface controller can obtain the m to-be-sent data packets from the host memory of the first host.
  • the first RDMA network interface controller can send the m data packets to the second RDMA network interface controller of the second host.
  • the first RDMA network interface controller can back up the m data packets to the network interface controller memory integrated into the first RDMA network interface controller.
  • the network interface controller memory can be a DRAM or any other suitable type of memory.
  • the first RDMA network interface controller can determine whether the second RDMA network interface controller receives the m data packets. For example, the first RDMA network interface controller can start a corresponding timer for each sent data packet. If the first RDMA network interface controller receives an Acknowledgement (ACK) response for a corresponding data packet from the second RDMA network interface controller before the timer expires, the first RDMA network interface controller can determine that the second RDMA network interface controller has received the corresponding data packet.
  • ACK Acknowledgement
  • the first RDMA network interface controller can consider that the second RDMA network interface controller does not receive the corresponding data packet.
  • the second RDMA network interface controller can return a Negative-Acknowledgement (NACK) response to the first RDMA network interface controller for a data packet that is not received.
  • NACK Negative-Acknowledgement
  • the first RDMA network interface controller can determine, based on the NACK response, which data packets are not received by the second RDMA network interface controller.
  • the ACK response or NACK response for the data packet can be implemented in multiple forms. For example, one message indicating an ACK or NACK response can be sent for each data packet. Or one message can be sent for several data packets, and the message can include ACK or NACK responses respectively corresponding to the data packets.
  • the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets in the m data packets.
  • the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets.
  • the predetermined duration can be duration of a timer for each of the n data packets. The predetermined duration can be determined in advance based on various factors such as an actual application scenario.
  • the first RDMA network interface controller can receive NACK responses for the n data packets from the second RDMA network interface controller. As such, the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets.
  • the second RDMA network interface controller does not receive the data packets. For example, the data packets are lost in the network transmission process, or the data packets cannot be successfully received due to a bit error or an insufficient receiving capability of the second host.
  • the first RDMA network interface controller needs to retransmit the n data packets that are not successfully received to the second RDMA network interface controller.
  • the first RDMA network interface controller backs up the m data packets that have been sent in the network interface controller memory of the first RDMA network interface controller, the first RDMA network interface controller can directly obtain the n data packets to be retransmitted from the network interface controller memory of the first RDMA network interface controller without needing to access the host memory of the first host again by using, for example, a PCIe.
  • the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
  • the first RDMA network interface controller can determine that the second RDMA network interface controller has received the m data packets. In this case, the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
  • the first RDMA network interface controller can confirm that the second RDMA network interface controller has received all the m data packets, and the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
  • FIG. 1 is described from the perspective of the RDMA network interface controller at the transmitting end. The following describes the operation of the RDMA network interface controller at the receiving end.
  • FIG. 2 is a schematic flowchart illustrating a data transmission method, according to an implementation.
  • the method in FIG. 2 can be performed by a second RDMA network interface controller of a second host.
  • a network interface controller memory integrated into a first RDMA network interface controller of a first host is referred to as a first network interface controller memory
  • a network interface controller memory integrated into the second RDMA network interface controller of the second host is referred to as a second network interface controller memory.
  • the second RDMA network interface controller can communicate with the first RDMA network interface controller of the first host, so as to receive m data packets from the first RDMA network interface controller.
  • the m data packets can be backed up by the first RDMA network interface controller to the first network interface controller memory.
  • the first RDMA network interface controller can back up the m data packets to the first network interface controller memory while sending the m data packets, where m can be a positive integer.
  • step 204 if the second RDMA network interface controller determines that n data packets in the m data packets are not received, the second RDMA network interface controller can store received data packets in the m data packets into the second network interface controller memory, and can wait to receive the n data packets retransmitted by the first RDMA network interface controller.
  • the retransmitted n data packets can be obtained by the first RDMA network interface controller from the first network interface controller memory, and n can be a positive integer.
  • the second RDMA network interface controller can transmit, together with the n data packets, the previously received data packets stored in the second network interface controller memory to a host memory of the second host.
  • the second RDMA network interface controller can store the received data packets in the network interface controller memory when waiting for the to-be-retransmitted data packets, and can send all the received data packets to the host memory after receiving all the data packets.
  • the first RDMA network interface controller can retransmit only data packets that are not received by the second RDMA network interface controller, and does not need to retransmit all data packets subsequent to a lost data packet, thereby effectively reducing the number of data packets to be retransmitted, and effectively improving retransmission efficiency.
  • the first RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the first RDMA network interface controller, it is unnecessary for the first RDMA network interface controller to access the host memory again during retransmission. Therefore, not only bandwidth occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved.
  • a packet loss tolerance capability of the RDMA network interface controller can be greatly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • the first network interface controller memory can be a DRAM or any other suitable type of memory.
  • the second network interface controller memory can be a DRAM or any other suitable type of memory.
  • the second RDMA network interface controller can detect, in multiple ways, whether the m data packets are received.
  • the m data packets can have a certain sequence, and each data packet can include a corresponding sequence number.
  • the second RDMA network interface controller can determine, based on sequence numbers in the received data packets, whether the m data packets have been received.
  • the second RDMA network interface controller can determine that the n data packets in the m data packets are not received.
  • the second RDMA network interface controller can send NACK responses for the n data packets to the first RDMA network interface controller.
  • the second RDMA network interface controller may not send ACK responses for the n data packets.
  • the first RDMA network interface controller can determine that no ACK response for a data packet is received from the second RDAM network interface controller within predetermined duration for the data packet, and can consider that the second RDMA network interface controller does not receive the data packet.
  • FIG. 3A is a schematic diagram illustrating an example of a data transmission scenario, according to an implementation.
  • a first host 302 A can include a top-layer application, a host memory, and a first RDMA network interface controller.
  • the top-layer application of the first host can store to-be-sent data packets in the host memory of the first host, and can send instructions about the to-be-sent data packets to the first RDMA network interface controller.
  • the first RDMA network interface controller can obtain the to-be-sent data packets from the host memory of the first host, and can send the obtained data packets to a second RDMA network interface controller.
  • the first RDMA network interface controller can back up the obtained data packets to a DRAM integrated into the first RDMA network interface controller. If retransmission needs to be performed, the first RDMA network interface controller can obtain to-be-retransmitted data packets from the DRAM of the first RDMA network interface controller.
  • a second host 304 A can include a top-layer application, a host memory, and the second RDMA network interface controller.
  • the second RDMA network interface controller can receive data packets from the first RDMA network interface controller. If there is a data packet not received, the second RDMA network interface controller can temporarily store received data packets into a DRAM integrated into the second RDMA network interface controller. Then, the second RDMA network interface controller can receive the retransmitted data packet from the first RDAM network interface controller. After receiving all the data packets, the second RDAM network interface controller can send all the data packets to the host memory of the second host. Then, the second RDAM network interface controller can notify the top-layer application of the second host of completion of data transmission, for example, send an interrupt instruction to the top-layer application. Then, the top-layer application of the second host can process the corresponding data packets in the host memory.
  • first host and the second host are shown as including three parts in the example in FIG. 3A .
  • first host and the second host can further include other related parts that are known in the art. This is not limited in the present specification.
  • FIG. 3B is a schematic flowchart illustrating a data transmission process, according to an implementation.
  • a first RDMA network interface controller can obtain m to-be-sent data packets from a host memory of a first host.
  • the first RDMA network interface controller can back up the m data packets to a DRAM integrated into the first RDMA network interface controller.
  • the first RDMA network interface controller can send the m data packets to a second RDMA network interface controller.
  • step 304 B is first performed and then step 306 B is performed, in different scenarios, steps 304 B and 306 B can be performed in parallel. Or step 306 B can be performed before step 304 B. This is not limited in the present specification.
  • step 308 B the second RDMA network interface controller can determine that n data packets are not received.
  • the second RDMA network interface controller can store received data packets in the m data packets into a DRAM integrated into the second RDMA network interface controller.
  • the second RDMA network interface controller can send ACK responses for the received data packets and NACK responses for the n data packets that are not received to the first RDMA network interface controller.
  • step 310 B is first performed and then step 312 B is performed, in different scenarios, steps 310 B and 312 B can be performed in parallel. Or step 312 B can be performed before step 310 B. This is not limited in the present specification.
  • the first RDMA network interface controller can obtain the n data packets to be retransmitted from the DRAM of the first RDMA network interface controller.
  • step 316 B the first RDMA network interface controller can send the n data packets obtained in step 314 B to the second RDMA network interface controller.
  • the second RDMA network interface controller can determine that the n data packets are received, and send ACK responses for the n data packets to the first RDMA network interface controller.
  • the second RDMA network interface controller can transmit the previously received data packets stored in the DRAM of the second RDMA network interface controller together with the n data packets received in step 316 B to a host memory of a second host.
  • the first RDMA network interface controller can determine that the second RDMA network interface controller has received all the m data packets, and delete the backed-up m data packets from the network interface controller memory of the first RDMA network interface controller.
  • the transmitting end RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the transmitting end RDMA network interface controller, and the receiving end RDMA network interface controller can temporarily store the received data packets in the network interface controller memory integrated into the receiving end RDMA network interface controller when waiting for the to-be-retransmitted data packets. Therefore, the transmitting end RDMA network interface controller can quickly obtain and retransmit the data packets not received by the receiving end RDMA network interface controller, thereby greatly improving retransmission efficiency and retransmission performance.
  • a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • FIG. 4 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation.
  • an RDMA network interface controller 400 in FIG. 4 can be the previous first RDMA network interface controller.
  • the RDMA network interface controller 400 can be implemented by combining software and hardware.
  • the RDMA network interface controller 400 can include a first acquisition unit 402 , a sending unit 404 , a backup unit 406 , and a second acquisition unit 408 .
  • the first acquisition unit 402 can obtain m to-be-sent data packets from a host memory of a first host, where m is a positive integer.
  • the sending unit 404 can send the m data packets to a second RDMA network interface controller of a second host.
  • the backup unit 406 can back up the m data packets to a network interface controller memory integrated into the RDMA network interface controller 400 .
  • the second acquisition unit 408 determines that the second RDMA network interface controller does not receive n data packets in the m data packets, the second acquisition unit 408 obtains the backed-up n data packets from the network interface controller memory, where n is a positive integer.
  • the sending unit 404 can retransmit the backed-up n data packets to the second RDMA network interface controller.
  • the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
  • bandwidth for example, PCIe bandwidth
  • the network interface controller memory can be a DRAM.
  • the second acquisition unit 408 can determine that no ACK response for the n data packets is received from the second RDMA network interface controller within predetermined duration. Or the second acquisition unit 408 can determine that NACK responses for the n data packets are received from the second RDMA network interface controller.
  • the backup unit 406 can delete the backed-up m data packets from the network interface controller memory.
  • Units of the RDMA network interface controller 400 can perform corresponding steps in the implementations shown in FIG. 1 and FIG. 3A and FIG. 3B . Therefore, for brevity of description, specific operations and functions of the units of the RDMA network interface controller 400 are not described here again.
  • FIG. 5 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation.
  • an RDMA network interface controller 500 in FIG. 5 can be the previous second RDMA network interface controller.
  • the RDMA network interface controller 500 can be implemented by combining software and hardware.
  • the RDMA network interface controller 500 can include a communications unit 502 , a data packet management unit 504 , and a data packet management unit 506 .
  • the communications unit 502 can communicate with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer.
  • the data packet management unit 504 can store received data packets in the m data packets into a second network interface controller memory integrated into the RDMA network interface controller 500 , and can wait to receive the n data packets by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory, and n is a positive integer.
  • the data packet management unit 504 transmits the received data packets together with the retransmitted n data packets to a host memory of a second host.
  • the receiving end RDMA network interface controller can store the received data packets in the network interface controller memory when waiting for the to-be-retransmitted data packets, and can send all the received data packets to the host memory after receiving all the data packets.
  • the first RDMA network interface controller can retransmit only data packets that are not received, and does not need to retransmit all data packets subsequent to a lost data packet, thereby effectively reducing the number of data packets to be retransmitted, and effectively improving retransmission efficiency.
  • the first RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the first RDMA network interface controller, it is unnecessary for the first RDMA network interface controller to access the host memory again during retransmission. Therefore, not only bandwidth occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved.
  • the first network interface controller memory can be a DRAM.
  • the second network interface controller memory can be a DRAM.
  • the communications unit 502 can send NACK responses for the n data packets to the first RDMA network interface controller, or the communications unit 502 may not send ACK responses for the n data packets to the first RDMA network interface controller.
  • Units of the RDMA network interface controller 500 can perform corresponding steps in the implementations shown in FIG. 2 to FIG. 3B . Therefore, for brevity of description, specific operations and functions of the units of the RDMA network interface controller 500 are not described here again.
  • FIG. 6 is a schematic structural diagram illustrating a computing device, according to an implementation.
  • a computing device 600 can include a top-layer application 602 , a host memory 604 , and an RDMA network interface controller 606 .
  • the computing device 600 can be corresponding to the previous first host.
  • the top-layer application 602 can store m to-be-sent data packets into the host memory 604 , and send instructions associated with the m data packets to the RDMA network interface controller 606 .
  • the top-layer application 602 and the host memory 604 refer to corresponding descriptions in the previous implementations. Details are omitted here for simplicity.
  • the RDMA network interface controller 606 is integrated with a network interface controller memory, and can implement functions of the first RDMA network interface controller based on the previous instructions.
  • the computing device 600 can be implemented in any applicable form in the art, for example, includes but is not limited to a desktop computer, a laptop computer, a smartphone, a tablet computer, a consumer electronic device, a wearable intelligent device, etc.
  • FIG. 7 is a schematic structural diagram illustrating a computing device, according to an implementation.
  • a computing device 700 can include a top-layer application 702 , a host memory 704 , and an RDMA network interface controller 706 .
  • the computing device 700 can be corresponding to the previous second host.
  • the RDMA network interface controller 706 can be integrated with a network interface controller memory, and can implement functions of the previous second RDMA network interface controller.
  • the RDMA network interface controller 706 can send, to the top-layer application, instructions about m data packets transmitted to the host memory 704 .
  • the top-layer application processes the m data packets based on the instructions.
  • top-layer application 702 and the host memory 704 For specific functions of the top-layer application 702 and the host memory 704 , refer to corresponding descriptions in the previous implementations. Details are omitted here for simplicity.
  • the computing device 700 can be implemented in any applicable form in the art, for example, includes but is not limited to a desktop computer, a laptop computer, a smartphone, a tablet computer, a consumer electronic device, a wearable intelligent device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Computing Systems (AREA)
  • Communication Control (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)

Abstract

Implementations of this disclosure provide data transmission operations and network interface controllers. An example method performed by a first RDMA network interface controller includes obtaining m data packets from a host memory of a first host; sending the m data packets to a second RDMA network interface controller of a second host; backing up the m data packets to a network interface controller memory integrated into the first RDMA network interface controller; determining that the second RDMA network interface controller does not receive n data packets of the m data packets; and in response, obtaining the n data packets from the m data packets that have been backed up to the network interface controller memory integrated into the first RDMA network interface controller, and retransmitting the n data packets to the second RDMA network interface controller.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation of U.S. patent application Ser. No. 16/809,273, filed on Mar. 4, 2020, with is a continuation of PCT Application No. PCT/CN2020/071728, filed on Jan. 13, 2020, which claims priority to Chinese Patent Application No. 201910624819.8, filed on Jul. 11, 2019, and each application is hereby incorporated by reference in its entirety.
TECHNICAL FIELD
Implementations of the present disclosure relate to the field of information technologies, and more specifically, to data transmission methods, remote direct memory access (RDMA) network interface controllers, and computing devices.
BACKGROUND
RDMA technology can directly move data from a storage area of one host to a storage area of another host without involving an operating system and a central processing unit (CPU), thereby greatly reducing a data processing delay and improving transmission performance. Accelerating network data transmission by using the RDMA technology is a topic of research.
SUMMARY
Considering problems in the existing technology, implementations of the present specification provide data transmission methods, RDMA network interface controllers (RNICs), and computing devices.
According to one aspect, an implementation of the present specification provides a data transmission method, where the method is executed by a first RDMA network interface controller of a first host, and the method includes: obtaining m to-be-sent data packets from a host memory of the first host, where m is a positive integer; sending the m data packets to a second RDMA network interface controller of a second host, and backing up the m data packets to a network interface controller memory integrated into the first RDMA network interface controller; if determining that the second RDMA network interface controller does not receive n data packets in the m data packets, obtaining the backed-up n data packets from the network interface controller memory, where n is a positive integer; and retransmitting the backed-up n data packets to the second RDMA network interface controller.
According to another aspect, an implementation of the present specification provides a data transmission method, where the method is executed by a second RDMA network interface controller of a second host, and the method includes: communicating with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer; and if determining that n data packets in the m data packets are not received, performing the following operations: storing received data packets in the m data packets into a second network interface controller memory integrated into the second RDMA network interface controller; waiting to receive the n data packets retransmitted by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory, and n is a positive integer; and after the retransmitted n data packets are received, transmitting the received data packets together with the retransmitted n data packets to a host memory of the second host.
According to another aspect, an implementation of the present specification provides an RDMA network interface controller, where the RDMA network interface controller is located in a first host, and the RDMA network interface controller includes a first acquisition unit, a backup unit, a sending unit, and a second acquisition unit; the first acquisition unit is configured to obtain m to-be-sent data packets from a host memory of the first host, where m is a positive integer; the backup unit is configured to back up the m data packets to a network interface controller memory integrated into the RDMA network interface controller; the sending unit is configured to send the m data packets to a second RDMA network interface controller of a second host; the second acquisition unit is configured to: if it is determined that the second RDMA network interface controller does not receive n data packets in the m data packets, obtain the backed-up n data packets from the network interface controller memory, where n is a positive integer; and the sending unit is further configured to retransmit the backed-up n data packets to the second RDMA network interface controller.
According to another aspect, an implementation of the present specification provides an RDMA network interface controller, where the RDMA network interface controller is located in a second host, and the RDMA network interface controller includes: a communications unit, configured to communicate with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer; a data packet management unit, configured to: if it is determined that n data packets in the m data packets are not received, perform the following operations: storing received data packets in the m data packets into a second network interface controller memory integrated into the RDMA network interface controller, and waiting to receive the n data packets retransmitted by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory, and n is a positive integer; and a transmission unit, configured to: after the communications unit receives the retransmitted n data packets, transmit the received data packets together with the retransmitted n data packets to a host memory of the second host.
According to another aspect, an implementation of the present specification provides a computing device, including a top-layer application, a host memory, and an RDMA network interface controller, where the top-layer application stores m to-be-sent data packets into the host memory, and sends instructions associated with the m data packets to the RDMA network interface controller, where m is a positive integer; and the RDMA network interface controller is integrated with a network interface controller memory and implements, based on the instructions, the method executed by the first RDMA network interface controller.
According to another aspect, an implementation of the present specification provides a computing device, including a top-layer application, a host memory, and an RDMA network interface controller. The RDMA network interface controller is integrated with a network interface controller memory and implements the method executed by the second RDMA network interface controller. The RDMA network interface controller sends, to the top-layer application, instructions associated with m data packets transmitted to the host memory, where m is a positive integer; and the top-layer application processes the m data packets based on the instructions.
In this technical solution, because the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
BRIEF DESCRIPTION OF DRAWINGS
By describing the implementations of the present specification in more detail with reference to the accompanying drawings, the previous and other objectives, features, and advantages of the implementations of the present specification become clearer. In the implementations of the present specification, the same reference numerals usually represent the same elements.
FIG. 1 is a schematic flowchart illustrating a data transmission method, according to an implementation;
FIG. 2 is a schematic flowchart illustrating a data transmission method, according to an implementation;
FIG. 3A is a schematic diagram illustrating an example of a data transmission scenario, according to an implementation;
FIG. 3B is a schematic flowchart illustrating a data transmission process, according to an implementation;
FIG. 4 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation;
FIG. 5 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation;
FIG. 6 is a schematic structural diagram illustrating a computing device, according to an implementation;
FIG. 7 is a schematic structural diagram illustrating a computing device, according to an implementation.
DESCRIPTION OF IMPLEMENTATIONS
The subject matter described in the present specification will be discussed with reference to implementations. It should be understood that these implementations are merely discussed to enable a person skilled in the art to better understand and implement the subject matter described in the present specification, and are not intended to limit the protection scope, applicability, or examples described in the claims. The functions and arrangements of the elements under discussion can be changed without departing from the protection scope of the claims. Various processes or components can be omitted, replaced, or added in the implementations as needed.
In the RDMA technology, because an operating system and a CPU do not need to participate in a data migration process, using the RDMA technology to accelerate network data transmission is gradually becoming one of the hot research topics.
Currently, the RDMA technology is usually implemented on a hardware network interface controller, and this network interface controller can be referred to as an RDMA network interface controller. The RDMA network interface controller can be usually connected to a host memory by using a peripheral component interconnect express (PCIe), so as to implement direct access to the host memory. Costs of accessing the host memory by the network interface controller by using the PCIe are usually relatively high. Therefore, the RDMA technology currently used in the industry usually requires that a network is lossless (that is, the network does not lose a packet). This is because when a packet is lost, the RDMA network interface controller needs to retransmit all subsequent data packets after the lost data packet, and the RDMA network interface controller needs to obtain these data packets from the host memory again by using the PCIe, which seriously affects data transmission efficiency. Therefore, the probability of such retransmission can be greatly reduced by ensuring that the network is lossless.
However, the current objective of ensuring that a network is lossless is generally achieved by enabling a flow control mechanism in a data center. But, enabling flow control greatly increases operation and maintenance costs of the data center, and also brings potential problems (for example, congestion diffusion). In addition, even if flow control is enabled, it is still difficult to fully avoid packet loss due to factors such as bit errors on a physical link. Therefore, it is desirable to improve the tolerance of the RDMA technology to packet loss.
In view of this, the present specification provides a technical solution for data transmission. In the present specification, for ease of description, a host that sends data is referred to as a first host, and a host that receives data is referred to as a second host.
The first host can include a first RDMA network interface controller used to implement the RDMA technology, and can further include a host memory. For example, the first RDMA network interface controller can connect to the host memory of the first host by using a PCIe, so as to access data on the host memory. In addition, the first RDMA network interface controller can further have a programmable function, for example, have a field programmable gate array (FPGA). For example, the FPGA is programmed, so the first RDMA network interface controller can implement various operations described in the present specification.
The second host can also include a second RDMA network interface controller used to implement the RDMA technology and a host memory. For example, the second RDMA network interface controller can connect to the host memory of the second host by using a PCIe, so as to access data on the host memory. The second RDMA network interface controller can also have a programmable function, for example, have an FPGA. For example, the FPGA is programmed, so the second RDMA network interface controller can implement various operations described in the present specification.
In addition, the first RDMA network interface controller and the second RDMA network interface controller can communicate with each other, so as to implement data transmission.
The first RDMA network interface controller can be integrated with a memory. For ease of description, the memory is referred to as a network interface controller memory in the present specification. For example, the network interface controller memory can be a dynamic random access memory (DRAM) or any other applicable memory. This is not limited in the present specification. The network interface controller memory of the first RDMA network interface controller can be configured to back up a to-be-sent data packet. As such, when retransmission needs to be performed, the first RDMA network interface controller can directly obtain the backed-up data packet from the network interface controller memory, and does not need to obtain the to-be-retransmitted data packet from the host memory again by using the PCIe.
It can be seen that in this technical solution, because the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
In addition, by backing up data in the network interface controller memory of the RDMA network interface controller, a sliding window and a flow control mechanism can be implemented at a hardware level, thereby reducing impact of RDMA traffic on the network and reducing the probability of network congestion.
In addition, the second RDMA network interface controller can be integrated with a network interface controller memory. The network interface controller memory of the second RDMA network interface controller can be configured to cache a received data packet. For example, when a packet loss occurs, the second RDMA network interface controller can cache the received data packet into the network interface controller memory. When the retransmitted data packet is received, the retransmitted data packet and the previously received data packet are transmitted together to the host memory of the second host. As such, the first RDMA network interface controller can retransmit only a lost data packet, and does not need to retransmit all subsequent data packets. Therefore, the number of data packets to be retransmitted can be greatly reduced, thereby effectively shortening a retransmission time and improving retransmission performance.
The following describes the technical solutions of the present specification with reference to specific implementations.
FIG. 1 is a schematic flowchart illustrating a data transmission method, according to an implementation. For example, the method in FIG. 1 can be performed by a first RDMA network interface controller of a first host.
As shown in FIG. 1, in step 102, the first RDMA network interface controller can obtain m to-be-sent data packets from a host memory of the first host, m is a positive integer.
In step 104, the first RDMA network interface controller can send the m data packets to a second RDMA network interface controller of a second host, and back up the m data packets to a network interface controller memory integrated into the first RDMA network interface controller.
In step 106, if the first RDMA network interface controller determines that the second RDMA network interface controller does not receive n data packets in the m data packets, the first RDMA network interface controller can obtain the backed-up n data packets from the network interface controller memory, where n is a positive integer.
In step 108, the first RDMA network interface controller can retransmit the backed-up n data packets to the second RDMA network interface controller.
It can be seen that in this technical solution, because the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
In this solution, a sliding window and a flow control mechanism are actually implemented at a hardware level, thereby reducing impact of RDMA traffic on a network and reducing the probability of network congestion.
In an implementation, in step 102, when the m data packets need to be sent to the second host, a top-layer application of the first host can store the m data packets in the host memory of the first host. In addition, the top-layer application can send, to the first RDMA network interface controller, instructions about the data packets that need to be sent. When receiving the instruction, the first RDMA network interface controller can obtain the m to-be-sent data packets from the host memory of the first host.
In an implementation, in step 104, the first RDMA network interface controller can send the m data packets to the second RDMA network interface controller of the second host. In addition, the first RDMA network interface controller can back up the m data packets to the network interface controller memory integrated into the first RDMA network interface controller. For example, as described above, the network interface controller memory can be a DRAM or any other suitable type of memory.
Then, the first RDMA network interface controller can determine whether the second RDMA network interface controller receives the m data packets. For example, the first RDMA network interface controller can start a corresponding timer for each sent data packet. If the first RDMA network interface controller receives an Acknowledgement (ACK) response for a corresponding data packet from the second RDMA network interface controller before the timer expires, the first RDMA network interface controller can determine that the second RDMA network interface controller has received the corresponding data packet. When the timer expires, if the first RDMA network interface controller does not receive the ACK response for the corresponding data packet from the second RDMA network interface controller, the first RDMA network interface controller can consider that the second RDMA network interface controller does not receive the corresponding data packet.
For another example, the second RDMA network interface controller can return a Negative-Acknowledgement (NACK) response to the first RDMA network interface controller for a data packet that is not received. As such, the first RDMA network interface controller can determine, based on the NACK response, which data packets are not received by the second RDMA network interface controller.
It should be understood that the ACK response or NACK response for the data packet can be implemented in multiple forms. For example, one message indicating an ACK or NACK response can be sent for each data packet. Or one message can be sent for several data packets, and the message can include ACK or NACK responses respectively corresponding to the data packets.
For example, in the previous technical solution, the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets in the m data packets. As described above, in an implementation, if the first RDMA network interface controller does not receive ACK responses for the n data packets from the second RDMA network interface controller within predetermined duration, the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets. For example, the predetermined duration can be duration of a timer for each of the n data packets. The predetermined duration can be determined in advance based on various factors such as an actual application scenario.
In another implementation, the first RDMA network interface controller can receive NACK responses for the n data packets from the second RDMA network interface controller. As such, the first RDMA network interface controller can determine that the second RDMA network interface controller does not receive the n data packets.
There can be multiple reasons why the second RDMA network interface controller does not receive the data packets. For example, the data packets are lost in the network transmission process, or the data packets cannot be successfully received due to a bit error or an insufficient receiving capability of the second host.
In this case, the first RDMA network interface controller needs to retransmit the n data packets that are not successfully received to the second RDMA network interface controller. In this case, because the first RDMA network interface controller backs up the m data packets that have been sent in the network interface controller memory of the first RDMA network interface controller, the first RDMA network interface controller can directly obtain the n data packets to be retransmitted from the network interface controller memory of the first RDMA network interface controller without needing to access the host memory of the first host again by using, for example, a PCIe. As such, not only bandwidth occupation on the host can be reduced, but also time required for obtaining the to-be-retransmitted data packet can be greatly shortened, so retransmission efficiency can be improved, and a packet loss tolerance capability of the RDMA network interface controller is effectively enhanced.
In addition, when determining that the second RDMA network interface controller has received the m data packets, the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
For example, if the first RDMA network interface controller receives ACK responses for the m data packets from the second RDMA network interface controller after the first time of transmission, the first RDMA network interface controller can determine that the second RDMA network interface controller has received the m data packets. In this case, the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
For another example, after retransmission, if the first RDMA network interface controller receives ACK responses for the n retransmitted data packets from the second RDMA network interface controller, the first RDMA network interface controller can confirm that the second RDMA network interface controller has received all the m data packets, and the first RDMA network interface controller can delete the backed-up m data packets from the network interface controller memory.
It can be seen that the previous technical solution actually provides a sliding window mechanism and a flow control mechanism at a hardware level of the network interface controller, so efficient data retransmission can be implemented.
FIG. 1 is described from the perspective of the RDMA network interface controller at the transmitting end. The following describes the operation of the RDMA network interface controller at the receiving end.
FIG. 2 is a schematic flowchart illustrating a data transmission method, according to an implementation. For example, the method in FIG. 2 can be performed by a second RDMA network interface controller of a second host. In this implementation, for ease of description, a network interface controller memory integrated into a first RDMA network interface controller of a first host is referred to as a first network interface controller memory, and a network interface controller memory integrated into the second RDMA network interface controller of the second host is referred to as a second network interface controller memory.
As shown in FIG. 2, in step 202, the second RDMA network interface controller can communicate with the first RDMA network interface controller of the first host, so as to receive m data packets from the first RDMA network interface controller.
The m data packets can be backed up by the first RDMA network interface controller to the first network interface controller memory. For example, the first RDMA network interface controller can back up the m data packets to the first network interface controller memory while sending the m data packets, where m can be a positive integer.
In step 204, if the second RDMA network interface controller determines that n data packets in the m data packets are not received, the second RDMA network interface controller can store received data packets in the m data packets into the second network interface controller memory, and can wait to receive the n data packets retransmitted by the first RDMA network interface controller.
The retransmitted n data packets can be obtained by the first RDMA network interface controller from the first network interface controller memory, and n can be a positive integer.
In step 206, after receiving the retransmitted n data packets, the second RDMA network interface controller can transmit, together with the n data packets, the previously received data packets stored in the second network interface controller memory to a host memory of the second host.
In this technical solution, because the network interface controller memory is integrated into the second RDMA network interface controller, the second RDMA network interface controller can store the received data packets in the network interface controller memory when waiting for the to-be-retransmitted data packets, and can send all the received data packets to the host memory after receiving all the data packets. As such, the first RDMA network interface controller can retransmit only data packets that are not received by the second RDMA network interface controller, and does not need to retransmit all data packets subsequent to a lost data packet, thereby effectively reducing the number of data packets to be retransmitted, and effectively improving retransmission efficiency.
In addition, because the first RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the first RDMA network interface controller, it is unnecessary for the first RDMA network interface controller to access the host memory again during retransmission. Therefore, not only bandwidth occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved.
Therefore, in this technical solution, a packet loss tolerance capability of the RDMA network interface controller can be greatly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
In an implementation, the first network interface controller memory can be a DRAM or any other suitable type of memory. The second network interface controller memory can be a DRAM or any other suitable type of memory.
The second RDMA network interface controller can detect, in multiple ways, whether the m data packets are received. For example, the m data packets can have a certain sequence, and each data packet can include a corresponding sequence number. The second RDMA network interface controller can determine, based on sequence numbers in the received data packets, whether the m data packets have been received.
For example, the second RDMA network interface controller can determine that the n data packets in the m data packets are not received. In an implementation, the second RDMA network interface controller can send NACK responses for the n data packets to the first RDMA network interface controller. In another implementation, the second RDMA network interface controller may not send ACK responses for the n data packets. For example, the first RDMA network interface controller can determine that no ACK response for a data packet is received from the second RDAM network interface controller within predetermined duration for the data packet, and can consider that the second RDMA network interface controller does not receive the data packet.
It can be seen that the previous technical solution provides a mechanism for efficient data retransmission between RDMA network interface controllers.
The following describes the technical solutions of the present specification with reference to specific examples. It should be understood that the following examples are merely intended to help a person skilled in the art better understand the previous technical solutions, rather than limit their scope.
FIG. 3A is a schematic diagram illustrating an example of a data transmission scenario, according to an implementation.
As shown in FIG. 3A, a first host 302A can include a top-layer application, a host memory, and a first RDMA network interface controller.
For example, the top-layer application of the first host can store to-be-sent data packets in the host memory of the first host, and can send instructions about the to-be-sent data packets to the first RDMA network interface controller. The first RDMA network interface controller can obtain the to-be-sent data packets from the host memory of the first host, and can send the obtained data packets to a second RDMA network interface controller. In addition, the first RDMA network interface controller can back up the obtained data packets to a DRAM integrated into the first RDMA network interface controller. If retransmission needs to be performed, the first RDMA network interface controller can obtain to-be-retransmitted data packets from the DRAM of the first RDMA network interface controller.
A second host 304A can include a top-layer application, a host memory, and the second RDMA network interface controller.
For example, the second RDMA network interface controller can receive data packets from the first RDMA network interface controller. If there is a data packet not received, the second RDMA network interface controller can temporarily store received data packets into a DRAM integrated into the second RDMA network interface controller. Then, the second RDMA network interface controller can receive the retransmitted data packet from the first RDAM network interface controller. After receiving all the data packets, the second RDAM network interface controller can send all the data packets to the host memory of the second host. Then, the second RDAM network interface controller can notify the top-layer application of the second host of completion of data transmission, for example, send an interrupt instruction to the top-layer application. Then, the top-layer application of the second host can process the corresponding data packets in the host memory.
It should be understood that, for ease of description, the first host and the second host are shown as including three parts in the example in FIG. 3A. However, in specific implementation, the first host and the second host can further include other related parts that are known in the art. This is not limited in the present specification.
The following describes specific operations of the first RDMA network interface controller and the second RDMA network interface controller in FIG. 3A with reference to a procedure in FIG. 3B. It should be understood that the procedure shown in FIG. 3B is only an example, and the scope of the technical solution in the present specification is not limited.
FIG. 3B is a schematic flowchart illustrating a data transmission process, according to an implementation.
As shown in FIG. 3B, in step 302B, a first RDMA network interface controller can obtain m to-be-sent data packets from a host memory of a first host.
In step 304B, the first RDMA network interface controller can back up the m data packets to a DRAM integrated into the first RDMA network interface controller.
In step 306B, the first RDMA network interface controller can send the m data packets to a second RDMA network interface controller.
It should be understood that although step 304B is first performed and then step 306B is performed, in different scenarios, steps 304B and 306B can be performed in parallel. Or step 306B can be performed before step 304B. This is not limited in the present specification.
In step 308B, the second RDMA network interface controller can determine that n data packets are not received.
In step 310B, the second RDMA network interface controller can store received data packets in the m data packets into a DRAM integrated into the second RDMA network interface controller.
In step 312B, the second RDMA network interface controller can send ACK responses for the received data packets and NACK responses for the n data packets that are not received to the first RDMA network interface controller.
It should be understood that although step 310B is first performed and then step 312B is performed, in different scenarios, steps 310B and 312B can be performed in parallel. Or step 312B can be performed before step 310B. This is not limited in the present specification.
In step 314B, the first RDMA network interface controller can obtain the n data packets to be retransmitted from the DRAM of the first RDMA network interface controller.
In step 316B, the first RDMA network interface controller can send the n data packets obtained in step 314B to the second RDMA network interface controller.
In step 318B, the second RDMA network interface controller can determine that the n data packets are received, and send ACK responses for the n data packets to the first RDMA network interface controller.
In step 320B, the second RDMA network interface controller can transmit the previously received data packets stored in the DRAM of the second RDMA network interface controller together with the n data packets received in step 316B to a host memory of a second host.
In step 322B, the first RDMA network interface controller can determine that the second RDMA network interface controller has received all the m data packets, and delete the backed-up m data packets from the network interface controller memory of the first RDMA network interface controller.
It can be seen that in this technical solution, the transmitting end RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the transmitting end RDMA network interface controller, and the receiving end RDMA network interface controller can temporarily store the received data packets in the network interface controller memory integrated into the receiving end RDMA network interface controller when waiting for the to-be-retransmitted data packets. Therefore, the transmitting end RDMA network interface controller can quickly obtain and retransmit the data packets not received by the receiving end RDMA network interface controller, thereby greatly improving retransmission efficiency and retransmission performance. In this technical solution, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
FIG. 4 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation. For example, an RDMA network interface controller 400 in FIG. 4 can be the previous first RDMA network interface controller. For example, the RDMA network interface controller 400 can be implemented by combining software and hardware.
The RDMA network interface controller 400 can include a first acquisition unit 402, a sending unit 404, a backup unit 406, and a second acquisition unit 408.
The first acquisition unit 402 can obtain m to-be-sent data packets from a host memory of a first host, where m is a positive integer. The sending unit 404 can send the m data packets to a second RDMA network interface controller of a second host. The backup unit 406 can back up the m data packets to a network interface controller memory integrated into the RDMA network interface controller 400.
If the second acquisition unit 408 determines that the second RDMA network interface controller does not receive n data packets in the m data packets, the second acquisition unit 408 obtains the backed-up n data packets from the network interface controller memory, where n is a positive integer.
The sending unit 404 can retransmit the backed-up n data packets to the second RDMA network interface controller.
It can be seen that in this technical solution, because the RDMA network interface controller can back up the to-be-sent data packet in the network interface controller memory integrated in the RDMA network interface controller, when retransmission is required, the to-be-retransmitted data packet can be quickly obtained without accessing the host memory again. Therefore, not only bandwidth (for example, PCIe bandwidth) occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved. As such, a packet loss tolerance capability of the RDMA network interface controller can be significantly enhanced, so the RDMA technology can be effectively implemented on a lossy network.
In an implementation, the network interface controller memory can be a DRAM.
In an implementation, the second acquisition unit 408 can determine that no ACK response for the n data packets is received from the second RDMA network interface controller within predetermined duration. Or the second acquisition unit 408 can determine that NACK responses for the n data packets are received from the second RDMA network interface controller.
In an implementation, if the backup unit 406 determines that the second RDMA network interface controller has received the m data packets, the backup unit 406 can delete the backed-up m data packets from the network interface controller memory.
Units of the RDMA network interface controller 400 can perform corresponding steps in the implementations shown in FIG. 1 and FIG. 3A and FIG. 3B. Therefore, for brevity of description, specific operations and functions of the units of the RDMA network interface controller 400 are not described here again.
FIG. 5 is a schematic block diagram illustrating an RDMA network interface controller, according to an implementation. For example, an RDMA network interface controller 500 in FIG. 5 can be the previous second RDMA network interface controller. For example, the RDMA network interface controller 500 can be implemented by combining software and hardware.
The RDMA network interface controller 500 can include a communications unit 502, a data packet management unit 504, and a data packet management unit 506.
The communications unit 502 can communicate with a first RDMA network interface controller of a first host, so as to receive m data packets from the first RDMA network interface controller, where the m data packets are backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, and m is a positive integer.
If the data packet management unit 504 determines that the communications unit 502 does not receive n data packets in the m data packets, the data packet management unit 504 can store received data packets in the m data packets into a second network interface controller memory integrated into the RDMA network interface controller 500, and can wait to receive the n data packets by the first RDMA network interface controller, where the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory, and n is a positive integer.
After the communications unit 502 receives the retransmitted n data packets, the data packet management unit 504 transmits the received data packets together with the retransmitted n data packets to a host memory of a second host.
In this technical solution, because the network interface controller memory is integrated into the receiving end RDMA network interface controller, the receiving end RDMA network interface controller can store the received data packets in the network interface controller memory when waiting for the to-be-retransmitted data packets, and can send all the received data packets to the host memory after receiving all the data packets. As such, the first RDMA network interface controller can retransmit only data packets that are not received, and does not need to retransmit all data packets subsequent to a lost data packet, thereby effectively reducing the number of data packets to be retransmitted, and effectively improving retransmission efficiency.
In addition, because the first RDMA network interface controller can back up the to-be-sent data packets in the network interface controller memory integrated into the first RDMA network interface controller, it is unnecessary for the first RDMA network interface controller to access the host memory again during retransmission. Therefore, not only bandwidth occupation of the host can be reduced, but also retransmission efficiency and retransmission performance can be greatly improved.
In an implementation, the first network interface controller memory can be a DRAM. The second network interface controller memory can be a DRAM.
In an implementation, after the data packet management unit 504 determines that the n data packets in the m data packets are not received, the communications unit 502 can send NACK responses for the n data packets to the first RDMA network interface controller, or the communications unit 502 may not send ACK responses for the n data packets to the first RDMA network interface controller.
Units of the RDMA network interface controller 500 can perform corresponding steps in the implementations shown in FIG. 2 to FIG. 3B. Therefore, for brevity of description, specific operations and functions of the units of the RDMA network interface controller 500 are not described here again.
FIG. 6 is a schematic structural diagram illustrating a computing device, according to an implementation. As shown in FIG. 6, a computing device 600 can include a top-layer application 602, a host memory 604, and an RDMA network interface controller 606. For example, the computing device 600 can be corresponding to the previous first host.
The top-layer application 602 can store m to-be-sent data packets into the host memory 604, and send instructions associated with the m data packets to the RDMA network interface controller 606. For specific functions of the top-layer application 602 and the host memory 604, refer to corresponding descriptions in the previous implementations. Details are omitted here for simplicity.
The RDMA network interface controller 606 is integrated with a network interface controller memory, and can implement functions of the first RDMA network interface controller based on the previous instructions.
The computing device 600 can be implemented in any applicable form in the art, for example, includes but is not limited to a desktop computer, a laptop computer, a smartphone, a tablet computer, a consumer electronic device, a wearable intelligent device, etc.
FIG. 7 is a schematic structural diagram illustrating a computing device, according to an implementation. As shown in FIG. 7, a computing device 700 can include a top-layer application 702, a host memory 704, and an RDMA network interface controller 706. For example, the computing device 700 can be corresponding to the previous second host.
The RDMA network interface controller 706 can be integrated with a network interface controller memory, and can implement functions of the previous second RDMA network interface controller.
In addition, the RDMA network interface controller 706 can send, to the top-layer application, instructions about m data packets transmitted to the host memory 704. The top-layer application processes the m data packets based on the instructions.
For specific functions of the top-layer application 702 and the host memory 704, refer to corresponding descriptions in the previous implementations. Details are omitted here for simplicity.
The computing device 700 can be implemented in any applicable form in the art, for example, includes but is not limited to a desktop computer, a laptop computer, a smartphone, a tablet computer, a consumer electronic device, a wearable intelligent device, etc.
The implementations in the present specification are described in a progressive way. For same or similar parts of the implementations, references can be made to the implementations mutually. Each implementation focuses on a difference from other implementations. Especially, an apparatus implementation, a device implementation, a non-volatile computer storage medium implementation are similar to a method implementation, and therefore are described briefly. For related parts, references can be made to the descriptions in the method implementation.
Specific implementations of the present specification are described above. Other implementations fall within the scope of the appended claims. In some situations, the actions or steps described in the claims can be performed in an order different from the order in the implementations and the desired results can still be achieved. In addition, the process depicted in the accompanying drawings does not necessarily need a particular execution order to achieve the desired results. In some implementations, multi-tasking and concurrent processing is feasible or can be advantageous.
It should be understood that, for a person of ordinary skill in the art, various modifications to the implementations in the present specification will be obvious, and the general principles defined in the present specification can be applied to other variations without departing from the protection scope of the claims.

Claims (18)

What is claimed is:
1. A computer-implemented method, comprising:
communicating, by a second RDMA network interface controller of a second host with a first RDMA network interface controller of a first host, to receive m data packets from the first RDMA network interface controller, wherein the m data packets have been backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, m being a positive integer; and
in response to determining that n data packets of the m data packets have not been received, n being a positive integer:
storing, by the second RDMA network interface controller, received data packets of the m data packets into a second network interface controller memory integrated into the second RDMA network interface controller;
waiting, by the second RDMA network interface controller, to receive the n data packets having been retransmitted by the first RDMA network interface controller, wherein the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory; and
after the retransmitted n data packets have been received by the second RDMA network interface controller, transmitting the received data packets together with the retransmitted n data packets to a host memory of the second host.
2. The computer-implemented method of claim 1, wherein the first network interface controller memory integrated into the first RDMA network interface controller is a dynamic random access memory.
3. The computer-implemented method of claim 1, wherein the second network interface controller memory integrated into the second RDMA network interface controller is a dynamic random access memory.
4. The computer-implemented method of claim 1, wherein after determining that n data packets of them data packets have not been received, the method further comprises:
sending, by the second RDMA network interface controller, negative acknowledgement (NACK) responses for the n data packets to the first RDMA network interface controller.
5. The computer-implemented method of claim 1, wherein after determining that n data packets of the m data packets have not been received, the method further comprises:
not sending acknowledgement (ACK) responses for the n data packets to the first RDMA network interface controller.
6. The computer-implemented method of claim 1, wherein the first RDMA network interface controller is configured to delete the m data packets from the first network interface controller memory in response to determining that the second RDMA network interface controller has received the m data packets, including the n data packets that have been retransmitted by the first RDMA network interface controller.
7. A non-transitory, computer-readable medium storing one or more instructions executable by a computer system to perform operations comprising:
communicating, by a second RDMA network interface controller of a second host with a first RDMA network interface controller of a first host, to receive m data packets from the first RDMA network interface controller, wherein the m data packets have been backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, m being a positive integer; and
in response to determining that n data packets of the m data packets have not been received, n being a positive integer:
storing, by the second RDMA network interface controller, received data packets of the m data packets into a second network interface controller memory integrated into the second RDMA network interface controller;
waiting, by the second RDMA network interface controller, to receive the n data packets having been retransmitted by the first RDMA network interface controller, wherein the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory; and
after the retransmitted n data packets have been received by the second RDMA network interface controller, transmitting the received data packets together with the retransmitted n data packets to a host memory of the second host.
8. The computer-readable medium of claim 7, wherein the first network interface controller memory integrated into the first RDMA network interface controller is a dynamic random access memory.
9. The computer-readable medium of claim 7, wherein the second network interface controller memory integrated into the second RDMA network interface controller is a dynamic random access memory.
10. The computer-readable medium of claim 7, wherein after determining that n data packets of them data packets have not been received, the operations further comprise:
sending, by the second RDMA network interface controller, negative acknowledgement (NACK) responses for the n data packets to the first RDMA network interface controller.
11. The computer-readable medium of claim 7, wherein after determining that n data packets of the m data packets have not been received, the operations further comprise:
not sending acknowledgement (ACK) responses for the n data packets to the first RDMA network interface controller.
12. The computer-readable medium of claim 7, wherein the first RDMA network interface controller is configured to delete the m data packets from the first network interface controller memory in response to determining that the second RDMA network interface controller has received the m data packets, including the n data packets that have been retransmitted by the first RDMA network interface controller.
13. A computer-implemented system, comprising:
one or more computers; and
one or more computer memory devices interoperably coupled with the one or more computers and having tangible, non-transitory, machine-readable media storing one or more instructions that, when executed by the one or more computers, perform one or more operations comprising:
communicating, by a second RDMA network interface controller of a second host with a first RDMA network interface controller of a first host, to receive m data packets from the first RDMA network interface controller, wherein the m data packets have been backed up by the first RDMA network interface controller to a first network interface controller memory integrated into the first RDMA network interface controller, m being a positive integer; and
in response to determining that n data packets of the m data packets have not been received, n being a positive integer:
storing, by the second RDMA network interface controller, received data packets of the m data packets into a second network interface controller memory integrated into the second RDMA network interface controller;
waiting, by the second RDMA network interface controller, to receive the n data packets having been retransmitted by the first RDMA network interface controller, wherein the retransmitted n data packets are obtained by the first RDMA network interface controller from the first network interface controller memory; and
after the retransmitted n data packets have been received by the second RDMA network interface controller, transmitting the received data packets together with the retransmitted n data packets to a host memory of the second host.
14. The computer-implemented system of claim 13, wherein the first network interface controller memory integrated into the first RDMA network interface controller is a dynamic random access memory.
15. The computer-implemented system of claim 13, wherein the second network interface controller memory integrated into the second RDMA network interface controller is a dynamic random access memory.
16. The computer-implemented system of claim 13, wherein after determining that n data packets of the m data packets have not been received, the operations further comprise:
sending, by the second RDMA network interface controller, negative acknowledgement (NACK) responses for the n data packets to the first RDMA network interface controller.
17. The computer-implemented system of claim 13, wherein after determining that n data packets of the m data packets have not been received, the operations further comprise:
not sending acknowledgement (ACK) responses for the n data packets to the first RDMA network interface controller.
18. The computer-implemented system of claim 13, wherein the first RDMA network interface controller is configured to delete the m data packets from the first network interface controller memory in response to determining that the second RDMA network interface controller has received the m data packets, including the n data packets that have been retransmitted by the first RDMA network interface controller.
US16/945,621 2019-07-11 2020-07-31 Data transmission and network interface controller Active US10911541B1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US16/945,621 US10911541B1 (en) 2019-07-11 2020-07-31 Data transmission and network interface controller
US17/164,678 US11115474B2 (en) 2019-07-11 2021-02-01 Data transmission and network interface controller
US17/412,866 US11736567B2 (en) 2019-07-11 2021-08-26 Data transmission and network interface controller

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
CN201910624819.8 2019-07-11
CN201910624819 2019-07-11
CN201910624819.8A CN110460412B (en) 2019-07-11 2019-07-11 Method and RDMA network card for data transmission
PCT/CN2020/071728 WO2021004056A1 (en) 2019-07-11 2020-01-13 Method for data transmission and rdma network interface card
US16/809,273 US10785306B1 (en) 2019-07-11 2020-03-04 Data transmission and network interface controller
US16/945,621 US10911541B1 (en) 2019-07-11 2020-07-31 Data transmission and network interface controller

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US16/809,273 Continuation US10785306B1 (en) 2019-07-11 2020-03-04 Data transmission and network interface controller

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/164,678 Continuation US11115474B2 (en) 2019-07-11 2021-02-01 Data transmission and network interface controller

Publications (2)

Publication Number Publication Date
US20210014307A1 US20210014307A1 (en) 2021-01-14
US10911541B1 true US10911541B1 (en) 2021-02-02

Family

ID=72516768

Family Applications (4)

Application Number Title Priority Date Filing Date
US16/809,273 Active US10785306B1 (en) 2019-07-11 2020-03-04 Data transmission and network interface controller
US16/945,621 Active US10911541B1 (en) 2019-07-11 2020-07-31 Data transmission and network interface controller
US17/164,678 Active US11115474B2 (en) 2019-07-11 2021-02-01 Data transmission and network interface controller
US17/412,866 Active 2040-03-09 US11736567B2 (en) 2019-07-11 2021-08-26 Data transmission and network interface controller

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US16/809,273 Active US10785306B1 (en) 2019-07-11 2020-03-04 Data transmission and network interface controller

Family Applications After (2)

Application Number Title Priority Date Filing Date
US17/164,678 Active US11115474B2 (en) 2019-07-11 2021-02-01 Data transmission and network interface controller
US17/412,866 Active 2040-03-09 US11736567B2 (en) 2019-07-11 2021-08-26 Data transmission and network interface controller

Country Status (1)

Country Link
US (4) US10785306B1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11115474B2 (en) 2019-07-11 2021-09-07 Advanced New Technologies Co., Ltd. Data transmission and network interface controller

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114520711B (en) * 2020-11-19 2024-05-03 迈络思科技有限公司 Selective retransmission of data packets

Citations (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060045108A1 (en) 2004-08-30 2006-03-02 International Business Machines Corporation Half RDMA and half FIFO operations
US20060274748A1 (en) 2005-03-24 2006-12-07 Fujitsu Limited Communication device, method, and program
US20080126509A1 (en) 2006-11-06 2008-05-29 Viswanath Subramanian Rdma qp simplex switchless connection
US20090125604A1 (en) 2004-08-30 2009-05-14 International Business Machines Corporation Third party, broadcast, multicast and conditional rdma operations
US20100082766A1 (en) 2008-09-29 2010-04-01 Cisco Technology, Inc. Reliable reception of messages written via rdma using hashing
US20130262613A1 (en) 2012-03-30 2013-10-03 Mark S. Hefty Efficient distribution of subnet administration data over an rdma network
US20140052808A1 (en) * 2012-08-20 2014-02-20 Advanced Micro Devices, Inc. Speculation based approach for reliable message communications
US20160026605A1 (en) * 2014-07-28 2016-01-28 Emulex Corporation Registrationless transmit onload rdma
US20160212214A1 (en) 2015-01-16 2016-07-21 Avago Technologies General Ip (Singapore) Pte. Ltd. Tunneled remote direct memory access (rdma) communication
CN106487896A (en) 2016-10-14 2017-03-08 北京百度网讯科技有限公司 Method and apparatus for processing remote direct memory access request
US20170075855A1 (en) 2015-09-14 2017-03-16 Cisco Technology, Inc. Low latency remote direct memory access for microservers
US20180004705A1 (en) 2016-06-29 2018-01-04 Mellanox Technologies Ltd. Selective acknowledgment of RDMA packets
US20180069923A1 (en) 2016-09-08 2018-03-08 Toshiba Memory Corporation Remote nvme activation
US20190079897A1 (en) 2017-09-14 2019-03-14 Microsoft Technology Licensing, Llc Remote direct memory access in computing systems
US20190095106A1 (en) 2017-09-27 2019-03-28 Alibaba Group Holding Limited Low-latency lightweight distributed storage system
US20190171612A1 (en) 2013-03-10 2019-06-06 Mellanox Technologies, Ltd. Network adapter with a common queue for both networking and data manipulation work requests
CN109981480A (en) 2017-12-27 2019-07-05 华为技术有限公司 A kind of data transmission method and the first equipment
CN110460412A (en) 2019-07-11 2019-11-15 阿里巴巴集团控股有限公司 Method and RDMA network interface card for data transmission
US10509764B1 (en) 2015-06-19 2019-12-17 Amazon Technologies, Inc. Flexible remote direct memory access
US20200162394A1 (en) * 2018-11-15 2020-05-21 Mellanox Technologies, Ltd. Transmission timeout system
US20200218688A1 (en) * 2017-09-22 2020-07-09 Huawei Technologies Co., Ltd. Data validation method and apparatus, and network interface card

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6031818A (en) * 1997-03-19 2000-02-29 Lucent Technologies Inc. Error correction system for packet switching networks
US8601049B2 (en) 2004-03-04 2013-12-03 The United States Postal Service System and method for providing centralized management and distribution of information to remote users
US8009586B2 (en) * 2004-06-29 2011-08-30 Damaka, Inc. System and method for data transfer in a peer-to peer hybrid communication network
US20060075057A1 (en) * 2004-08-30 2006-04-06 International Business Machines Corporation Remote direct memory access system and method
US7792026B2 (en) * 2006-02-17 2010-09-07 Alcatel-Lucent Usa Inc. Method of calculating a time period to wait for missing data packets
US8824378B2 (en) 2008-02-01 2014-09-02 Maarten Menzo Wentink Unscheduled peer power save mode
US7957273B2 (en) * 2008-06-06 2011-06-07 Redpine Signals, Inc. Packet re-transmission controller for block acknowledgement in a communications system
US9176911B2 (en) * 2012-12-11 2015-11-03 Intel Corporation Explicit flow control for implicit memory registration
US9762355B2 (en) * 2014-07-31 2017-09-12 Qualcomm Incorporated System and method of redundancy based packet transmission error recovery
US9979640B2 (en) * 2014-12-23 2018-05-22 Intel Corporation Reorder resilient transport
US9563431B2 (en) * 2014-12-26 2017-02-07 Intel Corporation Techniques for cooperative execution between asymmetric processor cores
US9996498B2 (en) * 2015-09-08 2018-06-12 Mellanox Technologies, Ltd. Network memory
TWM531105U (en) 2016-03-23 2016-10-21 Lanner Electronic Inc Network card information recognition system
WO2018009468A1 (en) * 2016-07-05 2018-01-11 Idac Holdings, Inc. Latency reduction by fast forward in multi-hop communication systems
TWI635396B (en) 2016-12-23 2018-09-11 英業達股份有限公司 Data transmission system, a data receiving method and a data transmission method using two-stage memories to handle packet data
US11064281B1 (en) * 2017-11-15 2021-07-13 Amazon Technologies, Inc. Sending and receiving wireless data
WO2019107870A1 (en) * 2017-11-28 2019-06-06 Samsung Electronics Co., Ltd. Method and a user equipment (ue) for transport layer optimization using a preemptive cross layer signaling
US11252110B1 (en) * 2018-09-21 2022-02-15 Marvell Asia Pte Ltd Negotiation of alignment mode for out of order placement of data in network devices
US10872173B2 (en) * 2018-09-26 2020-12-22 Marvell Asia Pte, Ltd. Secure low-latency chip-to-chip communication
US10785306B1 (en) * 2019-07-11 2020-09-22 Alibaba Group Holding Limited Data transmission and network interface controller

Patent Citations (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060045108A1 (en) 2004-08-30 2006-03-02 International Business Machines Corporation Half RDMA and half FIFO operations
US20090125604A1 (en) 2004-08-30 2009-05-14 International Business Machines Corporation Third party, broadcast, multicast and conditional rdma operations
US20060274748A1 (en) 2005-03-24 2006-12-07 Fujitsu Limited Communication device, method, and program
US20080126509A1 (en) 2006-11-06 2008-05-29 Viswanath Subramanian Rdma qp simplex switchless connection
US20100082766A1 (en) 2008-09-29 2010-04-01 Cisco Technology, Inc. Reliable reception of messages written via rdma using hashing
US20130262613A1 (en) 2012-03-30 2013-10-03 Mark S. Hefty Efficient distribution of subnet administration data over an rdma network
US20140052808A1 (en) * 2012-08-20 2014-02-20 Advanced Micro Devices, Inc. Speculation based approach for reliable message communications
US20190171612A1 (en) 2013-03-10 2019-06-06 Mellanox Technologies, Ltd. Network adapter with a common queue for both networking and data manipulation work requests
US20160026605A1 (en) * 2014-07-28 2016-01-28 Emulex Corporation Registrationless transmit onload rdma
US20160212214A1 (en) 2015-01-16 2016-07-21 Avago Technologies General Ip (Singapore) Pte. Ltd. Tunneled remote direct memory access (rdma) communication
US10509764B1 (en) 2015-06-19 2019-12-17 Amazon Technologies, Inc. Flexible remote direct memory access
US20170075855A1 (en) 2015-09-14 2017-03-16 Cisco Technology, Inc. Low latency remote direct memory access for microservers
US20180004705A1 (en) 2016-06-29 2018-01-04 Mellanox Technologies Ltd. Selective acknowledgment of RDMA packets
US20180069923A1 (en) 2016-09-08 2018-03-08 Toshiba Memory Corporation Remote nvme activation
CN106487896A (en) 2016-10-14 2017-03-08 北京百度网讯科技有限公司 Method and apparatus for processing remote direct memory access request
US20190079897A1 (en) 2017-09-14 2019-03-14 Microsoft Technology Licensing, Llc Remote direct memory access in computing systems
US20200218688A1 (en) * 2017-09-22 2020-07-09 Huawei Technologies Co., Ltd. Data validation method and apparatus, and network interface card
US20190095106A1 (en) 2017-09-27 2019-03-28 Alibaba Group Holding Limited Low-latency lightweight distributed storage system
CN109981480A (en) 2017-12-27 2019-07-05 华为技术有限公司 A kind of data transmission method and the first equipment
US20200162394A1 (en) * 2018-11-15 2020-05-21 Mellanox Technologies, Ltd. Transmission timeout system
CN110460412A (en) 2019-07-11 2019-11-15 阿里巴巴集团控股有限公司 Method and RDMA network interface card for data transmission

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Crosby et al., "BlockChain Technology: Beyond Bitcoin," Sutardja Center for Entrepreneurship & Technology Technical Report, Oct. 16, 2015, 35 pages.
Fukuda et al., "Caching Memcached at Reconfigurable Network Interface", 24th International Conference on Field Programmable Logic and Applications, Oct. 2014, 6 pages.
Nakamoto, "Bitcoin: A Peer-to-Peer Electronic Cash System," www.bitcoin.org, 2005, 9 pages.
PCT International Search Report and Written Opinion in International Application No. PCT/CN2020/071728, dated Apr. 17, 2020, 16 pages (with partial English machine translation).

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11115474B2 (en) 2019-07-11 2021-09-07 Advanced New Technologies Co., Ltd. Data transmission and network interface controller
US11736567B2 (en) 2019-07-11 2023-08-22 Advanced New Technologies Co., Ltd. Data transmission and network interface controller

Also Published As

Publication number Publication date
US20210014307A1 (en) 2021-01-14
US20210160320A1 (en) 2021-05-27
US10785306B1 (en) 2020-09-22
US11115474B2 (en) 2021-09-07
US11736567B2 (en) 2023-08-22
US20210392187A1 (en) 2021-12-16

Similar Documents

Publication Publication Date Title
TWI734380B (en) Computer-implemented method and system and non-transitory computer-readable storage medium
US10891253B2 (en) Multicast apparatuses and methods for distributing data to multiple receivers in high-performance computing and cloud-based networks
US11251911B2 (en) Data transmission method and related device
US11736567B2 (en) Data transmission and network interface controller
CN104011696A (en) Explicit flow control for implicit memory registration
US9256564B2 (en) Techniques for improving throughput and performance of a distributed interconnect peripheral bus
US20140313996A1 (en) System, apparatus, computer-readable medium and method
US20100217889A1 (en) Accelerated block option for trivial file transfer protocol (tftp)
CN101594308B (en) Method and system for transmitting message
US9954790B2 (en) Method for flow control in network
CN110943878A (en) Heartbeat packet transmission method, terminal and device with storage function
CN111404842B (en) Data transmission method, device and computer storage medium
US8769137B2 (en) Systems and methods for negotiated accelerated block option for trivial file transfer protocol (TFTP)
US8868994B2 (en) High performance virtual Converged Enhanced Ethernet with persistent state flow control
US9628366B2 (en) Methods, systems, and computer readable media for sustaining active control over concurrent session connections
US9450706B2 (en) Communication apparatus and packet transfer method
CN115865830A (en) Method and device for reducing RDMA (remote direct memory Access) network overhead based on connection management
CN111611068B (en) Data writing method in distributed system, server and client
US9742819B2 (en) System and method for reliable messaging between application sessions across volatile networking conditions
CN113765824A (en) Response message sending method and device based on MBIM (multimedia broadcast multicast service) interface, MBB (multimedia broadcast multicast service) equipment and medium
CN117692389A (en) RDMA message information retransmission method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: ALIBABA GROUP HOLDING LIMITED, CAYMAN ISLANDS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:LI, CHANGQING;REEL/FRAME:053650/0824

Effective date: 20200306

Owner name: ADVANTAGEOUS NEW TECHNOLOGIES CO., LTD., CAYMAN ISLANDS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALIBABA GROUP HOLDING LIMITED;REEL/FRAME:053743/0464

Effective date: 20200826

AS Assignment

Owner name: ADVANCED NEW TECHNOLOGIES CO., LTD., CAYMAN ISLANDS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ADVANTAGEOUS NEW TECHNOLOGIES CO., LTD.;REEL/FRAME:053754/0625

Effective date: 20200910

AS Assignment

Owner name: ALIBABA GROUP HOLDING LIMITED, CAYMAN ISLANDS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ZOU, YINCHAO;WU, PENG;KONG, JINCAN;SIGNING DATES FROM 20201218 TO 20201221;REEL/FRAME:054855/0520

STCF Information on status: patent grant

Free format text: PATENTED CASE

CC Certificate of correction