US20020009075A1 - System for reducing bus overhead for communication with a network interface - Google Patents
System for reducing bus overhead for communication with a network interface Download PDFInfo
- Publication number
- US20020009075A1 US20020009075A1 US08/899,435 US89943597A US2002009075A1 US 20020009075 A1 US20020009075 A1 US 20020009075A1 US 89943597 A US89943597 A US 89943597A US 2002009075 A1 US2002009075 A1 US 2002009075A1
- Authority
- US
- United States
- Prior art keywords
- buffer
- computer system
- data
- network interface
- command
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004891 communication Methods 0.000 title claims description 26
- 239000000872 buffer Substances 0.000 claims abstract description 186
- 238000012546 transfer Methods 0.000 claims abstract description 73
- 238000000034 method Methods 0.000 claims abstract description 39
- 239000012634 fragment Substances 0.000 claims description 57
- 230000007246 mechanism Effects 0.000 claims description 4
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 claims description 2
- 229910052710 silicon Inorganic materials 0.000 claims description 2
- 239000010703 silicon Substances 0.000 claims description 2
- 238000013519 translation Methods 0.000 abstract description 11
- 230000014616 translation Effects 0.000 abstract description 11
- 238000012545 processing Methods 0.000 abstract description 7
- 238000006243 chemical reaction Methods 0.000 abstract description 6
- 238000001152 differential interference contrast microscopy Methods 0.000 description 47
- 238000010586 diagram Methods 0.000 description 12
- 230000005540 biological transmission Effects 0.000 description 10
- 230000002093 peripheral effect Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 229920000740 poly(D-lysine) polymer Polymers 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/28—Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
- H04L12/40—Bus networks
- H04L12/40006—Architecture of a communication node
- H04L12/40032—Details regarding a bus interface enhancer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/28—Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
- H04L12/40—Bus networks
- H04L12/40006—Architecture of a communication node
Definitions
- the present invention relates to a device for connecting a computer system to a computer network, and more particularly to a method and an apparatus for reducing bus overhead in communications between a computer system and a network interface device through which the computer system communicates with a high speed packet-switched network.
- NICs network interface controllers
- the interface to the computer system typically connects to a bus within the computer system, such as a PCI bus, through which data is transferred between the computer system memory and the NIC.
- PIO Programmable I/O
- NIC Network Address Translation
- shared memory can be used to facilitate communications between the NIC and the computer system.
- the NIC and the computer system communicate by writing to and reading from a shared memory that exists in both the address space of the computer system and the address space of the NIC. This again leads to a simple hardware and operating system implementation, and a clean interface between the computer system and the NIC.
- DMA direct memory access
- DMA transfers between computer systems and NICs are commonly accomplished using the scatter-gather technique.
- scatter-gather a bus master device in the NIC is first instructed to obtain a command block from the memory of a host computer system.
- the command block contains a list of physical addresses for blocks within the host system memory that are to be copied to the DMA device.
- the command block also contains a count of the number of fragments in the command block and the overall length of the data contained in the fragments pointed to by the command block.
- the DMA device parses the command block, extracting the address of each fragment, and transfers the fragments from the host memory to the DMA device. This process is repeated for each fragment listed in the command block until all of the data described by the command block is copied to the DMA device.
- a significant performance bottleneck in using the scatter-gather technique for transferring data to a high speed network is the translation from virtual to physical addresses.
- Peripheral devices such as a NIC, cannot use virtual memory addresses to effect the transfers, because the hardware to implement the virtual-to-physical address translation is typically located inside the CPU. This means that conversion between virtual and physical addresses must take place before transfers between a computer system and a NIC can take place. This conversion can take a great deal of time and consume a significant amount of the computer system's processing power.
- the driver When data is passed to a device driver for transmission to the NIC, the driver first performs a virtual-to-physical address conversion for each buffer fragment passed down to it from the application layers above.
- each buffer fragment it is possible for each buffer fragment to straddle physical pages of the memory system. Thus, more than one physical address may correspond to each virtual address converted. Consequently, several virtual-to-physical address conversions may be required for each buffer of data that is transferred from the computer system to the NIC. This can be very time-consuming because each virtual-to-physical address translation can take from tens to hundreds of CPU cycles to accomplish.
- Peripheral devices such as NICs typically connect to computer systems through a peripheral interconnect bus, such as the PCI bus.
- a peripheral interconnect bus such as the PCI bus.
- devices connected to the bus contend for control of the bus. Once a device is granted control of the bus, it drives bus signal lines to transfer data to or from the computer system.
- the performance impediment stems from the number of times a NIC must contend for the peripheral interconnect bus when transferring data using the scatter-gather technique.
- bus contention to transfer data between a NIC and an attached computer system will occur three times per buffer transferred: first, when the computer system informs the NIC that a buffer is available for its use; second when the NIC reads the command block describing the buffer; and third when the NIC transfers data to or from the buffer.
- the computer system informs the NIC that a buffer is available for its use
- the NIC reads the command block describing the buffer
- the NIC transfers data to or from the buffer.
- at least two buffer fragments will be described in each command block.
- the present invention provides a method and an apparatus for transferring data between a computer system and a network interface card that avoids virtual-to-physical address translations.
- the computer system allocates blocks of memory during system initialization for storing data in transit between the computer system and the NIC, and the physical addresses of these blocks of memory are stored in a table on the NIC. Consequently, address conversion is performed only once, when the memory is allocated.
- the device driver copies the data from the upper layers into the next available memory block.
- the device driver then formats a command and passes it to the NIC for processing.
- PDC packet descriptor command
- the PDC contains a small ordinal value that indexes a table in the NIC, which includes a set of physical addresses of buffers preallocated by the computer system in the computer system memory. These buffers are used for storing data in transit to the NIC.
- the PDC also contains the length of the buffer to be copied to or from the NIC.
- the present invention also allows for multiple packets to be formatted into buffers and then subsequently transferred to the NIC in a single I/O operation.
- the present invention provides a number of advantages. First, virtual-to-physical address translation is avoided at run time. Second, the formatting of a packet descriptor list is greatly simplified. Third, the amount of control data transferred to the NIC by the computer system is greatly reduced. Finally, multiple packets can be transferred to the NIC in a single I/O operation, thereby making more efficient use of bandwidth on the interconnect bus.
- the present invention incurs additional overhead because the processor must move data from the application program into the data buffers in the computer system's memory before this data is transferred to the NIC. At first glance, this double copy operation appears to incur a great amount of additional processor overhead. However, this additional overhead is considerably smaller than the overhead involved in performing virtual-to-physical address translations. Each translation requires many tens (if not hundreds) of CPU cycles, and many such translations may be required for a single transfer operation. Consequently, the present invention provides a significant performance advantage for small data transfers, which represent a significant percentage of all data transfers. Hundreds of bytes can be moved to the preallocated buffer in the time it takes to perform just one virtual-to-physical address translation. Moreover, as microprocessors move to 64 and 128 bit architectures, their capacity to move data per clock will increase thereby further widening the performance advantage of the present invention over conventional scatter-gather DMA.
- CPU utilization may not be the primary bottleneck.
- bus utilization may be the largest bottleneck.
- favoring bus utilization at the expense of CPU utilization is often a desirable tradeoff to make.
- the present invention can be characterized as an apparatus for facilitating communications between a computer system, including a memory and a bus, and a packet-switched network, comprising: a bus interface coupled to the bus, for communicating across the bus; a transmit buffer, for storing data to be transmitted on the packet-switched network; a transmit data path, coupled to the bus interface and the transmit buffer, for transferring data from the bus interface to the transmit buffer; a receive buffer, for storing data received from the packet-switched network; a receive data path, coupled to the bus interface and the receive buffer, for transferring data from the receive buffer to the bus interface; a buffer address table, coupled to the bus interface, for storing at least one address of at least one buffer in the memory of the computer system, the at least one buffer being preallocated by the computer system and used to store data in transit between the computer system and one of the transmit buffer and the receive buffer; and a controller coupled to the transmit buffer, the receive buffer and the buffer address table, for controlling the transfer of data from
- the apparatus includes: a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system; and a transmit execution queue, coupled to the bus interface, the transmit command queue and the controller, for storing and processing commands from the transmit command queue, and command blocks from the computer system which are referenced by commands from the transmit command queue.
- the apparatus includes a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system; and a receive execution queue, coupled to the bus interface, the receive command queue and the controller, for storing and processing commands from the receive command queue and command blocks from the computer system that are referenced by commands from the receive command queue.
- the controller includes a mechanism to transfer a plurality of packets in a single operation between the at least one buffer preallocated by the computer system and the transmit buffer.
- the controller includes a mechanism to transfer a plurality of packets in a single operation between the receive buffer and the at least one buffer preallocated by the computer system.
- the present invention can also be characterized as a method for transferring data between a computer system and a network interface device, the network interface device being coupled to a packet-switched network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: receiving at the network interface device at least one address of a preallocated buffer in the memory; storing in the network interface device the at least one address of the preallocated buffer; receiving a command from the computer system through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; retrieving an address from the at least one address of a preallocated buffer stored in the network interface device; using the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and using the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
- FIG. 1 is a block diagram illustrating some of the major functional components of a system for coupling host computer system 190 with a high speed network 160 in accordance with an aspect of the present invention.
- FIG. 2 is a block diagram illustrating the format of a packet descriptor list in accordance with an aspect of the present invention.
- FIG. 3 is a block diagram illustrating the structure of a simplified packet descriptor command for initiating a transfer of data between a NIC and a computer system in accordance with an aspect of the present invention.
- FIG. 4 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 190 to a NIC in accordance with an aspect of the present invention.
- FIG. 5 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 150 to a NIC in accordance with an aspect of the present invention.
- FIG. 6 illustrates the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention.
- FIG. 7 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention.
- FIG. 8 is a block diagram illustrating some of the major functional components within a NIC in accordance with an aspect of the present invention.
- FIG. 9 is a block diagram illustrating the structure of a preallocated buffer, including a plurality of packets for transmission to a NIC, in accordance with an aspect of the present invention.
- FIG. 1 is a block diagram illustrating some of the major functional components of a host computer system 190 , which connects to a high speed network 160 through network interface 195 , in accordance with an aspect of the present invention.
- Host computer system 190 includes interconnect bus 130 , host bridge 110 , host bus 140 , host processor 120 and memory 180 .
- Host processor 120 connects to host bus 140 , which also connects to host bridge 110 .
- Host processor 120 can be any type of processor system including a device controller, a microprocessor, or a mainframe computing system.
- Host bus 140 is a bus which connects host processor 120 to host bridge 110 .
- Host bridge 110 includes cache controller 112 which connects to memory 180 .
- Memory 180 includes pre-allocated buffers 182 , which are buffers preallocated by host processor 120 during system initialization. These buffers are used to store data in transit between host computer system 190 and high speed network 160 .
- Host bridge 110 additionally connects to interconnect bus 130 .
- Interconnect bus 130 is used to connect host computer system 190 to peripheral devices, such as network interface 195 .
- Interconnect bus 130 may be any type of commonly used interconnection bus, such as a PCI bus.
- High speed network 160 is any type of high speed data network, including 100 megabit and gigabit Ethernet networks for example.
- Network interface 195 may be implemented on a separate computer card, or it may be integrated into a computer system motherboard. It may also be integrated into a single silicon chip.
- Network interface 195 includes physical layer interface 150 and controller 100 .
- Controller 100 is coupled to interconnect bus 130 within host computer system 190 .
- Controller 100 additionally connects to physical layer interface 150 , which connects to high speed network 160 .
- Controller 100 performs the DMA functions involved in transferring data between memory 180 within host computer system 190 , and high speed network 160 .
- Controller 100 includes buffer address table 105 , which includes a plurality of addresses of buffers in preallocated buffers 182 within memory 180 .
- Physical layer interface 150 includes resources for performing communications across high speed network 160 .
- FIG. 2 is a diagram illustrating the structure of a packet descriptor list (PDL) in accordance with an aspect of the present invention.
- the PDL illustrated in FIG. 2 specifies a transmission of data including the transmission of N separate fragments scattered through memory 180 within host computer system 190 .
- the PDL includes status field 200 , which contains information regarding the status of the data transfer corresponding to the PDL. It also includes a number of fragments field 210 , which indicates the number of fragments associated with the data transfer specified by the PDL.
- the PDL also includes a packet length 220 , which indicates the length of the entire transfer, including the plurality of associated fragments.
- the PDL additionally includes a number of address/length pairs.
- Address of fragment 1 230 includes the physical address for fragment 1 within memory 180 of host computer system 190 .
- Length of fragment 1 235 includes the length of fragment 1.
- Address of fragment 2 240 includes the address within memory 180 of fragment 2.
- Length of fragment 2 245 includes the length of fragment 2.
- address of fragment N 250 which contains the address of fragment N within memory 180 .
- the PDL additionally contains length of fragment N 255 , which includes the length of fragment N.
- FIG. 3 illustrates the format for a simplified command sent from host computer system 190 to network interface 195 in accordance with an aspect of the present invention.
- the command is known as a packet description command (PDC) and fits within a single word of memory.
- PDC packet description command
- Options field 300 which is a field indicating the processing options for a transmission between host computer system 190 and network interface 195 .
- block index field 310 which indexes a buffer address table within controller 100 , containing the physical address of a preallocated buffer within memory 180 in host computer system 190 .
- buffer length 320 which is the length of the data within the pre-allocated buffer that is to be transferred between network interface 195 and host computer system 190 .
- this simplified command format does not require a “number of fragments” field because only one fragment is sent. It also does not require a separate address and length for each fragment because multiple fragments are concatenated together within a single pre-allocated buffer to be transferred in a single operation.
- FIG. 4 illustrates the sequence of operations involved in transferring data from host computer system 190 to controller 100 within network interface 195 using a prior art packet descriptor list command format in accordance with an aspect of the present invention.
- host computer system 190 writes a packet descriptor address to controller 100 .
- controller 100 uses this packet descriptor address to retrieve a command block 410 from host computer system 190 .
- controller 100 performs a series of retrieval operations 420 , 430 and 440 , to retrieve individual fragments from host computer system 190 into controller 100 .
- FIG. 5 illustrates the sequence of operations required to move data from host computer system 190 to controller 100 using a packet descriptor command format in accordance with an aspect of the present invention.
- the greatly simplified sequence of operations in this example results from the simplified command format and the preallocation of buffers within memory 180 .
- host computer system 190 writes a packet descriptor command 500 to controller 100 .
- Controller 100 uses the block index 310 within this packet descriptor command as well as buffer length 320 to retrieve a buffer 510 within memory 180 .
- FIG. 6 illustrates the sequence of operations required to transfer data from controller 100 to host computer system 190 using a prior art packet descriptor list command format in accordance with an aspect of the present invention.
- a packet descriptor address is pre-loaded 600 into controller 100 sometime before the incoming data is received at controller 100 from high speed network 160 .
- the packet descriptor address is used to retrieve a command block 610 , including a packet descriptor list from host computer system 190 .
- a series of transfers, 620 , 630 and 640 take place between controller 100 and host computer system 190 to transfer all of the constituent fragments to host computer system 190 .
- FIG. 7 presents a greatly simplified series of transactions required to move data from controller 100 to host computer system 190 using a packet descriptor command format in accordance with an aspect of the present invention.
- a packet descriptor command is preloaded 700 into controller 100 from host computer system 190 before data is received at controller 100 .
- this data is transferred to a buffer within host computer system 190 in a single transaction.
- Controller 100 can buffer data until such time that host computer system 190 loads a packet descriptor command into controller 100 .
- FIG. 8 is a block diagram illustrating some of the major functional components of controller 100 in FIG. 1 in accordance with an aspect of the present invention.
- Controller 100 includes bus master controller 800 , which is coupled to transmit buffer 830 and receive buffer 835 . Transmit buffer 830 and receive buffer 835 are used to store data to be transmitted and received from high speed network 160 pictured in FIG. 1.
- Bus master controller 800 additionally connects to bus interface 820 , which implements bus interface functions for a connection onto interconnect bus 130 within host computer system 190 in FIG. 1.
- Bus master controller 800 includes transmit command FIFO 850 , which stores transmit commands from host computer system 190 . Transmit command FIFO 850 is coupled to transmit execution queue 860 .
- Transmit execution queue 860 contains expanded commands from transmit command FIFO 850 . If the command in transmit command FIFO 850 is a packet descriptor address, the corresponding command block is retrieved and placed into transmit execution queue 860 . If the command is a packet descriptor command, the command is directly transferred to transmit execution queue 860 . Transmit execution queue 860 is additionally coupled to transmit buffer address table 870 , which contains physical addresses of preallocated buffers within memory 180 in host computer system 190 .
- Bus master controller 800 additionally includes receive command FIFO 852 , which contains receive commands from host computer system 190 that are preloaded into receive command FIFO 852 .
- Receive command FIFO 852 is coupled to receive execution queue 862 , which contains expanded commands from receive command FIFO 852 .
- packet descriptor addresses within receive command FIFO 852 are expanded into corresponding command blocks which are loaded into receive execution queue 862 .
- Packet descriptor commands are directly loaded into receive execution queue 862 .
- Bus master controller 800 also includes receive buffer address table 872 , which contains a table of physical addresses of pre-allocated buffers for storing data received from high speed network 160 .
- Bus master controller 800 additionally includes byte aligner endian mode circuitry 840 and byte aligner endian mode circuitry 842 .
- Byte aligner endian mode circuitry 840 is coupled between bus interface 820 and transmit buffer 830 . It performs byte alignment and endian mode reversal functions for control information associated with data transmissions from bus interface 820 to transmit buffer 830 .
- Byte aligner and endian mode circuitry 842 is coupled between receive buffer 835 and bus interface 820 , and provides the same byte alignment and endian mode reversal functions for status information associated with data transmitted from receive buffer 835 and bus interface 820 in accordance with an aspect of the present invention.
- Bus master controller 800 additionally includes bus master state machine 810 , which is coupled to all of the functional components within bus master controller 800 , and is additionally coupled to bus interface 820 , transmit buffer 830 and receive buffer 835 .
- Bus master state machine 810 coordinates actions of the components within FIG. 8 to transfer data between bus interface 820 and transmit and receive buffers 830 and 835 .
- Bus master controller 800 is responsible for data flow between transmit buffer 830 , receive buffer 835 and bus interface 820 . It includes five major components: bus master state machine 810 , byte aligner endian mode circuitry 840 and 842 , transmit buffer address table 870 , receive buffer address table 872 , receive command FIFO 852 with receive execution queue 862 , and transmit command FIFO 850 with transmit execution queue 860 .
- Bus master controller 800 supports three modes of operation: programmed I/O (PIO), packet descriptor list (PDL), and packet descriptor command (PDC).
- PIO programmed I/O
- PDL packet descriptor list
- PDC packet descriptor command
- the PDC mode is also known as PROPULSION(tm) technology.
- Bus master controller 800 decodes and controls transactions and routing of data required by the operating modes. As a shorthand for references to the address of a PDL, the term PDA is used throughout the remainder of the text.
- Command execution queues 860 and 862 are used in the PDL and PDC modes of operation to hold either a packet descriptor address or a packet descriptor command.
- a PDA provides the address where a corresponding packet descriptor list is obtained, while a PDC is used to execute a PROPULSION(tm) transaction.
- the command execution queue contains the loaded PDL and/or PDC instructions from the command FIFO. The PDL and PDC commands are executed from the command execution queues.
- Packet descriptor lists are the data structures used to communicate information about transmit and receive packets. Both transmit and receive PDLs use the same format, shown in FIG. 2. Each PDL contains a packet descriptor header and one or more fragment descriptors describing the location and length of the packet data in host memory. In the case of a transmit PDL, the PDL describes the location and length of fragments that comprise the total packet. Packet length field 220 includes the sum of the length fields in the fragment descriptors. For receive PDLs, the length field is also the sum of the fragment lengths, however this field is overwritten with the actual length of the packet after the packet is received.
- bus master controller 800 When transferring a received packet from receive buffer 835 to host memory 180 , bus master controller 800 scatters the packet across the locations described by each fragment descriptor. Note that the fragment lengths are not overwritten, so the last fragment transferred may contain less data than is indicated by the corresponding fragment length field. The header length field also indicates how many fragments are completely filled and how much data is in the last fragment. If the buffer described by the receive PDL is not large enough to hold the complete packet, a receive overflow error is generated and the remaining data is discarded.
- Transmit execution queue 860 and receive execution queue 862 are independent FIFOs containing 32 and 64 entries, respectively, in a preferred embodiment. They contain either a PDA or a PDC instructions.
- the host software transfers control of the buffer to bus master controller 800 .
- the host computer system 190 writes a PDA or PDC to a command FIFO, the FIFO's command count register is incremented.
- bus master controller 800 has processed the PDL pointed to by a PDA or the PDC, the PDA or the PDC is removed from the ring and the command count register is decremented.
- the host software uses a command FIFO count register to determine how many PDA or PDC commands are currently owned by bus master controller 800 . If the host is capable of writing commands, and thereby transferring control of the PDL/PDC to bus master controller 800 , faster than bus master controller 800 uses the PDLs or PDCs, efficient pipelining of packets occurs and bus transactions overlap with network transactions. Because of the large size of FIFOs 850 and 852 , some of the bursty nature of bus accesses can be smoothed out.
- Bus master controller 800 uses command execution queues 860 and 862 as scratch memory while transferring packet data between host memory 180 and transmit and receive buffers 830 and 835 .
- Bus master controller 800 copies the PDL pointed to by the PDA into a command FIFO in the corresponding command execution queue.
- Bus master controller 800 uses this PDL to program bus interface 820 with the location and length of each fragment to be transferred. Loading the complete PDL into scratch memory improves utilization of the interconnect bus 130 because, in most cases, the complete PDL can be transferred in one bus transaction. If bus master controller 800 were to read each fragment descriptor separately, performance would suffer because each fragment descriptor read would require a separate bus transaction including the associated arbitration latency.
- Bus master controller 800 also uses command execution queues 860 and 862 to hold PDC instructions. PDC instructions are directly transferred from the command FIFO. PDCs are executed out of the command execution queues to maintain PDL/PDC ordering and to maximize the use of the command FIFO.
- Receive and transmit PDC instructions use the same word format, shown in FIG. 3.
- the format contains buffer length 320 , block index 310 , and options field 300 .
- Block index 310 is an address into the corresponding buffer address table which contains the physical address at which data is to be transferred to or from the host memory.
- Buffer length field 320 specifies the number of bytes to be transferred during transmit operations, or the allocated host memory space required for a receive operation. If the packet data for a receive operation is larger than the allocated host memory space, bus master controller 800 fills the allocated space, sets a receive overflow flag, and then discards the remaining amount of the packet.
- Options field 300 is used to communicate special processing options to bus master controller 800 , such as whether or not an interrupt is desired immediately upon the completion of a data transfer between controller 100 and host computer system 190 .
- Bus master state machine 810 coordinates and controls all activity associated with transferring packet data between host memory 180 and transmit and receive buffers 830 and 835 during PDL and PDC modes. Transmit and receive operations are independently described. However, they are actually performed by the same state machine 810 and are interleaved as necessary. Bus master state machine 810 can be configured to give priority to a receive operation. If data reception is occurring fast enough, it will perform up to eight receive cycles for each transmit cycle.
- a transmit PDL transaction is generated by the host software, which creates a PDL describing the packet in host memory, and transfers control to bus master controller 800 by writing the PDL Address (PDA) to transmit command FIFO 850 .
- PDA PDL Address
- Writing the PDA to transmit command FIFO 850 causes the transmit command count register to increment.
- bus master controller 800 extracts the next command from the FIFO. Since every PDL must have at least one fragment, bus master controller 800 programs bus interface 820 to read the PDL header and one fragment into transmit execution queue 860 . If there is more than one fragment, bus master controller 800 reads the header to determine this fact and adjusts the number of fragments to be transferred to the execution queue. Once the complete PDL has been copied into the command execution queue, that PDA is discarded.
- bus master controller 800 checks transmit buffer 830 to ensure there is enough room to load another packet. If there is not enough room, bus master controller 800 waits until transmit buffer 830 has enough room for a new packet.
- Bus master controller 800 then proceeds to interpret each fragment descriptor and programs bus interface 820 to copy each fragment from host memory 180 to transmit buffer 830 . When all fragments have been copied to transmit buffer 830 , bus master controller 800 discards the PDL and checks the command queue to determine if another PDL is available. If so, bus master controller 800 executes the new PDL and repeats the operation described above.
- bus master controller 800 during receive transfers is similar to the transmit case with a few subtle differences.
- the host software creates a receive PDL, which describes the buffer in which to transfer the received data.
- a corresponding PDA is then written to receive command FIFO 852 , transferring control to bus master controller 800 .
- Bus master controller 800 transfers the PDL into receive execution queue 862 , even if no received packets are available in the receive data buffers. By transferring the PDL before it is actually needed, bus master controller 800 attempts to reduce the latency between receiving a packet and transferring it to host memory.
- receive command FIFO 852 can load PDA/PDC instructions in advance to further reduce transaction latency.
- bus master controller 800 uses the preloaded PDL to determine how to scatter the received packet into host memory 180 .
- Bus master controller 800 programs the necessary transactions into bus interface 820 to copy each fragment of the received packet into host memory 180 as described by the PDL fragment descriptors.
- the first word of the received packet in receive buffer 835 contains the total packet length and the receive status. This word is saved by bus master controller 800 and is transferred to the receive PDL in host memory 180 after all packet data has been copied to host memory 180 .
- the receive status field of the PDL becoming non-zero indicates that bus master controller 800 has transferred control of the buffer and associated PDL back to host computer system 190 . This process is repeated until all received packets have been transferred or the PDA receive command FIFO is exhausted.
- Transmit PDC mode provides increased performance by reducing the number of bus acquisitions required. This mode requires that physical addresses of preallocated buffers be loaded into transmit buffer address table 870 during system initialization. An operation starts with the host software gathering data fragments into a predefined contiguous memory space in one of the preallocated buffers. Once this is complete, the information needed for a PDC instruction is known. The PDC instruction is then created and transferred to the transmit command FIFO 850 , and the command count register is incremented. If an instruction is at the beginning of transmit command FIFO 850 , it is transferred to transmit execution queue 860 . The command FIFO count is then decremented and the execution queue count is incremented.
- bus master controller 800 If the execution queue count is non-zero, control passes to bus master controller 800 , which begins execution of the instruction. Bus master controller 800 decodes the instruction for the length and base index information while checking the transmit buffer flags for available packet space. With the proper information and status, bus master controller 800 configures bus interface 820 to commence transfer to the data buffer. Once the transfer is complete, the PDC instruction is discarded, the command queue count is decremented, and a complete transfer flag is set.
- a receive PDC transaction is initiated by the receive buffer 835 .
- receive buffer address table 872 Prior to the data transfer, receive buffer address table 872 , receive execution queue 862 , and receive command FIFO 852 are preloaded. With all of this information loaded in advance, the data transfer occurs with minimal overhead. This information can also be reloaded during period of non-use, or when empty, to better distribute the workload over time.
- bus master controller 800 accesses the PDC instruction on top of receive command execution queue 862 , and examines the block index and length. Bus master controller 800 uses this information to program bus interface 820 to request a bus transaction. At the same time, bus master controller 800 determines if the initial data packet will fit in the allocated memory space. If the data packet will not fit, bus master controller 800 sends data until the host memory space is filled, sets the receive overflow flag, and signals receive buffer 835 to discard the remaining portion of the data packet.
- bus master controller 800 determines if another packet can be transferred, and sends additional packets until the allocated memory space is filled. If the memory space allocated is greater in size than a bus transaction can deliver in one transaction, bus master controller 800 breaks the transfer into multiple transactions. Once the transfer is complete, the command execution count is decremented and another command is loaded. If another packet is ready in receive buffer 835 , this cycle is repeated.
- PIO programmed I/O
- PIO programmed I/O
- This mode requires, very little or no action from bus master controller 800 .
- PIO transfers one double word (32 bits) of data at a time directly to transmit buffer 830 .
- Each transfer requires a separate request for interconnect bus 130 , thus increasing total bus acquisition latency.
- Bus master controller 800 is only responsible for routing data to transmit buffer 830 .
- the host software makes certain that packet space is available, and indicates to transmit buffer 830 that the packet transfer is complete by setting appropriate flags.
- a receive PIO transfer is initiated by the host software.
- the host software ensures that a packet is available prior to transferring the packet to host computer system 190 . Double word transfers are performed across interconnect bus 130 until all packet data has been transferred.
- the host software is also responsible for maintaining packet and data integrity.
- the packet transfer process minimizes the number of interrupts necessary to interact with bus master controller 800 .
- host computer system 190 receives and transfers packets without ever taking an interrupt.
- the host software can program bus master controller 800 to generate an interrupt in the following cases: after each packet has been transferred to the transmit buffer 830 ; when the transmit command FIFO is exhausted; or upon any transmit error.
- Receive interrupts are generated after each receive packet has been transferred to host memory 180 , when the receive command FIFO 852 is exhausted, or when receive errors occurs.
- bus master controller 800 implements a “lying send” transmit policy in which a successful packet transmission is signaled to the host software as soon as possible after bus master controller 800 completes the data transfer between host memory 180 and transmit buffer 830 or receive buffer 835 .
- the packet is considered to be “transmitted” the moment bus master controller 800 has a complete copy of the packet. It is the responsibility of protocols above the driver level to ensure that packets are successfully transmitted to remote stations. If a packet is lost during transmission by bus master controller 800 , the protocol must recognize that the packet is lost and take a corrective action, such as a retransmission.
- Errors such as CRC, runt packet and long packet errors are detected by bus master controller 800 and signaled to the host software by specific bits in the receive PDL header.
- the header also contains additional information bits pertaining to the inbound packet.
- Padding of packets that are shorter than the minimum legal length for transmission is the responsibility of software on host computer system 190 .
- Bus master controller 800 supports the use of PDL and PDC data transfer methods simultaneously.
- software on host computer system 190 indicates the required transfer method by placing an appropriate command in transmit command FIFO 850 or receive command FIFO 852 .
- software on host computer system 190 initiates the process by writing a PDC to the appropriate command FIFO. If host computer system 190 wishes to transmit a packet using conventional bus master DMA, it writes a PDA to the command FIFO instead.
- intermixing PDC and PDL data transfer methods can be accomplished with just one index variable for the PDC queue, one for the PDL queue and a counter variable reflecting the free space in transmit command FIFO 850 .
- This technique works as long as each queue accommodates at least as many entries as transmit command FIFO 850 .
- CMD # of entries that can be accommodated in the command FIFO
- PDL # of entries that can be accommodated in the PDL queue
- CMD FREE+PDL used +PDC used
- the system guarantees the relationship of used PDC/PDL entries to the total number of command FIFO entries with a free count variable. Initially, the counter is set to the size of the command FIFO. Each time a transmit request is submitted to bus master controller 800 , the command FIFO free count is decremented.
- FIG. 9 is a diagram illustrating two packets packed into a single preallocated buffer within memory 180 in accordance with an aspect of the present invention.
- FIG. 9 includes a first packet, including flags 901 and length 902 .
- Flags 901 contain status information for the first packet.
- Length field 902 contains the length of the first packet.
- the first packet also includes data 903 , which contains all of the data associated with the packet.
- the second packet includes flags 911 and length 912 .
- Flags 911 contain status information for the second packet.
- Length field 912 contains the length of the second packet.
- the second packet also includes data 913 , which is the data associated with the second packet. As indicated by the ellipsis, additional packets may be included into a single preallocated buffer.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Systems (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Computer And Data Communications (AREA)
Abstract
Description
- 1. Field of the Invention
- The present invention relates to a device for connecting a computer system to a computer network, and more particularly to a method and an apparatus for reducing bus overhead in communications between a computer system and a network interface device through which the computer system communicates with a high speed packet-switched network.
- 2. Related Art
- The advent of computer networking has given rise to devices that connect computer systems to packet-switched data networks. These devices (known as network interface controllers, or NICs) typically include interfaces to both the computer system and the packet-switched data network, as well as a buffer memory for buffering packets of data in transit between the computer system and the packet-switched data network. The interface to the computer system typically connects to a bus within the computer system, such as a PCI bus, through which data is transferred between the computer system memory and the NIC. As computer networks and NICs greatly increase in performance, communications across this bus can become an impediment to achieving high performance in communications between the computer system and the packet-switched data network.
- Three methods can be used to communicate between a computer system and a device such as a NIC. (1) Programmed I/O (PIO) operates by including explicit I/O commands in the application programs executed by the computer system. PIO can be implemented with a simple hardware and operating system design. However, it places a tremendous burden on the application program to explicitly manage communications between the computer system and the NIC. (2) Shared memory can be used to facilitate communications between the NIC and the computer system. In a shared memory system, the NIC and the computer system communicate by writing to and reading from a shared memory that exists in both the address space of the computer system and the address space of the NIC. This again leads to a simple hardware and operating system implementation, and a clean interface between the computer system and the NIC. However, it again places a burden on the application program to explicitly manage communications between the computer system and the NIC. (3) Finally, direct memory access (DMA) can be used to transfer data between the NIC and the memory of the computer system. DMA operates by allowing the NIC to perform bus operations to directly access the memory of the computer system in order to transfer data between the computer system and the NIC. A DMA system requires considerable complexity in hardware and operating system design. However, it relieves the application program of the burden of explicitly managing communications between the computer system and the NIC.
- DMA transfers between computer systems and NICs are commonly accomplished using the scatter-gather technique. In scatter-gather, a bus master device in the NIC is first instructed to obtain a command block from the memory of a host computer system. At a minimum, the command block contains a list of physical addresses for blocks within the host system memory that are to be copied to the DMA device. The command block also contains a count of the number of fragments in the command block and the overall length of the data contained in the fragments pointed to by the command block. The DMA device parses the command block, extracting the address of each fragment, and transfers the fragments from the host memory to the DMA device. This process is repeated for each fragment listed in the command block until all of the data described by the command block is copied to the DMA device.
- A significant performance bottleneck in using the scatter-gather technique for transferring data to a high speed network is the translation from virtual to physical addresses. Peripheral devices, such as a NIC, cannot use virtual memory addresses to effect the transfers, because the hardware to implement the virtual-to-physical address translation is typically located inside the CPU. This means that conversion between virtual and physical addresses must take place before transfers between a computer system and a NIC can take place. This conversion can take a great deal of time and consume a significant amount of the computer system's processing power. When data is passed to a device driver for transmission to the NIC, the driver first performs a virtual-to-physical address conversion for each buffer fragment passed down to it from the application layers above. It is possible for each buffer fragment to straddle physical pages of the memory system. Thus, more than one physical address may correspond to each virtual address converted. Consequently, several virtual-to-physical address conversions may be required for each buffer of data that is transferred from the computer system to the NIC. This can be very time-consuming because each virtual-to-physical address translation can take from tens to hundreds of CPU cycles to accomplish.
- Another significant performance impediment associated with the scatter-gather technique is its command block nature. Peripheral devices such as NICs typically connect to computer systems through a peripheral interconnect bus, such as the PCI bus. In order to transfer data to or from the computer system, devices connected to the bus contend for control of the bus. Once a device is granted control of the bus, it drives bus signal lines to transfer data to or from the computer system. The performance impediment stems from the number of times a NIC must contend for the peripheral interconnect bus when transferring data using the scatter-gather technique. Under ideal circumstances for scatter-gather, bus contention to transfer data between a NIC and an attached computer system will occur three times per buffer transferred: first, when the computer system informs the NIC that a buffer is available for its use; second when the NIC reads the command block describing the buffer; and third when the NIC transfers data to or from the buffer. In typical scenarios, at least two buffer fragments will be described in each command block. As a result, there will be at least four contentions instead of three. These additional contentions create opportunities for other devices to obtain control of the bus and thus delay transfers initiated by the NIC.
- What is needed is a method for performing DMA between a computer system and a NIC which is free from the overhead of performing virtual to physical address translations and minimizes the number of bus transactions required to initiate the DMA transfer process.
- The present invention provides a method and an apparatus for transferring data between a computer system and a network interface card that avoids virtual-to-physical address translations. The computer system allocates blocks of memory during system initialization for storing data in transit between the computer system and the NIC, and the physical addresses of these blocks of memory are stored in a table on the NIC. Consequently, address conversion is performed only once, when the memory is allocated. When a request to transfer data to the NIC is received from the upper layers, the device driver copies the data from the upper layers into the next available memory block. The device driver then formats a command and passes it to the NIC for processing. Data transfer commands are communicated to the NIC through a packet descriptor command (PDC), which is a 32-bit value subdivided into fields that completely describe the data transfer operation. The PDC contains a small ordinal value that indexes a table in the NIC, which includes a set of physical addresses of buffers preallocated by the computer system in the computer system memory. These buffers are used for storing data in transit to the NIC. The PDC also contains the length of the buffer to be copied to or from the NIC. The present invention also allows for multiple packets to be formatted into buffers and then subsequently transferred to the NIC in a single I/O operation.
- The present invention provides a number of advantages. First, virtual-to-physical address translation is avoided at run time. Second, the formatting of a packet descriptor list is greatly simplified. Third, the amount of control data transferred to the NIC by the computer system is greatly reduced. Finally, multiple packets can be transferred to the NIC in a single I/O operation, thereby making more efficient use of bandwidth on the interconnect bus.
- The present invention incurs additional overhead because the processor must move data from the application program into the data buffers in the computer system's memory before this data is transferred to the NIC. At first glance, this double copy operation appears to incur a great amount of additional processor overhead. However, this additional overhead is considerably smaller than the overhead involved in performing virtual-to-physical address translations. Each translation requires many tens (if not hundreds) of CPU cycles, and many such translations may be required for a single transfer operation. Consequently, the present invention provides a significant performance advantage for small data transfers, which represent a significant percentage of all data transfers. Hundreds of bytes can be moved to the preallocated buffer in the time it takes to perform just one virtual-to-physical address translation. Moreover, as microprocessors move to 64 and 128 bit architectures, their capacity to move data per clock will increase thereby further widening the performance advantage of the present invention over conventional scatter-gather DMA.
- Furthermore, CPU utilization may not be the primary bottleneck. In systems which move around large amounts of data, bus utilization may be the largest bottleneck. Hence, favoring bus utilization at the expense of CPU utilization is often a desirable tradeoff to make.
- Thus, the present invention can be characterized as an apparatus for facilitating communications between a computer system, including a memory and a bus, and a packet-switched network, comprising: a bus interface coupled to the bus, for communicating across the bus; a transmit buffer, for storing data to be transmitted on the packet-switched network; a transmit data path, coupled to the bus interface and the transmit buffer, for transferring data from the bus interface to the transmit buffer; a receive buffer, for storing data received from the packet-switched network; a receive data path, coupled to the bus interface and the receive buffer, for transferring data from the receive buffer to the bus interface; a buffer address table, coupled to the bus interface, for storing at least one address of at least one buffer in the memory of the computer system, the at least one buffer being preallocated by the computer system and used to store data in transit between the computer system and one of the transmit buffer and the receive buffer; and a controller coupled to the transmit buffer, the receive buffer and the buffer address table, for controlling the transfer of data from the computer system to the transmit buffer, and from the receive buffer to the computer system.
- According to an aspect of the present invention, the apparatus includes: a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system; and a transmit execution queue, coupled to the bus interface, the transmit command queue and the controller, for storing and processing commands from the transmit command queue, and command blocks from the computer system which are referenced by commands from the transmit command queue.
- According to another aspect of the present invention, the apparatus includes a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system; and a receive execution queue, coupled to the bus interface, the receive command queue and the controller, for storing and processing commands from the receive command queue and command blocks from the computer system that are referenced by commands from the receive command queue.
- According to another aspect of the present invention, the controller includes a mechanism to transfer a plurality of packets in a single operation between the at least one buffer preallocated by the computer system and the transmit buffer.
- According to another aspect of the present invention, the controller includes a mechanism to transfer a plurality of packets in a single operation between the receive buffer and the at least one buffer preallocated by the computer system.
- The present invention can also be characterized as a method for transferring data between a computer system and a network interface device, the network interface device being coupled to a packet-switched network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: receiving at the network interface device at least one address of a preallocated buffer in the memory; storing in the network interface device the at least one address of the preallocated buffer; receiving a command from the computer system through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; retrieving an address from the at least one address of a preallocated buffer stored in the network interface device; using the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and using the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
- FIG. 1 is a block diagram illustrating some of the major functional components of a system for coupling
host computer system 190 with ahigh speed network 160 in accordance with an aspect of the present invention. - FIG. 2 is a block diagram illustrating the format of a packet descriptor list in accordance with an aspect of the present invention.
- FIG. 3 is a block diagram illustrating the structure of a simplified packet descriptor command for initiating a transfer of data between a NIC and a computer system in accordance with an aspect of the present invention.
- FIG. 4 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from
host computer system 190 to a NIC in accordance with an aspect of the present invention. - FIG. 5 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from
host computer system 150 to a NIC in accordance with an aspect of the present invention. - FIG. 6 illustrates the sequence of commands and data transfers involved in transferring data from a NIC to a
host computer system 150 in accordance with an aspect of the present invention. - FIG. 7 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from a NIC to a
host computer system 150 in accordance with an aspect of the present invention. - FIG. 8 is a block diagram illustrating some of the major functional components within a NIC in accordance with an aspect of the present invention.
- FIG. 9 is a block diagram illustrating the structure of a preallocated buffer, including a plurality of packets for transmission to a NIC, in accordance with an aspect of the present invention.
- The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
- FIG. 1 is a block diagram illustrating some of the major functional components of a
host computer system 190, which connects to ahigh speed network 160 throughnetwork interface 195, in accordance with an aspect of the present invention.Host computer system 190 includesinterconnect bus 130,host bridge 110, host bus 140,host processor 120 andmemory 180.Host processor 120 connects to host bus 140, which also connects to hostbridge 110.Host processor 120 can be any type of processor system including a device controller, a microprocessor, or a mainframe computing system. Host bus 140 is a bus which connectshost processor 120 tohost bridge 110.Host bridge 110 includescache controller 112 which connects tomemory 180.Memory 180 includespre-allocated buffers 182, which are buffers preallocated byhost processor 120 during system initialization. These buffers are used to store data in transit betweenhost computer system 190 andhigh speed network 160.Host bridge 110 additionally connects to interconnectbus 130.Interconnect bus 130 is used to connecthost computer system 190 to peripheral devices, such asnetwork interface 195.Interconnect bus 130 may be any type of commonly used interconnection bus, such as a PCI bus. -
High speed network 160 is any type of high speed data network, including 100 megabit and gigabit Ethernet networks for example.Network interface 195 may be implemented on a separate computer card, or it may be integrated into a computer system motherboard. It may also be integrated into a single silicon chip.Network interface 195 includesphysical layer interface 150 andcontroller 100.Controller 100 is coupled tointerconnect bus 130 withinhost computer system 190.Controller 100 additionally connects tophysical layer interface 150, which connects tohigh speed network 160.Controller 100 performs the DMA functions involved in transferring data betweenmemory 180 withinhost computer system 190, andhigh speed network 160.Controller 100 includes buffer address table 105, which includes a plurality of addresses of buffers inpreallocated buffers 182 withinmemory 180.Physical layer interface 150 includes resources for performing communications acrosshigh speed network 160. - FIG. 2 is a diagram illustrating the structure of a packet descriptor list (PDL) in accordance with an aspect of the present invention. The PDL illustrated in FIG. 2 specifies a transmission of data including the transmission of N separate fragments scattered through
memory 180 withinhost computer system 190. The PDL includes status field 200, which contains information regarding the status of the data transfer corresponding to the PDL. It also includes a number offragments field 210, which indicates the number of fragments associated with the data transfer specified by the PDL. The PDL also includes apacket length 220, which indicates the length of the entire transfer, including the plurality of associated fragments. The PDL additionally includes a number of address/length pairs. Address offragment 1 230 includes the physical address forfragment 1 withinmemory 180 ofhost computer system 190. Length offragment 1 235 includes the length offragment 1. Address offragment 2 240 includes the address withinmemory 180 offragment 2. Length offragment 2 245 includes the length offragment 2. Next, there are a number of intervening fragments, and then address offragment N 250, which contains the address of fragment N withinmemory 180. The PDL additionally contains length offragment N 255, which includes the length of fragment N. - FIG. 3 illustrates the format for a simplified command sent from
host computer system 190 tonetwork interface 195 in accordance with an aspect of the present invention. The command is known as a packet description command (PDC) and fits within a single word of memory. It includesoptions field 300, which is a field indicating the processing options for a transmission betweenhost computer system 190 andnetwork interface 195. It also includesblock index field 310, which indexes a buffer address table withincontroller 100, containing the physical address of a preallocated buffer withinmemory 180 inhost computer system 190. Finally, it includesbuffer length 320, which is the length of the data within the pre-allocated buffer that is to be transferred betweennetwork interface 195 andhost computer system 190. Note that this simplified command format does not require a “number of fragments” field because only one fragment is sent. It also does not require a separate address and length for each fragment because multiple fragments are concatenated together within a single pre-allocated buffer to be transferred in a single operation. - FIG. 4 illustrates the sequence of operations involved in transferring data from
host computer system 190 tocontroller 100 withinnetwork interface 195 using a prior art packet descriptor list command format in accordance with an aspect of the present invention. First,host computer system 190 writes a packet descriptor address tocontroller 100. Next,controller 100 uses this packet descriptor address to retrieve acommand block 410 fromhost computer system 190. Once this command block is retrieved,controller 100 performs a series of 420, 430 and 440, to retrieve individual fragments fromretrieval operations host computer system 190 intocontroller 100. - FIG. 5 illustrates the sequence of operations required to move data from
host computer system 190 tocontroller 100 using a packet descriptor command format in accordance with an aspect of the present invention. The greatly simplified sequence of operations in this example results from the simplified command format and the preallocation of buffers withinmemory 180. First,host computer system 190 writes apacket descriptor command 500 tocontroller 100.Controller 100 uses theblock index 310 within this packet descriptor command as well asbuffer length 320 to retrieve abuffer 510 withinmemory 180. - FIG. 6 illustrates the sequence of operations required to transfer data from
controller 100 tohost computer system 190 using a prior art packet descriptor list command format in accordance with an aspect of the present invention. First, a packet descriptor address is pre-loaded 600 intocontroller 100 sometime before the incoming data is received atcontroller 100 fromhigh speed network 160. Next, the packet descriptor address is used to retrieve acommand block 610, including a packet descriptor list fromhost computer system 190. Next, when the incoming data is received fromhigh speed network 160, a series of transfers, 620, 630 and 640, take place betweencontroller 100 andhost computer system 190 to transfer all of the constituent fragments tohost computer system 190. - In contrast, FIG. 7 presents a greatly simplified series of transactions required to move data from
controller 100 tohost computer system 190 using a packet descriptor command format in accordance with an aspect of the present invention. First, a packet descriptor command is preloaded 700 intocontroller 100 fromhost computer system 190 before data is received atcontroller 100. Next, when data is finally received atcontroller 100 fromhigh speed network 160, this data is transferred to a buffer withinhost computer system 190 in a single transaction. - Although optimum performance is attained when the
packet descriptor command 700 is preloaded intocontroller 100, this sequence is not a requirement.Controller 100 can buffer data until such time thathost computer system 190 loads a packet descriptor command intocontroller 100. - FIG. 8 is a block diagram illustrating some of the major functional components of
controller 100 in FIG. 1 in accordance with an aspect of the present invention.Controller 100 includesbus master controller 800, which is coupled to transmitbuffer 830 and receivebuffer 835. Transmitbuffer 830 and receivebuffer 835 are used to store data to be transmitted and received fromhigh speed network 160 pictured in FIG. 1.Bus master controller 800 additionally connects tobus interface 820, which implements bus interface functions for a connection ontointerconnect bus 130 withinhost computer system 190 in FIG. 1.Bus master controller 800 includes transmitcommand FIFO 850, which stores transmit commands fromhost computer system 190. Transmitcommand FIFO 850 is coupled to transmitexecution queue 860. Transmitexecution queue 860 contains expanded commands from transmitcommand FIFO 850. If the command in transmitcommand FIFO 850 is a packet descriptor address, the corresponding command block is retrieved and placed into transmitexecution queue 860. If the command is a packet descriptor command, the command is directly transferred to transmitexecution queue 860. Transmitexecution queue 860 is additionally coupled to transmit buffer address table 870, which contains physical addresses of preallocated buffers withinmemory 180 inhost computer system 190. -
Bus master controller 800 additionally includes receivecommand FIFO 852, which contains receive commands fromhost computer system 190 that are preloaded into receivecommand FIFO 852. Receivecommand FIFO 852 is coupled to receiveexecution queue 862, which contains expanded commands from receivecommand FIFO 852. Again, packet descriptor addresses within receivecommand FIFO 852 are expanded into corresponding command blocks which are loaded into receiveexecution queue 862. Packet descriptor commands are directly loaded into receiveexecution queue 862.Bus master controller 800 also includes receive buffer address table 872, which contains a table of physical addresses of pre-allocated buffers for storing data received fromhigh speed network 160. -
Bus master controller 800 additionally includes byte alignerendian mode circuitry 840 and byte alignerendian mode circuitry 842. Byte alignerendian mode circuitry 840 is coupled betweenbus interface 820 and transmitbuffer 830. It performs byte alignment and endian mode reversal functions for control information associated with data transmissions frombus interface 820 to transmitbuffer 830. Byte aligner andendian mode circuitry 842 is coupled between receivebuffer 835 andbus interface 820, and provides the same byte alignment and endian mode reversal functions for status information associated with data transmitted from receivebuffer 835 andbus interface 820 in accordance with an aspect of the present invention. -
Bus master controller 800 additionally includes busmaster state machine 810, which is coupled to all of the functional components withinbus master controller 800, and is additionally coupled tobus interface 820, transmitbuffer 830 and receivebuffer 835. Busmaster state machine 810 coordinates actions of the components within FIG. 8 to transfer data betweenbus interface 820 and transmit and receive 830 and 835.buffers -
Bus master controller 800 is responsible for data flow between transmitbuffer 830, receivebuffer 835 andbus interface 820. It includes five major components: busmaster state machine 810, byte aligner 840 and 842, transmit buffer address table 870, receive buffer address table 872, receiveendian mode circuitry command FIFO 852 with receiveexecution queue 862, and transmitcommand FIFO 850 with transmitexecution queue 860. -
Bus master controller 800 supports three modes of operation: programmed I/O (PIO), packet descriptor list (PDL), and packet descriptor command (PDC). The PDC mode is also known as PROPULSION(tm) technology.Bus master controller 800 decodes and controls transactions and routing of data required by the operating modes. As a shorthand for references to the address of a PDL, the term PDA is used throughout the remainder of the text. -
860 and 862 are used in the PDL and PDC modes of operation to hold either a packet descriptor address or a packet descriptor command. A PDA provides the address where a corresponding packet descriptor list is obtained, while a PDC is used to execute a PROPULSION(tm) transaction. The command execution queue contains the loaded PDL and/or PDC instructions from the command FIFO. The PDL and PDC commands are executed from the command execution queues.Command execution queues - Packet descriptor lists are the data structures used to communicate information about transmit and receive packets. Both transmit and receive PDLs use the same format, shown in FIG. 2. Each PDL contains a packet descriptor header and one or more fragment descriptors describing the location and length of the packet data in host memory. In the case of a transmit PDL, the PDL describes the location and length of fragments that comprise the total packet.
Packet length field 220 includes the sum of the length fields in the fragment descriptors. For receive PDLs, the length field is also the sum of the fragment lengths, however this field is overwritten with the actual length of the packet after the packet is received. When transferring a received packet from receivebuffer 835 to hostmemory 180,bus master controller 800 scatters the packet across the locations described by each fragment descriptor. Note that the fragment lengths are not overwritten, so the last fragment transferred may contain less data than is indicated by the corresponding fragment length field. The header length field also indicates how many fragments are completely filled and how much data is in the last fragment. If the buffer described by the receive PDL is not large enough to hold the complete packet, a receive overflow error is generated and the remaining data is discarded. - Transmit
execution queue 860 and receiveexecution queue 862 are independent FIFOs containing 32 and 64 entries, respectively, in a preferred embodiment. They contain either a PDA or a PDC instructions. - By writing a PDA or PDC to either a transmit
command FIFO 850 or receivecommand FIFO 852, the host software transfers control of the buffer tobus master controller 800. Each time thehost computer system 190 writes a PDA or PDC to a command FIFO, the FIFO's command count register is incremented. Afterbus master controller 800 has processed the PDL pointed to by a PDA or the PDC, the PDA or the PDC is removed from the ring and the command count register is decremented. - The host software uses a command FIFO count register to determine how many PDA or PDC commands are currently owned by
bus master controller 800. If the host is capable of writing commands, and thereby transferring control of the PDL/PDC tobus master controller 800, faster thanbus master controller 800 uses the PDLs or PDCs, efficient pipelining of packets occurs and bus transactions overlap with network transactions. Because of the large size of 850 and 852, some of the bursty nature of bus accesses can be smoothed out.FIFOs -
Bus master controller 800 uses 860 and 862 as scratch memory while transferring packet data betweencommand execution queues host memory 180 and transmit and receive 830 and 835.buffers Bus master controller 800 copies the PDL pointed to by the PDA into a command FIFO in the corresponding command execution queue.Bus master controller 800 uses this PDL toprogram bus interface 820 with the location and length of each fragment to be transferred. Loading the complete PDL into scratch memory improves utilization of theinterconnect bus 130 because, in most cases, the complete PDL can be transferred in one bus transaction. Ifbus master controller 800 were to read each fragment descriptor separately, performance would suffer because each fragment descriptor read would require a separate bus transaction including the associated arbitration latency. -
Bus master controller 800 also uses 860 and 862 to hold PDC instructions. PDC instructions are directly transferred from the command FIFO. PDCs are executed out of the command execution queues to maintain PDL/PDC ordering and to maximize the use of the command FIFO.command execution queues - Receive and transmit PDC instructions use the same word format, shown in FIG. 3. The format contains
buffer length 320,block index 310, andoptions field 300.Block index 310 is an address into the corresponding buffer address table which contains the physical address at which data is to be transferred to or from the host memory.Buffer length field 320 specifies the number of bytes to be transferred during transmit operations, or the allocated host memory space required for a receive operation. If the packet data for a receive operation is larger than the allocated host memory space,bus master controller 800 fills the allocated space, sets a receive overflow flag, and then discards the remaining amount of the packet. Options field 300 is used to communicate special processing options tobus master controller 800, such as whether or not an interrupt is desired immediately upon the completion of a data transfer betweencontroller 100 andhost computer system 190. - Bus
master state machine 810 coordinates and controls all activity associated with transferring packet data betweenhost memory 180 and transmit and receive 830 and 835 during PDL and PDC modes. Transmit and receive operations are independently described. However, they are actually performed by thebuffers same state machine 810 and are interleaved as necessary. Busmaster state machine 810 can be configured to give priority to a receive operation. If data reception is occurring fast enough, it will perform up to eight receive cycles for each transmit cycle. - A transmit PDL transaction is generated by the host software, which creates a PDL describing the packet in host memory, and transfers control to
bus master controller 800 by writing the PDL Address (PDA) to transmitcommand FIFO 850. Writing the PDA to transmitcommand FIFO 850 causes the transmit command count register to increment. When the command count register is greater than zero,bus master controller 800 extracts the next command from the FIFO. Since every PDL must have at least one fragment,bus master controller 800programs bus interface 820 to read the PDL header and one fragment into transmitexecution queue 860. If there is more than one fragment,bus master controller 800 reads the header to determine this fact and adjusts the number of fragments to be transferred to the execution queue. Once the complete PDL has been copied into the command execution queue, that PDA is discarded. - Because the PDL header has the total length of the packet data,
bus master controller 800 checks transmitbuffer 830 to ensure there is enough room to load another packet. If there is not enough room,bus master controller 800 waits until transmitbuffer 830 has enough room for a new packet. -
Bus master controller 800 then proceeds to interpret each fragment descriptor andprograms bus interface 820 to copy each fragment fromhost memory 180 to transmitbuffer 830. When all fragments have been copied to transmitbuffer 830,bus master controller 800 discards the PDL and checks the command queue to determine if another PDL is available. If so,bus master controller 800 executes the new PDL and repeats the operation described above. - The operation of
bus master controller 800 during receive transfers is similar to the transmit case with a few subtle differences. The host software creates a receive PDL, which describes the buffer in which to transfer the received data. A corresponding PDA is then written to receivecommand FIFO 852, transferring control tobus master controller 800.Bus master controller 800 transfers the PDL into receiveexecution queue 862, even if no received packets are available in the receive data buffers. By transferring the PDL before it is actually needed,bus master controller 800 attempts to reduce the latency between receiving a packet and transferring it to host memory. - This procedure repeats until: 1) a receive packet is available or 2) receive
execution queue 862 is full. Even with receiveexecution queue 862 full, receivecommand FIFO 852 can load PDA/PDC instructions in advance to further reduce transaction latency. - When a complete packet is available in receive
buffer 835,bus master controller 800 uses the preloaded PDL to determine how to scatter the received packet intohost memory 180.Bus master controller 800 programs the necessary transactions intobus interface 820 to copy each fragment of the received packet intohost memory 180 as described by the PDL fragment descriptors. - The first word of the received packet in receive
buffer 835 contains the total packet length and the receive status. This word is saved bybus master controller 800 and is transferred to the receive PDL inhost memory 180 after all packet data has been copied to hostmemory 180. The receive status field of the PDL becoming non-zero indicates thatbus master controller 800 has transferred control of the buffer and associated PDL back tohost computer system 190. This process is repeated until all received packets have been transferred or the PDA receive command FIFO is exhausted. - Transmit PDC mode provides increased performance by reducing the number of bus acquisitions required. This mode requires that physical addresses of preallocated buffers be loaded into transmit buffer address table 870 during system initialization. An operation starts with the host software gathering data fragments into a predefined contiguous memory space in one of the preallocated buffers. Once this is complete, the information needed for a PDC instruction is known. The PDC instruction is then created and transferred to the transmit
command FIFO 850, and the command count register is incremented. If an instruction is at the beginning of transmitcommand FIFO 850, it is transferred to transmitexecution queue 860. The command FIFO count is then decremented and the execution queue count is incremented. If the execution queue count is non-zero, control passes tobus master controller 800, which begins execution of the instruction.Bus master controller 800 decodes the instruction for the length and base index information while checking the transmit buffer flags for available packet space. With the proper information and status,bus master controller 800 configuresbus interface 820 to commence transfer to the data buffer. Once the transfer is complete, the PDC instruction is discarded, the command queue count is decremented, and a complete transfer flag is set. - A receive PDC transaction is initiated by the receive
buffer 835. Prior to the data transfer, receive buffer address table 872, receiveexecution queue 862, and receivecommand FIFO 852 are preloaded. With all of this information loaded in advance, the data transfer occurs with minimal overhead. This information can also be reloaded during period of non-use, or when empty, to better distribute the workload over time. - When receive
buffer 835 indicates a packet has been received fromhigh speed network 160, control is transferred tobus master controller 800.Bus master controller 800 accesses the PDC instruction on top of receivecommand execution queue 862, and examines the block index and length.Bus master controller 800 uses this information toprogram bus interface 820 to request a bus transaction. At the same time,bus master controller 800 determines if the initial data packet will fit in the allocated memory space. If the data packet will not fit,bus master controller 800 sends data until the host memory space is filled, sets the receive overflow flag, and signals receivebuffer 835 to discard the remaining portion of the data packet. If the memory space is greater than one packet,bus master controller 800 determines if another packet can be transferred, and sends additional packets until the allocated memory space is filled. If the memory space allocated is greater in size than a bus transaction can deliver in one transaction,bus master controller 800 breaks the transfer into multiple transactions. Once the transfer is complete, the command execution count is decremented and another command is loaded. If another packet is ready in receivebuffer 835, this cycle is repeated. - The simplest form of transmit occurs during programmed I/O (PIO) mode (PIO) transfers. This mode requires, very little or no action from
bus master controller 800. PIO transfers one double word (32 bits) of data at a time directly to transmitbuffer 830. Each transfer requires a separate request forinterconnect bus 130, thus increasing total bus acquisition latency.Bus master controller 800 is only responsible for routing data to transmitbuffer 830. The host software makes certain that packet space is available, and indicates to transmitbuffer 830 that the packet transfer is complete by setting appropriate flags. - A receive PIO transfer is initiated by the host software. The host software ensures that a packet is available prior to transferring the packet to
host computer system 190. Double word transfers are performed acrossinterconnect bus 130 until all packet data has been transferred. The host software is also responsible for maintaining packet and data integrity. - The packet transfer process minimizes the number of interrupts necessary to interact with
bus master controller 800. In many cases,host computer system 190 receives and transfers packets without ever taking an interrupt. The host software can programbus master controller 800 to generate an interrupt in the following cases: after each packet has been transferred to the transmitbuffer 830; when the transmit command FIFO is exhausted; or upon any transmit error. Receive interrupts are generated after each receive packet has been transferred tohost memory 180, when the receivecommand FIFO 852 is exhausted, or when receive errors occurs. - In one embodiment,
bus master controller 800 implements a “lying send” transmit policy in which a successful packet transmission is signaled to the host software as soon as possible afterbus master controller 800 completes the data transfer betweenhost memory 180 and transmitbuffer 830 or receivebuffer 835. The packet is considered to be “transmitted” the momentbus master controller 800 has a complete copy of the packet. It is the responsibility of protocols above the driver level to ensure that packets are successfully transmitted to remote stations. If a packet is lost during transmission bybus master controller 800, the protocol must recognize that the packet is lost and take a corrective action, such as a retransmission. - Errors such as CRC, runt packet and long packet errors are detected by
bus master controller 800 and signaled to the host software by specific bits in the receive PDL header. The header also contains additional information bits pertaining to the inbound packet. - Padding of packets that are shorter than the minimum legal length for transmission is the responsibility of software on
host computer system 190. -
Bus master controller 800 supports the use of PDL and PDC data transfer methods simultaneously. When transferring data, software onhost computer system 190 indicates the required transfer method by placing an appropriate command in transmitcommand FIFO 850 or receivecommand FIFO 852. For packet transmission using the PDC data transfer method, software onhost computer system 190 initiates the process by writing a PDC to the appropriate command FIFO. Ifhost computer system 190 wishes to transmit a packet using conventional bus master DMA, it writes a PDA to the command FIFO instead. - From the perspective of
host computer system 190, intermixing PDC and PDL data transfer methods can be accomplished with just one index variable for the PDC queue, one for the PDL queue and a counter variable reflecting the free space in transmitcommand FIFO 850. This technique works as long as each queue accommodates at least as many entries as transmitcommand FIFO 850. Mathematically, we say that - CMD=# of entries that can be accommodated in the command FIFO
- PDL=# of entries that can be accommodated in the PDL queue
- PDC=# of entries that can be accommodated in the PDC queue
- FREE=# of unused entries in the command FIFO
- CMD=PDL=PDC
- CMD=FREE+PDL used+PDCused
- The PDL and PDC variables with the “used” subscript indicate the entries in the respective rings that contain transmit requests.
- The system guarantees the relationship of used PDC/PDL entries to the total number of command FIFO entries with a free count variable. Initially, the counter is set to the size of the command FIFO. Each time a transmit request is submitted to
bus master controller 800, the command FIFO free count is decremented. - When the counter reaches zero, the host software reloads the counter from a bus master status register, thereby obtaining the most recent free count. Since the system guarantees the relationship in the above equation, we see that FREE=CMD−PDL used−PDCused.
- FIG. 9 is a diagram illustrating two packets packed into a single preallocated buffer within
memory 180 in accordance with an aspect of the present invention. FIG. 9 includes a first packet, includingflags 901 andlength 902.Flags 901 contain status information for the first packet.Length field 902 contains the length of the first packet. The first packet also includesdata 903, which contains all of the data associated with the packet. The second packet includesflags 911 andlength 912.Flags 911 contain status information for the second packet.Length field 912 contains the length of the second packet. The second packet also includesdata 913, which is the data associated with the second packet. As indicated by the ellipsis, additional packets may be included into a single preallocated buffer. - The foregoing description of embodiments of the invention have been presented for purposes of illustration and description only. They are not intended to be exhaustive or to limit the invention to the forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art.
Claims (23)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/899,435 US6397316B2 (en) | 1997-07-24 | 1997-07-24 | System for reducing bus overhead for communication with a network interface |
| AU75973/98A AU7597398A (en) | 1997-07-24 | 1998-05-26 | System for reducing bus overhead for communications with a network interface |
| PCT/US1998/010652 WO1999005809A1 (en) | 1997-07-24 | 1998-05-26 | System for reducing bus overhead for communications with a network interface |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/899,435 US6397316B2 (en) | 1997-07-24 | 1997-07-24 | System for reducing bus overhead for communication with a network interface |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20020009075A1 true US20020009075A1 (en) | 2002-01-24 |
| US6397316B2 US6397316B2 (en) | 2002-05-28 |
Family
ID=25410967
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US08/899,435 Expired - Lifetime US6397316B2 (en) | 1997-07-24 | 1997-07-24 | System for reducing bus overhead for communication with a network interface |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US6397316B2 (en) |
| AU (1) | AU7597398A (en) |
| WO (1) | WO1999005809A1 (en) |
Cited By (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030086485A1 (en) * | 2001-11-08 | 2003-05-08 | John Lin | Master to multi-slave asynchronous transmit fifo |
| US20030236929A1 (en) * | 2002-06-10 | 2003-12-25 | Nath Raman Viswa | Buffer management methods and apparatus for network device drivers |
| US6721835B1 (en) * | 2000-09-19 | 2004-04-13 | Intel Corporation | Method and apparatus for minimizing bus contention for I/O controller read operations |
| US20040123056A1 (en) * | 2002-12-23 | 2004-06-24 | Intel Corporation | Interleaving memory access |
| US6963946B1 (en) | 2003-10-01 | 2005-11-08 | Advanced Micro Devices, Inc. | Descriptor management systems and methods for transferring data between a host and a peripheral |
| US20070070901A1 (en) * | 2005-09-29 | 2007-03-29 | Eliezer Aloni | Method and system for quality of service and congestion management for converged network interface devices |
| US20070223483A1 (en) * | 2005-11-12 | 2007-09-27 | Liquid Computing Corporation | High performance memory based communications interface |
| US20070283054A1 (en) * | 1997-12-17 | 2007-12-06 | Src Computers, Inc. | Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices in a fully buffered dual in-line memory module format (fb-dimm) |
| US20070294435A1 (en) * | 2006-06-19 | 2007-12-20 | Liquid Computing Corporation | Token based flow control for data communication |
| US20080148291A1 (en) * | 2006-10-30 | 2008-06-19 | Liquid Computing Corporation | Kernel functions for inter-processor communications in high performance multi-processor systems |
| US7533154B1 (en) | 2004-02-04 | 2009-05-12 | Advanced Micro Devices, Inc. | Descriptor management systems and methods for transferring data of multiple priorities between a host and a network |
| US7689738B1 (en) | 2003-10-01 | 2010-03-30 | Advanced Micro Devices, Inc. | Peripheral devices and methods for transferring incoming data status entries from a peripheral to a host |
| US7826614B1 (en) | 2003-11-05 | 2010-11-02 | Globalfoundries Inc. | Methods and apparatus for passing initialization vector information from software to hardware to perform IPsec encryption operation |
| US8937940B2 (en) | 2011-08-12 | 2015-01-20 | International Business Machines Corporation | Optimized virtual function translation entry memory caching |
| US8954704B2 (en) | 2011-08-12 | 2015-02-10 | International Business Machines Corporation | Dynamic network adapter memory resizing and bounding for virtual function translation entry storage |
| US20160364179A1 (en) * | 2014-02-14 | 2016-12-15 | Micron Technology, Inc. | Command queuing |
| US9824004B2 (en) | 2013-10-04 | 2017-11-21 | Micron Technology, Inc. | Methods and apparatuses for requesting ready status information from a memory |
| US10108372B2 (en) | 2014-01-27 | 2018-10-23 | Micron Technology, Inc. | Methods and apparatuses for executing a plurality of queued tasks in a memory |
Families Citing this family (45)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001013583A2 (en) | 1999-08-16 | 2001-02-22 | Iready Corporation | Internet jack |
| WO2001020416A2 (en) * | 1999-09-14 | 2001-03-22 | Siemens Aktiengesellschaft | Serial data transmission via a bus system |
| US7039717B2 (en) * | 2000-11-10 | 2006-05-02 | Nvidia Corporation | Internet modem streaming socket method |
| US20020083150A1 (en) * | 2000-12-27 | 2002-06-27 | Linden Minnick | Accessing information from memory |
| US7379475B2 (en) * | 2002-01-25 | 2008-05-27 | Nvidia Corporation | Communications processor |
| US7447795B2 (en) * | 2001-04-11 | 2008-11-04 | Chelsio Communications, Inc. | Multi-purpose switching network interface controller |
| US20020184566A1 (en) * | 2001-06-01 | 2002-12-05 | Michael Catherwood | Register pointer trap |
| US7454456B2 (en) * | 2002-02-14 | 2008-11-18 | International Business Machines Corporation | Apparatus and method of improving network performance using virtual interfaces |
| US20030208630A1 (en) * | 2002-04-19 | 2003-11-06 | Twomey John M. | Pre-assembled message buffers for a network processor |
| EP1552409B1 (en) * | 2002-08-19 | 2013-07-24 | Broadcom Corporation | One-shot rdma |
| WO2004068356A1 (en) * | 2003-01-29 | 2004-08-12 | Fujitsu Limited | Data communication system, data communication method, and data communication program |
| US20050132081A1 (en) * | 2003-12-11 | 2005-06-16 | International Business Machines Corporation | Efficient protocol processing to increase bandwidth in interrupt mode |
| US7260631B1 (en) | 2003-12-19 | 2007-08-21 | Nvidia Corporation | System and method for receiving iSCSI protocol data units |
| US8065439B1 (en) | 2003-12-19 | 2011-11-22 | Nvidia Corporation | System and method for using metadata in the context of a transport offload engine |
| US8549170B2 (en) * | 2003-12-19 | 2013-10-01 | Nvidia Corporation | Retransmission system and method for a transport offload engine |
| US7899913B2 (en) * | 2003-12-19 | 2011-03-01 | Nvidia Corporation | Connection management system and method for a transport offload engine |
| US8176545B1 (en) | 2003-12-19 | 2012-05-08 | Nvidia Corporation | Integrated policy checking system and method |
| US7624198B1 (en) | 2003-12-19 | 2009-11-24 | Nvidia Corporation | Sequence tagging system and method for transport offload engine data lists |
| US7249306B2 (en) * | 2004-02-20 | 2007-07-24 | Nvidia Corporation | System and method for generating 128-bit cyclic redundancy check values with 32-bit granularity |
| US7206872B2 (en) * | 2004-02-20 | 2007-04-17 | Nvidia Corporation | System and method for insertion of markers into a data stream |
| GB0404696D0 (en) * | 2004-03-02 | 2004-04-07 | Level 5 Networks Ltd | Dual driver interface |
| US7698413B1 (en) | 2004-04-12 | 2010-04-13 | Nvidia Corporation | Method and apparatus for accessing and maintaining socket control information for high speed network connections |
| GB0408876D0 (en) * | 2004-04-21 | 2004-05-26 | Level 5 Networks Ltd | User-level stack |
| US7831745B1 (en) | 2004-05-25 | 2010-11-09 | Chelsio Communications, Inc. | Scalable direct memory access using validation of host and scatter gather engine (SGE) generation indications |
| US7957379B2 (en) * | 2004-10-19 | 2011-06-07 | Nvidia Corporation | System and method for processing RX packets in high speed network applications using an RX FIFO buffer |
| US7831749B2 (en) * | 2005-02-03 | 2010-11-09 | Solarflare Communications, Inc. | Including descriptor queue empty events in completion events |
| US7562366B2 (en) * | 2005-02-03 | 2009-07-14 | Solarflare Communications, Inc. | Transmit completion event batching |
| JP2006259898A (en) * | 2005-03-15 | 2006-09-28 | Toshiba Corp | I / O controller, signal processing system, and data transfer method |
| US7819763B2 (en) * | 2005-04-21 | 2010-10-26 | Campbell Steven S | Baseball batting trainer |
| US7496699B2 (en) * | 2005-06-17 | 2009-02-24 | Level 5 Networks, Inc. | DMA descriptor queue read and cache write pointer arrangement |
| US7715436B1 (en) | 2005-11-18 | 2010-05-11 | Chelsio Communications, Inc. | Method for UDP transmit protocol offload processing with traffic management |
| US7660264B1 (en) | 2005-12-19 | 2010-02-09 | Chelsio Communications, Inc. | Method for traffic schedulign in intelligent network interface circuitry |
| US7660306B1 (en) | 2006-01-12 | 2010-02-09 | Chelsio Communications, Inc. | Virtualizing the operation of intelligent network interface circuitry |
| US7724658B1 (en) | 2005-08-31 | 2010-05-25 | Chelsio Communications, Inc. | Protocol offload transmit traffic management |
| US7616563B1 (en) | 2005-08-31 | 2009-11-10 | Chelsio Communications, Inc. | Method to implement an L4-L7 switch using split connections and an offloading NIC |
| US7596644B2 (en) * | 2006-01-11 | 2009-09-29 | Solarflare Communications, Inc. | Transmit rate pacing system and method |
| US7760733B1 (en) | 2005-10-13 | 2010-07-20 | Chelsio Communications, Inc. | Filtering ingress packets in network interface circuitry |
| US7984180B2 (en) | 2005-10-20 | 2011-07-19 | Solarflare Communications, Inc. | Hashing algorithm for network receive filtering |
| US7721023B2 (en) * | 2005-11-15 | 2010-05-18 | International Business Machines Corporation | I/O address translation method for specifying a relaxed ordering for I/O accesses |
| US8935406B1 (en) | 2007-04-16 | 2015-01-13 | Chelsio Communications, Inc. | Network adaptor configured for connection establishment offload |
| US7826350B1 (en) | 2007-05-11 | 2010-11-02 | Chelsio Communications, Inc. | Intelligent network adaptor with adaptive direct data placement scheme |
| US8589587B1 (en) | 2007-05-11 | 2013-11-19 | Chelsio Communications, Inc. | Protocol offload in intelligent network adaptor, including application level signalling |
| US8060644B1 (en) | 2007-05-11 | 2011-11-15 | Chelsio Communications, Inc. | Intelligent network adaptor with end-to-end flow control |
| US7831720B1 (en) | 2007-05-17 | 2010-11-09 | Chelsio Communications, Inc. | Full offload of stateful connections, with partial connection offload |
| US8412862B2 (en) | 2008-12-18 | 2013-04-02 | International Business Machines Corporation | Direct memory access transfer efficiency |
Family Cites Families (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4419728A (en) | 1981-06-22 | 1983-12-06 | Bell Telephone Laboratories, Incorporated | Channel interface circuit providing virtual channel number translation and direct memory access |
| JPH0787461B2 (en) * | 1987-06-19 | 1995-09-20 | 株式会社東芝 | Local Area Network System |
| US4888691A (en) | 1988-03-09 | 1989-12-19 | Prime Computer, Inc. | Method for disk I/O transfer |
| CA1338639C (en) * | 1989-09-26 | 1996-10-08 | Seiichi Kubo | Communication control device |
| US5315708A (en) * | 1990-02-28 | 1994-05-24 | Micro Technology, Inc. | Method and apparatus for transferring data through a staging memory |
| US5379296A (en) * | 1992-12-31 | 1995-01-03 | Unisys Corporation | Method and apparatus for interfacing a workstation to a plurality of computer platforms |
| US5522050A (en) | 1993-05-28 | 1996-05-28 | International Business Machines Corporation | Bus-to-bus bridge for a multiple bus information handling system that optimizes data transfers between a system bus and a peripheral bus |
| JP3531074B2 (en) * | 1994-11-28 | 2004-05-24 | 富士通株式会社 | Storage device control system |
| US5893926A (en) * | 1995-12-08 | 1999-04-13 | International Business Machines Corporation | Data buffering technique in computer system |
| US5983332A (en) * | 1996-07-01 | 1999-11-09 | Sun Microsystems, Inc. | Asynchronous transfer mode (ATM) segmentation and reassembly unit virtual address translation unit architecture |
| US5937436A (en) * | 1996-07-01 | 1999-08-10 | Sun Microsystems, Inc | Network interface circuit including an address translation unit and flush control circuit and method for checking for invalid address translations |
-
1997
- 1997-07-24 US US08/899,435 patent/US6397316B2/en not_active Expired - Lifetime
-
1998
- 1998-05-26 AU AU75973/98A patent/AU7597398A/en not_active Abandoned
- 1998-05-26 WO PCT/US1998/010652 patent/WO1999005809A1/en active Application Filing
Cited By (43)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070283054A1 (en) * | 1997-12-17 | 2007-12-06 | Src Computers, Inc. | Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices in a fully buffered dual in-line memory module format (fb-dimm) |
| US7680968B2 (en) * | 1997-12-17 | 2010-03-16 | Src Computers, Inc. | Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices in a fully buffered dual in-line memory module format (FB-DIMM) |
| US6721835B1 (en) * | 2000-09-19 | 2004-04-13 | Intel Corporation | Method and apparatus for minimizing bus contention for I/O controller read operations |
| US20030086485A1 (en) * | 2001-11-08 | 2003-05-08 | John Lin | Master to multi-slave asynchronous transmit fifo |
| US20030236929A1 (en) * | 2002-06-10 | 2003-12-25 | Nath Raman Viswa | Buffer management methods and apparatus for network device drivers |
| US6826634B2 (en) * | 2002-06-10 | 2004-11-30 | Sun Microsystems, Inc. | Extended message block for network device drivers |
| US6944728B2 (en) * | 2002-12-23 | 2005-09-13 | Intel Corporation | Interleaving memory access |
| US20040123056A1 (en) * | 2002-12-23 | 2004-06-24 | Intel Corporation | Interleaving memory access |
| US7689738B1 (en) | 2003-10-01 | 2010-03-30 | Advanced Micro Devices, Inc. | Peripheral devices and methods for transferring incoming data status entries from a peripheral to a host |
| US6963946B1 (en) | 2003-10-01 | 2005-11-08 | Advanced Micro Devices, Inc. | Descriptor management systems and methods for transferring data between a host and a peripheral |
| US8504728B1 (en) | 2003-11-05 | 2013-08-06 | Globalfoundries Inc. | Network interface with secondary data and packet information storage and memory control systems to accommodate out-of-order data processing and split transactions on a host system bus |
| US7826614B1 (en) | 2003-11-05 | 2010-11-02 | Globalfoundries Inc. | Methods and apparatus for passing initialization vector information from software to hardware to perform IPsec encryption operation |
| US7533154B1 (en) | 2004-02-04 | 2009-05-12 | Advanced Micro Devices, Inc. | Descriptor management systems and methods for transferring data of multiple priorities between a host and a network |
| US8660137B2 (en) * | 2005-09-29 | 2014-02-25 | Broadcom Israel Research, Ltd. | Method and system for quality of service and congestion management for converged network interface devices |
| US20070070901A1 (en) * | 2005-09-29 | 2007-03-29 | Eliezer Aloni | Method and system for quality of service and congestion management for converged network interface devices |
| USRE47756E1 (en) | 2005-11-12 | 2019-12-03 | Iii Holdings 1, Llc | High performance memory based communications interface |
| US8284802B2 (en) | 2005-11-12 | 2012-10-09 | Liquid Computing Corporation | High performance memory based communications interface |
| US20110087721A1 (en) * | 2005-11-12 | 2011-04-14 | Liquid Computing Corporation | High performance memory based communications interface |
| US20070223483A1 (en) * | 2005-11-12 | 2007-09-27 | Liquid Computing Corporation | High performance memory based communications interface |
| US7773630B2 (en) * | 2005-11-12 | 2010-08-10 | Liquid Computing Corportation | High performance memory based communications interface |
| US20070294435A1 (en) * | 2006-06-19 | 2007-12-20 | Liquid Computing Corporation | Token based flow control for data communication |
| US20070291778A1 (en) * | 2006-06-19 | 2007-12-20 | Liquid Computing Corporation | Methods and systems for reliable data transmission using selective retransmission |
| US7908372B2 (en) | 2006-06-19 | 2011-03-15 | Liquid Computing Corporation | Token based flow control for data communication |
| US7664026B2 (en) | 2006-06-19 | 2010-02-16 | Liquid Computing Corporation | Methods and systems for reliable data transmission using selective retransmission |
| US20070294426A1 (en) * | 2006-06-19 | 2007-12-20 | Liquid Computing Corporation | Methods, systems and protocols for application to application communications |
| US20070299970A1 (en) * | 2006-06-19 | 2007-12-27 | Liquid Computing Corporation | Secure handle for intra- and inter-processor communications |
| US8631106B2 (en) | 2006-06-19 | 2014-01-14 | Kaiyuan Huang | Secure handle for intra- and inter-processor communications |
| US7873964B2 (en) | 2006-10-30 | 2011-01-18 | Liquid Computing Corporation | Kernel functions for inter-processor communications in high performance multi-processor systems |
| US20080148291A1 (en) * | 2006-10-30 | 2008-06-19 | Liquid Computing Corporation | Kernel functions for inter-processor communications in high performance multi-processor systems |
| US8937940B2 (en) | 2011-08-12 | 2015-01-20 | International Business Machines Corporation | Optimized virtual function translation entry memory caching |
| US8954704B2 (en) | 2011-08-12 | 2015-02-10 | International Business Machines Corporation | Dynamic network adapter memory resizing and bounding for virtual function translation entry storage |
| US8959310B2 (en) | 2011-08-12 | 2015-02-17 | International Business Machines Corporation | Dynamic network adapter memory resizing and bounding for virtual function translation entry storage |
| US11151027B2 (en) | 2013-10-04 | 2021-10-19 | Micron Technology, Inc. | Methods and apparatuses for requesting ready status information from a memory |
| US10445228B2 (en) | 2013-10-04 | 2019-10-15 | Micron Technology, Inc. | Methods and apparatuses for requesting ready status information from a memory |
| US9824004B2 (en) | 2013-10-04 | 2017-11-21 | Micron Technology, Inc. | Methods and apparatuses for requesting ready status information from a memory |
| US10108372B2 (en) | 2014-01-27 | 2018-10-23 | Micron Technology, Inc. | Methods and apparatuses for executing a plurality of queued tasks in a memory |
| US11023167B2 (en) | 2014-01-27 | 2021-06-01 | Micron Technology, Inc. | Methods and apparatuses for executing a plurality of queued tasks in a memory |
| US10146477B2 (en) * | 2014-02-14 | 2018-12-04 | Micron Technology, Inc. | Command queuing |
| US10884661B2 (en) | 2014-02-14 | 2021-01-05 | Micron Technology, Inc. | Command queuing |
| US20160364179A1 (en) * | 2014-02-14 | 2016-12-15 | Micron Technology, Inc. | Command queuing |
| US11494122B2 (en) | 2014-02-14 | 2022-11-08 | Micron Technology, Inc. | Command queuing |
| US11954370B2 (en) | 2014-02-14 | 2024-04-09 | Lodestar Licensing Group Llc | Command queuing |
| US12366996B2 (en) | 2014-02-14 | 2025-07-22 | Lodestar Licensing Group Llc | Command queuing |
Also Published As
| Publication number | Publication date |
|---|---|
| AU7597398A (en) | 1999-02-16 |
| US6397316B2 (en) | 2002-05-28 |
| WO1999005809A1 (en) | 1999-02-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6397316B2 (en) | System for reducing bus overhead for communication with a network interface | |
| US6065087A (en) | Architecture for a high-performance network/bus multiplexer interconnecting a network and a bus that transport data using multiple protocols | |
| KR100437146B1 (en) | Intelligent network interface device and system for accelerating communication | |
| US7464199B2 (en) | Method, system, and program for handling Input/Output commands | |
| US5752078A (en) | System for minimizing latency data reception and handling data packet error if detected while transferring data packet from adapter memory to host memory | |
| EP1358562B1 (en) | Method and apparatus for controlling flow of data between data processing systems via a memory | |
| JP2539614B2 (en) | Apparatus and method for generating pointer address | |
| JP3863912B2 (en) | Automatic start device for data transmission | |
| US6049842A (en) | Efficient data transfer mechanism for input/output devices | |
| US5745684A (en) | Apparatus and method for providing a generic interface between a host system and an asynchronous transfer mode core functional block | |
| US6754735B2 (en) | Single descriptor scatter gather data transfer to or from a host processor | |
| US6823403B2 (en) | DMA mechanism for high-speed packet bus | |
| US20020196806A1 (en) | Method and apparatus for multi context direct memory access | |
| EP0141742A2 (en) | Buffer system for input/output portion of digital data processing system | |
| EP0498201A2 (en) | Generic high bandwidth adapter architecture | |
| US6128674A (en) | Method of minimizing host CPU utilization in driving an adapter by residing in system memory a command/status block a soft interrupt block and a status block queue | |
| JPH08288941A (en) | Computer system and message transfer method | |
| CN110188059B (en) | Flow control type FIFO (first in first out) cache device and method for unified configuration of data valid bits | |
| EP0752665B1 (en) | Method and apparatus for coordinating data transfer between hardware and software | |
| US5581741A (en) | Programmable unit for controlling and interfacing of I/O busses of dissimilar data processing systems | |
| US20060018330A1 (en) | Method, system, and program for managing memory requests by devices | |
| US6856619B1 (en) | Computer network controller | |
| EP0772831B1 (en) | Bidirectional parallel signal interface | |
| US5923852A (en) | Method and system for fast data transmissions in a processing system utilizing interrupts | |
| US6801963B2 (en) | Method, system, and program for configuring components on a bus for input/output operations |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: JATO TECHNOLOGIES, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:FESAS, NESTOR A., JR.;REEL/FRAME:009001/0908 Effective date: 19980127 Owner name: JATO TECHNOLOGIES, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:FESAS, NESTOR A., JR.;REEL/FRAME:009106/0295 Effective date: 19980127 |
|
| AS | Assignment |
Owner name: LEVEL ONE COMMUNICATIONS, INCORPORATED, CALIFORNIA Free format text: SECURITY AGREEMENT;ASSIGNOR:JATO TECHNOLOGIES, INC.;REEL/FRAME:009565/0351 Effective date: 19981021 |
|
| AS | Assignment |
Owner name: LEVEL ONE COMMUNICATIONS, INCORPORATED, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:JATO TECHNOLOGIES, INC.;REEL/FRAME:010898/0036 Effective date: 20000519 |
|
| FEPP | Fee payment procedure |
Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| REFU | Refund |
Free format text: REFUND - SURCHARGE, PETITION TO ACCEPT PYMT AFTER EXP, UNINTENTIONAL (ORIGINAL EVENT CODE: R2551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| FPAY | Fee payment |
Year of fee payment: 4 |
|
| FEPP | Fee payment procedure |
Free format text: PETITION RELATED TO MAINTENANCE FEES FILED (ORIGINAL EVENT CODE: PMFP); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PETITION RELATED TO MAINTENANCE FEES GRANTED (ORIGINAL EVENT CODE: PMFG); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| REMI | Maintenance fee reminder mailed | ||
| REIN | Reinstatement after maintenance fee payment confirmed | ||
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20100528 |
|
| FPAY | Fee payment |
Year of fee payment: 8 |
|
| PRDP | Patent reinstated due to the acceptance of a late maintenance fee |
Effective date: 20101018 |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| SULP | Surcharge for late payment | ||
| FPAY | Fee payment |
Year of fee payment: 12 |