CN110690991B - Non-blocking network reduction computing device and method based on logic tree - Google Patents

Non-blocking network reduction computing device and method based on logic tree Download PDF

Info

Publication number
CN110690991B
CN110690991B CN201910852825.9A CN201910852825A CN110690991B CN 110690991 B CN110690991 B CN 110690991B CN 201910852825 A CN201910852825 A CN 201910852825A CN 110690991 B CN110690991 B CN 110690991B
Authority
CN
China
Prior art keywords
reduction
calculation
network
data packet
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910852825.9A
Other languages
Chinese (zh)
Other versions
CN110690991A (en
Inventor
高剑刚
卢宏生
任秀江
曹志强
周建毅
冯冬明
王梦嘉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxi Jiangnan Computing Technology Institute
Original Assignee
Wuxi Jiangnan Computing Technology Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxi Jiangnan Computing Technology Institute filed Critical Wuxi Jiangnan Computing Technology Institute
Priority to CN201910852825.9A priority Critical patent/CN110690991B/en
Publication of CN110690991A publication Critical patent/CN110690991A/en
Application granted granted Critical
Publication of CN110690991B publication Critical patent/CN110690991B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0893Assignment of logical groups to network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • H04L41/0823Configuration setting characterised by the purposes of a change of settings, e.g. optimising configuration for enhancing reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

A non-blocking network reduction computing device based on a logic tree belongs to the technical field of hardware integrated circuits. The device comprises a network packet receiving module, a network packet matching module and a data packet transmitting module, wherein the network packet receiving module is used for receiving the reduction data packet transmitted on the cache network and transmitting the reduction data packet to the network packet matching module; the network packet matching module is used for matching the control information of the reduction data packet with the integrated message state record, and after the control information of the reduction data packet is successfully matched with the integrated message state record, the reduction data packet is sent to the reduction calculation module and triggers the reduction calculation module to start calculation; the reduction calculation module is used for performing local reduction calculation and network reduction calculation; and the network packet sending module is used for sending the reduction calculation result after the calculation to the reduction communication indication object. The method is realized by adopting the device. The invention can automatically complete the functions of set ID matching, reduction data calculation, reduction result sending and the like in the reduction communication process, can accelerate the processing of set reduction communication, reduces the disturbance of the set reduction communication to a CPU of a processor, and improves the performance of the set reduction communication.

Description

Non-blocking network reduction computing device and method based on logic tree
Technical Field
The invention belongs to the technical field of hardware integrated circuits, and particularly relates to a non-blocking network reduction computing device and method based on a logic tree.
Background
In a high performance computing system, there is aggregate communication in which multiple nodes participate, in addition to point-to-point communication. The number of nodes participating in the collective communication is uncertain, and is determined by the task operation, and the characteristic makes the hardware implementation of the collective communication more difficult than the point-to-point communication.
In the collective communication, there is also a communication type that needs to calculate the communication data, that is, after calculating the data on all the nodes, the result is returned to all the nodes, and this communication is called reduction communication. The reduction communication is characterized in that the communication data on each communication node is the same in size, and after all the data are accumulated or logically operated, a piece of result data is finally formed and needs to be returned to all the communication nodes.
In a high-performance interconnection network, reduction communication participated by a plurality of communication nodes is converted into point-to-point communication between every two nodes on a software level, and then CPU processors in the nodes complete calculation of reduction data. When the data volume is only suitable for large data volume, in a high-performance computing system, the data volume of collective communication is not large in most times, when the data volume is small, the efficiency of a software method adopting point-to-point communication is low, the data computing requirement can interrupt the work of a CPU, and the operating efficiency of a project is influenced. Therefore, the reduction communication with small data volume is subjected to hardware conversion, and the method has important significance for improving the operation efficiency of a high-performance computing system.
The invention patent application CN91105946.6 discloses a reduction processor, and in particular discloses that the reduction processor is controlled by a program having a structure and is adapted to simplify said structure by including several reduction steps of different reduction types, a first-stage processor of the type comprising a fast memory (1, 2) comprising in turn a plurality of fast memory cells, each of which is likely to cause the execution of a reduction operation, and b a communication network informing all memory cells connected to said result of each reduction.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a non-blocking network reduction computing device and method based on a logic tree, which are suitable for reduction communication computing of big data and small data, can remarkably accelerate set reduction communication processing, reduce disturbance of set reduction communication to a CPU (central processing unit) of a processor, and improve set reduction communication performance.
The invention is realized by the following technical scheme:
the invention provides a non-blocking network reduction computing device based on a logic tree, which comprises:
the network packet receiving module is used for receiving the reduction data packet transmitted on the cache network and sending the reduction data packet to the network packet matching module;
the network packet matching module is used for matching the control information of the reduction data packet with the integrated message state record, and after the control information of the reduction data packet is successfully matched with the integrated message state record, the reduction data packet is sent to the reduction calculation module and triggers the reduction calculation module to start calculation;
the reduction calculation module is used for performing local reduction calculation and network reduction calculation;
and the network packet sending module is used for sending the reduction calculation result after the calculation to the reduction communication indication object.
The invention can automatically receive the reduction data packet on the network according to the type of the set operation reduction calculation and can complete the reduction calculation between the network reduction data packet and the local data packet.
Preferably, the network packet receiving module includes:
a receiving unit, configured to receive a reduction packet transmitted on a cache network;
the verification unit is used for detecting whether the target ID information of the reduction data packet is matched with the local node, and if so, the received reduction data packet is sent to the network packet matching module; otherwise, the received reduction data packet is discarded.
Preferably, the network packet matching module includes:
the matching logic unit is used for receiving the matching request and retrieving the set message state records of the set message state recording unit based on the set message ID in the matching request;
the integrated message state recording unit stores integrated message state records;
the matching unit is used for matching the control information of the reduction data packet with the integrated message state record, and if the matching is successful, the reduction data packet is sent to the reduction calculation module; if the matching is not successful, the reduction data packet is discarded.
Preferably, the reduction calculation module includes:
the local reduction calculation engine unit is used for carrying out reduction calculation on the data of the local node and the network data of the calculation data buffer unit;
the network reduction calculation engine unit is used for carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit;
and the calculation data buffer unit is used for storing a first reduction data packet of the local node assembly message, a reduction calculation result of the network reduction calculation engine unit and a reduction calculation result of the local reduction calculation engine unit.
Preferably, the calculation data buffer unit is a dual-port structure and is respectively connected with the local reduction calculation engine unit and the network reduction calculation engine unit.
A non-blocking network reduction computing method based on a logic tree is realized by adopting a network reduction computing device, and the method comprises the following steps:
step S01, receiving the reduction data packet transmitted on the cache network, matching the control information of the reduction data packet with the status record of the aggregate message, and after matching is successful, performing reduction calculation of the aggregate message;
step S02, performs local reduction calculation and network reduction calculation, and sends the reduction calculation result to the reduction communication instruction object.
Preferably, the step S01 further includes: before matching the reduction data packet, detecting whether the target ID information of the reduction data packet is matched with the local node, if so, matching the control information of the reduction data packet with the integrated message state record, and if not, discarding the received reduction data packet.
Preferably, the process of matching the control information of the reduction packet with the aggregate message status record in step S01 includes:
retrieving the state record of the aggregate message according to the aggregate message ID in the matching request;
matching the control information of the reduction data packet with the state record of the aggregate message, and if the matching is successful, carrying out reduction calculation on the aggregate message; if the matching is not successful, the reduction data packet is discarded.
Preferably, the retrieving the aggregated message status record according to the aggregated message ID in the matching request includes:
when the retrieved aggregate message state record is an empty entry, writing the aggregate message ID and the reduction data packet number in the matching request, and setting the entry to be valid;
and when the retrieved aggregate message state record is a valid entry, executing a matching step of the control information of the reduction data packet and the aggregate message state record.
Preferably, the network reduction calculation process in step S02 includes:
if the reduction data packet is the first data packet of the set message at the local node, storing the reduction data packet into a corresponding entry in the calculation data buffer;
if the reduction data packet is a middle data packet of the set message at the local node, carrying out reduction calculation on the reduction data packet and network data of the calculation data buffer unit, storing a reduction calculation result into a calculation data buffer, and updating the state of the set message;
and if the reduction data packet is the last data packet of the aggregate message at the local node, carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit, and generating a sending signal after the calculation is finished.
The invention has the following beneficial effects:
the invention relates to a non-blocking network reduction computing device and method based on a logic tree, which can automatically complete functions of set ID matching, reduction data computing, reduction result sending and the like in a reduction communication process, wherein two computing engines are arranged in the device, and can simultaneously complete reduction computing between local node data and network data packets and between local node data and network data packets without blocking; the supported reduction calculation types include logic operation, bitwise operation, comparison operation and the like with various byte lengths, can remarkably accelerate the processing of the set reduction communication, reduce the disturbance of the set reduction communication to a CPU (central processing unit) of the processor, and improve the performance of the set reduction communication.
Drawings
FIG. 1 is a schematic block diagram of a non-blocking network reduction computing device based on a logic tree according to the present invention;
FIG. 2 is a schematic structural diagram of a reduction calculation module in a non-blocking network reduction calculation apparatus based on a logic tree according to the present invention;
fig. 3 is a flowchart of a non-blocking network reduction calculation method based on a logic tree according to the present invention.
Detailed Description
The following are specific embodiments of the present invention and are further described with reference to the drawings, but the present invention is not limited to these embodiments.
Referring to fig. 1, the non-blocking network reduction computing device based on the logic tree of the present invention includes a network packet receiving module, a network packet matching module, a reduction computing module, and a network packet sending module. The network packet receiving module is used for receiving the reduction data packet transmitted on the cache network and sending the reduction data packet to the network packet matching module. And the network packet matching module is used for matching the control information of the reduction data packet with the integrated message state record, and after the control information of the reduction data packet is successfully matched with the integrated message state record, sending the reduction data packet to the reduction calculation module and triggering the reduction calculation module to start calculation. And the reduction calculation module is used for performing local reduction calculation and network reduction calculation. And the network packet sending module is used for sending the reduction calculation result after the calculation to the reduction communication indication object.
The invention relates to a non-blocking network reduction computing device based on a logic tree, which mainly comprises the following working procedures of receiving reduction data packets from a network, checking and matching the reduction data packets with local information, storing the reduction data packets in a corresponding cache, completing the computation of corresponding data, wherein the corresponding data comprises other reduction data packets received on the network and reduction data packets of local nodes, and submitting and sending the data packets according to reduction communication instructions of the local nodes after all reduction computations are completed.
Specifically, the network packet receiving module includes a receiving unit and a checking unit. The receiving unit is used for receiving the reduction data packet transmitted on the cache network. The check unit is used for detecting whether the target ID information of the reduction data packet is matched with the local node, and if so, the check unit sends the received reduction data packet to the network packet matching module; otherwise, the received reduction data packet is discarded. For example, when the local node 4 detects that the destination ID information sent by the reduction data packet does not include the local node 4, it considers that the destination ID information does not match, that is, the reduction data packet is not sent to the local node 4, the received reduction data packet is discarded, subsequent reduction calculation is not performed, and then the reduction data packet continues to be received, and the foregoing check is performed on the reduction data packet received each time.
Specifically, the network packet matching module includes a matching logic unit, an aggregate message state recording unit, and a matching unit. The matching logic unit is used for receiving the matching request and retrieving the set message state records of the set message state recording unit based on the set message ID in the matching request. The aggregate message state recording unit stores an aggregate message state record, and the aggregate message state record is stored corresponding to the index information. The matching unit is used for matching the control information of the reduction data packet with the integrated message state record, and if the matching is successful, the reduction data packet is sent to the reduction calculation module; if the matching is not successful, the reduction data packet is discarded. Wherein, the matching content comprises information such as a reduction operation ID, an operation type, a data length and the like.
The specific matching process is as follows: the matching logic unit receives a matching request submitted by the network interface, and retrieves the state record of the aggregate message according to the low order of the aggregate message ID in the matching request: if the entry is empty, writing the aggregate message ID and the network reduction packet number in the matching request, and concatenating the entry to be valid; if the entry is a valid entry, reading out the recorded aggregate message ID in the entry and comparing and matching the aggregate message ID carried in the network reduction packet: if the network reduction packet is matched with the network reduction packet, recording the number of the network reduction packet, and receiving and submitting the network reduction packet to a next-stage calculation control module; if not, the network reduction packet is discarded, and a matching error response is generated for the source node of the network reduction packet.
As shown in fig. 2, the reduction calculation module includes a local reduction calculation engine unit, a network reduction calculation engine unit, and a calculation data buffer unit. The local reduction calculation engine unit is used for carrying out reduction calculation on the data of the local node and the network data of the calculation data buffer unit. And the network reduction calculation engine unit is used for carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit. The calculation data buffer unit is used for storing a first reduction data packet of the local node assembly message, a reduction calculation result of the network reduction calculation engine unit and a reduction calculation result of the local reduction calculation engine unit.
The calculation data buffer unit is a dual-port structure and is respectively connected with the local reduction calculation engine unit and the network reduction calculation engine unit, namely, the port 0 is connected with the local reduction calculation engine unit, and the port 1 is connected with the network reduction calculation engine unit. The bit width of the port is 2 times of that of the data path, and both ports can carry out read-write operation. The local reduction calculation engine unit is responsible for reading out the data of the local node and performing reduction calculation with the network data buffered by the calculation data. The network reduction calculation engine unit is responsible for carrying out reduction calculation on the reduction data packet data and the data in the calculation data buffer.
The specific processing flow of the network reduction calculation engine unit is as follows:
1. if the reduction data packet is the first data packet of the set message at the local node, directly storing the reduction data packet into a corresponding entry in the calculation data buffer;
2. if the reduction data packet is an intermediate data packet, submitting the data packet to a network computing engine, and storing the result in a computing data buffer after computing operation is carried out on the data in the computing buffer and the network reduction computing engine unit; meanwhile, updating the state of the set message, and recording the receiving processing of the reduction data packet;
3. if the reduction data packet is the last data packet of the set message at the node, generating a sending signal after the calculation is finished; and after the calculation is finished, submitting the reduction result data packet to a network packet sending module.
Referring to fig. 3, the present invention provides a non-blocking network reduction calculation method based on a logical tree by using the non-blocking network reduction calculation apparatus based on a logical tree. The method comprises the following steps:
step S01, receiving the reduction data packet transmitted on the cache network, matching the control information of the reduction data packet with the status record of the aggregate message, and after matching is successful, performing reduction calculation of the aggregate message;
step S02, performs local reduction calculation and network reduction calculation, and sends the reduction calculation result to the reduction communication instruction object.
The step S01 further includes: before matching the reduction data packet, detecting whether the target ID information of the reduction data packet is matched with the local node, if so, matching the control information of the reduction data packet with the integrated message state record, and if not, discarding the received reduction data packet. This step is used to detect whether the reduction packet is sent to the receiving node.
As shown in Table one, the parts in the aggregate message state mean as follows:
aggregate message ID: the node is used for distinguishing different messages operated by the node;
local node attribute: the local nodes are divided into leaf nodes, father nodes and root nodes. The leaf node only sends the data of the node to the node of the previous level without calculation; the father node receives the network reduction packet, calculates the network reduction packet with local data and sends the father node to the previous node; the root node receives the network reduction packet and generates a reduction result after calculating with the local node;
the number of child nodes is as follows: the node is valid when the node is a root node or a father node and is used for indicating the number of the network reduction packets which need to be received by the node in the current set message;
child node vector: the number of the child nodes receiving the network reduction packet is accurately recorded, and the number is used for eliminating the repeated network reduction packet; the number of child node vector bits is related to the maximum number of child nodes supported.
Table one: schematic diagram of aggregate message state coding
Figure DEST_PATH_IMAGE002
The process of matching the control information of the reduction packet with the aggregate message status record in step S01 includes: retrieving the state record of the aggregate message according to the aggregate message ID in the matching request; matching the control information of the reduction data packet with the state record of the aggregate message, and if the matching is successful, carrying out reduction calculation on the aggregate message; if the matching is not successful, the reduction data packet is discarded.
Wherein, the process of retrieving the aggregated message state record according to the aggregated message ID in the matching request comprises:
when the retrieved aggregate message state record is an empty entry, writing the aggregate message ID and the reduction data packet number in the matching request, and setting the entry to be valid;
and when the retrieved aggregate message state record is a valid entry, executing a matching step of the control information of the reduction data packet and the aggregate message state record.
After the matching information is submitted, matching is carried out according to the matching index information and the reduction control information stored in the information suspension, the matching content comprises information comparison of reduction operation ID, operation type, data length and the like, and if the information is not matched, the suspension processing returns a matching error response; and if the information is matched successfully, recording the matched source information, and simultaneously informing the reduction calculation module to start calculation. Under the control of the calculation control module, the matched reduction data packet completes the calculation processing of the message according to the record in the state of the integrated message, and the processing content has the following conditions:
1. if the reduction data packet is the first data packet of the set message at the local node, storing the reduction data packet into a corresponding entry in the calculation data buffer;
2. if the reduction data packet is a middle data packet of the set message at the local node, carrying out reduction calculation on the reduction data packet and network data of the calculation data buffer unit, storing a reduction calculation result into a calculation data buffer, and updating the state of the set message;
3. and if the reduction data packet is the last data packet of the aggregate message at the local node, carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit, and generating a sending signal after the calculation is finished.
It will be appreciated by persons skilled in the art that the embodiments of the invention described above and shown in the drawings are given by way of example only and are not limiting of the invention. The objects of the present invention have been fully and effectively accomplished. The functional and structural principles of the present invention have been shown and described in the examples, and any variations or modifications of the embodiments of the present invention may be made without departing from the principles.

Claims (4)

1. A logical tree based non-blocking network reduction computing device, comprising:
the network packet receiving module is used for receiving the reduction data packet transmitted on the cache network and sending the reduction data packet to the network packet matching module; the network packet receiving module includes:
a receiving unit, configured to receive a reduction packet transmitted on a cache network;
the verification unit is used for detecting whether the target ID information of the reduction data packet is matched with the local node, and if so, the received reduction data packet is sent to the network packet matching module; otherwise, discarding the received reduction data packet;
the network packet matching module is used for matching the control information of the reduction data packet with the integrated message state record, and after the control information of the reduction data packet is successfully matched with the integrated message state record, the reduction data packet is sent to the reduction calculation module and triggers the reduction calculation module to start calculation; the network packet matching module comprises:
the matching logic unit is used for receiving the matching request and retrieving the set message state records of the set message state recording unit based on the set message ID in the matching request;
the integrated message state recording unit stores integrated message state records;
the matching unit is used for matching the control information of the reduction data packet with the integrated message state record, and if the matching is successful, the reduction data packet is sent to the reduction calculation module; if the matching is unsuccessful, discarding the reduction data packet;
the reduction calculation module is used for performing local reduction calculation and network reduction calculation; the reduction calculation module comprises:
the local reduction calculation engine unit is used for carrying out reduction calculation on the data of the local node and the network data of the calculation data buffer unit;
the network reduction calculation engine unit is used for carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit;
the calculation data buffer unit is used for storing a first reduction data packet of the local node assembly message, a reduction calculation result of the network reduction calculation engine unit and a reduction calculation result of the local reduction calculation engine unit;
and the network packet sending module is used for sending the reduction calculation result after the calculation to the reduction communication indication object.
2. The apparatus as claimed in claim 1, wherein the calculation data buffer unit is a dual-port structure, and is connected to the local reduction calculation engine unit and the network reduction calculation engine unit respectively.
3. A non-blocking network reduction computing method based on a logic tree is characterized by being realized by adopting a network reduction computing device, and the method comprises the following steps:
step S01, receiving the reduction data packet transmitted on the cache network, matching the control information of the reduction data packet with the status record of the aggregate message, and after matching is successful, performing reduction calculation of the aggregate message; the step S01 further includes: before matching reduction data packets, detecting whether target ID information of the reduction data packets is matched with a local node, if so, performing a step of matching control information of the reduction data packets with an aggregation message state record, and if not, discarding the received reduction data packets, wherein the step of matching the control information of the reduction data packets with the aggregation message state record in the step S01 comprises the following steps:
retrieving the state record of the aggregate message according to the aggregate message ID in the matching request;
matching the control information of the reduction data packet with the state record of the aggregate message, and if the matching is successful, carrying out reduction calculation on the aggregate message; if the matching is unsuccessful, discarding the reduction data packet;
step S02, carrying out local reduction calculation and network reduction calculation, and sending the reduction calculation result to the reduction communication indication object, wherein the network reduction calculation process in step S02 comprises:
if the reduction data packet is the first data packet of the set message at the local node, storing the reduction data packet into a corresponding entry in the calculation data buffer unit;
if the reduction data packet is a middle data packet of the aggregation message at the local node, carrying out reduction calculation on the reduction data packet and network data of the calculation data buffer unit, storing a reduction calculation result into the calculation data buffer unit, and updating the aggregation message state;
and if the reduction data packet is the last data packet of the aggregate message at the local node, carrying out reduction calculation on the reduction data packet and the network data of the calculation data buffer unit, and generating a sending signal after the calculation is finished.
4. The method of claim 3, wherein retrieving the aggregate message status record according to the aggregate message ID in the matching request comprises:
when the retrieved aggregate message state record is an empty entry, writing the aggregate message ID and the reduction data packet number in the matching request, and setting the entry to be valid;
and when the retrieved aggregate message state record is a valid entry, executing a matching step of the control information of the reduction data packet and the aggregate message state record.
CN201910852825.9A 2019-09-10 2019-09-10 Non-blocking network reduction computing device and method based on logic tree Active CN110690991B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910852825.9A CN110690991B (en) 2019-09-10 2019-09-10 Non-blocking network reduction computing device and method based on logic tree

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910852825.9A CN110690991B (en) 2019-09-10 2019-09-10 Non-blocking network reduction computing device and method based on logic tree

Publications (2)

Publication Number Publication Date
CN110690991A CN110690991A (en) 2020-01-14
CN110690991B true CN110690991B (en) 2021-03-19

Family

ID=69107958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910852825.9A Active CN110690991B (en) 2019-09-10 2019-09-10 Non-blocking network reduction computing device and method based on logic tree

Country Status (1)

Country Link
CN (1) CN110690991B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1062426A (en) * 1990-08-02 1992-07-01 卡尔斯特电子公司 Reduction processor
CN1493040A (en) * 2001-02-24 2004-04-28 �Ҵ���˾ Global tree network for computing structures
CN106371919A (en) * 2016-08-24 2017-02-01 上海交通大学 Shuffle data caching method based on mapping-reduction calculation model
CN109947933A (en) * 2017-11-29 2019-06-28 阿里巴巴集团控股有限公司 Method and device for classifying to log

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8229955B2 (en) * 2006-12-05 2012-07-24 International Business Machines Corporation Database query optimizer that takes network choice into consideration
US7953684B2 (en) * 2007-01-31 2011-05-31 International Business Machines Corporation Method and system for optimal parallel computing performance
US8655962B2 (en) * 2009-09-28 2014-02-18 International Business Machines Corporation Shared address collectives using counter mechanisms
CN108052621A (en) * 2017-12-15 2018-05-18 佛山租我科技有限公司 A kind of new big data order-preserving matching filtered based on similarity and searching algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1062426A (en) * 1990-08-02 1992-07-01 卡尔斯特电子公司 Reduction processor
CN1493040A (en) * 2001-02-24 2004-04-28 �Ҵ���˾ Global tree network for computing structures
CN106371919A (en) * 2016-08-24 2017-02-01 上海交通大学 Shuffle data caching method based on mapping-reduction calculation model
CN109947933A (en) * 2017-11-29 2019-06-28 阿里巴巴集团控股有限公司 Method and device for classifying to log

Also Published As

Publication number Publication date
CN110690991A (en) 2020-01-14

Similar Documents

Publication Publication Date Title
US11989154B2 (en) Hybrid remote direct memory access
CN107608750B (en) Device for pattern recognition
US7934025B2 (en) Content terminated DMA
US20060047849A1 (en) Apparatus and method for packet coalescing within interconnection network routers
EP3827356A1 (en) Unified address space for multiple hardware accelerators using dedicated low latency links
US10761781B2 (en) Apparatus and methods for a distributed memory system including memory nodes
US11816052B2 (en) System, apparatus and method for communicating telemetry information via virtual bus encodings
US20140344492A1 (en) Methods and systems for reducing spurious interrupts in a data storage system
CN114710224A (en) Frame synchronization method and device, computer readable medium and electronic device
CN111309700B (en) Control method and system for multi-sharing directory tree
CN110928717A (en) Complex time sequence event detection method and device
CN110910249B (en) Data processing method and device, node equipment and storage medium
CN112040001A (en) Request processing method and device based on distributed storage
CN110690991B (en) Non-blocking network reduction computing device and method based on logic tree
WO2022001133A1 (en) Method and system for improving soft copy read performance, terminal, and storage medium
US6895493B2 (en) System and method for processing data in an integrated circuit environment
WO2023231391A1 (en) Data identification method and apparatus and electronic device
US9811403B1 (en) Method, apparatus and system for performing matching operations in a computing system
CN112416820B (en) Data packet classification storage method and system
CN101185056B (en) Data pipeline management system and method for using the system
US7194573B1 (en) CAM-based search engine devices having advanced search and learn instruction handling
US8760941B2 (en) Data transfer circuit and data transfer method
CN111860793A (en) Data transmission technology in neural network system and neural network system
CN113076282B (en) Deadlock processing method for processor network-on-chip
CN116016397B (en) Method and circuit for quickly searching message transmission buffer area

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Gao Jiangang

Inventor after: Lu Hongsheng

Inventor after: Ren Xiujiang

Inventor after: Cao Zhiqiang

Inventor after: Zhou Jianyi

Inventor after: Feng Dongming

Inventor after: Wang Mengjia

Inventor before: Lu Hongsheng

Inventor before: Ren Xiujiang

Inventor before: Cao Zhiqiang

Inventor before: Zhou Jianyi

Inventor before: Feng Dongming

Inventor before: Wang Mengjia

GR01 Patent grant
GR01 Patent grant