WO2009155762A1 - 一种阵列处理器结构 - Google Patents
一种阵列处理器结构 Download PDFInfo
- Publication number
- WO2009155762A1 WO2009155762A1 PCT/CN2008/073179 CN2008073179W WO2009155762A1 WO 2009155762 A1 WO2009155762 A1 WO 2009155762A1 CN 2008073179 W CN2008073179 W CN 2008073179W WO 2009155762 A1 WO2009155762 A1 WO 2009155762A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- processor
- unit
- array
- routing
- data
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
Definitions
- the invention belongs to the field of integrated circuit design, and in particular relates to an array processor structure. ⁇ Background technique ⁇
- the main technical problem to be solved by the present invention is to provide an array processor structure, which can flexibly adapt to the differences between different algorithms and greatly improve the design efficiency of the integrated system chip.
- An array processor structure comprising a plurality of processor units constituting a processor array, wherein the adjacent processor units are connected by an interconnect bus, and further comprising at least one routing unit, wherein each of the routing units is respectively connected to the interconnect bus Connecting two of the processor units; the routing unit receives a data packet transmitted by the source processor unit, and passes the data body in the data packet according to the addressing information of the destination processor unit attached to the data packet The transmission path is forwarded to the destination processor unit.
- adjacent routing units are connected by an interconnect bus to form an array junction. 3 ⁇ 4.
- the processor array is a two-dimensional grid structure arranged in rows and columns
- the routing array is also a two-dimensional grid structure arranged in rows and columns, and each processor unit is connected to at least one routing unit.
- an input unit that provides program and instruction loading to the processor array, the input unit being coupled to the routing array, the processor unit and routing unit being provided with a load boot module pre-loaded for data frames The form of the loader and the data of the bootloader.
- the data packet transmitted by the processor unit to the routing unit and the data packet transmitted between the routing unit includes identification information of an active processor unit and addressing information of a destination processor unit, where the routing unit
- the data packet transmitted by the processor unit contains identification information of the active processor unit.
- the processor unit and/or the routing unit internally has a stop module that performs a function to stop its own operation, and a wake-up module that is triggered periodically or a wake-up module that is triggered by external input information.
- the processor unit and the routing unit internally have a clock generation module for providing an independent clock signal.
- the processor unit is a digital signal processor.
- the present invention realizes a serialized architecture with configurable scale and function, can support ASIC implementations processed by different specific algorithms, improves design efficiency and effect, and therefore does not need to be in every
- the secondary development of the ASIC has been carried out several times in the development of the film, in order to meet the product differentiation and time-to-market constraints, but also greatly reduce the cost of design and development.
- FIG. 2 is a partial enlarged view of the structure of the array processor of FIG. 1;
- FIG. 3 is a functional block diagram of a routing unit in an embodiment of the present invention.
- FIG. 4 is a functional block diagram of a DSP in an embodiment of the present invention.
- FIG. 6 is a schematic diagram of an input/output 10 channel according to an embodiment of the present invention.
- the array processor structure includes a plurality of processor units and a plurality of routing units (identified by R in the figure).
- the processor unit in this embodiment uses a DSP (identified by D in the figure).
- Each DSP is arranged in a square grid structure of the DSP array by rows and columns in turn, and each routing unit R is also arranged in a row grid array by a row and a column adjacent to the interconnected bus. The rows and columns of the row and column of the DSP array are distributed.
- each DSP has five input/output ports, and the input/output ports of the DSP on the non-array boundary are five, four of which are connected to the corresponding ports of the adjacent four DSPs, and the rest A corresponding port that connects to an adjacent routing unit R.
- Each routing unit R has eight input/output ports, and the number of input/output ports of the routing unit R on the non-array boundary is eight, four of which are connected to the corresponding ports of the adjacent four DSPs, and the other four are adjacent to each other. Corresponding ports of the four routing units R. The connection of each unit to other units under the condition of the array boundary is correspondingly reduced, and the number of ports set thereon can also be reduced according to actual needs.
- each DSP can have other numbers of input/output ports, connected to adjacent DSPs and routing units R, such as four or six.
- routing unit R can have other numbers of input/output ports connected to adjacent DSPs or other routing units. For example six.
- the routing unit receives the data packet transmitted by the source processor unit, and the data packet includes a packet header and a data body.
- the addressing information is included in the packet header, and the routing unit forwards the data body in the data packet to the destination processor unit through the transmission path according to the addressing information attached in the data packet.
- the data packet transmitted by the processor unit to the routing unit and the header of the data packet transmitted between the routing units include identification information of the active processor unit and addressing information of the destination processor unit, and data transmitted by the routing unit to the processor unit
- the packet header contains the identification information of the active processor unit. If the routing unit is directly connected to the destination processor unit, the routing unit can forward the data packet directly from the source processor unit to the destination processor unit. If the routing unit and the destination processor unit are not directly connected, but are connected through other routing units, the routing unit may select an appropriate path to forward the data packet to the destination processor unit through the other routing unit.
- the routing unit R By connecting to the DSP array by the interconnect bus, the routing unit R provides communication routes for data transmission between the DSPs, increasing the data transmission path, and the data transmission is no longer limited to transmission between the processor units, and the source processor unit transmits the data. Data can be routed through the routing unit and transmitted directly or indirectly to the destination processor unit, making data transmission more flexible and fast.
- Figure 2 shows a partial enlarged view of the structure of the array processor.
- the input interface and output interface of the DSP and the routing unit R are independent, and the interconnection of the input and output interfaces can also be implemented.
- the DSP mainly carries the computing function
- the routing unit R mainly carries the data transmission function between the DSPs.
- the routing unit can implement direct forwarding of data frames by using the Internet between the state machine control interfaces, and can also implement storage and forwarding of data frames by using a program controlled by a processor architecture.
- the array processor structure of the embodiment preferably adopts heterogeneous serial communication.
- the input/output ports on the DSP and the routing unit R are different serial input/output ports, in the DSP array and the routing array.
- Each unit uses an isochronous serial input/output port to communicate with other units via the interconnect bus.
- the data packets transmitted between each DSP and routing unit are in the form of data frames. All data frames include a frame header and a frame body, and the frame body is mainly data to be transmitted; the frame header is based on the DSP and the routing unit.
- the connection relationship between R is divided into at least the following types:
- the frame header includes a length of the pilot code, on the one hand for activating the destination DSP (if it enters a sleep state), and on the other hand for causing the destination DSP to generate the received peer information, and further, the frame header further includes an indication frame body. Data length information.
- the frame header includes, in addition to the frame header information of the data frame transmitted between the adjacent DSPs, the identification information of the source DSP and the identification information (addressing information) of the destination DSP, which is intended to enable the routing unit to According to the identification and planning of the path and passed to the destination DSP;
- the data frame can be transmitted by means of broadcast and packet broadcast.
- the data frame transmitted by the routing unit to the routing unit is substantially the data frame transmission between the routing units on the path through which the source DSP transmits the data frame to the non-adjacent destination DSP.
- the frame header also includes the identification information of the source DSP and the destination DSP, so that the routing unit that receives the data frame can continue to forward according to the identification information of the destination DSP;
- the DSP receiving the data frame from the routing unit is the destination DSP, and the frame header of the data frame may include only the identification information of the source DSP in addition to the frame header information of the data frame transmitted between the adjacent DSPs.
- the absolute address information from the source DSP to the destination DSP is obtained based on the identification information of the destination DSP.
- the relative address can also be used for routing.
- the relative address information indicates the addressing information of the next-level transmission path of the current location of the transmitted data.
- heterogeneous serial communication not only helps to reduce the communication and wiring complexity inside the array chip, but also helps improve clock management and fault-tolerant management, as well as the flexibility to configure the array to support different algorithms.
- the routing unit includes an independent communication controller and an input/output interface.
- the input and output interfaces are eight different serial input/output interfaces (the number of interfaces varies according to the actual conditions under the array boundary conditions), four of which are The interfaces are respectively connected to the corresponding interfaces of the other four DSPs, and the other four interfaces are respectively connected to the corresponding interfaces of the other four routing units Router.
- the communication controller preferably further includes a DSP communication controller and a routing unit controller for respectively performing communication management on the corresponding heterogeneous serial interface.
- the main function of the routing unit is to select the appropriate path to forward the data frame to the destination DSP according to the destination DSP identifier contained in the frame header, including the isochronous serial input/output interface list. Modules for DSP identification, routing, and forwarding control of the element and frame header.
- the DSP includes a DSP core, an independent communication controller, and an input and output interface.
- the DSP core is responsible for data processing and control, and the input and output interfaces are five different serial input and output ports (the number of ports under the array boundary condition is based on The actual situation has changed.
- Four of the ports are connected to the other four DSPs, and the other port is connected to a routing unit Router.
- the communication controller preferably further includes a DSP communication controller and a routing unit Router communication controller for respectively performing communication management on the corresponding heterogeneous serial interface.
- Each level of routing unit forwards it to the next-level routing unit or to the DSP of the current level according to the address information of the received data frame.
- the DSP first starts its own loading and guiding program.
- the function of the program is to receive the loading data frame sent by the routing unit, and move the data frame to the instruction register according to the frame header flag information (program loading or data loading). In the data register, the loading of the program or data is completed.
- Figure 6 shows the input and output of the array processor structure processing data, which can be directly entered into the DSP's isochronous serial interface through 10 channels.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multi Processors (AREA)
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2008100681271A CN101320364A (zh) | 2008-06-27 | 2008-06-27 | 一种阵列处理器结构 |
CN200810068127.1 | 2008-06-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009155762A1 true WO2009155762A1 (zh) | 2009-12-30 |
Family
ID=40180421
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2008/073179 WO2009155762A1 (zh) | 2008-06-27 | 2008-11-24 | 一种阵列处理器结构 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN101320364A (zh) |
WO (1) | WO2009155762A1 (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101320364A (zh) * | 2008-06-27 | 2008-12-10 | 北京大学深圳研究生院 | 一种阵列处理器结构 |
CN101706767B (zh) * | 2009-08-13 | 2012-08-08 | 北京大学深圳研究生院 | 一种阵列处理器 |
CN102122275A (zh) * | 2010-01-08 | 2011-07-13 | 上海芯豪微电子有限公司 | 一种可配置处理器 |
CN101882127B (zh) * | 2010-06-02 | 2011-11-09 | 湖南大学 | 一种多核心处理器 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040111590A1 (en) * | 2002-07-23 | 2004-06-10 | Klein Robert C. | Self-configuring processing element |
CN1761954A (zh) * | 2003-01-27 | 2006-04-19 | 皮科芯片设计有限公司 | 处理器阵列 |
CN1849598A (zh) * | 2003-09-09 | 2006-10-18 | 皇家飞利浦电子股份有限公司 | 具有多个可编程处理器的集成数据处理电路 |
CN101320364A (zh) * | 2008-06-27 | 2008-12-10 | 北京大学深圳研究生院 | 一种阵列处理器结构 |
-
2008
- 2008-06-27 CN CNA2008100681271A patent/CN101320364A/zh active Pending
- 2008-11-24 WO PCT/CN2008/073179 patent/WO2009155762A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040111590A1 (en) * | 2002-07-23 | 2004-06-10 | Klein Robert C. | Self-configuring processing element |
CN1761954A (zh) * | 2003-01-27 | 2006-04-19 | 皮科芯片设计有限公司 | 处理器阵列 |
CN1849598A (zh) * | 2003-09-09 | 2006-10-18 | 皇家飞利浦电子股份有限公司 | 具有多个可编程处理器的集成数据处理电路 |
CN101320364A (zh) * | 2008-06-27 | 2008-12-10 | 北京大学深圳研究生院 | 一种阵列处理器结构 |
Also Published As
Publication number | Publication date |
---|---|
CN101320364A (zh) | 2008-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11256656B2 (en) | Hybrid programmable many-core device with on-chip interconnect | |
US9256575B2 (en) | Data processor chip with flexible bus system | |
US7595659B2 (en) | Logic cell array and bus system | |
US20220294454A1 (en) | Embedded network on chip accessible to programmable logic fabric of programmable logic device in multi-dimensional die systems | |
JP4547198B2 (ja) | 演算装置、演算装置の制御方法、プログラム及びコンピュータ読取り可能記録媒体 | |
US6275975B1 (en) | Scalable mesh architecture with reconfigurable paths for an on-chip data transfer network incorporating a network configuration manager | |
JP2012080379A (ja) | 半導体データ処理装置及びデータ処理システム | |
US11730325B2 (en) | Dual mode interconnect | |
WO2009155762A1 (zh) | 一种阵列处理器结构 | |
JP5170579B2 (ja) | 再構成可能なデバイス | |
Konishi et al. | PCA-1: A fully asynchronous, self-reconfigurable LSI | |
US8190856B2 (en) | Data transfer network and control apparatus for a system with an array of processing elements each either self- or common controlled | |
JP2001117858A (ja) | データ処理装置 | |
US20050251646A1 (en) | Network with programmable interconnect nodes adapted to large integrated circuits | |
EP4123469A2 (en) | Scalable computing device | |
US20090085626A1 (en) | Semiconductor integrated circuit and method for controlling semiconductor integrated circuit | |
EP2266046B1 (en) | A control apparatus for fast inter processing unit data exchange in a processor architecture with processing units of different bandwidth connection to a pipelined ring bus | |
US20230280907A1 (en) | Computer System Having Multiple Computer Devices Each with Routing Logic and Memory Controller and Multiple Computer Devices Each with Processing Circuitry | |
US20230283547A1 (en) | Computer System Having a Chip Configured for Memory Attachment and Routing | |
US9626325B2 (en) | Array processor having a segmented bus system | |
JP2010033336A (ja) | 信号処理装置及び信号処理方法 | |
CN117093536A (zh) | 集成芯片及业务处理方法 | |
WO2007092747A2 (en) | Multi-core architecture with hardware messaging |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08874761 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 08874761 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 23/05/2011) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 08874761 Country of ref document: EP Kind code of ref document: A1 |