CN110347638A - ASIC array, data processing board, and block mining method and apparatus - Google Patents
ASIC array, data processing board, and block mining method and apparatus Download PDFInfo
- Publication number
- CN110347638A CN110347638A CN201811504035.3A CN201811504035A CN110347638A CN 110347638 A CN110347638 A CN 110347638A CN 201811504035 A CN201811504035 A CN 201811504035A CN 110347638 A CN110347638 A CN 110347638A
- Authority
- CN
- China
- Prior art keywords
- block
- asic
- network
- chip
- node
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims description 33
- 238000000034 method Methods 0.000 title claims description 24
- 238000005065 mining Methods 0.000 title abstract description 3
- 238000004891 communication Methods 0.000 claims description 14
- 238000009412 basement excavation Methods 0.000 claims description 10
- 238000003491 array Methods 0.000 claims description 5
- 230000005540 biological transmission Effects 0.000 claims description 4
- 238000012544 monitoring process Methods 0.000 claims description 3
- 238000001816 cooling Methods 0.000 claims 1
- 238000004364 calculation method Methods 0.000 abstract description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 56
- 238000010586 diagram Methods 0.000 description 22
- 230000015654 memory Effects 0.000 description 14
- 230000000875 corresponding effect Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 230000003139 buffering effect Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 101100203322 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SKS1 gene Proteins 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
- G05B19/042—Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
- G05B19/0423—Input/output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G06F15/17356—Indirect interconnection networks
- G06F15/17368—Indirect interconnection networks non hierarchical topologies
- G06F15/17381—Two dimensional, e.g. mesh, torus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7807—System on chip, i.e. computer system on a single chip; System in package, i.e. computer system on one or more chips in a single package
- G06F15/7825—Globally asynchronous, locally synchronous, e.g. network on chip
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7828—Architectures of general purpose stored program computers comprising a single central processing unit without memory
- G06F15/7832—Architectures of general purpose stored program computers comprising a single central processing unit without memory on one IC chip (single chip microprocessors)
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7867—Architectures of general purpose stored program computers comprising a single central processing unit with reconfigurable architecture
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/20—Pc systems
- G05B2219/25—Pc structure of the system
- G05B2219/25257—Microcontroller
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Microelectronics & Electronic Packaging (AREA)
- Automation & Control Theory (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Multi Processors (AREA)
Abstract
The invention relates to an ASIC array used for carrying out block mining, wherein an ASIC chip of the ASIC array is packaged with a plurality of bare chips provided with a network on chip, the network on chip comprises M multiplied by N computing nodes, the NoC addresses of the computing nodes are (M, N), the computing nodes adjacent to the NoC addresses are connected with each other, and the network on chip adopts a two-dimensional folding ring topology structure, so that the computing nodes adjacent to the NoC addresses have the same physical distance. The ASIC array of the invention adopts a two-dimensional folding ring topology structure, the forward and backward paths of the calculation nodes in the network on chip are staggered, and the links between the adjacent nodes in different directions have the same buffer and delay characteristics, thereby greatly simplifying the physical realization of the bare chip and the forwarding strategy of the network on chip.
Description
Technical field
The present invention relates to block chain technical fields, more particularly to a kind of method and apparatus excavated for block, and
Using the ASIC array and data processing board of the method that block excavates.
Background technique
Ether mill (Ethereum) is the block platform chain of a completely new evolution, it allows anyone to establish in platform
With use the decentralization application by block chain technical operation.The core in ether mill is point-to-point (P2P) network, ether
Mill block chain database is safeguarded and is updated by numerous nodes for being connected to network.
Fig. 1 is that the ether mill block of the prior art excavates schematic diagram.As shown in Figure 1, tranaction costs are collected by node, user
(Miner) 12 be exactly the node for collecting in ether mill network 10, propagating, confirming and execute transaction.Transaction is packaged as by users
Block, and compete with one another for, so that their block can be added on next block chain, this process is referred to as to dig mine.Mine
Block is excavated the business subcontract use inner to mine pond (Miningpool) on new block by pool server 11 (Pool server)
Family, it includes that block prefix hashed value, user it is expected the random number range and difficulty or ease that search that each, which digs parameter that miner makees,
Degree.
As block chain network, users successfully " are dug " by solving the problems, such as the task of complex mathematical to area
Block, this is referred to as " proof of work ".One operational problem needs more if algorithmically solved than verifying solution
The resource of the order of magnitude, then it is exactly the splendid selection of proof of work.It is special to prevent from having occurred and that in block chain network
Centralization phenomenon caused by hardware (such as application-specific IC), ether mill have selected to lay particular emphasis on the fortune for consuming more memories
Calculation problem.If problem needs memory and CPU, ideal hardware is exactly common computer.This just makes the workload in ether mill
Prove that there is the characteristic of anti-application-specific IC and this block chain phase excavated by specialised hardware control block of block chain
Than ether mill can bring being securely distributed for more decentralization.
The process of complex mathematical and the cpu performance zero correlation of block excavating equipment are solved the problems, such as, with block excavating equipment
Memory size and memory bandwidth are positively correlated, it means that the block excavation of those large scale deployments by way of shared drive is set
It is standby, linear or superlinearity (super-linear) growth can not be generated on block digging efficiency.
Patent application " a kind of modified ring topologies and its application method ", international publication number
WO2015176243A9 provides the modified ring topologies and its application method in a kind of photoelectric communication field.It is described to change
It is constituted into type ring topologies by m*n with the routing node that mesh topology is connected, including two close rings: the
The 1st routing node of K row is connect with n-th of routing node of K+1 row, the 1st road of the 2nd n-th of routing node of row and m-1 row
It is connected by node, so that all routing nodes of the 2nd row into m-1 row form the 1st close ring, 1 < K < m-1;J column
M-th of routing node arranges the 1st routing node with J+1 and connect, and the 2nd arranges the 1st routing node and m-th of the (n-1)th column routing
Node connection, so that all routing nodes of the 2nd column into the (n-1)th column form the 2nd close ring, 1 < J < n-1.More than
Technological means, it is possible to reduce the average communication transmission path of interaction between node reduces the transmission delay between node.
If but foregoing invention does not consider the unequal problem of the physical length of link between the node of network-on-chip, chain
The physical length on road is unequal may to need additional buffering for the information that will lead in network-on-chip transmitting in particular directions
And causing inconsistent delay, the forwarding strategy complexity of the physical Design and network-on-chip that lead to bare die rises.
Summary of the invention
Keep information transmitting delay in network-on-chip inconsistent for solution link physical length is unequal, and then leads to bare die
The problem of physical Design and the forwarding strategy complexity of network-on-chip rise, the invention discloses a kind of ASIC arrays, use this
The data processing plate of ASIC array, and use the data processing plate block excavating equipment and corresponding block method for digging.
Specifically, the invention discloses a kind of ASIC arrays, for carrying out block excavation, the ASIC core of the ASIC array
Piece is packaged with multiple bare dies for being provided with network-on-chip, which includes M × N number of calculate node, the NoC of the calculate node
Address is (m, n), and the adjacent calculate node in the address NoC is connected with each other by piece upper channel, which is folded using two dimension
Ring topology, wherein the two dimension folded coil topological structure be the line n of the network-on-chip calculate node physically
Location be followed successively by (0, n), (1, n) ..., (M-1, n) when, then the corresponding address NoC of the line n calculate node be followed successively by (0, n),
(M-1, n), (2, n), (M-2, n) ..., (N), the m of the network-on-chip arrange the physical address of the calculate node according to
It is secondary for (m, 0), (m, 1) ..., (m, N-1) when, then m arrange the corresponding address NoC of the calculate node be followed successively by (m, 0), (m,
N-1), (m, 1), (m, N-2) ..., (m,), wherein 0≤m≤M-1,0≤n≤N-1, M >=2, N >=2, and M, N, m, n
For integer.
Further, which includes 6 × 12 calculate nodes, which includes 2 bare dies, PCB electricity
4 × 4 asic chips are arranged in the one side of road plate, and 4 × 4 asic chips are arranged in the another side of the PCB circuit board.
The invention also discloses a kind of block method for digging, carry out block excavation, feature using above-mentioned ASIC array
It is, comprising: step 1, obtain the random data set for being excavated to current block;Step 2, which is drawn
It is divided into multiple subsets and is distributed the calculate node for being stored in the ASIC array;Step 3, a certain calculate node institute is arbitrarily chosen
A certain random number in the subset of preservation carries out first round address arithmetic to obtain destination address, with the destination address, according to this
Two-dimentional folded coil topological structure finds the correspondence calculate node of the destination address, and the son saved from the correspondence calculate node
Input of the correspondence random number of concentration as next round address arithmetic;It is random to be obtained after the address arithmetic of default wheel number
Number is target random number;Step 4, to the target random number carry out hash calculate to obtain target value, if the target value be less than or
Equal to difficulty threshold value, then using the target random number as the block random number of current block, using the target value as the area of current block
Block hashed value;It is on the contrary then abandon the target random number, and it re-execute the steps 3;Step 5, by the block random number and the block
The current block is written in hashed value, and the current block is broadcasted to block chain network;Step 6, when verifier receives broadcast
The current block when, verify the legitimacy of the current block, and enter block chain for legal current block chain is verified.
Further, which includes: current block head and current block body, wherein deserving before proparea build includes
Block hashed value, current block random number, current block hashed value and difficulty threshold value of one block.
The invention also discloses a kind of data processing plates excavated for block, including above-mentioned ASIC array, further includes:
The controller unit being connect with the ASIC array, for monitoring the ASIC array processing, to the ASIC array input data and defeated
The result that the ASIC array obtains out;Temperature management unit, including radiator and temperature sensor, wherein the temperature sensor is used
In the temperature for detecting the data processing plate and it is sent to the controller unit and/or the radiator, or controls the work of the power supply
State;The radiator is used to cool down for the data processing plate;Guidance unit is connect with the controller unit, for the control
The starting of device unit guides;Debugging unit is connect with the controller unit, for the controller unit to be debugged and provided
Tuning parameter.
Further, which is field programmable gate array.
The invention also discloses a kind of equipment excavated for block, including at least one above-mentioned data processing plate, also
It include: network communication module, for connecting block chain network to carry out data receiver and transmission;Task allocating module, being used for will
The data distribution obtained from the network gives the data processing plate, and the result that the data processing plate is obtained passes through the network communication
Module is sent to the network.
The two-dimentional folded coil topological structure that ASIC array of the invention uses, in network-on-chip calculate node it is preceding to it is rear
Interlock to path, it, can be great in different directions close to the buffering having the same of the link between node and delay feature
Simplify physics realization and the network-on-chip forwarding strategy of bare die.
Detailed description of the invention
Fig. 1 is that the ether mill block of the prior art excavates schematic diagram.
Fig. 2 is the ether mill block structure schematic diagram of the prior art.
Fig. 3 is block method for digging flow chart of the invention.
Fig. 4 is the ASIC array structure schematic diagram of first embodiment of the invention.
Fig. 5 is the network-on-chip two-dimensional annular topological structure schematic diagram of first embodiment of the invention.
Fig. 6 is the on piece channel direction schematic diagram between the calculate node of first embodiment of the invention.
Fig. 7 is access diagram between piece between the bare die of first embodiment of the invention.
Fig. 8 is access diagram between piece between the asic chip of first embodiment of the invention.
Fig. 9 A, 9B are one-dimensional foldable circle face topological structure schematic diagrames.
Figure 10 is the two-dimentional foldable circle face topological structure schematic diagram of first embodiment of the invention.
Figure 11 is the structural schematic diagram of the data processing plate based on ASIC array of second embodiment of the invention.
Figure 12 is the block excavating equipment structural schematic diagram of third embodiment of the invention.
Wherein, appended drawing reference are as follows:
10: ether mill 11: mine pool server
12: user's 100:ASIC array
110:ASIC chip 120: bare die
130: network-on-chip 140: node
140-1: calculate node 140-2: functional node
141: memory 142: mixed to calculate device
143: node communication module 144: piece upper channel
150: reflector node 160: channel between piece
161: link pair 162: link
170:PCB circuit board 180: serial parallel channel
200: data processing plate 210: controller unit
220: power supply 230: temperature sensor
240: fan 300: task allocating module
400: network communication module 500: block excavating equipment
RX: receiver TX: transmitter
E, S, W, N: port position
Specific embodiment
Below in conjunction with attached drawing, the present invention will be described in detail.In the accompanying drawings, identical label typicallys represent identical or function
It can similar component.In addition, the leftmost number of appended drawing reference shows the volume of that width attached drawing when the appended drawing reference first appears
Number.
Subject description discloses one or more embodiments comprising feature of the present invention.Disclosed embodiment is used only for lifting
Example explanation.Protection scope of the present invention is not limited to the disclosed embodiments.The present invention is defined by the following claims.Explanation
It is directed to the reference of " one embodiment ", " embodiment ", " example embodiment " etc. in book, refers to that the embodiment of description may include
Specific feature, structure or characteristic, each embodiment of but not must include these a particular feature, structure, or characteristics.In addition,
Such statement not refers to the same embodiment.Further, specific feature, structure or characteristic are being described in conjunction with the embodiments
When, regardless of either with or without specific description, it has been shown that such feature, structure or characteristic are integrated in other embodiments be
In the knowledge of those skilled in the art.
Some vocabulary is used in specification and following claims to censure specific components or component, this field
The member of ordinary skill, it is to be appreciated that technology user or manufacturer can be different noun or term come call the same component or
Component.This specification and following claims not in such a way that the difference of title is as component or component is distinguished, and
It is the criterion with component or component difference functionally as differentiation.In specification in the whole text and subsequent claim
Mentioned " comprising " and "comprising" is an open term, therefore should be construed to " including but not limited to ".In addition, " connection "
One word includes any direct and indirect means of electrical connection herein.Indirect means of electrical connection include by other devices into
Row connection.It should be noted that in the description of the present invention, term " transverse direction ", " longitudinal direction ", "upper", "lower", "front", "rear",
The orientation or positional relationship of the instructions such as "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outside" is based on attached drawing institute
The orientation or positional relationship shown, is merely for convenience of description of the present invention and simplification of the description, and is not the dress of indication or suggestion meaning
It sets or element must have a particular orientation, be constructed and operated in a specific orientation, therefore should not be understood as to limit of the invention
System.
Block chain is a kind of calculating and storage architecture, specifically a kind of distributed accounting system of decentralization, should
Node in system safeguards a account book without trusting each other, by unified common recognition mechanism jointly.And ether mill is a kind of energy
Enough it is reprogrammed the single block chain to realize arbitrarily complicated computing function.Fig. 2 is the ether mill block structure of the prior art
Schematic diagram.As shown in Fig. 2, the block structure in ether mill includes block head and block body, wherein block head includes that previous block dissipates
Train value, the block hashed value of this block, the block random number of this block, the difficulty threshold value for excavating this block etc., block body includes
The signature packet for the message that all external accounts issue in this block establishment process.The common recognition mechanism in ether mill is PoW
(Proof of Work proof of work mechanism), using Ethash algorithm, this algorithm is to Dagger-Hashimoto algorithm
Modified version, using a large amount of pseudo-random data, the set of these pseudo-random datas be referred to as " data set (dataset) " or
Referred to as DAG.The characteristics of Ethash algorithm is that the efficiency that block excavates and memory size and memory bandwidth are positively correlated, process packet
Include: for each block, calculating a seed (seed) first, seed only and current block it is information-related;Then basis
Seed generates the random data set (cache) of a 32M;(had according to the random data set DAG that cache generates a 1GB size
To acyclic graph), DAG is a complete search space, and the process that block excavates is exactly the random selection element (class from DAG
It is similar to search suitable Nonce in the block excavation of block chain) hash operations are carried out again, it is specified that DAG can be quickly calculated from Cache
The element of position, and then Hash verifying requires to periodically update Cache and DAG, every 1000 blocks update once, and
Provide the size linear increase as time goes by of DAG.
Since the proof of work in ether mill has the characteristic of anti-application-specific IC, the digital goods in ether mill
Coin --- in the excavation of ether coin, for ASIC relative to the common computer for using memory and CPU, there is no excellent in speed and efficiency
Gesture.Fig. 3 is block method for digging flow chart of the invention.As shown in figure 3, making the invention discloses a kind of block method for digging
The block excavation in ether mill can be used ASIC array and obtain higher speed and efficiency, specifically, block of the invention
Method for digging includes:
Step S1, the user in ether mill obtain the block body of current block, and for being excavated to current block
Random data set DAG;Wherein DAG is to be obtained by technological means known in Ethash algorithm, such as pass through such as keccak256
The hash of algorithm, which calculates, obtains seed, is calculated by the hash of such as keccak512 algorithm and generates cache according to seed, by such as
The hash of keccak512 algorithm, which is calculated, generates DAG etc. according to cache, and the present invention is not limited thereto;
Random data set DAG is divided into multiple random data set subsets by step S2, and by subset in the form of one-to-one
Distribution is stored in the memory of calculate node, so that the memory of each calculate node is stored with a pseudo random number subset;
Step S3 chooses input of a certain random number as first run address arithmetic in a certain subset, passes through address arithmetic
It is obtained the result is that a destination address, output of the corresponding random number of this destination address as epicycle address arithmetic, i.e.,
The random number of this output is for epicycle address arithmetic as a result, wherein the random number of this output is likely to be at the random number institute of input
Subset, it is also possible to be in another subset;With the destination address, which is found according to the two dimension folded coil topological structure
The correspondence calculate node of address is marked, and as next round from the correspondence random number in the subset that the correspondence calculate node is saved
The input of location operation carries out next round address arithmetic;After carrying out R wheel address arithmetic, using the address arithmetic result of acquisition as target
Random number;Wherein R is preset wheel number;Hash is carried out to the target random number of acquisition to calculate to obtain target value;
The target value of acquisition is compared by step S4 with the difficulty threshold value that current block excavates, if target value be less than or
Equal to difficulty threshold value, then using the target random number of acquisition as the block random number of current block, the target value with acquisition is current
The block hashed value of block;If target value is greater than difficulty threshold value, give up the corresponding target random number of this target value, and again
Execute step S3;
Step S5, by the block head of the step S4 block random number obtained and block hashed value write-in current block, to obtain
To complete current block, complete current block is broadcasted to ether mill.
Step S6, the other users (verifier) in ether mill are in the current block for receiving broadcast, according to current block
Block head included in the block hashed value of previous block, the block random number of current block, current block block dissipate
Train value and difficulty threshold value etc. verify the legitimacy of current block, and enter block chain for legal current block chain is verified.
According to above-mentioned block method for digging, a kind of ASIC array of the disclosed (ASIC Array) of the present invention, institute of the present invention
Disclosed ASIC array is connected between asic chip by high-speed link, and further includes multiple bare dies on each asic chip
(die), each bare die includes a network-on-chip (NoC, Network on Chip), and network-on-chip NoC has multiple calculating to save
Point, network-on-chip NoC have multiple calculate nodes, and each calculate node includes mixed calculation device, node communication module, memory etc..
Fig. 4 is the ASIC array structure schematic diagram of first embodiment of the invention.As shown in figure 4, real in of the invention first
Example is applied, ASIC array 100 includes PCB circuit board 170 and asic chip 110, and wherein asic chip 110 is set to PCB circuit board
170 upper bottom surface and bottom surface, in the upper bottom surface of PCB circuit board 170, asic chip 110 is 16, uniform in 4 × 4 array
Distribution, in the bottom surface of PCB circuit board 170, mirror image is dispersed with same amount of asic chip 110;Each asic chip 110 wraps
Include 2 bare dies 120 arranged side by side;Each bare die 120 includes a network-on-chip 130;Network-on-chip 130 includes 72 calculate nodes
(Node) 140 and 6 reflector nodes (Reflector) 150;Calculate node 140 includes memory (SRAM) 141, mixed calculation device
(Mixer) 142 and node communication module 143, each calculate node 140 has the memory of rMB.In this way, ASIC gusts entire
Column 100 then have a calculating section of a bare die 120,4608 (64 × 72) of a asic chip 110,64 (2 × 32) in 32 (2 × 4 × 4)
Point.
Fig. 5 is the two-dimensional annular topological structure schematic diagram of the network-on-chip of first embodiment of the invention.As shown in figure 5,72
A calculate node 140 is uniformly distributed in 6 × 12 arrays, and definition X is line direction, and Y is column direction, and network-on-chip 130 6 arranges totally, 12
Row also has 1 reflector node 150 in the bottommost of each column respectively, between calculate node 140-1, reflector node 150 and meter
Differential lines between operator node 140-1 by network-on-chip are formed by the interconnection of piece upper channel 144, have network-on-chip
Two-dimensional annular topological structure, wherein each calculate node 140-1 has 4 directly connected calculate node 140-1 to make
For close to node.And reflector node 150 is connected in the column of network-on-chip 130, only there are two close to node, without with it is other
Column connection.Fig. 6 is the on piece channel direction schematic diagram between the calculate node of first embodiment of the invention.As shown in fig. 6, with side
The piece that its immediate node of calculate node 140-1 in the two-dimensional annular topological structure of network-on-chip connects is indicated to E, W, S, N
The port position of upper channel 144, then reflector node 150 only has the direction N, S.
Fig. 7 is access diagram between piece between the bare die of first embodiment of the invention.As shown in fig. 7, between bare die 120
Piece between channel 160 include both links to (pair) 161, both links respectively correspond two directions toward each other to 161,
A transmitter TX and a receiver RX, direction corresponding to link pair 161 is respectively set in the endpoint of each link pair 161
It is determined by the position where transmitter TX and receiver RX.Each link pair 161 has both links 162, in this way, each
Between channel 160 include 4 links 162.Fig. 8 is access diagram between piece between the asic chip of first embodiment of the invention.
As shown in figure 8, being connected with each other between bare die 120 also by channel 160 between piece.Since each asic chip 110 includes 2 naked
Piece 120, therefore channel 160 is attached requiring 4 silvers between any between of chip 110.
It is emphasized that and be connected with each other between 2 bare dies 120 in asic chip 110 by channel 160 between 2 silvers,
Wherein channel 160 can be used as the spare interface channel between 2 bare dies 120 between 1 silver.
In the first embodiment of the present invention, bare die 120 is connected with full-mesh topology (full mesh topology) structure,
Channel 160 is directly connected to every other bare die 120 between i.e. each bare die 120 passes through piece.Bare die 120 further includes for operation
Task generates and the functional module of operational data insertion, and the serial parallel for calculating data insertion and calculated result output is logical
Road (SerDes) 180.Referring once again to Fig. 5, as shown in figure 5, in the first embodiment of the present invention, 72 calculating of network-on-chip
There are 64 calculate node 140-1 connection serial parallel channels 144, remaining 8 calculate node 140-1 linkage functions in node 140-1
Module.In Fig. 5, the functional node 140-2 that dark node on behalf is connected with functional module, light node on behalf and series-parallel
The calculate node 140-1 that channel is connected.
The invention proposes a kind of two-dimentional foldable circle face topological structures.One-dimensional foldable circle face topological structure is first introduced below,
So as to more be readily understood by the details of two-dimentional foldable circle face topological structure.Fig. 9 A, B are one-dimensional foldable circle face topological structures
Schematic diagram.As shown in Figure 9 A, there are the network-on-chips of an one-dimensional ring topology, and wherein most calculate node is with identical
The link channel of length be connected to they close to node, but the connection between fringe node then can only be by passing through all middle nodes
The link channel connection of point, so that the link channel length between its immediate node of not all calculate node is all equal, it will
Will lead to information transmitting may need additional buffering in particular directions and cause inconsistent delay, lead to the physics of bare die
Design and the forwarding strategy complexity of network-on-chip rise.The present invention uses a kind of topological structure of foldable circle face, such as Fig. 9 B institute
Show, so that calculate node forward and backward path interlocks in network-on-chip, although in this way, linkage length increases between node
Add, but then there are identical buffering and delay feature in different directions close to the link between node.
Figure 10 is the two-dimentional foldable circle face topological structure schematic diagram of first embodiment of the invention.As shown in Figure 10, Yu Benfa
Bright first embodiment hands over calculate node 140-1 forward and backward path in network-on-chip 130 using two-dimentional foldable circle face
It is wrong.After folding mode, calculate node logic and calculate node number change, and lead to the object of calculate node 140-1
Reason position changes.The link channel in topological structure in all directions of calculate node 140-1 is indicated with direction E, W, S, N
Port position, logically and physically layout between these it is inconsistent can by optionally link exchange selection signal, and
Any difference physically is not needed in node structure.Rotation right side half node is being partially disposed in link channel close to piece
In the node of upper network edge.Using exchanging mechanism identical with folding, this can be hidden from logical view and be physically rotated.It folds
The topological structure of anchor ring will lead to introduced between the physical structure of anchor ring and its logical construction it is inconsistent, for example, certain end of node
The logical direction of mouth is " E ", and the variation of position, but the logical direction of this corresponding port may occur after physical structure variation,
Still this port is referred to as " E ".The topological structure of two-dimentional foldable circle face through the invention, so that calculate node in network-on-chip
Front, rear, top, and bottom path interlocks, under the premise of ensuring that linkage length is not excessively increased between node, make different directions close to
Link buffering having the same and delay feature between node can simplify the physics realization and network-on-chip forwarding plan of bare die
Slightly.
Specifically, in the first embodiment of the present invention, the topological structure of network-on-chip 130 includes 6 × 12 calculating
Node 140-1, if node NoC address number is identified with columns m and line number n respectively, such as in line n the address node NoC according to
It is secondary be (0, n), (1, n), (2, n), (3, n), (4, n), (5, n), on m column the address node NoC be followed successively by (m, 0), (m,
1), (m, 2), (m, 3), (m, 4), (m, 5), (m, 6), (m, 7), (m, 8), (m, 9), (m, 10), (m, 11) carries out folding operation
Afterwards, the address node NoC becomes (0, n), (5, n), (1, n), (4, n), (2, n), (3, n), the node on m column in line n
The address NoC becomes (m, 0), (m, 11), (m, 1), (m, 10), (m, 2), (m, 9), (m, 3), (m, 8), (m, 4), (m, 7), (m
5)、(m,6)。
In addition, in network-on-chip 130 include linkage function module 8 functional node 140-2, each not with functional module
The calculate node 140-1 of connection has a functional node 140-2 to be used as close to node.Referring to Figure 10, such as Figure 10 institute
Show, using two-dimentional foldable circle face topological structure of the invention, the physical distribution of functional node 140-2 can be remained unchanged, and
Each calculate node 140-1 still has logical neighbors of the functional node 140-2 as connection.
Fig. 9 is a kind of structural schematic diagram of data processing plate based on ASIC array of second embodiment of the invention.Such as Fig. 9
It is shown, in the second embodiment of the present invention, a kind of data processing plate based on ASIC array is provided, which includes
ASIC array 100, controller unit (Controller) 210, power supply (Power) 220 and temperature management unit (thermal
Management), wherein temperature management unit includes temperature sensor (Sensors) 230, radiator 240.ASIC array 100
For receiving data that controller unit 210 transmits and carrying out hash calculating to it, after the result met the requirements, by it
Send controller unit 210 to;Controller unit 210 is connect with ASIC array 100 and temperature management unit respectively, for monitoring
The hash of ASIC array calculates, and is transferred to ASIC array after receiving external data, and exports the result of ASIC array acquisition;
Power supply 220 is used to DC input voitage being converted to operating voltage, and operating voltage is supplied to the ASIC battle array of data processing plate
Column 100, controller unit 210 and other modules;Temperature management unit can will pass through at the data that temperature sensor 230 obtains
The temperature information of reason plate is sent to controller unit 210, can also receive the instruction of controller unit 210, temperature management unit
Be sent to radiator 240 according to the instruction of controller unit 210, the fan governor of radiator 240 to start or close fan,
Or adjustment rotation speed of the fan, power work state can also be controlled in the case where not being related to system software, such as work as temperature sensing
Device 230 detects that data processing plate 200 controls when the temperature is excessively high and cuts off the power.In the second embodiment of the present invention, controller
Unit 210 can be using the Zynq ZU3CG field programmable gate array with 4GB RAM of operation (SuSE) Linux OS
FPGA。
It further include guidance unit, debugging unit and status lamp, wherein guidance unit is used in the second embodiment of the present invention
In controller unit starting guide, e.g. μ SD storage card or other have the storage medium of guiding function, the present invention is not
As limit;Debugging unit may have access to the serial console and JTAG of controller unit, for debugging to controller unit
And tuning parameter is provided, e.g. USB full-speed device etc., the present invention is not limited thereto;Status lamp be used for through color and/or
Brightness change shows the working condition of data processing plate, the elements such as e.g. LED.
ASIC array disclosed by the invention and the data processing plate for using ASIC array, for being hashed using e.g. SHA3
The hash of function calculates, and meets the big memory requirements of the workload verification algorithm Pow in ether mill.Figure 10 is that third of the present invention is implemented
The block excavating equipment structural schematic diagram of example.As shown in Figure 10, it in the third embodiment of the present invention, discloses a kind of for ether
The block excavating equipment 500 that mill block excavates, including at network communication module 400, task allocating module 300 and data above-mentioned
Plate 200 is managed, wherein network communication module 400 is for the data communication with ether mill, the e.g. equipment such as High_speed NIC;Task point
The ether mill block mining data for being used to will acquire with module 400 is distributed to data processing plate, and data processing plate is obtained
As a result ether mill is sent to by network communication module;Ether mill block excavating equipment 500 includes one or more data processings
Plate 200, the data that data processing plate 200 distributes task allocating module 400 are handled, to carry out block excavation.The present invention
The block excavating equipment 500 of 3rd embodiment further include power supply module, radiating module, cabinet etc., belong to block excavation applications
In common technique, therefore which is not described herein again.
Although the present invention is disclosed as above with embodiment, and is not intended to limit the invention, the skill of any the art
Art personnel can carry out equivalent modifications or change to it, be intended to be limited solely by the present invention in without departing from spirit and scope of the invention
Claims protection scope in.
Claims (11)
1. a kind of ASIC array, for carrying out block excavation, which is characterized in that the asic chip of the ASIC array is packaged with multiple
It is provided with the bare die of network-on-chip, which includes M × N number of calculate node, and the address NoC of the calculate node is (m, n),
The adjacent calculate node in the address NoC is connected with each other by piece upper channel, which uses two-dimentional folded coil topological structure,
Wherein the two dimension folded coil topological structure be the line n of the network-on-chip calculate node physical address be followed successively by (0, n),
(1, n) ..., (M-1, n) when, then the corresponding address NoC of the line n calculate node be followed successively by (0, n), (M-1, n), (2, n),
(M-2, n) ...,The physical address that the m of the network-on-chip arranges the calculate node be followed successively by (m, 0),
(m, 1) ..., (m, N-1) when, then m arrange the corresponding address NoC of the calculate node be followed successively by (m, 0), (m, N-1), (m, 1),
(m, N-2) ...,Wherein 0≤m≤M-1,0≤n≤N-1, M >=2, N >=2, and M, N, m, n are integer.
2. ASIC array as described in claim 1, which is characterized in that the network-on-chip includes 6 × 12 calculate nodes.
3. ASIC array as described in claim 1, which is characterized in that the asic chip includes 2 bare dies.
4. ASIC array as described in claim 1, which is characterized in that 4 × 4 ASIC are arranged in the one side of the PCB circuit board
4 × 4 asic chips are arranged in the another side of chip, the PCB circuit board.
5. a kind of block method for digging carries out block excavation using the described in any item ASIC arrays of such as Claims 1 to 4,
It is characterized in that, comprising:
Step 1, the random data set for being excavated to current block is obtained;
Step 2, which is divided into multiple subsets and carries out the calculate node that distribution is stored in the ASIC array;
Step 3, a certain random number arbitrarily chosen in the subset that a certain calculate node is saved carries out first round address arithmetic
To obtain destination address, with the destination address, according to the two dimension folded coil topological structure find the destination address to accrued
Operator node, and the input from the correspondence random number in the subset that the correspondence calculate node is saved as next round address arithmetic;
Using the random number obtained after the address arithmetic of default wheel number as target random number;
Step 4, it carries out hash to the target random number to calculate to obtain target value, if the target value is less than or equal to difficulty threshold
Value, then using the target random number as the block random number of current block, using the target value as the block hashed value of current block;Instead
Then abandon the target random number, and re-execute the steps 3;
Step 5, the current block is written into the block random number and the block hashed value, and the current block is broadcasted to block
Chain network.
6. block method for digging as claimed in claim 5, which is characterized in that further include:
Step 6, when verifier receives the current block of broadcast, the legitimacy of the current block is verified, and verifying is closed
The current block chain of method enters block chain.
7. block method for digging as claimed in claim 5, which is characterized in that the current block includes: current block head and works as
Proparea block, wherein deserving block hashed value, current block random number, the current block hash that proparea build includes previous block
Value and the difficulty threshold value.
8. a kind of data processing plate including the described in any item ASIC arrays of Claims 1 to 4 is excavated for block, special
Sign is, further includes: the controller unit being connect with the ASIC array, for monitoring the ASIC array processing, to the ASIC gusts
Column input data and the result for exporting ASIC array acquisition.
9. data processing plate as claimed in claim 8, which is characterized in that further include:
Temperature management unit, including radiator and temperature sensor, wherein the temperature sensor is for detecting the data processing plate
Temperature and be sent to the controller unit and/or the radiator, or control the working condition of the power supply;The radiator is for being
Data processing plate cooling;
Guidance unit is connect with the controller unit, for the starting guidance to the controller unit;
Debugging unit is connect with the controller unit, for being debugged to the controller unit and providing tuning parameter.
10. data processing plate as claimed in claim 8, which is characterized in that the controller unit is field programmable gate array.
11. a kind of block excavating equipment, including at least one is such as the described in any item data processing plates of claim 8~10, uses
It is excavated in block, which is characterized in that further include:
Network communication module, for connecting network to carry out data receiver and transmission;
Task allocating module, data distribution for will acquire from the network give the data processing plate, and by the data processing plate
Obtained result is sent to the network by the network communication module.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810306317 | 2018-04-08 | ||
CN2018103063176 | 2018-04-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110347638A true CN110347638A (en) | 2019-10-18 |
Family
ID=68174022
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811503862.0A Pending CN110347637A (en) | 2018-04-08 | 2018-12-10 | ASIC array, data processing board, and block mining method and apparatus |
CN201811504033.4A Active CN110347071B (en) | 2018-04-08 | 2018-12-10 | ASIC array, data processing board, and block mining method and apparatus |
CN201811504035.3A Pending CN110347638A (en) | 2018-04-08 | 2018-12-10 | ASIC array, data processing board, and block mining method and apparatus |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811503862.0A Pending CN110347637A (en) | 2018-04-08 | 2018-12-10 | ASIC array, data processing board, and block mining method and apparatus |
CN201811504033.4A Active CN110347071B (en) | 2018-04-08 | 2018-12-10 | ASIC array, data processing board, and block mining method and apparatus |
Country Status (1)
Country | Link |
---|---|
CN (3) | CN110347637A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113642275A (en) * | 2020-05-11 | 2021-11-12 | 阿特里斯公司 | System and method for integration of network on chip without deadlock transformations |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115587018B (en) * | 2022-11-22 | 2023-03-10 | 中科声龙科技发展(北京)有限公司 | Calculation force service data set storage method, calculation force calculation device and calculation force service equipment |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100401946B1 (en) * | 2001-08-10 | 2003-10-17 | 박종원 | A method of address calculation and routing and a conflict-free memory system using the method |
US9864582B2 (en) * | 2014-11-14 | 2018-01-09 | Cavium, Inc. | Code processor to build orthogonal execution blocks for programmable network devices |
CN106407008B (en) * | 2016-08-31 | 2019-12-03 | 北京比特大陆科技有限公司 | Dig mine method for processing business, device and system |
US20180082290A1 (en) * | 2016-09-16 | 2018-03-22 | Kountable, Inc. | Systems and Methods that Utilize Blockchain Digital Certificates for Data Transactions |
CN107329926A (en) * | 2017-07-10 | 2017-11-07 | 常州天能博智能系统科技有限公司 | A kind of computing board and its troubleshooting methodology |
-
2018
- 2018-12-10 CN CN201811503862.0A patent/CN110347637A/en active Pending
- 2018-12-10 CN CN201811504033.4A patent/CN110347071B/en active Active
- 2018-12-10 CN CN201811504035.3A patent/CN110347638A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113642275A (en) * | 2020-05-11 | 2021-11-12 | 阿特里斯公司 | System and method for integration of network on chip without deadlock transformations |
Also Published As
Publication number | Publication date |
---|---|
CN110347071A (en) | 2019-10-18 |
CN110347637A (en) | 2019-10-18 |
CN110347071B (en) | 2022-07-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20100026408A1 (en) | Signal transfer for ultra-high capacity circuits | |
Scott et al. | The blackwidow high-radix clos network | |
CN112416853A (en) | Stacked programmable integrated circuit system with intelligent memory | |
US11656662B2 (en) | Layered super-reticle computing : architectures and methods | |
CN110347638A (en) | ASIC array, data processing board, and block mining method and apparatus | |
CN107567622A (en) | photonic circuit design system | |
CN102780628B (en) | On-chip interconnection network routing method oriented to multi-core microprocessor | |
CN104077138A (en) | Multiple core processor system for integrating network router, and integrated method and implement method thereof | |
CN105763962A (en) | Optical network-on-chip, optical router and signal transmission method | |
Pothiraj et al. | Floor planning of 3D IC design using hybrid multi-verse optimizer | |
Seo | Three-dimensional Petersen-torus network: a fixed-degree network for massively parallel computers | |
Balamurugan et al. | Roadmap for machine learning based network-on-chip (M/L NoC) technology and its analysis for researchers | |
Lee et al. | System-level design framework for insertion-loss-minimized optical network-on-chip router architectures | |
US20210240543A1 (en) | Efficient and more advanced implementation of ring-allreduce algorithm for distributed parallel deep learning | |
US20230153164A1 (en) | Efficient inter-chip interconnect topology for distributed parallel deep learning | |
Fusella et al. | Lattice-based turn model for adaptive routing | |
CN111563583A (en) | Optical computing chip design method based on neural ordinary differential equation and optical computing chip | |
Sanju et al. | Design and implementation of a network on chip-based simulator: a performance study | |
CN103500240B (en) | The method that silicon through hole is carried out Dynamic Programming wiring | |
Cai et al. | Deadlock‐free adaptive 3D network‐on‐chips routing algorithm with repetitive turn concept | |
Bahrami et al. | A demand-based structure for the architecture of wireless networks on chip | |
CN104408017A (en) | Task mapping algorithm and developing method applied to grid architecture NoC (network-on-chip) system | |
Jewajinda et al. | A cooperative approach to compact genetic algorithm for evolvable hardware | |
KR20130071331A (en) | Method for automatically synthesizing tile routing structures for designing field programalbe gate array routing architecture | |
Underwood et al. | Exploiting communication and packaging locality for cost-effective large scale networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |