CN113674772B - Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment - Google Patents

Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment Download PDF

Info

Publication number
CN113674772B
CN113674772B CN202111237509.4A CN202111237509A CN113674772B CN 113674772 B CN113674772 B CN 113674772B CN 202111237509 A CN202111237509 A CN 202111237509A CN 113674772 B CN113674772 B CN 113674772B
Authority
CN
China
Prior art keywords
memory
dimensional integrated
logic
unit
access
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111237509.4A
Other languages
Chinese (zh)
Other versions
CN113674772A (en
Inventor
周小锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Unilc Semiconductors Co Ltd
Original Assignee
Xian Unilc Semiconductors Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Unilc Semiconductors Co Ltd filed Critical Xian Unilc Semiconductors Co Ltd
Priority to CN202111237509.4A priority Critical patent/CN113674772B/en
Publication of CN113674772A publication Critical patent/CN113674772A/en
Application granted granted Critical
Publication of CN113674772B publication Critical patent/CN113674772B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C5/00Details of stores covered by group G11C11/00
    • G11C5/02Disposition of storage elements, e.g. in the form of a matrix array
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1008Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
    • G06F11/1068Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices in sector programmable memories, e.g. flash disk
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C29/00Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
    • G11C29/04Detection or location of defective memory elements, e.g. cell constructio details, timing of test signals
    • G11C29/08Functional testing, e.g. testing during refresh, power-on self testing [POST] or distributed testing
    • G11C29/12Built-in arrangements for testing, e.g. built-in self testing [BIST] or interconnection details
    • G11C29/38Response verification devices
    • G11C29/42Response verification devices using error correcting codes [ECC] or parity check
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C5/00Details of stores covered by group G11C11/00
    • G11C5/06Arrangements for interconnecting storage elements electrically, e.g. by wiring
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L25/00Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof
    • H01L25/18Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof the devices being of types provided for in two or more different subgroups of the same main group of groups H01L27/00 - H01L33/00, or in a single subclass of H10K, H10N

Abstract

The invention relates to a three-dimensional integrated chip, a construction method thereof, a data processing method and electronic equipment. The present invention relates to a high-performance and super-computing system realized by three-dimensional heterogeneous integrated logic wafers or logic chips and memory wafers or memory chips, which can realize higher memory bandwidth, lower memory power consumption and lower latency, and can also realize expansion of memory capacity and memory bandwidth by vertically stacking multiple layers of memory wafers.

Description

Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment
Technical Field
The present invention relates to the field of memory, and more particularly, to the field of High Performance Computing (HPC) and supercomputing Computing systems for three-dimensional integrated (3D-IC). More particularly, the present invention relates to a three-dimensional integrated chip, a method of constructing the same, a data processing method, and an electronic device.
Background
High Bandwidth Memory (HBM) is known in the prior art, and is a new type of Memory implemented by stacked Memory technology. Fig. 1 shows a system configuration diagram of a high bandwidth memory in the prior art. In high bandwidth memory, memory dies are first stacked and then connected together using Through-silicon Vias (TSVs), thereby providing higher bandwidth to the CPU/GPU in a manner that increases memory access I/O and speed. The high bandwidth memory standard is implemented in a 2.5D package, with I/O of the high bandwidth memory interconnected to I/O of the CPU/GPU through a silicon interposer (interposer). Interconnecting the I/O of the high bandwidth memory to the I/O of the CPU/GPU through an interposer (interposer) has significant advantages in terms of speed and power consumption over interconnecting memory to the CPU/GPU through a PCB in conventional memory.
The technical scheme of the high-bandwidth memory in the prior art mainly has two defects: first, in a high bandwidth Memory, a Dynamic Random Access Memory (DRAM) is interconnected with a GPU/CPU/SoC using a Redistribution Layer (Redistribution Layer) of micro-bumps (micro-bumps). However, the pitch of the microbumps is about 40 microns. This highlights the problem of low integration density, limiting further increases in I/O and bandwidth. Second, interconnecting stacked memory dies by means of through-silicon vias creates large parasitic capacitances and thermal resistances. These large parasitic parameters result in high transmission delay and high power consumption, limiting further upgrades in system bandwidth and power consumption.
Therefore, there is a need to solve the above technical problems in the prior art.
Disclosure of Invention
The present invention relates to a system that can achieve higher memory bandwidth, lower memory power consumption, and lower access latency. High-performance computing systems and supercomputing computing systems are realized by three-dimensional heterogeneous integrated logic wafers or logic chips and memory wafers or memory chips. In addition, expansion of memory capacity and memory bandwidth may also be achieved by stacking multiple layers of memory wafers or memory chips vertically.
According to a first aspect of the present invention, there is provided a three-dimensional integrated chip comprising at least a memory cell and a logic cell,
the memory cell includes at least one memory array and at least one first three-dimensional integrated port, wherein:
the memory array comprises at least one memory bank;
each memory bank corresponds to a first three-dimensional integrated port;
the logic unit comprises at least one logic subunit, the logic subunit at least comprises a second three-dimensional integrated port, at least one computing core and a memory controller, wherein:
the second three-dimensional integrated port is connected with the at least one first three-dimensional integrated port, so that the memory unit and the logic unit are connected in a three-dimensional heterogeneous integration manner;
the computing core is configured to access the at least one memory bank and implement a corresponding computing function using the accessed data;
the memory controller is coupled to the computing core and selectively coupled to the second three-dimensional integrated port, the memory controller configured to control memory access between the computing core and the at least one memory bank via the coupling of the second three-dimensional integrated port and the at least one first three-dimensional integrated port.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the memory controller implements access to the corresponding memory banks by the computation core through connection of the second three-dimensional integrated port and the first three-dimensional integrated port corresponding to the access address based on the access address in the access instruction received from the computation core.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the logic unit further comprises:
a test and repair unit selectively connected with the second three-dimensional integration port, the test and repair unit configured to test and repair the at least one memory bank via connection of the second three-dimensional integration port and the first three-dimensional integration port.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the logic unit further comprises:
a multiplexer switch connecting the memory controller and the test and repair unit, the multiplexer switch configured to selectively connect the memory controller with the second three-dimensional integrated port to enable memory access or the test and repair unit with the second three-dimensional integrated port to enable testing and repair.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the three-dimensional integrated chip comprises at least two memory arrays and at least two logic subunits, and the logic subunits further comprise at least one routing unit configured to connect the at least two logic subunits;
wherein the routing unit in the source logical subunit is configured to implement, based on a target access address in the access instruction received from the computation core in the source logical subunit, access to the corresponding target access address by the computation core in the source logical subunit at least through a connection of the routing unit in the source logical subunit and the routing unit in the target logical subunit.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the target access address is an address of a computing core in a target logical subunit, wherein
The routing unit in the source logical subunit is configured to enable access of the compute core in the source logical subunit to the compute core in the target logical subunit through at least a connection of the routing unit in the source logical subunit and the routing unit in the target logical subunit based on an address of the compute core in the target logical subunit in the access instruction received from the compute core in the source logical subunit.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the target access address is an address of a bank of a target memory array, wherein
The routing unit in the source logic subunit is configured to implement access of the computational core in the source logic subunit to the memory bank of the target memory array at least through connection of the routing unit in the source logic subunit, the routing unit in the logic subunit corresponding to the target memory array, and the second three-dimensional integration port to the first three-dimensional integration port corresponding to the memory bank of the target memory array based on the address of the memory bank of the target memory array in the access instruction received from the computational core in the source logic subunit.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the memory unit further includes:
at least one Error Correction Code (ECC) cell, wherein the at least one memory bank each has an ECC cell configured to detect and correct errors of data stored in each memory bank.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the logic subunit further comprises: the buffer module is configured to convert the working voltage of the logic unit into the working voltage of the memory unit or convert the working voltage of the memory unit into the working voltage of the logic unit when the logic unit performs storage access on the memory unit.
According to a preferred embodiment of the three-dimensional integrated chip of the present invention, the logic unit is a high performance computational logic unit or an ultra-computational logic unit.
According to a second aspect of the present invention, there is provided a method of building a three-dimensional integrated chip including at least a memory cell and a logic cell, the method comprising:
building the memory cell, the memory cell comprising at least one storage array and at least one first three-dimensional integrated port, wherein:
the memory array comprises at least one memory bank;
each memory bank corresponds to a first three-dimensional integrated port;
constructing the logic unit, wherein the logic unit comprises at least one logic subunit, and the logic subunit at least comprises a second three-dimensional integrated port, at least one computing core and a memory controller, wherein:
the second three-dimensional integrated port is connected with the at least one first three-dimensional integrated port;
the computing core is configured to access the at least one memory bank and implement a corresponding computing function using the accessed data;
the memory controller is coupled to the computing core and selectively coupled to the second three-dimensional integrated port, the memory controller configured to control memory access between the computing core and the at least one memory bank via the coupling of the second three-dimensional integrated port and the at least one first three-dimensional integrated port.
According to a third aspect of the invention, there is provided an electronic device comprising the three-dimensional integrated chip according to the first aspect of the invention.
According to a fourth aspect of the present invention, there is provided a data processing method of a three-dimensional integrated chip, the data processing method being based on the three-dimensional integrated chip provided in the first aspect, the method comprising: receiving an access instruction from a computing core, and acquiring an access address based on the access instruction; and controlling storage access between the computing core and the at least one storage body through the connection of the second three-dimensional integrated port and the at least one first three-dimensional integrated port.
Drawings
The invention will be more readily understood by the following description in conjunction with the accompanying drawings, in which:
fig. 1 shows a system configuration diagram of a high bandwidth memory in the prior art.
Fig. 2 schematically shows an embodiment of a three-dimensional integrated chip according to the invention.
Fig. 3 schematically shows another embodiment of a three-dimensional integrated chip according to the invention.
Fig. 4 schematically shows an embodiment of a logic cell of a three-dimensional integrated chip according to the invention.
Figure 5 schematically shows a typical memory cell used in the present invention.
Figure 6 schematically shows an embodiment of a memory network according to the invention.
Detailed Description
Embodiments of the present invention will be described in further detail below with reference to the accompanying drawings.
Fig. 2 schematically shows an embodiment of a three-dimensional integrated chip according to the invention.
The three-dimensional integrated chip shown in fig. 2 includes one logic wafer 210 and two memory wafers 220. The logic wafer 210 is a wafer for implementing logic functions, and the memory wafer 220 is a wafer for implementing memory functions.
As shown in fig. 2, a logic wafer 210, two memory wafers 220 are stacked and integrated in a vertical direction. The "vertical direction" herein refers to the thickness direction of the logic wafer 210 and the memory wafer 220.
In addition, two adjacent memory wafers 220 and the logic wafer 210 are bonded and connected in a three-dimensional heterogeneous integration manner, so that the three-dimensional heterogeneous integration of the logic wafer 210 and the two memory wafers 220 is realized.
Specifically, the memory wafer 220 near the logic wafer 210 (i.e., the memory wafer 220 below in fig. 2) is bonded by a hybrid bonding technique (shown schematically as:infig. 2) "
Figure 663890DEST_PATH_IMAGE001
") directly to the logic wafer 210; while memory wafers 220 remote from the logic wafer 210 (i.e., the upper memory wafer 220 in fig. 2) are first processed by hybrid bonding techniques (shown schematically as:infig. 2) "
Figure 794657DEST_PATH_IMAGE001
") to a memory wafer 220 near the logic wafer 210 (i.e., the lower memory wafer 220 in fig. 2), and then through-silicon vias TSVs 240 (shown schematically in fig. 2 as ″) of the memory wafer 220 near the logic wafer 210 (i.e., the lower memory wafer 220 in fig. 2)"
Figure 933514DEST_PATH_IMAGE002
") and then by hybrid bonding techniques (shown schematically in fig. 2 as ″) via a memory wafer 220 close to the logic wafer 210 (i.e., the lower memory wafer 220 in fig. 2)"
Figure 516943DEST_PATH_IMAGE001
") to logic wafer 210.
Thus, the logic wafer 210 enables independent storage access to the upper memory wafer 220 and the lower memory wafer 220 in fig. 2.
As can also be seen in fig. 2, the logic wafer 210 is connected to the substrate 230 by a Bump (Bump) process. Bump processes include, but are not limited to, BOPCOA, BOAC, HOTROD as known in the art.
Additionally, depicted in FIG. 2 is a logic wafer and a memory wafer. It is known that wafers are basic raw materials for manufacturing semiconductor devices, and semiconductors of extremely high purity are prepared into wafers through processes of pulling, slicing, and the like. The wafer is processed through a series of semiconductor manufacturing processes to form a tiny circuit structure, and then is cut, packaged and tested to form a chip. The invention is not limited to logic wafers and memory wafers, and in fact, the logic wafer 210 may also be a logic chip, and the memory wafer 220 may also be a memory chip. Preferably, the memory wafer 220 may be a dynamic random access memory wafer or a dynamic random access memory chip.
Shown in fig. 2 are two memory wafers 220, the two memory wafers 220 each constituting a two-layer memory wafer. It should be understood that to meet bandwidth and storage capacity requirements, the memory die 220 may be more layers, including but not limited to 4 layers, 6 layers, and the like. In addition, a plurality of memory wafers may be disposed in the same layer of the multi-layer memory wafer. For example, two memory wafers are disposed in the same memory wafer layer, and the two memory wafers are independent of each other.
In addition, the multi-layer memory wafer may be disposed on two sides of the logic wafer respectively. Fig. 3 schematically illustrates such an embodiment, taking two memory wafers 220 as an example. As shown in fig. 3, one memory wafer 220 is disposed above the logic wafer 210, and another memory wafer 220 is disposed below the logic wafer 210. Therefore, the memory access path from the logic wafer 210 to the memory wafer 220 is shortened, and the memory access efficiency is improved.
Specifically, memory wafers 220 on both sides of logic wafer 210 are formed by hybrid bonding techniques (shown schematically in FIG. 3 as "
Figure 828975DEST_PATH_IMAGE001
") directly to the logic wafer 210; and logic wafer 210 is first processed by a hybrid bonding technique (shown schematically as "in fig. 3)"
Figure 814249DEST_PATH_IMAGE001
") is connected to the underlying memory wafer 220 and then through the through-silicon vias TSV 240 (shown schematically in fig. 3) of the underlying memory wafer 220"
Figure 124007DEST_PATH_IMAGE002
") and then connected to the substrate 230 by a bumping process.
Fig. 4 schematically shows an embodiment of a logic cell of a three-dimensional integrated chip according to the invention. Fig. 5 schematically illustrates a typical memory cell (i.e., memory array) used in the present invention. The three-dimensional integrated chip at least comprises a logic unit as shown in fig. 4 and a memory unit as shown in fig. 5, and the logic unit and the memory unit are stacked and integrated, preferably by bonding or more preferably by hybrid bonding.
As shown in fig. 4, the logic unit includes at least a compute core 410 and a memory controller 420 connected to the compute core 410. In addition, the logic unit also comprises an input and output three-dimensional heterogeneous integrated port area
Figure 758513DEST_PATH_IMAGE003
(i.e., hybrid bonded ports, through two rows in the figure) "
Figure 811920DEST_PATH_IMAGE004
Schematically shown), these three-dimensional heterogeneous integrated port regions
Figure 651700DEST_PATH_IMAGE003
The logic cells are connected with memory cells (e.g., the memory cells shown in fig. 5). In FIG. 4, the input-output three-dimensional heterogeneous integrated port region
Figure 194677DEST_PATH_IMAGE003
(i.e., hybrid bonded ports, through two rows in the figure) "
Figure 487118DEST_PATH_IMAGE004
Schematically shown) is located in a cache module 460 (two rows) "
Figure 406532DEST_PATH_IMAGE004
"box at hand). The illustration in fig. 4 is only schematic, and the input and output three-dimensional heterogeneous integrated port area can be located in other modules. The buffer module 460 is configured to perform voltage conversion, and is used to convert the operating voltage of the logic unit into the operating voltage of the memory unit or convert the operating voltage of the memory unit into the operating voltage of the logic unit when the logic unit performs a storage access to the memory unit.
As shown in fig. 5, a typical memory cell used in the present invention is schematically shown. The memory cell shown in FIG. 5 also includes an input-output three-dimensional heterogeneous integrated port region
Figure 100819DEST_PATH_IMAGE005
Figure 752380DEST_PATH_IMAGE006
(i.e., hybrid bonded ports, also through the two rows in the figure) "
Figure 125593DEST_PATH_IMAGE004
Schematically shown), these three-dimensional heterogeneous integrated port regions
Figure 724064DEST_PATH_IMAGE005
Figure 367797DEST_PATH_IMAGE006
Logic cells (e.g., the logic cells shown in fig. 4) are connected with memory cells.
Three-dimensional heterogeneous integrated port regions of logic cells for the logic cells of FIG. 4 and the memory cells of FIG. 5
Figure 455839DEST_PATH_IMAGE003
(i.e., the port of the hybrid bond,through two rows in the figure "
Figure 722872DEST_PATH_IMAGE004
Schematically shown) corresponding to three-dimensional heterogeneous integrated port regions of connected memory cells
Figure 656193DEST_PATH_IMAGE005
Figure 387389DEST_PATH_IMAGE006
(i.e., hybrid bonded ports, through two rows in the figure) "
Figure 646332DEST_PATH_IMAGE004
"shown schematically). It should be noted that the Memory unit in fig. 5 has 8 Memory banks (Memory banks), each Memory Bank corresponds to a Memory space of 128Mb and a corresponding data bit width of 128bits, each Memory Bank corresponds to an independent three-dimensional heterogeneous integrated port region, and the three-dimensional heterogeneous integrated port regions corresponding to the 8 Memory banks are respectively
Figure 462978DEST_PATH_IMAGE005
Figure 465569DEST_PATH_IMAGE006
. Three-dimensional heterogeneous integrated port region of logic cell in FIG. 4
Figure 988954DEST_PATH_IMAGE003
Three-dimensional heterogeneous integrated port region corresponding to memory cell of FIG. 5
Figure 949957DEST_PATH_IMAGE005
Figure 191583DEST_PATH_IMAGE006
That is, the logic unit in fig. 4 is connected to 8 banks of the memory unit in fig. 5, so that the memory access of the logic unit to the 8 banks can be realized.
In addition, 8 banks are shown in fig. 5. It should be understood that to meet bandwidth and storage capacity requirements, the memory banks in a memory unit may also be 16, 32, etc., and the storage space per memory bank may also be 64Mb, 32Mb, etc. The invention is not limited in this respect.
Furthermore, it is also shown in fig. 5 that each bank comprises error correction code ECC units with a capacity of 8 Mb. These ECC cells are used to store additional check bits to detect and correct errors in the data stored in each bank. These ECC units may be implemented using ECC algorithms known in the art. In addition, the storage capacity of the ECC unit is not limited to 8 Mb. The storage capacity of the ECC unit may vary depending on the bandwidth and storage capacity of the memory bank, the ECC algorithm employed, etc.
Referring back to FIG. 4, in the normal operating mode of the three-dimensional integrated chip, the compute core 410 in the logic unit has direct access to the three-dimensional heterogeneous integrated port region through it
Figure 732285DEST_PATH_IMAGE003
Connected memory banks.
For example, when the computing core 410 receives an access instruction, and the access address carried by the access instruction is a memory bank with a memory space of 128Mb at the upper left corner in the memory unit shown in fig. 5, the memory controller 420 passes through the three-dimensional heterogeneous integrated port region of the logic unit based on the access address
Figure 110177DEST_PATH_IMAGE003
Three-dimensional heterogeneous integrated port region corresponding to memory bank at upper left corner
Figure 552003DEST_PATH_IMAGE005
The physical connection port formed accesses the memory bank in the upper left corner. Specifically, the access instruction includes, for example, a read instruction and a write instruction. When the access instruction is a read instruction, memory controller 420 passes through the region based on the read instruction
Figure 15345DEST_PATH_IMAGE003
And region
Figure 359739DEST_PATH_IMAGE005
The physical connection port is formed to read data from the upper left bank. When the access command is a write command, the access command also includes data to be written, and the memory controller 420 passes through the region based on the write command
Figure 654454DEST_PATH_IMAGE003
And region
Figure 160521DEST_PATH_IMAGE005
The formed physical connection port writes the data to be written into the memory bank at the upper left corner.
For another example, when the computing core 410 receives an access instruction, and the access address carried by the access instruction is a memory bank with a memory space of 128Mb in the upper right corner of the memory unit shown in fig. 5, the memory controller 420 passes through the three-dimensional heterogeneous integrated port region of the logic unit based on the access address
Figure 376739DEST_PATH_IMAGE007
Three-dimensional heterogeneous integrated port region corresponding to memory bank at upper right corner
Figure 259244DEST_PATH_IMAGE008
The physical connection port formed accesses the upper right bank. Specifically, the access instruction includes, for example, a read instruction and a write instruction. When the access instruction is a read instruction, memory controller 420 passes through the region based on the read instruction
Figure 674045DEST_PATH_IMAGE009
And region
Figure 351014DEST_PATH_IMAGE010
The physical connection port is formed to read data from the upper right bank. When the access command is a write command, the access command also includes data to be written, and the memory controller 420 passes through the region based on the write command
Figure 788949DEST_PATH_IMAGE011
And region
Figure 803041DEST_PATH_IMAGE010
The formed physical connection port writes the data to be written into the memory bank at the upper right corner.
Therefore, in the three-dimensional integrated chip of the invention, the logic unit can perform independent memory access on different memory banks of the memory unit, and the memory access processes among different memory banks are not interfered with each other. Therefore, the three-dimensional integrated chip of the invention can further improve the bandwidth and the access efficiency.
In addition, as shown in FIG. 4, the logic unit further includes a test and repair unit 440 for testing and repairing at least one memory cell. In addition, the logic unit also includes a multiplexer switch 450.
Under the normal working mode of the three-dimensional integrated chip, the multi-way selection switch 450 will calculate the core 410, the memory controller 420 and the three-dimensional heterogeneous integrated port region
Figure 10032DEST_PATH_IMAGE011
Connected so as to directly access at least one bank of at least one memory unit. The specific procedure is as described above.
In the test mode of the three-dimensional integrated chip, the multiplexer 450 connects the test and repair unit 440 to the three-dimensional heterogeneous integrated port region
Figure 123481DEST_PATH_IMAGE011
Connected to test and repair at least one bank of at least one memory cell.
Specifically, when memory needs to be tested and repaired, the multiplexer 450 switches off the compute core 410, the memory controller 420 to the three-dimensional heterogeneous integrated port region
Figure 346914DEST_PATH_IMAGE011
And turn on the test and repair unit 440 to the three-dimensional heterogeneous integrationPort area
Figure 571222DEST_PATH_IMAGE011
The path between them. At this time, the test and repair unit 440 tests and repairs each bank of the memory cells. For example, the test and repair unit 440 passes through the three-dimensional heterogeneous integrated port region
Figure 898298DEST_PATH_IMAGE011
And three-dimensional heterogeneous integrated port region
Figure 182649DEST_PATH_IMAGE005
The formed physical connection port tests and repairs the memory bank at the upper left corner of the memory unit; as another example, test and repair unit 440 passes through a three-dimensional heterogeneous integrated port region
Figure 391914DEST_PATH_IMAGE011
And three-dimensional heterogeneous integrated port region
Figure 419913DEST_PATH_IMAGE010
The physical connection port is formed to test and repair the bank in the upper right corner of the memory cell.
In addition, as shown in fig. 4, the logic unit further includes a routing unit 430, the routing unit 430 connects the multiplexing switch 450 and the three-dimensional heterogeneous integrated port region
Figure 601495DEST_PATH_IMAGE011
. The routing unit 430 is used to interconnect a plurality of memories to form a system on chip.
Figure 6 schematically shows an embodiment of a memory network, i.e. a system on chip, according to the invention. The system on chip shown in fig. 6 is a system on chip in which a plurality of logic subunits and a plurality of memory subunits are integrated via a routing unit. The logic sub-unit in fig. 6 may be constituted by the logic unit in fig. 4, and the memory sub-unit in fig. 6 may be constituted by the memory unit in fig. 5. The system on chip is formed by routing cell interconnections located in logical subunits.
The ability of the compute core 410 in the logic unit to directly access the memory banks connected through its three-dimensional heterogeneous integrated port region was discussed above with respect to fig. 4 and 5. In the case of forming the system on chip shown in fig. 6, the computational core in one logical subunit in the three-dimensional integrated chip can also indirectly access the computational core in another logical subunit or the memory bank to which another logical subunit belongs through the routing unit 430.
For example, when the compute core in the upper left-hand logical subunit (hereinafter "first logical subunit") shown in FIG. 6 receives an access instruction, and the access address carried by the access instruction is the memory bank of the memory unit of the upper right logical subunit (hereinafter referred to as "second logical subunit") shown in fig. 6, the memory controller of the first logical subunit based on the access address, an access path formed by a routing cell in the first logical sub-cell (hereinafter referred to as "first routing cell"), a routing cell in an intermediate logical sub-cell in the first row (hereinafter referred to as "intermediate routing cell"), a routing cell in the second logical sub-cell (hereinafter referred to as "second routing cell"), and through a three-dimensional heterogeneous integrated port region in the second logical sub-cell.
Figure 56747DEST_PATH_IMAGE003
Three-dimensional heterogeneous integrated port region of memory bank corresponding to access address
Figure 18887DEST_PATH_IMAGE005
Figure 584998DEST_PATH_IMAGE006
And the formed physical connection port accesses the memory bank corresponding to the access address.
For another example, when the compute core in the first logic subunit shown in fig. 6 receives the access instruction, and the access address carried by the access instruction is the compute core in the second logic subunit shown in fig. 6, the memory controller in the first logic subunit accesses the compute core in the second logic subunit through the access path formed by the first routing unit in the first logic subunit, the intermediate routing unit in the intermediate logic subunit, the second routing unit in the second logic subunit, and the memory controller based on the access address.
The memory network of the mesh topology is only schematically shown in fig. 6. The memory networks of the present invention include, but are not limited to, bus, star, ring, tree, mesh, and hybrid topologies.
Applications of the present invention may include, but are not limited to, high performance computing systems, supercomputing systems, and the like.
When the logic wafer 210 is a high performance computing logic wafer, the obtained corresponding three-dimensional integrated chip is a three-dimensional integrated high performance computing chip.
In the prior art, the performance of high performance computing systems is limited by memory bandwidth and power consumption. However, in the invention, since the high-performance computing logic wafer and the memory wafer are stacked and integrated in a hybrid bonding mode, and the hybrid bonding mode can realize higher-density integration, the realization of higher memory bandwidth is facilitated; in addition, since the high performance computing system of the present invention has a lower parasitic parameter (capacitance), lower memory access power consumption can be achieved. In addition, since the memory wafers can also be stacked vertically in multiple layers, expansion of memory capacity and memory bandwidth can be further achieved.
Therefore, the three-dimensional integrated chip obtained by the invention can realize higher memory bandwidth, higher memory capacity and lower memory access power consumption, and can meet the requirements of a high-performance computing system.
In addition, different memory banks in the memory array connected to the three-dimensional heterogeneous integrated port region of the high-performance computation logic unit (i.e., the memory array to which the high-performance computation core belongs) respectively have corresponding three-dimensional heterogeneous integrated port regions, so that the high-performance computation core in the high-performance computation logic unit can perform independent memory access on the different memory banks in the memory array to which the high-performance computation core belongs, memory access processes among the different memory banks are not interfered with each other, and access bandwidth and access efficiency can be remarkably improved.
For example, when the high-performance computing core receives an access instruction, and an access address carried by the access instruction is a memory bank at the upper right corner of the memory unit shown in fig. 5, the memory controller in the high-performance logic unit accesses the memory bank at the upper right corner through a physical connection port formed by the three-dimensional heterogeneous integrated port region of the high-performance logic unit and the three-dimensional heterogeneous integrated port region corresponding to the memory bank at the upper right corner based on the access address. For another example, when the high-performance computing core receives the access instruction and the access address carried by the access instruction is the lower left bank in the memory unit shown in fig. 5, the memory controller in the high-performance logic unit accesses the lower left bank through the physical connection port formed by the three-dimensional heterogeneous integrated port region of the high-performance logic unit and the three-dimensional heterogeneous integrated port region corresponding to the lower left bank based on the access address. That is to say, since the upper right and lower left banks have corresponding three-dimensional heterogeneous integrated port regions, the high-performance computation core can perform the storage access of the upper right bank and the lower left bank in parallel and independently, which can greatly improve the access bandwidth and the access efficiency.
When the logic wafer 210 is an overcomputing logic wafer, the obtained corresponding three-dimensional integrated chip is a three-dimensional integrated overcomputing chip.
In the prior art, the performance of a supercomputing system is limited by memory bandwidth and access latency. However, in the invention, the super-computation logic wafer and the memory wafer are stacked and integrated in a hybrid bonding mode, and the hybrid bonding mode can realize higher-density integration, so that higher memory bandwidth is realized; in addition, the distributed super-computation core improves the computation parallelism, thereby further improving the computation speed of the super-computation chip.
The super computation logic unit comprises a super computation core, an instruction cache, a data cache, a memory controller and a cache module. Similar to the above description, there may be an input-output three-dimensional heterogeneous integrated port region in the cache module, and the cache module is configured for voltage conversion. And the memory controller directly accesses the memory array to which the super computation logic unit belongs by connecting the input/output three-dimensional heterogeneous integrated port area in the cache module with the input/output three-dimensional heterogeneous integrated port area in the memory array.
For example, when the supercomputing computing core receives an access instruction, the supercomputing computing core first determines whether the access instruction and the access data are stored in the instruction cache and the data cache. If the access instruction and the access data are stored in the instruction cache and the data cache, the access results are returned from the instruction cache and the data cache to the supercomputing computational core. And if the access instruction and the access data are not stored in the instruction cache and the data cache, the memory controller accesses the storage array and returns an access result to the super computing core through a physical connection port formed by the input/output three-dimensional heterogeneous integrated port area in the cache module and the input/output three-dimensional heterogeneous integrated port area in the storage array based on an access address carried in the access instruction.
In the super computation logic unit, the caching strategies of the instruction cache and the data cache are established according to the access frequency or the writing time sequence of the super computation cores. And writing the access instruction and the access data into an instruction cache and a data cache or a storage array according to the cache policies of the instruction cache and the data cache. The presence of an instruction cache and a data cache in the supercomputing logic unit further increases the computation speed.
In addition, in the case where the three-dimensional integrated supercomputing chip is formed in a manner similar to that shown in fig. 6, the supercomputing logic unit described above may form a supercomputing logic sub-unit of the three-dimensional integrated supercomputing chip. Similar to what is described with respect to fig. 6, a supercomputing computational core in one supercomputing-logic subunit can indirectly access a supercomputing computational core in another supercomputing-logic subunit or a memory bank to which the other supercomputing-logic subunit belongs through a routing unit.
For example, when the access instruction is received by the supercomputing computational core in the source supercomputing logic subunit, and the access address carried by the access instruction is the memory bank of the memory unit of the target supercomputing logic subunit, the memory controller of the source supercomputing logic subunit accesses the memory bank corresponding to the access address through the access path formed by the routing unit in the source supercomputing logic subunit, the routing unit in one or more intermediate supercomputing logic subunits, and the routing unit in the target supercomputing logic subunit, based on the access address, and through the physical connection port formed by the three-dimensional heterogeneous integrated port area in the target supercomputing logic subunit and the three-dimensional heterogeneous integrated port area of the memory bank corresponding to the access address.
For another example, when the super computation core in the source super computation logic subunit receives the access instruction and the access address carried by the access instruction is the super computation core in the target super computation logic subunit, the memory controller in the source super computation logic subunit accesses the super computation core in the target super computation logic subunit through an access path formed by the routing unit in the source super computation logic subunit, the intermediate routing unit in the one or more intermediate super computation logic subunits, the routing unit in the target super computation logic subunit, and the memory controller based on the access address.
By forming the system on supercomputer in a manner similar to that shown in fig. 6, each supercomputer has independent access to its described storage array and independent computation in the manner described above, or has access to another supercomputer or the storage array to which the supercomputer belongs via a routing unit, the computation speed is significantly increased.
The three-dimensional integrated chips proposed in the present invention are memory chips or memory dies (e.g., ROM, SDRAM, RAM, DRAM, SRAM, FLASH, EPROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices) for storing data and/or computer code. The three-dimensional integrated chip may be or include non-transitory volatile memory or non-volatile memory, etc.
The invention also provides a data processing method based on the three-dimensional integrated chip in any of the embodiments, and the method comprises the following steps: receiving an access instruction from a computing core, and acquiring an access address based on the access instruction; and controlling storage access between the computing core and the at least one storage body through the connection of the second three-dimensional integrated port and the at least one first three-dimensional integrated port. It can be understood that the data processing method further includes functions that can be implemented by the three-dimensional integrated chip, and details are not described herein.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. It is to be understood that the scope of the invention is defined by the claims.

Claims (12)

1. A three-dimensional integrated chip is characterized in that the three-dimensional integrated chip at least comprises a memory unit and a logic unit,
the memory cell includes at least one memory array and at least one first three-dimensional integrated port, wherein:
the memory array comprises at least one memory bank;
each memory bank corresponds to a first three-dimensional integrated port;
the logic unit comprises at least one logic subunit, the logic subunit at least comprises a second three-dimensional integrated port, at least one computing core and a memory controller, wherein:
the second three-dimensional integrated port is connected with the at least one first three-dimensional integrated port, so that the memory unit and the logic unit are connected in a three-dimensional heterogeneous integration manner;
the computing core is configured to access the at least one memory bank and implement a corresponding computing function using the accessed data;
the memory controller is coupled to the computing core and selectively coupled to the second three-dimensional integrated port, the memory controller configured to control memory access between the computing core and the at least one memory bank via the coupling of the second three-dimensional integrated port and the at least one first three-dimensional integrated port;
wherein the logic unit further comprises: a test and repair unit selectively connected with the second three-dimensional integration port, the test and repair unit configured to test and repair the at least one memory bank via connection of the second three-dimensional integration port and the first three-dimensional integration port.
2. The three-dimensional integrated chip of claim 1, wherein the memory controller is configured to enable access to the corresponding memory bank by the compute core through the connection of the second three-dimensional integrated port and the first three-dimensional integrated port corresponding to the access address based on the access address in the access instruction received from the compute core.
3. The three-dimensional integrated chip of claim 1, wherein the logic unit further comprises:
a multiplexer switch connecting the memory controller and the test and repair unit, the multiplexer switch configured to selectively connect the memory controller with the second three-dimensional integrated port to enable memory access or the test and repair unit with the second three-dimensional integrated port to enable testing and repair.
4. The three-dimensional integrated chip according to claim 1 or 2, wherein the three-dimensional integrated chip comprises at least two memory arrays and at least two logic sub-units, and the logic sub-units further comprise at least one routing unit configured to connect the at least two logic sub-units;
wherein the routing unit in the source logical subunit is configured to implement, based on a target access address in the access instruction received from the computation core in the source logical subunit, access to the corresponding target access address by the computation core in the source logical subunit at least through a connection of the routing unit in the source logical subunit and the routing unit in the target logical subunit.
5. The three-dimensional integrated chip of claim 4, wherein the target access address is an address of a compute core in a target logical subunit, wherein
The routing unit in the source logical subunit is configured to enable access of the compute core in the source logical subunit to the compute core in the target logical subunit through at least a connection of the routing unit in the source logical subunit and the routing unit in the target logical subunit based on an address of the compute core in the target logical subunit in the access instruction received from the compute core in the source logical subunit.
6. The three-dimensional integrated chip of claim 4, wherein the target access address is an address of a bank of a target memory array, wherein
The routing unit in the source logic subunit is configured to implement access of the computational core in the source logic subunit to the memory bank of the target memory array at least through connection of the routing unit in the source logic subunit, the routing unit in the logic subunit corresponding to the target memory array, and the second three-dimensional integration port to the first three-dimensional integration port corresponding to the memory bank of the target memory array based on the address of the memory bank of the target memory array in the access instruction received from the computational core in the source logic subunit.
7. The three-dimensional integrated chip of claim 1 or 2, wherein the memory unit further comprises:
at least one Error Correction Code (ECC) cell, wherein the at least one memory bank each has an ECC cell configured to detect and correct errors of data stored in each memory bank.
8. The three-dimensional integrated chip of claim 1, wherein the logic subunit further comprises:
the buffer module is configured to convert the working voltage of the logic unit into the working voltage of the memory unit or convert the working voltage of the memory unit into the working voltage of the logic unit when the logic unit performs storage access on the memory unit.
9. The three-dimensional integrated chip according to claim 1 or 2, wherein the logic unit is a high performance computational logic unit or a super computational logic unit.
10. A method for constructing a three-dimensional integrated chip, wherein the three-dimensional integrated chip at least comprises a memory unit and a logic unit, the method comprising:
building the memory cell, the memory cell comprising at least one storage array and at least one first three-dimensional integrated port, wherein:
the memory array comprises at least one memory bank;
each memory bank corresponds to a first three-dimensional integrated port;
constructing the logic unit, wherein the logic unit comprises at least one logic subunit, and the logic subunit at least comprises a second three-dimensional integrated port, at least one computing core and a memory controller, wherein:
the second three-dimensional integrated port is connected with the at least one first three-dimensional integrated port;
the computing core is configured to access the at least one memory bank and implement a corresponding computing function using the accessed data;
the memory controller is coupled to the computing core and selectively coupled to the second three-dimensional integrated port, the memory controller configured to control memory access between the computing core and the at least one memory bank via the coupling of the second three-dimensional integrated port and the at least one first three-dimensional integrated port;
wherein the logic unit further comprises: a test and repair unit selectively connected with the second three-dimensional integration port, the test and repair unit configured to test and repair the at least one memory bank via connection of the second three-dimensional integration port and the first three-dimensional integration port.
11. An electronic device, characterized in that it comprises a three-dimensional integrated chip according to any one of claims 1-9.
12. A data processing method of a three-dimensional integrated chip, wherein the data processing method is based on the three-dimensional integrated chip of any one of claims 1 to 9, and the method comprises:
receiving an access instruction from a computing core, and acquiring an access address based on the access instruction;
and controlling storage access between the computing core and the at least one storage body through the connection of the second three-dimensional integrated port and the at least one first three-dimensional integrated port.
CN202111237509.4A 2021-10-25 2021-10-25 Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment Active CN113674772B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111237509.4A CN113674772B (en) 2021-10-25 2021-10-25 Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111237509.4A CN113674772B (en) 2021-10-25 2021-10-25 Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment

Publications (2)

Publication Number Publication Date
CN113674772A CN113674772A (en) 2021-11-19
CN113674772B true CN113674772B (en) 2022-04-12

Family

ID=78550985

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111237509.4A Active CN113674772B (en) 2021-10-25 2021-10-25 Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment

Country Status (1)

Country Link
CN (1) CN113674772B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114497033A (en) * 2022-01-27 2022-05-13 上海燧原科技有限公司 Three-dimensional chip
CN114709205B (en) * 2022-06-02 2022-09-09 西安紫光国芯半导体有限公司 Three-dimensional stacked chip and data processing method thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10666264B1 (en) * 2018-12-13 2020-05-26 Micron Technology, Inc. 3D stacked integrated circuits having failure management
CN111863784A (en) * 2020-07-30 2020-10-30 长江存储科技有限责任公司 Semiconductor structure and manufacturing method thereof
CN112384978A (en) * 2020-10-12 2021-02-19 长江先进存储产业创新中心有限责任公司 Novel integration scheme for bonding to 3D XPOINT chip by adopting CPU

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109148498B (en) * 2018-08-14 2021-06-15 武汉新芯集成电路制造有限公司 Structure of three-dimensional bonding sensor with high storage capacity and manufacturing method thereof
CN110945652A (en) * 2019-04-15 2020-03-31 长江存储科技有限责任公司 Stacked three-dimensional heterogeneous memory device and forming method thereof
US10923450B2 (en) * 2019-06-11 2021-02-16 Intel Corporation Memory arrays with bonded and shared logic circuitry
JP7211296B2 (en) * 2019-07-19 2023-01-24 日本ポリプロ株式会社 Branched polypropylene polymer
US11171097B2 (en) * 2020-01-28 2021-11-09 Sandisk Technologies Llc Bonded assembly containing metal-organic framework bonding dielectric and methods of forming the same
CN112164674A (en) * 2020-09-24 2021-01-01 芯盟科技有限公司 Stacked high bandwidth memory
CN112214448B (en) * 2020-10-10 2024-04-09 声龙(新加坡)私人有限公司 Data dynamic reconstruction circuit and method of heterogeneous integrated workload proving operation chip
CN112614853B (en) * 2020-12-01 2023-05-12 长江存储科技有限责任公司 Three-dimensional memory device and forming method thereof
CN112558889B (en) * 2021-02-26 2021-05-28 北京微核芯科技有限公司 Stacked Cache system based on SEDRAM, control method and Cache device
CN113505091B (en) * 2021-09-10 2021-12-14 西安紫光国芯半导体有限公司 Stack device based on SEDRAM and stack system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10666264B1 (en) * 2018-12-13 2020-05-26 Micron Technology, Inc. 3D stacked integrated circuits having failure management
CN111863784A (en) * 2020-07-30 2020-10-30 长江存储科技有限责任公司 Semiconductor structure and manufacturing method thereof
CN112384978A (en) * 2020-10-12 2021-02-19 长江先进存储产业创新中心有限责任公司 Novel integration scheme for bonding to 3D XPOINT chip by adopting CPU

Also Published As

Publication number Publication date
CN113674772A (en) 2021-11-19

Similar Documents

Publication Publication Date Title
US11145384B2 (en) Memory devices and methods for managing error regions
US10147479B2 (en) Memory module and memory system
US11508693B2 (en) High capacity memory module including wafer-section memory circuit
CN113674772B (en) Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment
US9287268B2 (en) Dynamic random access memory (DRAM) and production method, semiconductor packaging component and packaging method
TWI786738B (en) Dram chiplet structure and method for manufacturing the same
US11810640B2 (en) Memory interface with configurable high-speed serial data lanes for high bandwidth memory
JP2017514263A (en) Memory with controller with local memory
US20150325521A1 (en) Semiconductor device and semiconductor chip
US10748877B2 (en) Integrated wafer-level processing system
TW201417219A (en) Through-silicon via self-routing circuit and routing method thereof
Beyne et al. 3D SoC integration, beyond 2.5 D chiplets
CN113505091B (en) Stack device based on SEDRAM and stack system
CN106782666B (en) Three-dimensional stacked memory
WO2023030051A1 (en) Stacked chip
CN113971370A (en) Three-dimensional integrated chip, construction method thereof, data processing method and electronic equipment
CN111952298B (en) Neural network intelligent chip and forming method thereof
US20240088084A1 (en) Tightly-coupled random access memory interface shim die
Tam et al. Breaking the memory wall for AI chip with a new dimension
CN105742277A (en) Three-dimensional expansion method for high-capacity three-dimensional integrated SRAM memory
CN112447202A (en) Neural network intelligent chip and forming method thereof
CN116601705A (en) Memory, using method thereof, manufacturing method thereof and electronic equipment
WO2020000183A1 (en) Wafer-level packaging method for semiconductor and semiconductor package
CN114497033A (en) Three-dimensional chip
CN112446475A (en) Neural network intelligent chip and forming method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant