WO2024001962A1 - Memory, chip stack structure, chip package structure, and electronic device - Google Patents

Memory, chip stack structure, chip package structure, and electronic device Download PDF

Info

Publication number
WO2024001962A1
WO2024001962A1 PCT/CN2023/102132 CN2023102132W WO2024001962A1 WO 2024001962 A1 WO2024001962 A1 WO 2024001962A1 CN 2023102132 W CN2023102132 W CN 2023102132W WO 2024001962 A1 WO2024001962 A1 WO 2024001962A1
Authority
WO
WIPO (PCT)
Prior art keywords
memory
storage area
chip
bandwidth
banks
Prior art date
Application number
PCT/CN2023/102132
Other languages
French (fr)
Chinese (zh)
Inventor
汪浩
王腾
王永森
刘宇
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2024001962A1 publication Critical patent/WO2024001962A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0625Power saving in storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/061Improving I/O performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0631Configuration or reconfiguration of storage systems by allocating resources to storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0656Data buffering arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0688Non-volatile semiconductor memory arrays
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/48Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/48Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads, terminal arrangements ; Selection of materials therefor
    • H01L23/481Internal lead connections, e.g. via connections, feedthrough structures
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • H01L23/538Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames the interconnection structure between a plurality of semiconductor chips being formed on, or in, insulating substrates
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L23/00Details of semiconductor or other solid state devices
    • H01L23/52Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames
    • H01L23/538Arrangements for conducting electric current within the device in operation from one component to another, i.e. interconnections, e.g. wires, lead frames the interconnection structure between a plurality of semiconductor chips being formed on, or in, insulating substrates
    • H01L23/5386Geometry or layout of the interconnection structure
    • HELECTRICITY
    • H01ELECTRIC ELEMENTS
    • H01LSEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
    • H01L25/00Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof
    • H01L25/03Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof all the devices being of a type provided for in the same subgroup of groups H01L27/00 - H01L33/00, or in a single subclass of H10K, H10N, e.g. assemblies of rectifier diodes
    • H01L25/04Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof all the devices being of a type provided for in the same subgroup of groups H01L27/00 - H01L33/00, or in a single subclass of H10K, H10N, e.g. assemblies of rectifier diodes the devices not having separate containers
    • H01L25/065Assemblies consisting of a plurality of individual semiconductor or other solid state devices ; Multistep manufacturing processes thereof all the devices being of a type provided for in the same subgroup of groups H01L27/00 - H01L33/00, or in a single subclass of H10K, H10N, e.g. assemblies of rectifier diodes the devices not having separate containers the devices being of a type provided for in group H01L27/00
    • H01L25/0657Stacked arrangements of devices
    • HELECTRICITY
    • H10SEMICONDUCTOR DEVICES; ELECTRIC SOLID-STATE DEVICES NOT OTHERWISE PROVIDED FOR
    • H10BELECTRONIC MEMORY DEVICES
    • H10B80/00Assemblies of multiple devices comprising at least one memory device covered by this subclass

Definitions

  • the present application relates to the field of memory technology, and in particular to a memory, a chip stack structure, a chip packaging structure and electronic equipment.
  • multiple separate, low-capacity chips can be implemented into a high-capacity single chip.
  • Stacking multiple identical memory chips in a single package to achieve higher storage density packaging can increase storage capacity, but this can only solve the problem of insufficient storage capacity and cannot increase the bandwidth of the memory chip.
  • Embodiments of the present application provide a memory, a chip stack structure, a chip packaging structure and an electronic device to improve the problem that existing main memories cannot meet the requirements for bandwidth, capacity and power consumption.
  • a memory including: a first storage area and a second storage area, the first storage area and the second storage area including a plurality of storage banks, each storage bank including one or more arrays, each The arrays have the same number of input and output ports; wherein the number of arrays of the bank located in the first storage area is different from the number of arrays of the bank located in the second storage area, so that the banks of the first storage area and the banks of the second storage area have different The number of input and output ports and the amount of data that can be read and written in unit time are different, so the bandwidth of the first storage area is different from the bandwidth of the second storage area.
  • the first storage area and the second storage area can be suitable for different bandwidth requirements. Application scenarios.
  • the memory provided by the embodiments of the present application has different bandwidth sizes between the first storage area and the second storage area, which can meet the requirements. Different bandwidth requirements, while only increasing the bandwidth of a part of the storage area, can avoid the increase in overall power consumption of the memory due to the increase in bandwidth; when expanding the memory capacity, the memory capacity can also be expanded according to the bandwidth requirements or capacity requirements, for example, if the memory If you need to meet the requirements of smaller bandwidth and larger storage capacity, you can increase the area of the storage area with smaller bandwidth in the first storage area and the second storage area. This can also prevent the memory from expanding the memory capacity due to the larger overall bandwidth. This results in an increase in the overall power consumption of the memory.
  • the number of arrays of banks located in the first storage area is greater than the number of arrays of banks in the second storage area, and each array has the same number of input and output ports.
  • the first storage area The number of arrays of the bank in the second storage area is greater than the number of arrays in the bank in the second storage area.
  • the bank in the first storage area has more input and output ports and can be read and written per unit time. There is more data, so the bandwidth of the first storage area is greater than the bandwidth of the second storage area.
  • the first storage area can be suitable for large-bandwidth storage requirements, and the second storage area can be suitable for low-bandwidth storage requirements.
  • the number of arrays of banks located in the first storage area is 2 n times the number of arrays of banks located in the second storage area, and n is a positive integer, such as the number of arrays of banks in the first storage area. It is twice the number of bank arrays in the second storage area, so that the bandwidth of the first storage area can also be twice the bandwidth of the second storage area.
  • the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the capacity of any two banks is the same, so that the storage capacity of the first storage area is smaller than the storage capacity of the second storage area. Capacity, due to the increase in bandwidth, will lead to an increase in power consumption.
  • the bandwidth of the first storage area in the memory provided by the embodiment of the present application is greater than the second storage area, and the number of banks in the first storage area is less than the number of banks in the second storage area. , that is, the capacity of the first storage area is smaller than the capacity of the second storage area, which can avoid excessive power consumption.
  • the embodiment of the present application Provides memory with lower power consumption.
  • the memory includes a first data bus and a second data bus; the first data bus is used to read and write data to the first storage area, and the second data bus is used to read and write data to the second storage area.
  • the first storage area and the second storage area can operate independently. When the first storage area is working, the second storage area does not need to work. When the second storage area is working, the first storage area does not need to work. This can reduce the memory cost. of power consumption.
  • inventions of the present application provide a chip stack structure.
  • the chip stack structure includes a system and a chip SoC and one or more memories.
  • the SoC and one or more memories are stacked in sequence; At least one memory provided for any implementation of the first aspect.
  • the stacking method includes one or more of the following methods: through silicon via TSV connection, chip on chip, chip on wafer, chip on wafer, wafer on wafer Yuan wafer on wafer.
  • embodiments of the present application provide a chip packaging structure, including a packaging substrate and a chip stack structure provided in any implementation manner of the second aspect.
  • the chip stack structure is disposed on the packaging substrate.
  • embodiments of the present application provide a chip packaging structure, including a packaging substrate, a system-on-chip SoC, and a memory provided in any implementation manner of the first aspect;
  • the packaging substrate includes a first plane, and the first plane includes The first area and the second area, the SoC is arranged in the first area, the memory is arranged in the second area, and the memory is electrically connected to the SoC through connecting lines.
  • embodiments of the present application provide a chip packaging structure, including a packaging substrate, a system-level chip, and multiple memories; the packaging substrate includes a first plane, and the first plane includes a first region and a second region that do not want to intersect.
  • the SoC is disposed in the first area; a plurality of memory stacks are disposed in the second area to form a chip stack structure, at least one of the plurality of memories is a memory provided by any implementation method of the first aspect; the chip stack structure is connected to the chip stack structure through connecting lines. SoC electrical connections.
  • embodiments of the present application provide an electronic device, including a printed circuit board and a chip packaging structure as provided in any one of the second to fifth aspects; the packaging substrate and the printed circuit board in the chip packaging structure Electrical connection.
  • Figure 1 is a schematic diagram of data exchange between various sensors and processors
  • Figure 2 is a schematic diagram of the pyramid structure of multi-level storage
  • Figure 3 is a schematic diagram of a computer system provided by an embodiment of the present application.
  • Figure 4 is a schematic diagram of a storage unit provided by an embodiment of the present application.
  • Figure 5 is a schematic diagram of a memory provided by an embodiment of the present application.
  • Figure 6 is a schematic diagram of an array provided by an embodiment of the present application.
  • Figure 7 is a schematic structural diagram of a memory provided by an embodiment of the present application.
  • Figure 8 is a schematic diagram of main memory power consumption in mobile devices
  • Figures 9 and 10 are schematic diagrams of the main parameters of the main memory of the mobile device.
  • Figure 11 is a schematic diagram of a memory package
  • Figure 12 is a schematic diagram of a solution for improving main memory frequency
  • Figure 13 is a schematic structural diagram of a memory provided by an embodiment of the present application.
  • Figure 14 is a schematic diagram of the storage library divided into multiple arrays
  • Figure 15 is a schematic diagram of the distribution of the first storage area and the second storage area
  • Figure 16 is a schematic diagram of a chip stack structure provided by an embodiment of the present application.
  • Figure 17 is a schematic diagram of a chip packaging structure provided by an embodiment of the present application.
  • Figure 18 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application.
  • Figure 19 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application.
  • Figure 20 is a schematic diagram of another chip stack structure provided by an embodiment of the present application.
  • Figure 21 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application.
  • Figure 22 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application.
  • Figure 23 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application.
  • Figure 24 is a schematic layout diagram of an SoC and memory provided by an embodiment of the present application.
  • connection should be understood in a broad sense.
  • connection can be a fixed connection, a detachable connection, or an integral body; it can be a direct connection, They can also be connected indirectly through intermediaries.
  • electrical connection may be a direct electrical connection or an indirect electrical connection through an intermediary.
  • module usually refers to a functional structure divided according to logic.
  • the “module” can be implemented by pure hardware, or a combination of software and hardware.
  • "and/or” describes the association of associated objects, indicating that there can be three relationships. For example, A and/or B can represent: A exists alone, B exists alone, and A and B exist simultaneously. situation.
  • the Von Neumann computer consists of five major components: input device, output device, memory, arithmetic unit and controller.
  • the arithmetic unit is used to complete arithmetic operations and logical operations, and the intermediate results of the operations are temporarily stored in the arithmetic unit;
  • the controller is used to control and direct the input, operation and processing of operation results of programs and data;
  • the memory is used to store data and programs;
  • input Devices are used to convert information forms that people are familiar with into information forms that computers can recognize.
  • CPU central processing unit
  • I/O devices input/output equipment
  • Memory is a memory component used to store programs and various data information. From different dimensions, memories in computer systems can be divided into different types.
  • memory can be divided into read-only memory (read-only-memory) and random-access memory (random-access memory, RAM) according to different working principles.
  • ROM works in a non-destructive reading mode. It can only read information but cannot write it. Once the information is written, it is fixed. Even if the power is cut off, the information will not be lost, so it is also called fixed memory.
  • RAM is a readable/writable memory with fast reading and writing speeds. It is usually used as a temporary data storage medium for the operating system or other running programs. RAM can be written or read from any specified address at any time when working. The biggest difference between data and ROM is the volatility of data, that is, the stored data will be lost once the power is turned off.
  • RAM is used in computers or digital systems to temporarily store programs, data and intermediate results.
  • RAM can be divided into dynamic random access memory (dynamic random access memory, DRAM) and static random access memory (static random access memory, SRAM).
  • memory can be divided into main memory, auxiliary memory and cache.
  • Figure 1 shows a schematic diagram of data exchange between main memory, auxiliary memory and cache and CPU.
  • Main memory also known as main memory, is also called memory.
  • Main memory can directly exchange data or information with the CPU. It is used to temporarily store the CPU's operation data, data exchanged with auxiliary storage, etc.
  • Main memory is the bridge between the CPU and auxiliary memory. All computer programs run in the main memory. The performance of the main memory will affect the overall performance of the computer. While the computer is running, The operating system will transfer the data that needs to be processed from the main memory to the CPU for calculation, and the computer cannot operate normally without the main memory.
  • auxiliary memory also known as auxiliary memory
  • auxiliary memory is also called external memory. It refers to memory other than computer memory and CPU cache. Compared with memory, the read and write speed is slower. This type of memory can still save data after a power outage. It is generally used Stores programs and data that are used less frequently. Auxiliary storage cannot directly exchange data with the CPU. Data needs to be exchanged with the CPU through main memory. Common auxiliary storage includes hard disks, floppy disks, optical disks, U disks, flash memory, etc.
  • Cache or buffer memory (cache) is a technology used to solve the speed mismatch between the CPU and the main memory.
  • the cache is a small-capacity memory between the CPU and the main memory and has a faster reading speed.
  • Figure 2 shows the storage hierarchy pyramid structure of a computer system. From top to bottom, it is: register file (RF), cache (cache), main memory (main memory) and auxiliary storage (storage). Storage from top to bottom. The capacity is getting bigger, but the access speed is getting slower and slower.
  • RF register file
  • cache cache
  • main memory main memory
  • auxiliary storage storage
  • the register file is an array of multiple registers in the processor. It usually consists of dozens of 32/64bits registers. It can be used to temporarily store instructions, data, addresses, etc.
  • the registers are generally integrated in the CPU. For mobile For end devices, they are usually integrated on a system on chip (SoC).
  • SoC system on chip
  • the register has a read and write speed close to that of the processor, but the cost is higher, so the capacity is generally smaller.
  • Cache also known as cache memory
  • cache memory is a small-capacity but high-speed memory located between the CPU and the main memory.
  • the capacity is usually in the order of MB.
  • the cache generally does not use DRAM technology, but it is expensive to use. But faster SRAM technology. Since the speed of the CPU is much higher than that of the main memory, the CPU has to wait for a certain period of time to directly access data from the main memory. Therefore, a cache is set up to solve the problem of speed mismatch between the CPU and the main memory.
  • the cache stores the data that the CPU has just used or A part of the data used in the cycle can be directly called from the cache when the CPU uses this part of the data again. This reduces the waiting time of the CPU and improves the efficiency of the system.
  • the setting of the cache is an important factor in the high performance of all modern computer systems. one.
  • Main memory is mainly used to store programs and data that need to be run. There is a big gap between the speed of main memory and the speed of the CPU. In order to match the speed of the main memory and the CPU, a speed ratio of the main memory is inserted between the main memory and the CPU. Faster and smaller cache memory. Main memory generally uses DRAM technology. Its capacity restricts the number of programs that the device can run at the same time, which directly affects the performance of the device. It is an important storage device in computers. The storage capacity of main memory can reach the GB level. For mobile devices For end devices, the main memory usually does not belong to the same chip (die) as the SoC.
  • the last level of large-capacity auxiliary storage is used to store data, such as images, videos, etc.
  • the read and write speed of this level of memory is slow, but the capacity can be very large. For example, the capacity can reach the GB or even TB level.
  • the data stored in the memory is loaded into the main memory for processing.
  • the multi-level structure of the storage system is mainly reflected in the two storage levels of main memory-cache and main memory-auxiliary memory.
  • the main memory-cache level is mainly used to solve the speed problem of the storage system.
  • the speed of main memory does not match the speed of the CPU. Since the speed of main memory is lower and the speed of cache is higher than that of main memory, you only need to transfer the data to be used by the CPU into the cache, and the CPU can directly obtain the data from the cache, thereby increasing the access speed; main memory -
  • the auxiliary storage level mainly solves the capacity problem of the storage system.
  • the speed of auxiliary storage is lower than that of main memory, and it cannot directly exchange data with the CPU. However, its capacity is much larger than that of main memory. When the CPU needs data in auxiliary storage, it will After being loaded into the main memory, it is used by the CPU.
  • the main memory can exchange data with the CPU, cache and auxiliary memory, and is an important part of the storage system.
  • Various parameters of the main memory such as capacity, bandwidth, cost, etc., restrict the development of this computer system.
  • the computer system referred to in the embodiments of this application can be a computer, and of course it can also refer to a mobile device, such as a mobile phone, a tablet, and other devices.
  • Figure 3 shows a schematic architectural diagram of a computer system provided by an embodiment of the present application.
  • the computer system 100 may at least include a processor 101, a memory controller 102 and a memory 103.
  • the memory controller 102 may be integrated into the processor 101 and the memory 103 may be main memory.
  • the computer system 100 may also include other devices such as communication interfaces and disks as auxiliary storage, which are not limited here.
  • the processor 101 is the computing core and control unit of the computer system 100.
  • Processor 101 may include multiple cores 104 .
  • An operating system and other software programs are installed in the processor 101, so that the processor 101 can access the memory 103, cache and disk.
  • the core 104 in the processor 101 may be a central processing unit (CPU), an artificial intelligence (artificial intelligence, AI) processor, a digital signal processor (digital signal processor), or a neural network processor.
  • a network processor can also be other application specific integrated circuit (ASIC), etc.
  • the memory controller 102 is a bus circuit controller that controls the memory 103 internally in the computer system 100 and is used to manage and plan data transmission from the memory 103 to the core 104.
  • Memory controller 102 may be a separate chip and connected to core 104 via a system bus.
  • the memory controller 102 may also be integrated into the processor 101 or built into the northbridge.
  • the embodiments of the present application do not limit the specific location of the memory controller 102. In practical applications, the memory controller 102 can control necessary logic to write data to the memory 103 or read data from the memory 103 .
  • the memory 103 is the main memory of the computer system 100 and is usually used to store various running software in the operating system, input and output data, and information exchanged with external memory. Dynamic random access memory (DRAM) is usually used as the memory 103.
  • the processor 101 can access the memory 103 at high speed through the memory controller 102, and perform read operations and write operations on any storage unit in the memory 103.
  • the memory 103 is a DRAM as an example for description. Unless otherwise specified, the main memory referred to in the embodiment of the present application is also called DRAM.
  • the smallest unit used to store data in DRAM is called a memory cell (MC).
  • a memory cell can store 1 bit of data.
  • the memory unit of DRAM is usually composed of a transistor and a capacitor. If it contains two transistors and two capacitors, it is called 2T2C; if it contains two transistors and one capacitor, it is called 2T1C; if it contains one transistor and one capacitor, it is called 2T1C.
  • the capacitor is called 1T1C.
  • the transistor can be a metal-oxide-semiconductor field effect transistor (MOSFET).
  • MOSFET metal-oxide-semiconductor field effect transistor
  • the transistor is divided into two types: N (negative, negative) type transistor and P (positive, positive) type transistor.
  • a transistor includes a source, a drain, and a gate.
  • the transistor By controlling the level of the input transistor gate, the transistor can be turned on or off. When the transistor is turned on, the source and drain are turned on, generating a conduction current. Moreover, when the gate levels of the transistors are different, the magnitude of the conduction current generated between the source and the drain is also different; the transistor is When turned off, the source and drain will not conduct, and no current will flow.
  • the gate of the transistor is also called the control terminal, the source is called the first terminal, and the drain is called the second terminal; or, the gate is called the control terminal, and the drain is called the second terminal. It is called the first terminal and the source is called the second terminal.
  • Figure 4 shows a schematic structural diagram of a memory unit of DRAM, including a transistor T and a capacitor C.
  • the first end of the transistor T is connected to the bit line (BL), and the second end of the transistor T is connected to the bit line (BL).
  • the control end of the transistor T is connected to the word line (WL), and the second end of the capacitor C is connected to the source line (SL), which can be connected to A specific voltage (such as ground).
  • Capacitor C is used to store data. For example, when the voltage at the first terminal of capacitor C is high level and the voltage at the second terminal is low, it stores 1.
  • the memory cells in the memory 103 are arranged and distributed into a matrix. This matrix is called a sub-array.
  • the memory 103 may include a command decoder 110, a control logic circuit 120, and a sub-array 130. , input/output circuit 140, specific embodiments are not limited thereto and may include a smaller or larger number of constituent components.
  • the command decoder 110 may receive a command CMD (command, CMD) from the memory controller 102 and may decode the received command CMD. For example, write commands (write, WR), read commands (read, RD), etc., the memory controller 102 can locate any storage unit in the bank through the corresponding row-column decoder, that is, locate any bit of data.
  • an array can include multiple sub-arrays.
  • storage units in the same array share row decoders, column decoders, and interfaces (such as input output (IO) ports).
  • IO input output
  • the array shown in Figure 6 has 64 parallel IO port. The number of IO ports is related to the bandwidth of the entire memory, so manufacturers generally increase the number of IO ports in the array to the maximum extent within the allowable range of the process.
  • DRAM uses the amount of electricity stored in the capacitor to represent data 0 and 1. Due to leakage in capacitors, if there is insufficient charge in the capacitor, errors will occur in the stored data. Therefore, every once in a while, the memory controller 102 refreshes the data in the DRAM 103 to prevent the DRAM from losing data. Moreover, DRAM is volatile. When the computer system 100 is powered off, the information in the DRAM will no longer be saved.
  • the memory unit from the processor to the DRAM is divided into channels from large to small according to the hierarchy. ), storage column (rank), storage bank (bank), storage array (array), row or column (row/column).
  • a channel is an independent accessible storage space, and the storage space may include one or a storage rank.
  • a channel includes a certain capacity of storage space and a hardware circuit for accessing the storage space.
  • the hardware circuit may include control logic, interfaces and other related circuits.
  • Channels can be composed of storage columns (ranks).
  • the memory rank refers to the memory particles (chips) connected to the same chip select signal.
  • the memory particles are also called chips; because these chips are connected to the same chip select signal, the memory controller 102 can Write operations are performed on chips in the same storage column, and chips in the same storage column also share the same control signal.
  • Each storage column may include one or more storage banks (banks). For example, a storage column may include 4 banks, or may also include 8 banks.
  • a DRAM bank includes one or more storage arrays.
  • Each storage array includes multiple memory cells (memory cells) distributed in rows and columns, or each storage array can also include multiple sub-arrays. (sub-array), the sub-array includes multiple storage cells distributed in rows and columns.
  • Figure 11 shows a memory in which data signals, control signals and addresses are connected through interconnection terminals and interconnection networks by stacking multiple main memory chips (for example, memory 1 and memory 2) on a packaging substrate in one package.
  • Signal implements multiple separate, low-capacity main memory chips into a high-capacity single chip, and achieves higher storage density packaging by stacking multiple identical memory chips in a single package.
  • the speed of reading data is one of the reasons that affects the computing power of the processor.
  • the processor reads data from the main memory for processing, so it is particularly important to increase the transmission bandwidth of the main memory.
  • Bandwidth refers to the amount of data that can be read or written per unit time.
  • one possible implementation method is to increase the number of main memory channels or expand the data bit width of each channel. bandwidth.
  • the storage controller includes a traffic statistics unit and a clock controller.
  • a frequency function is set in the storage control module. According to the statistics of the traffic statistics unit According to the traffic demand, the frequency function is used to determine the target frequency according to the traffic demand, and the frequency output by the clock controller is adjusted through the storage driver.
  • the clock controller is controlled to output a low-frequency clock signal to reduce the read and write frequency of the main memory chip, reduce the read and write bandwidth, and also reduce power consumption; when the bandwidth requirement is high, , controlling the clock controller to output a high-frequency clock signal to increase the read and write frequency of the main memory chip and increase the read and write bandwidth.
  • the high energy efficiency area of the main memory particles is located at high frequency. Running at low frequency will cause poor main memory energy efficiency and increase the access delay and power consumption of the SoC.
  • embodiments of the present application provide a memory to simultaneously meet the mobile device's requirements for main memory in terms of power consumption, bandwidth, and capacity.
  • an embodiment of the present application provides a memory, including: a first storage area and a second storage area; the first storage area and the second storage area can each be provided with one or more channels, so that the first storage area and the second storage area can each be provided with one or more channels.
  • the first storage area and the second storage area can be accessed or read and write data independently.
  • channel 1 abbreviated as ch1
  • ch2 are set in the first storage area
  • ch3 to ch8 are set in the second storage area.
  • Each channel includes one or more storage banks (banks).
  • ch1 includes 4 banks such as b1, b2, b3, and b4.
  • ch4 includes 4 banks such as b5, b6, b7, and b8.
  • Each bank includes one or more arrays, where each array has the same number of input and output ports (IO).
  • the number of arrays in the bank located in the first storage area is different from the number of arrays located in the second storage area. , in this way, the number of IO ports of the bank in the first storage area is different from the number of IO ports of the bank in the second storage area, that is to say, the bank in the first storage area and the bank in the second storage area can read and write in unit time.
  • the amount of data is different, so the first storage area and the second storage area may have different read and write bandwidths.
  • the memory can be adapted to usage scenarios with different bandwidth requirements.
  • the read and write bandwidth of the first storage area is greater than the read and write bandwidth of the second storage area.
  • the first storage area can serve usage scenarios with large bandwidth requirements, such as neural networks, artificial intelligence, etc. Intelligence, graphics processing, etc.; the second storage area can serve usage scenarios with low bandwidth requirements, such as ordinary program running, etc.
  • the first storage area and the second storage area can be accessed independently, which can meet the diverse demands for bandwidth under different operating states of the SoC.
  • the first storage area includes 2 channels: ch1, ch2; the second storage area includes 6 channels: ch3, ch4, ch5, ch6, ch7, and ch8.
  • Each channel includes multiple banks.
  • ch1 includes 4 banks such as b1, b2, b3, and b4.
  • ch4 includes b5, b6, b7, and b8. etc. 4 banks, and the number of arrays of the bank located in the first storage area is greater than the number of arrays of the bank located in the second storage area.
  • b1 of the first storage area includes 2 arrays
  • b5 of the second storage area includes 1 array.
  • the banks of the first storage area are divided into finer granularities, and more IO ports can be used.
  • the number of IO ports of the bank in the storage area is greater than the number of IO ports of the bank in the second storage area.
  • the amount of data that the bank in the first storage area can read and write is greater than the amount of data that the bank in the second storage area can read and write.
  • the read and write bandwidth of the first storage area is greater than the read and write bandwidth of the second storage area.
  • the number of IO ports per array is generally 2 n .
  • the most commonly used configuration is that each array has 64 IO ports.
  • the number of arrays in each bank can be 1, 2, 4 or 8, etc. That is, the number of arrays in each bank can be 20 , 21 , 22 , 23 , etc. Based on this, In the embodiment of the present application, the number of bank arrays located in the first storage area is 2 n times the number of bank arrays located in the second storage area.
  • each bank (taking b5 as an example) of the second storage area includes 1 array, and each array includes 64 IO ports.
  • each bank of the second storage area has 64 IO ports.
  • the number of arrays in each bank (taking b1 as an example) in the first storage area is 21 times the number of arrays in the bank in the second storage area.
  • Each array includes 64 IO ports, so that the number of arrays in the first storage area is
  • Each bank has 128 IO ports, the number of which is 2.1 times the number of IO ports of each bank in the first storage area.
  • the bandwidth of each channel in the first storage area is that of the second storage area. 2 1 times the bandwidth of each channel in the area.
  • each bank The number of arrays in each bank can be determined according to actual needs and process level. For example, as shown in Figure 14, a, b, c, and d in Figure 14 show one bank respectively. Schematic diagram of the arrays divided into 20 , 21 , 22 , and 23 arrays respectively. Since the capacity of each bank is the same, the more arrays the bank includes, the smaller the capacity of each array, and the more IO ports the bank has.
  • the shapes of the first storage area and the second storage area can be diverse.
  • the channels in the first storage area can be distributed horizontally within the memory chip plane, or as shown in Figure 15, they can also be vertically distributed. Straight distribution.
  • the memory provided by the embodiment of the present application has a first storage area The number of banks is less than the number of banks in the second storage area.
  • the first storage area of the memory 103 shown in Figure 13 includes two channels ch1 and ch2, each channel includes 4 banks, and the first storage area has a total of 8 banks; if The capacity of each bank is 256Mb, so the capacity of the first storage area is 2048Mb.
  • the second storage area includes 6 channels: ch3 ⁇ ch8. Each channel includes 4 banks. The second storage area has a total of 24 banks. If the capacity of each bank is 256Mb, the capacity of the second storage area is 6114Mb. 3 times the capacity of a storage area.
  • the capacity of each bank is basically the same, the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the capacity of the first storage area is smaller than the capacity of the second storage area; the first storage area is suitable for large bandwidth , low storage capacity usage requirements, while avoiding excessive power consumption; the second storage area is suitable for low bandwidth, large storage capacity usage requirements, which can not only ensure large-capacity storage (second storage area), but also ensure that the memory has Large read and write bandwidth (first storage area).
  • the first storage area and the second storage area can independently read and write data.
  • the first storage area can only work in large-bandwidth usage scenarios.
  • the first storage area can be configured as The closed state can reduce the power consumption of the memory; in addition, the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the scale of the first storage area is smaller.
  • this application implements The memory provided in the example can achieve lower reading and writing power consumption.
  • the memory provided by the embodiment of the present application includes a first storage area and a second storage area.
  • the first storage area and the second storage area can be read and written independently.
  • the first storage area and the second storage area include multiple storage areas with the same capacity.
  • bank where the number of banks in the first storage area is less than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities and have more arrays; the banks located in the second storage area have With a smaller number of arrays, the same memory chip simultaneously provides a large-bandwidth, small-capacity first storage area and a small-bandwidth, large-capacity second storage area, taking into account the bandwidth and capacity of the main memory of the mobile device SoC.
  • the first storage area and the second storage area can be read and written independently.
  • the memory provided by the embodiment of the present application only improves the read and write bandwidth of the first storage area, and
  • the first storage area only occupies a smaller part of the entire memory area, so it can achieve lower reading and writing power consumption.
  • the main memory and the SoC are usually on different dies.
  • the SoC and the main memory are usually packaged in the same package.
  • Figure 16 shows a chip stack structure provided by an embodiment of the present application, including an SoC and a memory 103.
  • the SoC and the memory 103 are stacked.
  • the SoC and the memory 103 are connected through a contact (bump) and a silicon pass.
  • Through-silicon-via (TSV) technology realizes the transmission of data signals, control signals and address signals, so that the SoC can read data from the memory 103 or write data to the memory 103.
  • TSV Through-silicon-via
  • the memory 103 includes a first storage area and a second storage area.
  • the first storage area and the second storage area can be read and written independently.
  • the first storage area and the second storage area include a plurality of banks with the same capacity, where The number of banks in the first storage area is smaller than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities, with more arrays and more IO ports, so the first storage area A higher density of contacts is required between the second storage area and the SoC; the bank located in the second storage area has a smaller number of arrays and fewer IO ports, so a lower density of contacts is required between the second storage area and the SoC.
  • the same memory 103 chip simultaneously provides a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, taking into account the main memory bandwidth and capacity requirements of the mobile device SoC.
  • the first storage area and the second storage area can each be provided with one or more channels, so that the first storage area and the second storage area can be independently accessed or read and write data.
  • the memory 103 is provided with a first data bus, a third an address bus and a first control bus; the memory 103 is also provided with a second data bus, a second address bus and a second control bus, wherein the first data bus, the first control bus and the first address bus are used to implement the first storage
  • the second data bus, the second control bus and the second address bus are used to read and write data in the second storage area.
  • a first memory controller and a second memory controller are respectively provided in the SoC corresponding to the first storage area and the second storage area of the memory 103.
  • the first memory controller connects the first data bus, the first control bus and the third memory controller.
  • An address bus is connected to the first storage area for realizing read and write control of the first storage area of the memory 103.
  • the second memory controller communicates with the second storage area through the second data bus, the second control bus and the second address bus. The connection is used to realize read and write control of the second storage area of the memory 103.
  • SoC and main memory can adopt a variety of packaging forms, such as 2.5D packaging, 3D packaging, multichip module (MCM) packaging, and package on package (PoP) packaging.
  • packaging forms such as 2.5D packaging, 3D packaging, multichip module (MCM) packaging, and package on package (PoP) packaging.
  • MCM multichip module
  • PoP package on package
  • Figure 17 shows a chip packaging structure provided by the embodiment of the present application.
  • the chip packaging structure includes a packaging substrate and the above-mentioned chip stacking structure.
  • the chip stacking structure is arranged on the packaging substrate. This packaging form in which all chips are stacked together is called 3D packaging.
  • the chip stack structure here includes SoC and memory 103.
  • the SoC can be close to the packaging substrate and the memory 103 is far away from the packaging substrate; as shown in Figure 17, the memory 103 can also be close to the packaging substrate. , the SoC is far away from the packaging substrate.
  • Figure 19 shows another chip packaging structure provided by an embodiment of the present application, including a packaging substrate, an SoC and a memory 103.
  • the packaging substrate includes a first plane, and the first plane includes a first area and a first area that do not want to intersect.
  • the SoC is arranged in the first area
  • the memory 103 is arranged in the second area.
  • the memory 103 is electrically connected to the SoC through connecting lines.
  • MCM multichip module
  • the memory 103 simultaneously provides a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, taking into account the bandwidth and capacity requirements of the main memory of the mobile device SoC.
  • the memory 103 The capacity is limited and cannot meet the SoC's demand for main memory capacity. In this way, the capacity can be expanded by stacking multiple memory chips.
  • FIG. 20 shows another chip stack structure provided by an embodiment of the present application, including an SoC and multiple memories. At least one memory among the multiple memories here is provided by the previous embodiment of the present application.
  • the memory 103 includes a first storage area with large bandwidth and small capacity, and a second storage area with small bandwidth and large capacity.
  • the chip stack structure includes an SoC, a memory 103 and a memory 105.
  • the memory 103 includes a first storage area and a second storage area.
  • the first storage area and the second storage area can be read and written independently.
  • the first storage area and the second storage area can be read and written independently.
  • the second storage area includes multiple banks with the same capacity.
  • the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities and have more arrays.
  • the bank located in the second storage area has a smaller number of arrays, so that the same memory 103 chip provides both a large bandwidth and small capacity first storage area and a small bandwidth and large capacity second storage area, taking into account Mobile device SoC’s requirements for main memory bandwidth and capacity.
  • the number of arrays of each bank in the storage area of the memory 105 is the same, and the bandwidth provided is also the same.
  • the number of arrays of each bank in the storage area of the memory 105 is the same as the number of arrays of each bank in the second storage area of the memory 103. , so that the memory 105 can have the same read and write bandwidth as the second storage area of the memory 103 .
  • this embodiment of the present application provides another chip packaging structure, including a packaging substrate, an SoC and multiple memories.
  • a packaging substrate including a packaging substrate, an SoC and multiple memories.
  • the multiple memories here, at least one memory is provided by the previous embodiment of the present application.
  • the memory 103 includes a first storage area with large bandwidth and small capacity, and a second storage area with small bandwidth and large capacity.
  • the chip packaging structure includes a packaging substrate, SoC, memory 103 and memory 105, wherein the memory 103 includes a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, wherein the The number of banks in one storage area is smaller than the number of banks in the second storage area.
  • the banks located in the first storage area are divided into finer granularities and have more arrays; the banks located in the second storage area have fewer arrays. , taking into account the bandwidth and capacity requirements of mobile device SoC for main memory.
  • the structure of memory 105 is different from that of memory 103.
  • the number of arrays of each bank in the storage area of memory 105 is the same.
  • the number of arrays of each bank in the storage area of memory 105 is the same as that of each bank in the second storage area of memory 103. number of arrays, so that the second memory of memory 105 can be combined with the second memory of memory 103
  • the storage area has the same read and write bandwidth.
  • the chip packaging structure includes a packaging substrate, an SoC, a memory 103 and a memory 105.
  • the memory 103 includes a large-bandwidth, small-capacity first memory. area and a second storage area with small bandwidth and large capacity.
  • the number of banks in the first storage area is smaller than the number of banks in the second storage area.
  • the banks located in the first storage area are divided into finer granularities and have more arrays. ;
  • the bank located in the second storage area has a smaller number of arrays, taking into account the main memory bandwidth and capacity requirements of the mobile device SoC.
  • the structure of the memory 105 is the same as that of the memory 103, which is equivalent to doubling the storage capacity of the large-bandwidth storage area and the low-bandwidth storage area provided by the memory 103 at the same time.
  • the above is only an explanation of the solution for expanding the main memory capacity in a stacking manner provided by the embodiment of the present application.
  • multiple memory chips can be stacked to expand the capacity of the main memory.
  • the stacked multiple memory chips may include at least one memory 103 provided in the previous embodiments of the present application, that is, a memory including a large-bandwidth, low-capacity first storage area and a low-bandwidth, large-capacity second storage area.
  • stacking All of the multiple memory chips may be the memory 103 provided in the previous embodiments of the present application.
  • the stacking methods of each chip stack structure include one or more of the following methods, such as: through-silicon-via (TSV) connection, chip on chip (Chip on chip) Chip), chip on wafer (Chip on Wafer), wafer on wafer (wafer on wafer), and of course other stacking methods are also possible.
  • TSV through-silicon-via
  • Chip on chip Chip on chip
  • Chip on Wafer chip on Wafer
  • wafer on wafer wafer on wafer
  • MCM packaging encapsulates multiple chips on the same packaging substrate.
  • memory chips can also be stacked to expand the capacity of main memory.
  • the embodiment of the present application provides another chip packaging structure, including a packaging substrate, SoC and multiple memories; the packaging substrate includes a first plane, and the first plane includes an unintended The first area and the second area, the SoC is disposed in the first area; a plurality of memory stacks are disposed in the second area to form a chip stack structure, and at least one of the plurality of memories is the memory provided in the previous embodiment of the present application, that is, it includes a large bandwidth , a low-capacity first storage area and a low-bandwidth, large-capacity second storage area memory 103.
  • the multiple stacked memory chips may all be the memories 103 provided in the previous embodiments of the present application.
  • the SoC can independently access the first storage area and the second storage area, that is to say, taking the memory 103 and the memory 105 as an example, the memory 103 includes the first storage area and the second storage area, and the bank of the first storage area The number of arrays is different from the number of arrays of banks in the second storage area. For example, the number of arrays of banks in the first storage area is greater than the number of arrays of banks in the second storage area.
  • the SoC is provided with a first memory controller and a second memory controller.
  • the first memory controller is used to perform read and write control on the first storage area
  • the second memory controller is used to perform read and write control on the second storage area
  • the number of arrays of each bank in the storage area of the memory 103 is the same, for example, The number of arrays of each bank in the storage area of the memory 103 is the same as the number of banks in the second storage area of the memory 103. In this way, the bandwidth of the memory 105 and the second storage area of the memory 103 can be the same, and the SoC can use the second memory controller to The memory 105 performs read and write control.
  • the memory 103 Since the memory 103 has two independent storage areas, the first storage area and the second storage area need to be provided with an address bus, a control bus and a data bus respectively; while the memory 105 only has one overall storage area, so there is no connection between the memory 103 and the SoC.
  • the interconnection lines used for read and write control are more and more complex, and the interconnection lines used for read and write control between the memory 105 and the SoC are fewer and simpler, and in the case of stacking settings, part of the interconnection lines between the memory 103 and the SoC can be used.
  • the memory with the first storage area and the second storage area is located on the side closer to the SoC among the multiple stacked memories to reduce the length of the interconnect lines and reduce the wiring.
  • Capacitors reduce the complexity of read and write control circuits such as address decoders, and further reduce read and write power consumption.
  • the memory 103 includes a first storage area and a second storage area.
  • the first storage area has a larger read and write bandwidth and can meet the high bandwidth requirements of high computing power modules in the SoC.
  • the high computing power modules here generally include graphics. processor (graphic processing unit, GPU) or network processor (neural-network processing unit, NPU), etc., then when the SoC is stacked on the memory, for example, see Figure 24, the high computing power module of the SoC can be combined with the third A storage area is aligned in the stacking direction.
  • the projection of the SoC's high computing power module on the packaging substrate and the projection of the first storage area on the packaging substrate are located in the same area. This can also reduce the length of the interconnection line between the two. Reducing the connection capacitance will help reduce reading and writing power consumption.
  • An embodiment of the present application also provides a storage device.
  • the storage device includes a printed circuit board (PCB) and a memory connected to the printed circuit board.
  • the memory can be any of the memories provided above.
  • the printed circuit board is used to provide electrical connections for electronic components included in the memory.
  • the storage device can be a computer, Different types of user devices or terminal devices such as mobile phones, tablets, wearable devices, and vehicle-mounted devices.
  • the storage device may further include a packaging substrate, the packaging substrate is fixed on the printed circuit board PCB through solder balls, the memory is fixed on the packaging substrate through solder balls, and the packaging substrate is used to package the memory.
  • a storage device in another aspect of the present application, includes a first controller, a second controller and a memory.
  • the first controller and the second controller are used to control reading and writing in the memory.
  • the memory may be the memory 103 provided in the embodiment of this application.
  • the storage device can be applied to mobile devices, such as mobile phones, tablet computers, etc.
  • the first controller can be a first memory controller
  • the second controller can be a second memory controller
  • the memory includes large bandwidth, low
  • the first memory controller is used to control the reading and writing of the first storage area
  • the second memory controller is used to control the reading and writing of the second storage area.
  • an electronic device in another aspect of the present application, includes a printed circuit board (PCB) and a chip packaging structure as provided in Figures 17 to 23 in the above embodiments.
  • the chip packaging structure Including memory, SoC and packaging substrate, the chip packaging structure is connected to the printed circuit board, for example, the packaging substrate and the printed circuit board can be electrically connected.
  • the electronic device is different types of user equipment or terminal equipment such as computer systems, mobile phones, tablets, wearable devices, and vehicle-mounted equipment. It should be noted that the above-mentioned relevant descriptions of the memory can be correspondingly cited in the storage device and electronic equipment provided in the present application, and the embodiments of the present application will not be repeated here.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Microelectronics & Electronic Packaging (AREA)
  • Power Engineering (AREA)
  • Computer Hardware Design (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Condensed Matter Physics & Semiconductors (AREA)
  • Geometry (AREA)
  • Memory System (AREA)
  • Dram (AREA)
  • Semiconductor Memories (AREA)

Abstract

The present application relates to the technical field of memories, and provides a memory, a chip stack structure, a chip package structure, and an electronic device, capable of providing storage spaces of different bandwidths on a same memory, and meeting the bandwidth, power consumption and capacity requirements. The memory comprises a first storage region and a second storage region; the first storage region and the second storage region each comprise multiple banks, each bank comprises one or more arrays, and each array has the same number of input and output ports; the number of arrays of banks located in the first storage region is different from the number of arrays of banks located in the second storage region, for example, the number of arrays of banks in the first storage region is greater than the number of arrays of banks in the second storage region, and the number of banks in the first storage region is less than the number of banks in the second storage region. In this way, the first storage region can have a greater bandwidth; excessive power consumption can be avoided; and the capacity of the second storage region is larger, and therefore, the memory can simultaneously meet the bandwidth, power consumption and capacity requirements.

Description

存储器、芯片堆叠结构、芯片封装结构及电子设备Memory, chip stack structure, chip packaging structure and electronic equipment
本申请要求于2022年06月29日提交国家知识产权局、申请号为202210753276.1、申请名称为“存储器、芯片堆叠结构、芯片封装结构及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires the priority of the Chinese patent application submitted to the State Intellectual Property Office on June 29, 2022, with the application number 202210753276.1 and the application name "Memory, chip stack structure, chip packaging structure and electronic equipment", and all its contents have been approved This reference is incorporated into this application.
技术领域Technical field
本申请涉及存储器技术领域,尤其涉及一种存储器、芯片堆叠结构、芯片封装结构及电子设备。The present application relates to the field of memory technology, and in particular to a memory, a chip stack structure, a chip packaging structure and electronic equipment.
背景技术Background technique
手机、平板电脑等电子设备对主存(main memory)有三方面的需求:大容量、低功耗以及高带宽,但现有的解决方案往往只能满足其中一个方面的需求,无法同时做到三者兼顾。提高存储密度,增大容量的方案经常导致读写带宽的降低和功耗的增加;而增加带宽的设计往往不利于存储容量的增加和功耗的降低。Electronic devices such as mobile phones and tablets have three requirements for main memory: large capacity, low power consumption and high bandwidth. However, existing solutions can often only meet the needs of one of them and cannot achieve all three at the same time. Both. Solutions to increase storage density and capacity often lead to a reduction in read and write bandwidth and an increase in power consumption; designs that increase bandwidth are often not conducive to an increase in storage capacity and a reduction in power consumption.
通过将多个存储芯片进行堆叠,通过“互联端子”和“互联网络”连接数据信号、控制信号和地址信号,可以将多个分离的、低容量的芯片实现为高容量的单芯片,通过在单个封装中堆叠多个相同的存储芯片实现更高存储密度的封装,可以增大存储容量,但这样仅仅能够解决存储容量不足的问题,无法提高存储芯片的带宽。By stacking multiple memory chips and connecting data signals, control signals and address signals through "interconnect terminals" and "internet networks", multiple separate, low-capacity chips can be implemented into a high-capacity single chip. Stacking multiple identical memory chips in a single package to achieve higher storage density packaging can increase storage capacity, but this can only solve the problem of insufficient storage capacity and cannot increase the bandwidth of the memory chip.
或者还可以通过将存储芯片的粒度划分为更细,这样来得到更多的数据传输通道,以此来增加带宽,但这样的方式无法解决容量不足的问题,还会增加功耗。Or you can divide the memory chip into finer granularities to get more data transmission channels to increase bandwidth. However, this method cannot solve the problem of insufficient capacity and will also increase power consumption.
发明内容Contents of the invention
本申请的实施例提供了一种存储器、芯片堆叠结构、芯片封装结构及电子设备,以改善现有的主存无法满足带宽、容量与功耗的需求的问题。Embodiments of the present application provide a memory, a chip stack structure, a chip packaging structure and an electronic device to improve the problem that existing main memories cannot meet the requirements for bandwidth, capacity and power consumption.
第一方面,提供一种存储器,包括:第一存储区域与第二存储区域,第一存储区域与第二存储区域包括多个存储库bank,每个存储库包括一个或多个阵列,每一个阵列具有相同数量的输入输出端口;其中,位于第一存储区域的bank的阵列数量与位于第二存储区域的bank的阵列数量不同,从而第一存储区域的bank与第二存储区域的bank具有不同的输入输出端口数量,在单位时间内能够读写的数据量不同,因此第一存储区域的带宽与第二存储区域的带宽不同,第一存储区域与第二存储区域可以适用于不同带宽需求的应用场景。传统的提升带宽的方案大多是提升存储器的整体带宽,这样会导致存储器的整体功耗增大,本申请的实施例提供的存储器,第一存储区域与第二存储区域的带宽大小不同,能够满足不同的带宽需求,同时仅提升一部分存储区域的带宽,可以避免因为带宽增加而导致存储器整体功耗增大;在扩展存储器容量时,也能按照带宽需求或者容量需求扩展存储器的容量,例如若存储器需要满足较小带宽、较大存储容量的需求,那么可以提高第一存储区域与第二存储区域中带宽较小的存储区域的面积,这样也能够避免存储器在扩展存储器容量因为整体带宽较大而导致存储器整体的功耗增加。In a first aspect, a memory is provided, including: a first storage area and a second storage area, the first storage area and the second storage area including a plurality of storage banks, each storage bank including one or more arrays, each The arrays have the same number of input and output ports; wherein the number of arrays of the bank located in the first storage area is different from the number of arrays of the bank located in the second storage area, so that the banks of the first storage area and the banks of the second storage area have different The number of input and output ports and the amount of data that can be read and written in unit time are different, so the bandwidth of the first storage area is different from the bandwidth of the second storage area. The first storage area and the second storage area can be suitable for different bandwidth requirements. Application scenarios. Most of the traditional solutions for increasing bandwidth are to increase the overall bandwidth of the memory, which will lead to an increase in the overall power consumption of the memory. The memory provided by the embodiments of the present application has different bandwidth sizes between the first storage area and the second storage area, which can meet the requirements. Different bandwidth requirements, while only increasing the bandwidth of a part of the storage area, can avoid the increase in overall power consumption of the memory due to the increase in bandwidth; when expanding the memory capacity, the memory capacity can also be expanded according to the bandwidth requirements or capacity requirements, for example, if the memory If you need to meet the requirements of smaller bandwidth and larger storage capacity, you can increase the area of the storage area with smaller bandwidth in the first storage area and the second storage area. This can also prevent the memory from expanding the memory capacity due to the larger overall bandwidth. This results in an increase in the overall power consumption of the memory.
在一种可能的实现方式中,位于第一存储区域的bank的阵列数量与大于第二存储区域的bank的阵列数量,而每一个阵列具有的输入输出端口的数量是相同的,第一存储区域的bank的阵列数量大于第二存储区域的bank的阵列数量,位于第一存储区域的bank相比位于第二存储区域的bank而言,具有更多的输入输出端口,单位时间内能够读写的数据更多,因此第一存储区域的带宽大于第二存储区域的带宽,第一存储区域可以适用于大带宽的存储需求,第二存储区域可以适用于低带宽的存储需求。In a possible implementation, the number of arrays of banks located in the first storage area is greater than the number of arrays of banks in the second storage area, and each array has the same number of input and output ports. The first storage area The number of arrays of the bank in the second storage area is greater than the number of arrays in the bank in the second storage area. Compared with the bank in the second storage area, the bank in the first storage area has more input and output ports and can be read and written per unit time. There is more data, so the bandwidth of the first storage area is greater than the bandwidth of the second storage area. The first storage area can be suitable for large-bandwidth storage requirements, and the second storage area can be suitable for low-bandwidth storage requirements.
在一种可能的实现方式中,位于第一存储区域的bank的阵列数量为位于第二存储区域的bank的阵列的数量2n倍,n为正整数,例如第一存储区域的bank的阵列数量为第二存储区域的bank的阵列数量的2倍,这样第一存储区域的带宽也可以达到第二存储区域的带宽的2倍。In a possible implementation, the number of arrays of banks located in the first storage area is 2 n times the number of arrays of banks located in the second storage area, and n is a positive integer, such as the number of arrays of banks in the first storage area. It is twice the number of bank arrays in the second storage area, so that the bandwidth of the first storage area can also be twice the bandwidth of the second storage area.
在一种可能的实现方式中,第一存储区域的bank数量小于第二存储区域的bank数量,而任意两个bank的容量大小相同,这样第一存储区域的存储容量小于第二存储区域的存储容量,由于带宽增加,会导致功耗增大,本申请实施例提供的存储器中第一存储区域的带宽大于第二存储区域,而第一存储区域的bank数量少于第二存储区域的bank数量,即第一存储区域的容量小于第二存储区域的容量,这样可以避免功耗过大,相比于第一存储区域与第二存储区域即存储器整体提高带宽的方式而言,本申请实施例提供的存储器功耗更低。 In a possible implementation, the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the capacity of any two banks is the same, so that the storage capacity of the first storage area is smaller than the storage capacity of the second storage area. Capacity, due to the increase in bandwidth, will lead to an increase in power consumption. The bandwidth of the first storage area in the memory provided by the embodiment of the present application is greater than the second storage area, and the number of banks in the first storage area is less than the number of banks in the second storage area. , that is, the capacity of the first storage area is smaller than the capacity of the second storage area, which can avoid excessive power consumption. Compared with the first storage area and the second storage area, that is, the overall bandwidth of the memory, the embodiment of the present application Provides memory with lower power consumption.
在一种可能的实现方式中,存储器包括第一数据总线与第二数据总线;第一数据总线用于向第一存储区域读写数据,第二数据总线用于向第二存储区域读写数据,第一存储区域与第二存储区域可以独立运行,在第一存储区域工作时,第二存储区域可以不工作,在第二存储区域工作时,第一存储区域可以不工作,这样可以降低存储器的功耗。In a possible implementation, the memory includes a first data bus and a second data bus; the first data bus is used to read and write data to the first storage area, and the second data bus is used to read and write data to the second storage area. , the first storage area and the second storage area can operate independently. When the first storage area is working, the second storage area does not need to work. When the second storage area is working, the first storage area does not need to work. This can reduce the memory cost. of power consumption.
第二方面,本申请的实施例提供了一种芯片堆叠结构,芯片堆叠结构包括系统及芯片SoC与一个或多个存储器,SoC与一个或多个存储器依次堆叠设置;一个或多个存储器中的至少一个为第一方面任一种实现方式提供的存储器。In the second aspect, embodiments of the present application provide a chip stack structure. The chip stack structure includes a system and a chip SoC and one or more memories. The SoC and one or more memories are stacked in sequence; At least one memory provided for any implementation of the first aspect.
在一种可能的实现方式中,堆叠的方式包括以下各种方式中的一种或多种:硅通孔TSV连接,芯片上芯片Chip on Chip,晶元上芯片Chip on Wafer,晶元上晶元wafer on wafer。In a possible implementation, the stacking method includes one or more of the following methods: through silicon via TSV connection, chip on chip, chip on wafer, chip on wafer, wafer on wafer Yuan wafer on wafer.
第三方面,本申请的实施例提供了一种芯片封装结构,包括封装基板以及如第二方面任一种实现方式提供的芯片堆叠结构,芯片堆叠结构设置在封装基板上。In a third aspect, embodiments of the present application provide a chip packaging structure, including a packaging substrate and a chip stack structure provided in any implementation manner of the second aspect. The chip stack structure is disposed on the packaging substrate.
第四方面,本申请的实施例提供了一种芯片封装结构,包括封装基板、系统级芯片SoC以及如第一方面任一种实现方式提供的存储器;封装基板包括第一平面,第一平面包括第一区域与第二区域,SoC设置于第一区域,存储器设置于第二区域,存储器通过连接线与SoC电连接。In the fourth aspect, embodiments of the present application provide a chip packaging structure, including a packaging substrate, a system-on-chip SoC, and a memory provided in any implementation manner of the first aspect; the packaging substrate includes a first plane, and the first plane includes The first area and the second area, the SoC is arranged in the first area, the memory is arranged in the second area, and the memory is electrically connected to the SoC through connecting lines.
第五方面,本申请的实施例提供了一种芯片封装结构,包括封装基板、系统级芯片以及多个存储器;封装基板包括第一平面,第一平面包括不想交的第一区域与第二区域,SoC设置于第一区域;多个存储器堆叠设置于第二区域形成芯片堆叠结构,多个存储器中的至少一个为如第一方面任一种实现方式提供的存储器;芯片堆叠结构通过连接线与SoC电连接。In the fifth aspect, embodiments of the present application provide a chip packaging structure, including a packaging substrate, a system-level chip, and multiple memories; the packaging substrate includes a first plane, and the first plane includes a first region and a second region that do not want to intersect. , the SoC is disposed in the first area; a plurality of memory stacks are disposed in the second area to form a chip stack structure, at least one of the plurality of memories is a memory provided by any implementation method of the first aspect; the chip stack structure is connected to the chip stack structure through connecting lines. SoC electrical connections.
第六方面,本申请的实施例提供了一种电子设备,包括印制电路板以及如第二方面至第五方面任一种提供的芯片封装结构;芯片封装结构中的封装基板与印刷电路板电连接。In a sixth aspect, embodiments of the present application provide an electronic device, including a printed circuit board and a chip packaging structure as provided in any one of the second to fifth aspects; the packaging substrate and the printed circuit board in the chip packaging structure Electrical connection.
附图说明Description of drawings
图1为各类传感器与处理器的数据交换示意图;Figure 1 is a schematic diagram of data exchange between various sensors and processors;
图2为多级存储的金字塔结构示意图;Figure 2 is a schematic diagram of the pyramid structure of multi-level storage;
图3为本申请实施例提供的计算机系统的示意图;Figure 3 is a schematic diagram of a computer system provided by an embodiment of the present application;
图4为本申请实施例提供的存储单元的示意图;Figure 4 is a schematic diagram of a storage unit provided by an embodiment of the present application;
图5为本申请实施例提供的一种存储器的示意图;Figure 5 is a schematic diagram of a memory provided by an embodiment of the present application;
图6为本申请实施例提供的阵列的示意图;Figure 6 is a schematic diagram of an array provided by an embodiment of the present application;
图7为本申请实施例提供的存储器的结构示意图;Figure 7 is a schematic structural diagram of a memory provided by an embodiment of the present application;
图8为移动端设备中主存功耗占比示意图;Figure 8 is a schematic diagram of main memory power consumption in mobile devices;
图9、图10为移动端设备主存的主要参数的示意图;Figures 9 and 10 are schematic diagrams of the main parameters of the main memory of the mobile device;
图11为一种存储器的封装示意图;Figure 11 is a schematic diagram of a memory package;
图12为一种改善主存频率的方案示意图;Figure 12 is a schematic diagram of a solution for improving main memory frequency;
图13为本申请实施例提供的一种存储器的结构示意图;Figure 13 is a schematic structural diagram of a memory provided by an embodiment of the present application;
图14为存储库划分多个阵列的示意图;Figure 14 is a schematic diagram of the storage library divided into multiple arrays;
图15为第一存储区域与第二存储区域的分布示意图;Figure 15 is a schematic diagram of the distribution of the first storage area and the second storage area;
图16为本申请的实施例提供的一种芯片堆叠结构的示意图;Figure 16 is a schematic diagram of a chip stack structure provided by an embodiment of the present application;
图17为本申请的实施例提供的一种芯片封装结构的示意图;Figure 17 is a schematic diagram of a chip packaging structure provided by an embodiment of the present application;
图18为本申请的实施例提供的另一种芯片封装结构的示意图;Figure 18 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application;
图19为本申请的实施例提供的另一种芯片封装结构的示意图;Figure 19 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application;
图20为本申请实施例提供的另一种芯片堆叠结构的示意图;Figure 20 is a schematic diagram of another chip stack structure provided by an embodiment of the present application;
图21为本申请的实施例提供的另一种芯片封装结构的示意图;Figure 21 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application;
图22为本申请的实施例提供的另一种芯片封装结构的示意图;Figure 22 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application;
图23为本申请的实施例提供的另一种芯片封装结构的示意图;Figure 23 is a schematic diagram of another chip packaging structure provided by an embodiment of the present application;
图24为本申请的实施例提供的一种SoC与存储器的布局示意图。Figure 24 is a schematic layout diagram of an SoC and memory provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行描述,显然,所描述 的实施例仅仅是本申请一部分实施例,而不是全部的实施例。The technical solutions in the embodiments of the present application will be described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described The embodiments are only part of the embodiments of this application, rather than all the embodiments.
以下,术语“第一”、“第二”等仅用于描述方便,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”等的特征可以明示或者隐含地包括一个或者更多个该特征。在本申请的描述中,除非另有说明,“多个”的含义是两个或两个以上。例如,多个处理单元是指两个或两个以上的处理单元。Hereinafter, the terms "first", "second", etc. are only used for convenience of description and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, features defined by "first," "second," etc. may explicitly or implicitly include one or more of such features. In the description of this application, unless otherwise stated, "plurality" means two or more. For example, multiple processing units refers to two or more processing units.
此外,本申请实施例中,“上”、“下”、“左”以及“右”不限于相对附图中的部件示意置放的方位来定义的,应当理解到,这些方向性术语可以是相对的概念,它们用于相对于的描述和澄清,其可以根据附图中部件附图所放置的方位的变化而相应地发生变化。在附图中,为了清楚起见,夸大了层和区域的厚度,图示中的各部分之间的尺寸比例关系并不反映实际的尺寸比例关系。In addition, in the embodiments of the present application, "upper", "lower", "left" and "right" are not limited to being defined relative to the schematically placed orientations of the components in the drawings. It should be understood that these directional terms may be Relative concepts, which are used for relative description and clarification, may change accordingly according to changes in the orientation of the components in the drawings. In the drawings, the thickness of layers and regions are exaggerated for clarity, and the dimensional proportions between the various parts in the illustrations do not reflect actual dimensional proportions.
本申请实施例中,除非另有明确的规定和限定,术语“连接”应做广义理解,例如,“连接”可以是固定连接,也可以是可拆卸连接,或成一体;可以是直接相连,也可以通过中间媒介间接相连。此外,术语“电连接”可以是直接的电性连接,也可以通过中间媒介间接的电性连接。In the embodiments of this application, unless otherwise clearly stated and limited, the term "connection" should be understood in a broad sense. For example, "connection" can be a fixed connection, a detachable connection, or an integral body; it can be a direct connection, They can also be connected indirectly through intermediaries. In addition, the term "electrical connection" may be a direct electrical connection or an indirect electrical connection through an intermediary.
本申请实施例中,术语“模块”通常是按照逻辑划分的功能性结构,该“模块”可以由纯硬件实现,或者,软硬件结合实现。本申请实施例中,“和/或”描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B可以表示:单独存在A,单独存在B,同时存在A和B这三种情况。In the embodiments of this application, the term "module" usually refers to a functional structure divided according to logic. The "module" can be implemented by pure hardware, or a combination of software and hardware. In the embodiment of this application, "and/or" describes the association of associated objects, indicating that there can be three relationships. For example, A and/or B can represent: A exists alone, B exists alone, and A and B exist simultaneously. situation.
本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或“例如”等词旨在以具体方式呈现相关概念。In the embodiments of this application, words such as "exemplary" or "for example" are used to represent examples, illustrations or explanations. Any embodiment or design described as "exemplary" or "such as" in the embodiments of the present application is not to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the words "exemplary" or "such as" is intended to present the concept in a concrete manner.
1945年冯·诺依曼提出的“存储程序”的概念,以此概念为基础的各类计算机统称为冯·诺依曼计算机。冯·诺依曼计算机由输入设备、输出设备、存储器、运算器和控制器五大部件组成。运算器用来完成算数运算和逻辑运算,并将运算的中间结果暂时存储在运算器内;控制器用来控制、指挥程序和数据的输入、运行以及处理运算结果;存储器用来存放数据和程序;输入设备用来将人们熟悉的信息形式转换为计算机能识别的信息形式,常见的有键盘、鼠标、麦克风、扫描仪等;输出设备可将计算机的运算结果转换为人们熟悉的信息形式,如显示器、打印机、音响等。现阶段计算机的控制器和运算器合二为一,统称为中央处理器(central processing unit,CPU),而输入设备与输出设备简称为I/O设备(input/output equipment)。In 1945, von Neumann proposed the concept of "stored program". Various computers based on this concept are collectively called von Neumann computers. The Von Neumann computer consists of five major components: input device, output device, memory, arithmetic unit and controller. The arithmetic unit is used to complete arithmetic operations and logical operations, and the intermediate results of the operations are temporarily stored in the arithmetic unit; the controller is used to control and direct the input, operation and processing of operation results of programs and data; the memory is used to store data and programs; input Devices are used to convert information forms that people are familiar with into information forms that computers can recognize. Common examples include keyboards, mice, microphones, scanners, etc.; output devices can convert computer calculation results into information forms that people are familiar with, such as monitors, Printers, speakers, etc. At this stage, the controller and arithmetic unit of the computer are combined into one, collectively called the central processing unit (CPU), while the input devices and output devices are referred to as I/O devices (input/output equipment).
而随着计算机的发展,CPU的速度变得很高,而存储器的存取速度很难和它匹配,这就使得计算机的运行速度很大程度上受存储器的制约,即所谓的“内存墙问题”,这也使得存储器的地位更为突出。With the development of computers, the speed of the CPU has become very high, and the access speed of the memory is difficult to match. This makes the running speed of the computer largely restricted by the memory, which is the so-called "memory wall problem". ”, which also makes the status of memory more prominent.
存储器是用来存储程序和各种数据信息的记忆部件,从不同的维度,计算机系统中的存储器可以分为不同的种类。Memory is a memory component used to store programs and various data information. From different dimensions, memories in computer systems can be divided into different types.
例如,根据工作原理的不同可以将存储器分为只读存储器(read-only-memory)和随机存取存储器(random-access memory,RAM)。ROM以非破坏性读出方式工作,只能读出无法写入信息,信息一旦写入后就固定下来,即使切断电源,信息也不会丢失,所以又称固定存储器。RAM是一种可读/可写的存储器,读写速度快,通常作为操作系统或其他正在运行中的程序的临时数据存储介质,RAM工作时可以随时从任何一个指定的地址写入或读出数据,它与ROM最大的区别是数据的易失性,即一旦断电所存储的数据将随之丢失。RAM在计算机或数字系统中用来暂时存储程序、数据和中间结果。RAM又可以分为动态随机存取存储器(dynamic random access memory,DRAM)和静态随机存取存储器(static random access memory,SRAM)等。For example, memory can be divided into read-only memory (read-only-memory) and random-access memory (random-access memory, RAM) according to different working principles. ROM works in a non-destructive reading mode. It can only read information but cannot write it. Once the information is written, it is fixed. Even if the power is cut off, the information will not be lost, so it is also called fixed memory. RAM is a readable/writable memory with fast reading and writing speeds. It is usually used as a temporary data storage medium for the operating system or other running programs. RAM can be written or read from any specified address at any time when working. The biggest difference between data and ROM is the volatility of data, that is, the stored data will be lost once the power is turned off. RAM is used in computers or digital systems to temporarily store programs, data and intermediate results. RAM can be divided into dynamic random access memory (dynamic random access memory, DRAM) and static random access memory (static random access memory, SRAM).
根据用途的不同,存储器可以分为主存、辅存和缓存。图1示出了主存、辅存和缓存与CPU的数据交换示意图。According to different uses, memory can be divided into main memory, auxiliary memory and cache. Figure 1 shows a schematic diagram of data exchange between main memory, auxiliary memory and cache and CPU.
主存,即主存储器(main memory),也被称为内存,主存可以和CPU直接交换数据或信息,用于暂时存放CPU的运算数据、与辅存交换的数据等。主存是CPU与辅存沟通的桥梁,计算机所有的程序都在主存中运行,主存性能的强弱会影响计算机整体性能的水平。计算机运行过程中, 操作系统会将需要处理的数据由主存调到CPU进行计算,计算机无法在失去主存的情况下正常运行。Main memory, also known as main memory, is also called memory. Main memory can directly exchange data or information with the CPU. It is used to temporarily store the CPU's operation data, data exchanged with auxiliary storage, etc. Main memory is the bridge between the CPU and auxiliary memory. All computer programs run in the main memory. The performance of the main memory will affect the overall performance of the computer. While the computer is running, The operating system will transfer the data that needs to be processed from the main memory to the CPU for calculation, and the computer cannot operate normally without the main memory.
辅存,即辅助存储器,也被称为外存,指计算机内存与CPU缓存以外的存储器,相对内存而言读写速度较慢,这一类存储器在断电后仍然能够保存数据,一般用来存放使用频率较低的程序和数据,辅存不能直接与CPU交换数据,需要通过主存与CPU交换数据,常见的辅存有硬盘、软盘、光盘、U盘、闪存等。Auxiliary memory, also known as auxiliary memory, is also called external memory. It refers to memory other than computer memory and CPU cache. Compared with memory, the read and write speed is slower. This type of memory can still save data after a power outage. It is generally used Stores programs and data that are used less frequently. Auxiliary storage cannot directly exchange data with the CPU. Data needs to be exchanged with the CPU through main memory. Common auxiliary storage includes hard disks, floppy disks, optical disks, U disks, flash memory, etc.
缓存,即缓冲存储器(cache),是为了解决CPU与主存之间的速度不匹配而采用的一项技术,缓存是介于CPU与主存之间的小容量存储器,读取速度较快。Cache, or buffer memory (cache), is a technology used to solve the speed mismatch between the CPU and the main memory. The cache is a small-capacity memory between the CPU and the main memory and has a faster reading speed.
存储器有三个主要的指标:速度、容量与成本。一般而言,速度越高,成本越大;容量越大,速度便会降低。对于计算机系统而言,一般采用多级存储系统,将各种不同存储容量、读写速度和成本的存储器按照层次结构组成多级存储器,并通过管理软件和辅助硬件有机的组合成为一个整体,使所存放的程序和数据按照层次分布在各种存储器中。There are three main indicators of memory: speed, capacity and cost. Generally speaking, the higher the speed, the greater the cost; the higher the capacity, the slower the speed. For computer systems, multi-level storage systems are generally used. Memories with different storage capacities, reading and writing speeds, and costs are composed into multi-level memories according to a hierarchical structure. They are organically combined into a whole through management software and auxiliary hardware, so that The stored programs and data are distributed hierarchically in various memories.
图2为计算机系统的存储层次金字塔结构,从上到下依次为:寄存器堆(register file,RF)、缓存(cache)、主存(main memory)和辅存(storage),从上到下存储容量越来越大,但是访问速度越来越慢。Figure 2 shows the storage hierarchy pyramid structure of a computer system. From top to bottom, it is: register file (RF), cache (cache), main memory (main memory) and auxiliary storage (storage). Storage from top to bottom. The capacity is getting bigger, but the access speed is getting slower and slower.
寄存器堆(register file)是处理器中多个寄存器组成的阵列,通常由几十个32/64bits的寄存器组成,可以用来暂存指令、数据和地址等,寄存器一般集成在CPU中,对于移动端设备而言,通常集成在系统级芯片(system on chip,SoC)上,寄存器具有与处理器接近的读写速度,但成本较高,因此一般容量比较小。The register file is an array of multiple registers in the processor. It usually consists of dozens of 32/64bits registers. It can be used to temporarily store instructions, data, addresses, etc. The registers are generally integrated in the CPU. For mobile For end devices, they are usually integrated on a system on chip (SoC). The register has a read and write speed close to that of the processor, but the cost is higher, so the capacity is generally smaller.
缓存(cache),也被称为高速缓冲存储器,是位于CPU与主存间的一种容量较小但速度很高的存储器,容量通常为MB量级,缓存一般不使用DRAM技术,而使用昂贵但较快速的SRAM技术。由于CPU的速度远高于主存,CPU直接从主存中存取数据要等待一定时间周期,因此设置缓存来解决CPU与主存的速度不匹配的问题,缓存中保存着CPU刚用过或循环使用的一部分数据,当CPU再次使用该部分数据时可从缓存中直接调用,这样就减少了CPU的等待时间,提高了系统的效率,缓存的设置是所有现代计算机系统发挥高性能的重要因素之一。Cache, also known as cache memory, is a small-capacity but high-speed memory located between the CPU and the main memory. The capacity is usually in the order of MB. The cache generally does not use DRAM technology, but it is expensive to use. But faster SRAM technology. Since the speed of the CPU is much higher than that of the main memory, the CPU has to wait for a certain period of time to directly access data from the main memory. Therefore, a cache is set up to solve the problem of speed mismatch between the CPU and the main memory. The cache stores the data that the CPU has just used or A part of the data used in the cycle can be directly called from the cache when the CPU uses this part of the data again. This reduces the waiting time of the CPU and improves the efficiency of the system. The setting of the cache is an important factor in the high performance of all modern computer systems. one.
主存主要用于存放要参与运行的程序和数据,主存的速度与CPU的速度差距较大,为了使主存与CPU的速度匹配,因此在主存与CPU之间插入了比主存速度更快容量更小的高速缓冲存储器。主存一般采用DRAM技术,其容量的大小制约了设备可以同时运行的程序的数量,直接影响设备的性能,是计算机中较为重要的存储器件,主存的存储容量可以达到GB量级,对于移动端设备而言,主存通常和SoC不属于同一个芯片(die)。Main memory is mainly used to store programs and data that need to be run. There is a big gap between the speed of main memory and the speed of the CPU. In order to match the speed of the main memory and the CPU, a speed ratio of the main memory is inserted between the main memory and the CPU. Faster and smaller cache memory. Main memory generally uses DRAM technology. Its capacity restricts the number of programs that the device can run at the same time, which directly affects the performance of the device. It is an important storage device in computers. The storage capacity of main memory can reach the GB level. For mobile devices For end devices, the main memory usually does not belong to the same chip (die) as the SoC.
最后一级的大容量的辅存用来储存资料,例如图像、视频等。这一级存储器的读写速度较慢,但容量可以做到很大,例如容量可以达到GB甚至TB量级,设备运行时由将储存器中存储的数据加载至主存进行处理。The last level of large-capacity auxiliary storage is used to store data, such as images, videos, etc. The read and write speed of this level of memory is slow, but the capacity can be very large. For example, the capacity can reach the GB or even TB level. When the device is running, the data stored in the memory is loaded into the main memory for processing.
存储系统的多级结构主要体现在主存-缓存与主存-辅存这两个存储层次上,主存-缓存层次主要用于解决存储系统的速度问题,主存与CPU的速度不匹配,由于主存的速度较低,而缓存的速度比主存的速度高,因此只需要将CPU要使用的数据调入缓存,CPU即可直接从缓存中获取数据,从而提高访问速度;主存-辅存层次主要解决存储系统的容量问题,辅存的速度低于主存,并且不能直接与CPU交换数据,但容量远大与主存的容量,当CPU需要辅存中的数据时,将这些数据调入主存后供CPU使用。The multi-level structure of the storage system is mainly reflected in the two storage levels of main memory-cache and main memory-auxiliary memory. The main memory-cache level is mainly used to solve the speed problem of the storage system. The speed of main memory does not match the speed of the CPU. Since the speed of main memory is lower and the speed of cache is higher than that of main memory, you only need to transfer the data to be used by the CPU into the cache, and the CPU can directly obtain the data from the cache, thereby increasing the access speed; main memory - The auxiliary storage level mainly solves the capacity problem of the storage system. The speed of auxiliary storage is lower than that of main memory, and it cannot directly exchange data with the CPU. However, its capacity is much larger than that of main memory. When the CPU needs data in auxiliary storage, it will After being loaded into the main memory, it is used by the CPU.
主存能够与CPU、缓存与辅存之间交换数据,是存储系统中较为重要的一环,主存的各项参数:例如容量、带宽、成本等制约这计算机系统的发展。本申请实施例所指的计算机系统,可以是电脑,当然也可以指移动端设备,例如手机,平板等设备。图3示出了本申请的实施例提供的计算机系统的架构示意图,如图3所示,计算机系统100至少可以包括处理器(processor)101、存储器控制器(memory controller)102以及存储器103。通常,存储器控制器102可以集成在处理器101中,存储器103可以为主存。需要说明的是,本申请的实施例提供的计算机系统中,除了图3所示的器件外,计算机系统100还可以包括通信接口以及作为辅存的磁盘等其他器件,在此不做限制。 The main memory can exchange data with the CPU, cache and auxiliary memory, and is an important part of the storage system. Various parameters of the main memory: such as capacity, bandwidth, cost, etc., restrict the development of this computer system. The computer system referred to in the embodiments of this application can be a computer, and of course it can also refer to a mobile device, such as a mobile phone, a tablet, and other devices. Figure 3 shows a schematic architectural diagram of a computer system provided by an embodiment of the present application. As shown in Figure 3, the computer system 100 may at least include a processor 101, a memory controller 102 and a memory 103. Typically, the memory controller 102 may be integrated into the processor 101 and the memory 103 may be main memory. It should be noted that in the computer system provided by the embodiment of the present application, in addition to the devices shown in FIG. 3 , the computer system 100 may also include other devices such as communication interfaces and disks as auxiliary storage, which are not limited here.
处理器101是计算机系统100的运算核心和控制单元(control unit)。处理器101可以包括多个核(core)104。在处理器101中安装有操作系统和其他软件程序,从而处理器101能够实现对存储器103、缓存及磁盘的访问。在本申请的实施例中,处理器101中的core 104可以是中央处理器(central processing unit,CPU)、人工智能(artificial intelligence,AI)处理器、数字信号处理器(digital signal processor)和神经网络处理器,还可以是其他特定集成电路(application specific integrated circuit,ASIC)等。存储器控制器102是计算机系统100内部控制存储器103并用于管理与规划从存储器103到core 104间的数据传输的总线电路控制器。通过存储器控制器102,存储器103与core 104之间可以进行数据交换。存储器控制器102可以是一个单独的芯片,并通过系统总线与core 104连接。存储器控制器102也可以被集成到处理器101中,或者被内置于北桥中。本申请的实施例不对存储器控制器102的具体位置进行限定。实际应用中,存储器控制器102可以控制必要的逻辑以将数据写入存储器103或从存储器103中读取数据。The processor 101 is the computing core and control unit of the computer system 100. Processor 101 may include multiple cores 104 . An operating system and other software programs are installed in the processor 101, so that the processor 101 can access the memory 103, cache and disk. In the embodiment of the present application, the core 104 in the processor 101 may be a central processing unit (CPU), an artificial intelligence (artificial intelligence, AI) processor, a digital signal processor (digital signal processor), or a neural network processor. A network processor can also be other application specific integrated circuit (ASIC), etc. The memory controller 102 is a bus circuit controller that controls the memory 103 internally in the computer system 100 and is used to manage and plan data transmission from the memory 103 to the core 104. Through the memory controller 102, data can be exchanged between the memory 103 and the core 104. Memory controller 102 may be a separate chip and connected to core 104 via a system bus. The memory controller 102 may also be integrated into the processor 101 or built into the northbridge. The embodiments of the present application do not limit the specific location of the memory controller 102. In practical applications, the memory controller 102 can control necessary logic to write data to the memory 103 or read data from the memory 103 .
存储器103是计算机系统100的主存,通常用来存放操作系统中各种正在运行的软件、输入和输出数据以及与外存交换的信息等。通常采用动态随机存取存储器(dynamic random access memory,DRAM)作为存储器103。处理器101能够通过存储器控制器102高速访问存储器103,对存储器103中的任意一个存储单元进行读操作和写操作。在本申请的实施例中,以存储器103为DRAM为例进行描述,除特殊说明外,本申请实施例所称的主存,也被称为DRAM。The memory 103 is the main memory of the computer system 100 and is usually used to store various running software in the operating system, input and output data, and information exchanged with external memory. Dynamic random access memory (DRAM) is usually used as the memory 103. The processor 101 can access the memory 103 at high speed through the memory controller 102, and perform read operations and write operations on any storage unit in the memory 103. In the embodiment of the present application, the memory 103 is a DRAM as an example for description. Unless otherwise specified, the main memory referred to in the embodiment of the present application is also called DRAM.
DRAM中用于存储数据的最小单元称为存储单元(memory cell,MC),通常,一个存储单元可以存储1位(bit)数据。DRAM的存储单元通常由晶体管与电容构成,如果含有两个晶体管(transistor)与两个电容(capacitor)则称为2T2C;如果含有两个晶体管与一个电容则称为2T1C;如果含有一个晶体管与一个电容则称为1T1C。晶体管可以采用金属-氧化物半导体场效应晶体管(metal-oxide-semiconductor field effect transistor,MOSFET),晶体管分为N(negative,负)型晶体管和P(positive,正)型晶体管两种类型。晶体管包括源极(source)、漏极(drain)以及栅极(gate),通过控制输入晶体管栅极的电平可以控制晶体管的导通或关断。晶体管在导通时,源极和漏极导通,产生导通电流,并且,在晶体管的栅极电平不同时,源极与漏极之间产生的导通电流的大小也不同;晶体管在关断时,源极和漏极不会导通,不会产生电流。在本申请的实施例中,晶体管的栅极也被称为控制端,源极被称为第一端,漏极被称为第二端;或者,栅极被称为控制端,漏极被称为第一端,源极被称为第二端。The smallest unit used to store data in DRAM is called a memory cell (MC). Usually, a memory cell can store 1 bit of data. The memory unit of DRAM is usually composed of a transistor and a capacitor. If it contains two transistors and two capacitors, it is called 2T2C; if it contains two transistors and one capacitor, it is called 2T1C; if it contains one transistor and one capacitor, it is called 2T1C. The capacitor is called 1T1C. The transistor can be a metal-oxide-semiconductor field effect transistor (MOSFET). The transistor is divided into two types: N (negative, negative) type transistor and P (positive, positive) type transistor. A transistor includes a source, a drain, and a gate. By controlling the level of the input transistor gate, the transistor can be turned on or off. When the transistor is turned on, the source and drain are turned on, generating a conduction current. Moreover, when the gate levels of the transistors are different, the magnitude of the conduction current generated between the source and the drain is also different; the transistor is When turned off, the source and drain will not conduct, and no current will flow. In the embodiment of the present application, the gate of the transistor is also called the control terminal, the source is called the first terminal, and the drain is called the second terminal; or, the gate is called the control terminal, and the drain is called the second terminal. It is called the first terminal and the source is called the second terminal.
以1T1C的结构为例,图4示出了DRAM的一个存储单元的结构示意图,包括晶体管T与电容C,晶体管T的第一端连接位线(bit line,BL),晶体管T的第二端连接至电容C的第一端,晶体管T的控制端连接字线(word line,WL),电容C的第二端连接到源极线(source line,SL),可以通过源极线SL连接到特定的电压(例如地电压)。电容C用于存储数据,例如电容C的第一端的电压为高电平、第二端的电压为低电平时储存1,第一端的电压为低电平、第二端的电压为高点平时储存0;或者,电容C的第一端的电压为高电平、第二端的电压为低电平时储存0,第一端的电压为低电平、第二端的电压为高点平时储存1。Taking the structure of 1T1C as an example, Figure 4 shows a schematic structural diagram of a memory unit of DRAM, including a transistor T and a capacitor C. The first end of the transistor T is connected to the bit line (BL), and the second end of the transistor T is connected to the bit line (BL). Connected to the first end of the capacitor C, the control end of the transistor T is connected to the word line (WL), and the second end of the capacitor C is connected to the source line (SL), which can be connected to A specific voltage (such as ground). Capacitor C is used to store data. For example, when the voltage at the first terminal of capacitor C is high level and the voltage at the second terminal is low, it stores 1. When the voltage at the first terminal is low level and the voltage at the second terminal is high, it stores 1. Store 0; or, store 0 when the voltage at the first end of the capacitor C is high level and the voltage at the second end is low level, store 1 when the voltage at the first end is low level and the voltage at the second end is high.
存储器103中的存储单元被排列分布成一个矩阵,这个矩阵我们称之为子阵列(sub-array),参照图5所示,存储器103可以包括命令解码器110、控制逻辑电路120,子阵列130、输入/输出电路140,具体的实施例中不限于此并且可以包含更少或更大数量的构成组件。命令解码器110可以从存储器控制器102接收命令CMD(command,CMD)并且可以解码接收的命令CMD。例如写入命令(write,WR)、读取命令(read,RD)等,存储器控制器102通过相应的行列译码器可以定位到bank中的任意一个存储单元,即定位任意一个bit的数据。The memory cells in the memory 103 are arranged and distributed into a matrix. This matrix is called a sub-array. Referring to Figure 5, the memory 103 may include a command decoder 110, a control logic circuit 120, and a sub-array 130. , input/output circuit 140, specific embodiments are not limited thereto and may include a smaller or larger number of constituent components. The command decoder 110 may receive a command CMD (command, CMD) from the memory controller 102 and may decode the received command CMD. For example, write commands (write, WR), read commands (read, RD), etc., the memory controller 102 can locate any storage unit in the bank through the corresponding row-column decoder, that is, locate any bit of data.
一个或多个子阵列形成一个阵列核块(array core tile),简称为阵列(array)。如图6所示,array可以包括多个sub-array。在一些实现方式中,同一个array内的存储单元共用行译码器、列译码器以及接口(例如输入输出(input output,IO)端口),例如,图6所示的array具有64个并行的IO端口。IO端口的数量关系着整个存储器的带宽,因此生产厂商一般会在工艺允许范围内最大限度地增加array的IO端口数量,可以理解的是,不同的生产厂商可能具有不同的工艺水平,因此不同的生产厂商生产的array的IO端口数量可能会不同;但对于同一个生产厂商而言,工艺水平是稳定的,也就是说,生产的array的IO端口数量基本是相同的。 One or more subarrays form an array core tile, referred to as an array. As shown in Figure 6, an array can include multiple sub-arrays. In some implementations, storage units in the same array share row decoders, column decoders, and interfaces (such as input output (IO) ports). For example, the array shown in Figure 6 has 64 parallel IO port. The number of IO ports is related to the bandwidth of the entire memory, so manufacturers generally increase the number of IO ports in the array to the maximum extent within the allowable range of the process. It is understandable that different manufacturers may have different process levels, so different The number of IO ports in the arrays produced by the manufacturer may be different; but for the same manufacturer, the process level is stable, that is to say, the number of IO ports in the arrays produced is basically the same.
DRAM利用电容存储电量的多少来代表数据0和1。由于电容存在漏电现象,如果电容中的电荷不足,会导致存储的数据出错。因此,每隔一段时间,存储器控制器102会刷新DRAM103中的数据,以防止DRAM丢失数据。并且,DRAM是易失性的,当计算机系统100关闭电源后,DRAM中的信息将不再保存。DRAM uses the amount of electricity stored in the capacitor to represent data 0 and 1. Due to leakage in capacitors, if there is insufficient charge in the capacitor, errors will occur in the stored data. Therefore, every once in a while, the memory controller 102 refreshes the data in the DRAM 103 to prevent the DRAM from losing data. Moreover, DRAM is volatile. When the computer system 100 is powered off, the information in the DRAM will no longer be saved.
由于DRAM与处理器通常处于不同的芯片(die),存储单元需要进行封装和组合之后才可以与处理器连结,从处理器到DRAM的存储单元之间依据层级从大到小为:通道(channel)、存储列(rank)、存储库(bank)、存储阵列(array)、行或者列(row/column)。Since DRAM and the processor are usually on different chips (die), the memory unit needs to be packaged and combined before it can be connected to the processor. The memory unit from the processor to the DRAM is divided into channels from large to small according to the hierarchy. ), storage column (rank), storage bank (bank), storage array (array), row or column (row/column).
如图7所示,示出了一种DRAM的示意图,通道(channel)是一个独立的可访问的存储空间,该存储空间中可以包括一个或者存储列(rank)。通常,一个通道包括一定容量的存储空间、以及用于访问该存储空间的硬件电路,该硬件电路可以包括控制逻辑和接口等相关的电路。As shown in Figure 7, a schematic diagram of a DRAM is shown. A channel is an independent accessible storage space, and the storage space may include one or a storage rank. Usually, a channel includes a certain capacity of storage space and a hardware circuit for accessing the storage space. The hardware circuit may include control logic, interfaces and other related circuits.
通道可以由存储列(rank)组成。存储列(rank)指的是连接到同一个片选(chip select)信号的内存颗粒(chip),内存颗粒也被称为芯片;由于这些芯片连接同一个片选信号,因此存储器控制器102能够对同一个存储列中的芯片进行写操作,而在同一个存储列的芯片也共享同样的控制信号。Channels can be composed of storage columns (ranks). The memory rank refers to the memory particles (chips) connected to the same chip select signal. The memory particles are also called chips; because these chips are connected to the same chip select signal, the memory controller 102 can Write operations are performed on chips in the same storage column, and chips in the same storage column also share the same control signal.
每个存储列可以包括一个或多个存储库(bank),例如一个存储列可以包括4个bank,或者还可以包括8个bank。DRAM的bank包括一个或多个存储阵列(array),每一个存储阵列包括呈行(row)列(column)分布的多个存储单元(memory cell),或者每一个存储阵列还可以包括多个子阵列(sub-array),子阵列包括呈行(row)列(column)分布的多个存储单元。Each storage column may include one or more storage banks (banks). For example, a storage column may include 4 banks, or may also include 8 banks. A DRAM bank includes one or more storage arrays. Each storage array includes multiple memory cells (memory cells) distributed in rows and columns, or each storage array can also include multiple sub-arrays. (sub-array), the sub-array includes multiple storage cells distributed in rows and columns.
随着手机、平板电脑等移动端的SoC的发展,其对主存有三方面的需求:大容量、低功耗以及高带宽。首先随着移动端的软件规模越来越大、用户数据越来越多,移动端设备需要更大容量的主存来保存程序和数据,如图8所示,主存在移动端设备中的功耗占比较大;此外移动端设备对续航、发热等性能有较高的要求,因此要求主存具有低的读写功耗;最后对于视频图像处理、人工智能(artificial intelligence,AI)等场景,主存需要能提供足够高的带宽来满足其巨大的算力需求。因此需要找到一种既能满足移动端SoC对大容量主存的需求,又可以实现较低的读写功耗,还能提供足够带宽来支撑算力需求的解决方案。With the development of mobile SoCs such as mobile phones and tablets, they have three requirements for main memory: large capacity, low power consumption and high bandwidth. First of all, as the scale of mobile software becomes larger and larger and user data increases, mobile devices require larger-capacity main memory to save programs and data. As shown in Figure 8, the power consumption of main memory in mobile devices It accounts for a large proportion; in addition, mobile devices have high requirements for battery life, heat generation and other performance, so the main memory is required to have low read and write power consumption; finally, for scenarios such as video image processing and artificial intelligence (AI), the main The storage needs to provide high enough bandwidth to meet its huge computing power needs. Therefore, it is necessary to find a solution that can not only meet the mobile SoC's demand for large-capacity main memory, but also achieve lower read and write power consumption and provide sufficient bandwidth to support the computing power needs.
然而,现有的解决方案往往只能满足其中一个方面的需求,无法同时做到大容量、低功耗和高带宽。如图9所示,提高存储密度,增大存储容量的方案经常导致读写带宽的降低和功耗的增加;如图10所示,增加带宽的设计往往不利于存储容量的增加和功耗的降低,例如通过增大读写频率来增加带宽,这样会导致功耗的增加。However, existing solutions often only meet the needs of one aspect and cannot achieve large capacity, low power consumption and high bandwidth at the same time. As shown in Figure 9, solutions to increase storage density and storage capacity often lead to reductions in read and write bandwidth and increases in power consumption; as shown in Figure 10, designs that increase bandwidth are often detrimental to increases in storage capacity and power consumption. Reducing, for example, increasing the bandwidth by increasing the read and write frequency will lead to an increase in power consumption.
图11示出了一种存储器,在一个封装中,通过将多个主存芯片(例如,存储器1与存储器2)堆叠在封装基板上,通过互联端子和互联网络连接数据信号、控制信号与地址信号,将多个分离的、低容量的主存芯片实现为高容量的单芯片,通过在单个封装中堆叠多个相同的存储芯片实现更高存储密度的封装。Figure 11 shows a memory in which data signals, control signals and addresses are connected through interconnection terminals and interconnection networks by stacking multiple main memory chips (for example, memory 1 and memory 2) on a packaging substrate in one package. Signal, implements multiple separate, low-capacity main memory chips into a high-capacity single chip, and achieves higher storage density packaging by stacking multiple identical memory chips in a single package.
但通过堆叠多个相同的主存芯片实现更高存储密度仅仅能够解决存储容量不足的问题,对于提高传输带宽并无帮助,再考虑到堆叠多个主存芯片,部分主存芯片的读写线路长度会增加,还会导致读写功耗增大。However, achieving higher storage density by stacking multiple identical main memory chips can only solve the problem of insufficient storage capacity and does not help improve transmission bandwidth. Considering stacking multiple main memory chips, the read and write lines of some main memory chips The length will increase, which will also lead to increased read and write power consumption.
读取数据的速度是影响处理器计算能力原因之一,处理器从主存读取数据进行处理,提高主存的传输带宽就变得尤为重要。带宽是指单位时间内能够读取或写入的数据的多少,为了提高主存的带宽,一种可能的实现方式是通过扩充主存通道数或者扩充各个通道的数据位宽来提高主存的带宽。The speed of reading data is one of the reasons that affects the computing power of the processor. The processor reads data from the main memory for processing, so it is particularly important to increase the transmission bandwidth of the main memory. Bandwidth refers to the amount of data that can be read or written per unit time. In order to improve the bandwidth of main memory, one possible implementation method is to increase the number of main memory channels or expand the data bit width of each channel. bandwidth.
然而,扩充整体主存带宽能满足视频图像处理、AI等场景大带宽需求,但移动终端90%的场景带宽需求较小,该方案存在带宽浪费;并且提高整个主存带宽会导致更高功耗,由于移动应用的散热设计功耗(thermal design power,TDP)约束,会使得在高带宽下的运行时长受限。However, expanding the overall main memory bandwidth can meet the large bandwidth requirements of scenarios such as video image processing and AI. However, 90% of mobile terminal scenarios have small bandwidth requirements. This solution wastes bandwidth; and increasing the entire main memory bandwidth will lead to higher power consumption. , due to the thermal design power (TDP) constraints of mobile applications, the running time under high bandwidth will be limited.
另一种可能的方式是通过调节主存的读写频率来调节主存的读写带宽,例如,参阅图12,提供了一种利用软件和硬件结合的方式来解决对主存的带宽、功耗等的需求。为了降低功耗,提高传输带宽,软件根据数据流量实时调节读写频率,存储控制器包括流量统计单元与时钟控制器,而在软件层面,存储控制模块内设置有频率函数,根据流量统计单元统计到的流量需求,利用频率函数根据流量需求确定目标频率,通过存储驱动对时钟控制器输出的频率进行调节。在数据流 量要求小,带宽需求低时,控制时钟控制器输出低频率的时钟信号,以此来降低主存芯片的读写频率,降低读写带宽,同时也能够降低功耗;而在高带宽需求时,控制时钟控制器输出高频率的时钟信号,以此来提高主存芯片的读写频率,提高读写带宽。然而,主存颗粒的高能效区位于高频处,运行在低频会造成主存能效差,并且增加SoC的访问延迟和功耗。Another possible way is to adjust the read and write bandwidth of the main memory by adjusting the read and write frequency of the main memory. For example, see Figure 12, which provides a way to use a combination of software and hardware to solve the problem of bandwidth and function of the main memory. Consumption and other needs. In order to reduce power consumption and increase transmission bandwidth, the software adjusts the read and write frequency in real time according to the data flow. The storage controller includes a traffic statistics unit and a clock controller. At the software level, a frequency function is set in the storage control module. According to the statistics of the traffic statistics unit According to the traffic demand, the frequency function is used to determine the target frequency according to the traffic demand, and the frequency output by the clock controller is adjusted through the storage driver. in data flow When the memory requirement is small and the bandwidth requirement is low, the clock controller is controlled to output a low-frequency clock signal to reduce the read and write frequency of the main memory chip, reduce the read and write bandwidth, and also reduce power consumption; when the bandwidth requirement is high, , controlling the clock controller to output a high-frequency clock signal to increase the read and write frequency of the main memory chip and increase the read and write bandwidth. However, the high energy efficiency area of the main memory particles is located at high frequency. Running at low frequency will cause poor main memory energy efficiency and increase the access delay and power consumption of the SoC.
移动端设备的应用场景复杂,因此对主存有着大容量、高带宽和低功耗三个方面的需求,而现有的主存解决方案往往只能解决上述一个方面的需求,无法同时兼顾上述三个方面。基于此,本申请的实施例提供了一种存储器,以同时满足移动端设备对主存的功耗、带宽及容量这三个方面的需求。The application scenarios of mobile devices are complex, so there are three requirements for main memory: large capacity, high bandwidth, and low power consumption. However, existing main memory solutions can often only solve the needs of one of the above aspects and cannot take into account the above requirements at the same time. three aspects. Based on this, embodiments of the present application provide a memory to simultaneously meet the mobile device's requirements for main memory in terms of power consumption, bandwidth, and capacity.
参阅图13,本申请的实施例提供了一种存储器,包括:第一存储区域与第二存储区域;第一存储区域与第二存储区域可以分别设置一个或多个通道(channel),这样第一存储区域与第二存储区域可以独立的被访问或者读写数据。例如,第一存储区域设置通道1(channel1,简写为ch1)、ch2;第二存储区域设置ch3~ch8。每一个通道包括一个或多个存储库(bank),例如,对于第一存区域而言,以ch1为例,ch1包括b1、b2、b3、b4等4个bank,对于第二存储区域而言,以ch4为例,ch4包括b5、b6、b7、b8等4个bank。每个bank均包括一个或多个阵列(array),其中,每一个array具有相同数量的输入输出端口(IO),位于第一存储区域的bank的阵列数量与位于第二存储区域的阵列数量不同,这样第一存储区域的bank的IO端口数量与第二存储区域的bank的IO端口数量不同,也即是说,第一存储区域的bank与第二存储区域的bank在单位时间内可以读写的数据量不同,因此第一存储区域与第二存储区域可以具有不同的读写带宽。由于移动端设备的工作场景复杂,存在不同的带宽需求,例如在运行高算力程序时,对主存的带宽需求大;运行普通程序时对存主存的带宽需求小,本申请实施例提供的存储器可以适应于不同带宽需求的使用场景,例如第一存储区域的读写带宽大于第二存储区域的读写带宽,第一存储区域可以服务于大带宽需求的使用场景,例如神经网络、人工智能、图形处理等;第二存储区域可以服务于低带宽需求的使用场景,例如普通的程序运行等等。第一存储区域与第二存储区域可以被独立访问,这样可以满足SoC不同的运行状态下对带宽的多样化需求。Referring to Figure 13, an embodiment of the present application provides a memory, including: a first storage area and a second storage area; the first storage area and the second storage area can each be provided with one or more channels, so that the first storage area and the second storage area can each be provided with one or more channels. The first storage area and the second storage area can be accessed or read and write data independently. For example, channel 1 (abbreviated as ch1) and ch2 are set in the first storage area; ch3 to ch8 are set in the second storage area. Each channel includes one or more storage banks (banks). For example, for the first storage area, taking ch1 as an example, ch1 includes 4 banks such as b1, b2, b3, and b4. For the second storage area, , taking ch4 as an example, ch4 includes 4 banks such as b5, b6, b7, and b8. Each bank includes one or more arrays, where each array has the same number of input and output ports (IO). The number of arrays in the bank located in the first storage area is different from the number of arrays located in the second storage area. , in this way, the number of IO ports of the bank in the first storage area is different from the number of IO ports of the bank in the second storage area, that is to say, the bank in the first storage area and the bank in the second storage area can read and write in unit time. The amount of data is different, so the first storage area and the second storage area may have different read and write bandwidths. Due to the complex working scenarios of mobile devices, there are different bandwidth requirements. For example, when running high-computing power programs, the bandwidth requirements for the main memory are large; when running ordinary programs, the bandwidth requirements for the main memory are small. The embodiments of this application provide The memory can be adapted to usage scenarios with different bandwidth requirements. For example, the read and write bandwidth of the first storage area is greater than the read and write bandwidth of the second storage area. The first storage area can serve usage scenarios with large bandwidth requirements, such as neural networks, artificial intelligence, etc. Intelligence, graphics processing, etc.; the second storage area can serve usage scenarios with low bandwidth requirements, such as ordinary program running, etc. The first storage area and the second storage area can be accessed independently, which can meet the diverse demands for bandwidth under different operating states of the SoC.
例如,以图13示出的存储器103为例,第一存储区域包括2个通道:ch1、ch2;第二存储区域包括6个通道:ch3、ch4、ch5、ch6、ch7、ch8。每一个通道包括多个bank,例如以ch1为例,ch1包括b1、b2、b3、b4等4个bank,对于第二存储区域而言,以ch4为例,ch4包括b5、b6、b7、b8等4个bank,而位于第一存储区域的bank的阵列数量大于位于第二存储区域的bank的阵列数量。例如,第一存储区域的b1包括2个阵列,第二存储区域的b5包括1个阵列,这样第一存储区域的bank被划分的粒度更细,可以用有更多的IO端口数量,第一存储区域的bank的IO端口数量大于第二存储区域的bank的IO端口数量,在单位时间内,第一存储区域的bank能够读写的数据量大于第二存储区域的bank可以读写的数据量,第一存储区域的读写带宽大于第二存储区域的读写带宽。For example, taking the memory 103 shown in FIG. 13 as an example, the first storage area includes 2 channels: ch1, ch2; the second storage area includes 6 channels: ch3, ch4, ch5, ch6, ch7, and ch8. Each channel includes multiple banks. For example, taking ch1 as an example, ch1 includes 4 banks such as b1, b2, b3, and b4. For the second storage area, taking ch4 as an example, ch4 includes b5, b6, b7, and b8. etc. 4 banks, and the number of arrays of the bank located in the first storage area is greater than the number of arrays of the bank located in the second storage area. For example, b1 of the first storage area includes 2 arrays, and b5 of the second storage area includes 1 array. In this way, the banks of the first storage area are divided into finer granularities, and more IO ports can be used. The number of IO ports of the bank in the storage area is greater than the number of IO ports of the bank in the second storage area. In unit time, the amount of data that the bank in the first storage area can read and write is greater than the amount of data that the bank in the second storage area can read and write. , the read and write bandwidth of the first storage area is greater than the read and write bandwidth of the second storage area.
一般而言,每一个array的IO端口数量一般为2n个,示例性的,最常用的配置为每一个array具有64个IO端口。每一个bank的阵列数量可以为1个、2个、4个或者8个等,即每一个bank的阵列数量可以为20个、21个、22个、23个等,基于此,本申请的实施例中,位于第一存储区域的bank的阵列数量为位于第二存储区域的bank的阵列数量的2n倍。Generally speaking, the number of IO ports per array is generally 2 n . For example, the most commonly used configuration is that each array has 64 IO ports. The number of arrays in each bank can be 1, 2, 4 or 8, etc. That is, the number of arrays in each bank can be 20 , 21 , 22 , 23 , etc. Based on this, In the embodiment of the present application, the number of bank arrays located in the first storage area is 2 n times the number of bank arrays located in the second storage area.
以图13示出的存储器为例,第二存储区域的每个bank(以b5为例)包括1个阵列,每个阵列包括64个IO端口,这样第二存储区域的每个bank具有64个IO端口;第一存储区域的每个bank(以b1为例)的阵列数量为第二存储区域的bank的阵列数量的21倍,每个阵列包括64个IO端口,这样第一存储区域的每个bank具备128个IO端口,数量为第一存储区域的每个bank的IO端口数量的21倍,在读写频率相同的情况下,第一存储区域的各个通道的带宽为第二存储区域的各个通道的带宽的21倍。Taking the memory shown in Figure 13 as an example, each bank (taking b5 as an example) of the second storage area includes 1 array, and each array includes 64 IO ports. In this way, each bank of the second storage area has 64 IO ports. IO ports; the number of arrays in each bank (taking b1 as an example) in the first storage area is 21 times the number of arrays in the bank in the second storage area. Each array includes 64 IO ports, so that the number of arrays in the first storage area is Each bank has 128 IO ports, the number of which is 2.1 times the number of IO ports of each bank in the first storage area. When the read and write frequencies are the same, the bandwidth of each channel in the first storage area is that of the second storage area. 2 1 times the bandwidth of each channel in the area.
上述示例并非是对各个bank的限制,每一个bank的阵列数量可以根据实际的需求以及工艺水平确定,例如,如图14所示,图14中a、b、c、d分别示出了一个bank分别划分为20个、21个、22个、23个阵列的示意图。由于每一个bank的容量是相同的,bank包括的阵列数量越多,每一个阵列的容量越小,bank具有的IO端口数目越多。以一个256Mb的bank为例,划分为1 个阵列时,该阵列的容量为256Mb,具有64个IO端口,bank具有的IO端口数量为64;划分为2个阵列时,每个阵列的容量为128Mb,bank具有的IO端口数量为128;划分为4个阵列时,每个阵列的容量为64Mb,bank具有的IO端口数量为256;划分为8个阵列时,每个阵列的容量为32Mb,bank具有的IO端口数量为512。由此可见,当bank的阵列数量越多时,IO端口数量也越多,存储器的带宽也更大。The above example is not a limitation on each bank. The number of arrays in each bank can be determined according to actual needs and process level. For example, as shown in Figure 14, a, b, c, and d in Figure 14 show one bank respectively. Schematic diagram of the arrays divided into 20 , 21 , 22 , and 23 arrays respectively. Since the capacity of each bank is the same, the more arrays the bank includes, the smaller the capacity of each array, and the more IO ports the bank has. Take a 256Mb bank as an example, divided into 1 When divided into two arrays, the capacity of each array is 128Mb, and the number of IO ports of the bank is 128; When divided into 4 arrays, the capacity of each array is 64Mb, and the number of IO ports of the bank is 256; when divided into 8 arrays, the capacity of each array is 32Mb, and the number of IO ports of the bank is 512. It can be seen that when the number of bank arrays is larger, the number of IO ports is also larger, and the memory bandwidth is also larger.
第一存储区域与第二存储区域的形状可以是多样的,例如,如图13所示,第一存储区域的各个通道可以在存储器芯片面内水平分布,或者如图15所示,也可以竖直分布。The shapes of the first storage area and the second storage area can be diverse. For example, as shown in Figure 13, the channels in the first storage area can be distributed horizontally within the memory chip plane, or as shown in Figure 15, they can also be vertically distributed. Straight distribution.
移动端设备仅仅在部分场景下有大带宽需求,例如视频图像处理、人工智能等场景,在其他的应用场景,移动终端的带宽需求较小,而增大带宽,单位时间内读写的数据增多,势必会增加读写功耗,采用存储器整体扩充带宽的方案存在带宽浪费。由于大带宽的需求较小,对大带宽存储区域的容量要求较低;低带宽的需求较多,对低带宽的存储区域的容量要求较高,本申请实施例提供的存储器,第一存储区域的bank数量小于第二存储区域的bank数量,例如图13所示的存储器103第一存储区域包括两个通道ch1、ch2,每个通道包括4个bank,第一存储区域共8个bank;若每一个bank的容量为256Mb,那么第一存储区域的容量为2048Mb。第二存储区域包括6个通道:ch3~ch8,每个通道包括4个bank,第二存储区域共24个bank,若每一个bank的容量为256Mb,第二存储区域的容量为6114Mb,是第一存储区域的容量的3倍。由于每一个bank的容量是基本相同的,这样第一存储区域的bank数量小于第二存储区域的bank数量,第一存储区域的容量小于第二存储区域的容量;第一存储区域适用于大带宽、低存储容量的使用需求,同时避免功耗过大;第二存储区域适用于低带宽、大存储容量的使用需求,这样既可以保障大容量存储(第二存储区域),也能够保障存储器具有较大的读写带宽(第一存储区域)。Mobile devices only have large bandwidth requirements in some scenarios, such as video image processing, artificial intelligence and other scenarios. In other application scenarios, the bandwidth requirements of mobile terminals are smaller. When the bandwidth is increased, the data read and written per unit time increases. , will inevitably increase the read and write power consumption, and the solution of using the overall memory to expand the bandwidth will waste bandwidth. Since the demand for large bandwidth is small, the capacity requirements for the large-bandwidth storage area are low; the demand for low bandwidth is greater, and the capacity requirements for the low-bandwidth storage area are higher. The memory provided by the embodiment of the present application has a first storage area The number of banks is less than the number of banks in the second storage area. For example, the first storage area of the memory 103 shown in Figure 13 includes two channels ch1 and ch2, each channel includes 4 banks, and the first storage area has a total of 8 banks; if The capacity of each bank is 256Mb, so the capacity of the first storage area is 2048Mb. The second storage area includes 6 channels: ch3~ch8. Each channel includes 4 banks. The second storage area has a total of 24 banks. If the capacity of each bank is 256Mb, the capacity of the second storage area is 6114Mb. 3 times the capacity of a storage area. Since the capacity of each bank is basically the same, the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the capacity of the first storage area is smaller than the capacity of the second storage area; the first storage area is suitable for large bandwidth , low storage capacity usage requirements, while avoiding excessive power consumption; the second storage area is suitable for low bandwidth, large storage capacity usage requirements, which can not only ensure large-capacity storage (second storage area), but also ensure that the memory has Large read and write bandwidth (first storage area).
第一存储区域与第二存储区域可以独立的进行数据的读写,例如第一存储区域可以仅在大带宽使用场景下工作,在低带宽需求的使用场景下,第一存储区域可以被配置为关闭状态,可以降低存储器的功耗;此外,第一存储区域的bank数量小于第二存储区域的bank数量,第一存储区域的规模较小,与提高存储器整体的带宽方案相比,本申请实施例提供的存储器可以实现更低的读写功耗。The first storage area and the second storage area can independently read and write data. For example, the first storage area can only work in large-bandwidth usage scenarios. In low-bandwidth demand usage scenarios, the first storage area can be configured as The closed state can reduce the power consumption of the memory; in addition, the number of banks in the first storage area is smaller than the number of banks in the second storage area, and the scale of the first storage area is smaller. Compared with the solution to increase the overall bandwidth of the memory, this application implements The memory provided in the example can achieve lower reading and writing power consumption.
本申请实施例提供的存储器包括第一存储区域与第二存储区域,第一存储区域与第二存储区域可以独立的进行读写,第一存储区域与第二存储区域包括多个容量大小相同的bank,其中第一存储区域的bank的数量小于第二存储区域的bank的数量,而位于第一存储区域的bank划分的粒度更细,具有更多的阵列数量;位于第二存储区域的bank具有更少的阵列数量,这样在同一个存储器芯片中同时提供了大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,兼顾了移动端设备SoC对主存的带宽、容量的需求,并且第一存储区域和第二存储区域可以独立进行读写,相比与提高整个存储器的带宽而言,本申请实施例提供的存储器仅提高了第一存储区域的读写带宽,而第一存储区域在整个存储器中仅仅占用了较小的部分面积,因此能够实现更低的读写功耗。The memory provided by the embodiment of the present application includes a first storage area and a second storage area. The first storage area and the second storage area can be read and written independently. The first storage area and the second storage area include multiple storage areas with the same capacity. bank, where the number of banks in the first storage area is less than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities and have more arrays; the banks located in the second storage area have With a smaller number of arrays, the same memory chip simultaneously provides a large-bandwidth, small-capacity first storage area and a small-bandwidth, large-capacity second storage area, taking into account the bandwidth and capacity of the main memory of the mobile device SoC. requirements, and the first storage area and the second storage area can be read and written independently. Compared with increasing the bandwidth of the entire memory, the memory provided by the embodiment of the present application only improves the read and write bandwidth of the first storage area, and The first storage area only occupies a smaller part of the entire memory area, so it can achieve lower reading and writing power consumption.
在移动端设备中,主存和SoC通常处于不同的die,但为了减少设备内部空间的占用,简化连线等,SoC与主存通常被封装在同一个封装内。In mobile devices, the main memory and the SoC are usually on different dies. However, in order to reduce the internal space occupation of the device and simplify wiring, the SoC and the main memory are usually packaged in the same package.
示例性的,参阅图16,图16示出了本申请实施例提供的一种芯片堆叠结构,包括SoC与存储器103,SoC与存储器103堆叠设置,SoC与103通过触点(bump)和硅通孔技术(through-silicon-via,TSV)实现数据信号、控制信号与地址信号的传输,从而SoC可以从存储器103读出数据或者向存储器103写入数据。For example, refer to Figure 16. Figure 16 shows a chip stack structure provided by an embodiment of the present application, including an SoC and a memory 103. The SoC and the memory 103 are stacked. The SoC and the memory 103 are connected through a contact (bump) and a silicon pass. Through-silicon-via (TSV) technology realizes the transmission of data signals, control signals and address signals, so that the SoC can read data from the memory 103 or write data to the memory 103.
其中,存储器103包括第一存储区域与第二存储区域,第一存储区域与第二存储区域可以独立的进行读写,第一存储区域与第二存储区域包括多个容量大小相同的bank,其中第一存储区域的bank的数量小于第二存储区域的bank的数量,而位于第一存储区域的bank划分的粒度更细,具有更多的阵列数量,更多的IO端口,因此第一存储区域与SoC之间需要触点的密度更高;位于第二存储区域的bank具有更少的阵列数量,更少的IO端口,因此第二存储区域与SoC之间需要触点的密度更少。这样在同一个存储器103芯片中同时提供了大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,兼顾了移动端设备SoC对主存的带宽、容量的需求。 The memory 103 includes a first storage area and a second storage area. The first storage area and the second storage area can be read and written independently. The first storage area and the second storage area include a plurality of banks with the same capacity, where The number of banks in the first storage area is smaller than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities, with more arrays and more IO ports, so the first storage area A higher density of contacts is required between the second storage area and the SoC; the bank located in the second storage area has a smaller number of arrays and fewer IO ports, so a lower density of contacts is required between the second storage area and the SoC. In this way, the same memory 103 chip simultaneously provides a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, taking into account the main memory bandwidth and capacity requirements of the mobile device SoC.
第一存储区域与第二存储区域可以分别设置一个或多个通道,这样第一存储区域与第二存储区域可以独立的被访问或者读写数据,例如,存储器103设置有第一数据总线、第一地址总线、第一控制总线;存储器103还设置有第二数据总线、第二地址总线与第二控制总线,其中第一数据总线、第一控制总线与第一地址总线用于实现第一存储区域的数据读写;第二数据总线、第二控制总线与第二地址总线用于实现第二存储区域的数据读写。The first storage area and the second storage area can each be provided with one or more channels, so that the first storage area and the second storage area can be independently accessed or read and write data. For example, the memory 103 is provided with a first data bus, a third an address bus and a first control bus; the memory 103 is also provided with a second data bus, a second address bus and a second control bus, wherein the first data bus, the first control bus and the first address bus are used to implement the first storage The second data bus, the second control bus and the second address bus are used to read and write data in the second storage area.
SoC中对应存储器103的第一存储区域与第二存储区域分别设置有第一内存控制器与第二内存控制器,其中第一内存控制器通过上述的第一数据总线、第一控制总线与第一地址总线与第一存储区域连接,用于实现存储器103的第一存储区域的读写控制,第二内存控制器通过第二数据总线、第二控制总线与第二地址总线与第二存储区域连接,用于实现存储器103的第二存储区域的读写控制。A first memory controller and a second memory controller are respectively provided in the SoC corresponding to the first storage area and the second storage area of the memory 103. The first memory controller connects the first data bus, the first control bus and the third memory controller. An address bus is connected to the first storage area for realizing read and write control of the first storage area of the memory 103. The second memory controller communicates with the second storage area through the second data bus, the second control bus and the second address bus. The connection is used to realize read and write control of the second storage area of the memory 103.
SoC与主存可以采用多种封装形式,例如2.5D封装、3D封装、多芯片模块(multichip module,MCM)封装、封装体层叠(package on package,PoP)形式的封装。示例性的,在图16的基础上,参阅图17,本申请实施例提供的一种芯片封装结构。芯片封装结构包括封装基板以及上述的芯片堆叠结构,芯片堆叠结构设置在封装基板上,这种所有的芯片全部堆叠在一起的封装形式被称为3D封装。SoC and main memory can adopt a variety of packaging forms, such as 2.5D packaging, 3D packaging, multichip module (MCM) packaging, and package on package (PoP) packaging. For example, on the basis of Figure 16, refer to Figure 17, which shows a chip packaging structure provided by the embodiment of the present application. The chip packaging structure includes a packaging substrate and the above-mentioned chip stacking structure. The chip stacking structure is arranged on the packaging substrate. This packaging form in which all chips are stacked together is called 3D packaging.
这里的芯片堆叠结构包括SoC与存储器103,在芯片封装结构中,如图18所示,可以是SoC靠近封装基板、存储器103远离封装基板;如图17所示,也可以是存储器103靠近封装基板,SoC远离封装基板。The chip stack structure here includes SoC and memory 103. In the chip packaging structure, as shown in Figure 18, the SoC can be close to the packaging substrate and the memory 103 is far away from the packaging substrate; as shown in Figure 17, the memory 103 can also be close to the packaging substrate. , the SoC is far away from the packaging substrate.
结合图19,图19示出了本申请实施例提供的另一种芯片封装结构,包括封装基板、SoC以及存储器103,其中封装基板包括第一平面,第一平面包括不想交的第一区域与第二区域,SoC设置于第一区域,存储器103设置于第二区域,存储器103通过连接线与SoC电连接。这类将多个芯片组装在同一块封装基板上的封装形式称为多芯片模块(multichip module,MCM)封装。Combined with Figure 19, Figure 19 shows another chip packaging structure provided by an embodiment of the present application, including a packaging substrate, an SoC and a memory 103. The packaging substrate includes a first plane, and the first plane includes a first area and a first area that do not want to intersect. In the second area, the SoC is arranged in the first area, and the memory 103 is arranged in the second area. The memory 103 is electrically connected to the SoC through connecting lines. This type of packaging that assembles multiple chips on the same packaging substrate is called multichip module (MCM) packaging.
存储器103同时提供了大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,兼顾了移动端设备SoC对主存的带宽、容量的需求,但在一些情况下,存储器103的容量有限,无法满足SoC对主存容量的需求,这样还可以通过堆叠多个存储器芯片来扩大容量。The memory 103 simultaneously provides a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, taking into account the bandwidth and capacity requirements of the main memory of the mobile device SoC. However, in some cases, the memory 103 The capacity is limited and cannot meet the SoC's demand for main memory capacity. In this way, the capacity can be expanded by stacking multiple memory chips.
示例性的,参阅图20,图20示出了本申请实施例提供的另一种芯片堆叠结构,包括SoC与多个存储器,这里的多个存储器中的至少一个存储器为本申请前述实施例提供的存储器103,即包括大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域。Exemplarily, refer to FIG. 20 , which shows another chip stack structure provided by an embodiment of the present application, including an SoC and multiple memories. At least one memory among the multiple memories here is provided by the previous embodiment of the present application. The memory 103 includes a first storage area with large bandwidth and small capacity, and a second storage area with small bandwidth and large capacity.
例如,芯片堆叠结构包括SoC、存储器103与存储器105,其中存储器103包括第一存储区域与第二存储区域,第一存储区域与第二存储区域可以独立的进行读写,第一存储区域与第二存储区域包括多个容量大小相同的bank,其中第一存储区域的bank的数量小于第二存储区域的bank的数量,而位于第一存储区域的bank划分的粒度更细,具有更多的阵列数量;位于第二存储区域的bank具有更少的阵列数量,这样在同一个存储器103芯片中同时提供了大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,兼顾了移动端设备SoC对主存的带宽、容量的需求。存储器105的存储区域中各个bank的阵列数量是相同的,提供的带宽也是相同的,例如存储器105的存储区域中各个bank的阵列数量与存储器103中第二存储区域的各个bank具有相同的阵列数量,这样存储器105的可以与存储器103的第二存储区域具有相同的读写带宽。For example, the chip stack structure includes an SoC, a memory 103 and a memory 105. The memory 103 includes a first storage area and a second storage area. The first storage area and the second storage area can be read and written independently. The first storage area and the second storage area can be read and written independently. The second storage area includes multiple banks with the same capacity. The number of banks in the first storage area is smaller than the number of banks in the second storage area, and the banks located in the first storage area are divided into finer granularities and have more arrays. Quantity; the bank located in the second storage area has a smaller number of arrays, so that the same memory 103 chip provides both a large bandwidth and small capacity first storage area and a small bandwidth and large capacity second storage area, taking into account Mobile device SoC’s requirements for main memory bandwidth and capacity. The number of arrays of each bank in the storage area of the memory 105 is the same, and the bandwidth provided is also the same. For example, the number of arrays of each bank in the storage area of the memory 105 is the same as the number of arrays of each bank in the second storage area of the memory 103. , so that the memory 105 can have the same read and write bandwidth as the second storage area of the memory 103 .
基于图20所示的芯片堆叠结构,本申请实施例提供了另一种芯片封装结构,包括封装基板、SoC以及多个存储器,这里的多个存储器中,至少一个存储器为本申请前述实施例提供的存储器103,即包括大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域。Based on the chip stack structure shown in Figure 20, this embodiment of the present application provides another chip packaging structure, including a packaging substrate, an SoC and multiple memories. Among the multiple memories here, at least one memory is provided by the previous embodiment of the present application. The memory 103 includes a first storage area with large bandwidth and small capacity, and a second storage area with small bandwidth and large capacity.
示例性的,参阅图21,芯片封装结构包括封装基板、SoC、存储器103与存储器105,其中存储器103包括大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,其中第一存储区域的bank的数量小于第二存储区域的bank的数量,位于第一存储区域的bank划分的粒度更细,具有更多的阵列数量;位于第二存储区域的bank具有更少的阵列数量,兼顾了移动端设备SoC对主存的带宽、容量的需求。存储器105与存储器103的结构不同,存储器105的存储区域中各个bank的阵列数量是相同的,例如存储器105的存储区域中各个bank的阵列数量与存储器103中第二存储区域的各个bank具有相同的阵列数量,这样存储器105的可以与存储器103的第二存 储区域具有相同的读写带宽。For example, referring to Figure 21, the chip packaging structure includes a packaging substrate, SoC, memory 103 and memory 105, wherein the memory 103 includes a first storage area with large bandwidth and small capacity and a second storage area with small bandwidth and large capacity, wherein the The number of banks in one storage area is smaller than the number of banks in the second storage area. The banks located in the first storage area are divided into finer granularities and have more arrays; the banks located in the second storage area have fewer arrays. , taking into account the bandwidth and capacity requirements of mobile device SoC for main memory. The structure of memory 105 is different from that of memory 103. The number of arrays of each bank in the storage area of memory 105 is the same. For example, the number of arrays of each bank in the storage area of memory 105 is the same as that of each bank in the second storage area of memory 103. number of arrays, so that the second memory of memory 105 can be combined with the second memory of memory 103 The storage area has the same read and write bandwidth.
示例性的,参阅图22,本申请实施例还提供了另一种芯片封装结构,芯片封装结构包括封装基板、SoC、存储器103与存储器105,其中存储器103包括大带宽、小容量的第一存储区域和小带宽大容量的第二存储区域,其中第一存储区域的bank的数量小于第二存储区域的bank的数量,位于第一存储区域的bank划分的粒度更细,具有更多的阵列数量;位于第二存储区域的bank具有更少的阵列数量,兼顾了移动端设备SoC对主存的带宽、容量的需求。存储器105的结构与存储器103的结构相同,这样相当于同时将存储器103提供的大带宽存储区域与低带宽存储区域的存储容量翻倍。Illustratively, referring to Figure 22, this embodiment of the present application also provides another chip packaging structure. The chip packaging structure includes a packaging substrate, an SoC, a memory 103 and a memory 105. The memory 103 includes a large-bandwidth, small-capacity first memory. area and a second storage area with small bandwidth and large capacity. The number of banks in the first storage area is smaller than the number of banks in the second storage area. The banks located in the first storage area are divided into finer granularities and have more arrays. ; The bank located in the second storage area has a smaller number of arrays, taking into account the main memory bandwidth and capacity requirements of the mobile device SoC. The structure of the memory 105 is the same as that of the memory 103, which is equivalent to doubling the storage capacity of the large-bandwidth storage area and the low-bandwidth storage area provided by the memory 103 at the same time.
上述仅仅是对本申请实施例提供的以堆叠的方式扩充主存容量的方案进行说明,在任意一种封装形式中,都可以堆叠多个存储器芯片以扩展主存的容量。堆叠的多个存储器芯片中,可以包括至少一个本申请前述实施例提供的存储器103,即包括大带宽、低容量的第一存储区域与低带宽、大容量的第二存储区域的存储器,当然堆叠的多个存储器芯片中也可以全部为本申请前述实施例提供的存储器103。The above is only an explanation of the solution for expanding the main memory capacity in a stacking manner provided by the embodiment of the present application. In any packaging form, multiple memory chips can be stacked to expand the capacity of the main memory. The stacked multiple memory chips may include at least one memory 103 provided in the previous embodiments of the present application, that is, a memory including a large-bandwidth, low-capacity first storage area and a low-bandwidth, large-capacity second storage area. Of course, stacking All of the multiple memory chips may be the memory 103 provided in the previous embodiments of the present application.
本申请实施例提供的各个芯片堆叠结构,其堆叠的方式包括以下各种方式中的一种或多种,例如:硅通孔(through-silicon-via,TSV)连接、芯片上芯片(Chip on Chip)、晶元上芯片(Chip on Wafer)、晶元上晶元(wafer on wafer),当然也还可以是其他的堆叠方式。The stacking methods of each chip stack structure provided by the embodiments of the present application include one or more of the following methods, such as: through-silicon-via (TSV) connection, chip on chip (Chip on chip) Chip), chip on wafer (Chip on Wafer), wafer on wafer (wafer on wafer), and of course other stacking methods are also possible.
MCM封装将多个芯片封装在同一块封装基板上,在此基础上,同样可以对存储器芯片进行堆叠,以扩大主存的容量。例如,在图19的基础上,参阅图23,本申请实施例提供了另外一种芯片封装结构,包括封装基板、SoC以及多个存储器;封装基板包括第一平面,第一平面包括不想交的第一区域与第二区域,SoC设置于第一区域;多个存储器堆叠设置于第二区域形成芯片堆叠结构,多个存储器中的至少一个为本申请前述实施例提供的存储器,即包括大带宽、低容量的第一存储区域与低带宽、大容量的第二存储区域的存储器103,当然堆叠的多个存储器芯片中也可以全部为本申请前述实施例提供的存储器103。MCM packaging encapsulates multiple chips on the same packaging substrate. On this basis, memory chips can also be stacked to expand the capacity of main memory. For example, on the basis of Figure 19, referring to Figure 23, the embodiment of the present application provides another chip packaging structure, including a packaging substrate, SoC and multiple memories; the packaging substrate includes a first plane, and the first plane includes an unintended The first area and the second area, the SoC is disposed in the first area; a plurality of memory stacks are disposed in the second area to form a chip stack structure, and at least one of the plurality of memories is the memory provided in the previous embodiment of the present application, that is, it includes a large bandwidth , a low-capacity first storage area and a low-bandwidth, large-capacity second storage area memory 103. Of course, the multiple stacked memory chips may all be the memories 103 provided in the previous embodiments of the present application.
此外,由于SoC可以独立访问第一存储区域与第二存储区域,也就是说,以存储器103与存储器105为例,存储器103包括第一存储区域与第二存储区域,第一存储区域的bank的阵列数量与第二存储区域的bank的阵列数量不同,例如第一存储区域的bank的阵列数量大于第二存储区域的bank的阵列数量,SoC设置有第一内存控制器与第二内存控制器,利用第一内存控制器对第一存储区域进行读写控制,利用第二内存控制器对第二存储区域进行读写控制;存储器103的存储区域中各个bank的阵列的数量是相同的,例如,存储器103的存储区域中各个bank的阵列数量与存储器103的第二存储区域的bank的数量相同,这样存储器105与存储器103的第二存储区域的带宽可以相同,SoC可以利用第二内存控制器对存储器105进行读写控制。In addition, since the SoC can independently access the first storage area and the second storage area, that is to say, taking the memory 103 and the memory 105 as an example, the memory 103 includes the first storage area and the second storage area, and the bank of the first storage area The number of arrays is different from the number of arrays of banks in the second storage area. For example, the number of arrays of banks in the first storage area is greater than the number of arrays of banks in the second storage area. The SoC is provided with a first memory controller and a second memory controller. The first memory controller is used to perform read and write control on the first storage area, and the second memory controller is used to perform read and write control on the second storage area; the number of arrays of each bank in the storage area of the memory 103 is the same, for example, The number of arrays of each bank in the storage area of the memory 103 is the same as the number of banks in the second storage area of the memory 103. In this way, the bandwidth of the memory 105 and the second storage area of the memory 103 can be the same, and the SoC can use the second memory controller to The memory 105 performs read and write control.
由于存储器103具有两个独立的存储区域,第一存储区域与第二存储区域需要分别设置地址总线、控制总线与数据总线;而存储器105仅具有一个整体的存储区域,因此存储器103与SoC之间用于读写控制的互联线更多更复杂,存储器105与SoC之间用于读写控制的互联线更少更简单,并且在堆叠设置的情况下,可以沿用一部分存储器103与SoC之间的互联线,因此在具备多个存储器堆叠的情况下,具有第一存储区域与第二存储区域的存储器处于多个堆叠的存储器中更靠近SoC的一侧,以减少互联线长度,减小连线电容,降低地址解码器等读写控制电路的复杂程度,进一步可以降低读写功耗。Since the memory 103 has two independent storage areas, the first storage area and the second storage area need to be provided with an address bus, a control bus and a data bus respectively; while the memory 105 only has one overall storage area, so there is no connection between the memory 103 and the SoC. The interconnection lines used for read and write control are more and more complex, and the interconnection lines used for read and write control between the memory 105 and the SoC are fewer and simpler, and in the case of stacking settings, part of the interconnection lines between the memory 103 and the SoC can be used. Interconnect lines, so in the case of multiple memory stacks, the memory with the first storage area and the second storage area is located on the side closer to the SoC among the multiple stacked memories to reduce the length of the interconnect lines and reduce the wiring. Capacitors reduce the complexity of read and write control circuits such as address decoders, and further reduce read and write power consumption.
此外,存储器103包括第一存储区域与第二存储区域,第一存储区域具有更大的读写带宽,可以满足SoC中的高算力模块的高带宽需求,这里的高算力模块一般包括图形处理器(graphic processing unit,GPU)或者网络处理器(neural-network processing unit,NPU)等,那么在SoC于存储器堆叠的情况下,例如,参阅图24,可以将SoC的高算力模块与第一存储区域在堆叠方向上对齐,例如SoC的高算力模块在封装基板上的投影与第一存储区域在封装基板上的投影位于同一区域,这样同样可以减少二者之间的互联线长度,减小连线电容,有利于降低读写功耗。In addition, the memory 103 includes a first storage area and a second storage area. The first storage area has a larger read and write bandwidth and can meet the high bandwidth requirements of high computing power modules in the SoC. The high computing power modules here generally include graphics. processor (graphic processing unit, GPU) or network processor (neural-network processing unit, NPU), etc., then when the SoC is stacked on the memory, for example, see Figure 24, the high computing power module of the SoC can be combined with the third A storage area is aligned in the stacking direction. For example, the projection of the SoC's high computing power module on the packaging substrate and the projection of the first storage area on the packaging substrate are located in the same area. This can also reduce the length of the interconnection line between the two. Reducing the connection capacitance will help reduce reading and writing power consumption.
本申请实施例还提供一种存储装置,该存储设备包括印制电路板(printed circuit board,PCB)、以及与印制电路板连接的存储器,该存储器可以为上文所提供的任一种存储器。其中,该印制电路板用于为该存储器中所包括的电子元器件提供电气连接。可选的,该存储设备可以为计算机、 手机、平板电脑、可穿戴设备和车载设备等不同类型的用户设备或者终端设备。An embodiment of the present application also provides a storage device. The storage device includes a printed circuit board (PCB) and a memory connected to the printed circuit board. The memory can be any of the memories provided above. . Wherein, the printed circuit board is used to provide electrical connections for electronic components included in the memory. Optionally, the storage device can be a computer, Different types of user devices or terminal devices such as mobile phones, tablets, wearable devices, and vehicle-mounted devices.
可选的,该存储设备还可以包括封装基板,该封装基板通过焊球固定于印制电路板PCB上,该存储器通过焊球固定于封装基板上,该封装基板用于封装该存储器。Optionally, the storage device may further include a packaging substrate, the packaging substrate is fixed on the printed circuit board PCB through solder balls, the memory is fixed on the packaging substrate through solder balls, and the packaging substrate is used to package the memory.
在本申请的另一方面,还提供一种存储装置,该存储装置包括第一控制器、第二控制器和存储器,第一控制器、第二控制器用于控制该存储器中的读写,该存储器可以为本申请实施例提供的存储器103。示例性的,存储装置可以应用于移动端设备,例如手机、平板电脑等,第一控制器可以为第一内存控制器,第二控制器可以为第二内存控制器,存储器包括大带宽、低容量的第一存储区域和低带宽、大容量的第二存储区域,第一内存控制器用于控制第一存储区域的读写,第二内存控制器用于控制第二存储区域的读写。In another aspect of the present application, a storage device is also provided. The storage device includes a first controller, a second controller and a memory. The first controller and the second controller are used to control reading and writing in the memory. The memory may be the memory 103 provided in the embodiment of this application. For example, the storage device can be applied to mobile devices, such as mobile phones, tablet computers, etc., the first controller can be a first memory controller, the second controller can be a second memory controller, and the memory includes large bandwidth, low A first storage area with a large capacity and a second storage area with a low bandwidth and a large capacity, the first memory controller is used to control the reading and writing of the first storage area, and the second memory controller is used to control the reading and writing of the second storage area.
在本申请的另一方面,还提供一种电子设备,该电子设备包括印制电路板(print circuit board,PCB)以及如上述实施例中图17~图23提供的芯片封装结构,芯片封装结构包括存储器、SoC与封装基板,芯片封装结构与印制电路板连接,例如可以是封装基板与印刷电路板电连接。In another aspect of the present application, an electronic device is also provided. The electronic device includes a printed circuit board (PCB) and a chip packaging structure as provided in Figures 17 to 23 in the above embodiments. The chip packaging structure Including memory, SoC and packaging substrate, the chip packaging structure is connected to the printed circuit board, for example, the packaging substrate and the printed circuit board can be electrically connected.
可选的,该电子设备为计算机系统、手机、平板电脑、可穿戴设备和车载设备等不同类型的用户设备或者终端设备。需要说明的是,需要说明的是,上述关于该存储器的相关描述,均可对应援引到本申请所提供的存储装置和电子设备中,本申请实施例在此不再赘述。Optionally, the electronic device is different types of user equipment or terminal equipment such as computer systems, mobile phones, tablets, wearable devices, and vehicle-mounted equipment. It should be noted that the above-mentioned relevant descriptions of the memory can be correspondingly cited in the storage device and electronic equipment provided in the present application, and the embodiments of the present application will not be repeated here.
最后应说明的是:以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。 Finally, it should be noted that the above are only specific implementation modes of the present application, but the protection scope of the present application is not limited thereto. Any changes or substitutions within the technical scope disclosed in the present application shall be covered by this application. within the scope of protection applied for. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (12)

  1. 一种存储器,其特征在于,包括:第一存储区域与第二存储区域;A memory, characterized in that it includes: a first storage area and a second storage area;
    所述第一存储区域与所述第二存储区域均包括多个存储库bank,每个bank包括一个或多个阵列array,每一个所述阵列具有相同数量的输入输出端口;The first storage area and the second storage area each include a plurality of storage banks, each bank includes one or more array arrays, and each of the arrays has the same number of input and output ports;
    其中,位于所述第一存储区域的bank的阵列的数量与位于所述第二存储区域的bank的阵列的数量不同。Wherein, the number of bank arrays located in the first storage area is different from the number of bank arrays located in the second storage area.
  2. 根据权利要求1所述的存储器,其特征在于,位于所述第一存储区域的bank的阵列的数量大于所述第二存储区域的bank的阵列的数量。The memory according to claim 1, wherein the number of bank arrays located in the first storage area is greater than the number of bank arrays in the second storage area.
  3. 根据权利要求1所述的存储器,其特征在于,所述位于所述第一存储区域的bank的阵列的数量为位于所述第二存储区域的bank的阵列的数量2n倍,n为正整数。The memory according to claim 1, wherein the number of bank arrays located in the first storage area is 2 n times the number of bank arrays located in the second storage area, and n is a positive integer. .
  4. 根据权利要求2所述的存储器,其特征在于,所述第一存储区域的bank的数量小于所述第二存储区域的bank的数量。The memory of claim 2, wherein the number of banks in the first storage area is smaller than the number of banks in the second storage area.
  5. 根据权利要求1~4任一项所述的存储器,其特征在于,任意两个bank的容量大小相同。The memory according to any one of claims 1 to 4, characterized in that any two banks have the same capacity.
  6. 根据权利要求1~5任一项所述的存储器,其特征在于,所述存储器包括连接所述第一存储区域的第一数据总线与连接所述第二存储区域的第二数据总线;The memory according to any one of claims 1 to 5, characterized in that the memory includes a first data bus connecting the first storage area and a second data bus connecting the second storage area;
    所述第一数据总线用于向所述第一存储区域读写数据,所述第二数据总线用于向所述第二存储区域读写数据。The first data bus is used to read and write data to the first storage area, and the second data bus is used to read and write data to the second storage area.
  7. 一种芯片堆叠结构,其特征在于,所述芯片堆叠结构包括系统及芯片SoC与一个或多个存储器,所述SoC与所述一个或多个存储器依次堆叠设置;A chip stack structure, characterized in that the chip stack structure includes a system and a chip SoC and one or more memories, and the SoC and the one or more memories are stacked in sequence;
    所述一个或多个存储器中的至少一个为如权利要求1~6任一项所述的存储器。At least one of the one or more memories is the memory according to any one of claims 1 to 6.
  8. 根据权利要求7所述的芯片堆叠结构,其特征在于,所述堆叠的方式包括以下各种方式中的一种或多种:硅通孔TSV连接、芯片上芯片、晶元上芯片、晶元上晶元。The chip stack structure according to claim 7, wherein the stacking method includes one or more of the following methods: through silicon via TSV connection, chip on chip, chip on wafer, wafer Upper wafer.
  9. 一种芯片封装结构,其特征在于,包括封装基板以及如权利要求7~8任一项所述的芯片堆叠结构,所述芯片堆叠结构设置在所述封装基板上。A chip packaging structure, characterized by comprising a packaging substrate and the chip stack structure according to any one of claims 7 to 8, the chip stack structure being arranged on the packaging substrate.
  10. 一种芯片封装结构,其特征在于,包括封装基板、系统级芯片SoC以及如权利要求1~6任一项所述的存储器;A chip packaging structure, characterized in that it includes a packaging substrate, a system-on-chip SoC, and the memory according to any one of claims 1 to 6;
    所述封装基板包括第一平面,所述第一平面包括第一区域与第二区域,所述SoC设置于所述第一区域,所述存储器设置于所述第二区域,所述存储器通过连接线与所述SoC电连接。The packaging substrate includes a first plane, the first plane includes a first area and a second area, the SoC is disposed in the first area, the memory is disposed in the second area, and the memory is connected through lines are electrically connected to the SoC.
  11. 一种芯片封装结构,其特征在于,包括封装基板、系统级芯片以及多个存储器;A chip packaging structure, characterized by including a packaging substrate, a system-level chip and multiple memories;
    所述封装基板包括第一平面,所述第一平面包括第一区域与第二区域,所述SoC设置于所述第一区域;The packaging substrate includes a first plane, the first plane includes a first area and a second area, and the SoC is disposed in the first area;
    多个所述存储器堆叠设置于所述第二区域形成芯片堆叠结构,多个所述存储器中的至少一个为如权利要求1~6任一项所述的存储器;A plurality of the memory stacks are arranged in the second area to form a chip stack structure, and at least one of the plurality of memories is the memory according to any one of claims 1 to 6;
    所述芯片堆叠结构通过连接线与所述SoC电连接。The chip stack structure is electrically connected to the SoC through connecting wires.
  12. 一种电子设备,其特征在于,包括印制电路板以及如权利要求9~11任一项所述的芯片封装结构;An electronic device, characterized by comprising a printed circuit board and a chip packaging structure as claimed in any one of claims 9 to 11;
    所述芯片封装结构中的封装基板与所述印刷电路板电连接。 The packaging substrate in the chip packaging structure is electrically connected to the printed circuit board.
PCT/CN2023/102132 2022-06-29 2023-06-25 Memory, chip stack structure, chip package structure, and electronic device WO2024001962A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210753276.1 2022-06-29
CN202210753276.1A CN117389459A (en) 2022-06-29 2022-06-29 Memory, chip stacking structure, chip packaging structure and electronic equipment

Publications (1)

Publication Number Publication Date
WO2024001962A1 true WO2024001962A1 (en) 2024-01-04

Family

ID=89383213

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/102132 WO2024001962A1 (en) 2022-06-29 2023-06-25 Memory, chip stack structure, chip package structure, and electronic device

Country Status (2)

Country Link
CN (1) CN117389459A (en)
WO (1) WO2024001962A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102800352A (en) * 2011-05-27 2012-11-28 三星电子株式会社 Integrated circuit memory device
CN109840223A (en) * 2017-11-29 2019-06-04 三星电子株式会社 Memory device, including its electronic device and electronic device operating method
CN110047522A (en) * 2017-12-28 2019-07-23 三星电子株式会社 Storage equipment and electronic equipment including isomery volatile memory chip
CN110870011A (en) * 2017-06-12 2020-03-06 美光科技公司 Fine granularity dynamic random access memory

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102800352A (en) * 2011-05-27 2012-11-28 三星电子株式会社 Integrated circuit memory device
CN110870011A (en) * 2017-06-12 2020-03-06 美光科技公司 Fine granularity dynamic random access memory
CN109840223A (en) * 2017-11-29 2019-06-04 三星电子株式会社 Memory device, including its electronic device and electronic device operating method
CN110047522A (en) * 2017-12-28 2019-07-23 三星电子株式会社 Storage equipment and electronic equipment including isomery volatile memory chip

Also Published As

Publication number Publication date
CN117389459A (en) 2024-01-12

Similar Documents

Publication Publication Date Title
US11145384B2 (en) Memory devices and methods for managing error regions
US9293170B2 (en) Configurable bandwidth memory devices and methods
US7929368B2 (en) Variable memory refresh devices and methods
JP2024028937A (en) Memory system that combines high-density, low-bandwidth memory and low-density, high-bandwidth memory
KR101429869B1 (en) Memory circuit system and method
US8456880B2 (en) Multiple layers of memory implemented as different memory technology
US11789644B2 (en) Memory centric system incorporating computational memory
TW202022615A (en) Memory system
CN116737617B (en) Access controller
WO2024001962A1 (en) Memory, chip stack structure, chip package structure, and electronic device
US20230307030A1 (en) Adaptive Wordline Refresh
CN216119560U (en) LLC chip and cache system
CN114610665A (en) Memory expansion card
CN117149700B (en) Data processing chip, manufacturing method thereof and data processing system
US20240170038A1 (en) Adaptive Refresh Staggering
CN114038491A (en) 3D dynamic storage device, data reading method, data writing method and memory equipment
CN117453619A (en) Data processing chip, manufacturing method thereof and data processing system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23830140

Country of ref document: EP

Kind code of ref document: A1