CN116110450A - Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor - Google Patents

Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor Download PDF

Info

Publication number
CN116110450A
CN116110450A CN202310129169.6A CN202310129169A CN116110450A CN 116110450 A CN116110450 A CN 116110450A CN 202310129169 A CN202310129169 A CN 202310129169A CN 116110450 A CN116110450 A CN 116110450A
Authority
CN
China
Prior art keywords
search
pulse width
fefet
vdd
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310129169.6A
Other languages
Chinese (zh)
Inventor
黄如
徐伟凯
黄芊芊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN202310129169.6A priority Critical patent/CN116110450A/en
Publication of CN116110450A publication Critical patent/CN116110450A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C11/00Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor
    • G11C11/21Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements
    • G11C11/22Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements using ferroelectric elements
    • G11C11/223Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements using ferroelectric elements using MOS with ferroelectric gate insulating film
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C11/00Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor
    • G11C11/21Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements
    • G11C11/22Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements using ferroelectric elements
    • G11C11/225Auxiliary circuits
    • G11C11/2253Address circuits or decoders
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C11/00Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor
    • G11C11/21Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements
    • G11C11/22Digital stores characterised by the use of particular electric or magnetic storage elements; Storage elements therefor using electric elements using ferroelectric elements
    • G11C11/225Auxiliary circuits
    • G11C11/2275Writing or programming circuits or methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Semiconductor Memories (AREA)

Abstract

The invention provides a method for realizing an analog and multi-value content addressable memory based on a pulse width modulation ferroelectric field effect transistor, belonging to the technical field of novel storage and calculation. Compared with the traditional implementation modes of ACAM and MCAM, the invention does not need additional transistors and complex analog peripheral circuits, can be realized by only sharing a few logic gates, has a simpler implementation mode and lower hardware cost, only needs one voltage amplitude VDD in the search process, and is more compatible with a digital system. In addition, the encoding mode of the search query and the memory entry based on pulse width ensures that the memory range is not limited by the memory window of the FeFET, and MCAM with more bits can be obtained, so that the memory density and the search energy efficiency of the CAM are further improved, and the method has wide application prospect.

Description

基于脉冲宽度调制铁电场效应晶体管实现模拟和多值内容可寻址存储器的方法A method for implementing analog and multi-valued content-addressable memory based on pulse-width modulated ferroelectric field-effect transistors

技术领域technical field

本发明涉及新型存储与计算技术领域,具体涉及一种基于脉冲宽度调制铁电场效应晶体管的模拟和多值内容可寻址存储器设计。The invention relates to the technical field of novel storage and computing, in particular to an analog and multi-valued content addressable memory design based on a pulse width modulation ferroelectric field effect transistor.

背景技术Background technique

内容可寻址存储器(CAM)可以高效且并行地执行搜索操作,被广泛应用在路由器、数据库搜索、存内计算以及神经形态计算等高效机器学习模型中。CAM是一种用于并行搜索的特殊类型的存储器,其除了具有常规存储器的读操作和写操作之外,还可以执行独特的搜索操作。CAM最初是用于加速网络路由器中的数据包转发和分类等相关的查表操作,由于CAM可以在一个时钟周期内完成整个搜索操作,相较于其他基于硬件或软件的搜索系统具有显著的加速效果,在CAM的基础上进一步发展了具有存储掩码“X”状态能力的三态内容可寻址存储器(TCAM),TCAM可以实现精确匹配或者模糊匹配,提高查表效率。在大数据时代,由于CAM可以在一个搜索周期内完成输入向量(query)与所有存储向量(entry)的匹配操作,并根据不匹配程度进行基于距离度量的特征检索,在处理模式匹配、视频与图像处理等边缘端机器学习任务中具有极大的吸引力。Content-addressable memory (CAM) can perform search operations efficiently and in parallel, and is widely used in efficient machine learning models such as routers, database search, in-memory computing, and neuromorphic computing. CAM is a special type of memory for parallel search that can perform unique search operations in addition to the read and write operations of conventional memory. CAM was originally used to accelerate table lookup operations related to packet forwarding and classification in network routers. Since CAM can complete the entire search operation within one clock cycle, it has significant acceleration compared to other hardware or software-based search systems. As a result, on the basis of CAM, a three-state content addressable memory (TCAM) with the ability to store the state of the mask "X" has been further developed. TCAM can realize exact matching or fuzzy matching, and improve the efficiency of table lookup. In the era of big data, since CAM can complete the matching operation between the input vector (query) and all storage vectors (entry) in one search cycle, and perform feature retrieval based on distance measures according to the degree of mismatch, it is very useful in processing pattern matching, video and It is extremely attractive in edge-end machine learning tasks such as image processing.

基于传统静态随机存取存储器(SRAM)的CAM设计占用巨大的单元面积,限制了其对于计算密集型算法映射的存储密度,并且由此带来的较大寄生电容还会进一步增大搜索延时与功耗。基于各种新兴的非易失性存储器,例如电阻式随机存取存储器(RRAM)、相变存储器(PCM)以及铁电场效应晶体管(FeFET)等,设计的CAM具有降低的单元面积以及搜索延时和能耗。此外,进一步利用新兴非易失性存储器的多值存储能力设计的模拟内容可寻址存储器(ACAM),不仅可以通过量化为多值内容可寻址存储器(MCAM)提高CAM的存储密度,还可以基于其独特的范围匹配操作用于决策树、深度随机森林等更加广泛的应用场景。但是目前基于新兴非易失性存储器的ACAM和MCAM设计需要通过额外的硬件开销实现,即通过额外的控制晶体管或者附加外围复杂电路,实现模拟输入与存储范围之间的匹配操作,相应的编程以及搜索方式也更加的复杂。结合上述分析,实现更为紧凑并且操作简洁的ACAM和MCAM设计,具有十分显著的意义。The CAM design based on traditional static random access memory (SRAM) occupies a huge cell area, which limits its storage density for computing-intensive algorithm mapping, and the resulting large parasitic capacitance will further increase the search delay. and power consumption. Based on various emerging non-volatile memories, such as resistive random access memory (RRAM), phase change memory (PCM) and ferroelectric field effect transistor (FeFET), the designed CAM has reduced cell area and search delay and energy consumption. In addition, the analog content addressable memory (ACAM) designed by further utilizing the multi-value storage capability of emerging non-volatile memory can not only improve the storage density of CAM by quantizing it as multi-value content addressable memory (MCAM), but also can Based on its unique range matching operation, it is used in a wider range of application scenarios such as decision trees and deep random forests. However, the current ACAM and MCAM designs based on emerging non-volatile memories need to be implemented through additional hardware overhead, that is, through additional control transistors or additional peripheral complex circuits, to achieve matching operations between analog inputs and storage ranges, corresponding programming and The search method is also more complicated. Combined with the above analysis, it is of great significance to realize a more compact and easy-to-operate ACAM and MCAM design.

发明内容Contents of the invention

针对以上现有技术中存在的问题,本发明提出了一种基于脉冲宽度调制FeFET的ACAM和MCAM设计,相较于目前的ACAM和MCAM设计,不需要额外的控制晶体管以及复杂的外围电路,基于脉冲宽度的搜索query与存储entry的编码方式,使得基于脉冲宽度调制FeFET的ACAM的匹配范围不受FeFET存储窗口的限制,可以量化为更多bit数量的MCAM,提高了CAM的存储密度和搜索能效,并且在搜索过程中仅需要一个电压幅值,与数字系统更容易兼容。Aiming at the problems existing in the above prior art, the present invention proposes a kind of ACAM and MCAM design based on pulse width modulation FeFET. Compared with the current ACAM and MCAM design, no additional control transistor and complicated peripheral circuits are needed, based on The encoding method of pulse width search query and storage entry makes the matching range of ACAM based on pulse width modulation FeFET not limited by the storage window of FeFET, and can be quantized into MCAM with more bits, which improves the storage density and search energy efficiency of CAM , and only one voltage amplitude is needed in the search process, which is more compatible with digital systems.

本发明的技术方案如下:Technical scheme of the present invention is as follows:

一种基于脉冲宽度调制FeFET实现模拟内容可寻址存储器ACAM的方法,其特征在于,包括由CAM单元构成的CAM阵列,每个CAM单元由两个N型FeFET组成,所述FeFET的漏极连接两条匹配线ML和MLb,所述FeFET的栅极连接两条搜索线SL和SLb,所述FeFET的源极接地,在CAM阵列中,一行CAM单元共享两条匹配线ML和MLb,ML和MLb作为行共用的与门的两个输入,与门的输出为该行的匹配结果Vout;一列CAM单元共享两条搜索线SL和SLb,每一列的搜索信号SL和参考信号Ref作为列共用的异或门的两个输入,异或门的输出为SLb;预充电路用于给每一行的两条匹配线充电;在编程ACAM存储entry的阶段,分别对两个FeFET的栅极施加一定的编程电压,将其编程为一定的阈值电压VTH1和VTH2,当在两个FeFET的栅极施加电压VDD,通过预充电路预充到VDD的ML1和ML2表现出与FeFET阈值电压相关的放电速度,其放电到VDD/2的时间分别为t1和t2,在搜索阶段,先通过预充电路将ML1和ML2预充到VDD,SL端施加具有一定脉冲宽度TSL且幅值为VDD的搜索电压VSL,TSL的大小与搜索query的大小相对应,参考信号Ref的宽度为TRef,则SLb端的信号为脉冲宽度为TRef-TSL且幅值为VDD的搜索电压VSLb,当VSL的脉冲宽度TSL小于其作用的FeFET的放电时间t1,并且VSLb的脉冲宽度TRef-TSL小于其作用的FeFET的放电时间t2,在搜索的过程中,ML1和ML2的电压不会放电到VDD/2之下,与门的输出结果Vout保持高电平,表示匹配,因此可以得到匹配范围是TSL位于TRef-t2到t1之间,通过分别独立地编程两个FeFET,可以获得不同的放电时间t1和t2,进而得到不同的匹配范围。如果VSL的脉冲宽度较大,会使得TSL大于t1,使得VSL作用的FeFET的ML1在搜索阶段降到VDD/2之下,而如果VSL的脉冲宽度较小,会使TSL小于TRef-t2,会使VSLb作用的FeFET的ML2在搜索阶段降到VDD/2之下,两种情况都会使输出结果Vout在搜索阶段变为低电平,表示不匹配,实现ACAM单元的范围匹配操作。A method for realizing analog content addressable memory ACAM based on pulse width modulation FeFET, characterized in that it includes a CAM array composed of CAM cells, each CAM cell is composed of two N-type FeFETs, and the drains of the FeFETs are connected to Two matching lines ML and ML b , the gate of the FeFET is connected to two search lines SL and SL b , the source of the FeFET is grounded, in a CAM array, a row of CAM cells share two matching lines ML and ML b , ML and ML b are used as the two inputs of the AND gate shared by the row, and the output of the AND gate is the matching result Vout of the row; a column of CAM cells shares two search lines SL and SL b , and the search signal SL and reference signal of each column Ref is used as the two inputs of the XOR gate shared by the column, and the output of the XOR gate is SL b ; the precharge circuit is used to charge the two matching lines of each row; in the stage of programming the ACAM storage entry, the two FeFETs are respectively charged A certain programming voltage is applied to the gate, and it is programmed to a certain threshold voltage V TH1 and V TH2 . When the voltage VDD is applied to the gates of the two FeFETs, ML 1 and ML 2 are pre-charged to VDD through the pre-charging circuit. The discharge speed related to the threshold voltage of the FeFET is shown, and the discharge time to VDD/2 is t 1 and t 2 respectively. In the search phase, ML 1 and ML 2 are pre-charged to VDD through the pre-charging circuit first, and the SL terminal is applied with For a search voltage V SL with a certain pulse width T SL and an amplitude of VDD, the size of T SL corresponds to the size of the search query, and the width of the reference signal Ref is T Ref , then the signal at the SLb terminal has a pulse width of T Ref -T SL And the amplitude of the search voltage V SLb is VDD, when the pulse width T SL of V SL is less than the discharge time t 1 of the FeFET it acts on, and the pulse width T Ref -T SL of V SLb is less than the discharge time t of the FeFET it acts on 2. During the search process, the voltage of ML 1 and ML 2 will not be discharged below VDD/2, and the output result Vout of the AND gate remains high, indicating a match, so the matching range can be obtained when T SL is located at T Ref Between -t 2 and t 1 , by programming the two FeFETs independently, different discharge times t 1 and t 2 can be obtained, and thus different matching ranges can be obtained. If the pulse width of V SL is large, it will make T SL larger than t 1 , so that the ML 1 of the FeFET that V SL acts on falls below VDD/2 during the search phase, and if the pulse width of V SL is small, it will make T If SL is less than T Ref -t 2 , the ML 2 of the FeFET that V SLb acts on will drop below VDD/2 during the search phase. In both cases, the output result Vout will become low during the search phase, indicating a mismatch. Implements range matching operations for ACAM cells.

进一步,通过将阵列中的CAM单元编程为多个不重叠的脉冲宽度匹配范围,即可以将模拟内容可寻址存储器ACAM量化为多值内容可寻址存储器MCAM,多个分立的脉冲宽度匹配范围代表MCAM的多级entry状态,与entry相对应的量化的多级搜索query的脉冲宽度可以取存储范围的中间值,则在搜索操作时,根据搜索query施加相应脉冲宽度的VSL,只有当多级搜索query与多级存储entry一致时,VSL的脉冲宽度才处于匹配范围内,使得VSL和VSLb各自作用在FeFET上的时间均小于相应FeFET的放电时间,使得在搜索过程中ML1和ML2均大于VDD/2,使得输出结果Vout为高电平,表示匹配。否则,当多级搜索query与多级存储entry不一致时,会使VSL的脉冲宽度处于匹配范围之外,使得ML1或ML2在搜索阶段降到VDD/2之下,使得输出结果Vout变为低电平,表示不匹配,实现MCAM匹配操作。Further, by programming the CAM cells in the array to multiple non-overlapping pulse width matching ranges, the analog content addressable memory ACAM can be quantized into a multi-valued content addressable memory MCAM, and multiple discrete pulse width matching ranges Represents the multi-level entry state of MCAM, the pulse width of the quantized multi-level search query corresponding to the entry can take the middle value of the storage range, then in the search operation, apply the V SL of the corresponding pulse width according to the search query, only when the multi-level When the level search query is consistent with the multi-level storage entry, the pulse width of V SL is within the matching range, so that the time that V SL and V SLb act on the FeFET is shorter than the discharge time of the corresponding FeFET, so that ML 1 during the search process and ML 2 are greater than VDD/2, so that the output result Vout is a high level, indicating a match. Otherwise, when the multi-level search query is inconsistent with the multi-level storage entry, the pulse width of V SL will be out of the matching range, so that ML 1 or ML 2 will drop below VDD/2 during the search phase, and the output result Vout will become If it is low level, it means that it does not match, and realizes MCAM matching operation.

参考信号Ref的脉冲宽度TRef可以根据编程后FeFET的放电时间进行调整,FeFET可以编程为低阈值状态和高阈值状态,使其在栅极输入VDD电压时分别处于完全导通状态和截止状态,对应的放电时间为非常短的时间以及接近无穷的时间,因此可以通过调制FeFET的阈值电压以及选择适当的Ref,使得基于脉冲宽度调制FeFET的ACAM的存储范围从非常小的脉冲宽度范围到理论上无穷大的脉冲宽度范围,若用于MCAM则可以进一步提高可量化的状态数。The pulse width T Ref of the reference signal Ref can be adjusted according to the discharge time of the FeFET after programming, and the FeFET can be programmed into a low-threshold state and a high-threshold state, so that it is fully on and off when the gate is input with VDD voltage, The corresponding discharge time is very short time and close to infinite time. Therefore, by modulating the threshold voltage of FeFET and selecting an appropriate Ref, the storage range of ACAM based on pulse width modulation FeFET is from a very small pulse width range to a theoretical The infinite pulse width range, if used in MCAM, can further increase the number of quantifiable states.

综上所述,利用脉冲宽度的互补信息实现了基于脉冲宽度调制FeFET的ACAM以及MCAM设计,相较于传统ACAM的实现方式,其在搜索过程中仅需要一个电压幅值VDD,与数字系统兼容,此外,基于脉冲宽度的搜索query与存储entry的编码方式,使其存储范围不受FeFET的存储窗口限制,可以获得更多bit数量的MCAM,进一步提高CAM的密度和搜索能效。In summary, the ACAM and MCAM design based on pulse width modulation FeFET is realized by using the complementary information of pulse width. Compared with the traditional ACAM implementation, it only needs a voltage amplitude VDD in the search process, which is compatible with digital systems , In addition, the pulse width-based encoding method of search query and storage entry makes its storage range not limited by the storage window of FeFET, and can obtain MCAM with more bits, further improving the density and search energy efficiency of CAM.

本发明提出的基于脉冲宽度调制FeFET的ACAM以及MCAM设计,其中的铁电材料需要采用HfO2掺Zr(HZO)、HfO2掺Al(HfAlO)等各类HfO2掺杂型多畴铁电材料,器件栅叠层可以基于MFMIS、MFIS、MFS等多种结构。The ACAM and MCAM design based on the pulse width modulation FeFET proposed by the present invention, the ferroelectric material needs to adopt various HfO doped multi-domain ferroelectric materials such as HfO doped with Zr (HZO), HfO doped with Al (HfAlO), etc. , The device gate stack can be based on various structures such as MFMIS, MFIS, and MFS.

本发明的技术效果如下:Technical effect of the present invention is as follows:

1、本发明提出的基于脉冲宽度调制FeFET的ACAM设计,利用脉冲宽度的互补信息实现了基于脉冲宽度调制FeFET的ACAM,相较于传统ACAM的实现方式,在单元内不需要额外的晶体管以及复杂的模拟外围电路,仅需要可以共用的几个逻辑门即可实现,具有更加简洁的实现方式与更低的硬件开销,并且搜索过程中仅需要一个电压幅值VDD,更易与数字系统兼容。1. The ACAM design based on the pulse width modulation FeFET proposed by the present invention uses the complementary information of the pulse width to realize the ACAM based on the pulse width modulation FeFET. Compared with the traditional ACAM implementation, no additional transistors and complex The analog peripheral circuit can be realized with only a few logic gates that can be shared, which has a more concise implementation method and lower hardware overhead, and only one voltage amplitude VDD is needed during the search process, which is more compatible with digital systems.

2、本发明提出的基于脉冲宽度调制FeFET的MCAM设计,基于脉冲宽度的搜索query与存储entry的编码方式,使其存储范围不受FeFET的存储窗口限制,可以获得更多bit数量的MCAM,以进一步提高CAM的存储密度和搜索能效。2. The MCAM design based on the pulse width modulation FeFET proposed by the present invention, the encoding method of the search query and the storage entry based on the pulse width, makes its storage range not limited by the storage window of the FeFET, and can obtain MCAM with more bits, so as to Further improve the storage density and search energy efficiency of CAM.

附图说明Description of drawings

图1为本发明基于脉冲宽度调制FeFET的CAM单元结构示意图;Fig. 1 is the CAM unit structure schematic diagram based on pulse width modulation FeFET of the present invention;

图2为本发明基于脉冲宽度调制FeFET的ACAM功能实现原理图;图中:A)为每个CAM单元的两个FeFET的ID-VG特性示意图;B)为每个CAM单元的两个FeFET的放电特性示意图;C)为每个CAM单元的搜索信号波形示意图;D)为每个CAM单元的输出结果示意图;Fig. 2 is the ACAM function realization schematic diagram based on pulse width modulation FeFET of the present invention; Among the figure: A) is the ID - V G characteristic schematic diagram of two FeFETs of each CAM unit; B) is two of each CAM unit Schematic diagram of discharge characteristics of FeFET; C) is a schematic diagram of the search signal waveform of each CAM unit; D) is a schematic diagram of the output result of each CAM unit;

图3为本发明基于脉冲宽度调制FeFET的MCAM功能实现原理图。FIG. 3 is a schematic diagram of the realization of the MCAM function based on the pulse width modulation FeFET of the present invention.

具体实施方式Detailed ways

下面结合附图,通过具体实施例,进一步清楚、完整地阐述本发明。The present invention will be further clearly and completely set forth below through specific embodiments in conjunction with the accompanying drawings.

本发明基于脉冲宽度调制FeFET的CAM阵列结构如图1所示,CAM单元由两个N型FeFET(F1和F2)、行共用的二输入异或门、列共用的二输入与门以及预充电路组成,F1和F2的栅极分别连接两条搜索线SL和SLb,SL和参考信号Ref作为异或门的两个输入,异或门的输出为SLb,F1和F2的漏极连接两条匹配线ML1和ML2,ML1和ML2作为与门的两个输入,与门的输出为该行的匹配结果Vout,在搜索操作时,先通过预充电路将两条匹配线ML1和ML2充电到VDD,然后施加搜索电压,若在搜索过程中Vout保持高电平,则表示匹配,否则表示不匹配。The structure of the CAM array based on the pulse width modulation FeFET of the present invention is shown in Figure 1. The CAM unit consists of two N-type FeFETs (F 1 and F 2 ), a row-shared two-input XOR gate, a column-shared two-input AND gate, and The precharge circuit is composed of the gates of F 1 and F 2 connected to two search lines SL and SLb respectively, SL and the reference signal Ref are used as the two inputs of the XOR gate, and the output of the XOR gate is SLb, F 1 and F 2 The drain is connected to two matching lines ML 1 and ML 2 , ML 1 and ML 2 are used as the two inputs of the AND gate, and the output of the AND gate is the matching result Vout of this line. The two matching lines ML 1 and ML 2 are charged to VDD, and then a search voltage is applied. If Vout remains high during the search, it indicates a match, otherwise it indicates a mismatch.

图2为本实施例基于脉冲宽度调制FeFET的ACAM功能实现原理图,在编程ACAM存储entry的阶段,分别对F1和F2的栅极施加大于矫顽电压的编程电压,将其阈值电压编程为VTH1和VTH2,当在F1和F2的栅极施加电压VDD,通过预充电路预充到VDD的ML1和ML2表现出与F1和F2阈值电压相关的放电速度,其放电到VDD/2的时间分别为t1和t2;在搜索阶段,先通过预充电路将ML1和ML2预充到VDD,SL端施加与搜索query相应的脉冲宽度为TSL的搜索电压VSL,参考信号Ref的宽度为TRef,则SLb端的信号为脉冲宽度TSLb为TRef-TSL,只有当VSL的脉冲宽度TSL小于F1的放电时间t1,并且VSLb的脉冲宽度TSLb小于F2的放电时间t2,才可以保证在搜索的过程中,ML1和ML2的电压不会放电到VDD/2之下,与门的输出结果Vout保持高电平,表示匹配,即匹配范围是TSL位于TRef-t2到t1之间。如果VSL的脉冲宽度较大,会使得TSL大于t1,使得ML1在搜索阶段通过F1放电降到VDD/2之下,而如果VSL的脉冲宽度较小,会使TSL小于TRef-t2,会使ML2在搜索阶段通过F2放电降到VDD/2之下,使输出结果Vout在搜索阶段变为低电平,表示不匹配。Figure 2 is a schematic diagram of the implementation of the ACAM function based on pulse width modulation FeFET in this embodiment. In the stage of programming the ACAM storage entry, a programming voltage greater than the coercive voltage is applied to the gates of F1 and F2 respectively, and their threshold voltages are programmed. For V TH1 and V TH2 , when the voltage VDD is applied to the gates of F 1 and F 2 , ML 1 and ML 2 pre-charged to VDD through the pre-charging circuit exhibit a discharge rate related to the threshold voltage of F 1 and F 2 , The time to discharge to VDD/2 is t 1 and t 2 respectively; in the search phase, ML 1 and ML 2 are pre-charged to VDD through the pre-charging circuit first, and the pulse width T SL corresponding to the search query is applied to the SL terminal Search voltage V SL , the width of reference signal Ref is T Ref , then the signal at SLb end is pulse width T SLb is T Ref -T SL , only when the pulse width T SL of V SL is less than the discharge time t 1 of F 1 , and V The pulse width T SLb of SLb is less than the discharge time t 2 of F 2 to ensure that the voltage of ML 1 and ML 2 will not discharge below VDD/2 during the search process, and the output result Vout of the AND gate remains high. Ping means matching, that is, the matching range is that T SL is between T Ref -t 2 and t 1 . If the pulse width of V SL is large, it will make T SL larger than t 1 , making ML 1 discharge through F 1 to drop below VDD/2 during the search phase, and if the pulse width of V SL is small, it will make T SL smaller than T Ref -t 2 , will cause ML 2 to drop below VDD/2 through F 2 during the search phase, and make the output result Vout become low level during the search phase, indicating a mismatch.

如图3所示,通过将上述阵列中的CAM单元编程为多个不重叠的脉冲宽度匹配范围,即可以将ACAM量化为MCAM,多个分立的脉冲宽度匹配范围代表MCAM的多级entry状态,与entry相对应的量化的多级搜索query的脉冲宽度可以取脉冲宽度匹配范围的中间值,以获得更大的检测裕度与鲁棒性。在搜索操作时,根据搜索query施加相应脉冲宽度的VSL,当多级搜索query与多级存储entry一致时,VSL的脉冲宽度处于匹配范围内,使得VSL和VSLb各自作用在FeFET上的时间均小于相应FeFET的放电时间,在搜索过程中ML1和ML2均大于VDD/2,输出结果Vout为高电平,表示匹配;否则,当多级搜索query与多级存储entry不一致时,VSL的脉冲宽度处于匹配范围之外,ML1或ML2在搜索阶段降到VDD/2之下,输出结果Vout变为低电平,表示不匹配。As shown in Figure 3, by programming the CAM units in the above array into multiple non-overlapping pulse width matching ranges, ACAM can be quantized into MCAM, and multiple discrete pulse width matching ranges represent the multi-level entry state of MCAM, The pulse width of the quantized multi-level search query corresponding to the entry can take the middle value of the pulse width matching range to obtain greater detection margin and robustness. During the search operation, V SL of the corresponding pulse width is applied according to the search query. When the multi-level search query is consistent with the multi-level storage entry, the pulse width of V SL is within the matching range, so that V SL and V SLb act on the FeFET respectively. The time is less than the discharge time of the corresponding FeFET. During the search process, ML 1 and ML 2 are both greater than VDD/2, and the output result Vout is high, indicating a match; otherwise, when the multi-level search query is inconsistent with the multi-level storage entry , the pulse width of V SL is outside the matching range, ML 1 or ML 2 drops below VDD/2 during the search phase, and the output result Vout becomes low level, indicating a mismatch.

本实施例完整、详细地阐述了基于脉冲宽度调制FeFET的ACAM和MCAM的实现方式,利用脉冲宽度的互补信息实现ACAM功能,相较于传统ACAM的实现方式,在单元内不需要额外的晶体管以及复杂的模拟外围电路,并且基于脉冲宽度的搜索query与存储entry的编码方式,使其可以量化为更多bit数的MCAM,以提高CAM的存储密度和搜索能效。This embodiment fully and details the implementation of ACAM and MCAM based on pulse width modulation FeFET, using the complementary information of pulse width to realize the ACAM function. Compared with the traditional ACAM implementation, no additional transistors and MCAM are required in the unit. The complex analog peripheral circuit, and the encoding method of search query and storage entry based on pulse width can be quantized into MCAM with more bits to improve the storage density and search energy efficiency of CAM.

最后,需要注意的是,公布实施例的目的在于帮助进一步理解本发明,但是本领域的技术人员可以理解:在不脱离本发明及所附的权利要求的精神和范围内,各种替换和修改都是可能的。因此,本发明不应局限于实施例所公开的内容,本发明要求保护的范围以权利要求书界定的范围为准。Finally, it should be noted that the purpose of the disclosed embodiments is to help further understand the present invention, but those skilled in the art can understand that various replacements and modifications can be made without departing from the spirit and scope of the present invention and the appended claims. It's all possible. Therefore, the present invention should not be limited to the content disclosed in the embodiments, and the protection scope of the present invention is subject to the scope defined in the claims.

Claims (4)

1. Realizing analog content addressable memoryA method for storing ACAM is characterized by comprising a CAM array composed of CAM cells, each CAM cell is composed of two N-type FeFETs, and the drain electrode of each FeFET is connected with two matching lines ML and ML b The grid of the FeFET is connected with two search lines SL and SL b The source of the FeFET is grounded, and in the CAM array, a row of CAM cells shares two match lines ML and ML b ML and ML b As two inputs of an and gate shared by the rows, the output of the and gate is the matching result Vout of the row; one column of CAM cells shares two search lines SL and SL b The search signal SL and the reference signal Ref of each column are taken as two inputs of an exclusive OR gate shared by the columns, and the output of the exclusive OR gate is SL b The method comprises the steps of carrying out a first treatment on the surface of the The precharge circuit is used for charging the two match lines of each row; in the programming of ACAM memory entry, a certain programming voltage is applied to the gates of two FeFETs of a CAM cell, and the two FEFETs are programmed to a certain threshold voltage V TH1 And V TH2 When the voltage VDD is applied to the gates of the two FeFETs, ML is precharged to VDD by a precharge circuit 1 And ML (ML) 2 Exhibits a discharge rate related to the FeFET threshold voltage, and discharges to VDD/2 for a time t, respectively 1 And t 2 In the searching stage, ML is first performed by a pre-charge circuit 1 And ML (ML) 2 Pre-charge to VDD, SL end application with a certain pulse width T SL And the search voltage V with the amplitude of VDD SL ,T SL Corresponding to the size of the search query, the reference signal Ref has a width T Ref The signal at SLb end is pulse width T Ref -T SL And the search voltage V with the amplitude of VDD SLb When V SL Pulse width T of (2) SL Less than the discharge time t of the FeFET it acts on 1 And V is SLb Pulse width T of (2) Ref -T SL Less than the discharge time t of the FeFET it acts on 2 During the search, ML 1 And ML (ML) 2 The voltage of (2) will not discharge below VDD/2 and the output of AND gate Vout will remain high, indicating a match, resulting in a match range of T SL At T Ref -t 2 To t 1 Between them; if V is SL Is large in pulse width, T SL Greater than t 1 So that V SL ML of acting FeFET 1 Falling below VDD/2 during the search phase, or if V SL Is small in pulse width, T SL Less than T Ref -t 2 ,V SLb ML of acting FeFET 2 When the search stage falls below VDD/2, the output result Vout becomes low level in the search stage, which indicates no match, and ACAM matching operation is realized.
2. A method for realizing multi-value content addressable memory (ACAM) is characterized in that the ACAM as claimed in claim 1 is adopted to program CAM cells in an array into a plurality of non-overlapping pulse width matching ranges, the plurality of discrete pulse width matching ranges represent multi-stage entry states of MCAM, and V with corresponding pulse width is applied according to search query during search operation SL When the multi-level search query is consistent with the multi-level storage entry, V SL Is within a matching range such that V SL And V SLb The respective time of action on the FeFETs is less than the discharge time of the corresponding FeFET, ML during search 1 And ML (ML) 2 Are all larger than VDD/2, and the output result Vout is high level and indicates matching; otherwise, when the multi-level search query is inconsistent with the multi-level storage entry, V SL Is outside the matching range, ML 1 Or ML (ML) 2 In the search stage, the output result Vout goes low, indicating a mismatch, and the MCAM matching operation is implemented.
3. The method of claim 1 or 2, wherein the FeFET is a MFMIS, MFIS, MFS-based device.
4. The method of claim 3, wherein the ferroelectric material of the FeFET is HfO 2 Zr-doped or HfO-doped 2 Al is doped.
CN202310129169.6A 2023-02-08 2023-02-08 Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor Pending CN116110450A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310129169.6A CN116110450A (en) 2023-02-08 2023-02-08 Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310129169.6A CN116110450A (en) 2023-02-08 2023-02-08 Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor

Publications (1)

Publication Number Publication Date
CN116110450A true CN116110450A (en) 2023-05-12

Family

ID=86254072

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310129169.6A Pending CN116110450A (en) 2023-02-08 2023-02-08 Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor

Country Status (1)

Country Link
CN (1) CN116110450A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024244312A1 (en) * 2023-05-26 2024-12-05 北京超弦存储器研究院 Method for implementing universal content addressable memory on basis of n-type and p-type ferroelectric field effect transistors

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024244312A1 (en) * 2023-05-26 2024-12-05 北京超弦存储器研究院 Method for implementing universal content addressable memory on basis of n-type and p-type ferroelectric field effect transistors

Similar Documents

Publication Publication Date Title
Aziz et al. Computing with ferroelectric FETs: Devices, models, systems, and applications
Chen et al. Design and optimization of FeFET-based crossbars for binary convolution neural networks
Hu et al. In-memory computing with associative memories: A cross-layer perspective
CN111126579B (en) In-memory computing device suitable for binary convolutional neural network computation
CN113053434B (en) High-energy-efficiency TCAM based on FeFET structure and operation method thereof
Yang et al. An ultra-high-density and energy-efficient content addressable memory design based on 3D-NAND flash
CN116110450A (en) Method for realizing analog and multi-value content addressable memory based on pulse width modulation ferroelectric field effect transistor
US4799192A (en) Three-transistor content addressable memory
US11587623B2 (en) Content-addressable memory and operation method thereof
CN114758695A (en) A method for realizing multi-value content addressable memory MCAM based on ferroelectric tunneling field effect transistor FeTFET
WO2024244312A1 (en) Method for implementing universal content addressable memory on basis of n-type and p-type ferroelectric field effect transistors
Reis et al. In-memory computing accelerators for emerging learning paradigms
Gou et al. 2T1C DRAM based on semiconducting MoS2 and semimetallic graphene for in-memory computing
TWI849433B (en) Computing device, memory controller, and method for performing an in-memory computation
Tseng et al. A hybrid in-memory-searching and in-memory-computing architecture for NVM based AI accelerator
Sheikholeslami et al. A multiple-valued ferroelectric content-addressable memory
Reis et al. Exploiting fefets via cross-layer design from in-memory computing circuits to meta-learning applications
CN114898792A (en) Multi-bit memory inner product and exclusive OR unit, exclusive OR vector and operation method
Tseng et al. An analog in-memory-search solution based on 3D-NAND flash memory for brain-inspired computing
US20230027768A1 (en) Neural network computing device and computing method thereof
TW202036389A (en) Computing cell for performing xnor operation, neural network and method for performing digital xnor operation
TWI844013B (en) Content addressable memory device, content addressable memory cell and method for data searching and comparing thereof
EP4459621A1 (en) Architecture for computational memory and memory systems
US11875850B2 (en) Content addressable memory device, content addressable memory cell and method for data searching with a range or single-bit data
US12224008B2 (en) Non-volatile static random access memory

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination