WO2021056677A1 - Dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network - Google Patents

Dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network Download PDF

Info

Publication number
WO2021056677A1
WO2021056677A1 PCT/CN2019/114107 CN2019114107W WO2021056677A1 WO 2021056677 A1 WO2021056677 A1 WO 2021056677A1 CN 2019114107 W CN2019114107 W CN 2019114107W WO 2021056677 A1 WO2021056677 A1 WO 2021056677A1
Authority
WO
WIPO (PCT)
Prior art keywords
multiplication
analog
input
digital
phase coefficient
Prior art date
Application number
PCT/CN2019/114107
Other languages
French (fr)
Chinese (zh)
Inventor
刘波
沈泽昱
孙煜昊
黄乐朋
朱文涛
杨军
Original Assignee
东南大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 东南大学 filed Critical 东南大学
Publication of WO2021056677A1 publication Critical patent/WO2021056677A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/52Multiplying; Dividing
    • G06F7/523Multiplying only
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M1/00Analogue/digital conversion; Digital/analogue conversion
    • H03M1/66Digital/analogue converters

Definitions

  • the invention discloses a dual-phase coefficient adjustable analog multiplication calculation circuit oriented to a convolutional neural network, which relates to digital-analog hybrid integrated circuit technology and belongs to the technical field of calculation, calculation and counting.
  • analog multipliers instead of digital multipliers.
  • the traditional analog multiplier does not make full use of the scaling ratio of the CMOS process.
  • the design of the multiplication circuit is more difficult.
  • the digital-to-analog conversion circuit requires higher requirements. Therefore, the advantages of analog circuits cannot be fully utilized, and the computational power consumption that can be reduced is very limited.
  • the present invention provides a neural network-oriented two-phase coefficient adjustable analog multiplication circuit, which can convert the digital signal in the multiplication operation into For analog signals, the discrete-time circuit design is used to design analog multiplication calculation circuits, which can reduce the calculation power consumption of irregular network layers and achieve high linearity robustness.
  • the design of dual-phase coefficient switching circuits can have a wide frequency response tuning range.
  • a neural network-oriented dual-phase coefficient adjustable analog multiplier includes a current network digital-to-analog conversion module, a dual-phase coefficient adjustable analog multiplication array, a pipelined analog-to-digital conversion module and calculation Unit control module.
  • the current-type network digital-to-analog conversion module converts the characteristic data read from the storage module into an analog voltage, and uses the analog voltage as the input voltage of the two-phase coefficient adjustable analog multiplication array.
  • the calculation unit control module reads the weight data from the storage module, and controls the switching state of the analog multiplication unit in the dual-phase coefficient adjustable analog multiplication array in combination with the size of the convolution kernel, and completes the setting of the coefficient and the working state.
  • the two-phase coefficient adjustable analog multiplication array is an array composed of analog multiplication units, which is used to realize the multiplication operation of various network layers in the neural network.
  • the pipelined analog-to-digital conversion module converts the output voltage of the two-phase coefficient adjustable analog multiplication array into a digital signal. Finally, the digital signal output by the pipelined analog-to-digital conversion module is stored in the storage module.
  • This patent proposes a two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural networks, which uses discrete-time circuit structure to realize the multiplication calculation of the neural network layer, and adds a signed multiplier design to provide forward control And negative control can realize multiplication with sign bit and provide a wider range of voltage amplitude.
  • the dual-phase coefficient adjustable analog multiplication array is composed of multiple coefficient adjustable analog multiplication circuit units, and each coefficient adjustable analog multiplication circuit unit is composed of a dual-phase sample and hold buffer circuit and a switch-controlled analog multiplication circuit.
  • the sample-and-hold buffer circuit converts the input analog voltage into a signed multiplier, and adjusts the multiplication coefficient by controlling the switch circuit structure in the analog multiplier circuit, so that the analog voltage representing the signed multiplier is the input voltage of the analog multiplier,
  • the analog voltage output by the analog multiplication circuit can be superimposed to achieve a multiplication with a coefficient between 0 and 1 and a precision of 2-6 . Low power consumption and high linearity can also be maintained at low power supply voltages.
  • the current-type network digital-to-analog conversion module precharges the output signal terminal to the analog voltage proportional to the input value, so as to achieve the purpose of digital-to-analog conversion. It has good linearity and mismatch, and uses multiplexing.
  • the device generates an input pulse for each data, reducing area overhead and signal routing.
  • the pipeline analog-to-digital conversion module adopts a parallel structure, which can process multiple sampled data at the same time.
  • the signal processing speed is high, while maintaining high precision, it requires low power consumption, and has good linearity and low power consumption. Offset characteristics, so it can achieve high-speed and high-resolution conversion.
  • Figure 1 is a schematic diagram of the overall architecture of the present invention.
  • Figure 2 is the current-type network digital-to-analog conversion module of the present invention.
  • FIG. 3 shows the analog multiplication unit and its two-phase circuit structure of the present invention.
  • Figure 4 shows the pipelined analog-to-digital conversion module of the present invention.
  • the two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural networks performs deep convolution, point-by-point convolution, activation layer, pooling layer and batch normalization in neural networks under the control and scheduling between internal modules Multiplication in the transformation layer.
  • the neural network-oriented dual-phase coefficient adjustable analog multiplier includes: current network digital-to-analog conversion module, dual-phase coefficient adjustable analog multiplication array, pipelined analog-to-digital conversion module, and calculation unit control module.
  • the current-mode network digital-to-analog conversion module is composed of an input pulse generation module and a cascaded PMOS constant current source.
  • the input pulse generation module is composed of an 8:1 multiplexer with 8 timing signals. The purpose is to generate an input pulse for each input value while reducing area overhead and signal routing. And can design multiplexers according to different precision requirements, such as 6:1 or 10:1 multiplexers.
  • the constant current source cascaded PMOS three PMOS transistors (M P1, M P2, M P3) and an NMOS transistor M N (negative channel metal oxide semiconductor field-effect) transistors.
  • the charging current time of the output signal terminal is proportional to the input value.
  • This digital-to-analog conversion module architecture has better linearity and mismatch than the binary weighted PMOS charging digital-to-analog conversion module.
  • the pulse width of the timing signal usually has a smaller change, so it has good stability.
  • the input characteristic data is read from the storage module, and the input characteristic data enters the first-in-first-out memory of the current-type network digital-to-analog conversion module.
  • the input data is 6bit
  • the 3 most significant bits of the data are used to select the first half of the charging pulse width, and the 3 least significant bits of the input data are used to determine the second half of the charging pulse width, and then the charging pulse is input to the current-type network digital-to-analog conversion module to convert it into an analog voltage.
  • the analog voltage output by the current network digital-to-analog conversion module is transferred to the two-phase coefficient adjustable analog multiplication calculation array as the input voltage.
  • the weight data is read from the storage module, and the weight data enters the calculation unit control module.
  • the calculation unit control module combines the size of the convolution kernel and the weight data to configure each analog multiplication calculation unit with a 2-bit control signal and a 6-bit multiplication coefficient value.
  • the 2-bit control signal controls the switch 7 and the switch 8 of each analog multiplication calculation unit to realize the forward control or the reverse control of the input signal
  • the 6-bit multiplication coefficient value controls the switches 1 to 6 to realize the adjustment of the multiplication coefficient value.
  • the size of the convolution kernel is 3 ⁇ 3. At this time, the coefficient-adjustable analog multiplication array will mobilize 3 ⁇ 3 computing units in the array to complete the task.
  • the analog multiplication circuit array with adjustable coefficients can perform 4 sets of arithmetic operations at the same time.
  • the analog multiplication circuit array with adjustable coefficients can perform 16 sets of arithmetic operations at the same time.
  • the analog multiplication circuit array with adjustable coefficients can simultaneously perform a set of arithmetic operations.
  • the size of the convolution kernel is N ⁇ N, and N is greater than 4, multiple analog multiplication circuit arrays with adjustable coefficients can be used for parallel calculation.
  • the dual-phase coefficient adjustable analog multiplication array is composed of 4 by 4, a total of 16 coefficient adjustable analog multiplication circuit units.
  • Each coefficient adjustable analog multiplication circuit unit is composed of a dual-phase sample and hold buffer circuit and a switch-controlled analog multiplication circuit.
  • the dual-phase sample and hold buffer circuit is composed of a common source amplifier.
  • To control the input of the signed multiplier use the signed multiplier as the input signal of the switch-controlled analog multiplication circuit, and combine with the adjustment of the multiplication coefficient to realize the multiplication with the coefficient between 0 and 1 and the accuracy of 2-6 .
  • low power consumption and high linearity can be maintained under low power supply voltage.
  • the analog multiplication calculation unit As shown in Figure 3, after the input voltage of the adjustable analog multiplication calculation array enters the analog multiplication calculation unit, it is positively controlled when the switch 7 is closed and the switch 8 is open.
  • the analog multiplication calculation unit is closed when the switch 7 is open and the switch 8 is closed.
  • the analog multiplication calculation unit stops when the switches 7 and 8 are both open.
  • the input voltage is stabilized by the sample and hold buffer circuit. After the input voltage is stabilized, it is used as the input voltage of the six parallel switch branches. There are 6 branches in total from switch 1 to switch 6, and the switch on each branch is connected in series with a capacitor corresponding to the value of the multiplication coefficient of one bit.
  • Switch 1 is connected in series with a 10fF capacitor
  • switch 2 is connected in series with a 30fF capacitor
  • switch 3 is connected in series with a 40fF capacitor
  • switch 4 is connected in series with a 10fF capacitor
  • switch 5 is connected in series with a 20fF capacitor
  • switch 6 is connected in series with a 40fF capacitor.
  • the opening and closing of switches 1 to 6 are controlled by the control module of the calculation unit. Switches 1 to 6 respectively correspond to the lowest to highest position of the 6-bit coefficient value. If the corresponding bit is 1, the switch is closed and the corresponding capacitor is charged; if If the corresponding bit is 0, the switch is turned on and the corresponding capacitor is discharged.
  • switch 1 branch, switch 2 branch, switch 3 branch and a 10fF capacitor in parallel to main branch 1 connect switch 4 branch, switch 5 branch, and switch 6 branch in parallel to main branch 2, then
  • the main branch 1, an 800/7fF capacitor, and the main branch 2 will be connected in series in sequence.
  • the terminal voltage of the main branch 2 is the output voltage.
  • the output voltage will increase by 8/569 (about 1/64) of the input voltage; if switch 2 is closed, the output voltage will increase by 24/569 (about 1/32) of the input voltage; if switch 3 is closed , The output voltage will increase by about 32/569 (1/16) of the input voltage; if the switch 4 is closed, the output voltage will increase by 72/575 (about 1/8) of the input voltage; if the switch 5 is closed, the output voltage will increase There is an increase of 144/575 (about 1/4) of the input voltage; if the switch 6 is closed, the output voltage will have an increase of 288/575 (about 1/2) of the input voltage. It can be seen that the analog multiplication unit uses a discrete-time switched capacitor circuit.
  • the adjustable high-order narrow bandwidth programmable filter is realized. Then the digital circuit controls the closing of the 6 switches, and the support generated when the capacitor is charged on the switch circuit is controlled.
  • the output voltage of the analog multiplication unit can be obtained by superposing the circuit voltage.
  • the pipelined analog-to-digital converter can achieve high-speed and high-resolution conversion, and meet the requirements of low power consumption and small area chip design.
  • the pipelined analog-to-digital converter is mainly composed of multiple cascaded circuits, and each stage includes a Sample/Hold (S/H) amplifier, low-precision ADC, DAC, and summing circuit.
  • S/H Sample/Hold
  • the input analog quantity is converted into a 3bit digital quantity by a 3-bit coarse-precision ADC, which is the high 3 bits of the output data, and the 3bit digital quantity is converted into an analog quantity by the DAC.
  • the S/H amplifier samples the 3bit digital quantity output by the ADC and performs summation or difference operation with the analog quantity converted by the DAC, thereby deleting the analog signal corresponding to the 3bit digital quantity from the input signal, and the difference is obtained by amplifying Output the low 3 digits of the data and send the low 3 digits of the output data to the next-level circuit for processing. According to the accuracy requirements of the digital quantity, the participating signals are converted and finally high-precision n-bit output data is obtained.
  • Pipeline ADCs need to use digital error correction technology to reduce the accuracy requirements of the internal comparator. If the upper-level comparator has a large offset and the input voltage is at the comparison point, it will generate an incorrect output value, resulting in a difference The difference.
  • pipeline ADC is a parallel structure, can process multiple sampled data at the same time, high signal processing speed, low power consumption while maintaining high precision, and has good linearity and Low offset characteristics, so it can achieve high-speed and high-resolution conversion.
  • Step 1 Input characteristic data and read it from the storage module.
  • the characteristic data enters the first-in-first-out memory of the current-type network digital-to-analog conversion module.
  • the digital-to-analog conversion module converts the characteristic data into an analog voltage and transmits it to the dual-phase coefficient adjustable analog Multiply the calculation array as the input voltage.
  • Step 2 The weight data is read from the storage module, and the weight data enters the calculation unit control module.
  • the calculation unit control module combines the size of the convolution kernel and the weight data to control the eight switches in each analog multiplication calculation unit, and set the multiplication coefficients And the working mode of the two-phase symbol selector (positive control, negative control, stop).
  • Step 3 The input voltage of the analog multiplication calculation unit passes through the two-phase sign selector in positive control or negative control mode to complete the sign bit operation; then, the signed multiplier passes through the sample and hold buffer circuit to maintain the input voltage value. To prevent its attenuation from affecting the calculation results, at the same time, switches 1 to 6 are switched to the closed or open state according to the six-bit digital signal of the coefficient value. The capacitor on the branch where the closed switch is located is charged, and the capacitor on the branch where the open switch is located is charged. Discharge, when the charge and discharge process is over, the multiplication calculation result can be determined according to the difference of the capacitance and the circuit structure.
  • the contribution of the capacitance of different switch branches to the output voltage is different, and the contribution of switch 1 to the output voltage is about 1/
  • the contribution of switch 2 to the output voltage is about 1/32 of the input voltage
  • the contribution of switch 3 to the output voltage is about 1/16 of the input voltage
  • the contribution of switch 4 to the output voltage is about 1/8 Input voltage
  • the contribution of switch 5 to the output voltage is about 1/4 of the input voltage
  • the contribution of switch 6 to the output voltage is about 1/2 of the input voltage
  • Step 4 The output voltage of the analog multiplier is finally transferred to the pipeline analog-to-digital converter to obtain the output value, which is stored in the memory and waits for the next read instruction.

Abstract

A dual-phase coefficient adjustable analog multiplication circuit for a convolutional neural network, which relates to the technical fields of calculating, estimating and counting. The multiplication calculation circuit comprises a current-type network digital-to-analog conversion module, a dual-phase coefficient adjustable analog multiplication array, a pipeline-type analog-to-digital conversion module and a calculation unit control module. Multiplication calculation of a neural network layer is achieved by using a discrete time circuit structure, and a signed multiplier design is newly added to provide positive and negative control, signed multiplication may be achieved and a wider range of voltage amplitude is provided.

Description

一种面向卷积神经网络的双相系数可调模拟乘法计算电路A dual-phase coefficient adjustable analog multiplication calculation circuit oriented to convolutional neural network 技术领域Technical field
本发明公开了一种面向卷积神经网络的双相系数可调模拟乘法计算电路,涉及数模混合集成电路技术,属于计算、推算、计数的技术领域。The invention discloses a dual-phase coefficient adjustable analog multiplication calculation circuit oriented to a convolutional neural network, which relates to digital-analog hybrid integrated circuit technology and belongs to the technical field of calculation, calculation and counting.
背景技术Background technique
当今卷积神经网络的卷积层已经存在很多较好的优化设计,在功耗、面积和能效方面效果显著。例如,在数据存储方面运用了量化、压缩等方法,实现了卷积神经网络的二值化;在计算电路方面采用异或非门作为近似乘法器进行卷积运算等。因此,在缩减网络层以及数字电路领域对卷积层的进一步优化设计已经到达了瓶颈期,卷积神经网络的计算量和参数量大,对硬件加速器的要求较高,想要进一步优化,降低卷积运算的功耗,节省电路成本,有很多研究提出了将模拟电路与数字电路结合在一块芯片中的方案。比如用模拟乘法器替代数字乘法器。但与数字乘法器相比,传统模拟乘法器并未充分利用CMOS工艺的缩放比例,在实际中乘法电路设计较为困难,为了达到可靠的精度,对数模转换电路要求较高。因此无法充分利用模拟电路的优势,能降低的计算功耗十分有限。There are many better optimized designs for the convolutional layer of today's convolutional neural networks, which have significant effects in terms of power consumption, area and energy efficiency. For example, in the aspect of data storage, methods such as quantization and compression are used to realize the binarization of the convolutional neural network; in the calculation circuit, the exclusive NOR gate is used as an approximate multiplier for convolution operations. Therefore, the further optimization design of the convolutional layer in the field of reducing the network layer and the digital circuit has reached the bottleneck period. The calculation volume and parameter volume of the convolutional neural network are large, and the requirements for hardware accelerators are high. I want to further optimize and reduce The power consumption of convolution operation saves circuit cost. Many studies have proposed a scheme of combining analog circuits and digital circuits in a chip. For example, use analog multipliers instead of digital multipliers. However, compared with the digital multiplier, the traditional analog multiplier does not make full use of the scaling ratio of the CMOS process. In practice, the design of the multiplication circuit is more difficult. In order to achieve reliable accuracy, the digital-to-analog conversion circuit requires higher requirements. Therefore, the advantages of analog circuits cannot be fully utilized, and the computational power consumption that can be reduced is very limited.
发明内容Summary of the invention
为了解决现有神经网络卷积层的进一步优化设计已经到达了瓶颈期的问题,本发明提供了一种面向神经网络的双相系数可调模拟乘法电路,能将乘法运算中的数字信号转换为模拟信号,采用了离散时间电路的方案设计模拟乘法计算电路,可以降低非规则网络层的计算功耗,实现高线性度的稳健性,双相系数开关电路的设计可以宽频率响应调谐范围。In order to solve the problem that the further optimization design of the existing neural network convolutional layer has reached the bottleneck period, the present invention provides a neural network-oriented two-phase coefficient adjustable analog multiplication circuit, which can convert the digital signal in the multiplication operation into For analog signals, the discrete-time circuit design is used to design analog multiplication calculation circuits, which can reduce the calculation power consumption of irregular network layers and achieve high linearity robustness. The design of dual-phase coefficient switching circuits can have a wide frequency response tuning range.
本发明为实现上述发明目的采用如下技术方案:面向神经网络的双相系数可调模拟乘法器包括电流型网络数模转换模块、双相系数可调模拟乘法阵列、流水线型模数转换模块和计算单元控制模块。电流型网络数模转换模块将从存储模块中读取的特征数据转换为模拟电压,将模拟电压作为双相系数可调模拟乘法阵列的输入电压。计算单元控制模块从存储模块中读取权重数据,结合卷积核大小控制双相系数可调模拟乘法阵列中模拟乘法单元的开关状态,完成系数和工作状态的设置。双相系数可调模拟乘法阵列为由模拟乘法单元组成的阵列,用于实现神经网络中各种网络层的乘法运算。流水线型模数转换模块将双相系数可调模拟乘法阵列的输出电压转换为数字信号。最 后将流水线型模数转换模块输出的数字信号存储在存储模块中。The present invention adopts the following technical solutions to achieve the above-mentioned purpose of the invention: A neural network-oriented dual-phase coefficient adjustable analog multiplier includes a current network digital-to-analog conversion module, a dual-phase coefficient adjustable analog multiplication array, a pipelined analog-to-digital conversion module and calculation Unit control module. The current-type network digital-to-analog conversion module converts the characteristic data read from the storage module into an analog voltage, and uses the analog voltage as the input voltage of the two-phase coefficient adjustable analog multiplication array. The calculation unit control module reads the weight data from the storage module, and controls the switching state of the analog multiplication unit in the dual-phase coefficient adjustable analog multiplication array in combination with the size of the convolution kernel, and completes the setting of the coefficient and the working state. The two-phase coefficient adjustable analog multiplication array is an array composed of analog multiplication units, which is used to realize the multiplication operation of various network layers in the neural network. The pipelined analog-to-digital conversion module converts the output voltage of the two-phase coefficient adjustable analog multiplication array into a digital signal. Finally, the digital signal output by the pipelined analog-to-digital conversion module is stored in the storage module.
本发明采用上述技术方案,具有以下有益效果:The present invention adopts the above technical scheme and has the following beneficial effects:
(1)本专利提出的一种面向卷积神经网络的双相系数可调模拟乘法计算电路采用离散时间电路结构实现神经网络层的乘法计算,并新增了有符号乘数设计提供正向控制和负向控制,能实现带符号位的乘法,提供更宽范围的电压幅度。(1) This patent proposes a two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural networks, which uses discrete-time circuit structure to realize the multiplication calculation of the neural network layer, and adds a signed multiplier design to provide forward control And negative control can realize multiplication with sign bit and provide a wider range of voltage amplitude.
(2)双相系数可调模拟乘法阵列多个系数可调模拟乘法电路单元构成,每个系数可调模拟乘法电路单元由双相采样和保持缓冲电路和开关控制的模拟乘法电路组成,双相采样和保持缓冲电路将输入的模拟电压转换为有符号的乘数,通过控制模拟乘法电路中的开关切换电路结构调节乘法系数,以表征有符号乘数的模拟电压为模拟乘法电路的输入电压,叠加模拟乘法电路输出的模拟电压即可实现系数在0到1之间、精度为2 -6的乘法,在低电源电压下也能保持低功耗和高线性度。 (2) The dual-phase coefficient adjustable analog multiplication array is composed of multiple coefficient adjustable analog multiplication circuit units, and each coefficient adjustable analog multiplication circuit unit is composed of a dual-phase sample and hold buffer circuit and a switch-controlled analog multiplication circuit. The sample-and-hold buffer circuit converts the input analog voltage into a signed multiplier, and adjusts the multiplication coefficient by controlling the switch circuit structure in the analog multiplier circuit, so that the analog voltage representing the signed multiplier is the input voltage of the analog multiplier, The analog voltage output by the analog multiplication circuit can be superimposed to achieve a multiplication with a coefficient between 0 and 1 and a precision of 2-6 . Low power consumption and high linearity can also be maintained at low power supply voltages.
(3)电流型网络数模转换模块将输出信号端预充电到模拟电压与输入数值成比例,以此达到数模转换的目的,具有很好的线性度和失配度,采用多路复用器为每个数据生成一个输入脉冲,减少面积开销和信号路由。(3) The current-type network digital-to-analog conversion module precharges the output signal terminal to the analog voltage proportional to the input value, so as to achieve the purpose of digital-to-analog conversion. It has good linearity and mismatch, and uses multiplexing. The device generates an input pulse for each data, reducing area overhead and signal routing.
(4)流水型模数转换模块采用的是并行结构,可以同时对多个采样数据进行处理,信号处理速度高,在保持高精度的同时所需功耗低,并具有良好的线性度和低失调特点,因此能够实现高速及高分辨率的转换。(4) The pipeline analog-to-digital conversion module adopts a parallel structure, which can process multiple sampled data at the same time. The signal processing speed is high, while maintaining high precision, it requires low power consumption, and has good linearity and low power consumption. Offset characteristics, so it can achieve high-speed and high-resolution conversion.
附图说明Description of the drawings
图1为本发明的整体架构示意图。Figure 1 is a schematic diagram of the overall architecture of the present invention.
图2为本发明的电流型网络数模转换模块。Figure 2 is the current-type network digital-to-analog conversion module of the present invention.
图3为本发明的模拟乘法单元及其双相电路结构。Figure 3 shows the analog multiplication unit and its two-phase circuit structure of the present invention.
图4为本发明的流水线型模数转换模块。Figure 4 shows the pipelined analog-to-digital conversion module of the present invention.
具体实施方式detailed description
下面结合具体实施例进一步阐明本发明,应理解这些实施例仅用于说明本发明而不用于限制本发明的范围,在阅读了本发明之后,本领域技术人员对本发明的各种等价形式的修改均落于本申请所附权利要求所限定的范围。The present invention will be further clarified below in conjunction with specific examples. It should be understood that these examples are only used to illustrate the present invention and not to limit the scope of the present invention. After reading the present invention, those skilled in the art will understand various equivalent forms of the present invention. All modifications fall within the scope defined by the appended claims of this application.
面向卷积神经网络的双相系数可调模拟乘法计算电路在内部各模块之间的控制和调度下进行神经网络中的深度卷积、逐点卷积、激活层、池化层和批归一化层中的乘法运算。如图1所示,面向神经网络的双相系数可调模拟乘法器包括:电流型网络数模转换模块、双相系数可调模拟乘法阵列、流水线型模数转换模块和计算单元控制 模块。The two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural networks performs deep convolution, point-by-point convolution, activation layer, pooling layer and batch normalization in neural networks under the control and scheduling between internal modules Multiplication in the transformation layer. As shown in Figure 1, the neural network-oriented dual-phase coefficient adjustable analog multiplier includes: current network digital-to-analog conversion module, dual-phase coefficient adjustable analog multiplication array, pipelined analog-to-digital conversion module, and calculation unit control module.
如图2所示,电流型网络数模转换模块由输入脉冲生成模块和级联PMOS恒流源组成。输入脉冲生成模块由具有8个定时信号的8:1多路复用器构成,目的是为每个输入数值生成一个输入脉冲,同时减少面积开销和信号路由。并且可以根据不同的精度需求设计多路复用器,如6:1或10:1多路复用器等。级联PMOS恒流源由3个PMOS晶体管(M P1、M P2、M P3)和1个NMOS管M N(负沟道金属氧化物半导体场效应)晶体管组成。输出信号端充电电流时间与输入数值成正比,这种数模转换模块架构比二进制加权PMOS充电数模转换模块具有更好的线性度和失配度。此外,与由PMOS的阈值电压失配产生的信号相比,定时信号的脉冲宽度通常具有较小的变化,因此具有很好的稳定性。 As shown in Figure 2, the current-mode network digital-to-analog conversion module is composed of an input pulse generation module and a cascaded PMOS constant current source. The input pulse generation module is composed of an 8:1 multiplexer with 8 timing signals. The purpose is to generate an input pulse for each input value while reducing area overhead and signal routing. And can design multiplexers according to different precision requirements, such as 6:1 or 10:1 multiplexers. The constant current source cascaded PMOS three PMOS transistors (M P1, M P2, M P3) and an NMOS transistor M N (negative channel metal oxide semiconductor field-effect) transistors. The charging current time of the output signal terminal is proportional to the input value. This digital-to-analog conversion module architecture has better linearity and mismatch than the binary weighted PMOS charging digital-to-analog conversion module. In addition, compared with the signal generated by the mismatch of the threshold voltage of the PMOS, the pulse width of the timing signal usually has a smaller change, so it has good stability.
输入特征数据从存储模块中读出,输入特征数据进入电流型网络数模转换模块的先进先出存储器中,输入数据为6bit时,先将输入数据通过6:1多路复用器,使用输入数据的3个最高有效位来选择前半个充电脉冲宽度,使用输入数据的3个最低有效位确定后半个充电脉冲宽度,然后将充电脉冲输入电流型网络数模转换模块,转换为模拟电压。电流型网络数模转换模块输出的模拟电压传递至双相系数可调模拟乘法计算阵列作为输入电压。The input characteristic data is read from the storage module, and the input characteristic data enters the first-in-first-out memory of the current-type network digital-to-analog conversion module. When the input data is 6bit, first pass the input data through the 6:1 multiplexer, and use the input The 3 most significant bits of the data are used to select the first half of the charging pulse width, and the 3 least significant bits of the input data are used to determine the second half of the charging pulse width, and then the charging pulse is input to the current-type network digital-to-analog conversion module to convert it into an analog voltage. The analog voltage output by the current network digital-to-analog conversion module is transferred to the two-phase coefficient adjustable analog multiplication calculation array as the input voltage.
权重数据从存储模块中读出,权重数据进入计算单元控制模块。计算单元控制模块结合卷积核大小和权重数据给每个模拟乘法计算单元配置2位的控制信号和6位的乘法系数值。2位控制信号控制每个模拟乘法计算单元的开关7和开关8进而实现输入信号的正向控制或反向控制,6位的乘法系数值控制开关1到6进而实现乘法系数取值的调整。通常情况下,卷积核尺寸大小为3×3,这时系数可调模拟乘法阵列会调动阵列中的3×3个计算单元完成任务。当卷积核尺寸为2×2时,系数可调模拟乘法电路阵列可同时进行4组运算操作。当卷积核尺寸为1×1时,系数可调模拟乘法电路阵列可同时进行16组运算操作。当卷积核尺寸为4×4时,系数可调模拟乘法电路阵列可同时进行1组运算操作。当卷积核尺寸为N×N,且N大于4时,可以采用多个系数可调模拟乘法电路阵列并行计算。The weight data is read from the storage module, and the weight data enters the calculation unit control module. The calculation unit control module combines the size of the convolution kernel and the weight data to configure each analog multiplication calculation unit with a 2-bit control signal and a 6-bit multiplication coefficient value. The 2-bit control signal controls the switch 7 and the switch 8 of each analog multiplication calculation unit to realize the forward control or the reverse control of the input signal, and the 6-bit multiplication coefficient value controls the switches 1 to 6 to realize the adjustment of the multiplication coefficient value. Under normal circumstances, the size of the convolution kernel is 3×3. At this time, the coefficient-adjustable analog multiplication array will mobilize 3×3 computing units in the array to complete the task. When the size of the convolution kernel is 2×2, the analog multiplication circuit array with adjustable coefficients can perform 4 sets of arithmetic operations at the same time. When the size of the convolution kernel is 1×1, the analog multiplication circuit array with adjustable coefficients can perform 16 sets of arithmetic operations at the same time. When the size of the convolution kernel is 4×4, the analog multiplication circuit array with adjustable coefficients can simultaneously perform a set of arithmetic operations. When the size of the convolution kernel is N×N, and N is greater than 4, multiple analog multiplication circuit arrays with adjustable coefficients can be used for parallel calculation.
双相系数可调模拟乘法阵列由4乘4共16个系数可调模拟乘法电路单元构成。每个系数可调模拟乘法电路单元由双相采样和保持缓冲电路和开关控制的模拟乘法电路组成,双相采样和保持缓冲电路由一个公共源放大器构成,通过对输入信号的正向控制或反向控制实现有符号乘数的输入,以有符号的乘数作为开关控制的模拟乘法 电路的输入信号,再结合乘法系数的调节实现系数在0到1之间、精度为2 -6的乘法,同时在低电源电压下也能保持低功耗和高线性度。 The dual-phase coefficient adjustable analog multiplication array is composed of 4 by 4, a total of 16 coefficient adjustable analog multiplication circuit units. Each coefficient adjustable analog multiplication circuit unit is composed of a dual-phase sample and hold buffer circuit and a switch-controlled analog multiplication circuit. The dual-phase sample and hold buffer circuit is composed of a common source amplifier. To control the input of the signed multiplier, use the signed multiplier as the input signal of the switch-controlled analog multiplication circuit, and combine with the adjustment of the multiplication coefficient to realize the multiplication with the coefficient between 0 and 1 and the accuracy of 2-6 . At the same time, low power consumption and high linearity can be maintained under low power supply voltage.
如图3所示,可调模拟乘法计算阵列的输入电压进入模拟乘法计算单元后,在开关7闭合而开关8断开时为正向控制,模拟乘法计算单元在开关7断开而开关8闭合时为负向控制,模拟乘法计算单元在开关7和8都断开时停止。选择工作模式以后,输入电压经过采样和保持缓冲电路稳定电压。输入电压经过稳定后作为六个并联开关支路的输入电压,开关1到开关6共6条支路,每个支路上的开关串联一个对应一比特位乘法系数值大小的电容。开关1串联10fF电容,开关2串联30fF电容,开关3串联40fF电容,开关4串联10fF电容,开关5串联20fF电容,开关6串联40fF电容。开关1至开关6的开启和闭合由计算单元控制模块控制,开关1到开关6分别对应6比特系数值的最低位到最高位,若对应比特位为1则开关闭合,对应的电容充电;若对应比特位为0则开关打开,对应的电容放电。将开关1支路、开关2支路、开关3支路和一个10fF的电容并联为主支路1,将开关4支路、开关5支路、开关6支路并联为主支路2,然后将按顺序将主支路1、一个800/7fF的电容、主支路2串联。主支路2的端点电压即是输出电压。若开关1闭合,输出电压会有8/569(约1/64)输入电压的增加;若开关2闭合,输出电压会有24/569(约1/32)输入电压的增加;若开关3闭合,输出电压会有32/569约(1/16)输入电压的增加;若开关4闭合,输出电压会有72/575(约1/8)输入电压的增加;若开关5闭合,输出电压会有144/575(约1/4)输入电压的增加;若开关6闭合,输出电压会有288/575(约1/2)输入电压的增加。可见,模拟乘法单元使用的是离散时间开关电容电路,根据开关电路实现可调节的高阶窄带宽可编程滤波,于是由数字电路控制6个开关的闭合,将开关电路上充电电容时产生的支路电压叠加就能得到模拟乘法单元的输出电压。As shown in Figure 3, after the input voltage of the adjustable analog multiplication calculation array enters the analog multiplication calculation unit, it is positively controlled when the switch 7 is closed and the switch 8 is open. The analog multiplication calculation unit is closed when the switch 7 is open and the switch 8 is closed. When the time is negative control, the analog multiplication calculation unit stops when the switches 7 and 8 are both open. After selecting the operating mode, the input voltage is stabilized by the sample and hold buffer circuit. After the input voltage is stabilized, it is used as the input voltage of the six parallel switch branches. There are 6 branches in total from switch 1 to switch 6, and the switch on each branch is connected in series with a capacitor corresponding to the value of the multiplication coefficient of one bit. Switch 1 is connected in series with a 10fF capacitor, switch 2 is connected in series with a 30fF capacitor, switch 3 is connected in series with a 40fF capacitor, switch 4 is connected in series with a 10fF capacitor, switch 5 is connected in series with a 20fF capacitor, and switch 6 is connected in series with a 40fF capacitor. The opening and closing of switches 1 to 6 are controlled by the control module of the calculation unit. Switches 1 to 6 respectively correspond to the lowest to highest position of the 6-bit coefficient value. If the corresponding bit is 1, the switch is closed and the corresponding capacitor is charged; if If the corresponding bit is 0, the switch is turned on and the corresponding capacitor is discharged. Connect switch 1 branch, switch 2 branch, switch 3 branch and a 10fF capacitor in parallel to main branch 1, connect switch 4 branch, switch 5 branch, and switch 6 branch in parallel to main branch 2, then The main branch 1, an 800/7fF capacitor, and the main branch 2 will be connected in series in sequence. The terminal voltage of the main branch 2 is the output voltage. If switch 1 is closed, the output voltage will increase by 8/569 (about 1/64) of the input voltage; if switch 2 is closed, the output voltage will increase by 24/569 (about 1/32) of the input voltage; if switch 3 is closed , The output voltage will increase by about 32/569 (1/16) of the input voltage; if the switch 4 is closed, the output voltage will increase by 72/575 (about 1/8) of the input voltage; if the switch 5 is closed, the output voltage will increase There is an increase of 144/575 (about 1/4) of the input voltage; if the switch 6 is closed, the output voltage will have an increase of 288/575 (about 1/2) of the input voltage. It can be seen that the analog multiplication unit uses a discrete-time switched capacitor circuit. According to the switch circuit, the adjustable high-order narrow bandwidth programmable filter is realized. Then the digital circuit controls the closing of the 6 switches, and the support generated when the capacitor is charged on the switch circuit is controlled. The output voltage of the analog multiplication unit can be obtained by superposing the circuit voltage.
流水线型模数转换器能够实现高速及高分辨率的转换,并且满足低功耗和面积小的芯片设计要求。如图4所示,流水线型模数转换器主要由多个级联电路组成,每一级包括一个采样/保持(Sample/Hold,S/H)放大器、低精度ADC、DAC以及求和电路。输入模拟量经过一个3位粗精度ADC转换为3bit数字量,该数字量则为输出数据的高3位,3bit数字量经DAC转换为模拟量。S/H放大器对ADC输出的3bit数字量进行采样后与DAC转换得到的模拟量进行求和或求差操作,从而将3bit数字量对应的模拟信号从输入信号中删除,所得差值经放大得到输出数据的低3位数值并将输出数据的低3位数值送往下一级电路处理。根据数字量的精度需求,对参与信号进行 转换并最终得到高精度的n位输出数据。流水型ADC需要采用数字误差校正技术来降低内部比较器的精度要求,如果上一级的比较器存在较大的失调且输入电压正处于该比较点,则会生成错误的输出值,从而产生不同的差值。经过放大器后,可以恢复至正确的ADC结果。与其它模数转换器相比,流水型ADC是并行结构,可以同时对多个采样数据进行处理,信号处理速度高,在保持高精度的同时所需功耗低,并具有良好的线性度和低失调特点,因此能够实现高速及高分辨率的转换。The pipelined analog-to-digital converter can achieve high-speed and high-resolution conversion, and meet the requirements of low power consumption and small area chip design. As shown in Figure 4, the pipelined analog-to-digital converter is mainly composed of multiple cascaded circuits, and each stage includes a Sample/Hold (S/H) amplifier, low-precision ADC, DAC, and summing circuit. The input analog quantity is converted into a 3bit digital quantity by a 3-bit coarse-precision ADC, which is the high 3 bits of the output data, and the 3bit digital quantity is converted into an analog quantity by the DAC. The S/H amplifier samples the 3bit digital quantity output by the ADC and performs summation or difference operation with the analog quantity converted by the DAC, thereby deleting the analog signal corresponding to the 3bit digital quantity from the input signal, and the difference is obtained by amplifying Output the low 3 digits of the data and send the low 3 digits of the output data to the next-level circuit for processing. According to the accuracy requirements of the digital quantity, the participating signals are converted and finally high-precision n-bit output data is obtained. Pipeline ADCs need to use digital error correction technology to reduce the accuracy requirements of the internal comparator. If the upper-level comparator has a large offset and the input voltage is at the comparison point, it will generate an incorrect output value, resulting in a difference The difference. After passing through the amplifier, the correct ADC result can be restored. Compared with other analog-to-digital converters, pipeline ADC is a parallel structure, can process multiple sampled data at the same time, high signal processing speed, low power consumption while maintaining high precision, and has good linearity and Low offset characteristics, so it can achieve high-speed and high-resolution conversion.
本申请公开的双相系数可调模拟乘法计算电路的整个功能的实现包括如下6个步骤。The realization of the entire function of the dual-phase coefficient adjustable analog multiplication calculation circuit disclosed in the present application includes the following 6 steps.
步骤1:输入特征数据从存储模块中读出,特征数据进入电流型网络数模转换模块的先进先出存储器中,数模转换模块将特征数据转换为模拟电压后传递至双相系数可调模拟乘法计算阵列作为输入电压。Step 1: Input characteristic data and read it from the storage module. The characteristic data enters the first-in-first-out memory of the current-type network digital-to-analog conversion module. The digital-to-analog conversion module converts the characteristic data into an analog voltage and transmits it to the dual-phase coefficient adjustable analog Multiply the calculation array as the input voltage.
步骤2:权重数据从存储模块中读出,权重数据进入计算单元控制模块,计算单元控制模块结合卷积核大小和权重数据控制每个模拟乘法计算单元中的八个开关,设置好乘法的系数以及双相符号选择器的工作模式(正向控制,负向控制,停止)。Step 2: The weight data is read from the storage module, and the weight data enters the calculation unit control module. The calculation unit control module combines the size of the convolution kernel and the weight data to control the eight switches in each analog multiplication calculation unit, and set the multiplication coefficients And the working mode of the two-phase symbol selector (positive control, negative control, stop).
步骤3:模拟乘法计算单元的输入电压经过正向控制或负向控制模式的双相符号选择器,完成符号位的运算;然后,有符号的乘数经过采样和保持缓冲电路维持输入电压数值,防止其衰减影响到计算结果,同时,开关1到开关6根据系数值的六比特数字信号转换为闭合或者打开状态,闭合的开关所在支路上的电容进行充电,打开的开关所在支路上的电容进行放电,当充放电过程结束后,按照电容大小的不同和电路结构的不同即可确定乘法计算结果,不同开关支路上的电容对输出电压的贡献不同,开关1对输出电压的贡献是约1/64的输入电压,开关2对输出电压的贡献是约1/32的输入电压,开关3对输出电压的贡献是约1/16的输入电压,开关4对输出电压的贡献是约1/8的输入电压;开关5对输出电压的贡献是约1/4的输入电压,开关6对输出电压的贡献是约1/2的输入电压;最后,根据开关的闭合情况,就能得到对应的输出电压,即为模拟乘法器的输出。Step 3: The input voltage of the analog multiplication calculation unit passes through the two-phase sign selector in positive control or negative control mode to complete the sign bit operation; then, the signed multiplier passes through the sample and hold buffer circuit to maintain the input voltage value. To prevent its attenuation from affecting the calculation results, at the same time, switches 1 to 6 are switched to the closed or open state according to the six-bit digital signal of the coefficient value. The capacitor on the branch where the closed switch is located is charged, and the capacitor on the branch where the open switch is located is charged. Discharge, when the charge and discharge process is over, the multiplication calculation result can be determined according to the difference of the capacitance and the circuit structure. The contribution of the capacitance of different switch branches to the output voltage is different, and the contribution of switch 1 to the output voltage is about 1/ With an input voltage of 64, the contribution of switch 2 to the output voltage is about 1/32 of the input voltage, the contribution of switch 3 to the output voltage is about 1/16 of the input voltage, and the contribution of switch 4 to the output voltage is about 1/8 Input voltage; the contribution of switch 5 to the output voltage is about 1/4 of the input voltage, and the contribution of switch 6 to the output voltage is about 1/2 of the input voltage; finally, according to the closing of the switch, the corresponding output voltage can be obtained , Which is the output of the analog multiplier.
步骤4:模拟乘法器的输出电压最后传递至流水型模数转换器得到输出数值,保存在存储器中,等待下一次的读取指令。Step 4: The output voltage of the analog multiplier is finally transferred to the pipeline analog-to-digital converter to obtain the output value, which is stored in the memory and waits for the next read instruction.

Claims (10)

  1. 一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,包括:A dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network, which is characterized in that it comprises:
    数模转换模块,将读取的特征数据转换为模拟电压后输出至双相系数可调乘法阵列,The digital-to-analog conversion module converts the read characteristic data into an analog voltage and outputs it to the dual-phase coefficient adjustable multiplication array,
    计算单元控制模块,根据读取的权重数据并结合卷积核大小输出双相系数可调乘法阵列的工作状态控制信号及乘法系数控制信号,The calculation unit control module outputs the working state control signal and the multiplication coefficient control signal of the two-phase coefficient adjustable multiplication array according to the read weight data and combined with the size of the convolution kernel,
    双相系数可调乘法阵列,每个乘法单元在其工作状态控制信号及乘法系数控制信号的作用下将输入的模拟电压转换为有符号的乘数并选择对应乘法系数的电路结构,输出乘法运算结果,Two-phase coefficient adjustable multiplication array, each multiplication unit converts the input analog voltage into a signed multiplier under the action of its working state control signal and multiplication coefficient control signal, selects the circuit structure corresponding to the multiplication coefficient, and outputs the multiplication operation result,
    模数转换模块,对双向系数可调乘法阵列输出的乘法运算结果进行模数转换。The analog-to-digital conversion module performs analog-to-digital conversion on the multiplication result output by the bidirectional coefficient adjustable multiplication array.
  2. 根据权利要求1所述一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,每个乘法单元包括:The two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network according to claim 1, wherein each multiplication unit comprises:
    双相采样和保持缓冲电路,其正向输入端和反向输入端各串联有一开关,两个开关的控制端接计算单元控制模块输出的工作状态控制信号,对输入的模拟电压进行正向控制或反向控制后输出表征有符号乘数的模拟电压,及,The two-phase sample and hold buffer circuit has a switch in series with the forward input terminal and the reverse input terminal. The control terminals of the two switches are connected to the working state control signal output by the control module of the calculation unit, and the input analog voltage is forwardly controlled. Or output an analog voltage that represents a signed multiplier after reverse control, and,
    开关控制的模拟乘法电路,由多个并联的电容支路组成,每个电容支路上串接一个受控于乘法系数控制信号的开关,各电容支路正极板一端连接形成的输入端接双相采样和保持缓冲电路的输出端,各电容支路在乘法系数控制信号的作用下切换至充电状态或放电状态,表征有符号乘数的模拟电压作用于处于充电状态的电容支路,各电容支路负极板一端连接后形成的输出端输出表征乘法运算结果的模拟电压。The switch-controlled analog multiplication circuit is composed of multiple capacitor branches connected in parallel. Each capacitor branch is connected in series with a switch controlled by the multiplication coefficient control signal. The input terminal formed by connecting one end of the positive plate of each capacitor branch is connected to two-phase At the output end of the sample and hold buffer circuit, each capacitor branch is switched to the charging state or the discharging state under the action of the multiplication coefficient control signal, and the analog voltage representing the signed multiplier acts on the capacitor branch in the charging state, and each capacitor branch The output terminal formed after one end of the negative plate of the circuit is connected to output an analog voltage representing the result of the multiplication operation.
  3. 根据权利要求1所述一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,所述数模转换模块包括:The bi-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network according to claim 1, wherein the digital-to-analog conversion module comprises:
    输入脉冲生成模块,对输入的特征数据多路复用后映射至级联PMOS恒流源的输入端,及,Input pulse generation module, which multiplexes the input characteristic data and maps it to the input terminal of the cascaded PMOS constant current source, and,
    级联PMOS恒流源,将输入端映射的特征数据正比例转换为模拟电压输出端的充电电流时间。The cascaded PMOS constant current source converts the characteristic data mapped at the input terminal into the charging current time of the analog voltage output terminal in direct proportion.
  4. 根据权利要求1所述一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,所述模数转换模块由多个级联模块组成,每个级联模块包括:The two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network according to claim 1, wherein the analog-to-digital conversion module is composed of a plurality of cascaded modules, and each cascaded module includes:
    低精度ADC,对其输入端的乘法运算结果进行模数转换得到输出数字信号的高位数据,A low-precision ADC performs analog-to-digital conversion on the multiplication result of its input to obtain the high-bit data of the output digital signal.
    DAC,对其输入端的输出数字信号的高位数据进行模数转换后输出高位数据对应的模拟信号,DAC, which performs analog-to-digital conversion on the high-order data of the output digital signal at its input and outputs the analog signal corresponding to the high-order data.
    采样/保持放大器,对输入端的乘法运算结果进行采样保持后输出,及,A sample/hold amplifier, which samples and holds the result of the multiplication operation at the input and outputs it, and,
    求和电路,对采样/保持放大器输出的乘法运算结果及DAC输出的高位数据对应的模拟信号进行求和或求差操作,输出删除高位数据的乘法运算结果至下一个级联模块。The summing circuit performs the summation or difference operation on the multiplication result output by the sample/hold amplifier and the analog signal corresponding to the high-order data output by the DAC, and outputs the multiplication operation result with the high-order data deleted to the next cascade module.
  5. 根据权利要求3所述一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,所述输入脉冲生成模块为多路复用器,该多路复用器为每个特征数据生成一个输入脉冲。The two-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network according to claim 3, wherein the input pulse generation module is a multiplexer, and the multiplexer is each The characteristic data generates an input pulse.
  6. 根据权利要求4所述一种面向卷积神经网络的双相系数可调模拟乘法计算电路,其特征在于,所述模数转换模块还包括对各级联模块输出信号进行校正后拼接得到要求精度数字量的数字校正和对齐电路。The dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network according to claim 4, wherein the analog-to-digital conversion module further comprises correcting the output signals of the multi-level connection modules and then splicing them to obtain the required accuracy Digital correction and alignment circuit for digital quantities.
  7. 面向卷积神经网络的双相系数可调模拟乘法,其特征在于,将特征数据转换为模拟电压后作为双相系数可调乘法阵列的输入量,根据权重数据并结合卷积核大小控制双相系数可调乘法阵列中各乘法单元的工作状态及乘法系数,对双向系数可调乘法阵列输出的乘法运算结果进行模数转换得到计算结果。The dual-phase coefficient adjustable analog multiplication for convolutional neural networks is characterized in that the characteristic data is converted into an analog voltage as the input of the dual-phase coefficient adjustable multiplication array, and the dual-phase is controlled according to the weight data and combined with the size of the convolution kernel. The working state and multiplication coefficient of each multiplication unit in the coefficient adjustable multiplication array are converted from analog to digital on the multiplication operation result output by the bidirectional coefficient adjustable multiplication array to obtain the calculation result.
  8. 根据权利要求7所述面向卷积神经网络的双相系数可调模拟乘法,其特征在于,根据权重数据并结合卷积核大小控制双相系数可调乘法阵列中各乘法单元的工作状态及乘法系数的具体方法为:根据权重数据并结合卷积核大小生成对各乘法单元输入端的模拟电压进行正向控制或反向控制的工作状态控制信号以及选择各乘法单元电路结构以实现不同乘法系数的乘法系数控制信号。The dual-phase coefficient adjustable analog multiplication oriented to convolutional neural network according to claim 7, characterized in that the working status and multiplication of each multiplication unit in the dual-phase coefficient adjustable multiplication array are controlled according to the weight data and combined with the size of the convolution kernel The specific method of the coefficient is: according to the weight data and combined with the size of the convolution kernel to generate a working state control signal for forward control or reverse control of the analog voltage at the input of each multiplication unit, and select the circuit structure of each multiplication unit to achieve different multiplication coefficients. Multiplication coefficient control signal.
  9. 根据权利要求7所述面向卷积神经网络的双相系数可调模拟乘法,其特征在于,将特征数据转换为模拟电压后作为双相系数可调乘法阵列的输入量的具体方法为:将特征数据映射为正比例转换为级联PMOS恒流源模拟电压输出端的充电电流时间。The two-phase coefficient adjustable analog multiplication method for convolutional neural network according to claim 7, wherein the specific method of converting the characteristic data into an analog voltage as the input quantity of the two-phase coefficient adjustable multiplication array is: The data is mapped as a proportional conversion to the charging current time of the analog voltage output terminal of the cascaded PMOS constant current source.
  10. 根据权利要求7所述面向卷积神经网络的双相系数可调模拟乘法,其特征在于,采用流水线型模数转换方法对双向系数可调乘法阵列输出的乘法运算结果进行模数转换得到计算结果。The two-phase coefficient adjustable analog multiplication oriented to the convolutional neural network according to claim 7, characterized in that a pipelined analog-to-digital conversion method is used to perform analog-to-digital conversion on the multiplication operation result output by the bidirectional coefficient adjustable multiplication array to obtain the calculation result .
PCT/CN2019/114107 2019-09-27 2019-10-29 Dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network WO2021056677A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910922468.9 2019-09-27
CN201910922468.9A CN110750231B (en) 2019-09-27 2019-09-27 Convolution neural network-oriented biphase coefficient adjustable analog multiplication circuit

Publications (1)

Publication Number Publication Date
WO2021056677A1 true WO2021056677A1 (en) 2021-04-01

Family

ID=69277274

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/CN2019/114107 WO2021056677A1 (en) 2019-09-27 2019-10-29 Dual-phase coefficient adjustable analog multiplication calculation circuit for convolutional neural network
PCT/CN2020/079253 WO2021056980A1 (en) 2019-09-27 2020-03-13 Convolutional neural network oriented two-phase coefficient adjustable analog multiplication calculation circuit

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/079253 WO2021056980A1 (en) 2019-09-27 2020-03-13 Convolutional neural network oriented two-phase coefficient adjustable analog multiplication calculation circuit

Country Status (2)

Country Link
CN (1) CN110750231B (en)
WO (2) WO2021056677A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110750231B (en) * 2019-09-27 2021-09-28 东南大学 Convolution neural network-oriented biphase coefficient adjustable analog multiplication circuit
WO2022061867A1 (en) * 2020-09-28 2022-03-31 深圳市大疆创新科技有限公司 Data processing method and apparatus, and computer-readable storage medium
CN113741857A (en) * 2021-07-27 2021-12-03 北京大学 Multiply-accumulate operation circuit
CN116366066A (en) * 2021-12-27 2023-06-30 圣邦微电子(北京)股份有限公司 Arithmetic circuit
CN114491395A (en) * 2022-01-04 2022-05-13 电子科技大学 Current domain system design applied to analog front-end signal processing based on FFT algorithm
CN114117986B (en) * 2022-01-29 2022-07-19 深圳市芯茂微电子有限公司 Arithmetic unit
CN117829224A (en) * 2024-03-05 2024-04-05 南京大学 Bionic neural network circuit system based on composite dielectric gate structure

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679622A (en) * 2017-09-06 2018-02-09 清华大学 A kind of simulation towards neural network algorithm perceives computing architecture
CN108416438A (en) * 2018-05-30 2018-08-17 济南浪潮高新科技投资发展有限公司 A kind of convolutional neural networks hardware module dispositions method
US20180300627A1 (en) * 2017-04-14 2018-10-18 International Business Machines Corporation Update management for rpu array
CN109344964A (en) * 2018-08-08 2019-02-15 东南大学 A kind of multiply-add calculation method and counting circuit suitable for neural network
CN110008440A (en) * 2019-04-15 2019-07-12 合肥恒烁半导体有限公司 A kind of convolution algorithm and its application based on analog matrix arithmetic element

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9473250B2 (en) * 2014-09-03 2016-10-18 Mitsubishi Electric Research Laboratories, Inc. System and method for recovering carrier phase in optical communications
US20170193136A1 (en) * 2015-12-30 2017-07-06 Sharat C. Prasad On-chip and system-area multi-processor interconnection networks in advanced processes for maximizing performance minimizing cost and energy
CN105654729B (en) * 2016-03-28 2018-01-02 南京邮电大学 A kind of short-term traffic flow forecast method based on convolutional neural networks
CN107229598B (en) * 2017-04-21 2021-02-26 东南大学 Low-power-consumption voltage-adjustable convolution operation module for convolution neural network
CN107545305B (en) * 2017-09-15 2020-01-03 中国科学技术大学 CMOS (complementary metal oxide semiconductor) process-based digital-analog mixed charge domain neuron circuit
CN107797962B (en) * 2017-10-17 2021-04-16 清华大学 Neural network based computational array
CN108446631B (en) * 2018-03-20 2020-07-31 北京邮电大学 Deep learning intelligent spectrogram analysis method based on convolutional neural network
CN109359269A (en) * 2018-08-27 2019-02-19 北京大学 It carries out the system of matrix-vector multiplication operation and carries out neural network computing method
CN109800876B (en) * 2019-01-18 2021-06-01 合肥恒烁半导体有限公司 Data operation method of neural network based on NOR Flash module
CN110188685B (en) * 2019-05-30 2021-01-05 燕山大学 Target counting method and system based on double-attention multi-scale cascade network
CN110750231B (en) * 2019-09-27 2021-09-28 东南大学 Convolution neural network-oriented biphase coefficient adjustable analog multiplication circuit

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180300627A1 (en) * 2017-04-14 2018-10-18 International Business Machines Corporation Update management for rpu array
CN107679622A (en) * 2017-09-06 2018-02-09 清华大学 A kind of simulation towards neural network algorithm perceives computing architecture
CN108416438A (en) * 2018-05-30 2018-08-17 济南浪潮高新科技投资发展有限公司 A kind of convolutional neural networks hardware module dispositions method
CN109344964A (en) * 2018-08-08 2019-02-15 东南大学 A kind of multiply-add calculation method and counting circuit suitable for neural network
CN110008440A (en) * 2019-04-15 2019-07-12 合肥恒烁半导体有限公司 A kind of convolution algorithm and its application based on analog matrix arithmetic element

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
LIANG SHUANG: "Research on Key Technologies of Reconfigurable Neural Network Accelerator Design", CHINESE DOCTORAL DISSERTATIONS FULL-TEXT DATABASE, UNIVERSITY OF CHINESE ACADEMY OF SCIENCES, CN, 1 November 2017 (2017-11-01), CN, XP055794737, ISSN: 1674-022X *

Also Published As

Publication number Publication date
WO2021056980A1 (en) 2021-04-01
CN110750231B (en) 2021-09-28
CN110750231A (en) 2020-02-04

Similar Documents

Publication Publication Date Title
WO2021056980A1 (en) Convolutional neural network oriented two-phase coefficient adjustable analog multiplication calculation circuit
CN101277112B (en) Low-power consumption assembly line a/d converter by sharing operation amplifier
CN102594353B (en) Digital-to-analog converter and successive approximation storage converter
US8902093B1 (en) Parallel analog to digital converter architecture with charge redistribution and method thereof
US7414562B2 (en) Analog-to-digital conversion using asynchronous current-mode cyclic comparison
CN104967451A (en) Successive approximation type analog-to-digital converter
CN108809310B (en) Passive time-interleaved SAR ADC-based band-pass Delta-Sigma modulator
US7443333B2 (en) Single stage cyclic analog to digital converter with variable resolution
TW201301773A (en) Successive approximation register analog to digital converter and conversion method thereof
CN104242935A (en) SAR ADC segmented capacitor mismatch correction method
CN112564709B (en) Noise shaping successive approximation analog-to-digital converter based on error feedback
US8264393B2 (en) Current reduction in a single stage cyclic analog to digital converter with variable resolution
CN1561000B (en) Pipeline structure analogue/digital converter of controlling input common-mode drift
CN106341133A (en) Dual-channel time interleaved asynchronous assembly line flash analog-to-digital converter
CN109462402B (en) Mixed type assembly line ADC structure
CN116318161B (en) Multi-step type monoclinic analog-to-digital conversion circuit for image sensor and control method
CN103916126A (en) Pipelined ADC circuit with digital correction module
US11509320B2 (en) Signal converting apparatus and related method
CN113922819B (en) One-step two-bit successive approximation type analog-to-digital converter based on background calibration
CN114244369A (en) Successive approximation analog-to-digital conversion transpose
CN113672854A (en) Memory operation method based on current mirror and storage unit, convolution operation method and device and application of convolution operation method and device
CN113162623A (en) Conversion circuit and digital-to-analog converter based on resistance voltage division and capacitance integration
WO2021137686A1 (en) Interfacing circuit and analog to digital converter for battery monitoring applications and a method thereof
Ghoshal et al. Design of a Modified 8-bit Semiflash Analog to Digital Converter
JPH07202700A (en) A/d converter

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19946335

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19946335

Country of ref document: EP

Kind code of ref document: A1