WO2020103615A1 - 光电计算单元、光电计算阵列及光电计算方法 - Google Patents
光电计算单元、光电计算阵列及光电计算方法Info
- Publication number
- WO2020103615A1 WO2020103615A1 PCT/CN2019/111513 CN2019111513W WO2020103615A1 WO 2020103615 A1 WO2020103615 A1 WO 2020103615A1 CN 2019111513 W CN2019111513 W CN 2019111513W WO 2020103615 A1 WO2020103615 A1 WO 2020103615A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- area
- input
- data
- output
- readout
- Prior art date
Links
- 238000004364 calculation method Methods 0.000 title claims abstract description 604
- 230000008878 coupling Effects 0.000 claims abstract description 71
- 238000010168 coupling process Methods 0.000 claims abstract description 71
- 238000005859 coupling reaction Methods 0.000 claims abstract description 71
- 239000004065 semiconductor Substances 0.000 claims abstract description 56
- 239000011159 matrix material Substances 0.000 claims description 274
- 239000013598 vector Substances 0.000 claims description 274
- 230000003287 optical effect Effects 0.000 claims description 232
- 239000000758 substrate Substances 0.000 claims description 152
- 239000000969 carrier Substances 0.000 claims description 147
- 230000005693 optoelectronics Effects 0.000 claims description 91
- 238000004422 calculation algorithm Methods 0.000 claims description 51
- 238000000034 method Methods 0.000 claims description 51
- 238000011176 pooling Methods 0.000 claims description 50
- 238000006243 chemical reaction Methods 0.000 claims description 39
- 230000008859 change Effects 0.000 claims description 37
- 238000007667 floating Methods 0.000 claims description 36
- 230000006870 function Effects 0.000 claims description 29
- 238000002955 isolation Methods 0.000 claims description 25
- 238000013528 artificial neural network Methods 0.000 claims description 24
- 230000008569 process Effects 0.000 claims description 22
- 238000003491 array Methods 0.000 claims description 20
- 238000009825 accumulation Methods 0.000 claims description 18
- 230000001133 acceleration Effects 0.000 claims description 11
- 238000012360 testing method Methods 0.000 claims description 11
- 238000003384 imaging method Methods 0.000 claims description 7
- 238000010521 absorption reaction Methods 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 4
- 238000012886 linear function Methods 0.000 claims description 3
- 206010034960 Photophobia Diseases 0.000 claims description 2
- 230000002776 aggregation Effects 0.000 claims description 2
- 238000004220 aggregation Methods 0.000 claims description 2
- 208000013469 light sensitivity Diseases 0.000 claims description 2
- 230000026676 system process Effects 0.000 claims 2
- 230000002159 abnormal effect Effects 0.000 claims 1
- 230000005611 electricity Effects 0.000 claims 1
- 238000006386 neutralization reaction Methods 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 52
- 238000003860 storage Methods 0.000 description 45
- 239000008186 active pharmaceutical agent Substances 0.000 description 43
- 230000008901 benefit Effects 0.000 description 39
- 230000010354 integration Effects 0.000 description 30
- 239000000463 material Substances 0.000 description 11
- 238000012549 training Methods 0.000 description 10
- 239000013307 optical fiber Substances 0.000 description 8
- 238000004088 simulation Methods 0.000 description 8
- 238000005468 ion implantation Methods 0.000 description 7
- 230000005684 electric field Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 239000000835 fiber Substances 0.000 description 5
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 239000003990 capacitor Substances 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000009795 derivation Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 2
- 238000005265 energy consumption Methods 0.000 description 2
- 238000005530 etching Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000004298 light response Effects 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 229910052710 silicon Inorganic materials 0.000 description 2
- 239000010703 silicon Substances 0.000 description 2
- 235000012239 silicon dioxide Nutrition 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 238000013170 computed tomography imaging Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000002019 doping agent Substances 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000003325 tomography Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06E—OPTICAL COMPUTING DEVICES; COMPUTING DEVICES USING OTHER RADIATIONS WITH SIMILAR PROPERTIES
- G06E3/00—Devices not provided for in group G06E1/00, e.g. for processing analogue or hybrid data
- G06E3/001—Analogue devices in which mathematical operations are carried out with the aid of optical or electro-optical elements
- G06E3/005—Analogue devices in which mathematical operations are carried out with the aid of optical or electro-optical elements using electro-optical or opto-electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/067—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using optical means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7896—Modular architectures, e.g. assembled from a number of identical packages
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- H—ELECTRICITY
- H01—ELECTRIC ELEMENTS
- H01L—SEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
- H01L27/00—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate
- H01L27/14—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components sensitive to infrared radiation, light, electromagnetic radiation of shorter wavelength or corpuscular radiation and specially adapted either for the conversion of the energy of such radiation into electrical energy or for the control of electrical energy by such radiation
- H01L27/144—Devices controlled by radiation
- H01L27/146—Imager structures
- H01L27/14601—Structural or functional details thereof
- H01L27/14609—Pixel-elements with integrated switching, control, storage or amplification elements
- H01L27/14612—Pixel-elements with integrated switching, control, storage or amplification elements involving a transistor
-
- H—ELECTRICITY
- H01—ELECTRIC ELEMENTS
- H01L—SEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
- H01L29/00—Semiconductor devices specially adapted for rectifying, amplifying, oscillating or switching and having potential barriers; Capacitors or resistors having potential barriers, e.g. a PN-junction depletion layer or carrier concentration layer; Details of semiconductor bodies or of electrodes thereof ; Multistep manufacturing processes therefor
- H01L29/02—Semiconductor bodies ; Multistep manufacturing processes therefor
- H01L29/06—Semiconductor bodies ; Multistep manufacturing processes therefor characterised by their shape; characterised by the shapes, relative sizes, or dispositions of the semiconductor regions ; characterised by the concentration or distribution of impurities within semiconductor regions
- H01L29/0684—Semiconductor bodies ; Multistep manufacturing processes therefor characterised by their shape; characterised by the shapes, relative sizes, or dispositions of the semiconductor regions ; characterised by the concentration or distribution of impurities within semiconductor regions characterised by the shape, relative sizes or dispositions of the semiconductor regions or junctions between the regions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- H—ELECTRICITY
- H01—ELECTRIC ELEMENTS
- H01L—SEMICONDUCTOR DEVICES NOT COVERED BY CLASS H10
- H01L27/00—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate
- H01L27/02—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components specially adapted for rectifying, oscillating, amplifying or switching and having potential barriers; including integrated passive circuit elements having potential barriers
- H01L27/04—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components specially adapted for rectifying, oscillating, amplifying or switching and having potential barriers; including integrated passive circuit elements having potential barriers the substrate being a semiconductor body
- H01L27/10—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components specially adapted for rectifying, oscillating, amplifying or switching and having potential barriers; including integrated passive circuit elements having potential barriers the substrate being a semiconductor body including a plurality of individual components in a repetitive configuration
- H01L27/105—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components specially adapted for rectifying, oscillating, amplifying or switching and having potential barriers; including integrated passive circuit elements having potential barriers the substrate being a semiconductor body including a plurality of individual components in a repetitive configuration including field-effect components
- H01L27/1057—Devices consisting of a plurality of semiconductor or other solid-state components formed in or on a common substrate including semiconductor components specially adapted for rectifying, oscillating, amplifying or switching and having potential barriers; including integrated passive circuit elements having potential barriers the substrate being a semiconductor body including a plurality of individual components in a repetitive configuration including field-effect components comprising charge coupled devices [CCD] or charge injection devices [CID]
Definitions
- the invention relates to a photoelectric calculation unit, a photoelectric calculation array and a photoelectric calculation method. More specifically, the present invention combines some technologies in the computing field and the semiconductor device field, and the technical solutions of the present invention can perform calculations independently or in combination with current electronic computing technologies.
- Typical storage and computing integrated devices are mainly RRAM (memristor) and FLASH (flash memory).
- RRAM can save the resistance value affected by the input amount of its electrical input terminal for a long time after power off, but RRAM does not support
- the production of standard CMOS process the yield and uniformity of the device can not be guaranteed, which is unacceptable in the neural network algorithm that must use a large number of memory-calculation integrated devices to form a network to accelerate.
- FLASH as a memory-calculation integrated device, it means that a single floating gate tube must store more than one bit of data, that is, multi-value storage. This is for traditional FLASH that can only use two methods of erasing and programming to change the threshold. Difficult to do.
- the known optical calculation methods are mostly pure optical calculations that use light propagation rules to interact between light and optical devices.
- an optoelectronic computing device that uses the optoelectronic properties of semiconductor materials and uses external input optical signals to modulate electrical signals transmitted in the semiconductor materials to implement adders, multipliers, and some advanced operations .
- the device can realize a high-precision storage-calculation integrated function, and a single device can store the optical signal of the optical input end and store it for a long time after the light is cut off.
- a new photoelectric calculation method which uses the photoelectric properties of semiconductor materials and uses input optical signals to modulate the electrical signals transmitted in the semiconductor materials to implement basic operations such as adders and multipliers. New mechanism.
- the invention uses the photoelectric performance of semiconductor materials to design a photoelectric computing device, and discloses various adders, multipliers and algorithm accelerators composed of the photoelectric computing device, and their corresponding photoelectric calculation methods. It can be seen that the present invention utilizes the optoelectronic characteristics of semiconductor materials and the extended application of the technology that has been commonly used in the traditional optical field in the field of computing, and proposes a brand new optoelectronic computing device and a class of optoelectronic computing methods that can achieve high precision
- the integrated memory-calculation function a single device can both store the optical signal at the optical input and save it for a long time after the light is cut off, and can achieve a single device to complete the multiplication operation, which is very suitable for accelerating a class of needs represented by neural network algorithms. Store parameters "algorithm.
- FIG. 1 is a schematic diagram showing the basic structure of a multifunctional area of a photoelectric computing unit according to the present invention.
- FIG. 2 is a front view showing the photoelectric calculation unit according to the first embodiment of the present invention.
- FIG. 3 is a schematic perspective view showing the photoelectric calculation unit according to the first embodiment of the present invention.
- FIG. 4 is a configuration diagram showing a multi-function area of the photoelectric calculation unit according to the first embodiment of the present invention.
- FIG. 5 is an electrical model showing the photoelectric calculation unit according to the first embodiment of the present invention.
- FIG. 6 is a front view showing a photoelectric calculation unit according to a second embodiment of the present invention.
- FIG. 7 is a 3D schematic diagram showing an optoelectronic calculation unit according to a second embodiment of the present invention.
- FIG. 8 is a configuration diagram showing a multi-function area of a photoelectric calculation unit according to a second embodiment of the present invention.
- FIG. 9 is a front view showing a photoelectric calculation unit according to a third embodiment of the present invention.
- FIG. 10 is a schematic diagram showing the photoelectric calculation unit 3D according to the third embodiment of the present invention.
- FIG. 11 is a configuration diagram showing the multi-function area of the photoelectric calculation unit according to the third embodiment of the present invention.
- FIG. 12 is an electrical model showing a photoelectric calculation unit according to a third embodiment of the present invention.
- FIG. 13 is a schematic diagram showing the structure of a photoelectric calculation unit according to a fourth embodiment of the present invention.
- FIG. 14 is a schematic diagram showing the structure of the multifunctional area of the photoelectric calculation unit according to the fourth embodiment of the present invention.
- 15 is a schematic diagram showing a direct projection scheme according to the present invention.
- 16 is a schematic diagram showing the integration of the light emitting unit and the photoelectric calculation unit according to the present invention.
- FIG. 17 is a schematic diagram showing a lens light input scheme according to the present invention.
- FIG. 18 is a schematic diagram showing an optical fiber cone light input scheme according to the present invention.
- FIG. 19 is a schematic diagram showing a funnel-shaped optical fiber cone scheme according to the present invention.
- FIG. 20 is a structural diagram showing an example of a multi-control gate structure using a photoelectric calculation unit according to the present invention.
- FIG. 21 is a structural diagram showing an example of a multi-control gate structure using a photoelectric calculation unit according to the present invention.
- FIG. 22 is a structural diagram showing an example of a multi-control gate structure using a photoelectric calculation unit according to the present invention.
- FIG. 23 is a diagram showing an example of one of the adders according to the present invention.
- 24 is a diagram showing an example of one of the multipliers according to the present invention.
- FIG. 25 is a diagram showing an example of one of the multipliers according to the present invention.
- FIG. 26 is a diagram showing an example of one of the multipliers according to the present invention.
- FIG. 27 is a diagram showing an example of a vector adder according to the present invention.
- FIG. 28 is a diagram showing an example of a high-bit width multiplier according to the present invention.
- FIG. 29 is a schematic diagram showing a serial matrix vector multiplier according to the present invention.
- FIG. 30 is a schematic diagram showing the calculation of the parallel matrix vector multiplier according to the present invention.
- FIG. 31 is a schematic diagram showing a parallel matrix vector multiplier according to the present invention.
- FIG. 32 is a schematic diagram showing the convolution operation according to the present invention.
- FIG. 33 is a schematic diagram showing an array of convolution operation units according to the present invention, in which a 3 * 3 convolution kernel is targeted.
- Fig. 34 is a schematic diagram showing an ALEXnet network according to the present invention.
- FIG. 35 is a schematic diagram showing RELU function images according to the present invention.
- 36 is a schematic diagram showing an X-ray imaging method and a CT imaging method according to the present invention.
- FIG. 37 is a schematic diagram showing a CT algorithm according to the present invention, in which the ith ray passes through the jth pixel.
- Fig. 38 is a schematic diagram showing a CT algorithm according to the present invention.
- 39 is a schematic diagram showing an array of serial CT algorithm accelerators according to the present invention.
- FIG. 40 is a schematic diagram showing digital control logic according to the present invention.
- FIG. 41 is a graph showing the light response curve of the photoelectric calculation unit according to the first embodiment of the present invention.
- FIG. 42 is a schematic diagram showing an AlexNet-like network for simulation according to the present invention.
- the optoelectronic computing device unit according to the present invention includes a semiconductor multifunctional region structure, wherein the semiconductor multifunctional region structure includes a carrier control region, a coupling region, and a photogenerated carrier collection and reading Out of the zone, and the multifunctional zone can be a multi-layer structure, or any layer or zone structure that achieves the same photoelectric effect and control through the arrangement and transformation of multiple spaces.
- the photoelectric optoelectronic calculation unit according to the present invention will be described in detail with reference to the accompanying drawings.
- the photoelectric optoelectronic calculation unit according to the first embodiment of the present invention will be described with reference to FIGS. 2 to 5.
- the photoelectric optoelectronic computing unit of FIGS. 2 and 3 there is a P-type semiconductor substrate as the photogenerated carrier collection area and readout area divided into a left side collection area and a right side reading The output area, wherein the left-side collection area is used to apply a pulse with a negative voltage range on the substrate, or a pulse with a positive voltage range on the control gate, so that The depletion layer collected by the photoelectrons, and the number of collected photoelectrons is read out through the right reading area as the input amount of the light input terminal.
- the right readout area includes shallow trench isolation, N-type drain and N-type source.
- the shallow trench isolation is located in the middle of the collection region and the readout region in the middle of the semiconductor substrate.
- the shallow trench isolation is formed by etching and filling with silicon dioxide to isolate the electrical signals of the collection region and the readout region .
- the N-type source terminal is located on the side near the bottom dielectric layer in the readout area, and is formed by doping by ion implantation.
- the N-type drain end is located on the other side of the semiconductor substrate close to the bottom dielectric layer opposite to the N-type source end, and is also formed by the doping method by ion implantation.
- a positive voltage is applied to the control gate to form a conductive channel between the N-type source terminal and the N-type drain terminal of the collection region, and then a bias pulse is applied between the N-type source terminal and the N-type drain terminal.
- the voltage accelerates the electrons in the conductive channel to form a current between the source and drain.
- Carriers that form a current in the channel between the source and drain are subjected to the control gate voltage, the voltage between the source and drain, and the number of photoelectrons collected in the collection area to act as electrons that are combined by the light input and the electric input.
- the output is in the form of current, where the control gate voltage and the source-drain voltage can be used as the electrical input of the device, and the number of photoelectrons is the optical input of the device.
- the coupling region to connect the collection region and the read-out region, so that the depletion region in the collection region substrate begins to collect photoelectrons, and the surface potential of the substrate in the collection region will be collected Quantity influence;
- the surface potential of the semiconductor substrate in the readout area is affected by the surface potential of the semiconductor substrate in the collection area, which in turn affects the current between the source and drain in the readout area, so that To read the number of photoelectrons collected in the collection area;
- control gate as the carrier control region for applying a pulse voltage thereon, so that a depletion region for exciting photoelectrons is generated in the read-out region of the P-type semiconductor substrate.
- the photoelectric calculation unit includes a control gate as the carrier control region, a charge-coupled layer as the coupling region, and the photogeneration A P-type substrate for carrier collection and readout regions, and a bottom dielectric layer for isolation are provided between the P-type semiconductor substrate and the charge-coupled layer, and a top dielectric layer for isolation , Is disposed between the charge-coupled layer and the control gate.
- left side, right side, upper side, and lower side mentioned in this article only represent that the relative position observed under the viewing angle shown in the figure changes with the viewing angle, and is not to be understood as a limitation on the specific structure.
- FIG. 5 is an electrical model of the photoelectric calculation unit according to the first embodiment of the present invention, and the principle of the photoelectric calculation unit is described in detail based on the electrical model shown in FIG. 5.
- the left collecting area is equivalent to a capacitor of MOS capacitor
- the right readout area is equivalent to a standard floating gate MOS tube. Due to the design, the capacitance C2 is much smaller than C1, so the effect of the readout area on the photosensitive area when the device is working is negligible.
- the electric potential in a MOS-capacitor Si can be obtained by solving the following Poisson equation:
- ⁇ SI is the dielectric constant of silicon and ⁇ is the bulk charge density of the P-type substrate.
- the x direction is perpendicular to the downward direction of the underlying dielectric layer
- x d is the depth of the depletion region
- q is the amount of electron charge
- V is the potential at the depth x.
- E S is the surface electric field strength, assuming that the substrate voltage is set to 0V, so that the control gate potential during the photosensitive process is:
- V G is the electric potential of the control gate
- the available depletion region depth x d is:
- the total amount of charge on the control gate Q CG N A + Q, Q is the amount of signal charge (e- / cm2), because this signal charge is collected in the collection area under the action of the electric field between the control gate and the P-type substrate, and because the recombination of carriers in the semiconductor substrate requires a certain time, plus The presence of thermally excited carriers in the upper depletion zone, therefore, this signal charge will still be stored in the arithmetic unit for a long time after the light is cut off, realizing the integrated function of storage and calculation.
- V Q is the sum of the potentials generated by the signal charge:
- the channel current I d can be expressed as:
- V DS is the source-drain voltage
- V FG is the charge-coupled layer potential
- its size is affected by the control gate potential V G and the surface potential V s of the P-type substrate, which can be expressed as :
- equation (1-6) can be simplified as:
- Formula (1-11) brings into (1-3), that is, the surface potential V S of the P-type substrate and the control gate potential V G and the sum of the potentials generated by the signal charge V Q are approximately equal, that is:
- t is the exposure time
- X photon is the number of photons incident per unit time
- ⁇ is the device quantum efficiency
- the most basic structure of the photoelectric calculation unit contains only one output terminal, but if the MOSFET on the right side is divided into multiple parallel small MOSFETs with independent source and drain, and the device parameters are equal, then Expanding the number of output terminals, if the same V DS is given to the plurality of small MOSFETs, multiple identical output quantities of the photoelectric calculation unit can be obtained.
- the photoelectric calculation unit based on the solution described in the first embodiment described above will also be described later.
- the photoelectric calculation unit according to the second embodiment of the present invention will be described with reference to FIGS. 6 to 8.
- an N-type semiconductor substrate as a photogenerated carrier collection and readout area, which is divided into a left-side collection area and a right-side readout area.
- the left readout region is used to apply a pulse with a positive voltage range on the substrate, or a pulse with a negative voltage range on the control gate, so that a light hole is generated in the collection region substrate
- the depletion layer is collected, and the collected amount of photohole charge is read through the right readout region;
- the right readout region includes shallow trench isolation, a P-type drain terminal, and a P-type source terminal.
- the shallow trench isolation is located in the middle of the semiconductor substrate in the middle of the collection area and the readout area, and is formed by etching and filling with silicon dioxide to isolate the electrical signals of the collection area and the readout area.
- the P-type source terminal is located on the side near the bottom dielectric layer in the readout area, and is formed by doping by ion implantation.
- the P-type drain end is located on the other side of the semiconductor substrate near the bottom dielectric layer opposite to the P-type source end, and is also formed by an ion implantation method using a doping method.
- a negative pulse voltage is applied to the control gate to form a conductive channel between the P-type source and the P-type drain, and then a bias pulse voltage is applied between the P-type source and the P-type drain , So that the holes in the conductive channel accelerate to form a current between the source and drain.
- Carriers that form a current in the channel between the source and drain are subjected to the control gate pulse voltage, the voltage between the source and drain, and the number of light holes collected in the collection area, as a combined effect of the light input and the electrical input Carriers are output in the form of current, where the control gate voltage and the source-drain voltage can be used as the electrical input of the device, and the number of light holes is the optical input of the device.
- a charge-coupled layer as a coupling region to connect the collection region and the readout region, so that the depletion region in the collection region substrate begins to collect light holes, the surface potential of the collection region substrate will be affected by the collection region The influence of the number of light holes; through the connection of the charge-coupled layer, the surface potential of the semiconductor substrate in the readout area is affected by the surface potential of the semiconductor substrate in the readout area, which in turn affects the source-drain current in the readout area, thereby determining the readout area The current between source and drain is used to read the number of light holes collected in the collection area.
- control gate as a carrier control region for applying a negative pulse voltage thereon, so that a depletion region for exciting light holes is generated in the readout region of the N-type semiconductor substrate. It can be used as an electrical input terminal to input one bit of calculation.
- the photoelectric calculation unit includes a control gate as the carrier control region, a charge-coupled layer as the coupling region, and as the photogenerated An N-type substrate for carrier collection and readout regions, and a bottom dielectric layer for isolation are provided between the N-type semiconductor substrate and the charge-coupled layer, and a top dielectric layer for isolation , Is disposed between the charge-coupled layer and the control gate.
- this second embodiment differs in that the P-type substrate used in the device unit is replaced with an N-type, and the N-type source and drain terminals of the readout MOSFET are replaced by The P-type and other structures are unchanged. Therefore, the principle-based derivation process is similar to the process described in the first embodiment, and the similar parts are not described in detail.
- formula (1-7) points out that when the voltage difference between the control gate and the substrate is unchanged, the higher the doping concentration of the substrate before the photon is incident, the shallower the depth of the depletion region , And too shallow depletion region will cause the computing device to receive light input, the maximum number of photons that can be received is too small, the input range of the optical input terminal becomes smaller, which affects the performance of the computing unit; and, according to related theory, it is too high
- the doping concentration of the substrate will cause the thermal excitation of the carrier to become larger, which will affect the storage time of the data at the optical input end in the integrated device.
- the wafer is born with a low concentration of P-type doping, this doping can be used directly as a substrate condition for the production of P-type substrate devices; and if an N-type substrate device needs to be produced, ion implantation is required
- the first way is to make an N-well, and then make an N-type substrate device in the N-well. Therefore, compared to the N-type substrate device, the P-type substrate device is easier to obtain lower substrate doping, so in the above two embodiments, the solution described in the first embodiment is often compared to the second implementation
- the example solution has more advantages.
- the most basic structure of the photoelectric calculation unit in the second embodiment includes only one output terminal, but if the MOSFET on the right readout area is divided into a plurality of parallel independent sources and drains, In addition, the number of small MOSFETs with equal device parameters can expand the number of output terminals. If the same V DS is given to the multiple small MOSFETs, the same output of multiple channels of the photoelectric calculation unit can be obtained.
- the photoelectric calculation unit based on the solution described in the second embodiment described above will also be described later.
- the photoelectric calculation unit according to the third embodiment of the present invention will be described with reference to FIGS. 9 to 12.
- a P-type semiconductor substrate as the photogenerated carrier collection and readout area, which can undertake both photoreception and readout work, including An N-type drain and an N-type source.
- the N-type source terminal is located on the side near the bottom dielectric layer in the readout area, and is formed by doping by ion implantation.
- the N-type drain end is located on the other side of the semiconductor substrate close to the bottom dielectric layer opposite to the N-type source end, and is also formed by the doping method by ion implantation.
- a pulse with a negative voltage range is applied to the P-type semiconductor substrate, and a pulse with a positive voltage range is applied to the control gate as the carrier control region, so that A depletion layer for photoelectron collection is generated in the substrate, and the electrons generated in the depletion region are accelerated by the electric field between the control gate and the P-type substrate, and sufficient energy is obtained upon arrival.
- the control gate When receiving light, a pulse with a negative voltage range is applied to the P-type semiconductor substrate, and a pulse with a positive voltage range is applied to the control gate as the carrier control region, so that A depletion layer for photoelectron collection is generated in the substrate, and the electrons generated in the depletion region are accelerated by the electric field between the control gate and the P-type substrate, and sufficient energy is obtained upon arrival.
- the control gate Through the barrier of the underlying dielectric layer between the P-type substrate and the charge-coupled layer, it enters the charge-coupled layer and is stored there. The amount of charge in the charge-
- a pulse voltage accelerates the electrons in the conductive channel to form a current between the source and drain.
- the current between the source and drain is controlled by the pulse voltage of the control gate, the voltage between the source and drain, and the number of electrons stored in the charge-coupled layer. As the electrons that are combined by the light input and the electric input, the current takes the form of current.
- the output where the control gate voltage and the source-drain voltage can be used as the electrical input of the device, and the number of photoelectrons stored in the charge-coupled layer is the optical input of the device.
- a charge-coupled layer as the coupling region for storing photoelectrons entering it, and changing the threshold value of the device during reading, thereby affecting the current between the source and drain of the readout region, thereby determining the source-drain between the readout region
- the current is used to read out the number of photoelectrons generated when light is received and enters the charge-coupled layer.
- control gate as the carrier control region for applying a pulse voltage thereon, so that a depletion region for exciting photoelectrons is generated in the read-out region of the P-type semiconductor substrate.
- an electrical input terminal input one of the operation quantities.
- the photoelectric calculation unit includes a control gate as the carrier control region, a charge-coupled layer as the coupling region, and as the photogenerated P-type substrate for carrier collection and readout area, wherein a bottom dielectric layer for isolation is provided between the P-type semiconductor substrate and the charge-coupled layer, and a top dielectric layer for isolation , Is disposed between the charge-coupled layer and the control gate.
- FIG. 12 is an electrical model of the photoelectric calculation unit according to the third embodiment of the present invention, and the principle of the photoelectric calculation unit is described in detail based on the electrical model shown in FIG. 5.
- the structure of the photoelectric computing unit unit is roughly equivalent to the floating gate device.
- the topmost gate is the control gate, which is completely separated from the middle charge-coupled layer.
- Floating gate is the capacitance between the floating gate and the control gate, the source terminal, the substrate, and the drain terminal, respectively.
- V FG is the electric potential on the floating gate
- V CG is the electric potential on the control gate
- V S , V D , V B are the electric potential of the source end, drain end and substrate respectively.
- the coupling coefficient of the electrode J can be any one of the control gate G, the drain terminal D, the source terminal S, and the substrate B, and then the potential V FG of the floating gate can be expressed by the coupling coefficient as:
- V FG ⁇ G V GS + ⁇ D V DS + ⁇ S V S + ⁇ B V B (2-2)
- V GS and V DS are the gate-source voltage and source-drain voltage, respectively
- ⁇ G , ⁇ S , ⁇ D , and ⁇ B are the coupling coefficients of the gate, source, drain, and substrate, respectively. It can be seen that the potential of the floating gate is not only related to the control gate, but also related to the potential of the source, drain and substrate. If both the source and the substrate are grounded,
- the threshold voltage V T and the conduction coefficient ⁇ can be derived from the formula of a common MOS device:
- the floating gate potential when the device reaches the threshold is For the control gate potential when the device reaches the threshold, ⁇ CG is the body conduction coefficient for the control gate, and ⁇ FG is the body conduction coefficient for the floating gate.
- Formulas (2-3), (2-5), (2-7) are:
- V T is the same as Are directly related, while the formula of V T may be to change ⁇ V T can be expressed as:
- V T0 is the threshold when there is no charge in the floating gate.
- a gate voltage pulse is applied to the control gate of the photoelectric calculation unit shown in FIGS. 9 and 10, and a negative pulse voltage is applied to the substrate to form a depletion layer in the substrate semiconductor.
- photons representing the amount of light input enter the depletion region of the semiconductor substrate, and the Si semiconductor substrate absorbs a photon and excites an electron-hole pair.
- the photoelectron accelerates to the channel and obtains sufficiently high energy. If the energy is high enough, it can enter the charge-coupled layer under the action of the gate oxygen electric field to complete charge storage. After storing photoelectrons in the charge-coupled layer, the drain current and threshold voltage of the floating gate MOSFET will change during reading.
- ⁇ V T is the change in threshold voltage
- Q e is the amount of single electron charge
- C CG is the capacitance from the control gate to the floating gate
- N elec is the number of photoelectrons in the storage layer.
- the number of photoelectrons in the optoelectronic storage layer can be estimated by measuring the change in threshold voltage before and after exposure.
- the formula is as follows:
- W and L respectively represent the gate width and gate length of the floating gate device
- H is the thickness of the floating gate
- t IPD is the thickness between the floating gate and the gate in the device unit
- ⁇ 0 is the vacuum dielectric constant
- ⁇ 0x is Relative permittivity
- ⁇ I DS can be expressed as follows:
- the storage number of photoelectrons can also be obtained by measuring the change of the drain current in the linear region.
- drain-source current I d in the final readout area is:
- the drain-source current I d in the readout area as the readout is simultaneously affected by N as the optical input, V G and V DS as the electrical input, and naturally contains multiplication
- the operation relationship described above can be used to design an operation device that can realize various functions.
- the biggest difference of the third embodiment lies in: because the storage carrier photoelectron in the device unit of the optical input of this solution is stored in the isolated charge-coupled layer, it has Very long holding time, up to 10 years, and the optical input signals of the solutions described in the first and second embodiments above can only be maintained for seconds, so as an integrated storage and computing device, it has a greater Advantage.
- the most basic structure of the photoelectric calculation unit includes only one output terminal, but if the substrate under the charge-coupled layer is divided into a plurality of parallel independent sources and drains, In addition, the number of small MOSFETs with equal device parameters can expand the number of output terminals. If the same V DS is given to the multiple small MOSFETs, the same output of multiple channels of the photoelectric calculation unit can be obtained.
- the photoelectric calculation unit based on the solution described in the second embodiment described above will also be described later.
- the photoelectric calculation unit according to the fourth embodiment of the present invention will be described with reference to FIGS. 13 and 14.
- FIG. 13 there is a photodiode and readout tube as the photogenerated carrier collection and readout area, where the photodiode is formed by ion doping and is responsible for light sensitivity.
- the N area of the photodiode is connected to the control gate of the readout tube and the source end of the reset tube through the optoelectronic coupling lead as the coupling area, and a positive voltage pulse is applied to the drain end of the readout tube as the readout current Driving voltage; before exposure, the reset tube is opened, and the drain voltage of the reset tube is applied to the photodiode, so that the photodiode as the collection area is in a reverse bias state, and a depletion layer is generated; during exposure, the reset tube is turned off, The photodiode is electrically isolated.
- the control gate potential begins to decrease, which in turn affects the electron concentration in the channel of the readout tube.
- the read tube is responsible for reading, and a positive pulse voltage is applied to its drain end, and the source end is connected to the drain end of the address tube. During reading, the address tube is opened, and a current is generated in the read tube.
- the drain voltage of the reset tube, the drain voltage of the read tube and the number of incident photons affect the electrons in the channel of the read tube as the electrons that are combined by the light input and the electric input, and are output in the form of current, in which reset
- the tube drain voltage and the read tube drain voltage can be used as the electrical input of the device, and the number of incident photons is the optical input of the device.
- an optoelectronic coupling lead as a coupling region for connecting the photodiode as a collection region in the photogenerated carrier collection and readout region and the readout tube as a readout region, applying the photodiode N region potential to the readout Tube control grid.
- a reset tube as the so-called carrier control area.
- a positive voltage is applied to the photodiode through its drain terminal.
- the reset tube When the reset tube is turned on, the positive voltage will act on the photodiode, causing the photodiode to generate
- the depletion area is sensitive and can also be used as an electrical input terminal to input one bit of calculation.
- the addressing tube is used to control the output of the entire arithmetic device as the output current of the output.
- the photoelectric calculation unit includes a reset tube as the carrier control area, an optoelectronic coupling lead as the coupling area, and as the photogenerated load
- the photodiode and the readout tube in the flow collection and readout area in addition, also include an addressing tube, which is used to select the rows and columns when the photoelectric computing units are arrayed.
- the drain end of the reset tube is connected to the power supply, and the power supply voltage V d1 .
- a high voltage is applied to the gate of the reset tube to turn on the reset tube, and the power supply voltage V d1 is applied to the photodiode.
- the anode of the photodiode is grounded, and the voltage V PD across the photodiode is:
- V PD V d1 (3-1)
- K is a constant related to the diode parameters
- V bi is the built-in electric field
- m j depends on whether the diode type is abrupt junction or slow bias junction.
- R ph is the sensitivity of the photodiode
- L 0 is the cross-sectional area
- A is the light intensity. Because the photodiode is isolated, the photoelectrons will accumulate in the depletion region, which has the following ordinary differential equation:
- m j is a constant. It can be seen that the voltage across the photodiode gradually decreases with the increase in the number of incident photons. When the conventional parameters of the photodiode are brought into it, the curve of the voltage drop between the two ends with time has good linearity. Simplify the above formula to:
- V (t) (V d1 ) -K * X photo (3-5)
- X photo is the number of incident photons representing the amount of light input
- K is the slope of the fitted straight line.
- V ′ T is the threshold of the read tube itself
- V d2 is the voltage between the drain and the source of the read tube
- ⁇ is the channel mobility
- W and L are the gate width and gate length, respectively.
- the source leakage current I d of the readout area as the readout amount is simultaneously affected by X photo as the light input amount and V d1 and V d2 as the electric input amount, and is naturally generated
- the operation relationship including multiplication and addition can be used to design an operation device that can realize various functions.
- the biggest difference of the optoelectronic operation device unit described in the fourth embodiment is that the unit area is larger, and it needs one photodiode and three MOS tubes to achieve it, and the integration level is low.
- the most basic structure of the optoelectronic computing unit contains only one output terminal, but if one readout tube is expanded to multiple readouts with multiple gates connected, all device parameters are equal
- the number of output tubes can be expanded by adding the same number of addressing tubes. If the same V DS is given to the multiple readout tubes, the same output of multiple channels of the photoelectric calculation unit can be obtained.
- the photoelectric calculation unit based on the solution described in the second embodiment described above will also be described later.
- the photons entering the photoelectric calculation unit may come from the light emitting unit optically corresponding to the photoelectric calculation unit, or may come from other light sources, such as natural light sources or scenes of objects.
- the photons entering the photoelectric calculation unit may come from the light emitting unit optically corresponding to the photoelectric calculation unit, or may come from other light sources, such as natural light sources or scenes of objects.
- a combination scheme of a light-emitting unit array and an optoelectronic computing unit array which includes one or more light-emitting units and one or more optoelectronic computing units.
- the optoelectronic computing unit and the light-emitting unit are optically One-to-one correspondence to achieve accurate light input to a single optoelectronic computing unit in the array.
- the light-emitting unit array can be implemented using a high-density small pixel LED array.
- the light-emitting unit and the calculation unit correspond optically, that is, the light emitted by the light-emitting unit is accurately irradiated to the calculation unit corresponding to the light-emitting unit.
- the light emitted by the light emitting unit is irradiated to the calculation unit. If a light emitting unit such as 10 * 10 is used to form a light emitting array and the same number of calculation units are used to form a calculation array, the light emitted by each light emitting unit in the light emitting array needs to be calculated according to specific calculations According to the demand, the corresponding one or more calculation units are accurately irradiated. If the calculation function realized by this array is matrix vector multiplication, the light emitted by each light-emitting unit is required to be accurately irradiated to each calculation unit. This precise light input can be achieved through the following four preferred embodiments:
- one method is to directly make the light-emitting unit array close to the surface of the device array.
- the light-emitting array uses a small pixel LED screen, as shown in FIG. 15.
- the ideal light-emitting unit emits a spherical wave.
- the distance is close enough, it can be considered that the light emitted by the light-emitting unit is only transmitted to the surface of the device directly below it, thus achieving a one-to-one correspondence between the light source and the device.
- the optical structure between the luminous array and the computing array that realizes the focusing function can be a lens.
- the most common solution is to use a lens.
- the optical one-to-one correspondence between the light-emitting unit and the photoelectric calculation unit can also be achieved in this way, as shown in FIG. 17.
- the optical structure that realizes the focusing function between the light-emitting array and the computing array may also be an optical fiber cone.
- the fiber cone is a microstructure that can realize one-to-one correspondence between the light-emitting unit and the photoelectric calculation unit, and its function is similar to that of optical fiber.
- the optical fiber can be understood as an optical fiber array formed by multiple strands of dense light. If a fiber cone array is used to link the light emitting unit and the optoelectronic computing unit, a one-to-one correspondence between the light emitting unit and the optoelectronic computing unit can be well achieved.
- the general structure is shown in the figure. 18.
- the transfer function is high, and the one-to-one correspondence of the fiber cone can effectively suppress the optical crosstalk.
- the size of a single photoelectric calculation unit will be as small as possible while taking into account other indicators, while the size of the LED pixel is currently only about 8um.
- a funnel-shaped optical fiber cone can be used to connect the two units. The general structure is shown in FIG. 19.
- the driving of the light-emitting unit is controlled by the light input control part in the digital control system.
- the light-emitting unit is driven by a constant current sent by the driver, keeping the light intensity unchanged, and adjusting the light-emitting time to realize the input of different amounts of light input. If there is only one calculation unit and one light-emitting unit, the light input control part converts the data required to be input into the calculation unit through the light input into the pulse width of the light-emitting unit's light-emitting time, depending on the type of calculation unit used, for example, if Using the specific calculation unit as described in the first embodiment above, the larger the light input amount, the shorter the light emission duration of the driven light emitting unit should be.
- a single photoelectric calculation unit can implement addition or multiplication operations. If multiple photoelectric calculations are combined into an array, the above light-emitting units corresponding to the photoelectric calculation unit are also formed into an array , You can complete one or more sets of addition or multiplication operations. At the same time, through the connection of the leads, if the output ends of the two photoelectric calculation units are connected to make the output current converge into a current, it is equivalent to the realization of one addition .
- the connection of the lead and the arrangement of the photoelectric calculation unit can be changed to create a calculation array that implements specific operations.
- the present invention proposes a variety of specific embodiments of photoelectric calculation devices and photoelectric calculation methods.
- the light-emitting unit and the photoelectric calculation unit described above including the preferred first to fourth embodiments
- two Addition of bit addends are used.
- the biggest advantage of the adder according to the present invention is that it only needs a single photoelectric calculation unit and a light-emitting unit to realize the addition operation of two addends, and the integration is relatively high.
- the number of output terminals of this adder depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit with two output terminals described above is used, the adder also has two output terminals As an example, the photoelectric calculation unit with one output terminal is adopted by default in the four schemes described in detail below.
- V G is the voltage on the control gate representing the carrier control region
- t is the voltage on the control gate representing the carrier control region
- ⁇ is the voltage on the control gate representing the carrier control region
- q is the relationship between X photo and V G .
- X photo represents the input quantity of the optical input terminal, which is the first addend
- V G represents the input quantity of the electrical input terminal, which is the second addend
- the output current I d in the readout area of the carrier collection and readout area is the result of the addition operation, which is equivalent to the calculation described in (3-1-2):
- a, b, k and c are constants.
- the biggest difference of the solution 2 based on the photoelectric calculation unit of the second embodiment is that the P-substrate device is replaced by an N-substrate device, so it serves as the control gate of the carrier control region
- the applied voltage on the pole changed from positive to negative.
- the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changed from negative to positive, but the control gate voltage and incidence
- the number of photons is still a pair of addition and subtraction. Therefore, it is only necessary to make a few changes when modulating the optical input signal and the electrical input signal, and an addition operation that is substantially similar to the solution of the first embodiment can still be achieved.
- N elec is the number of electrons entering the charge-coupled layer as the coupling region
- V G is the voltage on the control gate as the carrier control region
- N elec represents the input quantity at the optical input terminal, which is the first addend
- V G represents the input quantity at the electrical input terminal, which is the second addend
- the voltage between the drain and the source in the readout area of the carrier collection and readout area V DS to be a constant value, the carrier-collection area and the read area is read out of the output current I D is the result of the addition operation, i.e., were calculated as equal to (3-3-2) according to the formula:
- a, b, k and c are constants.
- V d1 is the drain voltage of the reset tube as the carrier control area
- K is the slope of the fitted straight line
- V d2 is the source drain of the read tube Time voltage.
- X photo represents the input quantity of the optical input terminal, which is the first addend;
- V d1 represents the input quantity of the electrical input terminal, which is the second addend; at the same time, it serves as a readout tube in the readout area of the carrier collection and readout area If the drain terminal voltage V DS is added to a constant value, then the output current I d in the readout area of the carrier collection and readout area is the result of the addition operation, which is equal to the process described in (3-4-2) The calculation:
- a, b, k and c are constants.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes a variety of specific implementation schemes of photoelectric calculation devices and photoelectric calculation methods.
- a light-emitting unit and a multi-control area photoelectric calculation unit By using a light-emitting unit and a multi-control area photoelectric calculation unit, at least two-digit addition operation is realized.
- the biggest advantage of this adder is that it not only needs a single photoelectric calculation unit to realize the addition operation, and the number of input addends is not limited to two, but it needs the support of the process, especially the use of the above first, second,
- the multi-control gate parameters must have high uniformity.
- the number of output terminals of this adder depends on the number of output terminals of the photoelectric calculation unit used. For example, if the above photoelectric calculation unit with two output terminals is used, the adder also has two output terminals. Among the four schemes described, the photoelectric calculation unit with one output terminal is taken by default as an example.
- V G1 to V Gn represent the input voltages on the n control gates respectively, which are the electrical input quantities of multiple electrical input terminals, and k 1 to k n are the multi-gate input weights related to the area of the n control gates, respectively. It is not difficult to see from (4-1-2) that the voltage on each control gate and the amount of light input X photo are naturally a relationship of addition and subtraction, so the modulation of X photo and V G1 to V Gn can be used Addition.
- X photo represents the input quantity of the optical input terminal, which is the first addend;
- V G1 to V Gn represent the multiple input quantities of the electrical input terminal, which is the second to nth addend; at the same time, the carrier collection and readout area If the voltage V DS between the drain and the source of the readout area is given a constant value, the output current I d of the carrier collection and readout area in the readout area is the result of the addition operation, which is equal to (3-1-2 )
- a, b, k and c are constants.
- Option 2 the biggest difference between Option 2 is that the P substrate device is replaced by an N substrate device, so the voltage applied to the multi-gate control gate as the carrier control region changes from a positive voltage For negative pressure, the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from negative pressure to positive pressure, but the multiple voltages of the multi-gate and the number of incident photons are still one
- it is only necessary to make some changes in the modulation of the optical input signal and the electrical input signal and it is still possible to implement the addition operation of multiple addends that are generally similar to Scheme 1.
- V G1 to V Gn represent the input voltages on the n control gates respectively, which are the electrical input quantities of multiple electrical input terminals, and k 1 to k n are the multi-gate input weights related to the area of the n control gates, respectively. It is not difficult to see from (4-2-1) that the voltage on each control gate and the amount of charge N that the photoelectrons enter into the charge coupling layer representing the coupling region are inherently a relationship of addition and subtraction. For the modulation of G1 to V Gn , the above relationship is used for addition.
- N represents the input quantity of the optical input terminal, which is the first addend
- V G1 to V Gn represent the multiple input quantities of the electrical input terminal, which is the second to nth addend;
- the carrier collection and reading area read be a constant value of the voltage V DS between the drain-source region, the carrier collection region and a readout output current I D read-out zone is the result of operation, as carried out it is equal to (4-3-2)
- a, b, k and c are constants.
- V d1 to V dn represent the voltages of the drain terminals of n reset tubes, respectively, and are the electrical input quantities of multiple electrical input terminals
- k 1 to k n are the multi-gate input weights related to the channel resistance of the n reset tubes, respectively. It is not difficult to see from (4-4-1) that the voltage at the drain end of each reset tube and the light input amount X photo are naturally in the relationship of addition and subtraction, so the modulation of X photo and V d1 to V dn can be used to Addition.
- X photo represents the input quantity of the optical input terminal, which is the first addend;
- V d1 to V dn represent the multiple input quantities of the electrical input terminal, which is the second to nth addend;
- the carrier collection and readout area drain-source region of the read-out value to be a constant voltage V DS, and the results of the read-collection region of the read-out zone is the output current I D then adding the carrier, i.e. made equal to (4-4-2 )
- a, b, k and c are constants.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off. There is no need to re-enter the optical input for the next calculation.
- the present invention proposes a variety of specific implementation schemes of photoelectric calculation devices and photoelectric calculation methods.
- the addition of at least two digits is realized Operation.
- the biggest advantage of the scheme of this adder is that it uses the characteristics of high optical input accuracy.
- An optoelectronic computing unit only undertakes the input of one optical signal, and the electrical signal is only given a constant value, which is conducive to the improvement of the calculation uniformity; in addition, if there is Fixed calculation errors similar to fixed image noise or device uniformity can also be corrected by changing the constant value at the electrical input.
- the number of output terminals of this adder depends on the number of output terminals of the photoelectric calculation unit used. For example, if the photoelectric calculation unit with two output terminals described above is used, the adder also has two output terminals. In the following detailed description, a photoelectric calculation unit with an output terminal is taken as an example by default.
- X photon is the number of effective photons incident on the photoelectric calculation unit. If the output ends of two units with the same parameters are connected in parallel, the output current is converged and the two photoelectric calculation units are given different light input amounts X photo , But given the same electrical input terminals V G and V DS , the above formula is changed to:
- X photo1 and X photo2 are the input quantities of the optical input terminals of the two units whose output terminals are connected in parallel. It is not difficult to see from (5-1-2) that the data of the optical input terminals of the two units is naturally a relationship of addition and subtraction. Therefore, by modulating X photo1 and X photo2 , addition can be performed using the above relationship.
- X photo1 and X photo2 represent the first and second digit addends of the optical input terminal, respectively, while the control grid V G of the two cell units as the carrier control region and the carrier collection and readout region be the drain-source voltage of the read-out zone V DS constant value, then the total output current I D through the aggregated total, and then after the AD conversion, into the control system, to obtain the result of the addition, carried out as equal to (5 -1-3)
- control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the biggest difference compared to the above scheme 1 is that the P substrate device is replaced by an N substrate device, so the voltage applied to the control gate of the carrier control region is changed from positive pressure It becomes a negative pressure, and the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from a negative pressure to a positive pressure, but the data of the optical input terminals of multiple units connected in parallel is still.
- the relationship of a pair of addition and subtraction so only need to make some changes when the optical input signal and the electrical input constant value modulation, can still achieve the addition operation of multiple addends that are generally similar to Scheme 1.
- two photoelectric calculation units and light-emitting units are used, as shown in FIG. 23, where a box unit marked with a V character represents a photoelectric calculation unit adopting this scheme 3. .
- N is the photoelectron that enters the charge-coupled layer as the coupling region.
- N 1 and N 2 are the input quantities of the optical input terminals of the two output terminals connected in parallel. It is not difficult to see from (5-3-2) that the data of the optical input terminals of the two units is naturally a relationship of addition and subtraction. Therefore, by modulating N 1 and N 2 , addition can be performed using the above relationship.
- N 1 and N 2 respectively represent the first and second addends of the optical input terminal, and at the same time, the two cells of the control gate V G as the carrier control area and the carrier collection and readout area be the drain-source voltage of the read-out zone V DS constant value, then the total output current I D through the aggregated total, and then after the AD conversion, into the control system, to obtain the result of the addition, carried out as equal to (5 -3-3)
- control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- X photo is the photoelectrons collected in the photodiode as the readout area in the carrier collection and readout area, if the output ends of two units with the same various parameters are connected in parallel, the output current will converge , And give two photoelectric computing units different light input X photo , but given the same electrical input terminal input V d1 and V d2 , the above formula is changed to:
- X photo1 and X photo2 represent the first and second digit addends of the optical input terminal, respectively, and the drain voltage V d1 of the reset tube as the carrier control area of the two units, and as the carrier collection and reading drain voltage V d2 is read out of the tube in the region of the read-out zone to be constant, then the total output current I D through the aggregated total, and then after AD conversion, into the control system, to obtain the result of the addition, It is equal to the calculation as described in (5-4-3):
- control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the number of addends can be freely selected.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and the optical input does not need to be re-entered in the next calculation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a light-emitting unit and the photoelectric calculation unit described in the above embodiment a multiplication operation of a two-digit multiplier is realized.
- the biggest advantage of the solution of this multiplier is its high degree of integration.
- a single device can achieve multiplication, but it only supports multiplication of two inputs, and dual analog inputs have limited calculation accuracy.
- the number of output terminals of this multiplier depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the multiplier also has two outputs In the detailed description below, the photoelectric calculation unit with an output terminal is adopted by default.
- the source-drain output current meets the formula:
- V DS is the drain voltage of the P-type substrate representing the carrier collection and output regions, because the relationship between them is inherently multiplicative Therefore, by modulating X photo and V DS , the relationship between the two can be used to perform multiplication:
- X photo represents the input quantity of the light input terminal, which is the first multiplier
- V DS represents the input quantity of the electrical input terminal, which is the second multiplier
- the control gate voltage V G of the carrier control region is given a constant value
- I is the output current carrier collection results and a readout of the read-out zone in the region D of the multiplication, i.e. equivalent was calculated as (6-1-2) according to the formula:
- a, b, and k are constants.
- the second scheme the biggest difference compared to the first scheme is that the P substrate device is replaced by an N substrate device, so the voltage applied to the control gate of the carrier control region is changed from positive pressure Becomes a negative pressure, the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from a negative pressure to a positive pressure, but the drain voltage of the carrier readout area and the number of incident photons It is still a relationship of a pair of multiplications, so you only need to change the control gate voltage and the N-type substrate voltage, and you can still achieve a multiplication operation that is similar to the first scheme.
- the source-drain output current meets the formula:
- N elec is the photoelectron collected in the charge-coupled layer as the coupling region
- V DS is the drain voltage of the P-type substrate representing the carrier collection and output regions
- N elec represents the input quantity of the optical input terminal, which is the first multiplier
- V DS represents the input quantity of the electrical input terminal, which is the second multiplier
- the control gate voltage V G of the carrier control region is given a constant value, then I is the output current carrier collection results and a readout of the read-out zone in the region D of the multiplication, i.e. equivalent was calculated as (6-3-2) according to the formula:
- a, b, and k are constants.
- the source-drain output current meets the formula:
- X photo represents the input quantity of the optical input terminal, which is the first multiplier
- V d2 represents the input quantity of the electrical input terminal, which is the second multiplier
- the reset terminal drain voltage V d1 of the carrier control area is constant
- output current I D is the result of the multiplication of the carrier collection region and read out read-out zone, i.e., were calculated as equal to (6-4-3) according to the formula:
- a, b, and k are constants.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a light-emitting unit and the photoelectric calculation unit described in the above embodiment a multiplication operation of a two-digit multiplier is realized.
- the biggest advantage of this multiplier solution is that the electrical input is changed to a digital serial input, which has a high calculation accuracy.
- the disadvantage is that the serial input and output of data affect the calculation speed, and the control system needs to participate in auxiliary operations.
- the number of output terminals of this multiplier depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the multiplier also has two outputs In the detailed description below, the photoelectric calculation unit with an output terminal is adopted by default.
- FIG. 24 the calculation diagram is shown in Figure 24.
- the box unit marked with the V character in the figure represents the photoelectric calculation unit using the first scheme, where A is input through the electrical input terminal and W is through the light Input input.
- A A 0 A 1 A 2 ?? A m-1 (7-1-1)
- m depends on the bit width of the electrical input data.
- the binary data of A is bit-wise, and the n binary data are serially input into the control gate as the carrier control region in the form of a modulated voltage.
- the source-drain output current meets the formula:
- X photo represents the input quantity of the optical input terminal, which is the first multiplier;
- the serial input V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is collected and read out as a carrier
- the source-drain voltage V DS of the P-type substrate in the region is given a constant value.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second scheme the biggest difference compared to the first scheme is that the P substrate device is replaced by an N substrate device, so the voltage applied to the control gate of the carrier control region is changed from positive pressure Becomes a negative pressure, the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from a negative pressure to a positive pressure, but on the control gate of the carrier control area
- the output binarization voltage and the number of incident photons are still a pair of multiplications. Therefore, only a slight change is needed to the control gate voltage and the N-type substrate voltage, and the multiplication operation similar to the first scheme can still be achieved.
- FIG. 24 The box unit marked with the V character in the figure represents the photoelectric calculation unit adopting the third scheme, where A is input through the electrical input terminal and W is through the optical Input input.
- A A 0 A 1 A 2 ?? A m-1 (7-3-1)
- m depends on the bit width of the electrical input data.
- the binary data of A is bit-wise, and the n binary data are serially input into the control gate as the carrier control region in the form of a modulated voltage.
- the source-drain output current meets the formula:
- N elec is the number of photoelectrons in the charge-coupled layer
- V G is the voltage representing the control gate.
- N elec represents the input quantity of the optical input terminal, which is the first multiplier;
- the serial input V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is collected and read out as a carrier
- the source-drain voltage V DS of the P-type substrate in the region is given a constant value.
- R kW (A 0 * 2 0 + A 1 * 2 1 + A 2 * 2 2 +... + A m * 2 m-1 + a) (7-3-3)
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the box unit marked with the V character in the figure represents the photoelectric calculation unit adopting the fourth scheme, in which A is input through the electrical input terminal, W is through the light Input input.
- A A 0 A 1 A 2 ?? A m-1 (7-4-1)
- m depends on the bit width of the electrical input data.
- the binary data of A is bitwise, and the n binary data are serially input in the form of a modulated voltage to the drain end of the reset tube as the carrier control area .
- the source-drain output current meets the formula:
- V d1 is the voltage representing the control gate, when the control gate input binarization data is 0, it is equal to whether such that the input data X photo optical input terminal is equal to the number, it is sufficient such that the output current I D 0 is equal to the voltage value; when the control gate input binary data is 1, i.e. equal to the input voltage of the constant control gate.
- V d1 Because when V d1 is 0, there is no conductive channel in the readout tube, the current is 0, and the output result is 0, which is consistent with the multiplication result of the electrical input data 0 and the optical input data X photo ; when V d1 is equal to a When the voltage of the channel generated in the output tube, if V d2 is also given a constant value, the output result only depends on the optical input data X photo , and the output result still conforms to the constant value 1 and the result of the optical input input X photo multiplication.
- X photo represents the input quantity of the optical input terminal, which is the first multiplier;
- the serial input V d1 represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is collected and read out as a carrier drain voltage V d2 readout tube area reading zone given a constant value, as the serial input V d1, carrier collection region and a serial read output of the read output current I D through the region
- AD conversion it is sent to the control system, and the shift and accumulation operations are performed according to the bits input from the electrical input terminal in the control system, and the operation result of multiplication A * W can be obtained. That is equal to the calculation as described in (7-4-3):
- R kW (A 0 * 2 0 + A 1 * 2 1 + A 2 * 2 2 +... + A m * 2 m-1 + a) (7-4-3)
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a multiplication operation of a two-digit multiplier is realized.
- the biggest advantage of this multiplier solution is that the electrical input is changed to a digital parallel input, which has a higher calculation accuracy and a higher calculation speed than the second multiplier mentioned above; the disadvantage is that the parallel input of data needs more There are many photoelectric calculation units, and the control system needs to participate in auxiliary operations.
- the number of output terminals of this multiplier depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the multiplier also has two outputs In the detailed description below, the photoelectric calculation unit with an output terminal is adopted by default.
- a box unit marked with a V character in the figure represents a photoelectric calculation unit adopting the first scheme, in which A is input through an electrical input terminal, W is input through the optical input terminal.
- A A 0 A 1 A 2 ?? A m-1 (8-1-1)
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- X photo represents the input quantity of the optical input terminal, which is the first multiplier;
- V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is used as the carrier collection and reading area
- the source-drain voltage V DS of the P-type substrate is given a constant value.
- the parallel output carrier collection and the output current I D in the read-out area of the read-out area undergo AD conversion before sending.
- the control system in the control system according to the bit input at the electrical input terminal to shift and accumulate operations, you can get the multiplication A * W operation results. That is equal to the calculation as described in (8-1-3):
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second scheme the biggest difference compared to the first scheme is that the P substrate device is replaced by an N substrate device, so the voltage applied to the control gate of the carrier control region is changed from positive pressure Becomes a negative pressure, the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from a negative pressure to a positive pressure, but on the control gate of the carrier control area
- the output binarization voltage and the number of incident photons are still a pair of multiplications. Therefore, only a slight change is needed to the control gate voltage and the N-type substrate voltage, and the multiplication operation similar to the first scheme can still be achieved.
- a box unit marked with a V character in the figure represents an optoelectronic calculation unit adopting the third scheme, where A is input through the electrical input terminal. W is input through the optical input terminal.
- A A 0 A 1 A 2 ?? A m-1 (8-3-1)
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- N elec is the number of photoelectrons in the charge-coupled layer
- V G is the voltage representing the control gate.
- N elec represents the input quantity of the optical input terminal, which is the first multiplier
- the parallel input V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is used as the carrier collection and reading area
- the source-drain voltage V DS of the P-type substrate is given a constant value.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- a box unit marked with a V character in the figure represents a photoelectric calculation unit using the fourth scheme, where A is input through the electrical input terminal. W is input through the optical input terminal.
- A A 0 A 1 A 2 ?? A m-1 (8-4-1)
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- V d1 is the voltage representing the control gate, when the control gate input binarization data is 0, it is equal to whether such that the input data X photo optical input terminal is equal to the number, it is sufficient such that the output current I D 0 is equal to the voltage value; when the control gate input binary data is 1, i.e. equal to the input voltage of the constant control gate.
- V d1 Because when V d1 is 0, there is no conductive channel in the readout tube, the current is 0, and the output result is 0, which is consistent with the multiplication result of the electrical input data 0 and the optical input data X photo ; when V d1 is equal to a When the voltage of the channel generated in the output tube, if V d2 is also given a constant value, the output result only depends on the optical input data X photo , and the output result still conforms to the constant value 1 and the result of the optical input input X photo multiplication.
- X photo represents the input quantity of the optical input terminal, which is the first multiplier;
- the parallel input V d1 represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it serves as the carrier collection and readout area
- the drain voltage V d2 of the read tube in the middle read zone is given a constant value.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a multiplication operation of a two-digit multiplier is realized.
- the biggest advantage of this multiplier solution is that it implements the bit weight participation operation.
- the disadvantage is that it is essentially the multiplication of two analog inputs. The accuracy will be lower than the above second and third multiplier solutions.
- the number of output terminals of this multiplier depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the multiplier also has two outputs In the detailed description below, the photoelectric calculation unit with an output terminal is adopted by default.
- FIG. 26 A box unit marked with a V character in the figure represents a photoelectric calculation unit adopting the first scheme, in which A is input through an electrical input terminal, W is input through the optical input terminal.
- A A 0 A 1 A 2 ?? A m-1 (9-1-1)
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- V G is the voltage representing the control gate
- V DS is the source-drain voltage of the P-type substrate as the carrier control and output region.
- V DS and V G and X photo are inherently multiplied. Therefore, the binarized data on the control gate of the P-type substrate between the source and drain input and the parallel input corresponds to
- the bit weights corresponding to the bits 2 0 , 2 1 , 2 2 ... 2 m-1 are equivalent to the shift operation, and then the accumulation operation is completed directly by the current convergence method, without the need to control the system operation. Complete a complete multiplication operation.
- X photo represents the input quantity of the optical input terminal, which is the first multiplier
- the parallel input V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is used as the carrier collection and reading area
- the source-drain voltage V DS of the P-type substrate gives a modulated voltage equivalent to the corresponding bit weight of the binarized data.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second scheme the biggest difference compared to the first scheme is that the P substrate device is replaced by an N substrate device, so the voltage applied to the control gate of the carrier control region is changed from positive pressure Becomes a negative pressure, the voltage applied to the N-type substrate as the carrier collection and readout area during exposure changes from a negative pressure to a positive pressure, but on the control gate of the carrier control area
- the output binarization voltage and the number of incident photons are still a pair of multiplications. Therefore, only a slight change is needed to the control gate voltage and the N-type substrate voltage, and the multiplication operation similar to the first scheme can still be achieved.
- a box unit marked with a V character in the figure represents an optoelectronic calculation unit adopting the third scheme, where A is input through the electrical input terminal, W is input through the optical input terminal.
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- N elec is the number of photoelectrons in the charge-coupled layer
- V G is the voltage representing the control gate.
- V DS and V G and N elec are inherently multiplied. Therefore, the binarized data on the control gate of the P-type substrate between the source and drain input and the parallel input corresponds to
- the bit weights corresponding to the bits 2 0 , 2 1 , 2 2 ... 2 m-1 are equivalent to the shift operation, and then the accumulation operation is completed directly by the current convergence method, without the need to control the system operation. Complete a complete multiplication operation.
- N elec represents the input quantity of the optical input terminal, which is the first multiplier
- the parallel input V G represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it is used as the carrier collection and reading area
- the source-drain voltage V DS of the P-type substrate gives a modulated voltage equivalent to the corresponding bit weight of the binarized data.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- FIG. 26 A box unit marked with a V character in the figure represents a photoelectric calculation unit adopting the fourth scheme, in which A is input through an electrical input terminal, W is input through the optical input terminal.
- m is equivalent to the number of units used, depending on the bit width of the data at the electrical input.
- the source-drain output current meets the formula:
- V d1 is the voltage representing the control gate, when the control gate input binarization data is 0, it is equal to whether such that the input data X photo optical input terminal is equal to the number, it is sufficient such that the output current I D 0 is equal to the voltage value; when the control gate input binary data is 1, i.e. equal to the input voltage of the constant control gate.
- V d1 Because when V d1 is 0, there is no conductive channel in the readout tube, the current is 0, and the output result is 0, which is consistent with the multiplication result of the electrical input data 0 and the optical input data X photo ; when V d1 is equal to a When the voltage of the channel generated in the output tube, if V d2 is also given a constant value, the output result only depends on the optical input data X photo , and the output result still conforms to the constant value 1 and the result of the optical input input X photo multiplication.
- V DS and V G and X photo are naturally multiplied, so the bit corresponding to the binarized data on the control gate of the drain input of the readout tube and the parallel input Bit weight 2 0 , 2 1 , 2 2 ... 2 m-1 corresponding to the bit, which is equivalent to performing a shift operation, and then the accumulation operation is completed directly by current convergence, which can be completed without the control system operation.
- a complete multiplication operation
- X photo represents the input quantity of the optical input terminal, which is the first multiplier;
- V d1 represents the input quantity of the electrical input terminal, which is the binarized data of the second multiplier; at the same time, it serves as the carrier collection and readout area
- the drain voltage V d2 of the read tube in the middle read area gives a modulated voltage equivalent to the corresponding bit weight of the binarized data.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a plurality of photoelectric calculation adders described in one of the first, second, and third adders at least two dimensions are at least Vector addition of two vectors.
- R is the addition operation result
- X photo is the input quantity of the light input end
- Y is the input quantity of the electrical input end
- d, a, b and c are constants related to the unit parameters.
- Vector addition that is, adding elements corresponding to the serial number of two to-be-added vectors with the same dimension one by one to obtain a result vector with the same dimension as the to-be-added vector.
- the vector addition operation can be split into k separate addition operations with two two-bit inputs, so using k above-mentioned first adders can form a vector adder, as shown in Figure 27 As shown, where each box with "V adder" written in the middle represents a single first adder as described above.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- R is the addition operation result
- X photo optical input terminal input Y 1 to Y n is the multi-carrier control area electrical input terminal input
- b, a, c and k 1 to k n are sum adders Constants related to unit parameters.
- Vector addition that is, adding elements corresponding to the sequence numbers of two to-be-added vectors of the same dimension one by one to obtain a result vector with the same dimension as the to-be-added vector, and the addition operation of n to-be-added vectors of dimension m is A 1 + A 2 ... + A n as an example:
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- R is the result of the addition operation
- X 1 to X n are the input quantities of the optical input terminals of multiple individual photoelectric calculation units
- n is equal to the number of parallel photoelectric calculation units used
- a, c and k are the parameters of the adder unit Related constants.
- the above-mentioned third adder is the same as the above-mentioned second adder, and can perform addition operation with an addend greater than 2, so the vector adder composed of the above-mentioned third adder and the vector composed of the above-mentioned second adder are used.
- the adder scheme is similar and will not be repeated.
- the input data has storage characteristics, and can be stored in the device for a long time after the light is cut off. There is no need to re-enter the light input during the next calculation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a vector dot product operation of two vectors with at least two dimensions is realized .
- R is the result of multiplication operation
- X is the input quantity of the optical input end
- Y is the input quantity of the electrical input end
- c, a, b are constants related to the unit parameters.
- Vector dot multiplication that is, multiplying the elements of the corresponding serial numbers of two to-be-multiplied vectors with the same dimension to obtain a result vector with the same dimension as the to-be-multiplied vector, and the multiplication operation of the to-be-multiplied vector with two dimensions of k B as an example:
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the above-mentioned second, third and fourth multipliers are similar to the above-mentioned first multiplier, support multiplication operation of two-digit multiplier input, use the above three multipliers to form a vector dot multiplier scheme and use the first multiplier
- the composition of the vector dot multiplier is similar, and will not be repeated again.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- the photoelectric calculation multiplier described in one of the above-mentioned various multipliers the multiplication of two high-bit width multipliers is realized.
- R is the result of multiplication operation
- X is the input quantity of the optical input end
- Y is the input quantity of the electrical input end
- c, a, b are constants related to the unit parameters.
- High-bit width multiplication means splitting the two high-bit width multipliers into bits first, and then multiplying them in sequence according to the high and low bits, and then shifting and adding the results to complete the complete high-bit width multiplication operation. Take the two high-bit width multipliers A * B as an example, split the high-bit width multiplier into multiple low-bit width multipliers of bit width k, and then multiply the high and low bits:
- each box with "V multiplier" in the figure represents one of the above-mentioned first multipliers, and the number of shifts that the output of this multiplier needs to complete in the control system.
- the solid line represents the data Input, dashed line indicates data accumulation.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the above-mentioned second, third and fourth multipliers are similar to the above-mentioned first multiplier, and support multiplication operation of two-digit multiplier input.
- the above three multipliers are used to form a high-bit width multiplier and the first multiplication is used.
- the composition of the high-bit width multiplier is similar and will not be repeated again.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a matrix and a vector whose dimensions conform to the matrix vector multiplication rule are realized Multiplication.
- this matrix-vector multiplier depends on the number of outputs of the photoelectric calculation unit used. For example, if the photoelectric calculation unit described in the above embodiment with two outputs is used, the matrix vector multiplier also With two output terminals, the photoelectric calculation unit with one output terminal is adopted by default in the detailed description below.
- the number of photoelectric calculation units to be used in the present invention should be equal to the number of elements in the matrix to be multiplied by default.
- the matrix contains vectors, that is, if the matrix is 3 rows and 1 column, the number of photoelectric calculation units used is 3, but if The number of photoelectric calculation units is greater than the number of elements in the matrix. If six photoelectric calculation units are arranged in 3 rows and 2 columns, the operation is not affected.
- the calculation diagram is shown in Figure 29.
- the box unit marked with the V character represents the photoelectric calculation unit adopting the first scheme, in which the elements in the vector A are input through the electrical input terminal, and the elements in the matrix W are input through the optical input terminal.
- each element of A is binary converted in the control system:
- k depends on the bit width of a single element in the vector.
- the photoelectric calculation units according to the first embodiment described above will be arranged in an array as shown in FIG. 29, where the number of rows of the array is n and the number of columns is m, and the photoelectric calculation of all the same rows of the array
- the control gates of the cell as the carrier control area are all connected and input the same electrical input data; all the photoelectric calculation units of the same column of the array are used as P of the carrier collection and readout area
- the output terminals of the type substrate are connected, so that the output currents are concentrated and added.
- m * n data in the matrix are sequentially input into m * n photoelectric calculation units through the optical input terminal; the elements in the vector are serially input from the control gate connected to the peer unit, the same element is different
- the binary data of the bits is input sequentially in time-sharing.
- the binary data of the lowest bit of the element in the matrix and the element of the vector is multiplied by the corresponding bit, which is equal to Performed an operation (13-1-3):
- the calculation result (13-1-5) is AD converted, it is input to the control system. Because it is the lowest bit, it is shifted to the left by 0 bits, and then the second lower bit of the vector is used as the electrical input data input to the control gate to obtain The second lowest bit of the vector and the matrix vector multiplication result of the matrix are shifted to the left by 1 bit after input into the control system, and vector addition is performed with the lowest bit of the vector and the matrix multiplication result described above, and so on. After the binary data of all bits are sequentially shifted and accumulated after the control system, the final matrix vector operation result is obtained, which is equivalent to the following operations:
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second, third, and fourth schemes are similar to the first scheme.
- the multiplication of two multiplier inputs can also be completed through serial input, as described in the second multiplier above, so the previous
- the matrix vector multiplier composed of the photoelectric calculation unit of the first scheme is used instead of the matrix vector multiplier composed of the photoelectric calculation units of the second, third and fourth schemes.
- the only difference is that:
- the collection of the P-type substrate becomes the collection of the charge-coupled layer due to the change of the light input mode, so the light input quantity needs to be re-modulated.
- the parallel carrier control area is no longer the control gate, but the drain end of the reset tube.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a matrix and a vector whose dimensions conform to the matrix vector multiplication rule are realized Multiplication.
- the difference between the implementation scheme proposed by the present invention and the above-mentioned serial matrix vector multiplier is that more photoelectric calculation units and light-emitting units are used to form an array, and the binary data of the elements in the vector is input through the parallel input mode, and the operation speed is higher. But more units are needed.
- this matrix-vector multiplier depends on the number of outputs of the photoelectric calculation unit used. For example, if the photoelectric calculation unit described in the above embodiment with two outputs is used, the matrix vector multiplier also With two output terminals, the photoelectric calculation unit with one output terminal is adopted by default in the detailed description below.
- the number of photoelectric calculation units to be used in the present invention should be equal to the result of multiplying the number of elements in the matrix to be multiplied by the bit width of a single element by default.
- the matrix contains vectors, but if the number of photoelectric calculation units is greater than the number of elements in the matrix , Does not affect the operation.
- each element of A is binary converted in the control system:
- k depends on the bit width of a single element in the vector.
- the photoelectric calculation unit according to the above first embodiment will be used, and a total of k * m * n units will be used.
- the above units will be divided into k groups, with m * n units in each group, and the units in each group will be arranged as
- the above serial matrix vector multiplier array is the same array, that is, arranged in an array according to the form shown in FIG.
- each array has n rows and m columns, and all the groups All the photoelectric calculation units in the same row of all the arrays are connected to the control gate as the carrier control area, input the same electrical input data; all the same column photoelectric calculation units of all the arrays of all groups The output ends of the P-type substrate as the carrier collection and readout regions are connected, so that the output currents are converged and added.
- the other 1st to k-1th arrays input the binarized data from the 1st bit to the k-1th bit of the vector from the control gate of each row, and output them separately.
- the corresponding matrix vector multiplication result is output from the output terminal, and then the k group of calculation results are AD converted into the control system, and all elements of the i-th array result vector are shifted left by i bits, and then the shift will be completed in the control system.
- the output results of all subsequent groups are accumulated according to the rule of vector addition, that is, the final matrix vector operation result is obtained, which is equivalent to the following operation:
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second, third, and fourth schemes are similar to the first scheme, and the multiplication operation of the two multiplier inputs can also be completed through parallel input, as described in the third multiplier above, so the use described above In the first scheme, the matrix vector multiplier composed of the photoelectric calculation unit is changed to use the matrix vector multiplier composed of the photoelectric calculation unit in the second, third, and fourth schemes.
- the only difference is that:
- the collection of the P-type substrate becomes the collection of the charge-coupled layer due to the change of the light input mode, so the light input quantity needs to be re-modulated.
- the parallel carrier control area is no longer the control gate, but the drain end of the reset tube.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- a vector dot multiplier and one of the photoelectric matrix vector multipliers described in 12 an average pooling operation is realized.
- A is the vector input terminal, that is, the vector data input from the electrical input terminal, is input in a serial manner
- W is the matrix input terminal, that is, the data input from the electrical input terminal
- the output result is a vector of m * 1 dimension.
- Pooling operations include many types of operations, such as average pooling and maximum pooling.
- the pooling operator described in the present invention is only for average pooling operations.
- Average pooling that is, averaging, for example (15-1-2)
- a vector dot multiplier as described above suitable for an input matrix with a data dimension of n columns and 1 row matrix (vector) and n * 1 photoelectric calculation units can be used to complete the above calculation.
- control system determines the number of elements in the matrix to be pooled, and then in the control system, all the elements of the matrix to be pooled are disassembled, and then recombined into a one-dimensional vector, input from the vector input of the matrix vector multiplier , And then input the same optical input data equivalent to the reciprocal of the number of elements in the matrix to all the elements in the matrix vector multiplier through the optical input end, an output of the output of the matrix vector multiplier is the matrix to be pooled Average pooling operation result.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the above high-bit width multiplier is similar to the above vector dot multiplier, the only difference is that the data at the vector input is parallel input, the operation speed is faster but more photoelectric calculation units are needed. If the above high-bit width multiplier is used to calculate the formula (15-1-3), 4 * K photoelectric calculation units are needed, K is the bit width of the elements in the matrix A to be pooled, and using the above vector dot multiplier to calculate requires only 4 photoelectric calculation units.
- the optical input data has storage characteristics, and can be stored in the device for a long time after the light is cut off. There is no need to re-enter the optical input during the next operation. It is particularly advantageous for the pooling operation to calculate the average pooling denominator unchanged for multiple operations.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods. By using multiple light-emitting units and the photoelectric calculation units described in the above embodiments, the convolution operation of the matrix is realized.
- the number of output terminals of this convolution operator depends on the number of output terminals of the photoelectric calculation unit used, for example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the convolution operator also With two output terminals, the photoelectric calculation unit with one output terminal is adopted by default in the detailed description below.
- the rule of the convolution operation is that the to-be-convolved matrix plays one by one under the mapping of the convolution kernel and the elements in the convolution kernel, and then moves the convolution kernel in accordance with the corresponding step size for the next mapping, as shown in FIG.
- the following steps are required:
- the initial position of the convolution kernel coincides with the upper left corner of the matrix A, that is, the 3 rows and 3 columns of the convolution kernel a correspond to the 0 , 1, 2 rows, and 0, 1, 2 columns of the matrix A 0 , and then
- the elements in the product kernel and the elements in the matrix A 0 corresponding to the convolution kernel are multiplied one by one, as in the formula (16-1-3), it becomes 9 multiplication results, and then all the 9 multiplication results Accumulate to get the result of the convolution operation at the current position of the convolution kernel, called R 00 , which is to complete the operation described in (16-1-4):
- the above-mentioned matrix R which is the convolution matrix A, is the result of a convolution operation with a step size of 1 under the action of the convolution kernel a.
- the convolution operation is an operation of multiplying and accumulating the corresponding elements of two matrices multiple times, wherein the two matrices of the elements are multiplied two by one, and one of the matrices is
- the convolution kernel is an amount that is constant in multiple operations
- the other matrix is an element with a convolution matrix corresponding to the position of the convolution kernel, which is an amount that changes in multiple operations, so the invention described in invention 1 can be used
- the photoelectric calculation unit adopting the first type of photoelectric calculation unit scheme takes advantage of the advantage that the optical input storage can store data.
- the optical input terminal is used to input convolution kernel data, and the electrical input terminal is input with convolution matrix data for convolution operation. , Which can greatly improve the energy efficiency ratio and calculation speed. Therefore, the electrical input terminal of the unit is the data input terminal of the convolution matrix to be convolved, and the optical input terminal is the input terminal of the convolution kernel.
- the convolution operator can also be divided into serial input and parallel input.
- the main difference is the number of units used and the data input method of the electrical input terminal.
- the serial input scheme is as follows:
- the photoelectric calculation unit of the first scheme is used in an amount equal to the number of elements in the convolution kernel, the units are arranged in an array with the same dimension as the convolution kernel, and the carriers are collected and read out.
- the output terminals of the middle readout area are all connected, and the addition is completed by aggregation.
- FIG. 33 it is a cell array with a convolution kernel dimension of 3 * 3.
- a box with V in the middle in Fig. 33 represents an optoelectronic computing unit using the first scheme.
- the convolution kernel data is input into the unit one by one through the optical input terminal, and then the data corresponding to the position of the current convolution kernel in the matrix is converted into binary, and then serially from the carrier control area
- the control grid is input into the array, and the output results are aggregated and added to the control system after AD conversion, and then shifted and accumulated, that is, the convolution operation result of the current convolution kernel position is obtained, and then the convolution kernel is moved, Input the pre-stored convolution kernel data using the previous light, and directly re-enter the electrical input data to obtain the convolution operation result corresponding to the next convolution kernel position, and so on, until the convolution kernel facilitates the entire convolution matrix, Then the output convolution results are recombined into a result matrix, that is, all convolution operations are completed.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second, third, and fourth schemes are similar to the first scheme, and the convolution operation can also be completed by serial or parallel line input. Therefore, the convolution composed of the first scheme photoelectric calculation unit is used as described above.
- the arithmetic unit instead of using the matrix vector multiplier composed of the photoelectric calculation units described in the second, third, and fourth schemes, can also perform calculations. The only difference is that:
- the collection of the P-type substrate becomes the collection of the charge-coupled layer due to the change of the light input mode, so the light input quantity needs to be re-modulated.
- the optical input data has storage characteristics, and can be stored in the device for a long time after the light is cut off. There is no need to re-enter the optical input during the next operation. It is particularly advantageous for the convolution operation to be performed multiple times in the convolution operation.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- serial and parallel matrix vector multipliers By using the above-mentioned serial and parallel matrix vector multipliers, pooling operators and convolution operators, together with corresponding control systems, Speed up the work of neural network algorithm inference.
- the reasoning of the neural network algorithm is composed of a convolution layer and a fully connected layer, and can perform tasks such as face recognition.
- the detailed structure of the network is shown in Figure 34, where the convolution layer contains The operations include convolution operations, pooling operations, and nonlinear function operations; the operations included in the fully connected layer include matrix vector multiplication operations, pooling operations, and nonlinear function operations. You can see that in addition to nonlinear function operations, other operations are in The corresponding optoelectronic operation accelerator solutions have been mentioned in the present invention. There are many ways to operate non-linear functions. The most common one is the RELU function. Its function image is shown in Figure 35.
- the biggest advantage of using optoelectronic computing units for neural network acceleration is the storage characteristics of the optical input.
- the ALEXnet network is still used as an example.
- the dimensions of the output data of each layer are fixed values, such as the above-mentioned convolution operator
- the storage advantage of the optical input data can be used to complete multiple or even a single light All the convolution operations work, which greatly reduces the time and energy consumption required for data exchange between the storage unit and the optoelectronic calculation unit.
- the existence of a large number of matrix vector multiplications is the most powerless part of the traditional calculation method.
- the matrix data are fixed weights obtained through training. Once training is completed, the weights The value will not change anymore, so when performing inference operation, the weight value is also input into the photoelectric calculation unit through the light input method, which can greatly improve the operation efficiency.
- the input data of the ALEXnet network is 227 * 227 * 3 three-dimensional matrix data, which first passes through the convolution layer 1, as described in FIG. 34.
- convolution layer 1 the size of the convolution kernel is 11 * 11, the number is 96, and the step size is 4 for the convolution operation.
- the convolution operator scheme as described above, a minimum of 96 convolutions as above Convolution operator with kernel size 11 * 11.
- the pooling operation in convolutional layer 1 uses average pooling, because the kernel size is 3 * 3, so the number of 9 is an average of 1, so at least one of the 3 * 3 for the pooling operator described above is required. Pooling operator for matrix input.
- convolutional layer 2 At least 256 convolution operators with a convolution kernel size of 5 * 5 as described above are required, and at least one of the 3 * 3 matrix inputs as described in the pooling operator described above is required. Pooling operator.
- convolutional layer 5 a minimum of 256 convolution operators as described above for the convolution kernel size of 3 * 3, and at least one pooling operator as described above for the 3 * 3 matrix input .
- a matrix vector multiplier supporting 4096 * 9216 scale matrix and 1 * 9216 scale vector as described in the above serial and parallel matrix vector multiplier is required at least.
- a matrix vector multiplier supporting 4096 * 4096 scale matrix and 1 * 4096 scale vector as described in the above serial and parallel matrix vector multiplier is required at least.
- a matrix vector multiplier supporting a matrix of 1000 * 4096 scale and a vector of 1 * 4096 scale as described in the above serial and parallel matrix vector multiplier is required at least.
- ALEXnet network accelerator a complete ALEXnet network accelerator can be formed. If you need to increase the calculation speed, you can consider using parallel The input method consumes more calculators and obtains higher calculation speed.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the input of the above calculation amount by optical input can be used The storage characteristics of the optical input are maximized.
- the present invention proposes specific implementation solutions of various photoelectric calculation devices and photoelectric calculation methods.
- the acceleration work of the CT algorithm is realized.
- the number of output terminals of the CT algorithm accelerator depends on the number of output terminals of the photoelectric calculation unit used. For example, if the photoelectric calculation unit described in the above embodiment having two output terminals is used, the CT algorithm accelerator also has two One output terminal, the photoelectric calculation unit with one output terminal is adopted by default in the detailed description below.
- CT that is, electronic computer tomography, which uses precise collimated X-ray beams, gamma rays, ultrasound, etc., together with a highly sensitive calculator, to make one after another around a part of the human body
- Cross-section scanning has the characteristics of fast scanning time and clear image.
- the CT shooting mode and X-ray shooting mode are quite different. As shown in FIG. 36, the X-ray and CT shooting modes for an object are respectively from top to bottom.
- CT photography that is, the method of judging the internal material division of the fault through the received X-ray intensity that passes through a fault of the object to be observed from different angles, and the received one-dimension of multiple sets of different faults along different angles
- the algorithm that converts the incident X-ray intensity into two-dimensional multiple sets of two-dimensional material division pictures of different faults is the CT algorithm.
- the general content of the CT algorithm is as follows. As shown in Figure 37, the cross-sectional view of this tomographic image of an irregularly shaped object in the figure, multiple X-rays pass through it from different angles, of which the ith article passes through it The ray is the ray incident at the angle shown in the figure, and the cross-sectional view of the object is the tomogram that is to be restored by CT shooting and the CT algorithm.
- This tomogram is divided into pixels, from the first row to the first column. At the beginning of the count, the first pixel in the first row is the first pixel, a total of N pixels, and the j-th pixel is just passed by the i-th X-ray.
- X-rays are absorbed when passing through objects, and the amount of absorption varies according to the type of substance (water, cell tissue, bones, etc.). By determining the amount of absorption, you can indirectly determine what the object is.
- different pixel positions correspond to different types of substances, so X-rays will be absorbed to varying degrees after passing through these pixels, and X-rays incident along different angles will pass through different substances , The energy remaining after it exits the object is also different. Definition, the gray value of each pixel in the cross section of the object shown in FIG.
- Equation (18-1-2) is a multivariate equation system
- x is the grayscale value of the tomographic pixel to be reconstructed, that is, the amount to be solved, and the others are all known quantities.
- L is greater than or equal to N . Then this equation has a unique solution, that is, the tomographic image can be restored.
- equation (18-1-2) is not an equation system composed of L equations with N unknowns, but an equation system composed of 2 equations with 2 unknowns, then these two equations can be expressed as two The two lines in the dimensional plane, because the equation has a solution, the two lines must have an intersection point, and the coordinates of the intersection point is the solution of the system of equations.
- the method to quickly solve the solution of the system of equations is:
- the actual physical meaning of is whether the ray passes through the pixel, if it passes, it is 1, and if it does not pass, it is 0, and the emission angle of the CT machine ray is mostly a fixed angle. Therefore, for multiple CT shooting ,
- the system matrix ⁇ is mostly a fixed value, therefore, by using the storage characteristics of the optical input end of the photoelectric calculation unit described in the first scheme of invention 1 to input the data in the system matrix, the energy efficiency ratio and the operation speed will be greatly improved .
- the core of formula (18-1-3) is vector-vector multiplication Therefore, using the cell array as shown in FIG. 39 can speed up the calculation of this part.
- Each box with V in the middle of the figure represents an optoelectronic computing unit using the first scheme, with a system matrix size of N * L, CT algorithm with the number of X-rays as an example, assuming that the electrical input terminal data is serial input, the number of photoelectric calculation units to be used is N * L, and the number of photoelectric calculation units are arranged in N rows Column L, and the output ends of the P-type substrates as the carrier collection and readout area of all cells in the same column are connected together, so that the operation results are aggregated and added;
- the control gates of the zones are independent of each other.
- the control system may be a digital circuit, or various logic control units such as a computer, a single-chip microcomputer, and an FPGA.
- the second, third, and fourth schemes are similar to the first scheme.
- the CT algorithm can also be accelerated by serial or parallel line input. Therefore, the first scheme uses the photoelectric calculation unit composed of the first scheme.
- the CT algorithm accelerator is changed to use the matrix vector multiplier composed of the photoelectric calculation units described in the second, third, and fourth schemes. The only difference is that:
- the collection of the P-type substrate becomes the collection of the charge-coupled layer due to the change of the light input mode, so the light input quantity needs to be re-modulated.
- the present invention proposes specific implementation schemes of various photoelectric calculation devices and photoelectric calculation methods.
- the above-mentioned high-bit width multiplier and the photoelectric adder described in one of the first, second, and third adders two Multiplication of single precision floating point numbers.
- Single-precision floating-point number that is, to express a real number with a fractional part in a manner similar to scientific notation
- a single-precision floating-point number has a bit width of 32 bits, of which 1 sign bit is expressed by a binary number; 8-bit exponent Digits, the size of the part on the left of the decimal point is expressed by 8-bit binary numbers; the 23-bit mantissa, the size of the number of the right part of the decimal point is expressed by 23 binary numbers, as in formula (19-1-1):
- A (A) sign bit (A) exponent bit (A) mantissa bit
- the optoelectronic single-precision floating-point multiplier described above requires three operations of sign bit multiplication, exponent bit addition, and mantissa bit multiplication.
- the sign bit only needs to be judged by the general logic, and the addition of the exponent 8-bit wide addend is subtracted by 01111111, and only the first, second, and three adder operations as above are required; and the mantissa bit
- the multiplication of the two 23-bit wide multipliers because the multiplier bit width is large, it is necessary to use the above high-bit width multiplier for operation.
- the photoelectric calculation multiplier described in one of the above multipliers is usually applicable At the optical input terminal, the input precision is about 8 bits, and the multiplier of the multiplier input with a maximum of 16 bits is not exceeded.
- the control system divides the two single-precision floating-point numbers to be multiplied into a sign bit, an exponent bit, and a mantissa bit, and the sign bit is judged by the control system to be positive and negative.
- the mantissa is input to the two addend input ends of the photoelectric adder.
- the results of the three parts are returned to the control system, and recombined into a single-precision floating-point number in the control system, that is, a complete single-precision floating-point multiplication is completed.
- the invention utilizes the photoelectric characteristics of semiconductor materials, and discloses a basic photoelectric hybrid operation method and operation device. Since the semiconductor material can have a higher sensitivity to incident photons, a longer optical signal storage time, and itself is easier to improve the integration, the present invention has a substantial improvement in computing technology.
- the optical input data has storage characteristics, which can be stored in the device for a long time after the light is cut off, and there is no need to re-enter the optical input during the next operation.
- the invention provides a digital logic control system of a photoelectric calculation module, which is used to control the state and data input and output of the photoelectric calculation module.
- bit width of each element in the matrix W is 8 bits
- at least 8 groups of n columns and m rows of photoelectric computing arrays are required to form a parallel input matrix vector multiplication module capable of computing A * W.
- the digital control array of the optoelectronic calculation module is divided into the following parts: data input part, optical input control part, light receiving control part, electric input receiving control part, output control part and self-check control part.
- the controlled objects are: Eight groups of n * m photoelectric operation arrays and power modules that supply power to these arrays, and drivers that drive the light-emitting arrays.
- the power supply module can provide various voltages required by each functional area of the optoelectronic arithmetic array in various states such as receiving optical signals, receiving electrical signals, arithmetic and output, and resetting optical signals.
- -3V needs to be added to the P-type substrate when receiving optical signals
- 4V or 0V needs to be added to the control gate when receiving electrical signals
- P-type substrate when outputting
- the read-out area MOSFET needs to add 0.5V between the source and drain, and the substrate needs to add 1V during reset, then the power supply module that supplies it needs to provide at least -3V, 0V, 1V, 4V voltage, and in Under the control of digital control logic, the voltage is given to the corresponding parts of the cells in the array at the required moment.
- the matrix data W and the vector data A are input to the data input section, and the data input section is sent to the optical input control section and the electrical input reception control section and pre-stored in the register.
- Each element in the matrix data W in the register of the light input control part is converted into the time for the light emitting unit in the light emitting array to emit light through the light input control part, and sent to the driver of the light emitting array, the driver is converted into a pulse to drive the light emitting array Illuminate to realize light input.
- the light receiving control part sends a status signal corresponding to the received light signal to the power supply module.
- the power supply module changes the power supply voltage to put the cells in the computing array into the light receiving state.
- the power module gives the P-type substrate -3V, the source and drain of the readout area are floating, and the control gate is 0V, so that a depletion layer is generated in the P-type substrate When a photon is incident again, it will absorb the photon to generate photo-generated carriers to complete the light input.
- the driver is controlled by the light input control part to stop the light; then the electric input receiving part sends the state signal of the electric input to the power supply module, so that the units in the computing array enter the electric input state, and the Vector data A is input into the carrier control area of the computing array in parallel.
- the power module needs to give the control gate a voltage of 0V or 3V at this time. The specific value of 0V or 3V is controlled by the electrical input receiving control section.
- the input amount should be 0, then 0V voltage is applied to the control gate; if the electrical input amount is 1, then 3V voltage is applied to the control gate, at the same time, the P-type substrate remains applied -3V unchanged, the source and drain of the readout area are still Floating.
- the output control section sends a status signal that outputs the calculation result to the power module, so that the photoelectric calculation unit enters the output state.
- the power module needs to give The voltage difference between the source and drain of the readout region is 0.5V, and the voltage of 0V / 3V on the P-type substrate -3V and the control gate is maintained. After the output current as the operation result is converged, it first enters AD conversion, and the output control part sends a signal to start AD conversion to the AD converter.
- the power module After the AD converter completes the conversion, the converted result and the conversion end signal are output from the output terminal to the output control Part, after receiving the conversion end signal, the output control part sends the received conversion result to the shifter and accumulator for shifting and accumulating to obtain the final A * W operation result vector and store the result in the register , And send the status signal of the end of the operation to the power module, the power module ends the operation.
- the power module needs to cancel the 0.5V voltage difference between the source and drain of the readout area and the 0V / 3V electrical input on the control gate at the end of the operation. Value signal, but keep the -3V of the P-type substrate unchanged to maintain the "storage" of the optical input signal, waiting for the next operation.
- the optical input data reset signal is sent to the power module through the output control section, and the power module performs light on the cells in the computing array Input data reset.
- the power module needs to give the P-type substrate 1V at this time and control the gate at 0V, while keeping the source and drain of the readout area floating.
- the output control part sends a reset completion signal to the power supply module. The power supply module stops supplying power and waits for the next light input.
- the self-checking process occurs before the operation of the computing module, and is used to check whether the cells in the computing array are damaged.
- the self-test control part sends a status signal to the power module.
- the power module starts self-test on the first row of all columns of all groups. For example, if the photoelectric device of the first embodiment of invention 1 is used For the calculation unit, the power module needs to first give the control gate a voltage of 3V, a voltage of 0.5V between the source and drain in the readout area, and the output current is output to the self-test control part through the output terminal of each column.
- the first cell of this column is judged to be damaged; after that, the voltage of 3V on the control gate is removed, and the source and drain of the readout area remain at 0.5V, and the output of each column is judged by the self-test control part For the current, if it is found that there is still a current after the 3V voltage of the control gate is cut off at the output end of the column, the first cell in the column is judged to be damaged.
- the self-check control part controls the line feed, and starts the self-check on the second line.
- the self-check conditions are the same until the self-check of all lines is completed, that is, all the self-checks are completed.
- digital control logic can be a variety of digital circuits, microcontrollers, FPGAs and so on.
- This embodiment provides a single measured photoresponse curve using the photoelectric calculation unit according to the first embodiment described above, and with the aid of the photoresponse curve measured on the machine as a model of a single photoelectric calculation unit, built as described above
- the model of the parallel input matrix vector multiplier and the model of the convolution operator as described above, and with the aid of the built model components, become a complete neural network accelerator built using the optoelectronic calculation unit according to the first embodiment described above.
- use simulation software to try to simulate a complete AlexNet-like network (different from the standard AlexNet network mentioned in the above neural network accelerator solution) to classify and predict the pictures in the CIFAR-10 data set. And evaluate the accuracy of the operation results.
- a single photoelectric computing unit measured results and network simulation analysis
- the measured light response curve of the photoelectric calculation unit according to the first embodiment used is shown in FIG. 41, where the abscissa is the number of incident photons X characterized by the exposure time, and the ordinate is the carrier collection and In the P-type substrate of the readout area, the operation result size of the output end of the MOSFET in the readout area is output in the form of current.
- the control gate of the carrier control area that is, V G plus 3V voltage, P-type substrate plus -3V voltage, and give the appropriate output voltage when the source-drain voltage corresponds to the output result.
- the structure included in the simulated AlexNet-like network model is shown in FIG. 42.
- This type of AlexNet network model is composed of six convolutional layers, five pooling layers, and two fully connected layers, and uses ReLU as the activation function.
- the pooling layer does not use the above pooling operator, but directly assumes that general logic is used to perform the pooling operation.
- the function realized by this network is target recognition, and the picture data set used is the CIFAR-10 data set.
- the data set has a total of 60,000 color images, and the size of these images is 32 * 32 * 3, which is divided into 10 categories, each with 6000 images.
- the accuracy is high or low. Obviously, there are two factors that lead to the inaccuracy of the final recognition result.
- a network itself, and the recognition errors caused by the imperfect weights obtained through training are not related to the accuracy of inference calculation; the other is because
- the neural network accelerator model built by the single-tube model of the computing device has a calculation error and
- the parallel matrix vector multiplier shown in FIG. 31 there are k arrays of size m * n in total, assuming that each array has one additional array under each column AD, and the input value range of the optical input terminal is (-127,127), and the positive optical input value and the negative optical input value are respectively input into different matrices, the AD bit width is nbit, the matrix vector is multiplied by the row of the matrix If the number is m lines, the AD conversion accuracy is:
- 127 represents the maximum output value of a single photoelectric calculation unit, that is, the result of multiplying the maximum value 127 of the optical input terminal and the binarized maximum value 1 of the electrical input terminal.
- the smallest unit that can be recognized after AD conversion is 508, and outputs smaller than this value will be truncated, resulting in a certain decrease in accuracy.
- the convolution operator will also suffer from similar accuracy degradation caused by AD, and will not be repeated.
- the weight is the matrix data, and the source of the weight is the training of the network.
- the accuracy of the weight can be customized during training.
- the accuracy of the weight is considered in equation (21-2) It is 8bit, that is, the range is (-127,127).
- the results of the weight accuracy on the ideal network without calculation errors are as follows. The result is the accuracy of the target classification when performing inference:
- the classification accuracy of the neural network accelerator model is 85.4%, which is only 3% less than the ideal accuracy result of 88% without any calculation error, which is less than 3%, which is a higher
- the precision is enough to accelerate the work of neural network.
- the delay of a single AD is 20ns
- the operation delay of each layer of the network is 0.164ms
- the time required to complete a complete network inference is 1.312ms, which is a few seconds less than the optical input data.
- the retention time of many years (using the photoelectric calculation unit according to the third embodiment above) is already a fairly short time, even a few seconds of retention time is enough to operate within the time window of one light input Thousands of complete network reasoning.
- the inference time of the complete network of 1.312ms can easily meet the real-time video surveillance of hundreds of frames, and to achieve this index, regardless of the peripheral logic circuit, at least a total of only Need to use about 2 million photoelectric calculation units, assuming that the area of a photoelectric calculation unit is 3um * 3um, the chip size is less than 5mm * 5mm; according to the actual measurement results of a single photoelectric calculation unit, each photoelectric calculation unit is in the read state
- the power is only 0.1uW level, the entire network inference takes 1.312ms, each unit only needs to run at most one-eighth time, the leakage current when not running is negligible, so the total power of the chip operation is only To 0.05W. Regardless of power consumption or chip area, with the same computing power, this is unmatched by using GPU to accelerate neural network inference.
- the following table is a comparison chart of the estimated power consumption, chip area, computing power and manufacturing process of the integrated photoelectric storage and integrated chip and Google's TPU chip.
- the parameters and performance indicators of the photoelectric storage and integrated chip are derived from theoretical reasoning and Simulation results.
- the number of operations per second is still much higher than that of the TPU chip, which is mainly due to the computing array in the optoelectronic storage and computing integrated chip
- the multiplication operation can be completed, and the current collection has completed an addition operation, so a single unit can contribute two operands in a mechanical cycle, which is far superior to the TPU chip, but also leads to the chip
- the area is also smaller than that of the TPU chip; and another major advantage of the integrated photoelectric storage and calculation chip: the storage characteristics of the optical input, which leads to the photoelectric storage and calculation integrated chip will be much lower in power consumption than the TPU chip; in addition, the above parameters are Based on the 65nm process, Google TPU is the product of the 28nm process, which in turn creates the possibility for the optoelectronic storage and computing core to improve the performance of the device through the reduction of process nodes in the future; finally, it can be
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Power Engineering (AREA)
- Mathematical Physics (AREA)
- Microelectronics & Electronic Packaging (AREA)
- Computer Hardware Design (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Nonlinear Science (AREA)
- Optics & Photonics (AREA)
- Condensed Matter Physics & Semiconductors (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Ceramic Engineering (AREA)
- Neurology (AREA)
- Electromagnetism (AREA)
- Light Receiving Elements (AREA)
- Optical Modulation, Optical Deflection, Nonlinear Optics, Optical Demodulation, Optical Logic Elements (AREA)
- Photo Coupler, Interrupter, Optical-To-Optical Conversion Devices (AREA)
- Optical Communication System (AREA)
- Complex Calculations (AREA)
Abstract
Description
8bit(权值) | 7bit(权值) | 6bit(权值) | 5bit(权值) | 4bit(权值) | |
8bit(激励) | 90.830 | 90.820 | 90.730 | 90.380 | 88.040 |
Claims (49)
- 一种光电计算单元,采用光输入和电输入两种方式输入运算量,包括一个半导体多功能区结构,其中,所述半导体多功能区结构包括至少一个载流子控制区、至少一个耦合区、以及至少一个光生载流子收集区和读出区,其中:采用光输入的运算量,即光输入量,通过入射光子转化为光生载流子的方式来完成输入;采用电输入的运算量,即电输入量,通过直接注入载流子的方式来完成输入;所述载流子控制区,被设置为控制并调制所述光电计算单元内的载流子,并且作为所述光电计算单元的电输入端口,输入其中一个运算量作为电输入量;或者被设置为只控制并调制所述光电计算单元内的载流子,通过其他区域输入电输入量;所述耦合区,被设置为连接光生载流子收集区和读出区中的收集区和读出区,使得光子入射产生的光生载流子作用于所述光电计算单元内的载流子,形成运算关系;所述光生载流子收集区和读出区,其中收集区被设置为吸收入射的光子并收集产生的光生载流子,并且作为所述光电计算单元的光输入端口,输入其中一个运算量作为光输入量;读出区被设置为作为所述光电计算单元的电输入端口,输入其中一个运算量作为电输入量,并且作为所述光电计算单元的输出端口,输出被光输入量和电输入量作用后的载流子作为单元输出量;或者通过其他区域输入电输入量,读出区只作为所述光电计算单元的输出端口,输出被光输入量和电输入量作用后的载流子,作为单元输出量。
- 如权利要求1所述的光电计算单元,其中:包括作为所述载流子控制区的控制栅极、作为所述耦合区的电荷耦合层,以及作为所述光生载流子收集区和读出区的P型衬底,其中:所述作为光生载流子收集区和读出区的P型半导体衬底,包括左侧收集区和右侧读出区,所述左侧收集区被设置为用以产生用于光电子收集的耗尽层,并通过右侧读出区读出收集的光电子电荷量,作为光输入端的输入量;所述右侧读出区,包含浅槽隔离、N型漏端和N型源端,被设置为用于读出,同时 也可以作为电输入端,输入其中一位运算量;所述作为耦合区的电荷耦合层,被设置用作连接光生载流子收集和读出区中的收集区和读出区,使收集区衬底内耗尽区开始收集光电子以后,收集区衬底表面势就会受到收集的光电子数量影响;并且通过电荷耦合层的连接,使得读出区半导体衬底表面势受到收集区半导体衬底表面势影响,进而影响读出区源漏间电流大小,从而通过判断读出区源漏间电流来读出收集区收集的光电子数量;所述作为载流子控制区的控制栅,被设置为用以在其上施加一个脉冲电压,使得在P型半导体衬底读出区中产生用于激发光电子的耗尽区,同时也可以作为电输入端,输入其中一个运算量;用于隔离的底层介质层,被设置在所述P型半导体衬底和所述电荷耦合层之间;用于隔离的顶层介质层,被设置在电荷耦合层和所述控制栅之间。
- 如权利要求1所述的光电计算单元,其中:包括作为所述载流子控制区的控制栅极、作为所述耦合区的电荷耦合层,以及作为所述光生载流子收集区和读出区的N型衬底,其中:所述作为光生载流子收集区和读出区的N型半导体衬底,包括左侧收集区和右侧读出区,所述左侧收集区被设置用以产生用于光空穴收集的耗尽层,并通过右侧读出区读出收集的光空穴电荷量,作为光输入端的输入量;所述右侧读出区,包含浅槽隔离、P型漏端和P型源端,被设置为用于读出,同时也可以作为电输入端,输入其中一位运算量;所述作为耦合区的电荷耦合层,用以连接光生载流子收集区和读出区中的收集区和读出区,被设置为使收集区衬底内耗尽区开始收集光空穴以后,收集区衬底表面势就会受到收集的光空穴数量影响;通过电荷耦合层的连接,使得读出区半导体衬底表面势受到收集区半导体衬底表面势影响,进而影响读出区源漏间电流,从而通过判断读出区源漏间电流来读出收集区收集的光空穴数量;所述作为载流子控制区的控制栅,被设置为用以在其上施加一个负脉冲电压,使得在N型半导体衬底读出区中产生用于激发光空穴的耗尽区,同时也可以作为电输入端,输入其中一位运算量;用于隔离的底层介质层,被设置在所述N型半导体衬底和所述电荷耦合层之间;用于隔离的顶层介质层,被设置在电荷耦合层和所述控制栅之间。
- 如权利要求1所述的光电计算单元,其中:包括作为所述载流子控制区的控制栅极、作为所述耦合区的电荷耦合层,以及作为所述光生载流子收集区和读出区的P型衬底,其中:所述作为光生载流子收集区和读出区的P型半导体衬底,包含一个N型漏端和一个N型源端,被设置为同时承担感光和读出的工作,同时也可以作为电输入端,输入其中一位运算量;所述作为耦合区的电荷耦合层,被设置为用以储存进入其中的光电子,并改变读出时单元阈值大小,进而影响读出区源漏间电流,从而通过判断读出区源漏间电流来读出感光时产生并且进入电荷耦合层中的光电子数量;所述作为载流子控制区的控制栅,被设置为用以在其上施加一个脉冲电压,使得在P型半导体衬底读出区中产生用于激发光电子的耗尽区,同时也可以作为电输入端,输入其中一个运算量;用于隔离的底层介质层,被设置在所述P型半导体衬底和所述电荷耦合层之间;用于隔离的顶层介质层,被设置在所述电荷耦合层和所述控制栅之间。
- 如权利要求1所述的光电计算单元,其中:包括作为所述载流子控制区的复位管、作为所述耦合区的光电子耦合引线,以及作为所述光生载流子收集区和读出区的光电二极管和读出管,此外,还包括选址管,用于将所述光电计算单元组成阵列时行列选址使用,其中:所述作为光生载流子收集区和读出区的光电二极管和读出管,其中,光电二极管被设置为负责感光,所述光电二极管的N区通过作为耦合区的所述光电子耦合引线连接到读出管的控制栅和复位管的源端上;所述读出管,其源端和选址管漏端连接,被设置为用于读出,同时也可以作为电输入端,输入其中一位运算量;所述作为耦合区的光电子耦合引线,被设置为连接作为光生载流子收集区和读出区中的收集区和读出区的光电二极管和作为读出区的读出管,将光电二极管N区电势施加到读出管控制栅上;所述所谓载流子控制区的复位管,被设置为通过其漏端输入一个正电压作用于光电二极管,当复位管打开时,所述正电压即会作用在光电二极管上,使光电二极管产生耗尽区并感光,同时也可以作为电输入端,输入其中一位运算量;所述选址管,被设置为用于控制整个光电计算单元的输出。
- 如权利要求1所述的光电计算单元,其中,还包括一个在光学上与光电计算单元相对应的发光单元,所述发光单元被设置为发出的光在所述光电计算单元中产生光生载流子,被作为光电计算单元的光输入量,并且与所述光电计算单元中电输入端输入的电输入量相互作用,所得结果被设置为光电运算结果。
- 根据权力要求6所述的光电计算单元,其中,所述发光单元由一个信号转换驱动器进行驱动,所述信号转换驱动器被设置为将数字信号转化为发光单元的驱动电流脉冲脉宽,同时驱动整片多个发光单元组成的发光阵列,或通过选址来驱动特定的相关的发光单元,使得所述相关的发光单元产生相应时长的光信号,所述光信号被设置为作为相应光电计算单元的光学输入量。
- 如权利要求6所述的光电计算单元,被设置为组成二维或三维阵列,形成光电计算模块,实现各种特定的运算功能。
- 一种光电计算阵列,由多个如权利要求6所述的光电计算单元组成,其中,所述发光阵列和光电计算阵列之间带有一层或多层光学结构,用以实现来自所述发光阵列的光被照射到所述光电计算阵列预定位置的对焦方式,实现发光阵列和光电计算阵列之间的光学对应。
- 一种由光电计算单元执行的光电计算方法,所述光电计算单元包括至少一个发光单元和至少一个光电计算单元,所述光电计算单元包括一个半导体多功能区结构,所述多功能区结构包括载流子控制区、耦合区、以及光生载流子收集区和读出区,其中所述方法包括:将所述发光单元设置为发出光,并且所述光照射到所述光电计算单元,并经所述载流子控制区控制,在光生载流子收集区和读出区中的收集区中产生光生载流子,作为光电计算单元的第一个运算量;在所述多功能区中的其中一个区产生电运算量并输入相应的载流子,所述载流子作为光电计算单元的第二个运算量;将代表所述第一个运算量的光生载流子与代表所述第二个运算量的载流子共同作用于光生载流子读出区的载流子,被作用后的载流子作为所述光电运算的结果;将作为光电运算结果的载流子,在所述光生载流子收集区和读出区的读出区的输出端输出。
- 如权利要求6所述的光电计算单元,被用于作为加法器,包含一个所述发光单元以及一个光电计算单元,所述光电计算单元至少包含载流子控制区、耦合区和光生载流子收集区和读出区,其中:所述发光单元,被设置为发出代表第一个加数的光信号;所述载流子控制区,用于控制并调制所述光电计算单元内的载流子,并被设置为光电计算单元的电输入端,用于输入第二个加数;所述耦合区,被设置为连接光生载流子收集区和读出区,并使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中:所述光生载流子收集区和读出区中的收集区,被设置为吸收发光单元发出的光,产生并收集光生载流子,并且在所述光输入端输入第一个加数,所述第一个加数和所述第二个加数共同作用于光生载流子收集区和读出区中读出区里的载流子,并且被作用后的载流子经结果输出端被作为加法器的结果输出。
- 如权利要求10所述的光电计算方法,被用于进行加法运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为第一个加数;在所述载流子控制区注入载流子,并且将所述载流子作为第二个加数;在所述光生载流子收集区和读出区中,使代表所述第一加数的光生载流子 和代表所述第二加数的载流子共同作用于光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子被作为所述加法运算的结果,在光电子收集区和读出区被输出。
- 如权利要求6所述的光电计算单元,被用于作为多个加数同时相加的加法器,包含一个所述发光单元一个光电计算单元,所述光电计算单元至少包含载流子控制区、耦合区、以及光生载流子收集区和读出区,其中:所述发光单元,被设置为发出代表第一个加数的光信号;所述载流子控制区,被设置为采用并排的多控制区结构,以用于控制并调制单元内的载流子,并作为加法器的电输入端,输入其他多个加数;所述耦合区,被设置为连接光生载流子收集区和读出区,并使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,光生载流子收集区和读出区中的收集区,被设置为吸收发光单元发出的光,产生光生载流子并收集,并作为所述加法器的光输入端口,被输入其中一个加数,由电输入端和光输入端共同输入的多个加数共同作用于光生载流子收集区和读出区中读出区里的载流子,并且所述光生载流子读出区中的载流子经结果输出端被作为加法器的结果输出。
- 如权利要求10所述的光电计算方法,用于进行多个加数的加法运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为第一个加数;所述载流子控制区被设置为多栅极结构,并且将从多栅极输入的载流子作为其他多个加数;在所述光生载流子收集区和读出区中,使代表所述第一加数的光生载流子和代表所述其他加数的载流子共同作用于光生载流子收集区和读出区中读出区的载流子,并且被作用后的载流子作为所述加法运算的结果,在光电子收集区和读出区的输出端被输出。
- 如权利要求6所述的光电计算单元,被用于作为至少两个加数的加法 器,包含至少两个所述发光单元和至少两个所述光电计算单元,所述光电计算单元,至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:至少两个所述发光单元,被设置为发出代表至少两个加数的光信号;至少两个光电计算单元的输出端,被设置为彼此相连,其中所述至少两个光电计算单元的载流子控制区,被设置为输入一个恒定电压值,用于控制并驱动单元内的载流子;所述至少两个光电计算单元的耦合区,负责连接光生载流子收集区和读出区,并被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述至少两个光电计算单元的光生载流子收集区和读出区,每个各包含一个光输入端和至少一个结果输出端,其中所述至少两个光电计算单元的光输入端,被设置为分别接受至少两个发光单元发出的所述光信号,所述光信号分别被作为加法器的加数;所述至少两个结果输出端,被设置为输出受光输入端输入量影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,并经汇聚后得到最终结果,所述最终结果被作为加法运算的结果输出。
- 如权利要求10所述的光电计算方法,用于进行至少两个加数的加法运算,其中:所述至少两个发光单元发出的光信号,作为加法器的至少两个加数,对应地照射到至少两个对应的光电计算单元;在所述至少两个光生载流子收集区和读出区中,使代表相应加数的光生载流子作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在一个恒定电压的驱动下,以电流的形式输出,再流经相连的输出端完成电流的汇聚,所述汇聚后的电流值被作为加法器的结果输出。
- 如权利要求6所述的光电计算单元,被用于作为乘法器,包含所述一个发光单元以及一个光电计算单元,所述光电计算单元至少包含载流子控制区、耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,在被设置为发出代表第一个乘数的光信号;所述载流子控制区,被设置为输入恒定的电压值,用于控制并驱动单元内的载流子;所述耦合区,连接光生载流子收集区和读出区,并被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述光生载流子收集区和读出区,包含一个光输入端、一个电输入端和至少一个结果输出端,其中,光生载流子收集区和读出区中的收集区,被设置为吸收发光单元发出的光,产生光生载流子并收集,为乘法器的光输入端口,输入第一个乘数;所述电输入端口,被设置为输入乘法器的第二位乘数,并且所述光生载流子收集区和读出区中读出区内的被第一位乘数和第二位乘数共同作用的载流子,作为光输入量和电输入量作用后的载流子,在光电子收集区和读出区的输出端作为结果被输出。
- 如权利要求10所述的光电计算方法,被用于进行乘法运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为乘法器的第一个乘数;在所述光生载流子收集区和读出区,从读出区的电输入端口输入的载流子被设置为乘法器的第二个乘数,使代表所述第一乘数的光生载流子和代表所述第二加数的载流子共同作用于光生载流子收集区和读出区中读出区内的载流子,并且被作用后的所述载流子被作为乘法器的结果,在光电子收集区和读出区的输出端被输出。
- 如权利要求6所述的光电计算单元,被用于作为乘法器,包含一个所述发光单元一个光电计算单元,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,被设置为发出代表第一个乘数的光信号;所述载流子控制区,被设置为控制并调制单元内的载流子,并作为光电计算单元的电输入端,串行的按照高低位的顺序输入经过二进制转换的第二个乘数;所述耦合区,被设置为负责连接光生载流子收集区和读出区,使光生载流 子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,光生载流子收集区和读出区中的收集区,被设置为吸收发光单元发出的光,产生光生载流子并收集,为乘法器的光输入端口,输入第一个乘数;所述第一个乘数和所述第二个乘数的二进制各比特位上的二值化输入量共同作用于光生载流子收集区和读出区中读出区里的载流子,并且所述读出区中的载流子经结果输出端被作为第一个乘数和第二个乘数各比特位上的值的乘法结果被依次串行地输出,再经过移位和拼接工作,得到最终的乘法结果。
- 如权利要求10所述的光电计算方法,被用于进行乘法运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为乘法器的光输入端数据,为第一个乘数;将被代表为第二位乘数的量被转化为二进制,并按照比特位高低串行地从所述载流子控制区以被调制过的载流子形式注入,且所述被调制的载流子被作为乘法器的第二个乘数,从电输入端输入;在所述载流子收集区和读出区中,使代表所述第一个乘法量的光生载流子和代表所述第二个乘法量的串行输入地载流子控制区载流子依次共同作用于相应的光生载流子收集区和读出区中读出区的载流子;在光电子收集区和读出区的输出端的输出量,作为结果被输出,再依次被移位和累加,所得结果为最终的乘法运算结果。
- 如权利要求6所述的光电计算单元,用于作为乘法器,包含至少两个所述发光单元和至少两个所述光电计算单元,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其中还包括:所述至少两个发光单元,被设置为发出数值相同的光信号,所述光信号被作为第一个乘数;至少两个光电计算单元,被设置为并行排列,但不改变发光单元在光学上与之对应的关系,其中所述至少两个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且按照比特位高低,将经过二进制转化后的第二个乘数的二值数据,并行地输入到所述至少两个并行排列的光电计算单 元上,作为乘法器的电输入端数据,其组合被作为第二个乘数;所述至少两个光电计算单元的耦合区,连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述至少两个光电计算单元的光生载流子收集区和读出区,每个各包含一个光输入端和至少一个结果输出端,其中所述至少两个光电计算单元的光输入端,被设置为接受至少两个发光单元发出的所述光信号;所述至少两个结果输出端,被设置为输出受第一个乘数和第二个乘数各个比特位数据共同影响的光生载流子收集区和读出区中读出区里的载流子,并且在至少两个光电子收集区和读出区的输出端输出,进行移位和累加的操作,其结果被作为乘法器的结果被输出。
- 如权利要求10所述的光电计算方法,被用于进行乘法运算,其中:所述至少两个发光单元发出的光子所产生的光生载流子被设置为乘法器的光输入端数据,作为第一个乘数;将代表第二个乘数的量转化为二进制,并按照比特位高低并行地从所述至少两个载流子控制区以被调制过的载流子形式分别注入不同的单元中,且所述被调制的载流子,被作为第二个乘数;在所述至少两个载流子收集区和读出区中,使代表所述第一个乘法量的光生载流子和代表所述第二个乘法量不同比特位二值数据的载流子控制区载流子,分别作用于相应的至少两个光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在光电子收集区和读出区的输出端作为结果被输出,再经过移位和累加操作,被作为最终的乘法器运算结果。
- 如权利要求6所述的光电计算单元,被用于作为乘法器,包含至少两个所述发光单元和至少两个所述光电计算单元,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其中还包括:所述至少两个发光单元,被设置为发出数值相同的光信号,所述光信号被作为第一个乘数;至少两个光电计算单元,被设置为并行排列,但不改变发光单元在光学上 与之的对应关系,并将输出端相连,其中,所述至少两个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且按照比特位高低,将经过二进制转化后的第二个乘数的二值数据并行的输入到所述至少两个并行排列的光电计算单元上,作为乘法器的电输入端数据,为第二个乘数;所述至少两个光电计算单元的耦合区负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述至少两个光电计算单元的光生载流子收集区和读出区,每个各包含一个光输入端、一个电输入端和至少一个结果输出端,其中,所述光电计算单元的光输入端被设置为接受至少两个发光单元发出的所述光信号;所述电输入端,被设置为按照载流子控制区上输入数据比特位的高低,输入代表所述比特位位权的载流子;所述至少两个结果输出端,输出受第一个乘数、第二个乘数的各个比特位数据和各个比特位位权共同影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,并经汇聚后输出最后的乘法结果。
- 如权利要求10所述的光电计算方法,被用于进行乘法运算,其中:所述至少两个发光单元发出的光子所产生的光生载流子被设置为乘法器的光输入端数据,为第一个乘数;将被作为第二个乘数的量被转化为二进制,并按照比特位高低并行地从所述至少两个载流子控制区以被调制过的载流子形式分别注入不同的单元中,且所述被调制的载流子,被作为第二个乘数;在所述至少两个载流子收集区和读出区中,通过电输入端输入被设置为和载流子控制区上输入数据比特位位权相当的载流子,并且使代表所述第一个乘法量的光生载流子和代表所述第二个乘法量不同比特位的二值数据的载流子控制区的载流子,以及代表位权的载流子收集区和读出区的电输入端载流子,共同作用于相应的至少两个光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子以电流的形式输出,并加以汇聚,其结果被作为所述乘法器的结果输出。
- 如权利要求11、13、15中任一项所述的光电计算单元,被用于组成光电计算向量加法器,用于进行至少两组维度至少为二的向量的加法运算,其中,所述光电计算加法器包含至少两个加数输入端和结果输出端,其特征还在于:至少两个所述光电计算加法器,被设置为并行排列;每一个所述加法器的输入端,被设置为输入至少两位加数,代表至少两个待加向量相同序号的对应元素,其中,使用的加法器输入端的数量,不少于所述待加向量的数量;每一个所述加法器的输出端,被设置为输出两个向量相同序号对应元素相加的结果,所述至少两个结果被组合,拼接成一个完整的向量,所述完整的向量为所述向量加法器的运算结果。
- 如权利要求12、14、16中任一项所述的光电计算方法,被用于进行向量加法运算,用于进行至少两组维度至少为二的向量的加法运算,其中:将所述至少两个待加向量按照维度进行拆分,形成多组独立的加数;将每一组独立的加数输入到每一个加法器的加数输入端,其中使用的加法器输入端的数量不少于待加向量的数量;将所述至少两个加法器输出端的输出结果,按照输入的向量元素序号,重新拼接成一个完整的向量,所述完整的向量即为至少两个待加向量加法运算后的结果向量。
- 如权利要求19、21、23、25中任一项所述的光电计算单元,被用于组成光电计算向量点乘器,用于进行维度至少为二的向量的点乘运算,其中,所述光电计算乘法器包含两个乘数输入端和结果输出端,其中:至少两个所述光电计算乘法器,被设置为独立并行排列;每一个乘法器的输入端,被设置为输入待乘向量相同序号的对应元素的乘数;每一个乘法器的输出端,被设置为输出两个待乘向量相同序号对应元素相乘的结果,所述至少两个结果,拼接成一个完整的向量,所述完整向量为所述向量点乘器的运算结果。
- 如权利要求20、22、24、26中任一项所述的光电计算方法,被用于进行向量点乘运算,用于进行维度至少为二的向量的点乘运算,其中:将两个待乘向量按照维度进行拆分,形成多组独立的乘数;将每一组独立的乘数输入到每一个乘法器的乘数输入端;将所述至少两个乘法器输出端的输出结果,按照输入的向量元素序号,重新拼接成一个完整的向量,所述完整的向量即为两个待乘向量点乘后的结果向量。
- 如权利要求19、21、23、25中任一项所述的光电计算单元,被用于组成高位宽乘法器,其中每个所述光电计算乘法器包含两个乘数输入端和结果输出端,其中还包括:至少四个所述光电计算乘法器,被设置为并行排列;所述至少四个乘法器的输入端,其输入量被设置为经过高低位拆分后的待乘数部分数据的乘数;所述至少四个乘法器的输出端,被设置为输出两个待乘数相应高低位相乘后的结果,其中至少四个乘法器输出的结果,被按照输入数据的位权进行相应的移位和累加,得到一个完整的高位宽数,并作为最终的乘法结果。
- 如权利要求20、22、24、26中任一项所述的光电计算方法,被用于进行高位宽乘法运算,其中:将两个待乘高位宽数按照比特位进行高低位拆分,将两个高位宽乘数拆分成两组低位宽乘数,所述高位宽乘数拆成的份数取决于高位宽乘数的具体位宽;拆分后的两组低位宽乘数按照两两相乘的组合规则,分别输入到至少四个乘法器的乘数输入端,其中,使用的乘法器的数量,取决于待乘高位宽乘数具体位宽;将所述至少四个乘法器输出端的输出结果按照输入的乘数的位宽高低,进行相应的移位操作,再将移位后的结果累加,最终的累加结果即为两个高位宽乘数相乘后的结果。
- 如权利要求6所述的光电计算单元,包含多个所述发光单元和所述光电计算单元,组成串行矩阵向量乘法器;所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,被设置为发出待乘矩阵中数据的光信号;所述光电计算单元,被设置为排列成和待乘矩阵行列数相同的单元阵列,但不改变发光单元在光学上与之的对应关系,其中所述单元阵列每一列的所有单元的光生载流子收集区和读出区的输出端彼此相连,所述单元阵列每一行的所有单元的载流子控制区彼此相连;所述单元阵列中每一行单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且按列输入被设置为代表向量中各个元素的载流子,为矩阵向量乘法器的向量数据输入端;其中,所述向量中各个元素的数据被设置为转化为二进制后,按照比特位串行的将代表二值化后数据的载流子输入到各个行的载流子控制区;所述光电计算单元的耦合区,负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述多个光电计算单元的光生载流子收集区和读出区,每个单元包含一个光输入端和至少一个结果输出端,其中,所述光电计算单元的光输入端被设置为接受相应的发光单元发出的光信号并输入矩阵中的数据,被设置为矩阵向量乘法器的矩阵数据输入端;所述结果输出端,被设置为输出受矩阵数据和向量数据共同作用的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再按列汇聚,并且将结果进行输出;所输出的结果再按输入的比特位进行移位操作,然后累加,即得到最终的结果向量。
- 如权利要求10所述的光电计算方法,用于进行矩阵向量乘法运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为矩阵向量乘法器的光输入端数据,为待乘的矩阵数据;将所述光电计算单元排列成和待乘矩阵行列数相同的单元阵列,但不改变 发光单元在光学上与之的对应关系,其中,所述单元阵列每一列的所有单元的光生载流子收集区和读出区的输出端彼此相连,所述单元阵列每一行的所有单元的载流子控制区彼此相连;向量数据中的每个元素被转化为二进制,并按照比特位高低串行地从被设置为同行相连的载流子控制区以被调制过的载流子形式,将不代表不同元素的数据分别注入不同的行中,为待乘的向量数据;在所述载流子收集区和读出区中,使代表所述矩阵数据的光生载流子和代表所述向量数据不同比特位二值数据的载流子控制区载流子共同作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,并按列汇聚,再完成移位和累加操作,即得到最终乘法结果向量。
- 如权利要求6所述的光电计算单元,被用于组成并行矩阵向量乘法器,包含多个所述发光单元和多个所述光电计算单元,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元被设置为发出待乘矩阵中数据的光信号;所述光电计算单元,被设置得分成多组,每一组单元再排列成和待乘矩阵行列数相同的单元阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列每一列的所有单元的光生载流子收集区和读出区的输出端彼此相连,所述单元阵列每一行的所有单元的载流子控制区彼此相连;每一组单元阵列中每一行单元的载流子控制区,被设置为用于控制并调制单元内的载流子,并且并行地按组按列输入被设置为代表向量中各个元素二值化后对应比特位数据的载流子,为矩阵向量乘法器的向量数据输入端;多个所述光电计算单元的耦合区负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;多个所述光电计算单元的光生载流子收集区和读出区,每个单元包含一个光输入端和至少一个结果输出端,其中,多个所述光电计算单元的光输入端,被设置为接受多个发光单元发出的所述光信号,输入矩阵中的数据,为矩阵向量乘法器的矩阵数据输入端;所述多个结果输出端,被设置为输出受矩阵数据 和向量数据共同作用的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再按列汇聚后按输入的比特位进行移位操作最后累加,形成最终的结果向量。
- 如权利要求10所述的光电计算方法,被用于进行矩阵向量乘法运算,其中:所述多个发光单元发出的光子所产生的光生载流子被设置为矩阵向量乘法器的光输入端数据,为矩阵数据;将多个光电计算单元分成多组,每一组单元再排列成和待乘矩阵行列数相同的单元阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列每一列的所有单元的光生载流子收集区和读出区的输出端都相连,所述单元阵列每一行的所有单元的载流子控制区都相连;向量数据中的每个元素被转化为二进制,并行地从相应组的被设置为同行相连的载流子控制区以被调制过的载流子形式,将代表不同元素不同比特位的数据分别注入不同组的不同的行中,为向量数据;在多个所述载流子收集区和读出区中,使代表所述矩阵数据的光生载流子和代表所述向量数据不同比特位二值数据的载流子控制区载流子分别共同作用于相应的多个光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,并按列汇聚,再经过移位和累加操作,即得到最终结果向量。
- 如权利要求31、33中任一项所述的光电计算单元,被用于组成平均池化运算器,用于至少包含两个元素的矩阵的池化运算,所述光电矩阵向量乘法器包含一个矩阵输入端、一个向量输入端和结果输出端,其中包括:光电矩阵向量乘法器,列数为1,行数和待池化矩阵元素个数相当,作为待池化矩阵的平均池化运算器;所述光电矩阵向量乘法器的向量输入端,作为电输入端,输入待池化矩阵中的不同元素,为所述池化运算器的待池化矩阵输入端;矩阵输入端,作为光输入端,输入列数为1,行数和待池化矩阵元素个数相同,且每个元素都为待池化矩阵元素个数的倒数的矩阵,为平均值分母输入端;所述结果输出端,被设置为输出待池化矩阵最终的平均池化结果。
- 如权利要求32、34中任一项所述的光电计算方法,被用于进行平均池化运算,用于进行元素个数至少为2的矩阵的平均池化运算,其中:将待池化矩阵拆分成一个个独立的元素,再将拆分后的所有元素重新组成维度和待池化矩阵元素个数相当的向量;所述矩阵向量乘法器,适用于进行待乘矩阵行数和待池化矩阵元素个数相等、列数为1的矩阵向量运算;将所述待池化矩阵拆分后重组的向量,作为矩阵向量乘法器的向量输入端,即电输入端数据,输入到矩阵向量乘法器中;将维度和所使用的矩阵向量乘法器相同,每一个元素都为待池化矩阵元素个数的倒数的矩阵作为矩阵向量乘法器的矩阵输入端,即光输入端数据,输入到矩阵向量乘法器中,作为平均池化运算中求平均值的分母;所述结果输出端输出结果即为待池化矩阵中每一个元素除以元素个数再相加的结果,即待池化矩阵平均池化后的结果。
- 如权利要求6所述的光电计算单元,组成串行卷积运算器,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,被设置为发出代表卷积核中数据的光信号;所述光电计算单元,被设置为排列成和卷积核行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列的所有单元的光生载流子收集区和读出区的输出端彼此相连,汇总成一个输出端;所述阵列中每个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且串行输入被设置为代表经过补0操作后的待卷积矩阵中和初始卷积核位置对应的小矩阵中各个元素相应比特位数据的载流子,为卷积运算器的待卷积矩阵数据输入端;其中,所述小矩阵中数据被设置为转化为二进制后,按照比特位串行的将代表二值化后数据的载流子输入到各个单元的载流子控制区;所述阵列中光电计算单元的耦合区,连接光生载流子收集区和读出区,被 设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述阵列中每一个光电计算单元的光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,所述光输入端被设置为接受发光单元发出的所述光信号,输入卷积核中的数据,为卷积运算器的卷积核输入端;所述每一个单元的输出端,被设置为输出受待卷积矩阵分割出的小矩阵相应比特位数据和卷积核数据共同影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再经并联的输出端使得阵列中全部光电计算单元的输出电流全部汇聚后,最后按输入的比特位进行移位操作然后累加,得到当前卷积核位置对应的卷积运算结果;按卷积运算所需步长移动卷积核的位置,重新输入当前卷积核位置对应的待卷积矩阵数据分割出的小矩阵数据,得到输出值后继续移动卷积核,直到完成全部卷积运算;将所述全部输出值重新组成相应维度的矩阵,即得到最终卷积运算的结果。
- 如权利要求10所述的光电计算方法,用于进行卷积运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为卷积运算器的光输入端数据,为卷积核数据;将光电计算单元排列成和所述卷积运算卷积核行列数相同的单元阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列中的每一个光电计算单元的所有载流子收集区和读出区中读出区的输出端都相连,汇聚成一个输出端;将待卷积矩阵经过补零操作后,按照卷积核当前的位置,分割出维度和卷积核大小相当的小矩阵,并将小矩阵中的每个元素转化为二进制,并按照比特位高低串行地从所述载流子控制区以被调制过的载流子形式输入到每一个单元当中,为待卷积矩阵数据;在所述阵列中单元的载流子收集区和读出区中,使代表所述待卷积矩阵中分割出的小矩阵相应比特位数据的光生载流子和代表所述卷积核数据的载流子控制区载流子分别共同作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,并全部汇聚,再完成移位和累加操作,得到当前卷积核位置对应的卷积运 算结果;按卷积运算所需步长移动卷积核的位置,重新输入当前卷积核位置对应的待卷积矩阵数据分割出的小矩阵数据,得到输出值后继续移动卷积核,直到完成全部卷积运算;将所述全部输出值重新组成相应维度的矩阵,即得到最终卷积运算的结果。
- 如权利要求6所述的光电计算单元,被用于组成并行卷积运算器,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,被设置为发出代表卷积核中数据的光信号;所述数量的光电计算单元,被设置为分成多组,并且每一组都排列成行列数和卷积核行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述每一组阵列的所有单元的光生载流子收集区和读出区的输出端都相连,汇总成一个输出端;所述每一组阵列中每个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且并行输入被设置为代表经过补0操作后的待卷积矩阵中和初始卷积核位置对应的小矩阵中各个元素相应比特位数据的载流子,为卷积运算器的待卷积矩阵数据输入端;其中,所述小矩阵中数据被设置为转化为二进制后,将相应比特位对应的数据输入到相应组的单元阵列中;所述各个组阵列中光电计算单元的耦合区负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述各个组阵列中每一个光电计算单元的光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,所述光输入端,用以接受发光单元发出的所述光信号,输入卷积核中的数据,为卷积运算器的卷积核输入端,不同组的阵列接收相同的光输入数据;所述每一个单元的输出端,被设置为输出受待卷积矩阵分割出的小矩阵相应比特位数据和卷积核数据共同影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再经并联的输出端使得阵列中全部光电计算单元的输出电流全部汇聚,最后按输入的比特位进行移位操作然后累加,得到当前卷积核 位置对应的卷积运算结果;按卷积运算所需步长移动卷积核的位置,重新输入当前卷积核位置对应的待卷积矩阵数据分割出的小矩阵数据,得到输出值后继续移动卷积核,直到完成全部卷积运算;将所述全部输出值重新组成相应维度的矩阵,即得到最终卷积运算的结果。
- 如权利要求10所述的光电计算方法,被用于进行卷积运算,其中:所述发光单元发出的光子所产生的光生载流子被设置为卷积运算器的光输入端数据,为卷积核数据;将光电计算单元分成多组,每一组都排列成行列数和卷积核行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述每一组单元阵列中的每一个光电计算单元的所有载流子收集区和读出区中读出区的输出端都相连,汇聚成一个输出端;将待卷积矩阵经过补零操作后,按照卷积核当前的位置,分割出维度和卷积核大小相当的小矩阵,再将小矩阵中的每个元素转化为二进制,并将相应比特位数据并行地输入相应组阵列的载流子控制区,以被调制过的载流子形式输入到每一个单元当中,为待卷积矩阵数据;在所述光电计算单元的载流子收集区和读出区中,使代表所述待卷积矩阵中分割出的小矩阵相应比特位数据的光生载流子和代表所述卷积核数据的载流子控制区载流子分别共同作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,一个组阵列的所有输出电流全部汇聚,再完成移位和累加操作,即得到当前卷积核位置对应的卷积运算结果;按卷积运算所需步长移动卷积核的位置,重新输入当前卷积核位置对应的待卷积矩阵数据分割出的小矩阵数据,得到输出值后继续移动卷积核,直到完成全部卷积运算;将所述全部输出值重新组成相应维度的矩阵,即得到最终卷积运算的结果。
- 一种神经网络算法加速装置,包括如权利要求31或33所述的光电矩阵向量乘法器、如权利要求35所述的平均池化运算器以及如权利要求37或39所述的卷积运算器,其中:所述矩阵向量乘法器的矩阵输入端,被设置为输入网络权值,从向量输入端输入上一级网络的输出数据或初始数据;所述平均池化运算器的平均值分母输入端,被设置为输入待池化矩阵中元素个数的倒数;待池化矩阵输入端,被设置为输入待池化数据;所述卷积运算器的卷积核输入端,被设置为输入卷积核中的数据;从待卷积矩阵输入端输入卷积核当前位置对应的待卷积矩阵中的小矩阵数据,以及非线性函数模块,由电学计算元件组成,被设置为运算非线性函数;通用逻辑运算模块,包括电学运算器和/或基于权利要求1所述光电计算单元,被设置为用于整合并控制所述矩阵向量乘法器、平均池化运算器以及卷积运算器的运算功能。
- 一种神经网络运算的加速方法,采用如权利要求32或34所述的矩阵向量乘法的光电计算方法、如权利要求36所述的池化运算的光电计算方法以及如权利要求38或40所述的卷积的光电计算方法,其特征还在于:从有关的矩阵向量乘法器的矩阵输入端输入网络权值,从所述矩阵向量乘法器的向量输入端输入上一级网络的输出数据或初始数据;从有关的平均池化运算器的平均值分母输入端输入待池化矩阵中元素个数的倒数,从所述平均池化运算器的待池化矩阵输入端输入待池化数据;从有关的卷积运算器的卷积核输入端输入卷积核中的数据,从所述卷积运算器的待卷积矩阵输入端输入卷积核当前位置对应的待卷积矩阵中的小矩阵数据;采用由电学计算元件组成的非线性函数模块,用于运算非线性函数;采用通用逻辑运算模块,所述通用逻辑运算模块包括电学运算器和/或基于权利要求1所述光电计算单元,用于整合并控制所述矩阵向量乘法器、平均池化运算器以及卷积运算器的运算功能。
- 如权利要求6所述的光电计算单元,被用于组成以代数重建算法为基础的CT算法加速器,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元,被设置为发出CT算法中系统矩阵中数据的光信号;所述光电计算单元,被设置为排列成和系统矩阵行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列的同一列的所有光电计算单元的载流子收集区和读出区的读出区的输出端彼此相连,汇总成一个输出端,不同列的输出端相互独立;所述阵列中每个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且串行输入被设置为和当前迭代次数对应的预测图像像素数据二值化后相应比特位数据的载流子,为CT算法加速器的预测图像数据输入端;所述阵列中光电计算单元的耦合区负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述阵列中每一个光电计算单元的光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,所述光输入端用以接受发光单元发出的所述光信号,输入系统矩阵中的数据,为CT算法加速器的系统矩阵输入端;所述每一个单元的输出端,输出受当前迭代预测图像像素数据相应比特位数据和系统矩阵数据共同影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再经同列并联的输出端使得阵列中全部光电计算单元的输出电流全部汇聚后,最后按输入的比特位进行移位操作然后累加,再在控制系统中完成其他非矩阵向量乘的运算内容后,进入下一次迭代,并且,初始迭代时,第一次迭代的预测图像像素数据输入所述阵列第一列中单元的载流子控制区,从第一列获得输出结果后经控制系统处理进入第二次迭代,将第二次迭代的预测图像像素数据输入所述阵列的第二列中单元的载流子控制区,以此类推,完成所有列的迭代后输出结果并返回第一列继续迭代,完成全部迭代后输出数据给控制系统后再送达显示系统成像,获得处理过的CT图片。
- 如权利要求10所述的光电计算方法,被用于进行CT算法加速,其中:将所述发光单元发出的光子所产生的光生载流子设置为CT算法加速器的光输入端数据,为系统矩阵数据;将光电计算单元排列成和所述系统矩阵行列数相同的单元阵列,但不改变 发光单元在光学上与之的对应关系,其中,所述单元阵列中同一列的每一个光电计算单元的所有载流子收集区和读出区中读出区的输出端都相连,汇聚成一个输出端;按照当前的迭代次数,将对应预测图像像素数据向量中的每个元素转化为二进制,并按照比特位高低串行地从所述载流子控制区以被调制过的载流子形式输入到每一个单元当中,为预测图像数据;在所述阵列中单元的载流子收集区和读出区中,使代表所述当前迭代次数预测图像像素相应比特位数据的光生载流子和代表所述系统矩阵数据的载流子控制区载流子分别共同作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,并按列汇聚,完成移位和累加操作,再在控制系统中完成其他非矩阵向量乘的运算内容后,进入下一次迭代;初始迭代时,第一次迭代的预测图像像素数据输入所述阵列第一列中单元的载流子控制区,从第一列获得输出结果后经控制系统处理进入第二次迭代,将第二次迭代的预测图像像素数据输入所述阵列的第二列中单元的载流子控制区,以此类推,完成所有列的迭代后输出结果并返回第一列继续迭代,完成全部迭代后输出数据给控制系统后再送达显示系统成像,获得处理过的CT图片。
- 如权利要求6所述的光电计算单元,被用于组成以代数重建算法为基础CT算法加速器,所述光电计算单元至少包含载流子控制区,耦合区和光生载流子收集区和读出区,其特征为:所述发光单元被设置为发出CT算法中系统矩阵中数据的光信号;所述光电计算单元被设置为分成多组,并且每一组都排列成行列数和系统矩阵行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述各个组阵列的同一列的所有光电计算单元的载流子收集区和读出区的读出区的输出端都相连,汇总成一个输出端,不同组不同列的输出端相互独立;所述每一组阵列中每个光电计算单元的载流子控制区,被设置用于控制并调制单元内的载流子,并且并行输入被设置为和当前迭代次数对应的预测图像像素数据二值化后相应比特位数据的载流子,为CT算法加速器的预测图像 数据输入端,其中,所述预测图像数据被设置为转化为二进制后,将相应比特位对应的数据输入到相应组的单元阵列中;所述阵列中光电计算单元的耦合区负责连接光生载流子收集区和读出区,被设置为使光生载流子收集区和读出区中收集区的光生载流子作用于所述读出区中的载流子;所述阵列中每一个光电计算单元的光生载流子收集区和读出区,包含一个光输入端和至少一个结果输出端,其中,所述的光输入端,用以接受发光单元发出的所述光信号,输入系统矩阵中的数据,为CT算法加速器的系统矩阵输入端,不同组的阵列接收相同的光输入数据;所述的每一个单元的输出端,输出受当前迭代预测图像像素数据相应比特位数据和系统矩阵数据共同影响的光生载流子收集区和读出区中读出区里的载流子,又在固定电压的驱动下以电流的形式输出,再经同列并联的输出端使得阵列中全部光电计算单元的输出电流全部汇聚后,最后按输入的比特位进行移位操作然后累加,再在控制系统中完成其他非矩阵向量乘的运算内容后,进入下一次迭代,并且,初始迭代时,第一次迭代的预测图像像素数据输入所述各个组阵列第一列中单元的载流子控制区,从第一列获得输出结果后经控制系统处理进入第二次迭代,将第二次迭代的预测图像像素数据输入所述各个组阵列的第二列中单元的载流子控制区,以此类推,完成所有列的迭代后输出结果并返回第一列继续迭代,完成全部迭代后输出数据给控制系统后再送达显示系统成像,获得处理过的CT图片。
- 如权利要求10所述的光电计算方法,被用于进行CT算法加速,其中:将所述发光单元发出的光子所产生的光生载流子设置为CT算法加速器的光输入端数据,为系统矩阵数据;将光电计算单元分成多组,并且每一组都排列成行列数和系统矩阵行列数相等的阵列,但不改变发光单元在光学上与之的对应关系,其中,所述单元阵列中同一列的每一个光电计算单元的所有载流子收集区和读出区中读出区的输出端都相连,汇聚成一个输出端;按照当前的迭代次数,将对应预测图像像素数据向量中的每个元素转化为二进制,并将相应比特位数据并行地输入相应组阵列的载流子控制区,以被调 制过的载流子形式输入到每一个单元当中,为预测图像数据;在所述阵列中单元的载流子收集区和读出区中,使代表所述当前迭代次数预测图像像素相应比特位数据的光生载流子和代表所述系统矩阵数据的载流子控制区载流子分别共同作用于相应的光生载流子收集区和读出区中读出区的载流子,并且被作用后的所述载流子在恒定电压的驱动下,以电流的形式输出,并按列汇聚,完成移位和累加操作,再在控制系统中完成其他非矩阵向量乘的运算内容后,进入下一次迭代;初始迭代时,第一次迭代的预测图像像素数据输入所述各个组阵列第一列中单元的载流子控制区,从第一列获得输出结果后经控制系统处理进入第二次迭代,将第二次迭代的预测图像像素数据输入所述各个组阵列的第二列中单元的载流子控制区,以此类推,完成所有列的迭代后输出结果并返回第一列继续迭代,完成全部迭代后输出数据给控制系统后再送达显示系统成像,获得处理过的CT图片。
- 一种单精度浮点乘法器,包括如权利要求29所述的光电计算单元和权利要求11、13、15、17中任一项所述的光电计算单元,用于进行单精度浮点数的乘法运算,其中,所述高位宽光电计算乘法器,包含两个高位宽乘数输入端和结果输出端;所述光电加法器,包含两个加数输入端和结果输出端,其中:所述高位宽光电计算乘法器的两个高位宽乘数输入端,被设置为尾数输入端,将经过加一操作的两个待乘单精度浮点数的尾数位数据输入高位宽乘法器的两个高位宽输入端,完成相乘后,输出尾数位运算结果给控制系统,所述光电加法器的两个加数输入端,被设置为指数输入端,将两个待乘的单精度浮点数的指数位数据输入加数输入端,完成相加后输出指数位运算结果给控制系统,所述控制系统,被设置为完成两个待乘单精度浮点数的符号位判断操作,输出相乘后的符号位数据,再与被输出到控制系统的尾数位运算结果、指数位运算结果重新组合成浮点数,最终得到的结果即两个待乘单精度浮点数的乘法结果。
- 一种单精度浮点乘法计算方法,采用如权利要求30所述的高位宽乘法 的光电计算方法以及权利要求12、14、16、18中任一项所述的光电计算方法,其中:将两个待乘单精度浮点数拆分为两个指数位数据,两个符号位数据和两个尾数位数据,并对所述两个尾数位数据进行加一操作,所述两个指数位数据输入光电计算加法器的两个加数输入端,输出结果作指数位运算结果,所述两个经过加一操作的尾数位数据输入高位宽乘法器的两个高位宽乘数输入端,输出结果作为尾数位运算结果,所述两个符号位数据由一个控制系统完成正负判断后,和尾数位运算结果、指数位运算结果重新组合成浮点数,所获得的两个待乘单精度浮点数,被作为最终乘法结果。
- 一种光电计算模块的数字逻辑控制方法,用于如权利要求9所述的光电运算模块的控制,其中:通过数字控制逻辑的数据输入部分接收需要进行运算的被运算量,并将需要通过光来输入的数据发送给光输入控制部分,将需要通过电来输入的数据发送给电输入接收控制部分,通过数字控制逻辑的光输入控制部分控制发光阵列的驱动装置,使发光阵列产生代表所述光输入量的光子,并入射光电计算单元,通过数字控制逻辑的光接收控制部分,控制计算阵列中的光电计算单元,根据具体运算步骤,使需要接收光输入数据的光电计算单元的相应功能区,处于接收光信号状态,并接收入射的代表所述光输入量的光子,完成光输入,通过数字控制逻辑的电输入接收控制部分,控制计算阵列中的光电计算单元,根据具体运算步骤,使需要接收电输入数据的光电计算单元的相应功能区,处于接收电信号状态,并接收输入的代表所述电输入量的载流子,完成电输入,进行运算,通过数字控制逻辑的输出控制部分,将计算产生的输出数据按照不同的计算方法进行相应的处理,得到输出所需的计算结果,通过数字控制逻辑的自检控制部分,将自检信号输入待自检的光电计算单元,接收光电计算单元的返回信号后得到自检结果,判断待自检的光电计算单 元是否异常。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020217012261A KR102608628B1 (ko) | 2018-11-22 | 2019-10-16 | 광전자 컴퓨팅 유닛, 광전자 컴퓨팅 어레이 및 광전자 컴퓨팅 방법 |
JP2021540360A JP7224065B2 (ja) | 2018-11-22 | 2019-10-16 | 光電計算ユニット、光電計算アレイ及び光電計算方法 |
EP19887726.8A EP3839694A4 (en) | 2018-11-22 | 2019-10-16 | PHOTOELECTRIC CALCULATION UNIT, PHOTOELECTRIC CALCULATION NETWORK AND PHOTOELECTRIC CALCULATION PROCESS |
US17/278,567 US20210382516A1 (en) | 2018-11-22 | 2019-10-16 | Photoelectric computing unit, photoelectric computing array, and photoelectric computing method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811398206.9 | 2018-11-22 | ||
CN201811398206.9A CN111208865B (zh) | 2018-11-22 | 2018-11-22 | 光电计算单元、光电计算阵列及光电计算方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020103615A1 true WO2020103615A1 (zh) | 2020-05-28 |
Family
ID=70773658
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/111513 WO2020103615A1 (zh) | 2018-11-22 | 2019-10-16 | 光电计算单元、光电计算阵列及光电计算方法 |
Country Status (7)
Country | Link |
---|---|
US (1) | US20210382516A1 (zh) |
EP (1) | EP3839694A4 (zh) |
JP (1) | JP7224065B2 (zh) |
KR (1) | KR102608628B1 (zh) |
CN (1) | CN111208865B (zh) |
TW (1) | TWI750541B (zh) |
WO (1) | WO2020103615A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115420226A (zh) * | 2022-09-29 | 2022-12-02 | 成都理工大学 | 基于脉冲宽度的射线作用位置定位装置及定位方法 |
CN116094882A (zh) * | 2022-11-07 | 2023-05-09 | 南京大学 | 基于模拟存内计算的调制、解调方法及系统 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109960310B (zh) * | 2019-04-02 | 2023-05-02 | 南京威派视半导体技术有限公司 | 一种光电计算单元 |
CN111723337B (zh) * | 2020-06-16 | 2023-02-07 | 上海交通大学 | 用于神经网络训练的光子张量核集成电路架构及其神经网络训练方法 |
CN112929016B (zh) * | 2021-01-28 | 2024-02-20 | 北京科技大学 | 一种垂直结构的光电逻辑开关 |
CN112836816B (zh) * | 2021-02-04 | 2024-02-09 | 南京大学 | 一种适用于光电存算一体处理单元串扰的训练方法 |
CN113363271B (zh) * | 2021-05-31 | 2023-12-22 | 武汉新芯集成电路制造有限公司 | 感光阵列及成像设备 |
CN113220268B (zh) * | 2021-06-11 | 2022-08-02 | 上海交通大学 | 光电混合乘累加计算结构 |
KR102656904B1 (ko) | 2022-03-30 | 2024-04-12 | 광주과학기술원 | 무전원 다기능성 광전 변환 논리소자, 이의 제조방법 및 이의 동작방법 |
CN114897800A (zh) * | 2022-04-22 | 2022-08-12 | 南京航空航天大学 | 基于SiC神经网络芯片的超高速X射线图像识别方法及装置 |
CN114626011B (zh) * | 2022-05-12 | 2022-08-09 | 飞诺门阵(北京)科技有限公司 | 光子计算神经网络运算加速方法、装置、设备及存储介质 |
WO2024106784A1 (ko) * | 2022-11-16 | 2024-05-23 | 경북대학교 산학협력단 | 광전자 모듈의 클러스터링을 통한 광 신경망의 구현 방법 및 클러스터링된 광전자 모듈을 통한 광신경망 장치 |
CN117391148A (zh) * | 2022-12-14 | 2024-01-12 | 北京登临科技有限公司 | 卷积计算单元、ai运算阵列及相关设备 |
KR20240127783A (ko) * | 2023-02-16 | 2024-08-23 | 서울대학교산학협력단 | 발광소자가 내장된 뉴로모픽 소자 및 이를 이용한 뉴로모픽 시스템 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0490015A (ja) * | 1990-08-02 | 1992-03-24 | Mitsubishi Electric Corp | 光ニューロコンピュータ |
CN102938409A (zh) * | 2012-11-07 | 2013-02-20 | 南京大学 | 基于复合介质栅mosfet的双晶体管光敏探测器及其信号读取办法 |
CN107180844A (zh) * | 2017-06-26 | 2017-09-19 | 南京大学 | 一种复合介质栅电容耦合变增益光敏探测器及其工作方法 |
CN107658321A (zh) * | 2016-07-25 | 2018-02-02 | 南京大学 | 基于复合介质栅的双器件光敏探测单元、探测器及其方法 |
CN110275569A (zh) * | 2019-05-25 | 2019-09-24 | 南京惟心光电系统有限公司 | 光电计算单元工作状态的控制方法 |
CN110276046A (zh) * | 2019-05-25 | 2019-09-24 | 南京惟心光电系统有限公司 | 一种光电计算单元的控制方法 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5130563A (en) * | 1989-11-30 | 1992-07-14 | Washington Research Foundation | Optoelectronic sensory neural network |
US5321639A (en) * | 1992-03-02 | 1994-06-14 | Regents Of The University Of California | Dual-scale topology optoelectronic matrix algebraic processing system |
WO1996000913A2 (de) * | 1994-06-28 | 1996-01-11 | Matthias Budil | Holografischer neurochip |
US7113967B2 (en) * | 2001-05-29 | 2006-09-26 | Magiq Technologies, Inc | Efficient quantum computing operations |
AU2003291293A1 (en) * | 2002-11-05 | 2004-06-07 | Lightfleet Corporation | Optical fan-out and broadcast interconnect |
CN100510836C (zh) * | 2007-05-11 | 2009-07-08 | 中国科学院光电技术研究所 | 用于自适应光学系统波前控制运算的脉动阵列处理电路 |
CN101527010B (zh) * | 2008-03-06 | 2011-12-07 | 上海理工大学 | 人工神经网络算法的硬件实现方法及其系统 |
US8316073B1 (en) * | 2009-10-01 | 2012-11-20 | Lockheed Martin Corporation | Optical processor |
SG11201608265XA (en) * | 2014-03-06 | 2016-11-29 | Progress Inc | Neural network and method of neural network training |
JP7026057B2 (ja) * | 2016-06-02 | 2022-02-25 | マサチューセッツ インスティテュート オブ テクノロジー | 光ニューラルネットワークのための装置および方法 |
US10338630B2 (en) * | 2017-04-03 | 2019-07-02 | International Business Machines Corporation | Optical computing system |
-
2018
- 2018-11-22 CN CN201811398206.9A patent/CN111208865B/zh active Active
-
2019
- 2019-10-16 EP EP19887726.8A patent/EP3839694A4/en active Pending
- 2019-10-16 US US17/278,567 patent/US20210382516A1/en active Pending
- 2019-10-16 KR KR1020217012261A patent/KR102608628B1/ko active IP Right Grant
- 2019-10-16 JP JP2021540360A patent/JP7224065B2/ja active Active
- 2019-10-16 WO PCT/CN2019/111513 patent/WO2020103615A1/zh unknown
- 2019-11-22 TW TW108142659A patent/TWI750541B/zh active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0490015A (ja) * | 1990-08-02 | 1992-03-24 | Mitsubishi Electric Corp | 光ニューロコンピュータ |
CN102938409A (zh) * | 2012-11-07 | 2013-02-20 | 南京大学 | 基于复合介质栅mosfet的双晶体管光敏探测器及其信号读取办法 |
CN107658321A (zh) * | 2016-07-25 | 2018-02-02 | 南京大学 | 基于复合介质栅的双器件光敏探测单元、探测器及其方法 |
CN107180844A (zh) * | 2017-06-26 | 2017-09-19 | 南京大学 | 一种复合介质栅电容耦合变增益光敏探测器及其工作方法 |
CN110275569A (zh) * | 2019-05-25 | 2019-09-24 | 南京惟心光电系统有限公司 | 光电计算单元工作状态的控制方法 |
CN110276046A (zh) * | 2019-05-25 | 2019-09-24 | 南京惟心光电系统有限公司 | 一种光电计算单元的控制方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3839694A4 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115420226A (zh) * | 2022-09-29 | 2022-12-02 | 成都理工大学 | 基于脉冲宽度的射线作用位置定位装置及定位方法 |
CN116094882A (zh) * | 2022-11-07 | 2023-05-09 | 南京大学 | 基于模拟存内计算的调制、解调方法及系统 |
CN116094882B (zh) * | 2022-11-07 | 2023-09-22 | 南京大学 | 基于模拟存内计算的调制、解调方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
EP3839694A1 (en) | 2021-06-23 |
JP7224065B2 (ja) | 2023-02-17 |
TW202020599A (zh) | 2020-06-01 |
TWI750541B (zh) | 2021-12-21 |
KR102608628B1 (ko) | 2023-11-30 |
CN111208865B (zh) | 2021-10-08 |
KR20210062682A (ko) | 2021-05-31 |
CN111208865A (zh) | 2020-05-29 |
US20210382516A1 (en) | 2021-12-09 |
EP3839694A4 (en) | 2021-10-27 |
JP2022500806A (ja) | 2022-01-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020103615A1 (zh) | 光电计算单元、光电计算阵列及光电计算方法 | |
TWI774147B (zh) | 脈衝卷積神經網路演算法、積體電路及其製造方法、運算裝置及存儲介質 | |
JP7335990B2 (ja) | 撮像装置 | |
WO2019012370A1 (ja) | 撮像装置および電子機器 | |
JP2005260706A (ja) | 放射線撮像装置及びその制御方法 | |
CN110009102B (zh) | 一种基于光电计算阵列的深度残差网络的加速方法 | |
CN110244817B (zh) | 一种基于光电计算阵列的偏微分方程求解器及其方法 | |
CN110263295B (zh) | 一种基于光电计算阵列的矩阵向量乘法器的运算优化方法 | |
CN110276046B (zh) | 一种光电计算单元的控制方法 | |
US7688947B2 (en) | Method for reducing sensitivity modulation and lag in electronic imagers | |
CN109993283B (zh) | 基于光电计算阵列的深度卷积生成式对抗网络的加速方法 | |
US11650335B2 (en) | Method for the position and energy determination in scintillation detectors | |
CN110275569B (zh) | 光电计算单元工作状态的控制方法 | |
JP7171563B2 (ja) | 撮像装置 | |
CN110276048B (zh) | 一种矩阵向量乘阵列的控制方法 | |
CN110262774B (zh) | 一种光电乘法器的计算方法 | |
CN110276047B (zh) | 一种利用光电计算阵列进行矩阵向量乘运算的方法 | |
CN109976441B (zh) | 一种可实现高精度光输入的光电计算装置 | |
CN114843295A (zh) | 基于复合介质栅结构的光电一体器件、阵列及其方法 | |
Gouin-Ferland et al. | Combining optimized quantization and machine learning for real-time data reduction at the edge | |
CN110288078B (zh) | 一种针对GoogLeNet模型的加速器及其方法 | |
CN110045781B (zh) | 一种利用光学结构输入的光电计算阵列 | |
Meo et al. | GEANT4 simulation for modelling the optics of LaBr scintillation imagers | |
Gach et al. | C-BLUE 3 PC: a photon counting multimegapixel visible CMOS camera | |
CN117148051A (zh) | 基于硅光电倍增管阵列的放电弱光三维成像方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19887726 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2021540360 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2019887726 Country of ref document: EP Effective date: 20210319 |
|
ENP | Entry into the national phase |
Ref document number: 20217012261 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |