WO2021143066A1 - Target recognition method, device, and system, and computer readable storage medium - Google Patents

Target recognition method, device, and system, and computer readable storage medium Download PDF

Info

Publication number
WO2021143066A1
WO2021143066A1 PCT/CN2020/099443 CN2020099443W WO2021143066A1 WO 2021143066 A1 WO2021143066 A1 WO 2021143066A1 CN 2020099443 W CN2020099443 W CN 2020099443W WO 2021143066 A1 WO2021143066 A1 WO 2021143066A1
Authority
WO
WIPO (PCT)
Prior art keywords
pulse
layer
neurons
target
inference
Prior art date
Application number
PCT/CN2020/099443
Other languages
French (fr)
Chinese (zh)
Inventor
黄铁军
赵君伟
田永鸿
余肇飞
Original Assignee
北京大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京大学 filed Critical 北京大学
Publication of WO2021143066A1 publication Critical patent/WO2021143066A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/061Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using biological neurons, e.g. biological neurons connected to an integrated circuit
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Definitions

  • the present invention relates to the technical field of target recognition. More specifically, the present invention is a target recognition method, device, system, and computer-readable storage medium.
  • the present invention innovatively provides a target recognition method, device, system and computer-readable storage medium.
  • the neural network performs noise filtering and pulse enhancement on the original input pulse, which can achieve accurate and efficient identification of the target to be identified, and can also significantly reduce the amount of calculation and power consumption.
  • the present invention discloses a target recognition method, and the method includes the following steps:
  • the area where the target to be identified is located is retrieved by screening each pulse in the pulse sequence, the pulse sampling window is determined according to the area where the target to be identified is located, and the pulses in the pulse sampling window are input to the pulse neural network, the pulse neural network Including excitation layer and reasoning layer;
  • the pulse When the first preset condition is met, the pulse is transmitted along the firing neurons of each layer other than the final layer in turn, and the pulse is transmitted to the inference layer along the firing neuron of the last layer when the second preset condition is met, the reasoning layer Contains multiple layers of inference neurons connected in sequence;
  • the pulse is transmitted in sequence along each layer of inference neurons except the last layer of inference neurons;
  • the recognition result is determined according to the activity of the last layer of inference neurons.
  • the acquired raw pulse data of the target to be recognized comes from a bionic vision sensor, and the bionic vision sensor is used to photograph the target to be recognized.
  • the first preset condition is that the change in membrane potential of each layer of stimulated neurons other than the final layer of stimulated neurons exceeds its own first excitation threshold.
  • the second preset condition is that the change in membrane potential of the final layer of stimulated neurons exceeds its own second threshold for excitation, and the proportion of the number of activated neurons in the final layer of stimulated neurons is greater than the first preset Proportion.
  • the third preset condition is that the change in membrane potential of each layer of inference neurons other than the last layer of inference neurons exceeds its own third excitation threshold.
  • the pulse sampling window further includes the step of recording the time interval when the target to be identified appears in the pulse sampling window twice.
  • the original pulse data includes multiple pulse sequences, and the recognition results are output only when the recognition results of all the pulse sequences after the above-mentioned processing are consistent.
  • the present invention also discloses a target recognition device, and the device includes a pulse acquisition module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inference module, and a recognition result determination module;
  • the pulse acquisition module is used to acquire the original pulse data of the target to be identified that is photographed; the original pulse data includes at least one pulse sequence;
  • the sampling window module is used to retrieve the area where the target to be recognized is located by filtering each pulse in the pulse sequence, and is used to determine the pulse sampling window according to the area where the target to be recognized is located, and to sample the pulse in the window
  • the pulses of are input to the impulse neural network;
  • the impulse neural network includes an excitation layer and an inference layer;
  • the pulse mapping module is used to map the pulses input to the pulse neural network so that the number of the first layer of excitation neurons in the excitation layer corresponds to the size of the pulse sampling window; the excitation layer contains multiple layers of sequentially connected excitation nerves Yuan;
  • the neuron excitation module is used to enable the pulse to be transmitted along the firing neurons in each layer other than the terminal layer when the first preset condition is satisfied, and to enable the pulse to fire the nerve along the terminal layer when the second preset condition is satisfied Elements are transferred to the inference layer;
  • the inference layer includes multiple layers of inference neurons connected in sequence;
  • the neuron inference module is used to enable pulses to be sequentially transmitted along each layer of inference neurons other than the last layer of inference neurons when the third preset condition is satisfied;
  • the recognition result determination module is used to determine the recognition result according to the activity of the last layer of inference neurons.
  • the present invention also discloses a target recognition system, which includes a bionic vision sensor and the above-mentioned target recognition device.
  • the bionic vision sensor is used to photograph the target to be recognized to obtain the target to be recognized.
  • the raw pulse data is used to obtain the target to be recognized.
  • the present invention also discloses a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement any of the above-mentioned target identification methods.
  • the beneficial effects of the present invention are as follows: Compared with the prior art, the present invention can realize the accurate and fast recognition of the target to be recognized, and can be better applied to the target with high moving speed.
  • the present invention can take into account the accurate recognition at the same time.
  • the invention can also realize the measurement of the rotation speed of the high-speed rotating target, thereby providing a better solution for the rotation speed calibration of the rotating body.
  • the invention can record the dynamic change information of the high-speed moving target through the bionic vision sensor with high time resolution, and process the pulse sequence output by the bionic vision sensor through the pulse neural network, and can realize the noise filtering and pulse enhancement of the original input pulse to compare
  • the pulse input of the inference layer can be reduced by controlling the activation ratio of the pulse neuron in the excitation layer, thereby reducing the calculation amount and power consumption of the entire pulse processing algorithm;
  • the secondary measurement mechanism can significantly improve the recognition accuracy of the pulsed neural network; the present invention realizes the rotation speed measurement of the high-speed rotating target by analyzing the characteristics of the circular motion and adopts a fixed sampling window method, thereby providing a realization method for the rotation speed calibration of the rotating body.
  • FIG. 1 is a schematic flowchart of a target recognition method in some embodiments of the present invention.
  • Figure 2 is a schematic diagram of the composition of a target recognition system in some embodiments of the present invention.
  • FIG. 3 is a schematic diagram of the working principle of the pulse recognition process in some embodiments of the present invention taking a plurality of stimulated neurons with a local connection relationship as an example.
  • FIG. 4 is a schematic diagram of the working principle of the pulse recognition process in other embodiments of the present invention.
  • Figure 5 shows the effect of filtering and enhancing the pulse sequence of the region where the character "P" is located, which is the target to be recognized through the excitation layer.
  • Figure 6 shows the effect of noise filtering and pulse enhancement on the pulse sequence of the region where the target to be recognized—the character "C" is located through the excitation layer.
  • Fig. 7 shows the effect of noise filtering and pulse enhancement on the pulse sequence of the region where the target to be recognized—the character "L" is located through the excitation layer.
  • FIG. 1 is a schematic flowchart of a target recognition method in some embodiments of the present invention.
  • This embodiment provides a target recognition method.
  • the target recognition method may include the following steps.
  • Step 100 the target to be recognized can be photographed by the bionic vision sensor to obtain the original pulse data of the target to be recognized.
  • the bionic vision sensor used is an electronic device manufactured by imitating the vision sensing principle of the eyes of a living body. To provide information input for subsequent brain-like neural network algorithms, bionic vision sensors often have high time resolution and can record dynamic change information of high-speed moving targets.
  • the bionic vision sensors used in the present invention include but are not limited to the following three types : Differential sensor, integral sensor and event sensor; among them, the most representative differential sensor is the Dynamic Vision Sensor (DVS).
  • DVS Dynamic Vision Sensor
  • DVS only outputs the pixel address and information whose light intensity changes, so only Sensitive to moving objects, but not to static objects;
  • the integral sensor is currently mainly represented by the retinal sensor (Vidar) led by Professor Huang Tiejun of Peking University, which simulates the cell connection structure and integral distribution in the fovea area of the retina.
  • the principle of pulse is to output the pulses emitted by all pixels at each moment in the form of an array.
  • the present invention may also be a target recognition method based on a bionic vision sensor, and may also be a method for rapid target recognition. In the target recognition scheme, the present invention can make the bionic vision sensor play the role of "eyes".
  • Step 101 Obtain the original pulse data of the photographed target to be identified.
  • the original pulse data includes at least one pulse sequence.
  • one pulse sequence may include one frame of pulses or multiple frames of pulses, and multiple pulse sequences or multiple frames.
  • the pulse can be used for the subsequent multiple measurement mechanism; as a preferred embodiment, the acquired raw pulse data of the target to be identified comes from a bionic vision sensor, and the bionic vision sensor is used to photograph the target to be identified.
  • Step 102 retrieve the area where the target to be identified is located by filtering each pulse in the pulse sequence, and determine the pulse sampling window according to the area where the target to be identified is located, so that the area where the target is located is used as the range of the pulse sampling window.
  • the present invention only inputs the pulse data in the pulse sampling window into the excitation layer for processing in real time, and does not process the pulses outside the pulse sampling window. Therefore, the present invention can greatly reduce the calculation amount of the inference layer in the pulse neural network. Therefore, the target recognition speed can be significantly accelerated under the same hardware conditions.
  • the pulse sampling window after the pulse sampling window is determined, it further includes the step of recording the time interval between two adjacent occurrences of the target to be identified in the pulse sampling window.
  • the time interval here may be more than one time interval.
  • the present invention can use the above-mentioned fixed sampling window and recording time interval to complete the measurement of the speed of the high-speed rotating target, and then it is the speed of the rotating body. Calibration provides a new way to achieve.
  • the present invention uses a spiking neural network, which relates to the field of brain-like computing.
  • the spiking neural network (SNN) is called the third-generation artificial neural network, and its neuron model is closer to the characteristics of biological neurons.
  • the processing mechanism also borrows more from the brain, using pulses as the medium of information transmission, including time information and spatial information, so the pulse neural network is the most representative algorithm in the field of brain-like computing, and with deep learning
  • the connection mode and application scenarios have also changed.
  • the Spike Neural Network has developed from the early shallow complex connections to the deep simple connections.
  • the present invention can apply the spiking neural network to various pattern recognition problems, specifically it can be the recognition of high-speed rotating characters in the present invention. Compared with algorithms such as deep learning and machine learning, the spiking neural network has more biological likelihood.
  • Spike neuron is the basic component unit and information processing unit of spiking neural network, and it is a kind of biological neuron.
  • the neuron model in the pulsed neural network of the present invention may include an integral excitation model (Integrate and Fire, IF), a leakage current integral excitation model (Leaky Integrate and Fire, LIF), and pulse Response model (Spike Response Model, SRM), etc.
  • the pulses in the pulse sampling window are input to the pulse neural network.
  • the pulse neural network in the present invention may include an excitation layer and an inference layer.
  • Step 103 the input pulse is processed by the pulse neural network below.
  • this embodiment maps the pulse input to the pulse neural network so that the number of excited neurons in the first layer of the excitation layer corresponds to the size of the pulse sampling window.
  • the mapping method can be hash mapping, hexadecimal conversion, etc.
  • the excitation layer of this embodiment includes multiple layers of excitation neurons connected in sequence through synapses, and the connection between adjacent excitation neurons includes but is not limited to local connections. , Fully connected or dynamically randomly connected, the excitation layer of this embodiment is composed of (Leaky Integrate and Fire) LIF neurons, the first layer of excitation neurons is the input impulse neurons, and the other layers of excitation neurons are the excitation impulse neurons .
  • the excited neuron of the present invention receives a pulse, its membrane potential will change.
  • Step 104 When the first preset condition is satisfied, the pulse is transmitted along the excitation neurons of each layer except the last layer of excitation neurons.
  • the current excitation neuron is the next layer of excitation neuron or the inference layer emits the pulse, the membrane potential of itself Reset to a resting state.
  • the first preset condition is that the change in membrane potential of each layer other than the last layer of stimulated neurons exceeds its own first excitation threshold.
  • the first The excitation threshold is 1.0, so as to achieve noise filtering and pulse enhancement of the pulse input to the pulse neural network; and when the second preset condition is met, the pulse is transmitted to the inference layer along the final layer of the excitation neuron.
  • the preset conditions can be set into two types: 1. The change in membrane potential of the terminal stimulated neuron exceeds its own second excitation threshold; 2. The change in membrane potential of the terminal stimulated neuron exceeds its own second excitation. Threshold and the proportion of the number of activated neurons in the last layer of stimulated neurons (that is, the ratio of the number of stimulated neurons to the number of all neurons in the last layer) is greater than the first preset ratio, the first preset in this embodiment The ratio is 15%.
  • the second condition can adjust the pulse input of the inference layer by controlling the second preset ratio, and the less the pulse input of the inference layer, the amount of calculation required And the lower the power consumption, so the present invention also has the advantage of being able to reduce the calculation amount and power consumption of the pulse processing algorithm.
  • the inference layer includes multiple layers of inference neurons connected in sequence through synapses.
  • Figure 3 is a diagram of multiple excitations with local connections in some embodiments of the present invention.
  • FIG. 4 is a schematic diagram of the working principle of the pulse recognition process in some other embodiments of the present invention.
  • the excitation layer does not emit pulses to the inference layer, thereby reducing the calculation amount of the entire pulse recognition algorithm and greatly reducing the calculation energy consumption.
  • the reasoning layer in this embodiment is composed of (Integrate and Fire) IF neurons. Based on the disclosed content of the present invention, the number of layers of the reasoning layer and the number of neurons in each layer can be adjusted reasonably and wisely according to the actual situation.
  • the inference neurons of each layer are connected by synapses, and the synaptic connection methods include but are not limited to convolutional connection, partial connection or full connection. Synaptic strength can be obtained by using existing neural network training methods. Training methods include but It is not limited to gradient-based backpropagation learning algorithms, reinforcement learning algorithms, and pulse time-dependent synaptic plasticity (Spiking Time Dependent Plasticity, STDP) algorithms, etc., and will not be repeated in the present invention. After the inference neuron of the present invention receives the pulse, its membrane potential will change.
  • Step 105 When the third preset condition is satisfied, the pulse is transmitted along each layer of inference neurons other than the last layer of inference neurons.
  • the third preset condition is each layer of inference neurons other than the last layer of inference neurons.
  • the change in the membrane potential of the cell itself exceeds its own third excitation threshold, and the third preset value may be 1.0.
  • “sequential transmission” refers to the neuron set in the front transmitting the pulse to the neuron set in the back, as shown in FIG. 3 and FIG. 4.
  • Step 106 Determine the recognition result according to the activity of the last layer of inference neurons. For example, if the IF neurons in the last layer compare their respective activities to obtain the recognition results, that is, if the activities of the inference neurons in the last layer reach various set thresholds, then the recognition can be considered successful, otherwise, the recognition failed, or, In this embodiment, it is also possible to judge whether the recognition is successful based on whether the sum of the activity of each inference neuron in the final layer reaches a preset value.
  • a multiple measurement mechanism can be used, that is, the above steps 101 to 106 can be executed multiple times synchronously or asynchronously, preferably synchronously.
  • the target recognition method can simultaneously use time information and space information to complete rapid target recognition.
  • FIG. 2 is a schematic diagram of the composition of a target recognition system in some embodiments of the present invention.
  • the present invention provides a target recognition system.
  • the target recognition system includes a bionic vision sensor and the following target recognition device.
  • the target recognition device can be integrated on a pulse computing platform to implement a pulse processing algorithm, and the bionic vision sensor is used for shooting The target to be identified to obtain the original pulse data (that is, the recorded data) of the target to be identified, and then the recorded data can be output to the pulse calculation platform in real time.
  • the pulse calculation platform can receive the data output by the bionic vision sensor in real time ,
  • the original sensor data needs to be encoded and generally transmitted through the bus, so from the pulse event issuance to the algorithm processing platform to receive, this process produces a small time difference, which can be measured by the current time recording module, which is recorded as ⁇ t1.
  • the pulse computing platform used in the present invention includes but is not limited to the following types: (1) server, workstation, desktop host or mobile computer, (2) embedded computing platform, such as Multi-Processor System-on-Chip , MPSoC) development board, central processing unit (CPU) development board, graphics processing unit (GPU) development board, single-chip Microcomputer (SCM), etc., (3) field programmable gate Array (Field Programmable Gate Array, FPGA), or Application Specific Integrated Circuit (ASIC), etc., (4) Cloud computing platform, etc.
  • the pulse calculation platform is used to run pulse processing algorithms, which can include system control algorithms and pulse recognition algorithms.
  • some embodiments of the present invention provide a target recognition device, which includes a pulse acquisition module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inference module, and a recognition result determination module.
  • the pulse acquisition module is used to acquire the original pulse data of the target to be identified; the original pulse data contains at least one pulse sequence, where one pulse sequence can include one frame of pulse or multiple frames of pulse, multiple pulse sequences or multiple frames
  • the pulse can be used for subsequent multiple measurement mechanisms.
  • the sampling window module is used to retrieve the area of the target to be identified by filtering each pulse in the pulse sequence, and is used to determine the pulse sampling window according to the area of the target to be identified, and to sample the pulses in the pulse sampling window.
  • Input to the impulse neural network the impulse neural network includes an excitation layer and an inference layer. From the algorithm processing platform receiving the pulse data, to the sampling window module filtering out the pulse data in the sampling window and inputting it into the excitation layer, this process produces a slight time difference, which can be measured by the current moment recording module and recorded as ⁇ t2.
  • the pulse mapping module is used to map the pulses input to the pulse neural network so that the number of excited neurons in the first layer of the excitation layer corresponds to the size of the pulse sampling window.
  • the pulse mapping module can be understood as the pulse recognition algorithm used in the present invention
  • the impulse event mapping layer of the impulse neural network can use hash mapping, binary conversion and other methods to map impulse events to the input impulse neurons of the excitation layer; the excitation layer contains multiple layers of excitation nerves connected in sequence through synapses Meta, the connection mode of the synapse can include partial connection, full connection or dynamic random connection, etc.
  • the following content takes the local connection mode as an example for specific description.
  • the number of input pulse neurons in the above excitation layer needs to be consistent with the size of the pulse sampling window.
  • the sampling window size is N*N
  • the input pulse in the sampling window is actually corresponding to the position of N*N bionic vision sensors.
  • the photoreceptor neuron is excited, and the input pulse neuron that reaches the excitation layer has a time delay of ( ⁇ t1+ ⁇ t2).
  • each input impulse neuron is locally connected to multiple excitation impulse neurons.
  • the local connection mode is: the input impulse neuron's own position as the center, the pulse in the surrounding neighborhood with R as the radius Neurons are connected (if the input neuron is located at the boundary of the sampling window, only the impulse neurons within the range of N*N are connected).
  • the neuron excitation module is used to enable the pulse to be transmitted along the firing neurons in each layer other than the terminal layer when the first preset condition is met, and to enable the pulse to be transmitted along the terminal layer to stimulate the neuron when the second preset condition is met
  • the inference layer contains multiple layers of inference neurons connected in sequence.
  • the first preset condition is that the change in membrane potential of each layer other than the final layer of stimulated neurons exceeds its own first excitation threshold
  • the second preset condition is that the membrane potential of the final layer of stimulated neurons itself The amount of change exceeds its second excitation threshold and the proportion of the number of activated neurons in the final layer of excitation neurons is greater than the first preset ratio.
  • the membrane potential of the excitation impulse neuron connected to it and located at the same position increases by ⁇ 1 (that is, the connected synapse strength is ⁇ 1), and the other excitation impulse neurons connected to it have an increase in membrane potential.
  • the membrane potential increases by ⁇ 2 (that is, the connected synaptic strength is ⁇ 2), where ⁇ 1> ⁇ 2, when there is no pulse input, the membrane potential of the stimulated impulse neuron connected to it and located at the same position attenuates ⁇ 3 (that is, the membrane of the LIF neuron
  • the potential attenuation factor is ⁇ 3); among them, the values of ⁇ 1, ⁇ 2, and ⁇ 3 can be preset as fixed values or can be dynamically adjusted.
  • the neuron inference module is used to make the pulses be transmitted sequentially along each layer of inference neurons except the last layer of inference neurons when the third preset condition is satisfied.
  • the third preset condition is that the change in membrane potential of each layer of inference neurons other than the last layer of inference neurons exceeds its own third excitation threshold.
  • the IF neurons in the inference layer do not have the characteristics of membrane potential attenuation. After the IF neurons in the inference layer receive a pulse input, their own membrane potential increases, and the value of the increase is equal to the total pulse input If the membrane potential of the IF neuron exceeds its own excitation threshold, a pulse will be emitted and transmitted to the next layer of IF neurons connected to it. At the same time, the membrane potential of the IF neuron will be reset, and the pulse will go from layer to layer. After spreading, until the last layer.
  • the recognition result determination module is used to determine the recognition result according to the activity of the last layer of inference neurons. For example, if the IF neurons in the last layer compare their respective activities to obtain the recognition results, that is, if the activities of the inference neurons in the last layer reach various set thresholds, then the recognition can be considered successful, otherwise, the recognition failed, or, In this embodiment, it is also possible to judge whether the recognition is successful based on whether the sum of the activity of each inference neuron in the final layer reaches a preset value.
  • the target identification device may further include a rotational speed measurement module and a current moment recording module.
  • the rotational speed measurement module is used to record that the target to be identified appears twice in the pulse sampling window after the pulse sampling window is determined.
  • the rotational speed measurement module fixes the sampling window at a position within the camera's field of view according to the rotation radius of the rotating target, and then accumulates the time interval during which the same target appears in the sampling window multiple times, and can reduce the error by statistical methods , Calculate the time required for the target to rotate one circle, and then realize the measurement of the target speed of high-speed rotation.
  • the current time recording module is used to accurately obtain the current time. The specific form of the time can be selected reasonably and wisely according to needs.
  • the present invention can make the delay of time accuracy as small as possible.
  • the invention adopts the pulse neural network to process the pulse sequence output by the bionic vision sensor, and the processing method simultaneously utilizes the time information and the space information, can realize the rapid identification of the target, and is still applicable under the condition of high movement speed.
  • a computer-readable storage medium may also be provided, and a computer program is stored on the computer-readable storage medium, and the computer program may be executed by a processor, so as to realize the The target recognition method, the present invention can make the above-mentioned computer program run on the impulse computing platform.
  • This embodiment uses the above-mentioned integral bionic vision sensor Vidar, the sensor has a time resolution of 25 microseconds, and a display resolution of 400*250 pixels.
  • the sensor is used to shoot at a fine noon on a high-speed rotating industrial fan.
  • the center distance between the fan and the sensor is about 60cm, and the fan radius is about 20cm.
  • the blade rotation speed is about 2500R/min
  • the captured pulse data is packaged on the data acquisition card and then transmitted to the desktop computer through the USB3.0 bus for processing, and the desktop computer receives and obtains Vidar in real time
  • the output data, the sampling window setting module filters the pulse data, and the pulse event mapping layer maps the filtered pulse data to the input pulse neurons of the excitation layer one by one.
  • the position of the sampling window is set according to the position of the English character in the Vidar field of view
  • the size of the sampling window depends on the pixel size of the English characters in Vidar's field of view. In this embodiment, the coordinates (y, x) of the upper left corner of the sampling window are (50, 180), and the size of the sampling window is 40*40 pixels.
  • the excitation layer contains 40*40 input pulse neurons.
  • the pulse processing algorithm is effective for bionic vision.
  • the pulse data input by the sensor is processed.
  • the number of pulse input neurons in the excitation layer is the same as the sampling window size, which is 40*40.
  • the pulse event mapping module realizes the pulse input in the sampling window and the pulse input neuron One-to-one mapping.
  • Each impulse input neuron is connected to 9 excitation impulse neurons by synapses.
  • the synapses are connected as follows: excitation in the surrounding eight neighborhoods centered on the input impulse neuron's own position
  • the impulse neurons are connected (if the input impulse neuron is located at the boundary of the sampling window, only the impulse neurons within the range of 40*40 are connected).
  • the picture on the left is the pulse input of the bionic vision sensor, and the picture on the right is the effect of filtering and enhancement using the excitation layer of the present invention.
  • the white part represents the position of the pulse input and the black part It represents the position where there is no pulse input
  • the box area represents the pulse sampling window; among them,
  • Figure 5 is the effect of filtering and enhancing the pulse sequence of the region where the character "P" is located through the excitation layer.
  • Figure 6 shows the effect of the pulse sequence in the region where the character "P" is located through the excitation layer.
  • the effect of noise filtering and pulse enhancement on the pulse sequence of the area where the target to be recognized—the character "C" is performed by the layer.
  • Figure 7 shows the pulse sequence of the area where the target to be recognized—the character "L” is passed through the excitation layer for noise filtering and pulse
  • the enhanced effect is that only when the number of activated excitation pulse neurons exceeds a certain proportion (15%), the excitation pulses are tiled into one dimension (1*1600) and then uniformly input to the inference layer.
  • the input in the sampling window is background noise; as shown in Figure 4, the inference layer is composed of 3 layers of IF neurons, and the number of IF neurons in each layer is 512, 1024, and 3.
  • the IF neurons between each layer realize synaptic connection through a fully connected way.
  • the IF neuron After the IF neuron receives the pulse input, its own membrane potential increases, and the increased value is equal to the sum of the synaptic strength values of all the pulse inputs. If the membrane potential of an IF neuron exceeds its own excitation threshold (1.0), a pulse is sent and transmitted to the next layer of IF neurons connected to it, and its membrane potential is reset to a resting state (0). The pulse propagates back layer by layer until it reaches the last layer. The IF neuron of the last layer obtains the recognition result by comparing the activity of each neuron.
  • the multiple measurement mechanism of the present invention for example, only when the recognition results outputted by the inference layer for 5 consecutive times are consistent, the final recognition result can be obtained.
  • This method can significantly improve the recognition accuracy and recognition speed, and realize the measurement of the rotation speed of high-speed rotating characters.
  • the invention significantly improves the recognition speed through the optimization method of multi-threaded parallel computing, splits the computing task to multiple processors of the desktop computer for simultaneous calculation, and records the time interval when the same character appears in the sampling window twice in succession. Calculate the time required for the character to rotate one circle, and then realize the measurement of the speed of the high-speed rotating fan.
  • this embodiment means to combine the embodiments
  • the specific features, structures, materials or characteristics described by the examples are included in at least one embodiment or example of the present invention.
  • the schematic representations of the above terms do not necessarily refer to the same embodiment or example.
  • the described specific features, structures, materials or characteristics can be combined in any one or more embodiments or examples in a suitable manner.
  • those skilled in the art can combine and combine the different embodiments or examples and the features of the different embodiments or examples described in this specification without contradicting each other.
  • first and second are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Therefore, the features defined with “first” and “second” may explicitly or implicitly include at least one of the features. In the description of the present invention, “plurality” means at least two, such as two, three, etc., unless otherwise specifically defined.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Neurology (AREA)
  • Microelectronics & Electronic Packaging (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)

Abstract

Disclosed in the present invention are a target recognition method, device, and system, and a computer readable storage medium. The method comprises the following steps: obtaining original pulse data; determining a pulse sampling window; inputting pulse in the pulse sampling window into a pulse neural network; mapping the pulse that is inputted into the pulse neural network; in the case that the conditions are satisfied, sequentially transmitting the pulse along each layer of excitation neurons excluding a last layer of excitation neurons, transmitting the pulse to an inference layer along the last layer of excitation neurons, and sequentially transmitting the pulse along each layer of inference neurons excluding the last layer of inference neurons; and determining a recognition result. The device comprises a pulse obtaining module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inferencing module, and a recognition result determining module. The system comprises the foregoing device. The present invention can achieve the accurate and quick recognition of a target to be recognized, can be well suitable for a target having a high movement speed, and can consider recognition accuracy and the amount of calculation.

Description

一种目标识别方法、装置、系统及计算机可读存储介质Target recognition method, device, system and computer readable storage medium 技术领域Technical field
本发明涉及目标识别技术领域,更为具体来说,本发明为一种目标识别方法、装置、系统及计算机可读存储介质。The present invention relates to the technical field of target recognition. More specifically, the present invention is a target recognition method, device, system, and computer-readable storage medium.
背景技术Background technique
近年来,人工智能领域发展迅猛,在算法、硬件、芯片等各方面均有突破,特别是以计算机视觉为代表的图像领域,更是将诸多研究成果广泛商用化、民用化,极大的方便了人们的日常生活。但是,当前的人工智能算法还是以第二代人工神经网络为主要核心,该类算法早在上世纪八十年即被提出,随着多年来的不断研究,目前在学术界已经陷入研究瓶颈。所以人们把重点转移到了第三代人工神经网络(即脉冲神经网络),但由于现有技术存在的局限,对于计算机视觉应用,特别是目标识别方面,往往存在识别准确率较低、识别速度慢、计算量较大、对硬件要求过高及功耗过大等问题。In recent years, the field of artificial intelligence has developed rapidly, with breakthroughs in algorithms, hardware, chips, etc., especially in the field of imagery represented by computer vision, and it has widely commercialized and civilianized many research results, which is extremely convenient. To improve people’s daily life. However, the current artificial intelligence algorithms still take the second-generation artificial neural network as the main core. This type of algorithm was proposed as early as 80 years in the last century. With years of continuous research, the current academic community has fallen into a research bottleneck. Therefore, people have shifted the focus to the third-generation artificial neural network (ie, spiking neural network). However, due to the limitations of the existing technology, for computer vision applications, especially target recognition, there are often low recognition accuracy and slow recognition speed. , The amount of calculation is large, the hardware requirements are too high, and the power consumption is too large.
因此,如何有效提高目标识别准确率和速度、降低目标识别对计算量和功耗的要求,成为了本领域技术人员亟待解决的技术问题和始终研究的重点。Therefore, how to effectively improve the accuracy and speed of target recognition, and reduce the requirements for calculation and power consumption of target recognition, has become a technical problem to be solved urgently and a focus of research by those skilled in the art.
发明内容Summary of the invention
为解决现有目标识别技术广泛存在的识别效果较差、无法对高速运动的目标进行有效识别等问题,本发明创新提供了一种目标识别方法、装置、系统及计算机可读存储介质,通过脉冲神经网络对原始输入脉冲进行噪声滤波和脉冲增强,能够实现对待识别目标的准确识别和高效识别,而且还能够显著降低计算量和功耗。In order to solve the problems of poor recognition effect and the inability to effectively recognize high-speed moving targets in the existing target recognition technology, the present invention innovatively provides a target recognition method, device, system and computer-readable storage medium. The neural network performs noise filtering and pulse enhancement on the original input pulse, which can achieve accurate and efficient identification of the target to be identified, and can also significantly reduce the amount of calculation and power consumption.
为实现上述技术目的,本发明公开了一种目标识别方法,且所述方法包括如下步骤;In order to achieve the above technical purpose, the present invention discloses a target recognition method, and the method includes the following steps:
获取被拍摄的待识别目标的原始脉冲数据,所述原始脉冲数据中包含 至少一个脉冲序列;Acquiring raw pulse data of the photographed target to be identified, where the raw pulse data includes at least one pulse sequence;
通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,根据待识别目标所在的区域确定脉冲采样窗口,将脉冲采样窗口内的脉冲输入至脉冲神经网络,所述脉冲神经网络包括激发层和推理层;The area where the target to be identified is located is retrieved by screening each pulse in the pulse sequence, the pulse sampling window is determined according to the area where the target to be identified is located, and the pulses in the pulse sampling window are input to the pulse neural network, the pulse neural network Including excitation layer and reasoning layer;
对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发神经元数目与脉冲采样窗口尺寸对应,所述激发层中包含多层依次连接的激发神经元;Mapping the pulses input to the spiking neural network so that the number of firing neurons in the first layer of the firing layer corresponds to the size of the pulse sampling window, and the firing layer includes multiple layers of firing neurons connected in sequence;
在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,并在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层,所述推理层包含多层依次连接的推理神经元;When the first preset condition is met, the pulse is transmitted along the firing neurons of each layer other than the final layer in turn, and the pulse is transmitted to the inference layer along the firing neuron of the last layer when the second preset condition is met, the reasoning layer Contains multiple layers of inference neurons connected in sequence;
在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递;When the third preset condition is satisfied, the pulse is transmitted in sequence along each layer of inference neurons except the last layer of inference neurons;
根据末层推理神经元的活跃度确定识别结果。The recognition result is determined according to the activity of the last layer of inference neurons.
进一步地,获取的待识别目标的原始脉冲数据来源于仿生视觉传感器,利用仿生视觉传感器拍摄待识别目标。Further, the acquired raw pulse data of the target to be recognized comes from a bionic vision sensor, and the bionic vision sensor is used to photograph the target to be recognized.
进一步地,所述第一预设条件为末层激发神经元以外的各层激发神经元自身的膜电位变化量超过自身的第一激发阈值。Further, the first preset condition is that the change in membrane potential of each layer of stimulated neurons other than the final layer of stimulated neurons exceeds its own first excitation threshold.
进一步地,所述第二预设条件为末层激发神经元自身的膜电位变化量超过自身的第二激发阈值且末层激发神经元中被激活的神经元数目的占比大于第一预设比例。Further, the second preset condition is that the change in membrane potential of the final layer of stimulated neurons exceeds its own second threshold for excitation, and the proportion of the number of activated neurons in the final layer of stimulated neurons is greater than the first preset Proportion.
进一步地,所述第三预设条件为末层推理神经元以外的各层推理神经元自身的膜电位变化量超过自身的第三激发阈值。Further, the third preset condition is that the change in membrane potential of each layer of inference neurons other than the last layer of inference neurons exceeds its own third excitation threshold.
进一步地,在确定脉冲采样窗口后,还包括记录待识别目标相邻两次出现在脉冲采样窗口的时间间隔的步骤。Further, after the pulse sampling window is determined, it further includes the step of recording the time interval when the target to be identified appears in the pulse sampling window twice.
进一步地,所述原始脉冲数据中包含多个脉冲序列,并对所有脉冲序列同时进行上述处理后的识别结果均一致时才对识别结果进行输出。Further, the original pulse data includes multiple pulse sequences, and the recognition results are output only when the recognition results of all the pulse sequences after the above-mentioned processing are consistent.
为实现上述的技术目的,本发明还公开了一种目标识别装置,且所述 装置包括脉冲获取模块、采样窗口模块、脉冲映射模块、神经元激发模块、神经元推理模块及识别结果确定模块;In order to achieve the above technical purpose, the present invention also discloses a target recognition device, and the device includes a pulse acquisition module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inference module, and a recognition result determination module;
所述脉冲获取模块,用于获取被拍摄的待识别目标的原始脉冲数据;所述原始脉冲数据中包含至少一个脉冲序列;The pulse acquisition module is used to acquire the original pulse data of the target to be identified that is photographed; the original pulse data includes at least one pulse sequence;
所述采样窗口模块,用于通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,并用于根据待识别目标所在的区域确定脉冲采样窗口,以及用于将脉冲采样窗口内的脉冲输入至脉冲神经网络;所述脉冲神经网络包括激发层和推理层;The sampling window module is used to retrieve the area where the target to be recognized is located by filtering each pulse in the pulse sequence, and is used to determine the pulse sampling window according to the area where the target to be recognized is located, and to sample the pulse in the window The pulses of are input to the impulse neural network; the impulse neural network includes an excitation layer and an inference layer;
所述脉冲映射模块,用于对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发神经元数目与脉冲采样窗口尺寸对应;所述激发层中包含多层依次连接的激发神经元;The pulse mapping module is used to map the pulses input to the pulse neural network so that the number of the first layer of excitation neurons in the excitation layer corresponds to the size of the pulse sampling window; the excitation layer contains multiple layers of sequentially connected excitation nerves Yuan;
所述神经元激发模块,用于在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,并用于在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层;所述推理层包含多层依次连接的推理神经元;The neuron excitation module is used to enable the pulse to be transmitted along the firing neurons in each layer other than the terminal layer when the first preset condition is satisfied, and to enable the pulse to fire the nerve along the terminal layer when the second preset condition is satisfied Elements are transferred to the inference layer; the inference layer includes multiple layers of inference neurons connected in sequence;
所述神经元推理模块,用于在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递;The neuron inference module is used to enable pulses to be sequentially transmitted along each layer of inference neurons other than the last layer of inference neurons when the third preset condition is satisfied;
所述识别结果确定模块,用于根据末层推理神经元的活跃度确定识别结果。The recognition result determination module is used to determine the recognition result according to the activity of the last layer of inference neurons.
为实现上述的技术目的,本发明还公开了一种目标识别系统,该目标识别系统包括仿生视觉传感器和上述的目标识别装置,所述仿生视觉传感器用于拍摄待识别目标,以得到待识别目标的原始脉冲数据。In order to achieve the above-mentioned technical purpose, the present invention also discloses a target recognition system, which includes a bionic vision sensor and the above-mentioned target recognition device. The bionic vision sensor is used to photograph the target to be recognized to obtain the target to be recognized. The raw pulse data.
为实现上述的技术目的,本发明还公开了一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行,以实现如上所述的任一种目标识别方法。In order to achieve the above technical objectives, the present invention also discloses a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement any of the above-mentioned target identification methods.
本发明的有益效果为:与现有技术相比,本发明能够实现对待识别的目标的准确识别和快速识别,而且能够较好地适用于运动速度较高的目标, 本发明能够同时兼顾准确识别与计算量、功耗的问题。本发明还能够实现对高速旋转目标的转速测量,进而为旋转体转速标定提供一种较佳的解决方案。The beneficial effects of the present invention are as follows: Compared with the prior art, the present invention can realize the accurate and fast recognition of the target to be recognized, and can be better applied to the target with high moving speed. The present invention can take into account the accurate recognition at the same time. The problem with the amount of calculation and power consumption. The invention can also realize the measurement of the rotation speed of the high-speed rotating target, thereby providing a better solution for the rotation speed calibration of the rotating body.
本发明能够通过具有高时间分辨率的仿生视觉传感器记录高速运动目标的动态变化信息,通过脉冲神经网络处理仿生视觉传感器输出的脉冲序列,能够实现对原始输入脉冲进行噪声滤波和脉冲增强,以较好地实现对目标的准确识别和快速识别,具体可通过控制激发层脉冲神经元的激活比例值减少推理层的脉冲输入量,进而降低整个脉冲处理算法的计算量和功耗;本发明通过多次测量机制可显著提升脉冲神经网络的识别准确率;本发明通过分析圆周运动的特点、采用固定采样窗口的方法实现高速旋转目标的转速测量,进而为旋转体转速标定提供了一种实现方法。The invention can record the dynamic change information of the high-speed moving target through the bionic vision sensor with high time resolution, and process the pulse sequence output by the bionic vision sensor through the pulse neural network, and can realize the noise filtering and pulse enhancement of the original input pulse to compare To achieve accurate and rapid identification of the target, specifically, the pulse input of the inference layer can be reduced by controlling the activation ratio of the pulse neuron in the excitation layer, thereby reducing the calculation amount and power consumption of the entire pulse processing algorithm; The secondary measurement mechanism can significantly improve the recognition accuracy of the pulsed neural network; the present invention realizes the rotation speed measurement of the high-speed rotating target by analyzing the characteristics of the circular motion and adopts a fixed sampling window method, thereby providing a realization method for the rotation speed calibration of the rotating body.
附图说明Description of the drawings
图1为本发明一些实施例中的目标识别方法的流程示意图。FIG. 1 is a schematic flowchart of a target recognition method in some embodiments of the present invention.
图2为本发明一些实施例中的目标识别系统的组成示意图。Figure 2 is a schematic diagram of the composition of a target recognition system in some embodiments of the present invention.
图3为本发明一些实施例中以具有局部连接关系的多个激发神经元为例的脉冲识别过程的工作原理示意图。FIG. 3 is a schematic diagram of the working principle of the pulse recognition process in some embodiments of the present invention taking a plurality of stimulated neurons with a local connection relationship as an example.
图4为本发明另一些实施例中脉冲识别过程的工作原理示意图。FIG. 4 is a schematic diagram of the working principle of the pulse recognition process in other embodiments of the present invention.
图5为通过激发层对待识别目标—字符“P”所在的区域的脉冲序列进行滤波和增强后的效果。Figure 5 shows the effect of filtering and enhancing the pulse sequence of the region where the character "P" is located, which is the target to be recognized through the excitation layer.
图6为通过激发层对待识别目标—字符“C”所在的区域的脉冲序列进行噪声滤波和脉冲增强后的效果。Figure 6 shows the effect of noise filtering and pulse enhancement on the pulse sequence of the region where the target to be recognized—the character "C" is located through the excitation layer.
图7为通过激发层对待识别目标—字符“L”所在的区域的脉冲序列进行噪声滤波和脉冲增强后的效果。Fig. 7 shows the effect of noise filtering and pulse enhancement on the pulse sequence of the region where the target to be recognized—the character "L" is located through the excitation layer.
具体实施方式Detailed ways
下面结合说明书附图对本发明提供的一种目标识别方法、装置、系统及计算机可读存储介质进行详细的解释和说明。The following is a detailed explanation and description of a target recognition method, device, system, and computer-readable storage medium provided by the present invention with reference to the accompanying drawings of the specification.
请参阅图1,图1为本发明一些实施例中的目标识别方法的流程示意图。本实施例提供了一种目标识别方法,具体地,该目标识别方法可包括 如下步骤。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a target recognition method in some embodiments of the present invention. This embodiment provides a target recognition method. Specifically, the target recognition method may include the following steps.
步骤100,本实施例可以通过仿生视觉传感器拍摄待识别目标,从而得到待识别目标的原始脉冲数据,其中使用的仿生视觉传感器是模仿生物体眼睛的视觉传感原理而制造出的电子器件,可以为后续的类脑神经网络算法提供信息输入,仿生视觉传感器往往具有很高的时间分辨率、可记录高速运动目标的动态变化信息,本发明使用的仿生视觉传感器包括但不限于以下的三种类型:差分型传感器、积分型传感器及事件型传感器;其中,差分型传感器最具代表性的是动态视觉传感器(Dynamic Vision Sensor,DVS),DVS仅输出光强发生变化的像素地址和信息,因此只对运动物体敏感,但是对静态物体不敏感;积分型传感器目前主要以北京大学黄铁军教授牵头研制的仿视网膜传感器(Vidar)为代表,该传感器模拟了视网膜中央凹区域的细胞连接结构和积分发放脉冲的原理,将所有像素在每个时刻发放的脉冲以阵列的形式输出,具有全时、异步、高速等特点,对动态物体和静态物体均有灵敏的感知能力;事件型传感器将DVS与传统相机相结合,在输出脉冲数据流的同时,也输出帧图像,可以为ATIS(Asynchronous Time-based Image Sensor)、DAVIS(Dynamic and Active-pixel Vision Sensor)或CeleX等。所以本发明还可以是一种基于仿生视觉传感器的目标识别方法,而且可以是一种对目标快速识别的方法。在目标识别方案中,本发明可以令仿生视觉传感器担当“眼睛”的角色。Step 100: In this embodiment, the target to be recognized can be photographed by the bionic vision sensor to obtain the original pulse data of the target to be recognized. The bionic vision sensor used is an electronic device manufactured by imitating the vision sensing principle of the eyes of a living body. To provide information input for subsequent brain-like neural network algorithms, bionic vision sensors often have high time resolution and can record dynamic change information of high-speed moving targets. The bionic vision sensors used in the present invention include but are not limited to the following three types : Differential sensor, integral sensor and event sensor; among them, the most representative differential sensor is the Dynamic Vision Sensor (DVS). DVS only outputs the pixel address and information whose light intensity changes, so only Sensitive to moving objects, but not to static objects; the integral sensor is currently mainly represented by the retinal sensor (Vidar) led by Professor Huang Tiejun of Peking University, which simulates the cell connection structure and integral distribution in the fovea area of the retina. The principle of pulse is to output the pulses emitted by all pixels at each moment in the form of an array. It has the characteristics of full-time, asynchronous, high-speed, etc., and has sensitive perception of dynamic and static objects; event-based sensors combine DVS with traditional The camera is combined to output a pulsed data stream while also outputting frame images, which can be ATIS (Asynchronous Time-based Image Sensor), DAVIS (Dynamic and Active-pixel Vision Sensor) or CeleX. Therefore, the present invention may also be a target recognition method based on a bionic vision sensor, and may also be a method for rapid target recognition. In the target recognition scheme, the present invention can make the bionic vision sensor play the role of "eyes".
步骤101,获取被拍摄的待识别目标的原始脉冲数据,原始脉冲数据中包含至少一个脉冲序列,本实施例中,一个脉冲序列可包含一帧脉冲或多帧脉冲,多个脉冲序列或多帧脉冲能够用于后续的多次测量机制;作为较佳的实施例,获取的待识别目标的原始脉冲数据来源于仿生视觉传感器,利用仿生视觉传感器拍摄待识别目标。Step 101: Obtain the original pulse data of the photographed target to be identified. The original pulse data includes at least one pulse sequence. In this embodiment, one pulse sequence may include one frame of pulses or multiple frames of pulses, and multiple pulse sequences or multiple frames. The pulse can be used for the subsequent multiple measurement mechanism; as a preferred embodiment, the acquired raw pulse data of the target to be identified comes from a bionic vision sensor, and the bionic vision sensor is used to photograph the target to be identified.
步骤102,通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,根据待识别目标所在的区域确定脉冲采样窗口,以将目标所在区域作为脉冲采样窗口的范围,相比现有技术,本发明只将脉冲采样 窗口内的脉冲数据实时输入激发层进行处理,而对脉冲采样窗口范围外的脉冲不做处理,所以本发明能够极大减少脉冲神经网络中推理层计算量,从而在相同硬件条件下能够明显加快目标识别速度。在一些较佳的实施例中,在确定脉冲采样窗口后,还包括记录待识别目标相邻两次出现在脉冲采样窗口的时间间隔的步骤,应当理解的是,此处的时间间隔可以是多个时间间隔的均值或者是随机相邻两次的时间间隔,对于高速旋转目标,本发明能够利用上述固定采样窗口且记录时间间隔的方式完成对高速旋转目标的转速测量,进而为旋转体转速的标定提供了一种全新实现方法。本发明使用了脉冲神经网络,涉及类脑计算领域,脉冲神经网络(Spiking Neural Network,SNN)被称为第三代人工神经网络,其神经元模型更接近生物神经元特性,在连接方式和信息处理机制上也更多地借鉴了大脑,以脉冲作为信息传递的媒介,将时间信息和空间信息均包含其中,所以脉冲神经网络是类脑计算领域最具代表性的算法,而且随着深度学习的发展,脉冲神经网络在连接方式和应用场景上也发生了变化,比如,在连接方式上,脉冲神经网络由早期的浅层次复杂连接发展出深层次简单连接,比如,在应用场景上,本发明可以将脉冲神经网络应用到各类模式识别问题上,具体可以为本发明中的高速旋转字符的识别,相比于深度学习和机器学习等算法,脉冲神经网络更具生物似然性,而且依托其对脉冲的异步响应和处理机制,使得其计算能耗更低,处理速度更快,脉冲神经元(Spiking Neuron)是脉冲神经网络的基本组成单元和信息处理单元,是对生物神经元电化学特性简化后的一种数学公式表达,本发明脉冲神经网络中的神经元模型可包括积分激发模型(Integrate and Fire,IF)、漏电流积分激发模型(Leaky Integrate and Fire,LIF)和脉冲响应模型(Spike Response Model,SRM)等。在本步骤中,将脉冲采样窗口内的脉冲输入至脉冲神经网络,本发明中的脉冲神经网络可以包括激发层和推理层。Step 102: Retrieve the area where the target to be identified is located by filtering each pulse in the pulse sequence, and determine the pulse sampling window according to the area where the target to be identified is located, so that the area where the target is located is used as the range of the pulse sampling window. With technology, the present invention only inputs the pulse data in the pulse sampling window into the excitation layer for processing in real time, and does not process the pulses outside the pulse sampling window. Therefore, the present invention can greatly reduce the calculation amount of the inference layer in the pulse neural network. Therefore, the target recognition speed can be significantly accelerated under the same hardware conditions. In some preferred embodiments, after the pulse sampling window is determined, it further includes the step of recording the time interval between two adjacent occurrences of the target to be identified in the pulse sampling window. It should be understood that the time interval here may be more than one time interval. The average value of each time interval or the time interval between two adjacent times at random. For high-speed rotating targets, the present invention can use the above-mentioned fixed sampling window and recording time interval to complete the measurement of the speed of the high-speed rotating target, and then it is the speed of the rotating body. Calibration provides a new way to achieve. The present invention uses a spiking neural network, which relates to the field of brain-like computing. The spiking neural network (SNN) is called the third-generation artificial neural network, and its neuron model is closer to the characteristics of biological neurons. The processing mechanism also borrows more from the brain, using pulses as the medium of information transmission, including time information and spatial information, so the pulse neural network is the most representative algorithm in the field of brain-like computing, and with deep learning With the development of the Spike Neural Network, the connection mode and application scenarios have also changed. For example, in the connection mode, the Spike Neural Network has developed from the early shallow complex connections to the deep simple connections. For example, in the application scenarios, The present invention can apply the spiking neural network to various pattern recognition problems, specifically it can be the recognition of high-speed rotating characters in the present invention. Compared with algorithms such as deep learning and machine learning, the spiking neural network has more biological likelihood. Moreover, relying on its asynchronous response to impulse and processing mechanism, it makes its calculation energy consumption lower and processing speed faster. Spike neuron is the basic component unit and information processing unit of spiking neural network, and it is a kind of biological neuron. A simplified mathematical expression of electrochemical characteristics. The neuron model in the pulsed neural network of the present invention may include an integral excitation model (Integrate and Fire, IF), a leakage current integral excitation model (Leaky Integrate and Fire, LIF), and pulse Response model (Spike Response Model, SRM), etc. In this step, the pulses in the pulse sampling window are input to the pulse neural network. The pulse neural network in the present invention may include an excitation layer and an inference layer.
步骤103,下面通过脉冲神经网络对输入的脉冲进行处理,具体地,本实施例对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发 神经元数目与脉冲采样窗口尺寸对应,映射方式可以采用哈希映射、进制转换等方式,本实施例的激发层中包含多层通过突触依次连接的激发神经元,相邻激发神经元之间的连接方式包括但不限于局部连接、全连接或者动态随机连接,本实施例的激发层由(Leaky Integrate and Fire)LIF神经元组成,激发层的首层激发神经元为输入脉冲神经元、其它层激发神经元为激发脉冲神经元。在具体的工作中,本发明的激发神经元在接收到脉冲以后,自身膜电位会发生变化。Step 103, the input pulse is processed by the pulse neural network below. Specifically, this embodiment maps the pulse input to the pulse neural network so that the number of excited neurons in the first layer of the excitation layer corresponds to the size of the pulse sampling window. The mapping method can be hash mapping, hexadecimal conversion, etc. The excitation layer of this embodiment includes multiple layers of excitation neurons connected in sequence through synapses, and the connection between adjacent excitation neurons includes but is not limited to local connections. , Fully connected or dynamically randomly connected, the excitation layer of this embodiment is composed of (Leaky Integrate and Fire) LIF neurons, the first layer of excitation neurons is the input impulse neurons, and the other layers of excitation neurons are the excitation impulse neurons . In specific work, after the excited neuron of the present invention receives a pulse, its membrane potential will change.
步骤104,在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,当前激发神经元向下一层激发神经元或者推理层发放脉冲后、自身的膜电位复位到静息状态,本实施例中,第一预设条件为末层激发神经元以外的各层激发神经元自身的膜电位变化量超过自身的第一激发阈值,本实施例中的第一激发阈值为1.0,从而实现对输入至脉冲神经网络中的脉冲进行噪声滤波和脉冲增强;并在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层,本实施例中,第二预设条件可以设置成两种:一、末层激发神经元自身的膜电位变化量超过自身的第二激发阈值,二、末层激发神经元自身的膜电位变化量超过自身的第二激发阈值且末层激发神经元中被激活的神经元数目的占比(即被激发的神经元数目与末层所有神经元数目比值)大于第一预设比例,本实施例中的第一预设比例为15%,与第一种条件相比,第二种条件能够通过控制第二预设比例大小的方式调节推理层的脉冲输入量,而推理层的脉冲输入量越少,需要的计算量和功耗便越低,所以本发明还具有能够降低脉冲处理算法的计算量和功耗的优点。推理层包含多层通过突触依次连接的推理神经元,在本发明优选的一些实施例中,请参阅图3、4,图3为本发明一些实施例中以具有局部连接关系的多个激发神经元为例的脉冲识别过程的工作原理示意图,图4为本发明另一些实施例中脉冲识别过程的工作原理示意图。仅当激发层中激活的脉冲神经元数目超过一定比例值时,才将发放的脉冲平铺成一维后统一输入到推理层,当采样窗口中没有目标时,如采样 窗口中输入的是背景噪声,通过调节上述的比例值,能够实现当只有背景噪声输入时,激发层不向推理层发放脉冲,进而降低整个脉冲识别算法的计算量,极大地降低计算能耗。Step 104: When the first preset condition is satisfied, the pulse is transmitted along the excitation neurons of each layer except the last layer of excitation neurons. The current excitation neuron is the next layer of excitation neuron or the inference layer emits the pulse, the membrane potential of itself Reset to a resting state. In this embodiment, the first preset condition is that the change in membrane potential of each layer other than the last layer of stimulated neurons exceeds its own first excitation threshold. In this embodiment, the first The excitation threshold is 1.0, so as to achieve noise filtering and pulse enhancement of the pulse input to the pulse neural network; and when the second preset condition is met, the pulse is transmitted to the inference layer along the final layer of the excitation neuron. In this embodiment, the first Second, the preset conditions can be set into two types: 1. The change in membrane potential of the terminal stimulated neuron exceeds its own second excitation threshold; 2. The change in membrane potential of the terminal stimulated neuron exceeds its own second excitation. Threshold and the proportion of the number of activated neurons in the last layer of stimulated neurons (that is, the ratio of the number of stimulated neurons to the number of all neurons in the last layer) is greater than the first preset ratio, the first preset in this embodiment The ratio is 15%. Compared with the first condition, the second condition can adjust the pulse input of the inference layer by controlling the second preset ratio, and the less the pulse input of the inference layer, the amount of calculation required And the lower the power consumption, so the present invention also has the advantage of being able to reduce the calculation amount and power consumption of the pulse processing algorithm. The inference layer includes multiple layers of inference neurons connected in sequence through synapses. In some preferred embodiments of the present invention, please refer to Figures 3 and 4. Figure 3 is a diagram of multiple excitations with local connections in some embodiments of the present invention. A schematic diagram of the working principle of the pulse recognition process using a neuron as an example. FIG. 4 is a schematic diagram of the working principle of the pulse recognition process in some other embodiments of the present invention. Only when the number of activated pulse neurons in the excitation layer exceeds a certain ratio, the emitted pulses are tiled into one dimension and then input to the inference layer uniformly. When there is no target in the sampling window, such as background noise input in the sampling window By adjusting the above-mentioned ratio value, it is possible to realize that when only background noise is input, the excitation layer does not emit pulses to the inference layer, thereby reducing the calculation amount of the entire pulse recognition algorithm and greatly reducing the calculation energy consumption.
本实施例中的推理层由(Integrate and Fire)IF神经元组成,在本发明已公开内容的基础上,推理层的层数以及各层神经元数目可根据实际情况进行合理而明智的调整,各层推理神经元之间以突触相连接,且突触连接方式包括但不限于卷积连接、局部连接或全连接,突触强度可采用已有的神经网络训练方式获得,训练方法包括但不限于基于梯度的反向传播学习算法、强化学习算法和脉冲时间依赖的突触可塑性算法(Spiking Time Dependent Plasticity,STDP)等,本发明不再赘述。本发明的推理神经元在接收到脉冲以后,自身膜电位会发生变化。The reasoning layer in this embodiment is composed of (Integrate and Fire) IF neurons. Based on the disclosed content of the present invention, the number of layers of the reasoning layer and the number of neurons in each layer can be adjusted reasonably and wisely according to the actual situation. The inference neurons of each layer are connected by synapses, and the synaptic connection methods include but are not limited to convolutional connection, partial connection or full connection. Synaptic strength can be obtained by using existing neural network training methods. Training methods include but It is not limited to gradient-based backpropagation learning algorithms, reinforcement learning algorithms, and pulse time-dependent synaptic plasticity (Spiking Time Dependent Plasticity, STDP) algorithms, etc., and will not be repeated in the present invention. After the inference neuron of the present invention receives the pulse, its membrane potential will change.
步骤105,在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递,本实施例中,第三预设条件为末层推理神经元以外的各层推理神经元自身的膜电位变化量超过自身的第三激发阈值,第三预设值可以为1.0。在本发明提供的各实施例中,应理解的是,“依次传递”指的是在前设置的神经元将脉冲传递至在后设置的神经元,如图3和图4所示。Step 105: When the third preset condition is satisfied, the pulse is transmitted along each layer of inference neurons other than the last layer of inference neurons. In this embodiment, the third preset condition is each layer of inference neurons other than the last layer of inference neurons. The change in the membrane potential of the cell itself exceeds its own third excitation threshold, and the third preset value may be 1.0. In the various embodiments provided by the present invention, it should be understood that "sequential transmission" refers to the neuron set in the front transmitting the pulse to the neuron set in the back, as shown in FIG. 3 and FIG. 4.
步骤106,根据末层推理神经元的活跃度确定识别结果。比如,最后一层的IF神经元通过比较各自活跃度得出识别结果,即末层各推理神经元活跃度分别达到各种设定的阈值,则可认定识别成功、反之则识别失败,或者,本实施例也可通过末层各推理神经元活跃度之和是否达到预设值判断是否识别成功。作为一些优选的技术方案,本发明的一些实施例中可以使用多次测量机制,即可将上述的步骤101~步骤106可同步或异步执行多次,优选为同步执行,在101步骤中获取的是仿真视觉传感器输出的一段脉冲序列(仿生视觉传感器打破了传统相机帧的概念,普遍具有很高的时间分辨率),则原始脉冲数据中包含多个脉冲序列,脉冲识别算法计算的结果也不止一个、而是连续多个,本实施例对所有脉冲序列同时进行上 述处理后的多个识别结果均一致时才对识别结果进行最终输出,上述改进后的方案能够显著提升识别准确率。所以本发明上述各实施例中提供的目标识别方法可以同时利用时间信息和空间信息完成对目标的快速识别。Step 106: Determine the recognition result according to the activity of the last layer of inference neurons. For example, if the IF neurons in the last layer compare their respective activities to obtain the recognition results, that is, if the activities of the inference neurons in the last layer reach various set thresholds, then the recognition can be considered successful, otherwise, the recognition failed, or, In this embodiment, it is also possible to judge whether the recognition is successful based on whether the sum of the activity of each inference neuron in the final layer reaches a preset value. As some preferred technical solutions, in some embodiments of the present invention, a multiple measurement mechanism can be used, that is, the above steps 101 to 106 can be executed multiple times synchronously or asynchronously, preferably synchronously. It is a pulse sequence output by the simulated vision sensor (the bionic vision sensor breaks the concept of the traditional camera frame and generally has a high time resolution), the original pulse data contains multiple pulse sequences, and the calculation results of the pulse recognition algorithm are not limited One is one, but there are multiple consecutive ones. In this embodiment, the recognition results are finally output only when the multiple recognition results after the above-mentioned processing are performed on all pulse sequences at the same time are consistent. The above-mentioned improved solution can significantly improve the recognition accuracy. Therefore, the target recognition method provided in the foregoing embodiments of the present invention can simultaneously use time information and space information to complete rapid target recognition.
请参阅图2,图2为本发明一些实施例中的目标识别系统的组成示意图。本发明提供了一种目标识别系统,该目标识别系统包括仿生视觉传感器和下述的目标识别装置,目标识别装置可集成在脉冲计算平台上、用于实现脉冲处理算法,仿生视觉传感器用于拍摄待识别目标,以得到待识别目标的原始脉冲数据(即录制的数据),然后可将录制的数据实时输出至脉冲计算平台,本发明实施时,脉冲计算平台可实时接收仿生视觉传感器输出的数据,传感器原始数据需要编码后一般通过总线传输出来,所以从脉冲事件发放到算法处理平台接收,这个过程产生了一个微小时间差,可通过当前时刻记录模块测量,记为Δt1。Please refer to FIG. 2, which is a schematic diagram of the composition of a target recognition system in some embodiments of the present invention. The present invention provides a target recognition system. The target recognition system includes a bionic vision sensor and the following target recognition device. The target recognition device can be integrated on a pulse computing platform to implement a pulse processing algorithm, and the bionic vision sensor is used for shooting The target to be identified to obtain the original pulse data (that is, the recorded data) of the target to be identified, and then the recorded data can be output to the pulse calculation platform in real time. When the present invention is implemented, the pulse calculation platform can receive the data output by the bionic vision sensor in real time , The original sensor data needs to be encoded and generally transmitted through the bus, so from the pulse event issuance to the algorithm processing platform to receive, this process produces a small time difference, which can be measured by the current time recording module, which is recorded as Δt1.
本发明中使用的脉冲计算平台,包括但不限于以下类型:(1)服务器、工作站、台式主机或移动电脑,(2)嵌入式计算平台,如多核片上系统(Multi-Processor System-on-Chip,MPSoC)开发板、中央处理器(Central Processing Unit,CPU)开发板、图形处理器(Graphics Processing Unit,GPU)开发板、单片机(Single-Chip Microcomputer,SCM)等,(3)现场可编程门阵列(Field Programmable Gate Array,FPGA),或专用集成电路芯片(Application Specific Integrated Circuit,ASIC)等,(4)云计算平台等。脉冲计算平台用于运行脉冲处理算法,脉冲处理算法可以包括系统控制算法和脉冲识别算法。The pulse computing platform used in the present invention includes but is not limited to the following types: (1) server, workstation, desktop host or mobile computer, (2) embedded computing platform, such as Multi-Processor System-on-Chip , MPSoC) development board, central processing unit (CPU) development board, graphics processing unit (GPU) development board, single-chip Microcomputer (SCM), etc., (3) field programmable gate Array (Field Programmable Gate Array, FPGA), or Application Specific Integrated Circuit (ASIC), etc., (4) Cloud computing platform, etc. The pulse calculation platform is used to run pulse processing algorithms, which can include system control algorithms and pulse recognition algorithms.
具体地,本发明一些实施例中提供了一种目标识别装置,该装置包括脉冲获取模块、采样窗口模块、脉冲映射模块、神经元激发模块、神经元推理模块及识别结果确定模块。Specifically, some embodiments of the present invention provide a target recognition device, which includes a pulse acquisition module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inference module, and a recognition result determination module.
脉冲获取模块,用于获取被拍摄的待识别目标的原始脉冲数据;原始脉冲数据中包含至少一个脉冲序列,其中,一个脉冲序列可包含一帧脉冲或多帧脉冲,多个脉冲序列或多帧脉冲能够用于后续的多次测量机制。The pulse acquisition module is used to acquire the original pulse data of the target to be identified; the original pulse data contains at least one pulse sequence, where one pulse sequence can include one frame of pulse or multiple frames of pulse, multiple pulse sequences or multiple frames The pulse can be used for subsequent multiple measurement mechanisms.
采样窗口模块,用于通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,并用于根据待识别目标所在的区域确定脉冲采样窗口,以及用于将脉冲采样窗口内的脉冲输入至脉冲神经网络;脉冲神经网络包括激发层和推理层。从算法处理平台接收到脉冲数据,到采样窗口模块筛选出采样窗口内的脉冲数据并输入激发层,这个过程产生了一个微小的时间差,可通过当前时刻记录模块测量,记为Δt2。The sampling window module is used to retrieve the area of the target to be identified by filtering each pulse in the pulse sequence, and is used to determine the pulse sampling window according to the area of the target to be identified, and to sample the pulses in the pulse sampling window. Input to the impulse neural network; the impulse neural network includes an excitation layer and an inference layer. From the algorithm processing platform receiving the pulse data, to the sampling window module filtering out the pulse data in the sampling window and inputting it into the excitation layer, this process produces a slight time difference, which can be measured by the current moment recording module and recorded as Δt2.
脉冲映射模块,用于对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发神经元数目与脉冲采样窗口尺寸对应,脉冲映射模块可理解为是本发明使用的脉冲识别算法中的脉冲神经网络的脉冲事件映射层,可采用哈希映射、进制转换等方法,将脉冲事件映射到激发层的输入脉冲神经元;激发层中包含多层通过突触方式依次连接的激发神经元,突触的连接方式可包括局部连接、全连接或动态随机连接等,下述的内容以局部连接方式为例进行具体地说明。The pulse mapping module is used to map the pulses input to the pulse neural network so that the number of excited neurons in the first layer of the excitation layer corresponds to the size of the pulse sampling window. The pulse mapping module can be understood as the pulse recognition algorithm used in the present invention The impulse event mapping layer of the impulse neural network can use hash mapping, binary conversion and other methods to map impulse events to the input impulse neurons of the excitation layer; the excitation layer contains multiple layers of excitation nerves connected in sequence through synapses Meta, the connection mode of the synapse can include partial connection, full connection or dynamic random connection, etc. The following content takes the local connection mode as an example for specific description.
首先,上述的激发层的输入脉冲神经元数目需要与脉冲采样窗口尺寸需保持一致,假设采样窗口尺寸为N*N,则采样窗口内输入的脉冲实际是由N*N个仿生视觉传感器对应位置的感光神经元激发产生,到达激发层的输入脉冲神经元时产生了(Δt1+Δt2)时间延迟。如图3所示,每个输入脉冲神经元与多个激发脉冲神经元局部连接,其局部连接方式为:以输入脉冲神经元自身位置为中心,以R为半径的周围邻域范围内的脉冲神经元相连接(若输入神经元位于采样窗口边界处,则仅连接位于N*N范围内的脉冲神经元)。First of all, the number of input pulse neurons in the above excitation layer needs to be consistent with the size of the pulse sampling window. Assuming that the sampling window size is N*N, the input pulse in the sampling window is actually corresponding to the position of N*N bionic vision sensors. The photoreceptor neuron is excited, and the input pulse neuron that reaches the excitation layer has a time delay of (Δt1+Δt2). As shown in Figure 3, each input impulse neuron is locally connected to multiple excitation impulse neurons. The local connection mode is: the input impulse neuron's own position as the center, the pulse in the surrounding neighborhood with R as the radius Neurons are connected (if the input neuron is located at the boundary of the sampling window, only the impulse neurons within the range of N*N are connected).
神经元激发模块,用于在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,并用于在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层;推理层包含多层依次连接的推理神经元。具体地,第一预设条件为末层激发神经元以外的各层激发神经元自身的膜电位变化量超过自身的第一激发阈值,第二预设条件为末层激发神经元自身的膜电位变化量超过自身的第二激发阈值且末层激发神经元中被激活 的神经元数目的占比大于第一预设比例。具体地,当输入脉冲神经元发放脉冲时,与之相连且位于相同位置的激发脉冲神经元的膜电位增加Δ1(即连接的突触强度为Δ1),其余与之相连的激发脉冲神经元的膜电位增加Δ2(即连接的突触强度为Δ2),其中Δ1>Δ2,当没有脉冲输入时,与之相连且位于相同位置的激发脉冲神经元的膜电位衰减Δ3(即LIF神经元的膜电位衰减因子为Δ3);其中,Δ1、Δ2和Δ3的值可预先设定为固定值,也可动态调节。当激发脉冲神经元的膜电位超过自身的激发阈值时,则该神经元激活并发放一个脉冲,与此同时将自身膜电位复位至静息状态。The neuron excitation module is used to enable the pulse to be transmitted along the firing neurons in each layer other than the terminal layer when the first preset condition is met, and to enable the pulse to be transmitted along the terminal layer to stimulate the neuron when the second preset condition is met To the inference layer; the inference layer contains multiple layers of inference neurons connected in sequence. Specifically, the first preset condition is that the change in membrane potential of each layer other than the final layer of stimulated neurons exceeds its own first excitation threshold, and the second preset condition is that the membrane potential of the final layer of stimulated neurons itself The amount of change exceeds its second excitation threshold and the proportion of the number of activated neurons in the final layer of excitation neurons is greater than the first preset ratio. Specifically, when the input impulse neuron emits a pulse, the membrane potential of the excitation impulse neuron connected to it and located at the same position increases by Δ1 (that is, the connected synapse strength is Δ1), and the other excitation impulse neurons connected to it have an increase in membrane potential. The membrane potential increases by Δ2 (that is, the connected synaptic strength is Δ2), where Δ1>Δ2, when there is no pulse input, the membrane potential of the stimulated impulse neuron connected to it and located at the same position attenuates Δ3 (that is, the membrane of the LIF neuron The potential attenuation factor is Δ3); among them, the values of Δ1, Δ2, and Δ3 can be preset as fixed values or can be dynamically adjusted. When the membrane potential of a stimulated neuron exceeds its own excitation threshold, the neuron activates and emits a pulse, and at the same time resets its membrane potential to a resting state.
神经元推理模块,用于在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递。具体地,第三预设条件为末层推理神经元以外的各层推理神经元自身的膜电位变化量超过自身的第三激发阈值。具体地,与激发层的LIF神经元不同,推理层的IF神经元在于没有膜电位衰减的特性,推理层的IF神经元接收到脉冲输入后,自身膜电位增加,增加的值等于全部脉冲输入的突触强度值的加和;如果IF神经元膜电位超过自身的激发阈值,则发放一个脉冲并传递给下一层与之相连的IF神经元,同时自身的膜电位复位,脉冲逐层向后传播,直到最后一层。The neuron inference module is used to make the pulses be transmitted sequentially along each layer of inference neurons except the last layer of inference neurons when the third preset condition is satisfied. Specifically, the third preset condition is that the change in membrane potential of each layer of inference neurons other than the last layer of inference neurons exceeds its own third excitation threshold. Specifically, unlike the LIF neurons in the excitation layer, the IF neurons in the inference layer do not have the characteristics of membrane potential attenuation. After the IF neurons in the inference layer receive a pulse input, their own membrane potential increases, and the value of the increase is equal to the total pulse input If the membrane potential of the IF neuron exceeds its own excitation threshold, a pulse will be emitted and transmitted to the next layer of IF neurons connected to it. At the same time, the membrane potential of the IF neuron will be reset, and the pulse will go from layer to layer. After spreading, until the last layer.
识别结果确定模块,用于根据末层推理神经元的活跃度确定识别结果。比如,最后一层的IF神经元通过比较各自活跃度得出识别结果,即末层各推理神经元活跃度分别达到各种设定的阈值,则可认定识别成功、反之则识别失败,或者,本实施例也可通过末层各推理神经元活跃度之和是否达到预设值判断是否识别成功。The recognition result determination module is used to determine the recognition result according to the activity of the last layer of inference neurons. For example, if the IF neurons in the last layer compare their respective activities to obtain the recognition results, that is, if the activities of the inference neurons in the last layer reach various set thresholds, then the recognition can be considered successful, otherwise, the recognition failed, or, In this embodiment, it is also possible to judge whether the recognition is successful based on whether the sum of the activity of each inference neuron in the final layer reaches a preset value.
在本发明的一些改进的实施例中,目标识别装置还可以包括转速测量模块和当前时刻记录模块,转速测量模块用于在确定脉冲采样窗口后记录待识别目标相邻两次出现在脉冲采样窗口的时间间隔,转速测量模块根据旋转目标的旋转半径将采样窗口固定在相机视野内的一个位置,然后累积记录同一个目标连续多次出现在采样窗口的时间间隔,并可通过统计方法减小误差,计算出目标旋转一周所需要的时间,进而实现对高速旋转目标 转速的测量。当前时刻记录模块用于精准获取当前时刻,时刻的具体形式可以根据需要进行合理而明智的选择,本发明可令时间精度的延迟尽可能地小。本发明采用脉冲神经网络来处理仿生视觉传感器输出的脉冲序列,处理方法同时利用时间信息和空间信息,可以实现对目标的快速识别,在运动速度较高的情况下依然适用。In some improved embodiments of the present invention, the target identification device may further include a rotational speed measurement module and a current moment recording module. The rotational speed measurement module is used to record that the target to be identified appears twice in the pulse sampling window after the pulse sampling window is determined. The rotational speed measurement module fixes the sampling window at a position within the camera's field of view according to the rotation radius of the rotating target, and then accumulates the time interval during which the same target appears in the sampling window multiple times, and can reduce the error by statistical methods , Calculate the time required for the target to rotate one circle, and then realize the measurement of the target speed of high-speed rotation. The current time recording module is used to accurately obtain the current time. The specific form of the time can be selected reasonably and wisely according to needs. The present invention can make the delay of time accuracy as small as possible. The invention adopts the pulse neural network to process the pulse sequence output by the bionic vision sensor, and the processing method simultaneously utilizes the time information and the space information, can realize the rapid identification of the target, and is still applicable under the condition of high movement speed.
在本发明的一些实施例中,还可以提供一种计算机可读存储介质,且计算机可读存储介质上存储有计算机程序,该计算机程序可以被处理器执行,从而能够实现上述各个实施例中的目标识别方法,本发明能够令上述计算机程序运行于脉冲计算平台上。In some embodiments of the present invention, a computer-readable storage medium may also be provided, and a computer program is stored on the computer-readable storage medium, and the computer program may be executed by a processor, so as to realize the The target recognition method, the present invention can make the above-mentioned computer program run on the impulse computing platform.
下面以具体的实验过程对本发明工作过程及其实际带来的显著效果作进一步详细说明。本实施例采用上述积分型仿生视觉传感器Vidar,该传感器时间分辨率为25微秒,显示分辨率为400*250像素。用该传感器正对着高速旋转的工业风扇在晴朗的正午进行拍摄,风扇与传感器的中心间距约为60cm,风扇半径约20cm,风扇的叶片上贴有3个黑底白色英文字符(‘P’,‘C’,‘L’),叶片旋转速度约2500R/min,拍摄的脉冲数据在数据采集卡上打包后通过USB3.0总线传输到台式电脑上进行处理,并台式电脑实时接收和获取Vidar输出的数据,采样窗口设定模块筛选脉冲数据,脉冲事件映射层将筛选出的脉冲数据一一映射到激发层的输入脉冲神经元,采样窗口位置的设定根据英文字符在Vidar视野中的位置而定,采样窗口的大小根据英文字符在Vidar视野中的像素尺寸而定。本实施例中采样窗口左上角位点的坐标(y,x)为(50,180),采样窗口尺寸为40*40像素,即激发层包含40*40个输入脉冲神经元,脉冲处理算法对仿生视觉传感器输入的脉冲数据进行处理,如图4所示,激发层的脉冲输入神经元数目与采样窗口尺寸一致,为40*40,脉冲事件映射模块把采样窗口内输入的脉冲与脉冲输入神经元实现一一映射,每个脉冲输入输入神经元均与9个激发脉冲神经元以突触相连接,突触的连接方式为:以输入脉冲神经元自身位置为中心的周围八邻域范围内的激发脉冲神经元相连接(若输入脉冲神经元位于采样窗口边界处,则仅连接位于40*40范围内的激发脉冲 神经元)。当输入脉冲神经元发放脉冲时,与之相连且位于相同位置的激发脉冲神经元膜电位增加1.4(Δ1),其余与之相连的激发脉冲神经元膜电位增加0.5(Δ2)。当没有脉冲输入时,与之相连且位于相同位置的激发脉冲神经元膜电位衰减0.25(Δ3)。当激发脉冲神经元的膜电位超过自身的激发阈值(1.0)时,该激发脉冲神经元激活并发放一个脉冲,请参阅图5至图7,是利用本发明提供的激发层对脉冲序列进行噪声滤波和脉冲增强的效果展示,左侧图为仿生视觉传感器的脉冲输入,右侧图为利用本发明激发层进行滤波和增强后的效果,具体地,白色部分表示有脉冲输入的位置、黑色部分表示无脉冲输入的位置,方框区域表示脉冲采样窗口;其中,图5为通过激发层对待识别目标—字符“P”所在的区域的脉冲序列进行滤波和增强后的效果,图6为通过激发层对待识别目标—字符“C”所在的区域的脉冲序列进行噪声滤波和脉冲增强后的效果,图7为通过激发层对待识别目标—字符“L”所在的区域的脉冲序列进行噪声滤波和脉冲增强后的效果,仅当激活的激发脉冲神经元数目超过一定比例值(15%)时,才将激发的脉冲平铺成一维(1*1600)后统一输入到推理层。当旋转字符没有出现在采样窗口中时,采样窗口中输入的是背景噪声;如图4所示,推理层由3层IF神经元组成,各层IF神经元数目分别为512、1024、3,每层间IF神经元通过全连接的方式实现突触连接。IF神经元接收到脉冲输入后,自身膜电位增加,增加的值等于全部脉冲输入的突触强度值的加和。如果IF神经元膜电位超过自身的激发阈值(1.0),则发放一个脉冲并传递给下一层与之相连的IF神经元,同时自身的膜电位复位至静息状态(0)。脉冲逐层向后传播,直到到达最后一层,最后一层IF神经元通过比较各个神经元的活跃度得出识别结果。本发明多次测量机制,比如仅当推理层连续5次输出的识别结果一致时,才得出最终识别结果,该方式能够显著提升识别准确率和识别速度,并实现对高速旋转字符转速的测量。本发明通过多线程并行计算的优化方法显著提升识别速度,将计算任务拆分给台式电脑的多个处理器同时计算,而且通过记录同一个字符连续两次出现在采样窗口的时间间隔,便能计算出字符旋转一周所需要的时间,进而实现对高速旋转风扇转速的测量。In the following, the working process of the present invention and the significant effects actually brought by it will be further described in detail with the specific experimental process. This embodiment uses the above-mentioned integral bionic vision sensor Vidar, the sensor has a time resolution of 25 microseconds, and a display resolution of 400*250 pixels. The sensor is used to shoot at a fine noon on a high-speed rotating industrial fan. The center distance between the fan and the sensor is about 60cm, and the fan radius is about 20cm. There are 3 white English characters on black background ('P'). ,'C','L'), the blade rotation speed is about 2500R/min, the captured pulse data is packaged on the data acquisition card and then transmitted to the desktop computer through the USB3.0 bus for processing, and the desktop computer receives and obtains Vidar in real time The output data, the sampling window setting module filters the pulse data, and the pulse event mapping layer maps the filtered pulse data to the input pulse neurons of the excitation layer one by one. The position of the sampling window is set according to the position of the English character in the Vidar field of view The size of the sampling window depends on the pixel size of the English characters in Vidar's field of view. In this embodiment, the coordinates (y, x) of the upper left corner of the sampling window are (50, 180), and the size of the sampling window is 40*40 pixels. That is, the excitation layer contains 40*40 input pulse neurons. The pulse processing algorithm is effective for bionic vision. The pulse data input by the sensor is processed. As shown in Figure 4, the number of pulse input neurons in the excitation layer is the same as the sampling window size, which is 40*40. The pulse event mapping module realizes the pulse input in the sampling window and the pulse input neuron One-to-one mapping. Each impulse input neuron is connected to 9 excitation impulse neurons by synapses. The synapses are connected as follows: excitation in the surrounding eight neighborhoods centered on the input impulse neuron's own position The impulse neurons are connected (if the input impulse neuron is located at the boundary of the sampling window, only the impulse neurons within the range of 40*40 are connected). When the input impulse neuron emits a pulse, the membrane potential of the excitation impulse neuron connected to it and located at the same position increases by 1.4 (Δ1), and the membrane potential of the other excitation impulse neurons connected to it increases by 0.5 (Δ2). When there is no pulse input, the membrane potential of the stimulated pulse neuron connected to it and located at the same position attenuates by 0.25 (Δ3). When the membrane potential of the excitation pulse neuron exceeds its own excitation threshold (1.0), the excitation pulse neuron activates and emits a pulse. Please refer to Figures 5 to 7, which uses the excitation layer provided by the present invention to perform noise on the pulse sequence. The effect of filtering and pulse enhancement is shown. The picture on the left is the pulse input of the bionic vision sensor, and the picture on the right is the effect of filtering and enhancement using the excitation layer of the present invention. Specifically, the white part represents the position of the pulse input and the black part It represents the position where there is no pulse input, and the box area represents the pulse sampling window; among them, Figure 5 is the effect of filtering and enhancing the pulse sequence of the region where the character "P" is located through the excitation layer. Figure 6 shows the effect of the pulse sequence in the region where the character "P" is located through the excitation layer. The effect of noise filtering and pulse enhancement on the pulse sequence of the area where the target to be recognized—the character "C" is performed by the layer. Figure 7 shows the pulse sequence of the area where the target to be recognized—the character "L" is passed through the excitation layer for noise filtering and pulse The enhanced effect is that only when the number of activated excitation pulse neurons exceeds a certain proportion (15%), the excitation pulses are tiled into one dimension (1*1600) and then uniformly input to the inference layer. When the rotating character does not appear in the sampling window, the input in the sampling window is background noise; as shown in Figure 4, the inference layer is composed of 3 layers of IF neurons, and the number of IF neurons in each layer is 512, 1024, and 3. The IF neurons between each layer realize synaptic connection through a fully connected way. After the IF neuron receives the pulse input, its own membrane potential increases, and the increased value is equal to the sum of the synaptic strength values of all the pulse inputs. If the membrane potential of an IF neuron exceeds its own excitation threshold (1.0), a pulse is sent and transmitted to the next layer of IF neurons connected to it, and its membrane potential is reset to a resting state (0). The pulse propagates back layer by layer until it reaches the last layer. The IF neuron of the last layer obtains the recognition result by comparing the activity of each neuron. The multiple measurement mechanism of the present invention, for example, only when the recognition results outputted by the inference layer for 5 consecutive times are consistent, the final recognition result can be obtained. This method can significantly improve the recognition accuracy and recognition speed, and realize the measurement of the rotation speed of high-speed rotating characters. . The invention significantly improves the recognition speed through the optimization method of multi-threaded parallel computing, splits the computing task to multiple processors of the desktop computer for simultaneous calculation, and records the time interval when the same character appears in the sampling window twice in succession. Calculate the time required for the character to rotate one circle, and then realize the measurement of the speed of the high-speed rotating fan.
在本说明书的描述中,参考术语“本实施例”、“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, the description with reference to the terms "this embodiment", "one embodiment", "some embodiments", "examples", "specific examples", or "some examples", etc. means to combine the embodiments The specific features, structures, materials or characteristics described by the examples are included in at least one embodiment or example of the present invention. In this specification, the schematic representations of the above terms do not necessarily refer to the same embodiment or example. Moreover, the described specific features, structures, materials or characteristics can be combined in any one or more embodiments or examples in a suitable manner. In addition, those skilled in the art can combine and combine the different embodiments or examples and the features of the different embodiments or examples described in this specification without contradicting each other.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本发明的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Therefore, the features defined with "first" and "second" may explicitly or implicitly include at least one of the features. In the description of the present invention, "plurality" means at least two, such as two, three, etc., unless otherwise specifically defined.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明实质内容上所作的任何修改、等同替换和简单改进等,均应包含在本发明的保护范围之内。The above are only the preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement and simple improvement made in the essence of the present invention should be included in the protection scope of the present invention. Inside.

Claims (10)

  1. 一种目标识别方法,其特征在于:所述方法包括如下步骤;A target recognition method, characterized in that: the method includes the following steps;
    获取被拍摄的待识别目标的原始脉冲数据,所述原始脉冲数据中包含至少一个脉冲序列;Acquiring raw pulse data of the photographed target to be identified, where the raw pulse data includes at least one pulse sequence;
    通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,根据待识别目标所在的区域确定脉冲采样窗口,将脉冲采样窗口内的脉冲输入至脉冲神经网络,所述脉冲神经网络包括激发层和推理层;The area where the target to be identified is located is retrieved by screening each pulse in the pulse sequence, the pulse sampling window is determined according to the area where the target to be identified is located, and the pulses in the pulse sampling window are input to the pulse neural network, the pulse neural network Including excitation layer and reasoning layer;
    对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发神经元数目与脉冲采样窗口尺寸对应,所述激发层中包含多层依次连接的激发神经元;Mapping the pulses input to the spiking neural network so that the number of firing neurons in the first layer of the firing layer corresponds to the size of the pulse sampling window, and the firing layer includes multiple layers of firing neurons connected in sequence;
    在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,并在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层,所述推理层包含多层依次连接的推理神经元;When the first preset condition is met, the pulse is transmitted along the firing neurons of each layer other than the final layer in turn, and the pulse is transmitted to the inference layer along the firing neuron of the last layer when the second preset condition is met, the reasoning layer Contains multiple layers of inference neurons connected in sequence;
    在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递;When the third preset condition is satisfied, the pulse is transmitted in sequence along each layer of inference neurons except the last layer of inference neurons;
    根据末层推理神经元的活跃度确定识别结果。The recognition result is determined according to the activity of the last layer of inference neurons.
  2. 根据权利要求1所述的目标识别方法,其特征在于:获取的待识别目标的原始脉冲数据来源于仿生视觉传感器,利用仿生视觉传感器拍摄待识别目标。The target recognition method according to claim 1, wherein the acquired raw pulse data of the target to be recognized comes from a bionic vision sensor, and the bionic vision sensor is used to photograph the target to be recognized.
  3. 根据权利要求1或2所述的目标识别方法,其特征在于:所述第一预设条件为末层激发神经元以外的各层激发神经元自身的膜电位变化量超过自身的第一激发阈值。The target recognition method according to claim 1 or 2, characterized in that: the first preset condition is that the change in membrane potential of each layer of stimulated neurons other than the final layer of stimulated neurons exceeds its own first excitation threshold .
  4. 根据权利要求3所述的目标识别方法,其特征在于:所述第二预设条件为末层激发神经元自身的膜电位变化量超过自身的第二激发阈值且末层激发神经元中被激活的神经元数目的占比大于第一预设比例。The target recognition method according to claim 3, characterized in that: the second preset condition is that the change in membrane potential of the final layer of stimulated neurons exceeds its own second excitation threshold and the final layer of stimulated neurons is activated The proportion of the number of neurons is greater than the first preset proportion.
  5. 根据权利要求1、2或4中任一权利要求所述的目标识别方法,其特征在于:所述第三预设条件为末层推理神经元以外的各层推理神经元自身的膜电位变化量超过自身的第三激发阈值。The target recognition method according to any one of claims 1, 2 or 4, wherein the third preset condition is the change in membrane potential of each layer of inference neurons other than the last layer of inference neurons. Exceeds its own third excitation threshold.
  6. 根据权利要求1所述的目标识别方法,其特征在于:在确定脉冲采样窗口后,还包括记录待识别目标相邻两次出现在脉冲采样窗口的时间间隔的步骤。The target recognition method according to claim 1, characterized in that: after the pulse sampling window is determined, it further comprises the step of recording the time interval when the target to be recognized appears in the pulse sampling window twice.
  7. 根据权利要求1、2、4或6中任一权利要求所述的目标识别方法,其特征在于:所述原始脉冲数据中包含多个脉冲序列,并对所有脉冲序列同时进行上述处理后的识别结果均一致时才对识别结果进行输出。The target recognition method according to any one of claims 1, 2, 4, or 6, wherein the original pulse data contains a plurality of pulse sequences, and all the pulse sequences are simultaneously identified after the above processing The recognition result is output only when the results are consistent.
  8. 一种目标识别装置,其特征在于:所述装置包括脉冲获取模块、采样窗口模块、脉冲映射模块、神经元激发模块、神经元推理模块及识别结果确定模块;A target recognition device, characterized in that: the device includes a pulse acquisition module, a sampling window module, a pulse mapping module, a neuron excitation module, a neuron inference module, and a recognition result determination module;
    所述脉冲获取模块,用于获取被拍摄的待识别目标的原始脉冲数据;所述原始脉冲数据中包含至少一个脉冲序列;The pulse acquisition module is used to acquire the original pulse data of the target to be identified that is photographed; the original pulse data includes at least one pulse sequence;
    所述采样窗口模块,用于通过对脉冲序列中的各脉冲进行筛选的方式检索待识别目标所在的区域,并用于根据待识别目标所在的区域确定脉冲采样窗口,以及用于将脉冲采样窗口内的脉冲输入至脉冲神经网络;所述脉冲神经网络包括激发层和推理层;The sampling window module is used to retrieve the area where the target to be recognized is located by filtering each pulse in the pulse sequence, and is used to determine the pulse sampling window according to the area where the target to be recognized is located, and to sample the pulse in the window The pulses of are input to the impulse neural network; the impulse neural network includes an excitation layer and an inference layer;
    所述脉冲映射模块,用于对输入至脉冲神经网络的脉冲进行映射,以使激发层的首层激发神经元数目与脉冲采样窗口尺寸对应;所述激发层中包含多层依次连接的激发神经元;The pulse mapping module is used to map the pulses input to the pulse neural network so that the number of the first layer of excitation neurons in the excitation layer corresponds to the size of the pulse sampling window; the excitation layer contains multiple layers of sequentially connected excitation nerves Yuan;
    所述神经元激发模块,用于在满足第一预设条件下令脉冲沿末层激发神经元以外的各层激发神经元依次传递,并用于在满足第二预设条件下令脉冲沿末层激发神经元传递至推理层;所述推理层包含多层依次连接的推理神经元;The neuron excitation module is used to enable the pulse to be transmitted along the firing neurons in each layer other than the terminal layer when the first preset condition is satisfied, and to enable the pulse to fire the nerve along the terminal layer when the second preset condition is satisfied Elements are transferred to the inference layer; the inference layer includes multiple layers of inference neurons connected in sequence;
    所述神经元推理模块,用于在满足第三预设条件下令脉冲沿末层推理神经元以外的各层推理神经元依次传递;The neuron inference module is used to enable pulses to be sequentially transmitted along each layer of inference neurons other than the last layer of inference neurons when the third preset condition is satisfied;
    所述识别结果确定模块,用于根据末层推理神经元的活跃度确定识别结果。The recognition result determination module is used to determine the recognition result according to the activity of the last layer of inference neurons.
  9. 一种目标识别系统,其特征在于:该目标识别系统包括仿生视觉 传感器和权利要求8所述的目标识别装置,所述仿生视觉传感器用于拍摄待识别目标,以得到待识别目标的原始脉冲数据。A target recognition system, characterized in that: the target recognition system comprises a bionic vision sensor and the target recognition device according to claim 8, the bionic vision sensor is used to photograph the target to be recognized to obtain the original pulse data of the target to be recognized .
  10. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于:该计算机程序被处理器执行,以实现如权利要求1-7中任一权利要求所述的目标识别方法。A computer-readable storage medium with a computer program stored thereon, characterized in that the computer program is executed by a processor to implement the target identification method according to any one of claims 1-7.
PCT/CN2020/099443 2020-01-19 2020-06-30 Target recognition method, device, and system, and computer readable storage medium WO2021143066A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010062434.X 2020-01-19
CN202010062434.XA CN111275742B (en) 2020-01-19 2020-01-19 Target identification method, device and system and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2021143066A1 true WO2021143066A1 (en) 2021-07-22

Family

ID=71003474

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/099443 WO2021143066A1 (en) 2020-01-19 2020-06-30 Target recognition method, device, and system, and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN111275742B (en)
WO (1) WO2021143066A1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114723038A (en) * 2022-03-03 2022-07-08 北京大学 Model conversion method, device, equipment and storage medium based on target detection
CN114998996A (en) * 2022-06-14 2022-09-02 中国电信股份有限公司 Signal processing method, device and equipment with motion attribute information and storage
CN115048979A (en) * 2022-04-29 2022-09-13 贵州大学 Robot touch pulse data classification method based on regularization
CN115048954A (en) * 2022-05-23 2022-09-13 北京大学 Retina-imitating target detection method and device, storage medium and terminal
CN115100458A (en) * 2022-06-02 2022-09-23 西安电子科技大学 Image classification method and related device
CN115841142A (en) * 2023-02-20 2023-03-24 鹏城实验室 Visual cortex simulation method based on deep pulse neural network and related equipment
CN115862338A (en) * 2023-03-01 2023-03-28 天津大学 Airport traffic flow prediction method, system, electronic device and medium
CN116011563A (en) * 2023-03-28 2023-04-25 之江实验室 High-performance pulse transmission simulation method and device for pulse relay
US20230196102A1 (en) * 2021-01-07 2023-06-22 Lynxi Technologies Co., Ltd. Method for Training Spiking Neuron Network, Method for Processing Data, Electronic Device, and Medium
CN116663631A (en) * 2023-07-28 2023-08-29 之江实验室 Task execution method and device, storage medium and electronic equipment
CN117252239A (en) * 2023-09-18 2023-12-19 北京邮电大学 Method and system for detecting target of pulse signal

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111275742B (en) * 2020-01-19 2022-01-11 北京大学 Target identification method, device and system and computer readable storage medium
CN113554726B (en) * 2021-06-04 2023-10-20 北京大学 Image reconstruction method and device based on pulse array, storage medium and terminal
CN114202068B (en) * 2022-02-17 2022-06-28 浙江大学 Self-learning implementation system for brain-like computing chip
CN114936331A (en) * 2022-04-18 2022-08-23 北京大学 Position prediction method, position prediction device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190122110A1 (en) * 2017-10-23 2019-04-25 Samsung Electronics Co., Ltd. Neural network apparatus and method
CN109871940A (en) * 2019-01-31 2019-06-11 清华大学 A kind of multilayer training algorithm of impulsive neural networks
CN110210563A (en) * 2019-06-04 2019-09-06 北京大学 The study of pattern pulse data space time information and recognition methods based on Spike cube SNN
CN110659666A (en) * 2019-08-06 2020-01-07 广东工业大学 Image classification method of multilayer pulse neural network based on interaction
CN111275742A (en) * 2020-01-19 2020-06-12 北京大学 Target identification method, device and system and computer readable storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130325766A1 (en) * 2012-06-04 2013-12-05 Csaba Petre Spiking neuron network apparatus and methods
CN106295799B (en) * 2015-05-12 2018-11-02 核工业北京地质研究院 A kind of implementation method of deep learning multilayer neural network
CN108470190B (en) * 2018-03-09 2019-01-29 北京大学 Image-recognizing method based on FPGA customization impulsive neural networks

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190122110A1 (en) * 2017-10-23 2019-04-25 Samsung Electronics Co., Ltd. Neural network apparatus and method
CN109871940A (en) * 2019-01-31 2019-06-11 清华大学 A kind of multilayer training algorithm of impulsive neural networks
CN110210563A (en) * 2019-06-04 2019-09-06 北京大学 The study of pattern pulse data space time information and recognition methods based on Spike cube SNN
CN110659666A (en) * 2019-08-06 2020-01-07 广东工业大学 Image classification method of multilayer pulse neural network based on interaction
CN111275742A (en) * 2020-01-19 2020-06-12 北京大学 Target identification method, device and system and computer readable storage medium

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11941526B2 (en) * 2021-01-07 2024-03-26 Lynxi Technologies Co., Ltd. Methods, electronic devices, and computer-readable media for training, and processing data through, a spiking neuron network
US20230196102A1 (en) * 2021-01-07 2023-06-22 Lynxi Technologies Co., Ltd. Method for Training Spiking Neuron Network, Method for Processing Data, Electronic Device, and Medium
CN114723038A (en) * 2022-03-03 2022-07-08 北京大学 Model conversion method, device, equipment and storage medium based on target detection
CN114723038B (en) * 2022-03-03 2024-04-23 北京大学 Model conversion method, device, equipment and storage medium based on target detection
CN115048979A (en) * 2022-04-29 2022-09-13 贵州大学 Robot touch pulse data classification method based on regularization
CN115048954A (en) * 2022-05-23 2022-09-13 北京大学 Retina-imitating target detection method and device, storage medium and terminal
CN115100458A (en) * 2022-06-02 2022-09-23 西安电子科技大学 Image classification method and related device
CN114998996A (en) * 2022-06-14 2022-09-02 中国电信股份有限公司 Signal processing method, device and equipment with motion attribute information and storage
CN114998996B (en) * 2022-06-14 2024-04-05 中国电信股份有限公司 Signal processing method, device and equipment with motion attribute information and storage
CN115841142A (en) * 2023-02-20 2023-03-24 鹏城实验室 Visual cortex simulation method based on deep pulse neural network and related equipment
CN115862338A (en) * 2023-03-01 2023-03-28 天津大学 Airport traffic flow prediction method, system, electronic device and medium
CN116011563A (en) * 2023-03-28 2023-04-25 之江实验室 High-performance pulse transmission simulation method and device for pulse relay
CN116663631A (en) * 2023-07-28 2023-08-29 之江实验室 Task execution method and device, storage medium and electronic equipment
CN116663631B (en) * 2023-07-28 2024-05-14 之江实验室 Task execution method and device, storage medium and electronic equipment
CN117252239A (en) * 2023-09-18 2023-12-19 北京邮电大学 Method and system for detecting target of pulse signal

Also Published As

Publication number Publication date
CN111275742A (en) 2020-06-12
CN111275742B (en) 2022-01-11

Similar Documents

Publication Publication Date Title
WO2021143066A1 (en) Target recognition method, device, and system, and computer readable storage medium
EP4074247A1 (en) Health status test method and device, and computer storage medium
CN112699956B (en) Neuromorphic visual target classification method based on improved impulse neural network
Huang et al. 1000× faster camera and machine vision with ordinary devices
Parks et al. Augmented saliency model using automatic 3d head pose detection and learned gaze following in natural scenes
CN106796449A (en) Eye-controlling focus method and device
CN110516685A (en) Lenticular opacities degree detecting method based on convolutional neural networks
Kolla et al. Efficient classification of diabetic retinopathy using binary cnn
Posch Bio-inspired vision
Qi et al. A new heterogeneous neural network model and its application in image enhancement
CN107242876A (en) A kind of computer vision methods for state of mind auxiliary diagnosis
CN105930793A (en) Human body detection method based on SAE characteristic visual learning
Feng et al. Enhancing scene structure in prosthetic vision using iso-disparity contour perturbance maps
WO2023103145A1 (en) Head pose truth value acquisition method, apparatus and device, and storage medium
Taher et al. An extended eye movement tracker system for an electric wheelchair movement control
Mathews et al. Residual networks and deep-densely connected networks for the classification of retinal OCT images
Al-Shakarchy et al. Open and closed eyes classification in different lighting conditions using new convolution neural networks architecture
CN108416788B (en) Edge detection method based on receptive field and light and shade adaptability thereof
Tang et al. Unity Human Eye Model for Gaze Tracking with a Query-Driven Dynamic Vision Sensor
Zhao et al. A Tongue Color Classification Method in TCM Based on Transfer Learning
Wang et al. Research on an effective human action recognition model based on 3D CNN
Miao et al. Research on deep learning in the detection and classification of diabetic retinopathy
Zhao Research and application of deep learning in image recognition
CN116246067B (en) CoA Unet-based medical image segmentation method
CN117252239B (en) Method and system for detecting target of pulse signal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20914627

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20914627

Country of ref document: EP

Kind code of ref document: A1