WO2022199360A1 - Moving object positioning method and apparatus, electronic device, and storage medium - Google Patents

Moving object positioning method and apparatus, electronic device, and storage medium Download PDF

Info

Publication number
WO2022199360A1
WO2022199360A1 PCT/CN2022/079340 CN2022079340W WO2022199360A1 WO 2022199360 A1 WO2022199360 A1 WO 2022199360A1 CN 2022079340 W CN2022079340 W CN 2022079340W WO 2022199360 A1 WO2022199360 A1 WO 2022199360A1
Authority
WO
WIPO (PCT)
Prior art keywords
event
threshold
thresholds
events
target
Prior art date
Application number
PCT/CN2022/079340
Other languages
French (fr)
Chinese (zh)
Inventor
吴臻志
马欣
祝夭龙
Original Assignee
北京灵汐科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京灵汐科技有限公司 filed Critical 北京灵汐科技有限公司
Publication of WO2022199360A1 publication Critical patent/WO2022199360A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30241Trajectory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/44Event detection

Definitions

  • the present disclosure relates to the technical field of image recognition, and in particular, to a method, an apparatus, an electronic device, and a computer-readable storage medium for locating a moving object.
  • the image recognition technology usually extracts the acquired video image from the global image through the image classification model, and judges whether there is a moving object in the image according to the extracted image features, and determines whether the moving object exists in the image. position.
  • Embodiments of the present disclosure provide a method, apparatus, electronic device, and computer-readable storage medium for locating a moving object, so as to locate the moving object in an image.
  • an embodiment of the present disclosure provides a method for locating a moving object, where the method includes:
  • the number of pixels corresponding to at least two event number thresholds in the event number threshold set is obtained, and the pixel points corresponding to the event number threshold are correspondingly generated events greater than or equal to the event number threshold.
  • an embodiment of the present disclosure provides a positioning device for a moving object, and the positioning device includes:
  • an event frame acquisition module used for acquiring event stream information through a dynamic vision sensor, and acquiring sampling event frames according to the event stream information
  • a threshold acquisition module configured to acquire, according to the sampled event frame, the number of pixels corresponding to at least two event number thresholds in the event number threshold set respectively, where the pixel points corresponding to the event number threshold are correspondingly generated event numbers greater than or Pixels equal to the threshold for the number of events; determine the threshold for the number of events of interest according to the number of pixels corresponding to the at least two thresholds for the number of events;
  • a location area acquisition module configured to determine a target pixel point in the sampled event frame whose number of events is greater than or equal to the target event number threshold, and determine a location area of a moving object according to the target pixel point.
  • an embodiment of the present disclosure provides an electronic device, the electronic device comprising:
  • processors one or more processors
  • memory for storing one or more programs
  • the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors implement the method for locating a moving object according to any embodiment of the present disclosure.
  • an embodiment of the present disclosure further provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the method for locating a moving object described in any embodiment of the present disclosure.
  • the target event number threshold is determined according to the number of pixels corresponding to the at least two event number thresholds, and the corresponding generated event number is greater than or equal to the number of events.
  • the target pixel points of the target event threshold and finally determine the position area of the moving object according to all the target pixel points, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and calculation process when positioning the moving object, which effectively saves the calculation.
  • the recognition efficiency of moving objects is improved, and the accurate positioning of moving objects of small volume can be effectively realized.
  • FIG. 1 is a schematic flowchart of a method for locating a moving object according to an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure
  • FIG. 3 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure
  • FIG. 4 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure
  • FIG. 5 is a structural block diagram of a device for positioning a moving object according to an embodiment of the present disclosure
  • FIG. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 1 is a schematic flowchart of a method for locating a moving object provided by an embodiment of the present disclosure.
  • the positioning method provided by the embodiment of the present disclosure can be used to locate a moving object in a video image.
  • the device may be implemented by software and/or hardware, and integrated into an electronic device, and the method may include the following steps: Steps S110 to S140.
  • Step S110 Acquire event stream information through a dynamic vision sensor, and acquire sampled event frames according to the event stream information.
  • Dynamic Vision Sensor is an image acquisition device that adopts pixel asynchronous mechanism and is based on address and event expression (AER); , and sequentially read all the pixel information in each "frame", DVS does not need to read all the pixels in the picture, but only needs to obtain the address and information of the pixels whose light intensity changes.
  • AER address and event expression
  • an event signal of the pixel point is sent out.
  • the light intensity change is a positive change, that is, the brightness of the pixel jumps from low brightness to high brightness, an event signal represented by "+1" is sent out and marked as a positive event;
  • the light intensity changes as Negative change that is, the pixel jumps from high brightness to low brightness, an event signal represented by "-1" is sent, and it is marked as a negative event;
  • no event signal is sent. , marked as no events.
  • the dynamic vision sensor forms event flow information by marking the event signal generated by each pixel point, and the event flow information records the event situation generated by each pixel point in the picture collected by the dynamic vision sensor.
  • the light intensity of the corresponding pixels in the area where the moving object passes in the picture will change to different degrees.
  • the light intensity will increase significantly.
  • the light intensity of the pixels in the disappearing area of the moving object will be significantly reduced. Therefore, according to the event stream information, it can be determined which pixels in the picture may have moving objects.
  • the event stream information collected by the dynamic vision sensor may be sampled according to a preset sampling period to obtain sampled event frames.
  • the pixel may be a pixel related to a moving object.
  • the sampling event frame is an image frame displayed after summarizing all the labeling events of each pixel in the captured picture within the preset sampling period, which is used to describe the events (such as positive events or negative events) that occur at all pixels in the picture. ).
  • the preset sampling period can be set according to actual needs. For example, in order to improve the detection efficiency of moving objects in video images, the preset sampling period can be set to a lower value; in order to reduce the processing pressure of the sampled images, the preset sampling period can be set to a higher value; especially Yes, due to the high detection accuracy of DVS, the detection efficiency of the event signal of the pixel point can reach the nanosecond level (for example, 1000 nanoseconds, that is, the event signal of the pixel point can be obtained once every 1000 nanoseconds), and the preset The sampling period can usually be set to the millisecond level (for example, 10 milliseconds). Therefore, in one sampling period, the light intensity of a pixel may experience multiple changes, that is, the DVS sends out multiple event signals for a pixel, That is to say, a pixel generates multiple events.
  • the nanosecond level for example, 1000 nanoseconds, that is, the event signal of the pixel point can be obtained once every 1000 nanoseconds
  • the preset The sampling period can
  • Step S120 Acquire the number of pixels corresponding to at least two event number thresholds in the event number threshold set according to the sampled event frame.
  • Step S130 Determine the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively; wherein, in the at least two event number thresholds, the pixel corresponding to each event number threshold is the corresponding generated event number Pixels greater than or equal to the threshold of the number of events.
  • the pixel is a pixel corresponding to the threshold of the number of events.
  • the threshold for the number of events refers to the minimum number of times that the DVS sends an event signal for the same pixel in a sampling period. For example, if there are two event count thresholds in the event count threshold set, one of the event count thresholds is configured to be 5, and the other event count threshold is configured to be 6, then in step S120, obtain the event signal generated in the sampling event frame within the sampling period. The number of pixel points whose times are greater than or equal to 5 times, and the number of pixel points whose times are greater than or equal to 6 times in the sampling event frame in the acquisition sampling period.
  • the higher the threshold of the number of events the fewer the number of pixels whose number of events is greater than or equal to the threshold of the number of events. The number is small, so it may not be able to accurately describe the actual location area of the moving object; and the lower the event count threshold, the more the number of pixels with the event count greater than or equal to the event count threshold, the more likely the area where these pixels are located. Noise points (that is, falsely detected interference points), but due to the large number of these pixel points, it can more accurately describe the actual motion area of the moving object.
  • the event count threshold set is a preconfigured set consisting of at least two event count thresholds, and the minimum event count threshold in the event count threshold set may be preset according to actual needs, for example, the minimum event count threshold may be If it is set to 1, the threshold of the minimum number of events can also be set to other smaller values.
  • the threshold of the maximum number of events in the event number threshold set can be preset according to actual needs. For example, the threshold of the maximum number of events can be set to a larger value. Numeric value (eg, 50).
  • the event number threshold set may include multiple consecutive event number thresholds.
  • the event number threshold set includes 1 to 50, for a total of 50 event number thresholds, that is, at In step S130, among the above-mentioned 50 event times thresholds, a target event times threshold is determined.
  • Step S140 Determine the target pixel points in the sampling event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the location area of the moving object according to the target pixel points.
  • step S140 after determining the target pixel points in the sampling event frame whose number of events is greater than or equal to the threshold of the number of target events, all target pixels are divided into one or more densely distributed areas according to the proximity principle, wherein, if There is only one moving object in the sampling event frame, then there is a dense distribution area of target pixels in the sampling event frame. If there are multiple moving objects in the sampling event frame, then there are multiple target pixels in the sampling event frame. area; by connecting the outer edge pixels of the densely distributed area of the target pixels, the real contour information of the moving objects in the area can be obtained, so that the position area of the moving objects can be determined.
  • the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is obtained.
  • the target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and
  • the calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
  • FIG. 2 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure.
  • at least two events in the event count threshold set are obtained according to sampled event frames.
  • the positioning method may further include: step S111 and step S112.
  • Step S111 according to the sampled event frame, determine a candidate pixel point having at least one event.
  • Step S112 Determine the maximum matching event number threshold according to the number of candidate pixels to determine the event number threshold set.
  • the sampling event frame records events (such as positive events or negative events) that occur at all pixel points in the picture collected by the dynamic vision sensor, so according to the sampling event frame, all corresponding pixel points that generate at least one event can be determined, as an alternative pixel.
  • events such as positive events or negative events
  • the larger the number of candidate pixel points the larger the position area occupied by the target moving object in the image, or the larger the sum of the position areas occupied by multiple moving objects in the image.
  • the maximum number of events threshold in the event number threshold set can be set to a small value to obtain as many pixels as possible; The smaller the number, the smaller the location area occupied by the target moving object in the image, or the smaller the sum of the location area occupied by multiple moving objects in the image.
  • only a smaller number of pixels are needed to plan the moving object.
  • the maximum event number threshold in the event number threshold set can be set to a larger value to reduce the occurrence of noise points.
  • the matching maximum event number threshold is obtained, thereby determining the event number threshold set, which can effectively improve the acquisition efficiency of the target event number threshold.
  • the event number threshold set can be configured to include from 1 Consecutive values up to the maximum number of events threshold.
  • the step of determining the maximum number of events threshold for matching according to the number of candidate pixels may further include: obtaining, according to a predetermined correspondence between the number of candidate pixels and the threshold for the maximum number of events, obtaining Threshold for the maximum number of events to match the number of candidate pixels.
  • the correspondence between the number of candidate pixel points and the threshold of the maximum number of events may be obtained through a pixel threshold comparison table or a preset calculation rule.
  • the pixel threshold comparison table is used to describe the corresponding relationship between the number of candidate pixels and the maximum event threshold. After the number of candidate pixels is obtained, the number interval in which the number of candidate pixels is located can be used to pass the pixel threshold. Look up the corresponding maximum number of events threshold according to the table; you can also take the number of candidate pixel points as a known parameter and bring it into the pre-built calculation formula according to the preset calculation rule to obtain the corresponding maximum number of events threshold, which calculates The formula can be set according to actual needs.
  • the step of determining the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively may further include: according to the at least two event number thresholds respectively corresponding to The number of pixel points, determine the threshold of the number of critical events, and use the threshold of the number of critical events as the threshold of the number of target events.
  • the step of determining the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively may further include: according to the at least two event number thresholds respectively corresponding to The number of pixel points determines the threshold of the number of critical events; according to the threshold of the maximum number of events and the threshold of the number of critical events in the event number threshold set, the threshold of the number of intermediate events is determined, and the threshold of the number of intermediate events is used as the target event number threshold.
  • the threshold for the number of critical events that is, the threshold for the number of events before the number of pixels increases substantially, the threshold for the number of critical events may be used as the threshold for the number of target events.
  • the intermediate event count threshold between the critical event count threshold and the maximum event count threshold in the set of event count thresholds may also be used as a screening condition
  • the intermediate event number threshold is used as the target event number threshold to obtain target pixels corresponding to the intermediate event number threshold, and then determine the location area of the moving object; for example, the critical event number threshold is 7, and the maximum event number threshold is 11 , correspondingly, the intermediate event number threshold (ie, 9) between the two is selected as the filtering condition for obtaining the target pixel.
  • the step of determining a critical event number threshold according to the number of pixels corresponding to the at least two event number thresholds may further include: setting each adjacent two event number thresholds in the event number threshold set The number of corresponding pixel points is subjected to difference operation, and the difference result is obtained; in each difference result, the target difference result with the largest value is selected, and the difference between the two event times thresholds corresponding to the target difference result is compared. A large value is used as the threshold for the number of critical events. Wherein, for every two adjacent event number thresholds in the event number threshold set, the result of the difference between the number of pixels corresponding to the two adjacent event number thresholds is the number of pixels corresponding to the adjacent two event number thresholds The absolute value of the difference.
  • the event number threshold set includes multiple consecutive event number thresholds, by obtaining the number of pixels corresponding to each event number threshold, the difference between the number of pixels corresponding to each adjacent two event number thresholds is calculated. , and according to the difference results of the above statistics, obtain two event count thresholds related to the maximum difference result, and select the larger one among the above two event count thresholds as the critical event count threshold;
  • the count threshold set includes 8 event count thresholds. The values of the 8 event count thresholds are 11, 10, 9, 8, 7, 6, 5, and 4, respectively, and the corresponding pixel numbers are 80,000 and 100,000 respectively.
  • the difference between the number of pixels corresponding to the thresholds for the number of adjacent events is 20,000, 20,000, 30,000, 30,000, 20,000, 70,000, and 30,000.
  • the difference with the largest value is 70,000, and the corresponding two event thresholds are 6 and 5, respectively. Therefore, the event threshold 6 is determined as the critical event threshold. .
  • the above adjacent number of pixels is determined.
  • the larger value of the two event count thresholds is used as the critical event count threshold.
  • the step of determining the threshold for the number of critical events according to the number of pixels corresponding to the at least two thresholds for the number of events may further include: if any adjacent two thresholds for the number of events are acquired If the difference calculation result of the number of pixels corresponding to the event number thresholds is greater than or equal to the preset number threshold, the larger value of the two adjacent event number thresholds is used as the critical event number threshold.
  • the calculation result of the difference between the numbers of pixels corresponding to two adjacent event times thresholds is the absolute value of the difference between the numbers of pixels corresponding to two adjacent event times thresholds.
  • the step of determining the critical event count threshold according to the number of pixels corresponding to the at least two event count thresholds may further include: if any adjacent two event count threshold sets are acquired When the ratio between the difference calculation result of the number of pixels corresponding to the thresholds of the number of events of each event and the total number of pixels of the sampled event frame is greater than or equal to the preset percentage threshold, the adjacent two thresholds of the number of events will be divided into The larger value of , as the threshold for the number of critical events.
  • the preset number threshold is 50,000, and for the above-mentioned adjacent event number threshold 6 and event number threshold 5, the difference between the number of pixels corresponding to the two is 70,000, and the difference is greater than 70,000.
  • the preset number threshold is 50,000. Therefore, the larger of the event number threshold 6 and the event number threshold 5, that is, 6 is used as the critical event number threshold. At this time, there is no need to perform other adjacent event number thresholds.
  • the corresponding pixels The difference operation between the number of points or the ratio operation between the difference operation result and the total number of pixel points reduces the amount of data calculation and improves the acquisition speed of the threshold for the number of critical events.
  • the preset percentage threshold is 10%. Therefore, the larger of the event number threshold 6 and the event number threshold 5, that is, 6 is used as the critical event number threshold. At this time, there is no need to perform other adjacent event number thresholds.
  • the corresponding pixels The difference operation between the number of points or the ratio operation between the difference operation result and the total number of pixel points reduces the amount of data calculation and improves the acquisition speed of the threshold for the number of critical events.
  • FIG. 3 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure.
  • a candidate having at least one event is determined.
  • the positioning method further includes: step S113.
  • Step S113 perform side suppression processing on the region where the candidate pixel points are located in the sampling event frame.
  • the step of obtaining the number of pixels corresponding to at least two event count thresholds in the event count threshold set may further include: obtaining the event according to the sampled event frame after the side suppression processing. The number of pixels corresponding to at least two event count thresholds in the count threshold set.
  • lateral inhibition is the inhibitory effect that occurs between adjacent neurons, that is, when a neuron is stimulated and excited, the adjacent neurons are stimulated again, and the latter (that is, the above-mentioned similar neurons) will occur.
  • the inhibitory effect of the excitation on the former that is, the above-mentioned certain neuron
  • the lateral inhibition is essentially the phenomenon of mutual inhibition between adjacent receptors; in some embodiments of the present disclosure, the area where the candidate pixel points are located After the side suppression processing is performed, the display effect of the candidate pixels can be enhanced, and the background pixels in the area can be suppressed.
  • determining the location area of the moving object according to the target pixel point may further include: marking the location area of the moving object through a region of interest frame according to the target pixel point.
  • the region of interest is a box, circle, ellipse and polygon to outline the area that needs to be processed, because the acquired contour information of the moving object is usually an irregular figure, which is inconvenient in the image.
  • the smallest square that also includes the outline of the moving object can be marked in the image by means of a square marking frame, and the area within the square marking frame and the square marking frame is the area of the moving object. location area.
  • FIG. 4 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure.
  • the positioning method further includes: step S141.
  • Step S141 determining the moving trajectory of the moving object according to the position regions of the moving object in the multiple sampling event frames, and determining whether the moving trajectory is the target trajectory through the trained image classification model.
  • the image classification model is a classification model that is pre-trained based on sample images. Its function is to extract image features and obtain feature vectors for the input image information, and then output the corresponding image classification probability according to the obtained feature vectors.
  • Image classification The probability represents the probability that the input image information is a positive sample or a negative sample, and then classify according to the image classification probability (ie binary classification) to determine whether the input image is a target trajectory; among them, the type of the target trajectory is determined by the positive sample image.
  • the trajectory type determines, for example, the high-altitude parabolic trajectory is used as the target trajectory, and whether the moving trajectory of the moving object in the image is a high-altitude parabolic trajectory is determined to determine whether the moving trajectory of the moving object in the sampling event frame is a high-altitude parabolic trajectory.
  • the positioning method before judging whether the moving track is a target track through the image classification model completed by training, the positioning method further includes: constructing an initial image classification model based on a convolutional neural network, and pairing the image with a sample image set.
  • the initial image classification model performs image recognition and classification training to obtain a trained image classification model.
  • CNN Convolutional Neural Networks
  • DNN Deep Learning
  • Convolutional Neural Networks is a feedforward neural network (Feedforward Neural Networks) with deep structure including convolution calculation in Deep Learning (Deep Learning), which is characterized in that the application of convolution operation improves the The extraction accuracy of image features, and the application of pooling layer reduces the computational complexity of image features.
  • the positive sample image is a high-altitude parabolic trajectory image, and the output value of the positive sample image is 1;
  • the negative sample image is image information that does not include high-altitude parabolic trajectory or high-altitude falling object trajectory, for example, the flight trajectory of birds.
  • images such as images of silhouette flashing trajectory images, pixel blank images, and moving trajectory images of upper paraboloids.
  • the output value of negative sample images is 0.
  • the initial image classification model is trained by a sample image set composed of positive sample images and negative sample images, so that the trained image classification model has image recognition and classification capabilities.
  • the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is obtained.
  • the target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and
  • the calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
  • FIG. 5 is a structural block diagram of a device for positioning a moving object according to an embodiment of the present disclosure.
  • the positioning device 200 specifically includes: an event frame obtaining module 210 , a threshold value obtaining module 220 and a location area obtaining module 230 .
  • the event frame obtaining module 210 is configured to obtain event flow information through the dynamic vision sensor, and obtain sampled event frames according to the event flow information.
  • the threshold obtaining module 220 is used to: obtain the number of pixels corresponding to at least two event times thresholds in the event times threshold set according to the sampled event frame, wherein the pixel corresponding to the event times threshold is the corresponding generated event times greater than or equal to the number of pixels. Pixels of the event count threshold; determine the target event count threshold according to the number of pixels corresponding to at least two event count thresholds respectively.
  • the location area acquisition module 230 is configured to determine the target pixel points in the sampled event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the location area of the moving object according to the target pixel points.
  • the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is determined.
  • the target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and
  • the calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
  • the apparatus 200 for positioning a moving object may further include: a candidate pixel point acquisition module and a threshold set determination module.
  • the candidate pixel point acquiring module is configured to determine candidate pixel points having at least one event according to the sampled event frame.
  • the threshold set determination module is configured to determine the maximum matching event number threshold according to the number of candidate pixels, so as to determine the event number threshold set.
  • the threshold set determination module is configured to obtain the maximum event matching the number of candidate pixels according to the predetermined correspondence between the number of candidate pixels and the threshold of the maximum number of events number of thresholds.
  • the threshold obtaining module 220 is configured to determine the threshold of the number of critical events according to the number of pixels corresponding to the at least two thresholds of the number of events, and use the threshold of the number of critical events as the target event number of thresholds.
  • the threshold value acquisition module 220 is used for: according to the number of pixel points corresponding to at least two event times thresholds respectively, determine the critical event times threshold; according to the maximum event times threshold and the critical event times threshold in the event times threshold set, determine the intermediate event Threshold for the number of times, and use the threshold for the number of intermediate events as the threshold for the number of target events.
  • the threshold value obtaining module 220 specifically includes: a difference value result obtaining unit and a threshold value obtaining unit.
  • the difference result obtaining unit is configured to perform a difference operation on the number of pixels corresponding to every two adjacent event times thresholds in the event times threshold set, and obtain a difference result.
  • the threshold obtaining unit is used to select the target difference result with the largest value among the difference results, and use the larger value of the two event count thresholds corresponding to the target difference result as the critical event count threshold.
  • the threshold obtaining module 220 is configured to obtain a difference operation result of the number of pixels corresponding to any two adjacent event count thresholds in the event count threshold set being greater than or equal to
  • the threshold acquisition module 220 is configured to obtain the ratio of the difference calculation result of the number of pixels corresponding to any two adjacent event number thresholds in the event number threshold set to the total number of pixels in the sampling event frame. , when it is greater than or equal to the preset percentage threshold, the larger value of the two adjacent event times thresholds is used as the critical event times threshold.
  • the location area acquisition module 230 is configured to mark the location area of the moving object through the area of interest frame according to the target pixel point.
  • the apparatus 200 for positioning a moving object further includes: a side restraint processing execution module.
  • the side suppression processing execution module is used for performing side suppression processing on the region where the candidate pixel points are located in the sampling event frame.
  • the threshold obtaining module 220 is configured to obtain the number of pixels corresponding to at least two event count thresholds in the event count threshold set according to the sampled event frame after side suppression processing.
  • the apparatus 200 for positioning a moving object further includes: a movement track acquisition module.
  • the moving track acquisition module is used to determine the moving track of the moving object according to the position area of the moving object in the multiple sampling event frames, and judge whether the moving track is the target track through the image classification model completed by training.
  • the moving object positioning apparatus 200 further includes: an image classification model acquisition module.
  • the image classification model acquisition module is used to construct an initial image classification model based on the convolutional neural network, and perform image recognition and classification training on the initial image classification model through the sample image set to obtain the trained image classification model.
  • the above apparatus can execute the method for positioning a moving object provided by any embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method.
  • the above apparatus can execute the method for positioning a moving object provided by any embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method.
  • FIG. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 6 shows a block diagram of an exemplary electronic device 12 suitable for use in implementing embodiments of the present disclosure.
  • the electronic device 12 shown in FIG. 6 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
  • the electronic device 12 takes the form of a general-purpose computing device.
  • Components of electronic device 12 may include, but are not limited to, one or more processors or processing units 16 , memory 28 , and a bus 18 connecting various system components including memory 28 and processing unit 16 .
  • Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, Enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect ( PCI) bus.
  • Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12, including both volatile and non-volatile media, removable and non-removable media.
  • Memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache memory 32 .
  • Electronic device 12 may further include other removable/non-removable, volatile/non-volatile computer system storage media.
  • storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in FIG. 3, commonly referred to as a "hard disk drive”).
  • a disk drive may be provided for reading and writing to removable non-volatile magnetic disks (eg "floppy disks"), as well as removable non-volatile optical disks (eg CD-ROM, DVD-ROM) or other optical media) to read and write optical drives.
  • each drive may be connected to bus 18 through one or more data media interfaces.
  • Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present disclosure.
  • a program/utility 40 having a set (at least one) of program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include an implementation of a network environment.
  • Program modules 42 generally perform the functions and/or methods of the embodiments described in this disclosure.
  • the electronic device 12 may also communicate with one or more external devices 14 (eg, a keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with the electronic device 12, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may take place through input/output (I/O) interface 22 . Also, the electronic device 12 may communicate with one or more networks (eg, a local area network (LAN), a wide area network (WAN), and/or a public network such as the Internet) through a network adapter 20 . As shown, network adapter 20 communicates with other modules of electronic device 12 via bus 18 . It should be understood that, although not shown, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives and data backup storage systems.
  • the processing unit 16 executes various functional applications and data processing by running the programs stored in the memory 28, for example, implementing the method for positioning a moving object provided by any embodiment of the present disclosure.
  • An embodiment of the present disclosure also provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, implements the method for locating a moving object according to any embodiment of the present disclosure.
  • the computer storage medium of the embodiments of the present disclosure may adopt any combination of one or more computer-readable media.
  • the computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium.
  • a computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a propagated data signal in baseband or as part of a carrier wave, with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted using any suitable medium, including - but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
  • Computer program code for carrying out operations of the present disclosure may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, but also conventional Procedural programming language - such as the "C" language or similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
  • LAN local area network
  • WAN wide area network

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

A moving object positioning method, comprising: acquiring event stream information by means of a dynamic vision sensor, and acquiring a sampling event frame according to the event stream information; acquiring, according to the sampling event frame, the number of pixels respectively corresponding to at least two event count thresholds in an event count threshold set; determining a target event count threshold according to the number of pixels respectively corresponding to the at least two event count thresholds; and determining a target pixel the number of corresponding generated events of which is greater than or equal to the target event count threshold, and determining a location area of a moving object according to the target pixel. Also disclosed are a moving object positioning apparatus, an electronic device, and a storage medium.

Description

运动物体的定位方法、装置、电子设备及存储介质Positioning method, device, electronic device and storage medium for moving objects 技术领域technical field
本公开涉及图像识别技术领域,尤其涉及一种运动物体的定位方法、装置、电子设备及计算机可读存储介质。The present disclosure relates to the technical field of image recognition, and in particular, to a method, an apparatus, an electronic device, and a computer-readable storage medium for locating a moving object.
背景技术Background technique
随着科技的不断进步,图像识别技术得到了迅速发展,被广泛应用于各个领域,其中对于图像中运动物体的定位,成为了图像识别技术的重要分支。With the continuous advancement of science and technology, image recognition technology has developed rapidly and is widely used in various fields. The positioning of moving objects in images has become an important branch of image recognition technology.
在相关技术中,图像识别技术,通常是将获取到的视频图像,通过图像分类模型,在全局图像中进行特征提取,并根据提取到的图像特征判断图像中是否存在运动物体,并对运动物体定位。In the related art, the image recognition technology usually extracts the acquired video image from the global image through the image classification model, and judges whether there is a moving object in the image according to the extracted image features, and determines whether the moving object exists in the image. position.
但是这样的图像识别方式,图像特征的提取计算量极大,导致对运动物体的定位速度较慢,难以满足运动物体的实时定位,尤其对于体积较小的运动物体,定位效果较差。However, in such an image recognition method, the extraction of image features requires a large amount of calculation, resulting in a slow positioning speed of moving objects, and it is difficult to meet the real-time positioning of moving objects, especially for small moving objects, the positioning effect is poor.
发明内容SUMMARY OF THE INVENTION
本公开实施例提供了一种运动物体的定位方法、装置、电子设备及计算机可读存储介质,以定位图像中的运动物体。Embodiments of the present disclosure provide a method, apparatus, electronic device, and computer-readable storage medium for locating a moving object, so as to locate the moving object in an image.
第一方面,本公开实施例提供了一种运动物体的定位方法,该定位方法包括:In a first aspect, an embodiment of the present disclosure provides a method for locating a moving object, where the method includes:
通过动态视觉传感器获取事件流信息,并根据所述事件流信息获取采样事件帧;Obtain event flow information through a dynamic vision sensor, and obtain sampled event frames according to the event flow information;
根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,所述事件次数阈值对应的像素点为对应产生的事件次数大于或等于所述事件次数阈值的像素点;According to the sampled event frame, the number of pixels corresponding to at least two event number thresholds in the event number threshold set is obtained, and the pixel points corresponding to the event number threshold are correspondingly generated events greater than or equal to the event number threshold. pixel point;
根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值;Determine the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively;
确定所述采样事件帧中对应产生的事件次数大于或等于所述目标事件次数阈值的目标像素点,并根据所述目标像素点确定运动物体的位置区域。Determine the target pixel points in the sampled event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the position area of the moving object according to the target pixel points.
第二方面,本公开实施例提供了一种运动物体的定位装置,该定位装置包括:In a second aspect, an embodiment of the present disclosure provides a positioning device for a moving object, and the positioning device includes:
事件帧获取模块,用于通过动态视觉传感器获取事件流信息,并根据所述事件流信息获取采样事件帧;an event frame acquisition module, used for acquiring event stream information through a dynamic vision sensor, and acquiring sampling event frames according to the event stream information;
阈值获取模块,用于根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,所述事件次数阈值对应的像素点为对应产生的事件次数大于或等于所述事件次数阈值的像素点;根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值;A threshold acquisition module, configured to acquire, according to the sampled event frame, the number of pixels corresponding to at least two event number thresholds in the event number threshold set respectively, where the pixel points corresponding to the event number threshold are correspondingly generated event numbers greater than or Pixels equal to the threshold for the number of events; determine the threshold for the number of events of interest according to the number of pixels corresponding to the at least two thresholds for the number of events;
位置区域获取模块,用于确定所述采样事件帧中对应产生的事件次数大于或等于所述目标事件次数阈值的目标像素点,并根据所述目标像素点确定运动物体的位置区域。A location area acquisition module, configured to determine a target pixel point in the sampled event frame whose number of events is greater than or equal to the target event number threshold, and determine a location area of a moving object according to the target pixel point.
第三方面,本公开实施例提供了一种电子设备,该电子设备包括:In a third aspect, an embodiment of the present disclosure provides an electronic device, the electronic device comprising:
一个或多个处理器;one or more processors;
存储器,用于存储一个或多个程序,memory for storing one or more programs,
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本公开任意实施例所述的运动物体的定位方法。When the one or more programs are executed by the one or more processors, the one or more processors implement the method for locating a moving object according to any embodiment of the present disclosure.
第四方面,本公开实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现本公开任意实施例所述的运动物体的定位方法。In a fourth aspect, an embodiment of the present disclosure further provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the method for locating a moving object described in any embodiment of the present disclosure.
根据本公开实施例公开的技术方案,在获取到采样事件帧后,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,进而获取对应产生的事件次数大于或等于该目标事件阈值的目标像素点,最终根据所有目标像素点确定运动物体的位置区域,实现对运动物体的定位,且在对运动物体进行定位时无需进行图像特征的提取及计算过程,有效节省了计算资源,同时提高了运动物体的识别效率,能够有效实现针对较小体积运动物体的准确定位。According to the technical solutions disclosed in the embodiments of the present disclosure, after the sampling event frame is acquired, the target event number threshold is determined according to the number of pixels corresponding to the at least two event number thresholds, and the corresponding generated event number is greater than or equal to the number of events. The target pixel points of the target event threshold, and finally determine the position area of the moving object according to all the target pixel points, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and calculation process when positioning the moving object, which effectively saves the calculation. At the same time, the recognition efficiency of moving objects is improved, and the accurate positioning of moving objects of small volume can be effectively realized.
附图说明Description of drawings
图1为本公开实施例提供的一种运动物体的定位方法的流程示意图;1 is a schematic flowchart of a method for locating a moving object according to an embodiment of the present disclosure;
图2为本公开实施例提供的另一种运动物体的定位方法的流程示意图;FIG. 2 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure;
图3为本公开实施例提供的另一种运动物体的定位方法的流程示意图;3 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure;
图4为本公开实施例提供的另一种运动物体的定位方法的流程示意图;4 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure;
图5为本公开实施例提供的一种运动物体的定位装置的结构框图;5 is a structural block diagram of a device for positioning a moving object according to an embodiment of the present disclosure;
图6为本公开实施例提供的一种电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面结合附图和实施例对本公开作进一步的详细说明。可以理解的是,此处所描述的 具体实施例仅仅用于解释本公开,而非对本公开的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本公开相关的部分而非全部结构。The present disclosure will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present disclosure, but not to limit the present disclosure. In addition, it should be noted that, for the convenience of description, the drawings only show some but not all structures related to the present disclosure.
图1为本公开实施例提供的一种运动物体的定位方法的流程示意图,本公开实施例提供的定位方法可用于对视频图像中运动物体的定位,该方法可以由本公开实施例中的运动物体的定位装置来执行,该装置可以通过软件和/或硬件实现,并集成在电子设备中,该方法可以包括如下步骤:步骤S110~步骤S140。1 is a schematic flowchart of a method for locating a moving object provided by an embodiment of the present disclosure. The positioning method provided by the embodiment of the present disclosure can be used to locate a moving object in a video image. The device may be implemented by software and/or hardware, and integrated into an electronic device, and the method may include the following steps: Steps S110 to S140.
步骤S110、通过动态视觉传感器获取事件流信息,并根据事件流信息获取采样事件帧。Step S110: Acquire event stream information through a dynamic vision sensor, and acquire sampled event frames according to the event stream information.
其中,动态视觉传感器(Dynamic Vision Sensor,DVS),是一种采用像素异步机制,并基于地址和事件表达(AER)的图像采集装置;区别于相关技术中以固定频率采集的“帧”为基础,并依次读取各“帧”中所有的像素信息的方式,DVS不需要对画面中的所有像素点进行读取,仅需要获取光强度变化的像素点的地址和信息。Among them, Dynamic Vision Sensor (DVS) is an image acquisition device that adopts pixel asynchronous mechanism and is based on address and event expression (AER); , and sequentially read all the pixel information in each "frame", DVS does not need to read all the pixels in the picture, but only needs to obtain the address and information of the pixels whose light intensity changes.
对于动态视觉传感器,当动态视觉传感器检测到某个像素点的光强度变化大于等于预设门限数值时,则发出该像素点的事件信号。其中,如果该光强度变化为正向变化,即该像素点的亮度由低亮度跳变至高亮度,则发出采用“+1”表示的事件信号,并标注为正事件;如果该光强度变化为负向变化,即该像素点由高亮度跳变至低亮度,则发出采用“-1”表示的事件信号,并标注为负事件;如果光强度变化小于预设门限数值,则不发出事件信号,标注为无事件。动态视觉传感器通过对各像素点产生的事件信号进行的事件标注,以构成事件流信息,事件流信息记录了动态视觉传感器采集的画面中各像素点产生的事件情况。For the dynamic vision sensor, when the dynamic vision sensor detects that the light intensity change of a certain pixel point is greater than or equal to a preset threshold value, an event signal of the pixel point is sent out. Among them, if the light intensity change is a positive change, that is, the brightness of the pixel jumps from low brightness to high brightness, an event signal represented by "+1" is sent out and marked as a positive event; if the light intensity changes as Negative change, that is, the pixel jumps from high brightness to low brightness, an event signal represented by "-1" is sent, and it is marked as a negative event; if the light intensity change is less than the preset threshold value, no event signal is sent. , marked as no events. The dynamic vision sensor forms event flow information by marking the event signal generated by each pixel point, and the event flow information records the event situation generated by each pixel point in the picture collected by the dynamic vision sensor.
相比于光亮强度变化较小的背景图像,画面中运动物体经过的区域,其对应的像素点的光亮强度会存在不同程度的变化,例如,运动物体出现时,运动物体出现区域的像素点的光亮强度会显著增加,运动物体消失时,运动物体消失区域的像素点的光亮强度会显著降低,因此,根据事件流信息,可以确定画面中哪些像素点可能存在运动物体。Compared with the background image with small changes in the light intensity, the light intensity of the corresponding pixels in the area where the moving object passes in the picture will change to different degrees. The light intensity will increase significantly. When the moving object disappears, the light intensity of the pixels in the disappearing area of the moving object will be significantly reduced. Therefore, according to the event stream information, it can be determined which pixels in the picture may have moving objects.
在一些实施例中,可以根据预设采样周期对动态视觉传感器采集的事件流信息进行采样,以获取采样事件帧。In some embodiments, the event stream information collected by the dynamic vision sensor may be sampled according to a preset sampling period to obtain sampled event frames.
在预设采样周期内,如果某个像素点的事件流信息中包括正事件或负事件,则该像素点可能是与运动物体相关的像素点。采样事件帧是在预设采样周期内,对采集的画面中每个像素点的所有标注事件进行汇总后显示的图像帧,用于描述画面中所有像素点发生的事件(如正事件或负事件)。Within the preset sampling period, if the event flow information of a certain pixel includes a positive event or a negative event, the pixel may be a pixel related to a moving object. The sampling event frame is an image frame displayed after summarizing all the labeling events of each pixel in the captured picture within the preset sampling period, which is used to describe the events (such as positive events or negative events) that occur at all pixels in the picture. ).
其中,可以根据实际需要设定预设采样周期。例如,为了提高视频图像中运动物体的检测效率,可以将预设采样周期设定为较低数值;为了降低对采样图像的处理压力,则可以将预设采样周期设定为较高数值;特别的,由于DVS的检测精度较高,对于像素点的事件信号的检测效率可以达到纳秒级(例如,1000纳秒,即每间隔1000纳秒可获取一次像素点的事件信号),而预设采样周期通常可以设定为毫秒级(例如,10毫秒),因此,在一个采样周期内,一个像素点的光强度可能经历了多次变化,即DVS针对一个像素点发出了多个事件信号,也即表示一个像素点产生多次事件。The preset sampling period can be set according to actual needs. For example, in order to improve the detection efficiency of moving objects in video images, the preset sampling period can be set to a lower value; in order to reduce the processing pressure of the sampled images, the preset sampling period can be set to a higher value; especially Yes, due to the high detection accuracy of DVS, the detection efficiency of the event signal of the pixel point can reach the nanosecond level (for example, 1000 nanoseconds, that is, the event signal of the pixel point can be obtained once every 1000 nanoseconds), and the preset The sampling period can usually be set to the millisecond level (for example, 10 milliseconds). Therefore, in one sampling period, the light intensity of a pixel may experience multiple changes, that is, the DVS sends out multiple event signals for a pixel, That is to say, a pixel generates multiple events.
步骤S120、根据采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量。Step S120: Acquire the number of pixels corresponding to at least two event number thresholds in the event number threshold set according to the sampled event frame.
步骤S130、根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值;其中,在至少两个事件次数阈值中,每个事件次数阈值对应的像素点为对应产生的事件次数大于或等于该事件次数阈值的像素点。Step S130: Determine the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively; wherein, in the at least two event number thresholds, the pixel corresponding to each event number threshold is the corresponding generated event number Pixels greater than or equal to the threshold of the number of events.
在采样事件帧中,当一个像素点对应产生的事件次数大于或等于一个事件次数阈值时,则该像素点为该事件次数阈值对应的像素点。In the sampling event frame, when the number of events corresponding to a pixel is greater than or equal to a threshold of the number of events, the pixel is a pixel corresponding to the threshold of the number of events.
在本公开实施例中,事件次数阈值,是指在一个采样周期内,DVS针对同一个像素点发出事件信号的最少次数。例如,事件次数阈值集合中有两个事件次数阈值,其中一个事件次数阈值配置为5,另一个事件次数阈值配置为6,则在步骤S120中,获取采样周期内采样事件帧中产生事件信号的次数大于或等于5次的像素点的个数,以及获取采样周期内采样事件帧中产生事件信号的次数大于或等于6次的像素点的个数。In this embodiment of the present disclosure, the threshold for the number of events refers to the minimum number of times that the DVS sends an event signal for the same pixel in a sampling period. For example, if there are two event count thresholds in the event count threshold set, one of the event count thresholds is configured to be 5, and the other event count threshold is configured to be 6, then in step S120, obtain the event signal generated in the sampling event frame within the sampling period. The number of pixel points whose times are greater than or equal to 5 times, and the number of pixel points whose times are greater than or equal to 6 times in the sampling event frame in the acquisition sampling period.
需要说明的是,事件次数阈值越高,产生的事件次数大于或等于该事件次数阈值的像素点数量越少,这些像素点所在区域越有可能为实际的运动物体所在区域,但是由于这些像素点数量较少,因此可能无法准确描述运动物体的实际位置区域;而事件次数阈值越低,产生的事件次数大于或等于该事件次数阈值的像素点数量越多,这些像素点所在区域越有可能存在噪声点(即误检测到的干扰点),但是由于这些像素点的数量较多,因此更能准确的描述运动物体实际的运动区域。因此,需要在至少两个事件次数阈值中确定出目标事件次数阈值,通过目标事件次数阈值确定所需的像素点,以实现能够通过较多的像素点描述出运动物体实际的运动区域,且能够有效减少噪声点的出现。It should be noted that the higher the threshold of the number of events, the fewer the number of pixels whose number of events is greater than or equal to the threshold of the number of events. The number is small, so it may not be able to accurately describe the actual location area of the moving object; and the lower the event count threshold, the more the number of pixels with the event count greater than or equal to the event count threshold, the more likely the area where these pixels are located. Noise points (that is, falsely detected interference points), but due to the large number of these pixel points, it can more accurately describe the actual motion area of the moving object. Therefore, it is necessary to determine the threshold of the number of target events in at least two thresholds of the number of events, and determine the required pixel points through the threshold of the number of target events, so that the actual motion area of the moving object can be described by more pixels, and it can be Effectively reduce the appearance of noise spots.
在本公开实施例中,事件次数阈值集合是预先配置的由至少两个事件次数阈值组成的集合,事件次数阈值集合中的最小事件次数阈值可以根据实际需要预先设定,例如最小事件次数阈值可以配置为1,最小事件次数阈值也可以配置为其他较小的数值,事件次数阈 值集合中的最大事件次数阈值可以根据实际需要预先设定,例如,最大事件次数阈值可以设定为一个较大的数值(例如,50)。在一些实施例中,事件次数阈值集合可以包括多个连续的事件次数阈值,例如,最大事件次数阈值为50时,事件次数阈值集合包括1至50,总共50个事件次数阈值,也即,在步骤S130中,在上述50个事件次数阈值中,确定出目标事件次数阈值。In this embodiment of the present disclosure, the event count threshold set is a preconfigured set consisting of at least two event count thresholds, and the minimum event count threshold in the event count threshold set may be preset according to actual needs, for example, the minimum event count threshold may be If it is set to 1, the threshold of the minimum number of events can also be set to other smaller values. The threshold of the maximum number of events in the event number threshold set can be preset according to actual needs. For example, the threshold of the maximum number of events can be set to a larger value. Numeric value (eg, 50). In some embodiments, the event number threshold set may include multiple consecutive event number thresholds. For example, when the maximum event number threshold is 50, the event number threshold set includes 1 to 50, for a total of 50 event number thresholds, that is, at In step S130, among the above-mentioned 50 event times thresholds, a target event times threshold is determined.
步骤S140、确定采样事件帧中对应产生的事件次数大于或等于目标事件次数阈值的目标像素点,并根据目标像素点确定运动物体的位置区域。Step S140: Determine the target pixel points in the sampling event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the location area of the moving object according to the target pixel points.
在步骤S140中,在确定采样事件帧中对应产生的事件次数大于或等于目标事件次数阈值的目标像素点之后,根据就近原则将所有目标像素点划分为一个或多个密集分布区域,其中,如果采样事件帧中只存在一个运动物体,那么采样事件帧中对应存在一个目标像素点密集分布区域,如果采样事件帧中存在多个运动物体,那么采样事件帧中则存在多个目标像素点密集分布区域;将目标像素点密集分布区域的外侧边沿像素点进行连接,即可获取到该区域内的运动物体的真实轮廓信息,从而可以确定出运动物体的位置区域。In step S140, after determining the target pixel points in the sampling event frame whose number of events is greater than or equal to the threshold of the number of target events, all target pixels are divided into one or more densely distributed areas according to the proximity principle, wherein, if There is only one moving object in the sampling event frame, then there is a dense distribution area of target pixels in the sampling event frame. If there are multiple moving objects in the sampling event frame, then there are multiple target pixels in the sampling event frame. area; by connecting the outer edge pixels of the densely distributed area of the target pixels, the real contour information of the moving objects in the area can be obtained, so that the position area of the moving objects can be determined.
根据本公开实施例提供的运动物体的定位方法的技术方案,在获取到采样事件帧后,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,进而获取对应产生的事件次数大于或等于该目标事件次数阈值的目标像素点,最终根据所有目标像素点确定运动物体的位置区域,实现对运动物体的定位,且在对运动物体进行定位时无需进行图像特征的提取及计算过程,有效节省了计算资源,同时提高了对运动物体的识别效率,能够有效实现针对较小体积运动物体的准确定位。According to the technical solution of the method for locating a moving object provided by the embodiment of the present disclosure, after the sampling event frame is acquired, the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is obtained. The target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and The calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
图2为本公开实施例提供的另一种运动物体的定位方法的流程示意图,在本公开的一些实施例中,如图2所示,在根据采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点数的量之前,即在步骤S120之前,该定位方法还可以进一步包括:步骤S111和步骤S112。FIG. 2 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure. In some embodiments of the present disclosure, as shown in FIG. 2 , at least two events in the event count threshold set are obtained according to sampled event frames. Before the number of pixels corresponding to the thresholds of the number of events, that is, before step S120, the positioning method may further include: step S111 and step S112.
步骤S111、根据采样事件帧,确定具有至少一个事件的备选像素点。Step S111 , according to the sampled event frame, determine a candidate pixel point having at least one event.
步骤S112、根据备选像素点的数量,确定匹配的最大事件次数阈值,以确定事件次数阈值集合。Step S112: Determine the maximum matching event number threshold according to the number of candidate pixels to determine the event number threshold set.
在步骤S111中,采样事件帧记录了动态视觉传感器采集的画面中所有像素点发生的事件(如正事件或者负事件),因此根据采样事件帧,可以确定所有对应产生至少一个事件的像素点,以作为备选像素点。In step S111, the sampling event frame records events (such as positive events or negative events) that occur at all pixel points in the picture collected by the dynamic vision sensor, so according to the sampling event frame, all corresponding pixel points that generate at least one event can be determined, as an alternative pixel.
在本公开的一些实施例中,备选像素点的数量越多,表明图像中目标运动物体占据的 位置区域越大,或者图像中多个运动物体占据的位置区域总和越大,相应的,需要通过较多数量的像素点来规划运动物体的实际位置区域,因此,事件次数阈值集合中最大事件次数阈值可以设定为较小数值,以尽可能获取更多的像素点;备选像素点的数量越少,表明图像中目标运动物体占据的位置区域越小,或者图像中多个运动物体占据的位置区域总和越小,相应的,仅需要较少数量的像素点即可规划出运动物体的实际位置区域,因此,事件次数阈值集合中最大事件次数阈值可以设定为较大数值,以减少噪声点的出现。在步骤S112中,根据备选像素点的数量不同,获取匹配的最大事件次数阈值,从而确定事件次数阈值集合,可以有效提高目标事件次数阈值的获取效率,例如可以配置事件次数阈值集合包括从1至最大事件次数阈值的连续的多个数值。In some embodiments of the present disclosure, the larger the number of candidate pixel points, the larger the position area occupied by the target moving object in the image, or the larger the sum of the position areas occupied by multiple moving objects in the image. Correspondingly, it is necessary to The actual location area of the moving object is planned by a larger number of pixels. Therefore, the maximum number of events threshold in the event number threshold set can be set to a small value to obtain as many pixels as possible; The smaller the number, the smaller the location area occupied by the target moving object in the image, or the smaller the sum of the location area occupied by multiple moving objects in the image. Correspondingly, only a smaller number of pixels are needed to plan the moving object. The actual location area, therefore, the maximum event number threshold in the event number threshold set can be set to a larger value to reduce the occurrence of noise points. In step S112, according to the difference in the number of candidate pixel points, the matching maximum event number threshold is obtained, thereby determining the event number threshold set, which can effectively improve the acquisition efficiency of the target event number threshold. For example, the event number threshold set can be configured to include from 1 Consecutive values up to the maximum number of events threshold.
在本公开的一些实施例中,根据备选像素点的数量,确定匹配的最大事件次数阈值的步骤,可以进一步包括:根据预先确定的备选像素点数量与最大事件次数阈值的对应关系,获取与备选像素点的数量匹配的最大事件次数阈值。In some embodiments of the present disclosure, the step of determining the maximum number of events threshold for matching according to the number of candidate pixels may further include: obtaining, according to a predetermined correspondence between the number of candidate pixels and the threshold for the maximum number of events, obtaining Threshold for the maximum number of events to match the number of candidate pixels.
在本公开的一些实施例中,可以通过像素阈值对照表或者预设计算规则获取备选像素点数量与最大事件次数阈值的对应关系。其中,像素阈值对照表用于描述备选像素点的数量与最大事件阈值的对应关系,在获取到备选像素点的数量后,可以通过备选像素点的数量所在的数量区间,通过像素阈值对照表查找对应的最大事件次数阈值;还可以根据预设计算规则,将备选像素点的数量作为已知参数,带入预先构建的计算公式中,以获取对应的最大事件次数阈值,其中计算公式是可以根据实际需要进行设置的。In some embodiments of the present disclosure, the correspondence between the number of candidate pixel points and the threshold of the maximum number of events may be obtained through a pixel threshold comparison table or a preset calculation rule. Among them, the pixel threshold comparison table is used to describe the corresponding relationship between the number of candidate pixels and the maximum event threshold. After the number of candidate pixels is obtained, the number interval in which the number of candidate pixels is located can be used to pass the pixel threshold. Look up the corresponding maximum number of events threshold according to the table; you can also take the number of candidate pixel points as a known parameter and bring it into the pre-built calculation formula according to the preset calculation rule to obtain the corresponding maximum number of events threshold, which calculates The formula can be set according to actual needs.
在本公开的一些实施例中,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值的步骤,即步骤S130,可以进一步包括:根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值,并将临界事件次数阈值作为目标事件次数阈值。In some embodiments of the present disclosure, the step of determining the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively, that is, step S130, may further include: according to the at least two event number thresholds respectively corresponding to The number of pixel points, determine the threshold of the number of critical events, and use the threshold of the number of critical events as the threshold of the number of target events.
在本公开的一些实施例中,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值的步骤,即步骤S130,可以进一步包括:根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值;根据事件次数阈值集合中的最大事件次数阈值和临界事件次数阈值,确定中间事件次数阈值,并将中间事件次数阈值作为目标事件次数阈值。In some embodiments of the present disclosure, the step of determining the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively, that is, step S130, may further include: according to the at least two event number thresholds respectively corresponding to The number of pixel points determines the threshold of the number of critical events; according to the threshold of the maximum number of events and the threshold of the number of critical events in the event number threshold set, the threshold of the number of intermediate events is determined, and the threshold of the number of intermediate events is used as the target event number threshold.
当事件次数阈值低于该临界事件次数阈值时,表征物体运动区域的像素点的数量虽然会出现较大幅度增长,但由于噪声点的影响,获取到的运动物体的位置区域存在较大误差,因此,在一些实施例中,临界事件次数阈值也即像素点的数量出现较大幅度增长之前的事 件次数阈值,可以将临界事件次数阈值作为目标事件次数阈值。When the threshold of the number of events is lower than the threshold of the critical number of events, although the number of pixel points representing the motion area of the object will increase significantly, due to the influence of noise points, there is a large error in the obtained position area of the moving object. Therefore, in some embodiments, the threshold for the number of critical events, that is, the threshold for the number of events before the number of pixels increases substantially, the threshold for the number of critical events may be used as the threshold for the number of target events.
特别的,通过临界事件次数阈值获取的像素点,实质上也存在着一定程度的噪声,只是相比于数值更低的其它事件次数阈值,其噪声点数量并未显著增加。因此,在另一些实施例中,为了进一步降低噪声点运动物体位置区域的影响,还可以将临界事件次数阈值和事件次数阈值集合中的最大事件次数阈值之间的中间事件次数阈值作为筛选条件,即将该中间事件次数阈值作为目标事件次数阈值,以获取与该中间事件次数阈值对应的目标像素点,进而确定出运动物体的位置区域;例如,临界事件次数阈值为7,最大事件次数阈值为11,相应的,选择二者之间的中间事件次数阈值(即9),作为获取目标像素点的筛选条件。In particular, the pixel points obtained through the threshold of the critical event times also have a certain degree of noise, but the number of noise points does not increase significantly compared with other thresholds of event times with lower values. Therefore, in other embodiments, in order to further reduce the influence of the noise point moving object location area, the intermediate event count threshold between the critical event count threshold and the maximum event count threshold in the set of event count thresholds may also be used as a screening condition, The intermediate event number threshold is used as the target event number threshold to obtain target pixels corresponding to the intermediate event number threshold, and then determine the location area of the moving object; for example, the critical event number threshold is 7, and the maximum event number threshold is 11 , correspondingly, the intermediate event number threshold (ie, 9) between the two is selected as the filtering condition for obtaining the target pixel.
在本公开的一些实施例中,根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值的步骤,可以进一步包括:将事件次数阈值集合中每相邻两个事件次数阈值对应的像素点的数量进行差值运算,并获取差值结果;在各差值结果中,选择数值最大的目标差值结果,并将与目标差值结果对应的两个事件次数阈值中的较大值,作为临界事件次数阈值。其中,针对事件次数阈值集合中每相邻两个事件次数阈值,该相邻两个事件次数阈值对应的像素点的数量的差值结果是该相邻两个事件次数阈值对应的像素点的数量的差值的绝对值。In some embodiments of the present disclosure, the step of determining a critical event number threshold according to the number of pixels corresponding to the at least two event number thresholds may further include: setting each adjacent two event number thresholds in the event number threshold set The number of corresponding pixel points is subjected to difference operation, and the difference result is obtained; in each difference result, the target difference result with the largest value is selected, and the difference between the two event times thresholds corresponding to the target difference result is compared. A large value is used as the threshold for the number of critical events. Wherein, for every two adjacent event number thresholds in the event number threshold set, the result of the difference between the number of pixels corresponding to the two adjacent event number thresholds is the number of pixels corresponding to the adjacent two event number thresholds The absolute value of the difference.
由于事件次数阈值集合包括了多个数值连续的事件次数阈值,通过获取每个事件次数阈值对应的像素点的数量,统计每相邻两个事件次数阈值对应的像素点的数量之间的差值,并根据上述统计的各差值结果,获取与最大差值结果相关的两个事件次数阈值,并在上述两个事件次数阈值中选择数值较大的一个,作为临界事件次数阈值;例如,事件次数阈值集合包括8个事件次数阈值,该8个事件次数阈值的数值分别为11、10、9、8、7、6、5和4,对应的像素点的个数分别为8万、10万、12万、15万、18万、20万、27万和30万,则每相邻两个事件次数阈值对应的像素点的数量之间的差值分别为2万、2万、3万、3万、2万、7万和3万,显然,其中数值最大的差值结果为7万,对应的两个事件次数阈值分别为6和5,因此将事件次数阈值6确定为临界事件次数阈值。Since the event number threshold set includes multiple consecutive event number thresholds, by obtaining the number of pixels corresponding to each event number threshold, the difference between the number of pixels corresponding to each adjacent two event number thresholds is calculated. , and according to the difference results of the above statistics, obtain two event count thresholds related to the maximum difference result, and select the larger one among the above two event count thresholds as the critical event count threshold; The count threshold set includes 8 event count thresholds. The values of the 8 event count thresholds are 11, 10, 9, 8, 7, 6, 5, and 4, respectively, and the corresponding pixel numbers are 80,000 and 100,000 respectively. , 120,000, 150,000, 180,000, 200,000, 270,000, and 300,000, then the difference between the number of pixels corresponding to the thresholds for the number of adjacent events is 20,000, 20,000, 30,000, 30,000, 20,000, 70,000, and 30,000. Obviously, the difference with the largest value is 70,000, and the corresponding two event thresholds are 6 and 5, respectively. Therefore, the event threshold 6 is determined as the critical event threshold. .
在本公开的一些实施例中,当获取到事件次数阈值集合中任意相邻的两个事件次数阈值对应的像素点的数量,出现明显的像素点数量增多或减少时,则将上述相邻的两个事件次数阈值中数值较大的一个,作为临界事件次数阈值。In some embodiments of the present disclosure, when the number of pixels corresponding to any two adjacent event number thresholds in the event number threshold set is obtained, and there is an obvious increase or decrease in the number of pixels, the above adjacent number of pixels is determined. The larger value of the two event count thresholds is used as the critical event count threshold.
在本公开的一些实施例中,根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值的步骤,可以进一步包括:若获取到事件次数阈值集合中任意相邻的 两个事件次数阈值分别对应的像素点的数量的差值运算结果大于或等于预设数量阈值,则将该相邻的两个事件次数阈值中的较大值,作为临界事件次数阈值。其中,相邻的两个事件次数阈值对应的像素点的数量的差值运算结果为相邻的两个事件次数阈值对应的像素点的数量的差值的绝对值。In some embodiments of the present disclosure, the step of determining the threshold for the number of critical events according to the number of pixels corresponding to the at least two thresholds for the number of events may further include: if any adjacent two thresholds for the number of events are acquired If the difference calculation result of the number of pixels corresponding to the event number thresholds is greater than or equal to the preset number threshold, the larger value of the two adjacent event number thresholds is used as the critical event number threshold. The calculation result of the difference between the numbers of pixels corresponding to two adjacent event times thresholds is the absolute value of the difference between the numbers of pixels corresponding to two adjacent event times thresholds.
在本公开的另一些实施例中,根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值的步骤,可以进一步包括:若获取到事件次数阈值集合中任意相邻的两个事件次数阈值分别对应的像素点的数量的差值运算结果,与采样事件帧的像素点总数之间的比值,大于等于预设百分比阈值时,则将该相邻的两个事件次数阈值中的较大值,作为临界事件次数阈值。In other embodiments of the present disclosure, the step of determining the critical event count threshold according to the number of pixels corresponding to the at least two event count thresholds may further include: if any adjacent two event count threshold sets are acquired When the ratio between the difference calculation result of the number of pixels corresponding to the thresholds of the number of events of each event and the total number of pixels of the sampled event frame is greater than or equal to the preset percentage threshold, the adjacent two thresholds of the number of events will be divided into The larger value of , as the threshold for the number of critical events.
示例性的,预设数量阈值为5万,对于上述相邻的事件次数阈值6和事件次数阈值5,二者分别对应的像素点的数量之间的差值为7万个,该差值大于预设数量阈值5万,因此,将事件次数阈值6和事件次数阈值5中的较大值,即将6作为临界事件次数阈值,此时,无需再进行其它相邻的事件次数阈值分别对应的像素点的数量之间的差值运算或差值运算结果与像素点总数的比值运算,减少了数据计算量,提高了临界事件次数阈值的获取速度。Exemplarily, the preset number threshold is 50,000, and for the above-mentioned adjacent event number threshold 6 and event number threshold 5, the difference between the number of pixels corresponding to the two is 70,000, and the difference is greater than 70,000. The preset number threshold is 50,000. Therefore, the larger of the event number threshold 6 and the event number threshold 5, that is, 6 is used as the critical event number threshold. At this time, there is no need to perform other adjacent event number thresholds. The corresponding pixels The difference operation between the number of points or the ratio operation between the difference operation result and the total number of pixel points reduces the amount of data calculation and improves the acquisition speed of the threshold for the number of critical events.
示例性的,预设百分比阈值为10%,采样事件帧的像素点总数可以根据动态视觉传感器的分辨率确定,例如采样事件帧的像素点总数为60万个,对于上述相邻的事件次数阈值6和事件次数阈值5,二者分别对应的像素点的数量之间的差值为7万个,该差值与采样事件帧的像素点总数的比值为7÷60=11.7%,该比值大于预设百分比阈值10%,因此,将事件次数阈值6和事件次数阈值5中的较大值,即将6作为临界事件次数阈值,此时,无需再进行其它相邻的事件次数阈值分别对应的像素点的数量之间的差值运算或差值运算结果与像素点总数的比值运算,减少了数据计算量,提高了临界事件次数阈值的获取速度。Exemplarily, the preset percentage threshold is 10%, and the total number of pixels in the sampling event frame can be determined according to the resolution of the dynamic vision sensor. For example, the total number of pixels in the sampling event frame is 600,000. For the above threshold of adjacent events 6 and the number of events threshold 5, the difference between the number of pixels corresponding to the two is 70,000, and the ratio of the difference to the total number of pixels in the sampled event frame is 7÷60=11.7%, the ratio is greater than The preset percentage threshold is 10%. Therefore, the larger of the event number threshold 6 and the event number threshold 5, that is, 6 is used as the critical event number threshold. At this time, there is no need to perform other adjacent event number thresholds. The corresponding pixels The difference operation between the number of points or the ratio operation between the difference operation result and the total number of pixel points reduces the amount of data calculation and improves the acquisition speed of the threshold for the number of critical events.
图3为本公开实施例提供的另一种运动物体的定位方法的流程示意图,在本公开的一些实施例中,如图3所示,在根据采样事件帧,确定具有至少一个事件的备选像素点的步骤之后,即在步骤S111之后,该定位方法还包括:步骤S113。FIG. 3 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure. In some embodiments of the present disclosure, as shown in FIG. 3 , according to a sampled event frame, a candidate having at least one event is determined. After the step of pixel point, that is, after step S111, the positioning method further includes: step S113.
步骤S113、对采样事件帧中备选像素点所在的区域,进行侧抑制处理。Step S113 , perform side suppression processing on the region where the candidate pixel points are located in the sampling event frame.
进一步的,根据采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量的步骤,即步骤S120,可以进一步包括:根据侧抑制处理后的采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量。Further, according to the sampled event frame, the step of obtaining the number of pixels corresponding to at least two event count thresholds in the event count threshold set, that is, step S120, may further include: obtaining the event according to the sampled event frame after the side suppression processing. The number of pixels corresponding to at least two event count thresholds in the count threshold set.
其中,侧抑制是相近的神经元彼此之间发生的抑制作用,即在某个神经元受到刺激而产生兴奋时,再刺激相近的神经元,则后者(即上述相近的神经元)所发生的兴奋对前者(即上述某个神经元)产生的抑制作用,侧抑制实质上是相邻的感受器之间互相抑制的现象;在本公开的一些实施例中,对备选像素点所在的区域进行侧抑制处理后,可以加强备选像素点的显示效果,对该区域内的背景像素点进行抑制。Among them, lateral inhibition is the inhibitory effect that occurs between adjacent neurons, that is, when a neuron is stimulated and excited, the adjacent neurons are stimulated again, and the latter (that is, the above-mentioned similar neurons) will occur. The inhibitory effect of the excitation on the former (that is, the above-mentioned certain neuron), the lateral inhibition is essentially the phenomenon of mutual inhibition between adjacent receptors; in some embodiments of the present disclosure, the area where the candidate pixel points are located After the side suppression processing is performed, the display effect of the candidate pixels can be enhanced, and the background pixels in the area can be suppressed.
在本公开的一些实施例中,在步骤S140中,根据目标像素点确定运动物体的位置区域,可以进一步包括:根据目标像素点,通过感兴趣区域框标注运动物体的位置区域。In some embodiments of the present disclosure, in step S140 , determining the location area of the moving object according to the target pixel point may further include: marking the location area of the moving object through a region of interest frame according to the target pixel point.
其中,感兴趣区域(Region Of Interest,ROI)是以方框、圆、椭圆和多边形等方式勾勒出来需要处理的区域,由于获取的运动物体的轮廓信息通常为不规则图形,在图像中不便于定位,在本公开的一些实施例中,可以通过正方形标注框的方式,在图像中标注出同时包含运动物体轮廓的最小正方形,而正方形标注框及正方形标注框内的区域,即为运动物体的位置区域。Among them, the region of interest (Region Of Interest, ROI) is a box, circle, ellipse and polygon to outline the area that needs to be processed, because the acquired contour information of the moving object is usually an irregular figure, which is inconvenient in the image. Positioning, in some embodiments of the present disclosure, the smallest square that also includes the outline of the moving object can be marked in the image by means of a square marking frame, and the area within the square marking frame and the square marking frame is the area of the moving object. location area.
图4为本公开实施例提供的另一种运动物体的定位方法的流程示意图,在本公开的一些实施例中,如图4所示,在根据目标像素点确定运动物体的位置区域的步骤之后,即在步骤S140之后,该定位方法还包括:步骤S141。FIG. 4 is a schematic flowchart of another method for locating a moving object according to an embodiment of the present disclosure. In some embodiments of the present disclosure, as shown in FIG. 4 , after the step of determining the location area of the moving object according to the target pixel point , that is, after step S140, the positioning method further includes: step S141.
步骤S141、根据多个采样事件帧中运动物体的位置区域,确定运动物体的移动轨迹,并通过训练完成的图像分类模型,判断移动轨迹是否为目标轨迹。Step S141 , determining the moving trajectory of the moving object according to the position regions of the moving object in the multiple sampling event frames, and determining whether the moving trajectory is the target trajectory through the trained image classification model.
在每个采样事件帧的运动物体的位置区域中,将该位置区域的中心点作为运动物体的运动点,将多个连续的采样事件帧进行叠加后,即可获取到由多个运动点组成该运动物体的移动轨迹。图像分类模型,是基于样本图像预先训练完成的分类模型,其作用在于针对输入的图像信息,进行图像特征的提取并获取特征向量,然后根据获取到的特征向量输出对应的图像分类概率,图像分类概率表示了输入的图像信息为正样本或负样本的概率,进而根据该图像分类概率进行分类(即二值分类),确定输入图像是否为目标轨迹;其中,目标轨迹的类型由正样本图像的轨迹类型决定,例如,将高空抛物轨迹作为目标轨迹,判断图像中运动物体的移动轨迹是否为高空抛物轨迹,以确定采样事件帧中运动物体的移动轨迹是否为高空抛物轨迹。In the position area of the moving object in each sampling event frame, the center point of the position area is used as the moving point of the moving object, and after multiple consecutive sampling event frames are superimposed, the composition of multiple moving points can be obtained. The movement trajectory of the moving object. The image classification model is a classification model that is pre-trained based on sample images. Its function is to extract image features and obtain feature vectors for the input image information, and then output the corresponding image classification probability according to the obtained feature vectors. Image classification The probability represents the probability that the input image information is a positive sample or a negative sample, and then classify according to the image classification probability (ie binary classification) to determine whether the input image is a target trajectory; among them, the type of the target trajectory is determined by the positive sample image. The trajectory type determines, for example, the high-altitude parabolic trajectory is used as the target trajectory, and whether the moving trajectory of the moving object in the image is a high-altitude parabolic trajectory is determined to determine whether the moving trajectory of the moving object in the sampling event frame is a high-altitude parabolic trajectory.
在本公开的一些实施例中,在通过训练完成的图像分类模型,判断移动轨迹是否为目标轨迹之前,该定位方法还包括:基于卷积神经网络构建初始图像分类模型,并通过样本图像集合对初始图像分类模型进行图像识别及分类训练,以获取训练完成的图像分类模型。In some embodiments of the present disclosure, before judging whether the moving track is a target track through the image classification model completed by training, the positioning method further includes: constructing an initial image classification model based on a convolutional neural network, and pairing the image with a sample image set. The initial image classification model performs image recognition and classification training to obtain a trained image classification model.
其中,卷积神经网络(Convolutional Neural Networks,CNN)是深度学习(Deep Learning)中包含卷积计算且具有深度结构的前馈神经网络(Feedforward Neural Networks),其特点在于卷积运算的应用提高了图像特征的提取精度,池化层的应用则降低了图像特征的计算复杂度。Among them, Convolutional Neural Networks (CNN) is a feedforward neural network (Feedforward Neural Networks) with deep structure including convolution calculation in Deep Learning (Deep Learning), which is characterized in that the application of convolution operation improves the The extraction accuracy of image features, and the application of pooling layer reduces the computational complexity of image features.
示例性的,样本图像集合中,正样本图像为高空抛物轨迹图像,正样本图像的输出值为1;负样本图像为不包括高空抛物轨迹或高空坠物轨迹的图像信息,例如,飞鸟飞行轨迹图像、人影闪过轨迹图像、像素空白图像以及上抛物体移动轨迹图像等多种类型,负样本图像的输出值为0。Exemplarily, in the sample image set, the positive sample image is a high-altitude parabolic trajectory image, and the output value of the positive sample image is 1; the negative sample image is image information that does not include high-altitude parabolic trajectory or high-altitude falling object trajectory, for example, the flight trajectory of birds. There are various types of images, such as images of silhouette flashing trajectory images, pixel blank images, and moving trajectory images of upper paraboloids. The output value of negative sample images is 0.
通过正样本图像和负样本图像组成的样本图像集对初始图像分类模型的训练,使得训练完成的图像分类模型具备了图像识别及分类能力。The initial image classification model is trained by a sample image set composed of positive sample images and negative sample images, so that the trained image classification model has image recognition and classification capabilities.
根据本公开实施例提供的运动物体的定位方法的技术方案,在获取到采样事件帧后,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,进而获取对应产生的事件次数大于或等于该目标事件次数阈值的目标像素点,最终根据所有目标像素点确定运动物体的位置区域,实现对运动物体的定位,且在对运动物体进行定位时无需进行图像特征的提取及计算过程,有效节省了计算资源,同时提高了运动物体的识别效率,能够有效实现针对较小体积运动物体的准确定位。According to the technical solution of the method for locating a moving object provided by the embodiment of the present disclosure, after the sampling event frame is acquired, the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is obtained. The target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and The calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
图5为本公开实施例提供的一种运动物体的定位装置的结构框图,该定位装置200具体包括:事件帧获取模块210、阈值获取模块220和位置区域获取模块230。FIG. 5 is a structural block diagram of a device for positioning a moving object according to an embodiment of the present disclosure. The positioning device 200 specifically includes: an event frame obtaining module 210 , a threshold value obtaining module 220 and a location area obtaining module 230 .
其中,事件帧获取模块210用于通过动态视觉传感器获取事件流信息,并根据事件流信息获取采样事件帧。The event frame obtaining module 210 is configured to obtain event flow information through the dynamic vision sensor, and obtain sampled event frames according to the event flow information.
阈值获取模块220用于:根据采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,其中事件次数阈值对应的像素点为对应产生的事件次数大于或等于该事件次数阈值的像素点;根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值。The threshold obtaining module 220 is used to: obtain the number of pixels corresponding to at least two event times thresholds in the event times threshold set according to the sampled event frame, wherein the pixel corresponding to the event times threshold is the corresponding generated event times greater than or equal to the number of pixels. Pixels of the event count threshold; determine the target event count threshold according to the number of pixels corresponding to at least two event count thresholds respectively.
位置区域获取模块230用于确定采样事件帧中对应产生的事件次数大于或等于目标事件次数阈值的目标像素点,并根据目标像素点确定运动物体的位置区域。The location area acquisition module 230 is configured to determine the target pixel points in the sampled event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the location area of the moving object according to the target pixel points.
根据本公开实施例提供的运动物体的定位装置的技术方案,在获取到采样事件帧后,根据至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,进而获取对应产生的事件次数大于或等于该目标事件次数阈值的目标像素点,最终根据所有目标像素点确定运动物体的位置区域,实现对运动物体的定位,且在对运动物体进行定位时无需 进行图像特征的提取及计算过程,有效节省了计算资源,同时提高了运动物体的识别效率,能够有效实现针对较小体积运动物体的准确定位。According to the technical solution of the device for positioning a moving object provided by the embodiment of the present disclosure, after the sampling event frame is acquired, the target event number threshold is determined according to the number of pixels corresponding to at least two event number thresholds respectively, and then the corresponding generated event number threshold is determined. The target pixels whose number of events is greater than or equal to the threshold of the number of target events will finally determine the location area of the moving object according to all the target pixels, so as to realize the positioning of the moving object, and there is no need to perform image feature extraction and The calculation process effectively saves computing resources, improves the recognition efficiency of moving objects, and can effectively achieve accurate positioning for small-volume moving objects.
在一些实施例中,在上述技术方案的基础上,运动物体的定位装置200,还可以包括:备选像素点获取模块和阈值集合确定模块。In some embodiments, on the basis of the above technical solutions, the apparatus 200 for positioning a moving object may further include: a candidate pixel point acquisition module and a threshold set determination module.
备选像素点获取模块用于根据所述采样事件帧,确定具有至少一个事件的备选像素点。The candidate pixel point acquiring module is configured to determine candidate pixel points having at least one event according to the sampled event frame.
阈值集合确定模块用于根据备选像素点的数量,确定匹配的最大事件次数阈值,以确定事件次数阈值集合。The threshold set determination module is configured to determine the maximum matching event number threshold according to the number of candidate pixels, so as to determine the event number threshold set.
在一些实施例中,在上述技术方案的基础上,阈值集合确定模块用于根据预先确定的备选像素点数量与最大事件次数阈值的对应关系,获取与备选像素点的数量匹配的最大事件次数阈值。In some embodiments, on the basis of the above technical solutions, the threshold set determination module is configured to obtain the maximum event matching the number of candidate pixels according to the predetermined correspondence between the number of candidate pixels and the threshold of the maximum number of events number of thresholds.
在一些实施例中,在上述技术方案的基础上,阈值获取模块220用于根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值,并将临界事件次数阈值作为目标事件次数阈值。或者,阈值获取模块220用于:根据至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值;根据事件次数阈值集合中的最大事件次数阈值和临界事件次数阈值,确定中间事件次数阈值,并将中间事件次数阈值作为目标事件次数阈值。In some embodiments, based on the above technical solutions, the threshold obtaining module 220 is configured to determine the threshold of the number of critical events according to the number of pixels corresponding to the at least two thresholds of the number of events, and use the threshold of the number of critical events as the target event number of thresholds. Or, the threshold value acquisition module 220 is used for: according to the number of pixel points corresponding to at least two event times thresholds respectively, determine the critical event times threshold; according to the maximum event times threshold and the critical event times threshold in the event times threshold set, determine the intermediate event Threshold for the number of times, and use the threshold for the number of intermediate events as the threshold for the number of target events.
在一些实施例中,在上述技术方案的基础上,阈值获取模块220,具体包括:差值结果获取单元和阈值获取单元。In some embodiments, based on the above technical solutions, the threshold value obtaining module 220 specifically includes: a difference value result obtaining unit and a threshold value obtaining unit.
差值结果获取单元用于将事件次数阈值集合中每相邻两个事件次数阈值分别对应的像素点的数量进行差值运算,并获取差值结果。The difference result obtaining unit is configured to perform a difference operation on the number of pixels corresponding to every two adjacent event times thresholds in the event times threshold set, and obtain a difference result.
阈值获取单元用于在各差值结果中,选择数值最大的目标差值结果,并将与目标差值结果对应的两个事件次数阈值中的较大值,作为临界事件次数阈值。The threshold obtaining unit is used to select the target difference result with the largest value among the difference results, and use the larger value of the two event count thresholds corresponding to the target difference result as the critical event count threshold.
在一些实施例中,在上述技术方案的基础上,阈值获取模块220用于若获取到事件次数阈值集合中任意相邻两个事件次数阈值分别对应的像素点的数量的差值运算结果大于等于预设数量阈值时,则将该相邻的两个事件次数阈值中的较大值,作为临界事件次数阈值。或者,阈值获取模块220用于若获取到事件次数阈值集合中任意相邻的两个事件次数阈值分别对应的像素点的数量的差值运算结果,与采样事件帧的像素点总数之间的比值,大于等于预设百分比阈值时,则将该相邻的两个事件次数阈值中的较大值,作为临界事件次数阈值。In some embodiments, on the basis of the above technical solutions, the threshold obtaining module 220 is configured to obtain a difference operation result of the number of pixels corresponding to any two adjacent event count thresholds in the event count threshold set being greater than or equal to When the number threshold is preset, the larger value of the two adjacent event number thresholds is used as the critical event number threshold. Alternatively, the threshold acquisition module 220 is configured to obtain the ratio of the difference calculation result of the number of pixels corresponding to any two adjacent event number thresholds in the event number threshold set to the total number of pixels in the sampling event frame. , when it is greater than or equal to the preset percentage threshold, the larger value of the two adjacent event times thresholds is used as the critical event times threshold.
在一些实施例中,在上述技术方案的基础上,位置区域获取模块230用于根据目标像素点,通过感兴趣区域框标注运动物体的位置区域。In some embodiments, on the basis of the above technical solutions, the location area acquisition module 230 is configured to mark the location area of the moving object through the area of interest frame according to the target pixel point.
在一些实施例中,在上述技术方案的基础上,运动物体的定位装置200,还包括:侧抑制处理执行模块。In some embodiments, based on the above technical solutions, the apparatus 200 for positioning a moving object further includes: a side restraint processing execution module.
侧抑制处理执行模块用于对采样事件帧中备选像素点所在的区域,进行侧抑制处理。The side suppression processing execution module is used for performing side suppression processing on the region where the candidate pixel points are located in the sampling event frame.
在一些实施例中,在上述技术方案的基础上,阈值获取模块220用于根据侧抑制处理后的采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量。In some embodiments, based on the above technical solutions, the threshold obtaining module 220 is configured to obtain the number of pixels corresponding to at least two event count thresholds in the event count threshold set according to the sampled event frame after side suppression processing.
在一些实施例中,在上述技术方案的基础上,运动物体的定位装置200,还包括:移动轨迹获取模块。In some embodiments, on the basis of the above technical solutions, the apparatus 200 for positioning a moving object further includes: a movement track acquisition module.
移动轨迹获取模块用于根据多个采样事件帧中运动物体的位置区域,确定运动物体的移动轨迹,并通过训练完成的图像分类模型,判断移动轨迹是否为目标轨迹。The moving track acquisition module is used to determine the moving track of the moving object according to the position area of the moving object in the multiple sampling event frames, and judge whether the moving track is the target track through the image classification model completed by training.
在一些实施例中,在上述技术方案的基础上,运动物体的定位装置200,还包括:图像分类模型获取模块。In some embodiments, based on the above technical solutions, the moving object positioning apparatus 200 further includes: an image classification model acquisition module.
图像分类模型获取模块用于基于卷积神经网络构建初始图像分类模型,并通过样本图像集合对初始图像分类模型进行图像识别及分类训练,以获取训练完成的图像分类模型。The image classification model acquisition module is used to construct an initial image classification model based on the convolutional neural network, and perform image recognition and classification training on the initial image classification model through the sample image set to obtain the trained image classification model.
上述装置可执行本公开任意实施例所提供的运动物体的定位方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本公开任意实施例提供的方法。The above apparatus can execute the method for positioning a moving object provided by any embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method. For technical details not described in detail in this embodiment, reference may be made to the method provided by any embodiment of the present disclosure.
图6为本公开实施例提供的一种电子设备的结构示意图。图6示出了适于用来实现本公开实施方式的示例性电子设备12的框图。图6显示的电子设备12仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。FIG. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure. FIG. 6 shows a block diagram of an exemplary electronic device 12 suitable for use in implementing embodiments of the present disclosure. The electronic device 12 shown in FIG. 6 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
如图6所示,电子设备12以通用计算设备的形式表现。电子设备12的组件可以包括但不限于:一个或者多个处理器或者处理单元16,存储器28,连接不同系统组件(包括存储器28和处理单元16)的总线18。As shown in FIG. 6, the electronic device 12 takes the form of a general-purpose computing device. Components of electronic device 12 may include, but are not limited to, one or more processors or processing units 16 , memory 28 , and a bus 18 connecting various system components including memory 28 and processing unit 16 .
总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(ISA)总线,微通道体系结构(MAC)总线,增强型ISA总线、视频电子标准协会(VESA)局域总线以及外围组件互连(PCI)总线。 Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any of a variety of bus structures. By way of example, these architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, Enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect ( PCI) bus.
电子设备12典型地包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。 Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12, including both volatile and non-volatile media, removable and non-removable media.
存储器28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(RAM)30和/或高速缓存存储器32。电子设备12可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质(图3未显示,通常称为“硬盘驱动器”)。尽管图3中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如CD-ROM,DVD-ROM或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。存储器28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本公开各实施例的功能。 Memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache memory 32 . Electronic device 12 may further include other removable/non-removable, volatile/non-volatile computer system storage media. For example only, storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in FIG. 3, commonly referred to as a "hard disk drive"). Although not shown in Figure 3, a disk drive may be provided for reading and writing to removable non-volatile magnetic disks (eg "floppy disks"), as well as removable non-volatile optical disks (eg CD-ROM, DVD-ROM) or other optical media) to read and write optical drives. In these cases, each drive may be connected to bus 18 through one or more data media interfaces. Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present disclosure.
具有一组(至少一个)程序模块42的程序/实用工具40,可以存储在例如存储器28中,这样的程序模块42包括但不限于操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。程序模块42通常执行本公开所描述的实施例中的功能和/或方法。A program/utility 40 having a set (at least one) of program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include an implementation of a network environment. Program modules 42 generally perform the functions and/or methods of the embodiments described in this disclosure.
电子设备12也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该电子设备12交互的设备通信,和/或与使得该电子设备12能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备12还可以通过网络适配器20与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器20通过总线18与电子设备12的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备12使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。The electronic device 12 may also communicate with one or more external devices 14 (eg, a keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with the electronic device 12, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may take place through input/output (I/O) interface 22 . Also, the electronic device 12 may communicate with one or more networks (eg, a local area network (LAN), a wide area network (WAN), and/or a public network such as the Internet) through a network adapter 20 . As shown, network adapter 20 communicates with other modules of electronic device 12 via bus 18 . It should be understood that, although not shown, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives and data backup storage systems.
处理单元16通过运行存储在存储器28中的程序,从而执行各种功能应用以及数据处理,例如实现本公开任意实施例提供的运动物体的定位方法。本公开实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本公开任意实施例所述的运动物体的定位方法。The processing unit 16 executes various functional applications and data processing by running the programs stored in the memory 28, for example, implementing the method for positioning a moving object provided by any embodiment of the present disclosure. An embodiment of the present disclosure also provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, implements the method for locating a moving object according to any embodiment of the present disclosure.
本公开实施例的计算机存储介质,可以采用一个或多个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存 储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文件中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。The computer storage medium of the embodiments of the present disclosure may adopt any combination of one or more computer-readable media. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. A computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples (a non-exhaustive list) of computer readable storage media include: electrical connections having one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), Erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In this document, a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。A computer-readable signal medium may include a propagated data signal in baseband or as part of a carrier wave, with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括——但不限于无线、电线、光缆、RF等等,或者上述的任意合适的组合。Program code embodied on a computer readable medium may be transmitted using any suitable medium, including - but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开操作的计算机程序代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for carrying out operations of the present disclosure may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, but also conventional Procedural programming language - such as the "C" language or similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
注意,上述仅为本公开的较佳实施例及所运用技术原理。本领域技术人员会理解,本公开不限于这里所述的特定实施例,对本领域技术人员来说能够进行各种明显的变化、重新调整和替代而不会脱离本公开的保护范围。因此,虽然通过以上实施例对本公开进行了较为详细的说明,但是本公开不仅仅限于以上实施例,在不脱离本公开构思的情况下,还可以包括更多其他等效实施例,而本公开的范围由所附的权利要求范围决定。Note that the above are only preferred embodiments of the present disclosure and applied technical principles. Those skilled in the art will understand that the present disclosure is not limited to the specific embodiments described herein, and various obvious changes, readjustments and substitutions can be made by those skilled in the art without departing from the scope of protection of the present disclosure. Therefore, although the present disclosure has been described in detail through the above embodiments, the present disclosure is not limited to the above embodiments, and can also include more other equivalent embodiments without departing from the concept of the present disclosure. The scope is determined by the scope of the appended claims.

Claims (13)

  1. 一种运动物体的定位方法,其特征在于,包括:A method for positioning a moving object, comprising:
    通过动态视觉传感器获取事件流信息,并根据所述事件流信息获取采样事件帧;Obtain event flow information through a dynamic vision sensor, and obtain sampled event frames according to the event flow information;
    根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,所述事件次数阈值对应的像素点为对应产生的事件次数大于或等于所述事件次数阈值的像素点;According to the sampled event frame, the number of pixels corresponding to at least two event number thresholds in the event number threshold set is obtained, and the pixel points corresponding to the event number threshold are correspondingly generated events greater than or equal to the event number threshold. pixel point;
    根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值;Determine the target event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively;
    确定所述采样事件帧中对应产生的事件次数大于或等于所述目标事件次数阈值的目标像素点,并根据所述目标像素点确定运动物体的位置区域。Determine the target pixel points in the sampled event frame with the corresponding event times greater than or equal to the target event times threshold, and determine the position area of the moving object according to the target pixel points.
  2. 根据权利要求1所述的方法,其特征在于,在所述根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量之前,所述方法还包括:The method according to claim 1, characterized in that, before acquiring the number of pixels corresponding to at least two event number thresholds in the event number threshold set according to the sampled event frame, the method further comprises:
    根据所述采样事件帧,确定具有至少一个事件的备选像素点;According to the sampled event frame, determine a candidate pixel point having at least one event;
    根据所述备选像素点的数量,确定匹配的最大事件次数阈值,以确定所述事件次数阈值集合。According to the number of the candidate pixel points, a matching maximum event number threshold is determined to determine the event number threshold set.
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述备选像素点的数量,确定匹配的最大事件次数阈值,包括:The method according to claim 2, wherein the determining a threshold of the maximum number of events matched according to the number of the candidate pixels, comprising:
    根据预先确定的备选像素点数量与最大事件次数阈值的对应关系,获取与所述备选像素点的数量匹配的最大事件次数阈值。According to the corresponding relationship between the predetermined number of candidate pixel points and the maximum number of events threshold, the maximum number of events threshold matching the number of candidate pixels is acquired.
  4. 根据权利要求1所述的方法,其特征在于,所述根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,包括:The method according to claim 1, wherein the determining the threshold for the number of target events according to the number of pixels corresponding to the at least two thresholds for the number of events comprises:
    根据所述至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值,并将所述临界事件次数阈值作为所述目标事件次数阈值。A critical event number threshold is determined according to the number of pixels corresponding to the at least two event number thresholds respectively, and the critical event number threshold is used as the target event number threshold.
  5. 根据权利要求1所述的方法,其特征在于,所述根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值,包括:The method according to claim 1, wherein the determining the threshold for the number of target events according to the number of pixels corresponding to the at least two thresholds for the number of events comprises:
    根据所述至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值;Determine the critical event number threshold according to the number of pixels corresponding to the at least two event number thresholds respectively;
    根据所述事件次数阈值集合中的最大事件次数阈值和所述临界事件次数阈值,确定中间事件次数阈值,并将所述中间事件次数阈值作为所述目标事件次数阈值。An intermediate event number threshold is determined according to the maximum event number threshold and the critical event number threshold in the event number threshold set, and the intermediate event number threshold is used as the target event number threshold.
  6. 根据权利要求4或5所述的方法,其特征在于,所述根据所述至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值,包括:The method according to claim 4 or 5, wherein the determining the threshold of the number of critical events according to the number of pixels corresponding to the at least two thresholds of the number of events, comprises:
    将所述事件次数阈值集合中每相邻两个事件次数阈值分别对应的像素点的数量进行差值运算,并获取差值结果;Perform a difference operation on the number of pixels corresponding to every two adjacent event times thresholds in the event number threshold set, and obtain a difference result;
    在各所述差值结果中选择数值最大的目标差值结果,并将与所述目标差值结果对应的两个事件次数阈值中的较大值,作为所述临界事件次数阈值。The target difference result with the largest value is selected from each of the difference results, and the larger value of the two event times thresholds corresponding to the target difference result is used as the critical event times threshold.
  7. 根据权利要求4或5所述的方法,其特征在于,所述根据所述至少两个事件次数阈值分别对应的像素点的数量,确定临界事件次数阈值,包括:The method according to claim 4 or 5, wherein the determining the threshold of the number of critical events according to the number of pixels corresponding to the at least two thresholds of the number of events, comprises:
    若获取到所述事件次数阈值集合中任意相邻的两个事件次数阈值分别对应的像素点的数量的差值运算结果大于等于预设数量阈值时,则将所述相邻的两个事件次数阈值中的较大值作为所述临界事件次数阈值;或者,If the difference calculation result of the number of pixels corresponding to any two adjacent event number thresholds in the event number threshold set is greater than or equal to the preset number threshold, the adjacent two event number thresholds are set to The larger value of the thresholds is used as the threshold for the number of critical events; or,
    若获取到所述事件次数阈值集合中任意相邻的两个事件次数阈值分别对应的像素点的数量的差值运算结果,与所述采样事件帧的像素点总数之间的比值,大于等于预设百分比阈值时,则将所述相邻的两个事件次数阈值中的较大值作为所述临界事件次数阈值。If the difference calculation result of the number of pixels corresponding to any two adjacent event number thresholds in the event number threshold set is obtained, and the ratio between the total number of pixels in the sampled event frame, is greater than or equal to the predetermined value. When the percentage threshold is set, the larger value of the two adjacent event number thresholds is used as the critical event number threshold.
  8. 根据权利要求1所述的方法,其特征在于,所述根据所述目标像素点确定运动物体的位置区域,包括:The method according to claim 1, wherein the determining the position area of the moving object according to the target pixel point comprises:
    根据所述目标像素点,通过感兴趣区域框标注运动物体的位置区域。According to the target pixel point, the position area of the moving object is marked by the area of interest frame.
  9. 根据权利要求2所述的方法,其特征在于,在所述根据所述采样事件帧,确定具有至少一个事件的备选像素点之后,所述方法还包括:The method according to claim 2, wherein after determining the candidate pixel point having at least one event according to the sampled event frame, the method further comprises:
    对所述采样事件帧中所述备选像素点所在的区域,进行侧抑制处理;Perform side suppression processing on the area where the candidate pixel points are located in the sampling event frame;
    所述根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,包括:According to the sampled event frame, acquiring the number of pixels corresponding to at least two event number thresholds in the event number threshold set, including:
    根据侧抑制处理后的所述采样事件帧,获取所述至少两个事件次数阈值分别对应的像素点的数量。According to the sampled event frame after the side suppression processing, the number of pixels corresponding to the at least two event times thresholds respectively is acquired.
  10. 根据权利要求1所述的方法,其特征在于,在所述根据所述目标像素点确定运动物体的位置区域之后,所述方法还包括:The method according to claim 1, characterized in that after determining the position area of the moving object according to the target pixel, the method further comprises:
    根据多个所述采样事件帧中运动物体的位置区域,确定所述运动物体的移动轨迹,并通过训练完成的图像分类模型,判断所述移动轨迹是否为目标轨迹。Determine the moving trajectory of the moving object according to the position areas of the moving objects in the plurality of sampling event frames, and determine whether the moving trajectory is the target trajectory through the image classification model that has been trained.
  11. 一种运动物体的定位装置,其特征在于,包括:A positioning device for a moving object, comprising:
    事件帧获取模块,用于通过动态视觉传感器获取事件流信息,并根据所述事件流信息获取采样事件帧;an event frame acquisition module, used for acquiring event stream information through a dynamic vision sensor, and acquiring sampling event frames according to the event stream information;
    阈值获取模块,用于根据所述采样事件帧,获取事件次数阈值集合中至少两个事件次数阈值分别对应的像素点的数量,所述事件次数阈值对应的像素点为对应产生的事件 次数大于或等于所述事件次数阈值的像素点;根据所述至少两个事件次数阈值分别对应的像素点的数量,确定目标事件次数阈值;A threshold acquisition module, configured to acquire, according to the sampled event frame, the number of pixels corresponding to at least two event number thresholds in the event number threshold set respectively, where the pixel points corresponding to the event number threshold are correspondingly generated event numbers greater than or Pixels equal to the threshold for the number of events; determine the threshold for the number of events of interest according to the number of pixels corresponding to the at least two thresholds for the number of events;
    位置区域获取模块,用于确定所述采样事件帧中对应产生的事件次数大于或等于所述目标事件次数阈值的目标像素点,并根据所述目标像素点确定运动物体的位置区域。A location area acquisition module, configured to determine a target pixel point in the sampled event frame whose number of events is greater than or equal to the target event number threshold, and determine a location area of a moving object according to the target pixel point.
  12. 一种电子设备,其特征在于,所述电子设备包括:An electronic device, characterized in that the electronic device comprises:
    一个或多个处理器;one or more processors;
    存储器,用于存储一个或多个程序,memory for storing one or more programs,
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-10中任一所述的运动物体的定位方法。When the one or more programs are executed by the one or more processors, the one or more processors implement the method for locating a moving object according to any one of claims 1-10.
  13. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时实现如权利要求1-10中任一所述的运动物体的定位方法。A computer-readable storage medium on which a computer program is stored, characterized in that, when the program is executed by a processor, the method for locating a moving object according to any one of claims 1-10 is implemented.
PCT/CN2022/079340 2021-03-23 2022-03-04 Moving object positioning method and apparatus, electronic device, and storage medium WO2022199360A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110309463.6A CN113012200B (en) 2021-03-23 2021-03-23 Method and device for positioning moving object, electronic equipment and storage medium
CN202110309463.6 2021-03-23

Publications (1)

Publication Number Publication Date
WO2022199360A1 true WO2022199360A1 (en) 2022-09-29

Family

ID=76405564

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/079340 WO2022199360A1 (en) 2021-03-23 2022-03-04 Moving object positioning method and apparatus, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN113012200B (en)
WO (1) WO2022199360A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115457484A (en) * 2022-11-10 2022-12-09 梁山华鲁专用汽车制造有限公司 Control method and device for automatic unloading of semitrailer
CN116095914A (en) * 2023-04-10 2023-05-09 同方德诚(山东)科技股份公司 Intelligent building illumination adjusting method and system based on big data

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113012200B (en) * 2021-03-23 2023-01-13 北京灵汐科技有限公司 Method and device for positioning moving object, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180098082A1 (en) * 2016-09-30 2018-04-05 Intel Corporation Motion estimation using hybrid video imaging system
WO2019117247A1 (en) * 2017-12-14 2019-06-20 オムロン株式会社 Pupil detection device and detection system
CN112153309A (en) * 2019-06-26 2020-12-29 三星电子株式会社 Vision sensor, image processing apparatus, and method of operating vision sensor
CN112399032A (en) * 2019-08-13 2021-02-23 天津大学青岛海洋技术研究院 Optical flow acquisition method of pulse type image sensor based on detector
CN113012200A (en) * 2021-03-23 2021-06-22 北京灵汐科技有限公司 Method and device for positioning moving object, electronic equipment and storage medium

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA3031040C (en) * 2015-07-16 2021-02-16 Blast Motion Inc. Multi-sensor event correlation system
KR102308435B1 (en) * 2017-01-31 2021-10-05 삼성전자 주식회사 Apparatus and method for managing the object in wireless communication system
JP6912324B2 (en) * 2017-08-30 2021-08-04 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Information processing method, information processing device and information processing program
US20190188111A1 (en) * 2019-02-26 2019-06-20 Intel Corporation Methods and apparatus to improve performance data collection of a high performance computing application
KR20200115881A (en) * 2019-03-28 2020-10-08 삼성전자주식회사 Dynamic vision sensor configured to calibrate event signals using optical black region and method of operating the same
US10867495B1 (en) * 2019-09-11 2020-12-15 Motorola Solutions, Inc. Device and method for adjusting an amount of video analytics data reported by video capturing devices deployed in a given location
CN110942011B (en) * 2019-11-18 2021-02-02 上海极链网络科技有限公司 Video event identification method, system, electronic equipment and medium
CN112037266B (en) * 2020-11-05 2021-02-05 北京软通智慧城市科技有限公司 Falling object identification method and device, terminal equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180098082A1 (en) * 2016-09-30 2018-04-05 Intel Corporation Motion estimation using hybrid video imaging system
WO2019117247A1 (en) * 2017-12-14 2019-06-20 オムロン株式会社 Pupil detection device and detection system
CN112153309A (en) * 2019-06-26 2020-12-29 三星电子株式会社 Vision sensor, image processing apparatus, and method of operating vision sensor
CN112399032A (en) * 2019-08-13 2021-02-23 天津大学青岛海洋技术研究院 Optical flow acquisition method of pulse type image sensor based on detector
CN113012200A (en) * 2021-03-23 2021-06-22 北京灵汐科技有限公司 Method and device for positioning moving object, electronic equipment and storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115457484A (en) * 2022-11-10 2022-12-09 梁山华鲁专用汽车制造有限公司 Control method and device for automatic unloading of semitrailer
CN116095914A (en) * 2023-04-10 2023-05-09 同方德诚(山东)科技股份公司 Intelligent building illumination adjusting method and system based on big data
CN116095914B (en) * 2023-04-10 2023-08-25 同方德诚(山东)科技股份公司 Intelligent building illumination adjusting method and system based on big data

Also Published As

Publication number Publication date
CN113012200A (en) 2021-06-22
CN113012200B (en) 2023-01-13

Similar Documents

Publication Publication Date Title
US20220383535A1 (en) Object Tracking Method and Device, Electronic Device, and Computer-Readable Storage Medium
WO2022135511A1 (en) Method and apparatus for positioning moving object, and electronic device and storage medium
WO2022199360A1 (en) Moving object positioning method and apparatus, electronic device, and storage medium
US11643076B2 (en) Forward collision control method and apparatus, electronic device, program, and medium
US11182592B2 (en) Target object recognition method and apparatus, storage medium, and electronic device
US11430265B2 (en) Video-based human behavior recognition method, apparatus, device and storage medium
WO2021103868A1 (en) Method for structuring pedestrian information, device, apparatus and storage medium
WO2021031954A1 (en) Object quantity determination method and apparatus, and storage medium and electronic device
CN113378770B (en) Gesture recognition method, device, equipment and storage medium
Liu et al. Real-time facial expression recognition based on cnn
CN111226226A (en) Motion-based object detection method, object detection device and electronic equipment
KR20220126264A (en) Video jitter detection method and device, electronic equipment and storage medium
CN111783639A (en) Image detection method and device, electronic equipment and readable storage medium
JP2020009442A (en) Systems, methods, and programs for real-time end-to-end capturing of ink strokes from video
CN110796108B (en) Method, device and equipment for detecting face quality and storage medium
CN109241942B (en) Image processing method and device, face recognition equipment and storage medium
CN114461078B (en) Man-machine interaction method based on artificial intelligence
CN110163032B (en) Face detection method and device
CN113762027B (en) Abnormal behavior identification method, device, equipment and storage medium
CN115457620A (en) User expression recognition method and device, computer equipment and storage medium
CN113762017B (en) Action recognition method, device, equipment and storage medium
CN114640807A (en) Video-based object counting method and device, electronic equipment and storage medium
CN114200934A (en) Robot target following control method and device, electronic equipment and storage medium
US11847823B2 (en) Object and keypoint detection system with low spatial jitter, low latency and low power usage
CN111061367B (en) Method for realizing gesture mouse of self-service equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22774022

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 13-02-2024)