WO2020151172A1 - Moving object detection method and apparatus, computer device, and storage medium - Google Patents

Moving object detection method and apparatus, computer device, and storage medium Download PDF

Info

Publication number
WO2020151172A1
WO2020151172A1 PCT/CN2019/091905 CN2019091905W WO2020151172A1 WO 2020151172 A1 WO2020151172 A1 WO 2020151172A1 CN 2019091905 W CN2019091905 W CN 2019091905W WO 2020151172 A1 WO2020151172 A1 WO 2020151172A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
real
moving target
time video
bounding box
Prior art date
Application number
PCT/CN2019/091905
Other languages
French (fr)
Chinese (zh)
Inventor
王健宗
彭俊清
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020151172A1 publication Critical patent/WO2020151172A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • This application relates to the field of image recognition technology, and in particular to a moving target detection method, device, computer equipment and storage medium.
  • This application provides a moving target detection method, device, computer equipment and storage medium to improve the detection speed and accuracy of moving targets.
  • this application provides a method for detecting a moving target, the method including:
  • this application also provides a moving target detection device, the device including:
  • An obtaining and determining unit configured to obtain real-time video, and determine the moving target in the real-time video
  • An information extraction unit configured to extract a bounding box of the moving target and data information corresponding to the bounding box, the data information including position information and size information of the bounding box in the real-time video;
  • a recognition detection unit configured to input the image in the bounding box into a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
  • the target labeling unit is configured to label the moving target in the real-time video recording according to the classification category.
  • the present application also provides a computer device, the computer device includes a memory and a processor; the memory is used to store a computer program; the processor is used to execute the computer program and when executing the computer program The computer program realizes the above-mentioned moving target detection method.
  • the present application also provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the processor realizes the above-mentioned moving target detection method.
  • This application discloses a moving object detection method, device, equipment and storage medium, which can quickly identify and classify moving objects, such as identifying car logos and car models corresponding to moving vehicles, etc., which can reduce the amount of calculation when identifying and classifying, thereby Provides the recognition efficiency of moving targets and is suitable for real-time detection and recognition.
  • FIG. 1 is a schematic flowchart of a method for training a target recognition model provided by an embodiment of the present application
  • FIG. 2 is a schematic diagram of an application scenario of a moving target detection method provided by an embodiment of the present application
  • FIG. 3 is a schematic flowchart of a moving target detection method provided by an embodiment of the present application.
  • FIG. 4 is a schematic flowchart of sub-steps of the moving target detection method in FIG. 3;
  • FIG. 5 is a schematic flowchart of steps for determining a moving target provided by an embodiment of the present application
  • FIG. 6 is a schematic block diagram of a model training device provided by an embodiment of the application.
  • FIG. 7 is a schematic block diagram of a moving target detection device provided by an embodiment of the application.
  • FIG. 8 is a schematic block diagram of another moving target detection device provided by an embodiment of the application.
  • FIG. 9 is a schematic block diagram of the structure of a computer device according to an embodiment of the application.
  • the embodiments of the application provide a moving target detection method, device, computer equipment, and storage medium.
  • the moving target detection method can be applied to a terminal or a server to quickly and accurately identify the classification information of the moving target.
  • the moving target detection method is used to identify and classify moving vehicles on the road, and of course it can be used to identify other moving targets, such as non-motorized vehicles, animals, or pedestrians.
  • the following embodiments will take a moving vehicle as a moving target for detailed introduction.
  • FIG. 1 is a schematic flowchart of a method for training a target recognition model provided by an embodiment of the present application.
  • the target recognition model is obtained by model training based on a convolutional neural network.
  • a convolutional neural network Of course, other networks can also be used for training.
  • GoogLeNet is used for model training to obtain the target recognition model.
  • other networks may also be used, such as AlexNet or VGGNet.
  • GoogLeNet is used for model training to obtain the target recognition model.
  • the training method of the target recognition model is used to train the target recognition model for application in the moving target detection method.
  • the training method includes step S101 to step S105.
  • the target pictures are pictures of multiple target objects taken from different angles.
  • the target object is a vehicle, including vehicles of different models under the same vehicle label. Of course, it may also be a non-motorized vehicle, a pedestrian, or an animal. Selecting a vehicle includes selecting cars with different logos and models, and taking pictures taken from different angles of the car as the target picture.
  • the target picture constitutes a picture set for training the target recognition model.
  • S102 Mark the target picture according to the category identifier corresponding to the category category.
  • the classification category includes vehicle logo and vehicle model
  • the corresponding category identification includes vehicle logo identification and vehicle model identification.
  • the car logo includes: Ferrari, Lamborghini, Bentley, Aston Martin, Mercedes-Benz, BMW, Audi, Chevrolet, Volkswagen or BYD, etc.
  • model logos include: small cars, mini cars, compact cars, medium cars, high-end cars , luxury models, sedan models or SUV models.
  • the target pictures are marked according to the vehicle logo identifier and the vehicle type identifier corresponding to the classification category, so that each target picture has marking information, that is, each target picture includes the vehicle logo and the vehicle model.
  • sample data in order to quickly train the target recognition model, after marking each target picture, sample data can be constructed, and step S105 is executed according to the constructed sample data to perform model training.
  • S103 Perform an image processing operation on the target picture to change the picture parameters of the target picture, and use the target picture whose picture parameters are changed as a new target picture.
  • image processing operations include: size adjustment, cropping, rotation, image algorithm processing, etc.
  • image algorithm processing includes: color temperature adjustment algorithm, exposure adjustment algorithm, contrast adjustment algorithm, highlight recovery algorithm, low light compensation algorithm, white balance Algorithm, adjustment of definition algorithm, fogging algorithm index, adjustment of natural saturation algorithm.
  • the picture parameters include size information, pixel size, color temperature parameters, exposure, contrast, white balance, sharpness, fogging parameters, and natural saturation.
  • performing an image processing operation on the target picture to change the picture parameters of the target picture, and using the target picture whose picture parameters are changed as a new target picture refers to performing the aforementioned multiple image processing operations on the target picture respectively One or more of them are combined to change the picture parameters of the target picture.
  • the diversity of the samples is increased, and the samples are more representative of the real environment, thereby improving the recognition accuracy of the model.
  • the target picture whose picture parameters are changed is saved as a new target picture, and the new target picture and the original target picture are combined to form sample data. This increases the number of samples and at the same time increases the diversity of samples.
  • S105 Based on the convolutional neural network, perform model training according to the sample data to obtain a target recognition model, and use the obtained target recognition model as a pre-trained target recognition model.
  • the constructed sample data is used for model training through GoogLeNet.
  • GoogLeNet Specifically, directional propagation training can be used.
  • the convolutional layer and pooling layer of GoogLeNet are used to extract features from the input sample data, and the fully connected layer is used as a classifier.
  • the output of this classifier is the probability value of different car logos and models.
  • the convolutional neural network takes the trained sample data as input and goes through the forward propagation step (convolution, ReLU activation and pooling operations to forward propagation in the fully connected layer) , And finally get the output probability of each category.
  • the terminal can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device
  • the server can be an independent server or a server cluster.
  • the compression processing specifically includes pruning processing, quantization processing, and Huffman encoding processing on the target recognition model, etc., to reduce the size of the target recognition model, and thereby facilitate storage in a terminal with a smaller capacity.
  • the training method provided by the above-mentioned embodiments uses image processing operations to process the target pictures to increase the diversity of sample data by shooting target pictures with multiple target objects at different angles; based on the convolutional neural network, the training is performed according to the constructed sample data Model training is used to obtain a target recognition model, and the obtained target recognition model is used as a pre-trained target recognition model in the moving target recognition method, thereby improving the recognition accuracy of the moving target.
  • FIG. 2 is a schematic diagram of an application scenario of the moving target detection method provided by an embodiment of the present application.
  • This application scenario includes servers, terminals, and traffic monitoring equipment, and traffic monitoring equipment includes cameras.
  • the server is used to train the target recognition model, and save the trained target recognition model in the terminal or save it after compression;
  • the camera is used to collect real-time video of moving vehicles on the traffic road, and send the collected real-time video to the terminal ;
  • the terminal is used to implement the moving target detection method to identify the category of the detected moving vehicle.
  • FIG. 3 is a schematic flowchart of a method for detecting a moving target provided by an embodiment of the present application.
  • the moving object detection method can be applied to a terminal or a server, and quickly identify the category of the detected moving object from the real-time video with a small amount of calculation.
  • the moving target detection method specifically includes steps S201 to S204, which will be described in detail below in conjunction with FIG. 2.
  • real-time video recording is, for example, a camera in a traffic monitoring device that captures a video of a moving vehicle on a traffic road in real time.
  • determine the moving target in the real-time video recording such as a moving vehicle
  • determine the moving target in the real-time video recording such as a moving vehicle
  • use the inter-frame difference method to detect the real-time video to determine the moving vehicle.
  • other detection methods can also be used, such as image recognition to determine the moving vehicle.
  • Shape recognition of moving vehicles in real-time video can also be used, such as image recognition to determine the moving vehicle.
  • S202 Extract a bounding box of the moving target and data information corresponding to the bounding box.
  • the data information includes position information and size information of the bounding box in the real-time video recording. Extracting the bounding box of the moving target and the data information corresponding to the bounding box includes: determining the bounding box of the video frame image of the moving target in real-time recording; extracting the position of the bounding box in the real-time recording Information and size information.
  • step S202 includes sub-steps S202a and S202b.
  • S202a Determine a bounding box corresponding to the moving target according to the horizontal bandwidth and vertical length of the moving target in real-time video recording;
  • S202b Extract the horizontal bandwidth and vertical length as the size information, and the bounding box As the position information.
  • the corresponding bounding box is determined according to the maximum horizontal bandwidth and vertical length of the moving target in real-time recording; and the maximum horizontal bandwidth and vertical length are extracted as size information, and the center coordinate value of the bounding box is obtained as According to the position information, the size and position information of the bounding box can be obtained, and the size and position information of the bounding box is the data information corresponding to the bounding box.
  • a frame of image in real-time recording may include multiple moving targets, such as multiple moving vehicles, each moving vehicle corresponds to a bounding box, so the real-time recording video frame may correspond to multiple Bounding box.
  • the image in the bounding box can be determined according to the data information of the bounding box, and then the image in the bounding box is input to a pre-trained target recognition model for prediction, so as to output the classification category corresponding to the moving target.
  • the target recognition model may recognize that the classification category of the moving vehicle includes information such as car logo and model. Specifically, as shown in Figure 2, the predicted logo and model of the sports vehicle are Audi And the car.
  • S204 Mark the moving target in the real-time video recording according to the classification category.
  • marking the moving targets in the real-time recording according to the classification category includes displaying the classification category output by the model at the moving target in the real-time recording.
  • the bounding box can also be displayed in the real-time video, and then the classification category can be displayed in the bounding box.
  • other labeling methods may also be used to label the moving target in the real-time video recording. Therefore, by marking the moving target, it is convenient for the user to locate or track the moving vehicle.
  • each moving target needs to be marked separately for the user to recognize.
  • the method for recognizing moving objects can quickly recognize and classify moving objects, such as recognizing car logos and car models corresponding to moving vehicles. Specifically, after determining the moving target in real-time video; extracting the bounding box of the moving target and the data information corresponding to the bounding box; determining the image in the bounding box according to the data information corresponding to the bounding box, and then inputting the image in the bounding box To the pre-trained target recognition model to output the classification category of the moving target. This realizes the recognition and classification of moving targets in real-time video. This method can reduce the amount of calculation during classification, thereby improving the recognition efficiency of moving targets, and is suitable for real-time detection and recognition.
  • FIG. 5 is a schematic flowchart of steps for determining a moving target provided by an embodiment of the present application.
  • the steps of determining the moving target specifically include the following:
  • S301 Determine a current frame image from the real-time video recording, and use the current frame image as a reference image.
  • the current frame image is determined from the real-time recording, and the corresponding video picture can be selected as the current frame image according to the user in the real-time recording. For example, when the real-time video is played, the user clicks to select the currently played video, and the video frame selected by the user can be used as the current frame image. Of course, the user can also specify the corresponding video frame as the current frame image.
  • the determined current frame image is taken as the reference image, and the reference image is expressed as f k (i, j), where k represents the current frame image of the k-th video frame in the real-time recorded image sequence, where k is a positive integer, (i, j) are expressed as discrete image coordinates in the video frame.
  • the moving speed of the moving target can be determined first, and then the corresponding preset number of frames is selected according to the moving speed, where different moving speeds correspond to different numbers of presets The number of frames.
  • the movement speed is a range value, of course, it can also be a specific value.
  • the movement speed range value is, for example, 90 to 110km/h; the specific movement speed value is, for example, 100km/h.
  • the moving speed of the moving target to be determined may be measured by a speed measuring instrument, such as a laser speedometer.
  • a speed measuring instrument such as a laser speedometer.
  • the moving speed of the moving target can also be calculated based on two images with a certain number of frames in the interval.
  • the speed and accuracy of moving target recognition are improved.
  • the moving speed of the moving target to be determined can be determined according to the environmental parameters of the moving target.
  • the number of delayed preset frames is set according to the motion speed. For example, a vehicle in the leftmost lane on an expressway moves faster, and its corresponding delay preset number of frames is less. For example, set the preset number of frames to delay 1 or 2 frames; on an expressway For vehicles in the middle lane, the speed is also relatively fast. Set the default frame number to 4 or 5 frames later; for vehicles in the rightmost lane on the expressway, the speed is relatively fast, so set the default frame number to delay 7 frames or 8 frames; the speed of vehicles on urban roads is relatively slow, and the corresponding delay preset frame number can be set to a larger number of frames, such as 9 or 10 frames.
  • the preset frame number corresponding to the acquired motion speed range is determined, which can be changed according to the actual situation of the moving target, thereby quickly and accurately determining real-time recording Sports goals in.
  • the moving speed of the moving vehicle is approximately 110km/h or more, and the obtained moving speed is determined according to the preset correspondence between the moving speed range and the preset number of frames
  • the preset number of frames corresponding to the range is specifically 2 frames.
  • the reference image is expressed as f k (i, j).
  • the predetermined number of frames is determined to be 2 frames, and then an image that is 2 frames behind the reference image can be extracted
  • the delayed frame image is expressed as f k+2 (i, j).
  • the deferred frame image and the current frame image are subtracted by a difference method to obtain a difference image, and the difference image is expressed as:
  • D k represents a differential image
  • f k (i, j) represents a reference image
  • f k+2 (i, j) represents a delayed frame image
  • (i, j) represents a discrete image coordinate
  • S306 Perform threshold processing on the difference image to obtain a binary image corresponding to the difference image.
  • the performing threshold processing on the differential image to obtain the binary image corresponding to the differential image includes: determining pixels in the differential image with pixel values greater than a preset threshold; The pixel points determine the binary image corresponding to the difference image.
  • S k (i, j) represents a binary image
  • T is a preset threshold
  • (i, j) represents the coordinates of a discrete image
  • D k represents a differential image
  • greater than or equal to the preset threshold is represented as 1, and less than the preset threshold.
  • the determining the moving target in the real-time video recording according to the binary image includes: setting the area corresponding to S k (i, j) of 1 in the binary image as the moving area; passing through the moving area Morphological processing and connectivity analysis remove noise to determine the moving target in the real-time video.
  • the area corresponding to S k (i, j) of 1 in the binary image is set as the motion area, and then the motion area is processed by morphological processing and connectivity analysis to remove noise, so as to obtain effective motion aims.
  • FIG. 6 is a schematic block diagram of a model training device provided by an embodiment of the present application.
  • the model training device may be configured in a server and used to execute the aforementioned target recognition model training method.
  • the model training device 400 includes: a picture acquisition unit 401, a picture labeling unit 402, a parameter changing unit 403, a data construction unit 404, and a model training unit 405.
  • the picture acquiring unit 401 is configured to acquire a target picture, where the target picture is a picture of multiple target objects taken from different angles.
  • the picture marking unit 402 is configured to mark the target picture according to the category identifier corresponding to the classification category.
  • the parameter changing unit 403 is configured to perform an image processing operation on the target picture to change the picture parameters of the target picture, and use the target picture whose picture parameters are changed as a new target picture.
  • the image processing operations include: size adjustment, cropping processing, rotation processing, image algorithm processing, etc.
  • the image algorithm processing includes: color temperature adjustment algorithm, exposure adjustment algorithm, contrast adjustment algorithm, highlight restoration algorithm, low light compensation algorithm , White balance algorithm, sharpness adjustment algorithm, fogging algorithm index, natural saturation adjustment algorithm.
  • the data construction unit 404 is configured to construct sample data according to the new target picture and the target picture.
  • the model training unit 405 is configured to perform model training according to the sample data based on the convolutional neural network to obtain a target recognition model, and use the obtained target recognition model as a pre-trained target recognition model.
  • FIG. 7 is a schematic block diagram of a moving target detection device provided in an embodiment of the present application, and the moving target detection device is used to execute the aforementioned moving target detection method.
  • the moving target detection device can be configured in a server or a terminal.
  • the moving target detection device 500 includes: an acquisition and determination unit 501, an information extraction unit 502, an identification and detection unit 503, and a target labeling unit 504.
  • the obtaining and determining unit 501 is configured to obtain real-time video and determine the moving target in the real-time video.
  • the information extraction unit 502 is configured to extract a bounding box of the moving target and data information corresponding to the bounding box, the data information including position information and size information of the bounding box in the real-time video recording.
  • the information extraction unit 502 is specifically configured to determine the bounding box corresponding to the moving target according to the horizontal broadband and vertical length of the moving target in real-time video recording; extract the horizontal broadband and vertical length as the size Information, and the center coordinates of the bounding box as the position information.
  • the recognition and detection unit 503 is configured to input the image in the bounding box into a pre-trained target recognition model for recognition and detection according to the data information, so as to output the classification category corresponding to the moving target;
  • the target labeling unit 504 is configured to label the moving target in the real-time video recording according to the classification category.
  • the acquisition and determination unit 501 includes: a reference determination unit 5011, a speed determination unit 5012, a frame number determination unit 5013, an image extraction unit 5014, an image subtraction unit 5015, and an image processing unit 5016.
  • the reference determining unit 5011 is configured to determine a current frame image from the real-time video recording, and use the current frame image as a reference image.
  • the speed determining unit 5012 is used to obtain the moving speed of the moving target to be determined, where different moving speeds correspond to different numbers of preset frames.
  • the frame number determining unit 5013 is configured to determine the preset frame number corresponding to the acquired motion speed range according to the preset correspondence between the motion speed range and the preset frame number.
  • the image extraction unit 5014 is configured to extract a delayed frame image that is delayed by a preset number of frames relative to the reference image.
  • the image subtraction unit 5015 is configured to subtract the delayed frame image and the current frame image to obtain a difference image.
  • the image processing unit 5016 is configured to perform threshold processing on the difference image to obtain a binary image corresponding to the difference image.
  • the above-mentioned apparatus may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 9.
  • FIG. 9 is a schematic block diagram of the structure of a computer device according to an embodiment of the present application.
  • the computer equipment can be a server or a terminal.
  • the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may include a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium can store an operating system and a computer program.
  • the computer program includes program instructions, and when the program instructions are executed, the processor can execute any moving target detection method.
  • the processor is used to provide calculation and control capabilities and support the operation of the entire computer equipment.
  • the internal memory provides an environment for the running of the computer program in the non-volatile storage medium.
  • the processor can execute any method for detecting moving objects.
  • the network interface is used for network communication, such as sending assigned tasks.
  • the network interface is used for network communication, such as sending assigned tasks.
  • FIG. 9 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the specific computer device may Including more or less parts than shown in the figure, or combining some parts, or having a different part arrangement.
  • the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), and application specific integrated circuits (Application Specific Integrated Circuits). Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
  • the embodiments of the present application also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement the present application Any of the moving target detection methods provided by the embodiments.
  • the computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, such as the hard disk or memory of the computer device.
  • the computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk, a smart media card (SMC), or a secure digital (Secure Digital, SD) equipped on the computer device. ) Card, Flash Card, etc.

Abstract

A moving object detection method and apparatus, a device, and a storage medium. The method comprises: acquiring a real-time video, and firstly determining a moving object in the real-time video; extracting a bounding box of the moving object and data information corresponding to the bounding box; inputting the image in the bounding box into a pre-trained target recognition model according to the data information to carry out recognition detection so as to obtain a classification category corresponding to the moving object; and labeling the moving object in the real-time video according to the classification category.

Description

运动目标检测方法、装置、计算机设备及存储介质Moving target detection method, device, computer equipment and storage medium
本申请要求于2019年1月23日提交中国专利局、申请号为201910065021.4、发明名称为“运动目标检测方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on January 23, 2019, the application number is 201910065021.4, and the invention title is "moving target detection method, device, computer equipment and storage medium", the entire content of which is incorporated by reference Incorporated in this application.
技术领域Technical field
本申请涉及图像识别技术领域,尤其涉及一种运动目标检测方法、装置、计算机设备及存储介质。This application relates to the field of image recognition technology, and in particular to a moving target detection method, device, computer equipment and storage medium.
背景技术Background technique
在传统的目标检测方法中,需要通过图片或视频放入神经网络的卷积层中进行卷积运算,再进行分割并逐个寻找检测目标,该过程通过遍历整张图片找到相关目标,这样的方法比较耗费算力,在一些实际场景应用中,比如交通监测的场景下,在检测车辆的过程中,一般都是监测实时视频,对效率的要求非常高,而传统的目标检测方法很难得做到这一点。因此,有必要提供一种运动目标检测方法以解决上述问题。In the traditional target detection method, it is necessary to put pictures or videos into the convolutional layer of the neural network for convolution operation, and then segment and find the detection targets one by one. This process finds the relevant targets by traversing the entire picture. This method It consumes more computing power. In some practical scenarios, such as traffic monitoring scenarios, in the process of detecting vehicles, real-time video is generally monitored, which requires very high efficiency, while traditional target detection methods are difficult to achieve at this point. Therefore, it is necessary to provide a moving target detection method to solve the above problems.
发明内容Summary of the invention
本申请提供了一种运动目标检测方法、装置、计算机设备及存储介质,以提高运动目标的检测速度和准确性。This application provides a moving target detection method, device, computer equipment and storage medium to improve the detection speed and accuracy of moving targets.
第一方面,本申请提供了一种运动目标检测方法,所述方法包括:In the first aspect, this application provides a method for detecting a moving target, the method including:
获取实时录像,确定所述实时录像中的运动目标;Acquiring real-time video, and determining the moving target in the real-time video;
提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;Extracting a bounding box of the moving target and data information corresponding to the bounding box, where the data information includes position information and size information of the bounding box in the real-time video recording;
根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;Inputting the image in the bounding box to a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
根据所述分类类别对所述实时录像中的运动目标进行标注。Marking the moving target in the real-time video recording according to the classification category.
第二方面,本申请还提供了一种运动目标检测装置,所述装置包括:In the second aspect, this application also provides a moving target detection device, the device including:
获取确定单元,用于获取实时录像,确定所述实时录像中的运动目标;An obtaining and determining unit, configured to obtain real-time video, and determine the moving target in the real-time video;
信息提取单元,用于提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;An information extraction unit, configured to extract a bounding box of the moving target and data information corresponding to the bounding box, the data information including position information and size information of the bounding box in the real-time video;
识别检测单元,用于根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;A recognition detection unit, configured to input the image in the bounding box into a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
目标标注单元,用于根据所述分类类别对所述实时录像中的运动目标进行标注。The target labeling unit is configured to label the moving target in the real-time video recording according to the classification category.
第三方面,本申请还提供了一种计算机设备,所述计算机设备包括存储器和处理器;所述存储器用于存储计算机程序;所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现如上述的运动目标检测方法。In the third aspect, the present application also provides a computer device, the computer device includes a memory and a processor; the memory is used to store a computer program; the processor is used to execute the computer program and when executing the computer program The computer program realizes the above-mentioned moving target detection method.
第四方面,本申请还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时使所述处理器实现如上述的运动目标检测方法。In a fourth aspect, the present application also provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the processor realizes the above-mentioned moving target detection method.
本申请公开了一种运动目标检测方法、装置、设备及存储介质,可以快速对运动物体进行识别分类,比如识别运动车辆对应的车标和车型等,可以减小识别分类时的计算量,进而提供运动目标的识别效率,适用于实时检测识别。This application discloses a moving object detection method, device, equipment and storage medium, which can quickly identify and classify moving objects, such as identifying car logos and car models corresponding to moving vehicles, etc., which can reduce the amount of calculation when identifying and classifying, thereby Provides the recognition efficiency of moving targets and is suitable for real-time detection and recognition.
附图说明Description of the drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions of the embodiments of the present application, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technical personnel can obtain other drawings based on these drawings without creative work.
图1是本申请的实施例提供的一种目标识别模型的训练方法的示意流程图;FIG. 1 is a schematic flowchart of a method for training a target recognition model provided by an embodiment of the present application;
图2是本申请的实施例提供的运动目标检测方法的应用场景示意图;FIG. 2 is a schematic diagram of an application scenario of a moving target detection method provided by an embodiment of the present application;
图3是本申请的实施例提供的一种运动目标检测方法的示意流程图;FIG. 3 is a schematic flowchart of a moving target detection method provided by an embodiment of the present application;
图4是图3中的运动目标检测方法的子步骤示意流程图;4 is a schematic flowchart of sub-steps of the moving target detection method in FIG. 3;
图5是本申请的实施例提供的确定运动目标的步骤示意流程图;FIG. 5 is a schematic flowchart of steps for determining a moving target provided by an embodiment of the present application;
图6为本申请实施例提供的一种模型训练装置的示意性框图;FIG. 6 is a schematic block diagram of a model training device provided by an embodiment of the application;
图7为本申请实施例提供的一种运动目标检测装置的示意性框图;FIG. 7 is a schematic block diagram of a moving target detection device provided by an embodiment of the application;
图8为本申请实施例提供的另一种运动目标检测装置的示意性框图;FIG. 8 is a schematic block diagram of another moving target detection device provided by an embodiment of the application;
图9为本申请一实施例提供的一种计算机设备的结构示意性框图。FIG. 9 is a schematic block diagram of the structure of a computer device according to an embodiment of the application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present application in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowchart shown in the drawings is only an example, and does not necessarily include all contents and operations/steps, nor does it have to be executed in the described order. For example, some operations/steps can also be decomposed, combined or partially combined, so the actual execution order may be changed according to actual conditions.
本申请的实施例提供了一种运动目标检测方法、装置、计算机设备及存储介质。其中,该运动目标检测方法可以应用于终端或服务器中,以快速准确地识别检测运动目标的分类信息。The embodiments of the application provide a moving target detection method, device, computer equipment, and storage medium. Among them, the moving target detection method can be applied to a terminal or a server to quickly and accurately identify the classification information of the moving target.
例如,运动目标检测方法用于对道路上运动车辆进行识别分类,当然可以用于对其他运动目标的识别,比如非机动车、动物或行人等。但为了便于理解,以下实施例将以运动车辆为运动目标进行详细介绍。For example, the moving target detection method is used to identify and classify moving vehicles on the road, and of course it can be used to identify other moving targets, such as non-motorized vehicles, animals, or pedestrians. However, for ease of understanding, the following embodiments will take a moving vehicle as a moving target for detailed introduction.
下面结合附图,对本申请的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。In the following, some embodiments of the present application will be described in detail with reference to the accompanying drawings. In the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.
请参阅图1,图1是本申请的实施例提供的一种目标识别模型的训练方法的示意流程图。该目标识别模型是基于卷积神经网络进行模型训练得到的,当然也可以采用其他网络进行训练得到。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a method for training a target recognition model provided by an embodiment of the present application. The target recognition model is obtained by model training based on a convolutional neural network. Of course, other networks can also be used for training.
需要说明的是,在本实施例中,使用GoogLeNet进行模型训练以得到目标识别模型,当然也可以采用其他网络,比如采用AlexNet或VGGNet等。以下将以GoogLeNet为例进行介绍。It should be noted that in this embodiment, GoogLeNet is used for model training to obtain the target recognition model. Of course, other networks may also be used, such as AlexNet or VGGNet. The following will introduce GoogLeNet as an example.
如图1所示,该目标识别模型的训练方法,用于训练出目标识别模型以便应用在运动目标检测方法上。其中,该训练方法包括步骤S101至步骤S105。As shown in Figure 1, the training method of the target recognition model is used to train the target recognition model for application in the moving target detection method. Wherein, the training method includes step S101 to step S105.
S101、获取目标图片。S101. Obtain a target picture.
其中,所述目标图片为从不同角度拍摄的多个目标物体的图片。在本实施例中,目标物体为车辆,包括相同车标下的不同车型的车辆,当然也可以为非机动车辆、行人或动物等。选取车辆包括选取不同车标、车型的汽车,并从汽 车的不同角度去拍摄的图片作为目标图片,该目标图片构成图片集,用于训练目标识别模型。Wherein, the target pictures are pictures of multiple target objects taken from different angles. In this embodiment, the target object is a vehicle, including vehicles of different models under the same vehicle label. Of course, it may also be a non-motorized vehicle, a pedestrian, or an animal. Selecting a vehicle includes selecting cars with different logos and models, and taking pictures taken from different angles of the car as the target picture. The target picture constitutes a picture set for training the target recognition model.
S102、根据分类类别对应的类别标识对所述目标图片进行标记。S102: Mark the target picture according to the category identifier corresponding to the category category.
其中,分类类别包括车标和车型等,对应的类别标识包括车标标识和车型标识。其中,车标标识包括:法拉利、兰博基尼、宾利、阿斯顿马丁、奔驰、宝马、奥迪、雪佛兰、大众或比亚迪等等;车型标识包括:小型车、微型车、紧凑车型、中等车型、高级车型、豪华车型、三厢车型或SUV车型。Among them, the classification category includes vehicle logo and vehicle model, and the corresponding category identification includes vehicle logo identification and vehicle model identification. Among them, the car logo includes: Ferrari, Lamborghini, Bentley, Aston Martin, Mercedes-Benz, BMW, Audi, Chevrolet, Volkswagen or BYD, etc.; model logos include: small cars, mini cars, compact cars, medium cars, high-end cars , Luxury models, sedan models or SUV models.
具体地,根据分类类别对应的车标标识和车型标识对所述目标图片进行标记,使得每个目标图片均有带有标记信息,即每个目标图片均包括车标和车型。Specifically, the target pictures are marked according to the vehicle logo identifier and the vehicle type identifier corresponding to the classification category, so that each target picture has marking information, that is, each target picture includes the vehicle logo and the vehicle model.
在一个实施例中,为了快速训练出目标识别模型,在对每个目标图片进行标记后,即可构建样本数据,并根据构建的样本数据执行步骤S105,进行模型训练。In one embodiment, in order to quickly train the target recognition model, after marking each target picture, sample data can be constructed, and step S105 is executed according to the constructed sample data to perform model training.
S103、对所述目标图片进行图像处理操作以改变所述目标图片的图片参数,将改变图片参数的目标图片作为新的目标图片。S103: Perform an image processing operation on the target picture to change the picture parameters of the target picture, and use the target picture whose picture parameters are changed as a new target picture.
为了提高目标识别模型的准确度,在对每个目标图片进行标记完后,还需对每个目标图片进行图像处理操作以改变所述目标图片的图片参数。In order to improve the accuracy of the target recognition model, after marking each target picture, it is necessary to perform an image processing operation on each target picture to change the picture parameters of the target picture.
其中,图像处理操作包括:尺寸调整、裁剪处理、旋转处理和图像算法处理等等;图像算法处理包括:调整色温算法、调整曝光算法、调整对比度算法、高光恢复算法、低光补偿算法、白平衡算法、调整清晰度算法、雾化算法索引、调整自然饱和度算法。通过这些图像处理操作可以增加样本数据的多样性,使得样本数据更贴近真实拍摄的图片。Among them, image processing operations include: size adjustment, cropping, rotation, image algorithm processing, etc.; image algorithm processing includes: color temperature adjustment algorithm, exposure adjustment algorithm, contrast adjustment algorithm, highlight recovery algorithm, low light compensation algorithm, white balance Algorithm, adjustment of definition algorithm, fogging algorithm index, adjustment of natural saturation algorithm. Through these image processing operations, the diversity of the sample data can be increased, making the sample data closer to the real pictures.
相应地,图片参数包括尺寸信息、像素大小、色温参数、曝光度、对比度、白平衡、清晰度、雾化参数和自然饱和度等。Correspondingly, the picture parameters include size information, pixel size, color temperature parameters, exposure, contrast, white balance, sharpness, fogging parameters, and natural saturation.
需要说明的是,对所述目标图片进行图像处理操作以改变所述目标图片的图片参数,将改变图片参数的目标图片作为新的目标图片,是指分别对目标图片进行上述多种图像处理操作中的一种或几种结合以改变所述目标图片的图片参数。进而增加样本的多样性,同时使得样本更能代表现实环境,由此提高了模型的识别准确度。It should be noted that performing an image processing operation on the target picture to change the picture parameters of the target picture, and using the target picture whose picture parameters are changed as a new target picture, refers to performing the aforementioned multiple image processing operations on the target picture respectively One or more of them are combined to change the picture parameters of the target picture. In turn, the diversity of the samples is increased, and the samples are more representative of the real environment, thereby improving the recognition accuracy of the model.
S104、根据新的目标图片与所述目标图片构建样本数据。S104. Construct sample data according to the new target picture and the target picture.
具体地,保存改变图片参数的目标图片作为新的目标图片,将该新的目标图片和原来的目标图片一起构成样本数据。进而增加样本数量,同时又增加了 样本的多样性。Specifically, the target picture whose picture parameters are changed is saved as a new target picture, and the new target picture and the original target picture are combined to form sample data. This increases the number of samples and at the same time increases the diversity of samples.
S105、基于卷积神经网络,根据所述样本数据进行模型训练以得到目标识别模型,并将得到的目标识别模型作为预先训练的目标识别模型。S105: Based on the convolutional neural network, perform model training according to the sample data to obtain a target recognition model, and use the obtained target recognition model as a pre-trained target recognition model.
具体地,使用构建的样本数据,通过GoogLeNet进行模型训练,具体可以采用方向传播训练,使用GoogLeNet的卷积层和池化层从输入样本数据中提取特征,使用完全连接层用来做分类器,该分类器的输出是不同车标和车型的概率值。Specifically, the constructed sample data is used for model training through GoogLeNet. Specifically, directional propagation training can be used. The convolutional layer and pooling layer of GoogLeNet are used to extract features from the input sample data, and the fully connected layer is used as a classifier. The output of this classifier is the probability value of different car logos and models.
用随机值初始化所有过滤器和参数/权重;卷积神经网络将训练的样本数据作为输入,经过前向传播步骤(卷积,ReLU激活和池化操作以在完全连接层中的前向传播),最终得到每个类别的输出概率。Initialize all filters and parameters/weights with random values; the convolutional neural network takes the trained sample data as input and goes through the forward propagation step (convolution, ReLU activation and pooling operations to forward propagation in the fully connected layer) , And finally get the output probability of each category.
将上述样本数据中的部分图片作为标定数据(ground truth),利用准备的样本数据通过大规模迭代训练,让卷积神经网络在学习图片语义信息后输出每个类别的输出概率,使用输出概率与标定数据(ground truth)的定义损失函数(loss),在模型训练中尽量缩小损失函数(loss),来保证模型的准确度,以完成模型训练。Take part of the pictures in the above sample data as the ground truth, and use the prepared sample data through large-scale iterative training to let the convolutional neural network output the output probability of each category after learning the semantic information of the picture, using the output probability and Define the loss function (loss) of the calibration data (ground truth), and minimize the loss function (loss) in the model training to ensure the accuracy of the model to complete the model training.
由于,运动目标检测方法可以应用于终端或服务器中,因此需要将训练好的模型保存在终端或服务器中。其中,该终端可以是手机、平板电脑、笔记本电脑、台式电脑、个人数字助理和穿戴式设备等电子设备;服务器可以为独立的服务器,也可以为服务器集群。Since the moving target detection method can be applied to the terminal or server, the trained model needs to be stored in the terminal or server. Among them, the terminal can be an electronic device such as a mobile phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device; the server can be an independent server or a server cluster.
如果是应用于终端中,为了保证该终端的正常运行以及快速识别检测出运动目标的类别,还需要对训练得到的目标识别模型进行压缩处理,将压缩处理后的模型保存在终端。If it is applied to a terminal, in order to ensure the normal operation of the terminal and to quickly identify the category of the detected moving target, it is also necessary to compress the target recognition model obtained by training, and save the compressed model in the terminal.
其中,该压缩处理具体包括对目标识别模型进行剪枝处理、量化处理和哈夫曼编码处理等,以减小目标识别模型的大小,进而方便保存在容量较小的终端中。Wherein, the compression processing specifically includes pruning processing, quantization processing, and Huffman encoding processing on the target recognition model, etc., to reduce the size of the target recognition model, and thereby facilitate storage in a terminal with a smaller capacity.
上述实施例提供的训练方法,通过拍摄多个目标物体位于不同角度的目标图片,利用图像处理操作对目标图片进行处理以增加样本数据的多样性;基于卷积神经网络,根据构建的样本数据进行模型训练以得到目标识别模型,并将得到的目标识别模型作为预先训练的目标识别模型应用于运动目标识别方法中,由此可提高运动目标的识别准确度。The training method provided by the above-mentioned embodiments uses image processing operations to process the target pictures to increase the diversity of sample data by shooting target pictures with multiple target objects at different angles; based on the convolutional neural network, the training is performed according to the constructed sample data Model training is used to obtain a target recognition model, and the obtained target recognition model is used as a pre-trained target recognition model in the moving target recognition method, thereby improving the recognition accuracy of the moving target.
请参阅图2,图2是本申请的实施例提供的运动目标检测方法的应用场景示 意图。该应用场景包括服务器、终端和交通监控设备,交通监控设备包括摄像头。服务器用于训练目标识别模型,并将训练好的目标识别模型保存在终端中或压缩后保存在终端;摄像头用于采集交通道路上的运动车辆的实时录像,并将采集的实时录像发送给终端;终端用于执行运动目标检测方法,以识别检测出运动车辆的类别。Please refer to Fig. 2, which is a schematic diagram of an application scenario of the moving target detection method provided by an embodiment of the present application. This application scenario includes servers, terminals, and traffic monitoring equipment, and traffic monitoring equipment includes cameras. The server is used to train the target recognition model, and save the trained target recognition model in the terminal or save it after compression; the camera is used to collect real-time video of moving vehicles on the traffic road, and send the collected real-time video to the terminal ; The terminal is used to implement the moving target detection method to identify the category of the detected moving vehicle.
请参阅图3,图3是本申请的实施例提供的一种运动目标检测方法的示意流程图。该运动目标检测方法可以应用在终端或服务器中,以较小的计算量快速地从实时录像中识别检测出运动物体的类别。Please refer to FIG. 3, which is a schematic flowchart of a method for detecting a moving target provided by an embodiment of the present application. The moving object detection method can be applied to a terminal or a server, and quickly identify the category of the detected moving object from the real-time video with a small amount of calculation.
如图3所示,该运动目标检测方法,具体包括步骤S201至步骤S204,以下将结合图2进行详细介绍。As shown in FIG. 3, the moving target detection method specifically includes steps S201 to S204, which will be described in detail below in conjunction with FIG. 2.
S201、获取实时录像,确定所述实时录像中的运动目标。S201. Acquire real-time video, and determine a moving target in the real-time video.
具体地,实时录像比如为交通监控设备中摄像头实时拍下交通道路上运动车辆的录像视频。Specifically, real-time video recording is, for example, a camera in a traffic monitoring device that captures a video of a moving vehicle on a traffic road in real time.
其中,确定实时录像中的运动目标,运动目标比如为运动车辆,具体采用帧间差分法对实时录像进行检测以确定运动车辆,当然也可以采用其他检测方式,比如,图像识别方式来根据车辆的形状识别实时录像中的运动车辆。Among them, determine the moving target in the real-time video recording, such as a moving vehicle, and specifically use the inter-frame difference method to detect the real-time video to determine the moving vehicle. Of course, other detection methods can also be used, such as image recognition to determine the moving vehicle. Shape recognition of moving vehicles in real-time video.
S202、提取所述运动目标的边界框以及所述边界框对应的数据信息。S202: Extract a bounding box of the moving target and data information corresponding to the bounding box.
其中,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息。提取所述运动目标的边界框以及所述边界框对应的数据信息,包括:确定所述运动目标在实时录像中的视频帧图像的边界框;提取所述边界框在所述实时录像中的位置信息和尺寸信息。Wherein, the data information includes position information and size information of the bounding box in the real-time video recording. Extracting the bounding box of the moving target and the data information corresponding to the bounding box includes: determining the bounding box of the video frame image of the moving target in real-time recording; extracting the position of the bounding box in the real-time recording Information and size information.
在一个实施例中,提取边界框和数据信息具体过程,如图4所示,即步骤S202包括子步骤S202a和S202b。In one embodiment, the specific process of extracting the bounding box and data information is shown in FIG. 4, that is, step S202 includes sub-steps S202a and S202b.
S202a、根据所述运动目标在实时录像中的水平宽带和竖直长度确定所述运动目标对应的边界框;S202b、提取所述水平宽带和竖直长度作为所述尺寸信息,以及所述边界框的中心坐标作为所述位置信息。S202a. Determine a bounding box corresponding to the moving target according to the horizontal bandwidth and vertical length of the moving target in real-time video recording; S202b. Extract the horizontal bandwidth and vertical length as the size information, and the bounding box As the position information.
具体地,根据运动目标在实时录像中的最大的水平宽带和竖直长度确定其对应的边界框;并提取最大的水平宽带和竖直长度作为尺寸信息,以及获取该边界框的中心坐标值作为所述位置信息,进而可得到边界框的大小和位置信息,该边界框的大小和位置信息即为边界框对应的数据信息。Specifically, the corresponding bounding box is determined according to the maximum horizontal bandwidth and vertical length of the moving target in real-time recording; and the maximum horizontal bandwidth and vertical length are extracted as size information, and the center coordinate value of the bounding box is obtained as According to the position information, the size and position information of the bounding box can be obtained, and the size and position information of the bounding box is the data information corresponding to the bounding box.
需要说明的是:实时录像中的一帧图像可能包括多个运动目标,比如包括 多个运动车辆,每个运动车辆均会对应一个边界框,因此实时录像的视频帧中可能会对应的多个边界框。It should be noted that: a frame of image in real-time recording may include multiple moving targets, such as multiple moving vehicles, each moving vehicle corresponds to a bounding box, so the real-time recording video frame may correspond to multiple Bounding box.
S203、根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别。S203. Input the image in the bounding box to a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target.
具体地,可以根据边界框的数据信息确定边界框中的图像,再将边界框中的图像输入至预先训练好的目标识别模型进行预测,以输出该运动目标对应的分类类别。Specifically, the image in the bounding box can be determined according to the data information of the bounding box, and then the image in the bounding box is input to a pre-trained target recognition model for prediction, so as to output the classification category corresponding to the moving target.
比如,运动目标为运动车辆,则该目标识别模型可能识别出运动车辆的分类类别包括车标和车型等信息,具体地,如图2所示,预测的运动车辆的车标和车型分别为奥迪和小轿车。For example, if the moving target is a moving vehicle, the target recognition model may recognize that the classification category of the moving vehicle includes information such as car logo and model. Specifically, as shown in Figure 2, the predicted logo and model of the sports vehicle are Audi And the car.
S204、根据所述分类类别对所述实时录像中的运动目标进行标注。S204: Mark the moving target in the real-time video recording according to the classification category.
具体地,根据分类类别对实时录像中的运动目标进行标注,包括在实时录像中的运动目标处显示模型输出的分类类别。当然也可以在实时录像中显示边界框,再在边界框中显示分类类别。或者,也可以采用其他的标注方式,对所述实时录像中的运动目标进行标注。由此通过对运动目标进行标注,方便用户对该运动车辆定位或跟踪。Specifically, marking the moving targets in the real-time recording according to the classification category includes displaying the classification category output by the model at the moving target in the real-time recording. Of course, the bounding box can also be displayed in the real-time video, and then the classification category can be displayed in the bounding box. Alternatively, other labeling methods may also be used to label the moving target in the real-time video recording. Therefore, by marking the moving target, it is convenient for the user to locate or track the moving vehicle.
需要说明的是,如果实时录像中包括多个运动目标,需要分别对每个运动目标进行标注,以便用户进行识别。It should be noted that if multiple moving targets are included in the real-time recording, each moving target needs to be marked separately for the user to recognize.
上述实施例提供的运动目标的识别方法,可快速地对运动物体进行识别分类,比如识别运动车辆对应的车标和车型等。具体是通过在确定实时录像中的运动目标后;提取该运动目标的边界框以及边界框对应的数据信息;根据边界框对应的数据信息确定边界框中的图像,再将边界框中的图像输入至预先训练的目标识别模型以输出运动目标的分类类别。由此实现了对实时录像中的运动目标进行识别分类。该方法可以减小分类时的计算量,进而提供运动目标的识别效率,适用于实时检测识别。The method for recognizing moving objects provided in the above embodiments can quickly recognize and classify moving objects, such as recognizing car logos and car models corresponding to moving vehicles. Specifically, after determining the moving target in real-time video; extracting the bounding box of the moving target and the data information corresponding to the bounding box; determining the image in the bounding box according to the data information corresponding to the bounding box, and then inputting the image in the bounding box To the pre-trained target recognition model to output the classification category of the moving target. This realizes the recognition and classification of moving targets in real-time video. This method can reduce the amount of calculation during classification, thereby improving the recognition efficiency of moving targets, and is suitable for real-time detection and recognition.
请参阅图5,图5是本申请的实施例提供的确定运动目标的步骤示意流程图。为了快速准确地确定所述实时录像中的运动目标,如图5所示,确定运动目标的步骤,具体包括以下内容:Please refer to FIG. 5, which is a schematic flowchart of steps for determining a moving target provided by an embodiment of the present application. In order to quickly and accurately determine the moving target in the real-time recording, as shown in Figure 5, the steps of determining the moving target specifically include the following:
S301、从所述实时录像中确定当前帧图像,将所述当前帧图像作为基准图像。S301: Determine a current frame image from the real-time video recording, and use the current frame image as a reference image.
其中,从所述实时录像中确定当前帧图像,可以根据用户在实时录像中选 择相应的视频图片作为当前帧图像。比如,在播放该实时录像时,用户点击选择了当前播放的视频,则可根据用户选择的视频帧作为当前帧图像。当然,也可以由用户指定相应的视频帧作为当前帧图像。Wherein, the current frame image is determined from the real-time recording, and the corresponding video picture can be selected as the current frame image according to the user in the real-time recording. For example, when the real-time video is played, the user clicks to select the currently played video, and the video frame selected by the user can be used as the current frame image. Of course, the user can also specify the corresponding video frame as the current frame image.
具体地,将确定的当前帧图像作为基准图像,将基准图像表示为f k(i,j),k表示该实时录像的图像序列中第k视频帧的当前帧图像,其中k为正整数,(i,j)表示为视频帧中的离散图像坐标。 Specifically, the determined current frame image is taken as the reference image, and the reference image is expressed as f k (i, j), where k represents the current frame image of the k-th video frame in the real-time recorded image sequence, where k is a positive integer, (i, j) are expressed as discrete image coordinates in the video frame.
S302、获取待确定的运动目标的运动速度。S302. Acquire the moving speed of the moving target to be determined.
在本实施例中,为了提高确定运动目标的效率和准确度,可先确定该运动目标的运动速度,再根据运动速度选择相应的预设帧数,其中不同的运动速度对应不同数量的预设帧数。In this embodiment, in order to improve the efficiency and accuracy of determining the moving target, the moving speed of the moving target can be determined first, and then the corresponding preset number of frames is selected according to the moving speed, where different moving speeds correspond to different numbers of presets The number of frames.
具体地,该运动速度是一个范围值,当然也可以是一个具体值。运动速度范围值,比如为90至110km/h;运动速度具体值,比如为100km/h。Specifically, the movement speed is a range value, of course, it can also be a specific value. The movement speed range value is, for example, 90 to 110km/h; the specific movement speed value is, for example, 100km/h.
在一个实施例中,获取待确定的运动目标的运动速度,可以通过速度测量仪测量待确定的运动目标的运动速度,比如采用激光测速仪等。当然,获取待确定的运动目标的运动速度,也可以根据隔间一定帧数的两个图像来计算运动目标的运动速度。In one embodiment, to obtain the moving speed of the moving target to be determined, the moving speed of the moving target to be determined may be measured by a speed measuring instrument, such as a laser speedometer. Of course, to obtain the moving speed of the moving target to be determined, the moving speed of the moving target can also be calculated based on two images with a certain number of frames in the interval.
在一个实施例中,为了节省终端的计算量,提高运动目标识别速度和准确度。获取待确定的运动目标的运动速度,可以根据运动目标所处的环境参数来确定待确定的运动目标的运动速度。In an embodiment, in order to save the calculation amount of the terminal, the speed and accuracy of moving target recognition are improved. To obtain the moving speed of the moving target to be determined, the moving speed of the moving target to be determined can be determined according to the environmental parameters of the moving target.
譬如,先确定车辆在高速公路的哪一条道上,由此可根据具体道路确定运动车辆的大致范围。比如,车辆在最右侧车道,根据最右侧车道限速范围是60km/h~90km/h,可以确定运动目标的运动速度大致为60km/h~90km/h;相应地,中间车道限速范围是90km/h~110km/h;最左边车道是超车道,最低时速要高于110km/h。再比如,城市道路中同方向只有1条机动车道,限速为每小时50公里,如果运动目标在城市道路中,则可以确定运动速度大致为50km/h。For example, first determine which lane of the highway the vehicle is on, so that the approximate range of the moving vehicle can be determined according to the specific road. For example, if the vehicle is in the rightmost lane, according to the speed limit range of 60km/h~90km/h in the rightmost lane, it can be determined that the moving target's moving speed is roughly 60km/h~90km/h; accordingly, the speed limit in the middle lane The range is 90km/h~110km/h; the leftmost lane is the overtaking lane, and the minimum speed is higher than 110km/h. For another example, there is only one motor vehicle lane in the same direction on a city road, and the speed limit is 50 kilometers per hour. If the moving target is on a city road, the moving speed can be determined to be approximately 50 km/h.
S303、根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数。S303: According to the preset correspondence between the motion speed range and the preset frame number, determine the preset frame number corresponding to the acquired motion speed range.
具体地,延后预设帧数根据运动速度进行设定。比如,在高速公路上最左边车道的车辆,车辆运动速度较快,其对应的延后预设帧数较少,比如将预设帧数设为延后1帧或2帧;在高速公路上中间车道的车辆,车速也比较快,将预设帧数设为延后4帧或5帧;在高速公路上最右边车道的车辆,车速也相对 较快,将预设帧数设为延后7帧或8帧;在城市道路上的车辆,车速相对较慢,可将其对应的延后预设帧数设为较多帧数,比如9帧或10帧等。Specifically, the number of delayed preset frames is set according to the motion speed. For example, a vehicle in the leftmost lane on an expressway moves faster, and its corresponding delay preset number of frames is less. For example, set the preset number of frames to delay 1 or 2 frames; on an expressway For vehicles in the middle lane, the speed is also relatively fast. Set the default frame number to 4 or 5 frames later; for vehicles in the rightmost lane on the expressway, the speed is relatively fast, so set the default frame number to delay 7 frames or 8 frames; the speed of vehicles on urban roads is relatively slow, and the corresponding delay preset frame number can be set to a larger number of frames, such as 9 or 10 frames.
因此,根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数,可以根据运动目标的实际情况而变化,由此快速准确地确定实时录像中的运动目标。Therefore, according to the preset correspondence between the motion speed range and the preset frame number, the preset frame number corresponding to the acquired motion speed range is determined, which can be changed according to the actual situation of the moving target, thereby quickly and accurately determining real-time recording Sports goals in.
例如,车辆在高速公路上最左边车道,则确定该运动车辆的运动速度大致为110km/h以上,由此根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数具体为2帧。For example, if the vehicle is in the leftmost lane of the expressway, it is determined that the moving speed of the moving vehicle is approximately 110km/h or more, and the obtained moving speed is determined according to the preset correspondence between the moving speed range and the preset number of frames The preset number of frames corresponding to the range is specifically 2 frames.
S304、提取相对所述基准图像延后预设帧数的延后帧图像。S304. Extract a delayed frame image that is delayed by a preset number of frames relative to the reference image.
具体地,基准图像表示为f k(i,j),比如车辆在高速公路上最左边车道,则确定的预设帧数为2帧,则可以提取相对所述基准图像延后2帧的图像作为延后帧图像,由此延后帧图像表示为f k+2(i,j)。 Specifically, the reference image is expressed as f k (i, j). For example, if the vehicle is in the leftmost lane on the expressway, the predetermined number of frames is determined to be 2 frames, and then an image that is 2 frames behind the reference image can be extracted As the delayed frame image, the delayed frame image is expressed as f k+2 (i, j).
S305、将所述延后帧图像与所述当前帧图像相减以得到差分图像。S305: Subtracting the delayed frame image and the current frame image to obtain a difference image.
具体地,通过差分法将所述延后帧图像与所述当前帧图像相减以得到差分图像,差分图像表示为:Specifically, the deferred frame image and the current frame image are subtracted by a difference method to obtain a difference image, and the difference image is expressed as:
D k=|f k+2(i,j)-f(i,j)|    (1) D k =|f k+2 (i,j)-f(i,j)| (1)
其中,式(1)中,D k表示差分图像,f k(i,j)表示基准图像,f k+2(i,j)表示延后帧图像,(i,j)表示离散图像坐标。 Among them, in formula (1), D k represents a differential image, f k (i, j) represents a reference image, f k+2 (i, j) represents a delayed frame image, and (i, j) represents a discrete image coordinate.
S306、对所述差分图像进行阈值处理得到所述差分图像对应的二值图像。S306: Perform threshold processing on the difference image to obtain a binary image corresponding to the difference image.
具体地,所述对所述差分图像进行阈值处理得到所述差分图像对应的二值图像,包括:确定所述差分图像中像素值大于预设阈值的像素点;根据大于所述预设阈值的像素点确定所述差分图像对应的二值图像。Specifically, the performing threshold processing on the differential image to obtain the binary image corresponding to the differential image includes: determining pixels in the differential image with pixel values greater than a preset threshold; The pixel points determine the binary image corresponding to the difference image.
其中,所述二值图像表示为:Wherein, the binary image is expressed as:
Figure PCTCN2019091905-appb-000001
Figure PCTCN2019091905-appb-000001
其中,S k(i,j)表示二值图像,T为预设阈值,(i,j)表示离散图像的坐标,D k表示差分图像;大于或等于预设阈值表示为1,小于该预设阈值表示为0。 Among them, S k (i, j) represents a binary image, T is a preset threshold, (i, j) represents the coordinates of a discrete image, and D k represents a differential image; greater than or equal to the preset threshold is represented as 1, and less than the preset threshold. Let the threshold be represented as 0.
S307、根据所述二值图像确定所述实时录像中的运动目标。S307: Determine a moving target in the real-time video recording according to the binary image.
其中,所述根据所述二值图像确定所述实时录像中的运动目标,包括:将二值图像中S k(i,j)为1对应的区域设为运动区域;对所述运动区域通过形态学处理和连通性分析去除噪点,以确定所述实时录像中的运动目标。 Wherein, the determining the moving target in the real-time video recording according to the binary image includes: setting the area corresponding to S k (i, j) of 1 in the binary image as the moving area; passing through the moving area Morphological processing and connectivity analysis remove noise to determine the moving target in the real-time video.
具体地,将二值图像中S k(i,j)为1对应的区域设为运动区域,然后再对该运 动区域通过形态学处理和连通性分析处理以去除噪点,进而可获得有效的运动目标。 Specifically, the area corresponding to S k (i, j) of 1 in the binary image is set as the motion area, and then the motion area is processed by morphological processing and connectivity analysis to remove noise, so as to obtain effective motion aims.
请参阅图6,图6是本申请一实施例提供的一种模型训练装置的示意性框图,该模型训练装置可以配置于服务器中,用于执行前述的目标识别模型的训练方法。Please refer to FIG. 6. FIG. 6 is a schematic block diagram of a model training device provided by an embodiment of the present application. The model training device may be configured in a server and used to execute the aforementioned target recognition model training method.
如图6所示,该模型训练装置400,包括:图片获取单元401、图片标记单元402、参数改变单元403、数据构建单元404和模型训练单元405。As shown in FIG. 6, the model training device 400 includes: a picture acquisition unit 401, a picture labeling unit 402, a parameter changing unit 403, a data construction unit 404, and a model training unit 405.
图片获取单元401,用于获取目标图片,所述目标图片为从不同角度拍摄的多个目标物体的图片。The picture acquiring unit 401 is configured to acquire a target picture, where the target picture is a picture of multiple target objects taken from different angles.
图片标记单元402,用于根据分类类别对应的类别标识对所述目标图片进行标记。The picture marking unit 402 is configured to mark the target picture according to the category identifier corresponding to the classification category.
参数改变单元403,用于对所述目标图片进行图像处理操作以改变所述目标图片的图片参数,将改变图片参数的目标图片作为新的目标图片。The parameter changing unit 403 is configured to perform an image processing operation on the target picture to change the picture parameters of the target picture, and use the target picture whose picture parameters are changed as a new target picture.
其中,所述图像处理操作包括:尺寸调整、裁剪处理、旋转处理和图像算法处理等;所述图像算法处理包括:调整色温算法、调整曝光算法、调整对比度算法、高光恢复算法、低光补偿算法、白平衡算法、调整清晰度算法、雾化算法索引、调整自然饱和度算法。Wherein, the image processing operations include: size adjustment, cropping processing, rotation processing, image algorithm processing, etc.; the image algorithm processing includes: color temperature adjustment algorithm, exposure adjustment algorithm, contrast adjustment algorithm, highlight restoration algorithm, low light compensation algorithm , White balance algorithm, sharpness adjustment algorithm, fogging algorithm index, natural saturation adjustment algorithm.
数据构建单元404,用于根据新的目标图片与所述目标图片构建样本数据。The data construction unit 404 is configured to construct sample data according to the new target picture and the target picture.
模型训练单元405,用于基于卷积神经网络,根据所述样本数据进行模型训练以得到目标识别模型,并将得到的目标识别模型作为预先训练的目标识别模型。The model training unit 405 is configured to perform model training according to the sample data based on the convolutional neural network to obtain a target recognition model, and use the obtained target recognition model as a pre-trained target recognition model.
请参阅图7,图7是本申请的实施例还提供一种运动目标检测装置的示意性框图,该运动目标检测装置用于执行前述的运动目标检测方法。其中,该运动目标检测装置可以配置于服务器或终端中。Please refer to FIG. 7. FIG. 7 is a schematic block diagram of a moving target detection device provided in an embodiment of the present application, and the moving target detection device is used to execute the aforementioned moving target detection method. Wherein, the moving target detection device can be configured in a server or a terminal.
如图7所示,该运动目标检测装置500,包括:获取确定单元501、信息提取单元502、识别检测单元503和目标标注单元504。As shown in FIG. 7, the moving target detection device 500 includes: an acquisition and determination unit 501, an information extraction unit 502, an identification and detection unit 503, and a target labeling unit 504.
获取确定单元501,用于获取实时录像,确定所述实时录像中的运动目标。The obtaining and determining unit 501 is configured to obtain real-time video and determine the moving target in the real-time video.
信息提取单元502,用于提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息。The information extraction unit 502 is configured to extract a bounding box of the moving target and data information corresponding to the bounding box, the data information including position information and size information of the bounding box in the real-time video recording.
其中,信息提取单元502,具体用于,根据所述运动目标在实时录像中的水 平宽带和竖直长度确定所述运动目标对应的边界框;提取所述水平宽带和竖直长度作为所述尺寸信息,以及所述边界框的中心坐标作为所述位置信息。Wherein, the information extraction unit 502 is specifically configured to determine the bounding box corresponding to the moving target according to the horizontal broadband and vertical length of the moving target in real-time video recording; extract the horizontal broadband and vertical length as the size Information, and the center coordinates of the bounding box as the position information.
识别检测单元503,用于根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;The recognition and detection unit 503 is configured to input the image in the bounding box into a pre-trained target recognition model for recognition and detection according to the data information, so as to output the classification category corresponding to the moving target;
目标标注单元504,用于根据所述分类类别对所述实时录像中的运动目标进行标注。The target labeling unit 504 is configured to label the moving target in the real-time video recording according to the classification category.
在一个实施例中,如图8所示,该获取确定单元501,包括:基准确定单元5011、速度确定单元5012、帧数确定单元5013、图像提取单元5014、图像相减单元5015和图像处理单元5016。In one embodiment, as shown in FIG. 8, the acquisition and determination unit 501 includes: a reference determination unit 5011, a speed determination unit 5012, a frame number determination unit 5013, an image extraction unit 5014, an image subtraction unit 5015, and an image processing unit 5016.
基准确定单元5011,用于从所述实时录像中确定当前帧图像,将所述当前帧图像作为基准图像。The reference determining unit 5011 is configured to determine a current frame image from the real-time video recording, and use the current frame image as a reference image.
速度确定单元5012,用于获取待确定的运动目标的运动速度,其中不同的运动速度对应不同数量的预设帧数。The speed determining unit 5012 is used to obtain the moving speed of the moving target to be determined, where different moving speeds correspond to different numbers of preset frames.
帧数确定单元5013,用于根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数。The frame number determining unit 5013 is configured to determine the preset frame number corresponding to the acquired motion speed range according to the preset correspondence between the motion speed range and the preset frame number.
图像提取单元5014,用于提取相对所述基准图像延后预设帧数的延后帧图像。The image extraction unit 5014 is configured to extract a delayed frame image that is delayed by a preset number of frames relative to the reference image.
图像相减单元5015,用于将所述延后帧图像与所述当前帧图像相减以得到差分图像。The image subtraction unit 5015 is configured to subtract the delayed frame image and the current frame image to obtain a difference image.
图像处理单元5016,用于对所述差分图像进行阈值处理得到所述差分图像对应的二值图像。The image processing unit 5016 is configured to perform threshold processing on the difference image to obtain a binary image corresponding to the difference image.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的装置和各单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that for the convenience and conciseness of description, the specific working process of the device and each unit described above can refer to the corresponding process in the foregoing method embodiment, and it will not be omitted here. Repeat.
上述的装置可以实现为一种计算机程序的形式,该计算机程序可以在如图9所示的计算机设备上运行。The above-mentioned apparatus may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 9.
请参阅图9,图9是本申请实施例提供的一种计算机设备的结构示意性框图。该计算机设备可以是服务器或终端。Please refer to FIG. 9, which is a schematic block diagram of the structure of a computer device according to an embodiment of the present application. The computer equipment can be a server or a terminal.
参阅图9,该计算机设备包括通过系统总线连接的处理器、存储器和网络接口,其中,存储器可以包括非易失性存储介质和内存储器。Referring to FIG. 9, the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may include a non-volatile storage medium and an internal memory.
非易失性存储介质可存储操作系统和计算机程序。该计算机程序包括程序 指令,该程序指令被执行时,可使得处理器执行任意一种运动目标检测方法。The non-volatile storage medium can store an operating system and a computer program. The computer program includes program instructions, and when the program instructions are executed, the processor can execute any moving target detection method.
处理器用于提供计算和控制能力,支撑整个计算机设备的运行。The processor is used to provide calculation and control capabilities and support the operation of the entire computer equipment.
内存储器为非易失性存储介质中的计算机程序的运行提供环境,该计算机程序被处理器执行时,可使得处理器执行任意一种运动目标检测方法。The internal memory provides an environment for the running of the computer program in the non-volatile storage medium. When the computer program is executed by the processor, the processor can execute any method for detecting moving objects.
该网络接口用于进行网络通信,如发送分配的任务等。本领域技术人员可以理解,图9中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface is used for network communication, such as sending assigned tasks. Those skilled in the art can understand that the structure shown in FIG. 9 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may Including more or less parts than shown in the figure, or combining some parts, or having a different part arrangement.
应当理解的是,处理器可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), and application specific integrated circuits (Application Specific Integrated Circuits). Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. Among them, the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
本申请的实施例中还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序中包括程序指令,所述处理器执行所述程序指令,实现本申请实施例提供的任一项运动目标检测方法。The embodiments of the present application also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement the present application Any of the moving target detection methods provided by the embodiments.
其中,所述计算机可读存储介质可以是前述实施例所述的计算机设备的内部存储单元,例如所述计算机设备的硬盘或内存。所述计算机可读存储介质也可以是所述计算机设备的外部存储设备,例如所述计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。Wherein, the computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, such as the hard disk or memory of the computer device. The computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk, a smart media card (SMC), or a secure digital (Secure Digital, SD) equipped on the computer device. ) Card, Flash Card, etc.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Anyone familiar with the technical field can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, these modifications or replacements shall be covered within the protection scope of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (20)

  1. 一种运动目标检测方法,包括:A method for detecting moving targets includes:
    获取实时录像,确定所述实时录像中的运动目标;Acquiring real-time video, and determining the moving target in the real-time video;
    提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;Extracting a bounding box of the moving target and data information corresponding to the bounding box, where the data information includes position information and size information of the bounding box in the real-time video recording;
    根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;Inputting the image in the bounding box to a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
    根据所述分类类别对所述实时录像中的运动目标进行标注。Marking the moving target in the real-time video recording according to the classification category.
  2. 根据权利要求1所述的检测方法,其中,所述确定所述实时录像中的运动目标,包括:The detection method according to claim 1, wherein said determining the moving target in the real-time video recording comprises:
    从所述实时录像中确定当前帧图像,将所述当前帧图像作为基准图像;Determine a current frame image from the real-time video, and use the current frame image as a reference image;
    提取相对所述基准图像延后预设帧数的延后帧图像;Extracting a delayed frame image that is delayed by a preset number of frames relative to the reference image;
    将所述延后帧图像与所述当前帧图像相减以得到差分图像;Subtracting the delayed frame image and the current frame image to obtain a difference image;
    对所述差分图像进行阈值处理得到所述差分图像对应的二值图像;以及Performing threshold processing on the difference image to obtain a binary image corresponding to the difference image; and
    根据所述二值图像确定所述实时录像中的运动目标。The moving target in the real-time video recording is determined according to the binary image.
  3. 根据权利要求2所述的检测方法,其中,所述提取相对所述基准图像延后预设帧数的延后帧图像之前,还包括:2. The detection method according to claim 2, wherein before extracting a delayed frame image that is delayed by a preset number of frames relative to the reference image, the method further comprises:
    获取待确定的运动目标的运动速度,其中不同的运动速度对应不同数量的预设帧数;Acquiring the movement speed of the moving target to be determined, where different movement speeds correspond to different numbers of preset frames;
    根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数。According to the preset correspondence between the motion speed range and the preset frame number, determine the preset frame number corresponding to the acquired motion speed range.
  4. 根据权利要求2或3所述的检测方法,其中,所述对所述差分图像进行阈值处理得到所述差分图像对应的二值图像,包括:The detection method according to claim 2 or 3, wherein the threshold processing on the difference image to obtain the binary image corresponding to the difference image comprises:
    确定所述差分图像中像素值大于预设阈值的像素点;Determining pixels in the differential image with pixel values greater than a preset threshold;
    根据大于所述预设阈值的像素点确定所述差分图像对应的二值图像。The binary image corresponding to the difference image is determined according to pixels larger than the preset threshold.
  5. 根据权利要求4所述的检测方法,其中,所述二值图像表示为:The detection method according to claim 4, wherein the binary image is expressed as:
    Figure PCTCN2019091905-appb-100001
    Figure PCTCN2019091905-appb-100001
    其中,S k(i,j)表示二值图像,T为预设阈值,(i,j)表示离散图像的坐标,D k 表示差分图像; Among them, S k (i, j) represents a binary image, T is a preset threshold, (i, j) represents the coordinates of a discrete image, and D k represents a differential image;
    所述根据所述二值图像确定所述实时录像中的运动目标,包括:The determining the moving target in the real-time video recording according to the binary image includes:
    将二值图像中S k(i,j)为1对应的区域设为运动区域; Set the area corresponding to S k (i, j) as 1 in the binary image as the motion area;
    对所述运动区域通过形态学处理和连通性分析去除噪点,以确定所述实时录像中的运动目标。Morphological processing and connectivity analysis are performed on the moving area to remove noise to determine the moving target in the real-time video.
  6. 根据权利要求1所述的检测方法,其中,所述提取所述运动目标的边界框以及所述边界框对应的数据信息,包括:The detection method according to claim 1, wherein said extracting the bounding box of the moving target and the data information corresponding to the bounding box comprises:
    根据所述运动目标在实时录像中的水平宽带和竖直长度确定所述运动目标对应的边界框;Determine the bounding box corresponding to the moving target according to the horizontal bandwidth and vertical length of the moving target in real-time video recording;
    提取所述水平宽带和竖直长度作为所述尺寸信息,以及所述边界框的中心坐标作为所述位置信息。Extract the horizontal bandwidth and vertical length as the size information, and the center coordinates of the bounding box as the position information.
  7. 根据权利要求1所述的检测方法,还包括:The detection method according to claim 1, further comprising:
    获取目标图片,所述目标图片为从不同角度拍摄的多个目标物体的图片;Acquiring a target picture, the target picture being pictures of multiple target objects taken from different angles;
    根据分类类别对应的类别标识对所述目标图片进行标记,以构建样本数据;Marking the target picture according to the category identifier corresponding to the classification category to construct sample data;
    基于卷积神经网络,根据所述样本数据进行模型训练以得到目标识别模型,并将得到的目标识别模型作为预先训练的目标识别模型。Based on the convolutional neural network, model training is performed according to the sample data to obtain a target recognition model, and the obtained target recognition model is used as a pre-trained target recognition model.
  8. 一种运动目标检测装置,包括:A moving target detection device includes:
    获取确定单元,用于获取实时录像,确定所述实时录像中的运动目标;An obtaining and determining unit, configured to obtain real-time video, and determine the moving target in the real-time video;
    信息提取单元,用于提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;An information extraction unit, configured to extract a bounding box of the moving target and data information corresponding to the bounding box, the data information including position information and size information of the bounding box in the real-time video;
    识别检测单元,用于根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;A recognition detection unit, configured to input the image in the bounding box into a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
    目标标注单元,用于根据所述分类类别对所述实时录像中的运动目标进行标注。The target labeling unit is configured to label the moving target in the real-time video recording according to the classification category.
  9. 一种计算机设备,其中,所述计算机设备包括存储器和处理器;A computer device, wherein the computer device includes a memory and a processor;
    所述存储器用于存储计算机程序;The memory is used to store computer programs;
    所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现如下步骤:The processor is configured to execute the computer program and implement the following steps when executing the computer program:
    获取实时录像,确定所述实时录像中的运动目标;Acquiring real-time video, and determining the moving target in the real-time video;
    提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;Extracting a bounding box of the moving target and data information corresponding to the bounding box, where the data information includes position information and size information of the bounding box in the real-time video recording;
    根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;Inputting the image in the bounding box to a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
    根据所述分类类别对所述实时录像中的运动目标进行标注。Marking the moving target in the real-time video recording according to the classification category.
  10. 根据权利要求9所述计算机设备,其中,所述处理器在实现所述确定所述实时录像中的运动目标时,用于实现:9. The computer device according to claim 9, wherein the processor is configured to realize the following when realizing the determination of the moving target in the real-time video recording:
    从所述实时录像中确定当前帧图像,将所述当前帧图像作为基准图像;Determine a current frame image from the real-time video, and use the current frame image as a reference image;
    提取相对所述基准图像延后预设帧数的延后帧图像;Extracting a delayed frame image that is delayed by a preset number of frames relative to the reference image;
    将所述延后帧图像与所述当前帧图像相减以得到差分图像;Subtracting the delayed frame image and the current frame image to obtain a difference image;
    对所述差分图像进行阈值处理得到所述差分图像对应的二值图像;以及Performing threshold processing on the difference image to obtain a binary image corresponding to the difference image; and
    根据所述二值图像确定所述实时录像中的运动目标。The moving target in the real-time video recording is determined according to the binary image.
  11. 根据权利要求10所述的计算机设备,其中,所述处理器在实现所述提取相对所述基准图像延后预设帧数的延后帧图像之前,还用于实现:The computer device according to claim 10, wherein the processor is further configured to implement: before implementing the extraction of the delayed frame image delayed by a preset number of frames relative to the reference image:
    获取待确定的运动目标的运动速度,其中不同的运动速度对应不同数量的预设帧数;Acquiring the movement speed of the moving target to be determined, where different movement speeds correspond to different numbers of preset frames;
    根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数。According to the preset correspondence between the motion speed range and the preset frame number, determine the preset frame number corresponding to the acquired motion speed range.
  12. 根据权利要求10或11所述的计算机设备,其中,所述处理器在实现所述对所述差分图像进行阈值处理得到所述差分图像对应的二值图像时,用于实现:The computer device according to claim 10 or 11, wherein, when the processor implements the threshold processing on the differential image to obtain the binary image corresponding to the differential image, the processor is configured to implement:
    确定所述差分图像中像素值大于预设阈值的像素点;Determining pixels in the differential image with pixel values greater than a preset threshold;
    根据大于所述预设阈值的像素点确定所述差分图像对应的二值图像。The binary image corresponding to the difference image is determined according to pixels larger than the preset threshold.
  13. 根据权利要求12所述的计算机设备,其中,所述二值图像表示为:The computer device according to claim 12, wherein the binary image is represented as:
    Figure PCTCN2019091905-appb-100002
    Figure PCTCN2019091905-appb-100002
    其中,S k(i,j)表示二值图像,T为预设阈值,(i,j)表示离散图像的坐标,D k表示差分图像; Among them, S k (i, j) represents a binary image, T is a preset threshold, (i, j) represents the coordinates of a discrete image, and D k represents a differential image;
    所述处理器在实现所述根据所述二值图像确定所述实时录像中的运动目标时,用于实现:When the processor realizes the determination of the moving target in the real-time video recording according to the binary image, it is used to realize:
    将二值图像中S k(i,j)为1对应的区域设为运动区域; Set the area corresponding to S k (i, j) as 1 in the binary image as the motion area;
    对所述运动区域通过形态学处理和连通性分析去除噪点,以确定所述实时录像中的运动目标。Morphological processing and connectivity analysis are performed on the moving area to remove noise to determine the moving target in the real-time video.
  14. 根据权利要求9所述的计算机设备,其中,所述处理器在实现所述提取所述运动目标的边界框以及所述边界框对应的数据信息时,用于实现:The computer device according to claim 9, wherein the processor is configured to implement the following when extracting the bounding box of the moving target and the data information corresponding to the bounding box:
    根据所述运动目标在实时录像中的水平宽带和竖直长度确定所述运动目标对应的边界框;Determine the bounding box corresponding to the moving target according to the horizontal bandwidth and vertical length of the moving target in real-time video recording;
    提取所述水平宽带和竖直长度作为所述尺寸信息,以及所述边界框的中心坐标作为所述位置信息。Extract the horizontal bandwidth and vertical length as the size information, and the center coordinates of the bounding box as the position information.
  15. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时使所述处理器实现如下步骤:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the following steps:
    获取实时录像,确定所述实时录像中的运动目标;Acquiring real-time video, and determining the moving target in the real-time video;
    提取所述运动目标的边界框以及所述边界框对应的数据信息,所述数据信息包括所述边界框在所述实时录像中的位置信息和尺寸信息;Extracting a bounding box of the moving target and data information corresponding to the bounding box, where the data information includes position information and size information of the bounding box in the real-time video recording;
    根据所述数据信息将所述边界框中的图像输入至预先训练的目标识别模型进行识别检测,以输出所述运动目标对应的分类类别;Inputting the image in the bounding box to a pre-trained target recognition model for recognition and detection according to the data information, so as to output a classification category corresponding to the moving target;
    根据所述分类类别对所述实时录像中的运动目标进行标注。Marking the moving target in the real-time video recording according to the classification category.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述处理器在实现所述确定所述实时录像中的运动目标时,用于实现:15. The computer-readable storage medium according to claim 15, wherein the processor is configured to realize:
    从所述实时录像中确定当前帧图像,将所述当前帧图像作为基准图像;Determine a current frame image from the real-time video, and use the current frame image as a reference image;
    提取相对所述基准图像延后预设帧数的延后帧图像;Extracting a delayed frame image that is delayed by a preset number of frames relative to the reference image;
    将所述延后帧图像与所述当前帧图像相减以得到差分图像;Subtracting the delayed frame image and the current frame image to obtain a difference image;
    对所述差分图像进行阈值处理得到所述差分图像对应的二值图像;以及Performing threshold processing on the difference image to obtain a binary image corresponding to the difference image; and
    根据所述二值图像确定所述实时录像中的运动目标。The moving target in the real-time video recording is determined according to the binary image.
  17. 根据权利要求16所述的计算机可读存储介质,其中,所述处理器在实现所述提取相对所述基准图像延后预设帧数的延后帧图像之前,还用于实现:The computer-readable storage medium according to claim 16, wherein the processor is further configured to implement: before implementing the extraction of the delayed frame image delayed by a preset number of frames relative to the reference image:
    获取待确定的运动目标的运动速度,其中不同的运动速度对应不同数量的预设帧数;Acquiring the movement speed of the moving target to be determined, where different movement speeds correspond to different numbers of preset frames;
    根据运动速度范围与预设帧数之间的预设对应关系,确定获取的运动速度范围对应的预设帧数。According to the preset correspondence between the motion speed range and the preset frame number, determine the preset frame number corresponding to the acquired motion speed range.
  18. 根据权利要求16或17所述的计算机可读存储介质,其中,所述处理器在实现所述对所述差分图像进行阈值处理得到所述差分图像对应的二值图像时,用于实现:The computer-readable storage medium according to claim 16 or 17, wherein, when the processor implements the threshold processing on the differential image to obtain the binary image corresponding to the differential image, the processor is configured to implement:
    确定所述差分图像中像素值大于预设阈值的像素点;Determining pixels in the differential image with pixel values greater than a preset threshold;
    根据大于所述预设阈值的像素点确定所述差分图像对应的二值图像。The binary image corresponding to the difference image is determined according to pixels larger than the preset threshold.
  19. 根据权利要求18所述的计算机可读存储介质,其中,所述二值图像表示为:The computer-readable storage medium according to claim 18, wherein the binary image is represented as:
    Figure PCTCN2019091905-appb-100003
    Figure PCTCN2019091905-appb-100003
    其中,S k(i,j)表示二值图像,T为预设阈值,(i,j)表示离散图像的坐标,D k表示差分图像; Among them, S k (i, j) represents a binary image, T is a preset threshold, (i, j) represents the coordinates of a discrete image, and D k represents a differential image;
    所述处理器在实现所述根据所述二值图像确定所述实时录像中的运动目标时,用于实现:When the processor realizes the determination of the moving target in the real-time video recording according to the binary image, it is used to realize:
    将二值图像中S k(i,j)为1对应的区域设为运动区域; Set the area corresponding to S k (i, j) as 1 in the binary image as the motion area;
    对所述运动区域通过形态学处理和连通性分析去除噪点,以确定所述实时录像中的运动目标。Morphological processing and connectivity analysis are performed on the moving area to remove noise to determine the moving target in the real-time video.
  20. 根据权利要求15所述的计算机可读存储介质,其中,所述处理器在实现所述提取所述运动目标的边界框以及所述边界框对应的数据信息时,用于实现:15. The computer-readable storage medium according to claim 15, wherein, when the processor implements the extraction of the bounding box of the moving target and the data information corresponding to the bounding box, it is configured to implement:
    根据所述运动目标在实时录像中的水平宽带和竖直长度确定所述运动目标对应的边界框;Determine the bounding box corresponding to the moving target according to the horizontal bandwidth and vertical length of the moving target in real-time video recording;
    提取所述水平宽带和竖直长度作为所述尺寸信息,以及所述边界框的中心坐标作为所述位置信息。Extract the horizontal bandwidth and vertical length as the size information, and the center coordinates of the bounding box as the position information.
PCT/CN2019/091905 2019-01-23 2019-06-19 Moving object detection method and apparatus, computer device, and storage medium WO2020151172A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910065021.4 2019-01-23
CN201910065021.4A CN109919008A (en) 2019-01-23 2019-01-23 Moving target detecting method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2020151172A1 true WO2020151172A1 (en) 2020-07-30

Family

ID=66960695

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/091905 WO2020151172A1 (en) 2019-01-23 2019-06-19 Moving object detection method and apparatus, computer device, and storage medium

Country Status (2)

Country Link
CN (1) CN109919008A (en)
WO (1) WO2020151172A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111881854A (en) * 2020-07-31 2020-11-03 上海商汤临港智能科技有限公司 Action recognition method and device, computer equipment and storage medium
CN112101134A (en) * 2020-08-24 2020-12-18 深圳市商汤科技有限公司 Object detection method and device, electronic device and storage medium
CN112149546A (en) * 2020-09-16 2020-12-29 珠海格力电器股份有限公司 Information processing method and device, electronic equipment and storage medium
CN112465868A (en) * 2020-11-30 2021-03-09 浙江大华汽车技术有限公司 Target detection tracking method and device, storage medium and electronic device
CN112733741A (en) * 2021-01-14 2021-04-30 苏州挚途科技有限公司 Traffic signboard identification method and device and electronic equipment
CN113379591A (en) * 2021-06-21 2021-09-10 中国科学技术大学 Speed determination method, speed determination device, electronic device, and storage medium
CN113537207A (en) * 2020-12-22 2021-10-22 腾讯科技(深圳)有限公司 Video processing method, model training method and device and electronic equipment
CN113838110A (en) * 2021-09-08 2021-12-24 重庆紫光华山智安科技有限公司 Target detection result verification method and device, storage medium and electronic equipment
CN112149546B (en) * 2020-09-16 2024-05-03 珠海格力电器股份有限公司 Information processing method, device, electronic equipment and storage medium

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110532859B (en) * 2019-07-18 2021-01-22 西安电子科技大学 Remote sensing image target detection method based on deep evolution pruning convolution net
CN111723634B (en) * 2019-12-17 2024-04-16 中国科学院上海微系统与信息技术研究所 Image detection method and device, electronic equipment and storage medium
CN111222423A (en) * 2019-12-26 2020-06-02 深圳供电局有限公司 Target identification method and device based on operation area and computer equipment
CN113129331B (en) * 2019-12-31 2024-01-30 中移(成都)信息通信科技有限公司 Target movement track detection method, device, equipment and computer storage medium
CN111461209B (en) * 2020-03-30 2024-04-09 深圳市凯立德科技股份有限公司 Model training device and method
CN111582377A (en) * 2020-05-09 2020-08-25 济南浪潮高新科技投资发展有限公司 Edge end target detection method and system based on model compression
CN111866449B (en) * 2020-06-17 2022-03-29 中国人民解放军国防科技大学 Intelligent video acquisition system and method
CN112055172B (en) * 2020-08-19 2022-04-19 浙江大华技术股份有限公司 Method and device for processing monitoring video and storage medium
WO2022240363A2 (en) * 2021-05-12 2022-11-17 Nanyang Technological University Robotic meal-assembly systems and robotic methods for real-time object pose estimation of high-resemblance random food items
CN113205068A (en) * 2021-05-27 2021-08-03 苏州魔视智能科技有限公司 Method for monitoring sprinkler head, electronic equipment and vehicle
CN113192109B (en) * 2021-06-01 2022-01-11 北京海天瑞声科技股份有限公司 Method and device for identifying motion state of object in continuous frames
CN113344967B (en) * 2021-06-07 2023-04-07 哈尔滨理工大学 Dynamic target identification tracking method under complex background
CN113822137A (en) * 2021-07-23 2021-12-21 腾讯科技(深圳)有限公司 Data annotation method, device and equipment and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103559498A (en) * 2013-09-24 2014-02-05 北京环境特性研究所 Rapid man and vehicle target classification method based on multi-feature fusion
CN104700430A (en) * 2014-10-05 2015-06-10 安徽工程大学 Method for detecting movement of airborne displays
CN106991668A (en) * 2017-03-09 2017-07-28 南京邮电大学 A kind of evaluation method of day net camera shooting picture
CN108022249A (en) * 2017-11-29 2018-05-11 中国科学院遥感与数字地球研究所 A kind of remote sensing video satellite moving vehicle target region of interest extraction method
CN109035287A (en) * 2018-07-02 2018-12-18 广州杰赛科技股份有限公司 Foreground image extraction method and device, moving vehicle recognition methods and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106878674B (en) * 2017-01-10 2019-08-30 哈尔滨工业大学深圳研究生院 A kind of parking detection method and device based on monitor video
CN109117794A (en) * 2018-08-16 2019-01-01 广东工业大学 A kind of moving target behavior tracking method, apparatus, equipment and readable storage medium storing program for executing

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103559498A (en) * 2013-09-24 2014-02-05 北京环境特性研究所 Rapid man and vehicle target classification method based on multi-feature fusion
CN104700430A (en) * 2014-10-05 2015-06-10 安徽工程大学 Method for detecting movement of airborne displays
CN106991668A (en) * 2017-03-09 2017-07-28 南京邮电大学 A kind of evaluation method of day net camera shooting picture
CN108022249A (en) * 2017-11-29 2018-05-11 中国科学院遥感与数字地球研究所 A kind of remote sensing video satellite moving vehicle target region of interest extraction method
CN109035287A (en) * 2018-07-02 2018-12-18 广州杰赛科技股份有限公司 Foreground image extraction method and device, moving vehicle recognition methods and device

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111881854A (en) * 2020-07-31 2020-11-03 上海商汤临港智能科技有限公司 Action recognition method and device, computer equipment and storage medium
CN112101134B (en) * 2020-08-24 2024-01-02 深圳市商汤科技有限公司 Object detection method and device, electronic equipment and storage medium
CN112101134A (en) * 2020-08-24 2020-12-18 深圳市商汤科技有限公司 Object detection method and device, electronic device and storage medium
CN112149546A (en) * 2020-09-16 2020-12-29 珠海格力电器股份有限公司 Information processing method and device, electronic equipment and storage medium
CN112149546B (en) * 2020-09-16 2024-05-03 珠海格力电器股份有限公司 Information processing method, device, electronic equipment and storage medium
CN112465868A (en) * 2020-11-30 2021-03-09 浙江大华汽车技术有限公司 Target detection tracking method and device, storage medium and electronic device
CN112465868B (en) * 2020-11-30 2024-01-12 浙江华锐捷技术有限公司 Target detection tracking method and device, storage medium and electronic device
CN113537207A (en) * 2020-12-22 2021-10-22 腾讯科技(深圳)有限公司 Video processing method, model training method and device and electronic equipment
CN113537207B (en) * 2020-12-22 2023-09-12 腾讯科技(深圳)有限公司 Video processing method, training method and device of model and electronic equipment
CN112733741A (en) * 2021-01-14 2021-04-30 苏州挚途科技有限公司 Traffic signboard identification method and device and electronic equipment
CN113379591A (en) * 2021-06-21 2021-09-10 中国科学技术大学 Speed determination method, speed determination device, electronic device, and storage medium
CN113379591B (en) * 2021-06-21 2024-02-27 中国科学技术大学 Speed determination method, speed determination device, electronic device and storage medium
CN113838110B (en) * 2021-09-08 2023-09-05 重庆紫光华山智安科技有限公司 Verification method and device for target detection result, storage medium and electronic equipment
CN113838110A (en) * 2021-09-08 2021-12-24 重庆紫光华山智安科技有限公司 Target detection result verification method and device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN109919008A (en) 2019-06-21

Similar Documents

Publication Publication Date Title
WO2020151172A1 (en) Moving object detection method and apparatus, computer device, and storage medium
Zhang et al. CCTSDB 2021: a more comprehensive traffic sign detection benchmark
Anandhalli et al. A novel approach in real-time vehicle detection and tracking using Raspberry Pi
Pavlic et al. Classification of images in fog and fog-free scenes for use in vehicles
US10970824B2 (en) Method and apparatus for removing turbid objects in an image
CN111435446A (en) License plate identification method and device based on L eNet
CN114418998A (en) Vehicle part defect detection method and system and electronic equipment
Dhatbale et al. Deep learning techniques for vehicle trajectory extraction in mixed traffic
CN115493612A (en) Vehicle positioning method and device based on visual SLAM
CN113688839B (en) Video processing method and device, electronic equipment and computer readable storage medium
CN117218622A (en) Road condition detection method, electronic equipment and storage medium
CN111009136A (en) Method, device and system for detecting vehicles with abnormal running speed on highway
CN111127358A (en) Image processing method, device and storage medium
CN112435278B (en) Visual SLAM method and device based on dynamic target detection
CN110154896B (en) Method and equipment for detecting obstacle
Ali et al. IRUVD: a new still-image based dataset for automatic vehicle detection
CN110765940B (en) Target object statistical method and device
Lashkov et al. Edge-computing-facilitated nighttime vehicle detection investigations with CLAHE-enhanced images
Yaghoobi Ershadi Improving vehicle tracking rate and speed estimation in dusty and snowy weather conditions with a vibrating camera
CN113902047B (en) Image element matching method, device, equipment and storage medium
Muniruzzaman et al. Deterministic algorithm for traffic detection in free-flow and congestion using video sensor
Xiong et al. Fast and robust approaches for lane detection using multi‐camera fusion in complex scenes
CN116052090A (en) Image quality evaluation method, model training method, device, equipment and medium
CN108985233B (en) High-precision vehicle tracking method based on digital image correlation
Jehad et al. Developing and validating a real time video based traffic counting and classification

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19911806

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19911806

Country of ref document: EP

Kind code of ref document: A1