CN112001320B

CN112001320B - Gate detection method based on video

Info

Publication number: CN112001320B
Application number: CN202010864191.1A
Authority: CN
Inventors: 薛超; 高旭麟; 刘琰; 陈澎祥; 孙雅彬
Original assignee: Tiandy Technologies Co Ltd
Current assignee: Tiandy Technologies Co Ltd
Priority date: 2020-08-25
Filing date: 2020-08-25
Publication date: 2024-04-23
Anticipated expiration: 2040-08-25
Also published as: CN112001320A

Abstract

The invention provides a gate detection method based on video, which is characterized by comprising the following steps: s1, establishing and training a deep learning model for gate detection; s2, setting a gate detection rule; s3, preprocessing an image to be detected; s4, inputting the preprocessed image into a deep learning model for gate detection to detect. The gate detection method based on the video is used for detecting the gate state by analyzing the video image, achieves real-time performance and automation of gate state detection, saves a great amount of labor cost and time cost, effectively improves the identification accuracy, reduces the false alarm rate, and has applicability to various scenes.

Description

Gate detection method based on video

Technical Field

The invention belongs to the technical field of video monitoring, and particularly relates to a gate detection method based on video.

Background

With the wide application of the video monitoring system in the water conservancy industry, the water conservancy video monitoring system plays an increasingly important role in daily management of the water conservancy departments, and staff can monitor each water conservancy facility on a monitoring center or a working computer in real time through the monitoring system to observe the current situation of water resources in real time, so that the working efficiency is greatly improved. The gate in hydraulic engineering is various, because the gate is huge in stress in the switching process, is easy to damage, and the situations of incomplete switching, blocking and the like often occur, and the existing solution is to adopt a gate switching detection switch, but sometimes the gate needs to operate in a semi-closed state, and the switching detection switch is not easy to detect at the moment.

Disclosure of Invention

In view of the above, the present invention is directed to a method for detecting a gate based on video, so as to solve the problem that the gate is not easy to detect during the opening and closing process. The gate detection method based on the video can analyze video streams transmitted by the existing video camera, has good detection effect on various gates for videos with different definition, and can automatically monitor the gate continuously by analyzing the gate state according to the position relation between the top line of the current gate position and the top line of the opening state and the top line of the closing state due to different gate positions detected by the gate in the opening and closing states.

In order to achieve the above purpose, the technical scheme of the invention is realized as follows:

A video-based gate detection method, comprising the steps of:

S1, establishing and training a deep learning model for gate detection;

S2, setting a gate detection rule;

s3, preprocessing an image to be detected;

S4, inputting the preprocessed image into a deep learning model for gate detection to detect.

Further, the deep learning model detection gate in step S4 includes: gate status determination, gate movement direction determination, and gate opening percentage calculation.

Further, the training of the deep learning model in step S1 adopts a random gradient descent method to perform iterative training on the model, each iteration makes the loss function smaller, and the loss function used is as follows:

where i is the ith grid, j is the jth bbox, Meaning that if the ith grid, j is the jth bbox is a gate; the first two terms are coordinate predictions,/>For predicted gate center point coordinates, x _i,y_i is the labeled gate center point, ω _i、h_i is the gate box width, height,/>To predict the width and height of the output gate box, C _i represents the confidence of the gate box,Representing confidence of gate box prediction; the third term is to predict confidence of the box; the fourth term is to predict a box that does not contain gates; lambda _coord,λ_noobj is the weight coefficient, B is the number of anchor boxes, and s ² is the total cell number on the feature map, i.e. the grid number.

Further, the detection setting in step S2 mainly includes setting of a top line position when the shutter is opened and setting of a top line position when the shutter is closed. When the gate is opened, the top line is the position of the upper edge of the gate, and the upper edge position is marked by drawing a line segment; when the gate is closed, the top line is the position of the upper edge of the gate, the upper edge position is marked by drawing a line segment, and the current state of the gate can be judged by judging the position relationship between the detected upper edge position of the current position of the gate and the two top lines.

Further, in the step S3, a gaussian filtering method is adopted to preprocess the image, so as to reduce image noise.

Further, the deep learning model detection gate in step S4 detects the image obtained in step S2, and obtains a score corresponding to the detected gate position, where the score calculation formula is as follows:

where θ is a parameter vector, h _θ (x) represents the probability that its corresponding class label belongs to the positive case, i.e., the score in the text, for a given input x.

Further, the gate status judging process is as follows:

If the gate target is not detected by the continuous multi-frame in the step S4, the gate is considered to be closed; if the gate target is detected in step S4, the positional relationship between the top line of the gate position detected in step S4 and the top line when the set gate is completely closed is analyzed, and if the top line of the current gate position is above the set closing top line, the gate is considered to be opened, otherwise the gate is considered to be closed.

Further, the gate movement direction judging process is as follows:

In the judging of the gate movement direction, the top line of the gate position detected in the step S4 of two continuous frames is stored, and if the top line of the gate position of the current frame is above the top line of the gate of the previous frame, the gate is considered to move upwards; if the top line of the gate position of the current frame coincides with the top line of the gate position of the previous frame, the gate is considered to be stationary; if the top line of the gate position of the current frame is below the top line position of the gate of the previous frame, the gate is considered to be moving downwards.

Further, the gate opening percentage calculating method comprises the following steps: and calculating the distance d between the top line when the gate is fully opened and the top line when the gate is fully closed, and then calculating the distance delta d between the top line at the gate position detected in the step S4 and the top line when the gate is fully closed, wherein delta d/d is the percentage of gate opening.

Further, in the step S4, the process of detecting the image obtained in the step S3 is as follows: detecting the whole image through the trained YOLO model, recording the position and score of the detected target, if the detected target is larger than 0.8, considering the detected target as an effective target, and judging the moving direction of the effective target; if the detection target is smaller than 0.8, the gate is in a closed state.

Compared with the prior art, the gate detection method based on the video has the following advantages:

(1) According to the gate detection method based on the video, the gate state is detected by analyzing the video image, so that the real-time performance and the automation of the gate state detection are realized;

(2) The gate detection method based on the video, disclosed by the invention, has the advantages that a great amount of labor cost and time cost are saved, the identification accuracy is effectively improved, and the false alarm rate is reduced;

(3) The gate detection method based on the video has the applicability of various scenes.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention. In the drawings:

FIG. 1 is a training flow chart of a gate detection flow chart deep learning model of a gate detection method based on video according to an embodiment of the present invention;

fig. 2 is a gate detection flow chart of a gate detection method based on video according to an embodiment of the present invention.

Detailed Description

It should be noted that, without conflict, the embodiments of the present invention and features of the embodiments may be combined with each other.

In the description of the present invention, it should be understood that the terms "center", "longitudinal", "lateral", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", etc. indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, are merely for convenience in describing the present invention and simplifying the description, and do not indicate or imply that the devices or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and thus should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first", "a second", etc. may explicitly or implicitly include one or more such feature. In the description of the present invention, unless otherwise indicated, the meaning of "a plurality" is two or more.

In the description of the present invention, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention can be understood by those of ordinary skill in the art in a specific case.

The invention will be described in detail below with reference to the drawings in connection with embodiments.

Referring to fig. 1 and 2, the present invention provides a technical solution: a video-based gate detection method, comprising the steps of:

And S1, training a deep learning model. The method specifically comprises sample collection, data enhancement, sample labeling and model training under DarkNet framework. The collection of the sample needs to cover various possible patterns of the target in the application scene, and the sample needs to comprise a scene without a gate but with a gate easy to be detected by mistake so as to reduce the false detection rate; after the sample is collected, carrying out image enhancement on the sample, adjusting the brightness, angle, contrast and other information of the image, increasing the diversity of the sample and improving the robustness of the model; after the data enhancement is completed, marking the gate in the sample, wherein the marking of the sample requires the accuracy of the marked target position; after labeling is complete, the YOLO model is trained under DarkNet framework.

And S2, setting detection rules. The regular arrangement mainly comprises arrangement of the top line position when the gate is opened and arrangement of the top line position when the gate is closed. When the gate is opened, the top line is the position of the upper edge of the gate, and the upper edge position is marked by drawing a line segment; when the gate is closed, the top line is the position of the upper edge of the gate, the upper edge position is marked by drawing a line segment, and the current state of the gate can be judged by judging the position relationship between the detected upper edge position of the current position of the gate and the two top lines.

And S3, preprocessing the image. The image preprocessing is to smooth and remove the noise of the image to be processed before detection so as to achieve a better detection effect. In the method, the Gaussian filtering method is used for preprocessing the image, so that the image noise is reduced.

And S4, detecting a gate by the deep learning model. And (3) detecting the whole image by using the YOLO model trained in the step (S1), recording the position and the score of the detected target, and if the target score is greater than 0.8, considering the target as an effective target.

And judging the state of the gate. Since the gate cannot be seen after the gate is completely closed in some scenes, if the gate target is not detected in the consecutive multi-frame in step S4, the gate is considered to be closed; if the gate target is detected in step S4, the positional relationship between the top line of the gate position detected in step S4 and the top line when the set gate is completely closed is analyzed, and if the top line of the current gate position is above the set closing top line, the gate is considered to be opened, otherwise the gate is considered to be closed.

And judging the movement direction of the gate. Storing the top line of the gate position detected in the step S4 of two continuous frames, and if the top line of the gate position of the current frame is above the top line position of the gate of the previous frame, considering that the gate moves upwards; if the top line of the gate position of the current frame coincides with the top line of the gate position of the previous frame, the gate is considered to be stationary; if the top line of the gate position of the current frame is below the top line position of the gate of the previous frame, the gate is considered to be moving downwards.

And calculating the gate opening percentage. And (3) calculating the distance d between the top line when the gate is fully opened and the top line when the gate is fully closed, and then calculating the distance delta d between the top line at the gate position detected in the step (4) and the top line when the gate is fully closed, wherein delta d/d is the percentage of gate opening.

The working procedure of this embodiment is as follows:

The method is used for training the detection model in advance, samples of various gates in various scenes are required to be collected first, the samples are marked, namely, real position coordinates are marked according to the positions of the gates in images, the model is repeatedly and iteratively trained by adopting a random gradient descent method, the loss function is smaller in each iteration, and the used loss function has the following formula:

The iteration is continuous, so that box errors are smaller and smaller, and prediction is more accurate and more accurate.

And finally, determining the specific position of the gate in the image by using a YOLO model with the best detection effect.

Starting detection;

step S2, setting the orientation of the gate when the gate is completely opened and the top line of the gate when the gate is completely closed;

S3, preprocessing the video image, and removing image noise points; the Gaussian filter is used for preprocessing the image, so that noise can be effectively restrained, and the image can be smoothed;

and S4, detecting the image obtained in the step S3 by using a deep learning YOLO model to obtain a score corresponding to the detected gate position, wherein a score calculation formula is as follows:

Where θ is a parameter vector, h _θ (x) represents its corresponding class label for a given input x, the probability of belonging to a positive case, i.e., the score in the text; the score is 0 at the lowest and 1 at the highest, and the result with the score less than 0.8 is filtered out, so that a correct detection result is left;

And (3) comparing the top line position of the effective gate target of each frame of video image obtained in the step (S4) with the top line position of the gate when the gate is completely opened and the top line position of the gate when the gate is completely closed, so as to judge the current state, the movement direction and the opening percentage of the gate.

The foregoing description of the preferred embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, alternatives, and improvements that fall within the spirit and scope of the invention.

Claims

1. A video-based gate detection method, comprising the steps of:

S1, building and training a YOLO model for gate detection;

S2, setting a gate detection rule;

The detection setting in the step S2 mainly comprises the setting of the top line position when the gate is opened and the setting of the top line position when the gate is closed;

s3, preprocessing an image to be detected;

S4, inputting the preprocessed image into a YOLO model for gate detection for detection;

The YOLO model inspection gate in step S4 includes: judging the state of the gate, judging the moving direction of the gate and calculating the opening percentage of the gate;

The gate state judging process is as follows:

If the gate target is not detected by the continuous multi-frame in the step S4, the gate is considered to be closed; if the gate target is detected in the step S4, analyzing the position relation between the top line of the gate position detected in the step S4 and the top line when the set gate is completely closed, if the top line of the current gate position is above the set closing top line, the gate is considered to be opened, otherwise, the gate is considered to be closed;

the gate movement direction judging process is as follows:

Judging the gate movement direction, storing the top line of the gate position detected in the step S4 of two continuous frames, and if the top line of the gate position of the current frame is above the top line position of the gate of the previous frame, considering that the gate moves upwards; if the top line of the gate position of the current frame coincides with the top line of the gate position of the previous frame, the gate is considered to be stationary; if the top line of the gate position of the current frame is below the top line of the gate position of the previous frame, the gate is considered to move downwards;

The method for calculating the gate opening percentage comprises the following steps: and calculating the distance d between the top line when the gate is fully opened and the top line when the gate is fully closed, and then calculating the distance delta d between the top line at the gate position detected in the step S4 and the top line when the gate is fully closed, wherein delta d/d is the percentage of gate opening.

2. The video-based gate detection method of claim 1, wherein:

In the step S1, the model is trained repeatedly and iteratively by adopting a random gradient descent method, and a loss function is used as follows:

where i is the ith grid, j is the jth bbox, The j bbox th, representing the i-th grid, is a gate; the first two terms are coordinate predictions,/>For predicted gate center point coordinates, x _i,y_i is the labeled gate center point, ω _i、h_i is the gate bbox width, height,/>To predict the width, height of the output gate bbox, C _i represents the confidence of bbox containing the gate,/>Representing confidence in bbox predictions containing gates; the third term is to predict the confidence level of bbox that contains gates; the fourth term is to predict bbox that does not contain gates; lambda _coord,λ_noobj is a weight coefficient, B is the number of anchor frames, and s ² is the total grid number on the feature map.

3. The video-based gate detection method of claim 1, wherein: in the step S3, the image is preprocessed by adopting a Gaussian filtering method, so that the image noise is reduced.

4. The video-based gate detection method of claim 1, wherein: in the step S4, the YOLO model detection gate detects the image obtained in the step S2, and a score corresponding to the detected gate position is obtained, where a score calculation formula is as follows:

5. The video-based gate detection method of claim 1, wherein: in step S4, the process of detecting the image obtained in step S3 includes: detecting the whole image through the trained YOLO model, and recording the position and score of the detected target; if the detected target is larger than 0.8, the detected target is considered as an effective target, and the moving direction of the effective target is judged; if the detection target is smaller than 0.8, the gate is in a closed state.