CN113408365B

CN113408365B - Safety helmet identification method and device under complex scene

Info

Publication number: CN113408365B
Application number: CN202110579308.6A
Authority: CN
Inventors: 邹祥波; 秦士伟; 饶睦敏; 叶骥; 王群
Original assignee: Guangdong Energy Group Science And Technology Research Institute Co ltd
Current assignee: Guangdong Energy Group Science And Technology Research Institute Co ltd
Priority date: 2021-05-26
Filing date: 2021-05-26
Publication date: 2023-09-08
Anticipated expiration: 2041-05-26
Also published as: CN113408365A

Abstract

The application discloses a safety helmet identification method and device under a complex scene, wherein the method comprises the following steps: acquiring a safety helmet wearing state picture in a complex scene, and performing data annotation on the safety helmet wearing state picture to obtain an annotation picture; preprocessing the marked picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture; training a neural network model by adopting a preprocessing picture, and obtaining a safety helmet recognition model by changing the network structure of the neural network model and adding an attention mechanism; and acquiring a picture to be identified in a complex scene, inputting the picture to be identified into a safety helmet identification model, and identifying the wearing state of the safety helmet of a worker in the picture to be identified by adopting a TTA method. The embodiment of the application can effectively reduce the influence of the actual environmental factors on the identification result in the complex scene, and can improve the accuracy of safety helmet identification by modifying the network structure of the Yo-ov neural network and fusing the attention mechanism.

Description

Safety helmet identification method and device under complex scene

Technical Field

The application relates to the technical field of target detection, in particular to a safety helmet identification method and device under a complex scene.

Background

The safety helmet can play a role in buffering and damping, and is an indispensable safety tool for safety production workers and high-altitude operation personnel in various industries. The operation safety in a complex scene is closely related to the wearing relation of the safety helmet, the existing safety helmet identification method mainly aims at how to comprehensively utilize all main stream algorithm models to improve the safety helmet detection identification rate, and related researches are mostly aimed at safety helmet identification in a simple scene, and the influence of various factors of an actual building site on the safety helmet identification is not considered, so that the wearing condition of the safety helmet in the complex scene is difficult to identify.

Disclosure of Invention

The application provides a safety helmet identification method and device under a complex scene, which are used for solving the problem that the conventional safety helmet identification method does not consider the influence of various factors of an actual site on safety helmet identification, so that the wearing condition of the safety helmet under the complex scene is difficult to identify.

The first embodiment of the application provides a safety helmet identification method under a complex scene, which comprises the following steps:

acquiring a safety helmet wearing state picture in a complex scene, and performing data annotation on the safety helmet wearing state picture to obtain an annotation picture;

preprocessing the marked picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture;

training a neural network model by adopting the preprocessing picture, and obtaining a safety helmet recognition model by changing the network structure of the neural network model and adding an attention mechanism;

and acquiring a picture to be identified in a complex scene, inputting the picture to be identified into the safety helmet identification model, and identifying the wearing state of the safety helmet of the worker in the picture to be identified by adopting a TTA method.

Further, in the steps of acquiring a helmet wearing state picture under a complex scene, performing data annotation on the helmet wearing state picture to obtain an annotation picture, and preprocessing the annotation picture by adopting an illumination equalization picture processing method to obtain a preprocessed picture, the method further comprises:

and carrying out cluster analysis on the detection frame of the marked picture by using a k-means clustering method, and randomly erasing the picture region of the marked picture by using a random-serving data enhancement method.

Further, the method for processing the picture by adopting illumination equalization carries out pretreatment on the marked picture to obtain a pretreated picture, which specifically comprises the following steps:

performing brightness equalization processing on the marked picture, reading three RGB color channels of the marked picture, and converting the color channels into YUV color space;

selecting Y channel information of the YUV color space, counting Y channel values of each pixel, and calculating according to the Y channel values to obtain probability of occurrence of preset brightness;

and obtaining a brightness histogram according to the occurrence probability of each brightness, and carrying out normalization processing on the brightness histogram to obtain a preprocessed picture.

Further, the neural network is a Yolov5 model, the preprocessing picture is adopted to train the neural network model, and the safety helmet recognition model is obtained by changing the network structure of the neural network model and adding an attention mechanism, specifically:

and adding a layer of SElayer into the network structure of the neural network model, and adding a backstene fused with an attention mechanism to obtain the safety helmet recognition model.

Further, the method further comprises:

generating a voice message reminder when the safety helmet wearing state is recognized as the state of not wearing the safety helmet;

and when the wearing state of the safety helmet is identified as the worn safety helmet, classifying the worn safety helmet by adopting a machine learning method to obtain the color type of the worn safety helmet.

Further, the method for classifying the worn safety helmet by adopting the machine learning method obtains the color category of the worn safety helmet, which is specifically as follows:

detecting the position of a safety helmet in the preprocessed picture;

manufacturing color class templates of a plurality of safety helmets;

selecting the upper half part of the worn safety helmet according to the position of the safety helmet, converting the upper half part of the worn safety helmet into the YUV color space, and respectively calculating the Euclidean distance from the upper half part of the worn safety helmet to a plurality of color class templates;

and respectively comparing the Euclidean distances with a distance threshold range, and obtaining the color class of the safety helmet according to the comparison result.

Further, the comparing the euclidean distances with the distance threshold ranges respectively, and obtaining the color class of the safety helmet according to the comparison result, specifically includes:

comparing the Euclidean distances with a distance threshold range respectively, and if at least one Euclidean distance is in the threshold range, selecting a color class template corresponding to the minimum Euclidean distance in the Euclidean distances as a final calculation result to obtain the color class of the safety helmet;

and if all Euclidean distances are not in the distance threshold range, judging that the safety helmet is of other color types.

A second embodiment of the present application provides a helmet recognition device in a complex scene, including:

the data labeling module is used for acquiring a safety helmet wearing state picture in a complex scene and labeling the safety helmet wearing state picture with data to obtain a labeling picture;

the preprocessing module is used for preprocessing the marked picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture;

the model training module is used for training the neural network model by adopting the preprocessing picture, and obtaining a safety helmet recognition model by changing the network structure of the neural network model and adding an attention mechanism;

the identification module is used for acquiring pictures to be identified in a complex scene, inputting the pictures to be identified into the safety helmet identification model, and identifying the wearing state of the safety helmet of workers in the pictures to be identified by adopting a TTA method.

According to the embodiment of the application, the influence of factors such as strong light, weak light and shielding in a complex scene on the safety helmet state recognition is fully considered, and the data preprocessing is performed by adopting the image processing method of illumination equalization, so that the influence of actual environmental factors on the recognition result in the complex scene can be effectively reduced, and the safety helmet recognition can be more accurate; according to the embodiment of the application, the network structure of the Yolov neural network is modified and the attention mechanism is fused, so that the attention of the model in space is more concentrated, and the accuracy of identification is improved; the reliability of the safety helmet recognition model can be improved by adding the TTA method.

Furthermore, after the worker in the complex scene is identified to wear the safety helmet, the color type of the safety helmet can be further distinguished by manufacturing different color type templates of the safety helmet and calculating the Euclidean distance between the position of the safety helmet and the color type templates, so that the management efficiency of the complex scene on the wearing state of the safety helmet is improved.

Drawings

Fig. 1 is a schematic flow chart of a method for identifying a helmet in a complex scene according to an embodiment of the present application;

FIG. 2 is a schematic diagram of a neural network model according to an embodiment of the present application;

fig. 3 is another flow chart of a method for identifying a helmet in a complex scenario according to an embodiment of the present application;

fig. 4 is a schematic structural diagram of a safety helmet recognition device under a complex scene according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.

In the description of the present application, it should be understood that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present application, unless otherwise indicated, the meaning of "a plurality" is two or more.

In the description of the present application, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present application will be understood in specific cases by those of ordinary skill in the art.

Referring to fig. 1-3, in a first embodiment of the present application, the first embodiment of the present application provides a method for identifying a helmet in a complex scenario shown in fig. 1, including:

s1, acquiring a safety helmet wearing state picture in a complex scene, and marking the safety helmet wearing state picture with data to obtain a marked picture;

in the embodiment of the application, a safety helmet wearing state picture in a complex scene is acquired as a safety helmet data set in step S1, wherein the safety helmet wearing state picture comprises a worn safety helmet picture and an unworn safety helmet picture. According to the method, the data are marked on the wearing state picture of the safety helmet, so that the data balance is ensured.

S2, preprocessing the marked picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture;

by means of the method, contrast of the picture can be effectively improved, details of the picture can be increased, influences caused by changes among different illuminations in the complex environment can be effectively resisted, and recognition accuracy of the model in the strong light or weak light environment can be effectively improved.

S3, training a neural network model by adopting a preprocessing picture, and obtaining a safety helmet recognition model by changing the network structure of the neural network model and adding an attention mechanism;

according to the embodiment of the application, the network structure of the neural network model is modified and the attention mechanism is fused, so that the safety helmet recognition model obtained through training is more concentrated in space, and the accuracy and the efficiency of safety helmet wearing state recognition can be improved.

S4, acquiring a picture to be identified in a complex scene, inputting the picture to be identified into a safety helmet identification model, and identifying the wearing state of the safety helmet of a worker in the picture to be identified by adopting a TTA method.

Specifically, when the identification picture is used for identifying the wearing state of the safety helmet, the picture is randomly turned over and scaled by adopting a TTA method in the safety helmet identification model, and a final safety helmet wearing state identification result is obtained by comprehensively analyzing a plurality of results, so that the reliability of identifying the wearing state of the safety helmet can be effectively improved.

As a specific implementation manner of the embodiment of the application, in the steps of collecting the wearing state picture of the safety helmet in the complex scene, performing data annotation on the wearing state picture of the safety helmet to obtain an annotation picture, and preprocessing the annotation picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture, the method further comprises:

In the embodiment of the application, the k-means clustering method is utilized to perform cluster analysis on the detection frames marked with the pictures to obtain the size of the detection frames suitable for identifying the wearing state of the safety helmet, so that the accuracy of identification is improved. In addition, the embodiment of the application adopts a random-scrolling data enhancement method to randomly erase the picture area of the marked picture, so that the anti-shielding capability of the model can be effectively improved.

As a specific implementation manner of the embodiment of the application, the method for processing the picture by adopting illumination equalization is used for preprocessing the marked picture to obtain a preprocessed picture, and the specific steps are as follows:

YUV is a color coding method, divided into three components, where "Y" represents brightness and "U" and "V" represent chromaticity, used to describe affecting color and saturation, specifying the color of a pixel. The embodiment of the application converts the marked picture into the YUV color space, which is favorable for balancing the brightness information of the marked picture, and can effectively reduce the influence of various factors in a complex environment on the safety helmet state identification.

Selecting Y channel information of a YUV color space, counting Y channel values of each pixel, and calculating according to the Y channel values to obtain probability of occurrence of preset brightness;

Specifically, in one discrete picture { x }, the number of occurrences of luminance i is represented by ni, that is, the occurrence probability of a pixel of luminance i in the picture is:

where L is all the luminance numbers in the picture (typically 256), n is all the pixel numbers in the picture, px (i) is actually an image histogram with pixel value i, normalized to [0,1].

The cumulative distribution function corresponding to px is defined as:

alternatively, creating a transform in the form of y=t (x), generating a cumulative probability function of y for each value in the original image, can be linearized over all value ranges, where the transformation formula is defined as:

cdf _y (i)＝iK

as a specific implementation manner of the embodiment of the application, the neural network is a Yolov5 model, the neural network model is trained by adopting a preprocessing picture, and a safety helmet recognition model is obtained by changing the network structure of the neural network model and adding an attention mechanism, which is specifically as follows:

and adding a layer of SElayer into the network structure of the neural network model, and adding a backstage fused with an attention mechanism to obtain the safety helmet recognition model.

Specifically, when the preprocessing picture is input, a layer of SElayer is added in the network structure of the neural network model so as to pay attention to the importance degree of different channel characteristics. The added SElayer sequentially carries out average pooling and linear classification, and then learns the correlation among different channels through a Relu activation function and linear classification, so that the attention of the channel can be screened.

Referring to fig. 2, in the embodiment of the present application, after a preprocessed picture passes through a layer, the preprocessed picture sequentially passes through Focus, CBL, CSP _1, CBL, cdp_3, CBL, csp1_3, CBL, and SPP modules, the layer is added to the last layer of BackBone, and a one-dimensional vector as many as the number of channels is obtained as an evaluation score of each channel by processing the convolved feature map, and then the evaluation scores are applied to the corresponding channels respectively. After the preprocessed picture passes the BackBone fused with the attribute mechanism, the feature map is transmitted into a YOLOv5-Neck structure to obtain a safety helmet recognition model.

As a specific implementation manner of the embodiment of the present application, the method further includes:

Illustratively, the color classification of the headgear includes red, white, yellow, and blue.

As a specific implementation manner of the embodiment of the application, the worn safety helmet is classified by adopting a machine learning method, and the color categories of the worn safety helmet are obtained, specifically:

detecting the position of a safety helmet in the preprocessed picture;

manufacturing color class templates of a plurality of safety helmets;

the embodiment of the application selects four full-white, full-red, full-yellow and full-blue pictures as color category templates

Selecting the upper half part of the worn safety helmet according to the position of the safety helmet, converting the upper half part of the worn safety helmet into a YUV color space, and respectively calculating the Euclidean distance from the upper half part of the worn safety helmet to a plurality of color class templates;

and respectively comparing the Euclidean distances with the distance threshold ranges, and obtaining the color category of the safety helmet according to the comparison result.

According to the embodiment of the application, four full-white, full-red, full-yellow and full-blue pictures are selected as the color class templates, and the color class of the safety helmet is accurately identified according to Euclidean distances between a plurality of color class templates and the upper half part of the position of the safety helmet.

As a specific implementation manner of the embodiment of the present application, comparing a plurality of euclidean distances with a distance threshold range, and obtaining a color class of a safety helmet according to a comparison result, specifically includes:

and if all Euclidean distances are not in the range of the distance threshold value, judging that the safety helmet is of other color types.

Fig. 3 is another flow chart of a method for identifying a helmet in a complex scenario according to an embodiment of the present application.

The embodiment of the application has the following beneficial effects:

Referring to fig. 4, a second embodiment of the present application provides a helmet recognition device under a complex scene, including:

the data labeling module 10 is used for acquiring a wearing state picture of the safety helmet in a complex scene, and labeling the wearing state picture of the safety helmet with data to obtain a labeling picture;

The preprocessing module 20 is configured to preprocess the labeling picture by using a photo processing method of illumination equalization to obtain a preprocessed picture;

The model training module 30 is configured to train the neural network model by using the preprocessed image, and obtain a safety helmet recognition model by changing a network structure of the neural network model and adding an attention mechanism;

The recognition module 40 is configured to collect a picture to be recognized in a complex scene, input the picture to be recognized into the helmet recognition model, and recognize the wearing state of the helmet of the worker in the picture to be recognized by using a TTA method.

As a specific implementation of the embodiment of the present application, the preprocessing module 20 is further configured to:

The cumulative distribution function corresponding to px is defined as:

cdf _y (i)＝iK

as a specific implementation of the embodiment of the present application, the neural network is a Yolov5 model, and the model training module 30 is specifically configured to:

As a specific implementation of the embodiment of the present application, the identification module 40 is further configured to:

detecting the position of a safety helmet in the preprocessed picture;

manufacturing color class templates of a plurality of safety helmets;

The embodiment of the application has the following beneficial effects:

The foregoing is a preferred embodiment of the present application and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present application and are intended to be comprehended within the scope of the present application.

Claims

1. The safety helmet identification method under the complex scene is characterized by comprising the following steps of:

randomly erasing the picture area of the marked picture by using a random-scrolling data enhancement method;

acquiring a picture to be identified in a complex scene, inputting the picture to be identified into the safety helmet identification model, and identifying the wearing state of the safety helmet of a worker in the picture to be identified by adopting a TTA method;

the method for processing the picture by adopting illumination equalization is used for preprocessing the marked picture to obtain a preprocessed picture, and specifically comprises the following steps:

obtaining a brightness histogram according to the occurrence probability of each brightness, and carrying out normalization processing on the brightness histogram to obtain a preprocessed picture;

the neural network is a Yolov5 model;

when the wearing state of the safety helmet is identified as the worn safety helmet, classifying the worn safety helmet by adopting a machine learning method to obtain the color class of the worn safety helmet;

the method for classifying the worn safety helmet by adopting the machine learning method is characterized in that the color class of the worn safety helmet is obtained, and specifically:

detecting the position of a safety helmet in the preprocessed picture;

manufacturing color class templates of a plurality of safety helmets;

comparing the Euclidean distances with a distance threshold range respectively, and obtaining the color class of the safety helmet according to the comparison result;

the comparison is carried out on a plurality of Euclidean distances and a distance threshold range respectively, and the color class of the safety helmet is obtained according to the comparison result, specifically:

2. The method for identifying a helmet in a complex scene according to claim 1, wherein, between "acquiring a helmet wearing state picture in the complex scene, performing data annotation on the helmet wearing state picture to obtain an annotation picture" and "preprocessing the annotation picture by using a photo processing method of illumination equalization to obtain a preprocessed picture", further comprising:

and carrying out cluster analysis on the detection frame of the marked picture by using a k-means clustering method.

3. The method for identifying the safety helmet in the complex scene according to claim 1, wherein the training of the neural network model by using the preprocessed image is performed by changing a network structure of the neural network model and adding an attention mechanism, so as to obtain the safety helmet identification model, which is specifically:

4. A helmet recognition device in a complex scene, comprising:

the preprocessing module is used for preprocessing the marked picture by adopting a picture processing method of illumination equalization to obtain a preprocessed picture; the random-erasing data enhancement method is also used for randomly erasing the picture area of the marked picture;

the identification module is used for acquiring pictures to be identified in a complex scene, inputting the pictures to be identified into the safety helmet identification model, and identifying the wearing state of the safety helmet of workers in the pictures to be identified by adopting a TTA method; the voice message reminding device is also used for generating a voice message reminding when the fact that the safety helmet is not worn in the wearing state is recognized; when the wearing state of the safety helmet is identified as the worn safety helmet, classifying the worn safety helmet by adopting a machine learning method to obtain the color class of the worn safety helmet;

the neural network is a Yolov5 model;

detecting the position of a safety helmet in the preprocessed picture;

manufacturing color class templates of a plurality of safety helmets;