CN116912770A

CN116912770A - Public place smoking detection method based on improved YOLOv8

Info

Publication number: CN116912770A
Application number: CN202310848825.8A
Authority: CN
Inventors: 刘丽娟; 张澳
Original assignee: Dalian Jiaotong University
Current assignee: Dalian Jiaotong University
Priority date: 2023-07-10
Filing date: 2023-07-10
Publication date: 2023-10-20

Abstract

The invention discloses a public place smoking detection method based on improved YOLOv8, which is used for detecting smoking behaviors of monitoring videos in a market, wherein the smoking behaviors comprise smoking cigarettes on a mouth and exhaling cigarettes, holding lighted cigarettes in a hand, and solving the problems of high false detection rate, low accuracy and the like by an improved YOLOv8 model. The method comprises the following steps: s1, acquiring a smoking image to form a data set, and dividing the data set by using Labellmg label images; s2, adopting a YOLOv8 model, and simultaneously adding a small target detection layer; s3, improving a YOLOv8 backbone network, and replacing a Darknet53 with a lightweight network MobileNet V3; s4, introducing a attention mechanism CBAM into a Neck of the YOLOv8 model; s5, optimizing a loss function, and replacing the CIoU with the EIoU; s6, inputting the images into a trained improved YOLOv8 model based on a preset test set, and detecting each image to obtain a target detection result. The method improves the accuracy, the detection speed and the precision by improving the YOLOv8 model.

Description

Public place smoking detection method based on improved YOLOv8

Technical Field

The invention relates to a smoking detection method, in particular to a public place smoking detection method based on improved YOLOv 8. Belongs to the technical field of computer vision.

Background

The target detection is an important application direction in the field of computer vision, is used for detecting semantic objects of specific categories in images and videos, and is widely applied to the fields of face recognition, intelligent transportation, medical diagnosis, security monitoring and the like. The smoking detection is used as the key research content in the security monitoring field, and aims to rapidly and accurately identify and position the smoking position, record the detection result and assist in timely giving out smoking alarm. The fact proves that the harm caused by smoking in public places is not small, the harm not only affects the health of other people, but also can cause problems of fire, polluted air and the like, which is a public health problem to be solved urgently, so the smoking detection method has extremely important practical significance for the research of smoking detection methods. Because smoking detection belongs to small target detection, the proportion of the smoking detection in an image is small, and meanwhile, the color is difficult to distinguish from the surrounding environment color, and the smoking detection is difficult to capture, so that the missing detection rate is high and the false detection rate is high.

The traditional smoking detection methods comprise smoke sensor detection based on smoke concentration, biological detection based on sampling analysis indexes, detection based on a vision monitoring system and the like, and the methods are low in accuracy and long in time consumption.

Disclosure of Invention

In order to solve the problems, the invention provides a public place smoking detection method based on improved YOLOv 8. The problem of high false detection rate and low accuracy rate of smoking detection behaviors in a monitoring video of a mall is solved by applying and improving a YOLOv8 algorithm for object identification and positioning based on a deep neural network.

The technical solution of the invention is realized as follows:

a public place smoking detection method based on improved YOLOv8 comprises the following steps:

s1, acquiring a smoking image to form a data set, and dividing the data set by using Labellmg label images, wherein the smoking image refers to an image for detecting and labeling smoking behaviors in a monitoring environment by a monitoring video, and the smoking behaviors comprise holding cigarettes on the mouth and exhaling the smoke of the cigarettes, and holding the lighted cigarettes in the hands;

s2, adopting a YOLOv8 model, adding a small target detection layer, continuing the operations of a C2f module, up-sampling and the like on the original Neck module, and carrying out feature fusion convolution on the obtained feature map and a first layer feature map of a backbone network to obtain a feature map with the size of 160 x 160; and carrying out feature fusion convolution on the obtained feature map and a second layer feature map of the backbone network so as to obtain 80 x 80 feature maps. Because the size of the cigarette is too small, the sampling multiple of the YOLOv8 is large, the characteristic information of the cigarette is difficult to identify by the original characteristic image, and the improved YOLOv8 is added with a small target detection layer with the size of 160 x 160, so that the problem that the cigarette is difficult to detect is solved, the false detection rate is reduced, and the detection effect is improved;

s3, improving a YOLOv8 backbone network, replacing a Darknet53 network with a lightweight network MobileNet V3, specifically replacing C2f and a part Cnov in the YOLOv8 with Bneck, and adopting a depth separable convolution to reduce the calculated amount by a lightweight YOLOv8 model; the inverse residual structure is adopted, the original 1×1 convolution dimension reduction, 3×3 convolution and 1×1 convolution dimension increase are changed into 1×1 convolution dimension increase, 3×3 convolution and 1×1 convolution dimension reduction are changed into a structure with wider middle and narrower two ends, the channel number is improved, and the calculated amount is reduced; the attention mechanism SE is introduced, a large weight is given to important channels, a small weight is given to unimportant channels, and the task processing efficiency and accuracy are improved; the calculated amount of the lightweight YOLOv8 model is smaller, and the accuracy is higher;

s4, introducing an attention mechanism CBAM into a Neck of the YOLOv8 model, calculating attention force diagrams from two different dimensions of a channel and a space in sequence by a CBAM module, after introducing the attention mechanism CBAM into the Neck, focusing on cigarette information, ignoring other useless information, covering the characteristics to more positions of the cigarettes, and improving the accuracy;

s5, optimizing a Loss function, replacing the CIoU with the Loss function of the EIoU, wherein certain ambiguity exists in the aspect ratio of the CIoU, the EIoU respectively calculates a wide difference value and a high difference value to replace the aspect ratio on the basis of the CIoU, and meanwhile, focal-Loss is introduced to solve the problem of sample unbalance;

s6, inputting the images into a trained improved YOLOv8 model based on a preset test set, and detecting each image to obtain a target detection result.

Compared with the prior art, the invention has the following characteristics:

1. a small target detection layer is added, so that the problem of high false detection rate caused by small cigarette size is solved;

2. the YOLOv8 backbone network is improved, the light-weight network MobileNet V3 is used for replacing the Darknet53, and parameters and calculation amount are reduced;

3. the attention mechanism CBAM is introduced, more weight parameters are given to the relevant information of the cigarettes, and the accuracy is improved;

4. the smoking behavior in the monitoring video is monitored in real time;

5. and (3) improving the loss function, replacing the CIoU with the EIoU, and solving the problem of sample unbalance.

Drawings

The invention is shown in figure 2.

FIG. 1 is a diagram of the overall network architecture of YOLOv8 of the present invention;

fig. 2 is a diagram of the overall network architecture of the improved YOLOv8 of the present invention.

Detailed Description

A public place smoking detection method based on improved YOLOv8 as shown in fig. 1 and 2, comprising the following steps:

Claims

1. The public place smoking detection method based on the improved YOLOv8 is characterized by comprising the following steps of: