CN115270839A

CN115270839A - Industrial scene QR Code detection and identification method based on PPYOLOv2 model

Info

Publication number: CN115270839A
Application number: CN202210883094.6A
Authority: CN
Inventors: 苗庆伟; 陈骋; 贺文强
Original assignee: Henan Alson Intelligent Technology Co ltd
Current assignee: Henan Alson Intelligent Technology Co ltd
Priority date: 2022-07-21
Filing date: 2022-07-21
Publication date: 2022-11-01

Abstract

The invention relates to the technical field of intelligent manufacturing, in particular to a PPYOLOv2 model-based industrial scene QR Code detection and identification method, which aims at solving the problems that the current defects are various, irregular in shape, appearance position, area and the like, extremely fine, low in accuracy of the traditional algorithm and high in requirement on the detection speed of a single picture, and provides the following scheme, wherein the detection method comprises the following steps: s1: three rectangles positioned at the upper left, the lower left and the upper right are position detection graphs, the position of a first coded character on the right side of the rectangle at the upper left is accurately found, the position of the two-dimensional code is detected, the image is input, the minimum quadrangle surrounding the two-dimensional code is returned and detected, and the function calling state is realized; the invention aims to solve the problem that the two-dimensional Code identification is influenced by poor environments such as insufficient lighting, dust raising and light reflection in the industrial environment, improve the performance of terminal deployment hardware, reduce the cost and solve the identification problem of QR codes in an industrial scene.

Description

Industrial scene QR Code detection and identification method based on PPYOLOv2 model

Technical Field

The invention relates to the technical field of intelligent manufacturing, in particular to a PPYOLOv2 model-based industrial scene QR Code detection and identification method.

Background

In recent years, the two-dimensional bar Code technology is widely applied to the field of industrial control, and a plurality of industrial parts, electronic chips and product outer packages are identified by adopting the QR Code technology to complete the functions of quality management, product traceability and the like.

However, in the current industrial control environment, the bar code is often printed in other patterns, and therefore the position of the bar code must be detected in the background. The industrial environment is limited by certain limitations, such as insufficient lighting, dust raising, light reflection and the like, the types of defects are multiple, the shapes, the occurrence positions, the areas and the like are irregular, the defects are extremely fine, the accuracy of a traditional algorithm is low, the requirement on the detection speed of a single picture is high, the performance and the cost of terminal deployment hardware need to be balanced, each type of defect data sample is few, and model training basic materials are limited, so that the industrial scene QR Code detection and identification method based on the PPYOLOv2 model is provided and used for solving the problems.

Disclosure of Invention

The invention aims to solve the problems that the existing defects are various in types, irregular in shape, position, area and the like, extremely fine, low in accuracy of the traditional algorithm, high in requirement on detection speed of a single picture and the like, and provides a PPYOLOv2 model-based industrial scene QR Code detection and identification method.

In order to achieve the purpose, the invention adopts the following technical scheme:

a PPYOLOv2 model-based industrial scene QR Code detection and identification method comprises the following steps:

s1: three rectangles positioned at the upper left, the lower left and the upper right are position detection graphs, the position of a first coding character on the right side of the rectangle at the upper left corner is accurately found, the position of the two-dimensional code is detected, the image is input, the minimum quadrangle surrounding the two-dimensional code is returned, the function calling state is returned, the calling is successfully returned, and the two-dimensional code graph is obtained;

s2: correcting the two-dimension code graph, performing projection mapping on the irregular two-dimension code by utilizing computer perspective transformation, and obtaining a complete two-dimension code graph by two times of transformation;

s3: the camera scans black-white two-dimensional codes, the mobile phone converts the collected image into a binary image by using a threshold value theory of point operation, the binary image is subjected to binarization processing by using a gray value calculation formula to obtain a binary image, then expansion operation is performed on the binary image, and the expanded image is subjected to edge detection to obtain the outline of a bar code area;

s4: carrying out grid sampling, and sampling image pixels on each intersection point of the grid;

s5: the photoelectric converter receives and generates an analog electric signal, and the analog electric signal is amplified, filtered and shaped to form a square wave signal;

s6, the decoder determines whether the two-dimensional code is dark color 1 or light color 0 according to a threshold value so as to obtain an original binary sequence value of the two-dimensional code, corrects and decodes the data, and finally converts the original data into character data according to a logic coding rule of the bar code;

preferably, in S1, the detection of the two-dimensional code position utilizes a YOLOv2 algorithm, and the YOLOv2 predicts offsets of the bounding box relative to the prior box by using anchor boxes for referencing an RPN network, and a coordinate offset value (t) of an actual central position (x, y) of the bounding box_x，t_y) Scale of the prior box (w)_a，h_a) And center coordinate (x)_a，y_a) To calculate:

x＝(t_x×w_a)-x_a

y＝(t_y×h_a)-y_a

however, the above formula is unconstrained, and the predicted bounding box is very largeEasily shifted in any direction, e.g. when t_xThe bounding box will shift right by one width size of the prior box when =1, and when t_xWhen the boundary box is shifted to the left by one width of the prior box when the boundary box is-1, therefore the boundary box predicted at each position can fall on any position of the picture, which causes instability of the model, a long time is needed to predict correct offsets during training, YOLOv2 follows the method of YOLOv1, the relative offset value of the center point of the boundary box relative to the position of the upper left corner of the corresponding cell is predicted, in order to constrain the center point of the boundary box in the current cell, the offset value is processed by using a sigmoid function, so that the predicted offset value is in the range of (0, 1), the scale of each cell is regarded as 1), and in summary, 4 offssts predicted according to the boundary box are considered_x，t_y，t_w，t_hThe actual position and size of the bounding box can be calculated according to the following formula:

b_x＝σ(t_x)+c_x

b_y＝σ(t_y)+c_y

wherein (c)_x，c_y) For the coordinates of the upper left corner of the cell, the scale of each cell is 1 during calculation, so that the coordinates of the upper left corner of the current cell are (1, 1), and due to the processing of the sigmoid function, the central position of the bounding box can be restricted in the current cell to prevent excessive offset, and p_wAnd p_hThe width and the length of the prior frame are the values of the prior frame and the values are relative to the size of the feature map, and the length and the width of each cell in the feature map are both 1; note here that the size of the feature map is (W, H), so we can calculate the position and size of the bounding box with respect to the whole picture (4 values are between 0 and 1):

b_x＝(σ(t_x)+c_x)/W

b_y＝(σ(t_y)+c_y)/H

the final position and size of the bounding box can be obtained by multiplying the above 4 values by the width and length (pixel point values) of the picture, respectively.

Preferably, in S2, the original picture coordinates are mapped to obtain transformed picture coordinates x, y, where x = x '/w', y = y '/w'; w of the source coordinate is constantly 1, and the general transformation formula is as follows:

transformation matrix

Can be disassembled into 4 parts, and the two parts are combined,

represents a linear transformation, [ a ]₃₁ a₃₂]For translation, [ a ]₁₃ a₂₃]^TGenerating a perspective transformation; in the case where the mapped view plane is not parallel to the original plane, the image after perspective transformation is usually not a parallelogram;

in the code, we define several auxiliary variables in the perspective transformation, Δ x corresponds to dx1 in the code:

Δx₁＝x₁-x₂ Δx₂＝x₃-x₂ Δx₃＝x₀-x₁+x₂-x₃

Δy₁＝y₁-y₂ Δy₂＝y₃-y₂ Δy₃＝y₀-y₁+y₂-y₃

(1) when Δ x₃，Δy₃When all are 0, the transform plane is parallel to the original, and can obtain:

a₁₁＝x₁-x₀

a₂₁＝x₂-x₁

a₃₁＝x₀

a₁₂＝y₁-y₀

a₂₂＝y₂-y₁

a₃₂＝y₀

a₁₃＝0

a₁₂＝0

(2) when not 0, we obtain:

a₁₁＝x₁-x₀+a₁₂x₁

a₂₁＝x₃-x₀+a₁₂x₂

a₃₁＝x₀

a₁₂＝y₁-y₀+a₁₃y₁

a₂₂＝y₃-y₀+a₂₃y₃

a₃₂＝y₀

the solved transformation matrix can transform a square into a quadrangle, and the known target coordinates (x, y) are obtained by rewriting the transformation formula before, and the values of a11 to a33 are known, so that:

through two times of conversion; the transformation of a quadrangle to a square + the transformation of a square to a quadrangle makes it possible to transform any one quadrangle to another quadrangle.

Preferably, in S3, because the objects with different colors have different wavelengths of the reflected visible light, the white object can reflect the visible light with various wavelengths, and the black object absorbs the visible light with various wavelengths, when the camera scans the black and white two-dimensional code, the mobile phone uses the threshold theory of the point operation to change the acquired image into a binary image, that is, binarize the image to obtain a binarized image, and then perform an expansion operation on the binarized image, and perform edge detection on the expanded image to obtain the outline of the barcode region.

Preferably, in S3, the camera scans a black-and-white two-dimensional code, the mobile phone converts the acquired image into a binary image by using a threshold theory of point operation, performs binarization processing on the image through a gray value calculation formula, sets a global threshold T, and divides data of the image into two parts by using T: pixel groups larger than T and pixel groups smaller than T; setting the pixel value of the pixel group larger than T as white, setting the pixel value of the pixel group smaller than T as black, dividing the whole image into N windows according to a certain rule, dividing the pixels in the windows into two parts according to a uniform threshold value T for each window of the N windows, carrying out local binarization processing, and setting a parameter equation to carry out threshold value calculation to obtain the threshold value through various local characteristics such as the average value E of the pixels in the windows, the square difference P between the pixels, the root mean square value Q between the pixels and the like.

Preferably, in S3, the camera scans a black-and-white two-dimensional code, the mobile phone converts the acquired image into a binary image by using a threshold theory of point operation, performs binarization processing on the image through a gray value calculation formula to obtain a binarized image, performs expansion operation on the binarized image, performs edge detection on the expanded image to obtain an outline of a barcode region, performs or operation on each pixel of the scanned image by using a structural element of 3 × 3 and the binary image covered by the structural element, and if all the pixels are 0, the pixel of the resultant image is 0, otherwise, the pixel is 1.

Preferably, in S5, the photoelectric converter receives and generates an analog electrical signal, and the analog electrical signal is amplified, filtered and shaped to form a square-wave signal, and a median filtering technique is adopted.

Preferably, in S6, determining whether the code is dark color "1" or light color "0" according to a threshold value, so as to obtain an original binary sequence value of the two-dimensional code, performing error correction and decoding on the data, and finally converting the original data into text data according to a logic encoding rule of the barcode; finding a given term multiplied by a generator polynomial such that the result of the multiplication has the same first term as the message polynomial, xoring the result using the message polynomial in the first multiplication step or the remainder in all subsequent multiplication steps, performing these steps n times, where n is the coefficient in the message polynomial, and after division of the two polynomials there will be a remainder whose coefficient is an error correction codeword; and filling the obtained error correction code words into the grids of the two-dimensional code according to the standard to obtain data.

Compared with the prior art, the invention has the beneficial effects that:

1. the irregular graph is restored through a graph correction technology, the identification accuracy rate of the QR Code in the industrial scene is improved, the information quantity is large, the reliability is high, the use cost is low, and the method can be widely applied to the field of industrial control.

2. The PPYOLOv2 model realizes higher positioning precision, has high reliability, fuses SPP and improves the sample richness by using a data enhancement strategy.

The invention aims to solve the influence of poor environments such as insufficient lighting, dust raising, light reflection and the like on two-dimensional Code identification in an industrial environment, improve the performance of terminal deployment hardware, reduce the cost and solve the identification problem of QR codes in an industrial scene.

Drawings

Fig. 1 is a flowchart of an industrial scene QR Code detection and identification method based on a PPYOLOv2 model according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments.

Example one

Referring to fig. 1, a method for detecting and identifying an industrial scene QR Code based on a PPYOLOv2 model includes the following steps:

s2: correcting the two-dimension code graph, performing projection mapping on the irregular two-dimension code by utilizing computer perspective transformation, and obtaining a complete two-dimension code graph by twice transformation;

s6: the decoder determines whether the color is dark color '1' or light color '0' according to the threshold value, so as to obtain the original binary sequence value of the two-dimensional code, corrects and decodes the data, and finally converts the original data into character data according to the logic coding rule of the bar code;

in this embodiment, the detection of the position of the two-dimensional code uses the YOLOv2 algorithm, and YOLOv2 predicts offsets of the bounding box relative to the prior box by using anchor boxes for reference of the RPN network, and a coordinate offset value (t, y) of the actual central position (x, y) of the bounding box_x，t_y) The scale of the prior box (w)_a，h_a) And center coordinate (x)_a，y_a) To calculate:

x＝(t_x×w_a)-x_a

y＝(t_y×h_a)-y_a

however, the above formula is unconstrained, and the predicted bounding box is easily shifted in any direction, such as when t_xThe bounding box will shift right by one width size of the prior box when =1, and when t_xWhen the boundary box is shifted to the left by one width of the prior box when the value is = -1, therefore the boundary box predicted at each position can fall on any position of the picture, which causes instability of the model, a long time is needed to predict correct offsets during training, YOLOv2 follows the method of YOLOv1, the relative offset value of the center point of the boundary box relative to the position of the upper left corner of the corresponding cell is predicted, in order to constrain the center point of the boundary box in the current cell, the offset value is processed by using a sigmoid function, so that the predicted offset value is in the range of (0, 1), the scale of each cell is regarded as 1), and in summary, 4 offsets t predicted according to the boundary box are counted_x，t_y，t_w，t_hThe actual position and size of the bounding box can be calculated according to the following formula:

b_x＝σ(t_x)+c_x

b_y＝σ(t_y)+c_y

wherein (c)_x，c_y) For the coordinates of the top left corner of the cell, the scale of each cell is 1 during calculation, so that the coordinates of the top left corner of the current cell are (1, 1), and due to the processing of the sigmoid function, the central position of the bounding box can be restricted in the current cell to prevent excessive offset, and p_wAnd p_hThe width and the length of the prior frame are the values of the prior frame and the values are relative to the size of the feature map, and the length and the width of each cell in the feature map are both 1; note here that the size of the feature map is (W, H), so we can calculate the position and size of the bounding box with respect to the whole picture (4 values are between 0 and 1):

b_x＝(σ(t_x)+c_x)/W

b_y＝(σ(t_y)+c_y)/H

In this embodiment, the original picture coordinates are used to obtain transformed picture coordinates x, y, where x = x '/w', and y = y '/w'; w for the source coordinate is constant at 1, and the general transformation formula is given below:

transformation matrix

Can be disassembled into 4 parts, and then the two parts are separated,

representing a linear transformation, [ a ]₃₁a₃₂]For translation, [ a ]₁₃ a₂₃]^TGenerating a perspective transformation; in the case where the mapped view plane is not parallel to the original plane, the image after perspective transformation is usually not a parallelogram;

Δx₁＝x₁-x₂ Δx₂＝x₃-x₂ Δx₃＝x₀-x₁+x₂-x₃

Δy₁＝y₁-y₂ Δy₂＝y₃-y₂ Δy₃＝y₀-y₁+y₂-y₃

(1) when Δ x₃，Δy₃When the transform planes are all 0, the transform planes are parallel to the original transform planes, and can obtain:

a₁₁＝x₁-x₀

a₂₁＝x₂-x₁

a₃₁＝x₀

a₁₂＝y₁-y₀

a₂₂＝y₂-y₁

a₃₂＝y₀

a₁₃＝0

a₁₂＝0

(2) when not 0, we obtain:

a₁₁＝x₁-x₀+a₁₂x₁

a₂₁＝x₃-x₀+a₁₂x₂

a₃₁＝x₀

a₁₂＝y₁-y₀+a₁₃y₁

a₂₂＝y₃-y₀+a₂₃y₃

a₃₂＝y₀

In this embodiment, because objects of different colors have different wavelengths of reflected visible light, a white object can reflect visible light of various wavelengths, and a black object absorbs visible light of various wavelengths, when a camera scans a two-dimensional code between black and white, a mobile phone changes an acquired image into a binary image by using a threshold theory of point operation, that is, binarizes the image to obtain a binarized image, and then performs expansion operation on the binarized image, and performs edge detection on the expanded image to obtain an outline of a barcode region.

In this embodiment, the camera scans a black-and-white two-dimensional code, the mobile phone converts the acquired image into a binary image by using a threshold theory of point operation, performs binarization processing on the image through a gray value calculation formula, sets a global threshold T, and divides the data of the image into two parts by using T: pixel groups larger than T and pixel groups smaller than T; setting the pixel value of a pixel group larger than T as white, setting the pixel value of a pixel group smaller than T as black, dividing the whole image into N windows according to a certain rule, dividing the pixels in each window of the N windows into two parts according to a uniform threshold value T, carrying out local binarization processing, and setting a parameter equation to calculate the threshold value through various local characteristics such as the average value E of the pixels of the windows, the square difference P between the pixels, the root-mean-square value Q between the pixels and the like to obtain the threshold value.

In this embodiment, a camera scans a black-and-white two-dimensional code, a mobile phone converts an acquired image into a binary image by using a threshold theory of point operation, binarizes the image through a gray value calculation formula to obtain a binarized image, then performs expansion operation on the binarized image, performs edge detection on the expanded image to obtain an outline of a barcode region, performs or operation on each pixel of the scanned image by using a structural element of 3 × 3 and the binary image covered by the structural element, and if both the structural elements and the binary image are 0, the pixel of the resultant image is 0, otherwise, the pixel is 1.

In the embodiment, the deep color 1 or the light color 0 is determined according to the threshold value, so that the original binary sequence value of the two-dimensional code is obtained, the data is corrected and decoded, and finally the original data is converted into character data according to the logic coding rule of the bar code; finding a given term multiplied by a generator polynomial such that the result of the multiplication has the same first term as the message polynomial, xoring the result using the message polynomial in the first multiplication step or the remainder in all subsequent multiplication steps, performing these steps n times, where n is the coefficient in the message polynomial, and after division of the two polynomials there will be a remainder whose coefficient is an error correction codeword; and filling the obtained error correction code words into the grids of the two-dimensional code according to the standard to obtain data.

In this embodiment, the photoelectric converter receives and generates an analog electrical signal, and the analog electrical signal is amplified, filtered, and shaped to form a square wave signal, and a median filtering technique is used.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered as the technical solutions and the inventive concepts of the present invention within the technical scope of the present invention.

Claims

1. A QR Code detection and identification method based on a PPYOLOv2 model in an industrial scene is characterized by comprising the following steps:

s6: the decoder determines whether the color is dark color '1' or light color '0' according to the threshold value, so as to obtain the original binary sequence value of the two-dimensional code, corrects and decodes the data, and finally converts the original data into character data according to the logic coding rule of the bar code.

2. The method for QR Code detection and identification of industrial scenes based on PPYOLOv2 model according to claim 1, wherein in S1, the detection of the position of the two-dimensional Code utilizes the YOLOv2 algorithm, the YOLOv2 predicts the offsets of the bounding box relative to the prior box by using anchor boxes for reference of RPN network, and the coordinate offset value (t) of the actual central position (x, y) of the bounding box_x，t_y) Scale of the prior box (w)_a，h_a) And center coordinate (x)_a，y_a) To calculate:

x＝(t_x×w_a)-x_a

y＝(t_y×h_a)-y_a

YOLOv2 adopts the method of YOLOv1, predicts the relative offset value of the center point of the bounding box relative to the position of the upper left corner of the corresponding cell, and predicts 4 offset samples according to the bounding box_x，t_y，t_w，t_hAnd calculating the actual position and size of the bounding box according to the following formula:

b_x＝σ(t_x)+c_x

b_y＝σ(t_y)+c_y

wherein (c)_x，c_y) For the coordinates of the top left corner of the cell, the scale of each cell is 1 during calculation, so that the coordinates of the top left corner of the current cell are (1, 1), due to the processing of the sigmoid function, the center position of the bounding box is constrained in the current cell, and p is_wAnd p_hThe width and length of the prior frame are recorded, the size of the feature map is recorded as (W, H), and the boundary frame is relative to the whole frameThe position and size of the picture are calculated:

b_x＝(σ(t_x)+c_x)/W

b_y＝(σ(t_y)+c_y)/H

multiplying the above 4 values by the width and length of the picture, respectively, yields the final position and size of the bounding box.

3. The method for detecting and identifying the QR Code of the industrial scene based on the PPYOLOv2 model according to claim 1, wherein in S2, the original picture coordinates are mapped to transformed picture coordinates x, y, where x = x '/w', y = y '/w'; w of the source coordinate is constantly 1, and the general transformation formula is as follows:

transformation matrix

Can be disassembled into 4 parts, and the two parts are combined,

representing a linear transformation, [ a ]₃₁ a₃₂]For translation, [ a ]₁₃ a₂₃]^TGenerating a perspective transformation; in the case where the mapped view plane is not parallel to the original plane, the image after perspective transformation is usually not a parallelogram; in code, we define several auxiliary variables in the perspective transformation, Δ x corresponds to dx1 in code:

Δx₁＝x₁-x₂ Δx₂＝x₃-x₂ Δx₃＝x₀-x₁+x₂-x₃

Δy₁＝y₁-y₂ Δy₂＝y₃-y₂ Δy₃＝y₀-y₁+y₂-y₃

when Δ x₃，Δy₃When the transform planes are all 0, the transform planes are parallel to the original transform planes, and can obtain:

a₁₁＝x₁-x₀

a₂₁＝x₂-x₁

a₃₁＝x₀

a₁₂＝y₁-y₀

a₂₂＝y₂-y₁

a₃₂＝y₀

a₁₃＝0

a₁₂＝0

when not 0, we get:

a₁₁＝x₁-x₀+a₁₂x₁

a₂₁＝x₃-x₀+a₁₂x₂

a₃₁＝x₀

a₁₂＝y₁-y₀+a₁₃y₁

a₂₂＝y₃-y₀+a₂₃y₃

a₃₂＝y₀

through two transformations; the transformation of a quadrangle to a square + the transformation of a square to a quadrangle makes it possible to transform any one quadrangle to another quadrangle.

4. The method as claimed in claim 1, wherein in S3, objects of different colors reflect visible light of different wavelengths, white objects reflect visible light of various wavelengths, black objects absorb visible light of various wavelengths, and when a camera scans a two-dimensional Code between black and white, a mobile phone converts an acquired image into a binary image by using a threshold theory of point operation, i.e., binarizes the image to obtain a binarized image, and then performs dilation operation on the binarized image, and performs edge detection on the dilated image to obtain an outline of a barcode region.

5. The method as claimed in claim 4, wherein in S3, the camera scans two-dimensional codes between black and white, the mobile phone uses a threshold theory of point operation to change the collected image into a binary image, the image is binarized by a gray value calculation formula, a global threshold T is set, and the data of the image is divided into two parts by T: pixel groups larger than T and pixel groups smaller than T; setting the pixel value of the pixel group larger than T as white, setting the pixel value of the pixel group smaller than T as black, dividing the whole image into N windows according to a certain rule, dividing the pixels in the windows into two parts according to a uniform threshold value T for each window of the N windows, carrying out local binarization processing, and setting a parameter equation to carry out threshold value calculation to obtain the threshold value through various local characteristics such as the average value E of the pixels in the windows, the square difference P between the pixels, the root mean square value Q between the pixels and the like.

6. The method as claimed in claim 5, wherein in S3, the camera scans a two-dimensional Code between black and white, the mobile phone converts the collected image into a binary image by using a threshold theory of point operation, binarizes the image through a gray value calculation formula to obtain a binarized image, performs dilation operation on the binarized image, performs edge detection on the dilated image to obtain an outline of a barcode region, performs or operation on each pixel of the scanned image by using a binary image covered by a structural element and the structural element by using a 3x3 structural element, and if the pixel of the scanned image is 0, the pixel of the resultant image is 0, otherwise the pixel is 1.

7. The QR Code detection and identification method based on the PPYOLOv2 model for industrial scenes as claimed in claim 6, wherein in S6, the dark color "1" or the light color "0" is determined according to the threshold value, so as to obtain the original binary sequence value of the two-dimensional Code, the data is corrected and decoded, and finally the original data is converted into the text data according to the logical coding rule of the bar Code; finding a given term multiplied by a generator polynomial such that the result of the multiplication has the same first term as the message polynomial, xoring the result using the message polynomial in the first multiplication step or the remainder in all subsequent multiplication steps, performing these steps n times, where n is the coefficient in the message polynomial, and after division of the two polynomials there will be a remainder whose coefficient is an error correction codeword; and filling the obtained error correction code words into the grids of the two-dimensional code according to the standard to obtain data.

8. The QR Code detection and identification method based on PPYOLOv2 model for industrial scenes as claimed in claim 1, wherein in S5, the photoelectric converter receives and generates analog electrical signals, the analog electrical signals are amplified, filtered and shaped to form square wave signals, and a median filtering technique is adopted.