CN115439675A

CN115439675A - Complex feature target identification and classification method based on machine vision

Info

Publication number: CN115439675A
Application number: CN202210935073.4A
Authority: CN
Inventors: 翟石磊; 王杰; 姜贵轩; 孙安玉; 陈旺
Original assignee: Zhuhai Technology Suzhou Co ltd
Current assignee: Zhuhai Technology Suzhou Co ltd
Priority date: 2022-08-05
Filing date: 2022-08-05
Publication date: 2022-12-06

Abstract

The invention relates to a method for identifying and classifying complex characteristic targets based on machine vision, which adopts a method of combining a target identification algorithm based on composite characteristics with a convolutional neural network to realize rapid identification and classification of the complex characteristic targets in a complex background, has good universality, introduces a machine learning and deep learning algorithm in the process of target detection and classification, improves the accuracy rate of target identification as much as possible, and greatly improves the efficiency of the algorithm.

Description

Complex feature target identification and classification method based on machine vision

Technical Field

The invention relates to the technical field of industrial intelligent detection, in particular to a complex characteristic target identification and classification method based on machine vision.

Background

With the progress of related technologies such as computer science and image processing, image detection methods based on machine vision are increasingly used in the field of industrial surface detection applications. However, in complex image information, especially in detection targets with complex feature information, the accuracy and efficiency of the current algorithm for identifying and classifying such image targets are not ideal, and the application of artificial intelligence technology in related fields is greatly restricted.

Aiming at the problems, the invention provides a method for identifying and classifying complex characteristic targets based on machine vision, which realizes the rapid identification and classification of the complex characteristic targets in a complex background, has good universality, improves the target identification accuracy as much as possible, and greatly improves the efficiency of an algorithm.

Disclosure of Invention

The invention aims to overcome the problems in the prior art and provides a method for identifying and classifying complex characteristic targets based on machine vision.

In order to achieve the technical purpose and achieve the technical effect, the invention is realized by the following technical scheme:

a method for identifying and classifying complex feature targets based on machine vision comprises the following steps:

step S1: preprocessing an image, denoising and performing brightness compensation on the image with uneven illumination;

step S2: performing image threshold segmentation by adopting a target identification algorithm based on composite characteristics, forming a region corresponding to a live-action target by using the obtained subsets through division, reducing or eliminating image noise pollution, removing a non-edge pixel point set, only reserving part of candidate edges, and performing threshold lag by adopting high and low thresholds;

and step S3: carrying out image clustering to realize efficient marking on the characteristics;

and step S4: and identifying and classifying the features to achieve the aim of target detection.

Further, in step S1, the image preprocessing method includes: firstly, carrying out Gaussian filtering denoising on an image, and carrying out local processing on the processed image to carry out uneven illumination compensation; wherein,

the Gaussian filtering denoising adopts a two-dimensional zero-mean discrete Gaussian function as a smoothing filter, and sets cubic Gaussian filtering, wherein the two-dimensional Gaussian function is as follows:

wherein σ is a numerical variance;

after the image is denoised by Gaussian filtering for three times, locally processing the processed image to perform uneven illumination compensation, dividing the source image into NxM subblock regions, calculating the average gray value of each subblock to obtain a subblock brightness matrix of the source image, amplifying the subblock brightness difference matrix to the size of the source image by adopting a cubic interpolation method to obtain a brightness distribution matrix of the original image, and finally subtracting the original image from the brightness distribution matrix to obtain the image without uneven illumination.

Further, in step S2, the target identification algorithm based on the composite feature includes the following steps:

step S201: and (2) performing threshold segmentation on the image, namely dividing a pixel set in the image according to the gray level, wherein the divided subsets form areas corresponding to the real scene target:

in the formula, T is a set gray value threshold value;

step S202: adopting a Gaussian smoothing filter to reduce or eliminate image noise pollution, wherein f (x, y) is an input image, and a Gaussian function is defined as:

in formula, σ is a numerical variance, and a gaussian smooth output image is: f. of _s (x，y)＝G(x，y)*f(x，y)；

Step S203: calculating gradient amplitude and direction, inhibiting non-maximum values, removing non-edge pixel point sets and only reserving partial candidate edges;

step S204: threshold hysteresis is carried out by adopting high and low thresholds, and the specific method comprises the following steps: if the amplitude of a certain pixel position in the image is larger than a set high threshold value, the pixel is reserved as an edge pixel; if the magnitude of the pixel location is less than the set low threshold, then the pixel is not identified as an edge pixel; if the magnitude of a pixel location is between the high and low thresholds, the pixel can only be retained as an edge pixel if it has connections to a pixel that is greater than the high threshold.

Further, in step S3, the method for clustering images includes the following steps:

step S301: randomly selecting K central points;

step S302: assigning each data point to its nearest center point;

step S303: recalculating the average value of the distances from the points in each class to the central point of the class;

step S304: assigning each data to its nearest center point;

step S305: step S303 and step S304 are repeated until all observations are no longer assigned or a maximum number of iterations is reached.

step S301: randomly selecting K central points;

step S302: assigning each data point to its nearest center point;

step S303: recalculating the average value of the distances from the points in each class to the center point of the class;

step S304: assigning each data to its nearest central point;

5. The method for identifying and classifying complex-feature targets based on machine vision according to claim 4, wherein in the step S3, the algorithm for identifying the targets comprises the following steps:

step S401: presetting anchor frames with different lengths and widths, and simultaneously using t _x ,t _y To represent the predicted object position, using t _w ,t _h To represent the predicted object dimensions;

step S402: finding the transformation of the anchor frame which is closest to the detected object shape and the object surrounding rectangle;

step S403: calculating loss by adopting a cross entropy function, and assuming that the object class label is y and the neural network output value is

Then there are:

in the formula, L _obj Is a loss value;

step S404: calculating the loss by using a mean square error function, assuming the marked position as t _* The output of the neural network is

Then there are:

in the formula, t _*x 、t _*y As a position parameter, t _*w 、t _*h As a dimensional parameter, L _xy For loss of position parameter, L _wh Loss of dimensional parameters;

step S405: and multiplying each loss function by corresponding weight respectively and adding the loss functions to obtain a cost function of the network:

in the formula, λ is a weight parameter.

The beneficial effects of the invention are:

the method combines the target recognition algorithm based on the composite features with the convolutional neural network, realizes the rapid recognition and classification of the complex feature targets in the complex background, has good universality, introduces the machine learning and deep learning algorithm in the target detection and classification process, improves the target recognition accuracy as much as possible, and greatly improves the efficiency of the algorithm.

Drawings

FIG. 1 is a block flow diagram of the present invention;

FIG. 2 is a block flow diagram of image pre-processing in an embodiment of the invention;

FIG. 3 is a block diagram of a target recognition algorithm based on composite features according to an embodiment of the present invention.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.

As shown in fig. 1, a method for identifying and classifying complex feature targets based on machine vision includes the following steps:

As shown in fig. 2, in step S1, the image preprocessing method includes: firstly, carrying out Gaussian filtering denoising on an image, and carrying out local processing on the processed image to carry out uneven illumination compensation; wherein,

wherein σ is a numerical variance;

As shown in fig. 3, in step S2, the target identification algorithm based on the composite feature includes the following steps:

step S201: and (2) image threshold segmentation, namely dividing a pixel set in an image according to gray levels, wherein the divided subsets form areas corresponding to the real scene target:

in the formula, T is a set gray value threshold;

in the formula, sigma is a numerical formulaPoor, gaussian smoothed output image is: f. of _s (x，y)＝G(x，y)*f(x，y)；

In step S3, the image clustering method includes the following steps:

step S301: randomly selecting K central points;

step S302: assigning each data point to its nearest center point;

step S304: assigning each data to its nearest central point;

step S305: step S303 and step S304 are repeated until all observations are no longer assigned or the maximum number of iterations is reached.

step S401: presetting anchor frames with different lengths and widths, and using t _x ,t _y To represent the predicted object position, using t _w ,t _h To represent the predicted object dimensions;

step S403: calculating loss by adopting a cross entropy function, and assuming that the object class label is y, outputting a value by a neural networkIs composed of

Then there are:

in the formula, L _obj Is a loss value;

Then there are:

in the formula, λ is a weight parameter.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for identifying and classifying complex feature targets based on machine vision is characterized by comprising the following steps:

2. The method for identifying and classifying a complex-feature target based on machine vision according to claim 1, wherein in step S1, the image preprocessing method is: firstly, carrying out Gaussian filtering denoising on an image, and carrying out local processing on the processed image to carry out uneven illumination compensation; wherein,

in the formula (I), wherein,

is a numerical variance;

after the image is denoised by Gaussian filtering for three times, locally processing the processed image to perform uneven illumination compensation, dividing the source image into N x M subblock areas, calculating the average gray value of each subblock to obtain a subblock brightness matrix of the source image, amplifying the subblock brightness difference matrix to the size of the source image by adopting a cubic interpolation method to obtain a brightness distribution matrix of the source image, and finally subtracting the source image from the brightness distribution matrix to obtain the image without uneven illumination.

3. The method for identifying and classifying complex-feature objects based on machine vision according to claim 2, wherein in the step S2, the object identification algorithm based on composite features comprises the following steps:

wherein T is a set gray value threshold;

step S202: reducing or eliminating image noise pollution by using Gaussian smoothing filter

For an input image, the gaussian function is defined as:

in the formula of Chinese

For the numerical variance, the gaussian smoothed output image is:

；

step S203: calculating gradient amplitude and direction, inhibiting non-maximum values, removing non-edge pixel point sets, and only reserving partial candidate edges;

4. The method for identifying and classifying complex-feature targets based on machine vision according to claim 3, wherein in the step S3, the method for clustering images comprises the following steps:

step S301: randomly selecting K central points;

step S302: assigning each data point to its nearest center point;

step S304: assigning each data to its nearest central point;

step S401: presetting anchor frames with different lengths and widths, and simultaneously using t _x ,t _y To represent the predicted object position, using t _w ,t _h To represent the predicted object size;

step S402: finding the transformation of the anchor frame closest to the shape of the detected object and the object enclosing rectangle;

Then, there are:

in the formula (I), wherein,

is a loss value;

step S404: the loss is calculated using the mean square error function, assuming the annotated position is

The output of the neural network is

Then, there are:

in the formula, λ is a weight parameter.