CN112966786A

CN112966786A - Automatic marking method for convolutional neural network training data

Info

Publication number: CN112966786A
Application number: CN202110405677.3A
Authority: CN
Inventors: 李静雅; 王东杰; 郭志鹏; 樊昊
Original assignee: Ningbo Jiuhuan Shichuang Technology Co ltd
Current assignee: Ningbo Jiuhuan Shichuang Technology Co ltd
Priority date: 2021-04-15
Filing date: 2021-04-15
Publication date: 2021-06-15

Abstract

The invention provides an automatic marking method for convolutional neural network training data, which comprises the following steps: step S1, extracting basic defect characteristics; step S2, feature extraction and sample clipping, including: according to the characteristic mask obtained in the step S1, combining an isolated domain method to obtain the central position and the shape size of each defect body, and taking the point in the set as the center, simultaneously applying a basic transformation operation to the image to increase the number of samples, and cutting out samples with preset specifications; step S3, extracting the characteristic contour of the sample obtained in the step S2, and marking the type; step S4, parallel optimization based on the shared memory parallel system OpenmMP.

Description

Automatic marking method for convolutional neural network training data

Technical Field

The invention relates to the technical field of neural network training, in particular to an automatic marking method for convolutional neural network training data.

Background

Training of convolutional neural network models, which are mainstream in deep learning, requires a large number of labeled images as input data (in the order of tens of thousands or more) in principle in order to obtain reliable calculation results. In practical application, the input image samples are generally marked manually at present, so that the labor cost and the time cost are huge, and the calculation efficiency cannot be matched with the actual training calculation efficiency of a machine. This difference in efficiency makes deep learning training results often limited in the speed of harvest by artificial labeling efficiency. On the other hand, quantifiable evaluation standards are not established for manually marked data, the data volume is huge, the cost for rechecking is high, and the uncertainty caused by the difference of data samples made by different personnel causes poor convergence effect of actual training calculation.

Disclosure of Invention

The object of the present invention is to solve at least one of the technical drawbacks mentioned.

To this end, the invention proposes an automated labeling method for convolutional neural network training data.

In order to achieve the above object, an embodiment of the present invention provides an automatic labeling method for convolutional neural network training data, including the following steps:

step S1, extracting basic defect characteristics;

step S2, feature extraction and sample clipping, including: according to the characteristic mask obtained in the step S1, combining an isolated domain method to obtain the central position and the shape size of each defect body, and taking the point in the set as the center, simultaneously applying a basic transformation operation to the image to increase the number of samples, and cutting out samples with preset specifications;

step S3, extracting the characteristic contour of the sample obtained in the step S2, and marking the type;

step S4, parallel optimization based on the shared memory parallel system OpenmMP.

Further, in the step S1, the method includes the steps of:

(1) reading in image data and compressing;

(2) gaussian filtering: performing Gaussian processing on the image data;

(3) image interpolation: carrying out interpolation processing on the image data after Gaussian processing;

(4) image enhancement: performing enhancement processing on the image data after the interpolation processing;

(5) self-adaptive binarization of an image;

(6) and (4) image debridement.

Further, in the step S2, the isolated domain method employs skeleton extraction and a watershed method.

Further, in the step S3, the binary diagram boundary is directly obtained for the sample obtained in the step S2 with the calculation amount of one iteration, and the contour description and the category label are performed on the defect feature and written into the corresponding configuration file.

According to the automatic marking method for the convolutional neural network training data, disclosed by the embodiment of the invention, the automatic batch extraction of the obvious defect characteristics is realized based on Gaussian and USM methods; meanwhile, unified defect training samples are generated at a high speed, a full-automatic sample generation mechanism is established, and the matching of the sample marking efficiency and the training calculation efficiency is realized. Gaussian filtering is a common noise smoothing operator, and when the Gaussian filtering is matched with an emerging non-mask sharpening method, the contrast of a local boundary can be improved to the limit. The invention can smoothly pick the obvious foreign matters in the object by utilizing the characteristic. In addition, the invention is based on an OpenMP parallel model, and the algorithm core of the OpenMP parallel model can process gray images with the average defect size of more than 3 pixels by adopting methods of series Gaussian filtering, non-mask sharpening and the like. According to the invention, through multiple optimization of a data processing algorithm framework and a data storage mode, the parallel efficiency basically presents linear increase within a 16-core range through testing; compared with the traditional image marking efficiency, the effective marking efficiency is improved by four orders of magnitude.

Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 is a flow diagram of an automated labeling method for convolutional neural network training data, in accordance with an embodiment of the present invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

As shown in fig. 1, the automatic labeling method for convolutional neural network training data according to the embodiment of the present invention includes the following steps:

step S1, extracting basic defect features, including the following steps:

(1) reading in image data and compressing;

(2) gaussian filtering: performing Gaussian processing on the image data;

(5) self-adaptive binarization of an image;

(6) and (4) image debridement.

Specifically, the basis of the enhancement is the Unshirp mask (USM) and Gaussian method, and the basic model is as follows:

g(x₀,y₀)＝∫∫_{(x,y)∈Kernal}f(x,y)O((x₀,y₀)-(x,y))dxdy

wherein A is amplitude, (x)₀,y₀) As the center point position, (σ)_x,σ_y) Is the variance, O (x)₀,y₀) As the original value of the center position image, g (x)₀,y₀) Is a Gaussian processed image value, u (x)₀,y₀) For the image values after the enhancement processing, weight is the enhancement ratio and Kernal is the convolution kernel size.

The calculation method of the convolution kernel radius r is as follows:

r_x＝σ_x*(log(ε))²+1；

wherein ε is 0.01. For practical purposes, it is not necessary to pay attention to the size of the convolution kernel, and downscaling and interpolation algorithms are used to obtain the best precision (downscaling and interpolation algorithms), i.e. the data is reduced first, and then gaussian boolean and interpolation methods are used in sequence. Since image noise is generally dependent on the environment and the machine itself, its size does not correlate positively with resolution. When the resolution of the actual sample to be processed is higher, the method can simultaneously play a better noise reduction effect.

When weight tends to 1, it is evident that there is u (x)₀,y₀) Tends to be ∞. All the features that are emphasized will then be concentrated at the image maximum, i.e. where the most visible defects are located. Masks for extracting defective tissue can be obtained by simple background segmentation, such as binary methods like iterative self-organizing analysis (IJ iso data classifier).

Step S2, feature extraction and sample clipping, including: according to the feature mask acquired in step S1, the center position and the shape size of each defect body are acquired in combination with the isolated domain method, and with the point in the set as the center, a basic transformation operation is applied to the image to increase the number of samples while cutting out samples of a preset specification.

Specifically, most of the current convolutional neural networks represented by CNN generally need to control the parameters of the full link layer to be fixed, and thus the input image size needs to be uniform. When the size of the data source image and the position of the defect are not fixed, a large amount of time is usually spent on cutting the image by a common manual method.

In this step, the center position and the shape and size of each defect body can be obtained by combining the isolated domain method with the feature mask successfully obtained in step S1. With the point in the set as the center, we can simultaneously apply a fundamental transform operation to the image to increase the number of samples while directly tailoring the required sample size.

In step S2, the isolated domain method employs skeleton extraction and watershed methods.

In step S3, feature contour extraction is performed on the sample obtained in step S2, and a category is labeled.

Specifically, in cooperation with an actual training network data reading interface, the defect features are usually required to be subjected to contour description and category marking. According to the calculation result in the step S2, the binary diagram boundary is directly obtained under the calculation amount of one iteration and written into the corresponding configuration file, thereby avoiding the manual drawing work.

Inter-picture processing parameters are typically shared when the defect data originates from the same data volume. The main computational effort is now concentrated on the repeated convolution calculations for each picture. Because no communication is needed between the layers, the difficulty and complexity of actual programming can be greatly reduced by using OpenmMP.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made in the above embodiments by those of ordinary skill in the art without departing from the principle and spirit of the present invention. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. An automated labeling method for convolutional neural network training data, comprising the steps of:

step S1, extracting basic defect characteristics;

2. The automated labeling method for convolutional neural network training data of claim 1, wherein in said step S1, the steps of:

(1) reading in image data and compressing;

(2) gaussian filtering: performing Gaussian processing on the image data;

(5) self-adaptive binarization of an image;

(6) and (4) image debridement.

3. The automated labeling method for convolutional neural network training data of claim 1, wherein in said step S2, said isolated domain method employs a skeleton extraction and watershed method.

4. The method as claimed in claim 1, wherein in step S3, the binary image boundary is directly obtained from the samples obtained in step S2 with an iterative computation, and the defect features are subjected to contour description and class labeling and written into the corresponding configuration file.