CN110598752A

CN110598752A - Image classification model training method and system for automatically generating training data set

Info

Publication number: CN110598752A
Application number: CN201910759146.7A
Authority: CN
Inventors: 刘骏; 张啸宇
Original assignee: Shenzhen Yujun Vision Intelligent Technology Co Ltd
Current assignee: Shenzhen Yujun Vision Intelligent Technology Co Ltd
Priority date: 2019-08-16
Filing date: 2019-08-16
Publication date: 2019-12-20
Anticipated expiration: 2039-08-16
Also published as: CN110598752B

Abstract

The invention discloses an image classification model training method and system for automatically generating a training data set, which comprises the following steps: building a key feature learning model, and training the feature learning model by using an original training set consisting of pictures only labeled with key features of objects; capturing target features of each picture in the original training set, and automatically labeling the captured features according to the classification of the captured features to generate key feature data; training key characteristic data; and after the key characteristic data are trained, feeding back the pictures with the wrong classification to the characteristic learning model, and putting the pictures with the wrong classification into a training set of the characteristic learning model for iterative upgrade. The invention solves the problems of deep learning model training at present, and overcomes the problems of irrelevant key direction guiding of data to model training, disordered data labeling, wrong data classification, low data sorting efficiency, difficulty in meeting the production requirement of model accuracy rate and the like under the large data training at present.

Description

Image classification model training method and system for automatically generating training data set

Technical Field

The invention relates to the technical field of image vision, in particular to an image classification model training method and system for automatically generating a training data set.

Background

With the continuous development of industrial production and the continuous increase of material demand of people, the yield of various electronic consumer products is increased year by year, and the annual yield of some enterprise consumer electronic products exceeds 2 hundred million. With the continuous expansion of the yield, the enterprise has stronger and stronger requirements on automatic product appearance detection. In addition to traditional automation enterprises, a small number of enterprises related to the AI also start to be laid out on the appearance detection technology, but because the development of the AI is still in a starting stage, the accuracy of the current image recognition of the whole industry is about 90%, and the generalization is poor, so that the industry is difficult to fall to the ground.

The existing deep learning model needs big data to train the model, data collection, data arrangement and data labeling in the early stage need to be completed manually, a large amount of human resources and time are consumed, and the situations of classification errors and labeling errors occur sometimes. And because the training data amount is huge, most of the training data of the model only simply classify and label the original pictures. The above situations directly result in poor final training effect, research and development personnel need to spend a lot of time to collect more data for labeling, and repeated experiments prove that consumed resources are quite huge. Even if the test requirement is finally met, the product price is high due to the large research and development cost in the early stage, and the popularization of the AI technology is not facilitated.

Disclosure of Invention

In view of the above-mentioned defects of the prior art, the technical problem to be solved by the present invention is to provide an image classification model training method and system for automatically generating a training data set, wherein a batch of key image features are screened out by using the experience of an expert, then a key feature learning model is used to capture key features of original data, and finally the generated accurate training data set is used for training an image classification model, so that the model can quickly learn the key features in a real sense and improve the weight value thereof, thereby achieving the purpose of quickly and highly completing the training task of the model.

In order to achieve the above object, the present invention provides an image classification model training method for automatically generating a training data set, comprising:

step 1, building a key feature learning model, and training the feature learning model by using an original training set consisting of pictures only labeled with key features of objects;

step 2, capturing target features of each picture in the original training set, and automatically labeling the captured features according to the classification of the captured features to generate key feature data;

step 3, training key characteristic data;

and 4, after the key feature data are trained, feeding back the pictures with the wrong classification to the feature learning model, and putting the pictures with the wrong classification into a training set of the feature learning model for iterative upgrading.

Further, the step 1 specifically comprises:

1) performing convolution operation on the image by utilizing a WideResNet network to extract image characteristics, wherein the image characteristics are used for subsequent candidate area screening and classification;

2) screening a foreground candidate area with the maximum probability based on image characteristics extracted by a 3 x 3 convolution layer and a softmax full link layer on the basis of WideResNet;

3) classifying the candidate regions by using the image features extracted in the step 1);

4) and finishing the correction of the position of the candidate region in a linear regression mode.

Further, the step 2 specifically includes:

1) and (3) automatic generation: returning coordinate axes of the key feature region relative to the upper left corner (x 1, y 1) and the lower right corner (x2, y2) of the original image by the feature learning model, and automatically generating a local key feature picture according to the coordinate axes;

2) automatic labeling: automatically labeling the original picture and the local key feature picture according to the identified key feature category to realize automatic labeling of training data;

3) automatic data expansion: according to the quantity comparison condition of the generated various training data, automatic data expansion is carried out on the classification with less data so as to achieve the following conditions that the various data 1: 1, automatic data expansion is performed on the original picture in a random cutting mode.

Further, while training the key feature data in step 3, the method further includes: and storing the updated weights after all batches are trained, restoring the weight of each batch to key feature learning, evaluating the weight by using pre-imported test data, recording the accuracy corresponding to the weight of each batch, and automatically recommending the weight which is ranked in the front after all batches are tested.

Further, in the step 4: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

An image classification model training system that automatically generates a training data set, comprising:

the key feature learning module is used for training the feature learning model by utilizing an original training set consisting of pictures only labeled with key features of the object;

the key feature data automatic generation module is used for capturing the target features of each picture in the original training set, automatically labeling the captured features according to the classification of the captured features and generating key feature data;

the model automatic training module is used for training the key characteristic data;

and the key characteristic learning model automatic iteration module is used for feeding back the pictures with the wrong classification to the characteristic learning model after the key characteristic data is trained, and putting the pictures with the wrong classification into a characteristic learning model training set for iterative upgrade.

Further, the key feature learning module is specifically configured to:

Further, the key feature data automatic generation module is specifically configured to:

Further, the model auto-training module is further configured to:

and storing the updated weights after all batches are trained, restoring the weight of each batch to key feature learning, evaluating the weight by using pre-imported test data, recording the accuracy corresponding to the weight of each batch, and automatically recommending the weight which is ranked in the front after all batches are tested.

Further, the key feature learning model automatic iteration module is specifically configured to: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

The invention has the beneficial effects that:

according to the invention, by building a model training system which guides and learns key features of images and automatically generates a precise training data set, a plurality of problems of deep learning model training at present are solved in principle, and the problems that under the large data training at present, data guide to the direction of a key irrelevant to the model training, data labeling is disordered, data classification is wrong, data sorting efficiency is low, the model accuracy rate hardly meets the production requirement and the like are solved; meanwhile, due to the sustainable self-iterative upgrade, the early data processing work of the system can theoretically achieve zero errors. Training data is greatly optimized, and meanwhile, the system is based on modular (SDK) management, is convenient to integrate, simple to popularize and wide in application range.

The conception, the specific structure and the technical effects of the present invention will be further described with reference to the accompanying drawings to fully understand the objects, the features and the effects of the present invention.

Drawings

FIG. 1 is a flow diagram of a key feature learning module of the present invention;

FIG. 2 is an architecture diagram of the key feature learning module of the present invention;

FIG. 3 is a system architecture diagram of the present invention;

FIG. 4 is a flow diagram of a key feature learning model of the present invention;

FIG. 5 is a schematic diagram illustrating the correction of the position of the candidate region by linear regression according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it should be understood that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments, and it should be understood that the present application is not limited to the example embodiments disclosed and described herein. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

The invention provides an image classification model training method for automatically generating a training data set, which comprises the following steps:

step 3, training key characteristic data;

Wherein, the step 1 specifically comprises the following steps:

Wherein, the step 2 specifically comprises the following steps:

Wherein, when training the key feature data in step 3, the method further comprises: and storing the updated weights after all batches are trained, restoring the weight of each batch to key feature learning, evaluating the weight by using pre-imported test data, recording the accuracy corresponding to the weight of each batch, and automatically recommending the weight which is ranked in the front after all batches are tested.

Wherein, in the step 4: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

The invention also provides an image classification model training system for automatically generating a training data set, which comprises:

Wherein the key feature learning module is specifically configured to:

The key feature data automatic generation module is specifically configured to:

Wherein the model auto-training module is further configured to: and storing the updated weights after all batches are trained, restoring the weight of each batch to key feature learning, evaluating the weight by using pre-imported test data, recording the accuracy corresponding to the weight of each batch, and automatically recommending the weight which is ranked in the front after all batches are tested.

The key feature learning model automatic iteration module is specifically configured to: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

The following specifically discusses the principles of the image classification model training method and system for automatically generating a training data set according to the present invention:

the whole process comprises the steps of firstly screening some key features of an image by using the experience of an expert, enabling a feature learning model to learn the key features (small data amount learning (500-. The training system can enable the feature learning model to automatically generate key feature training data, guide the trained image classification model to quickly learn the real key feature images, extract more fine key features by matching with the self characteristics of the deep learning model, reasonably distribute weighted values of various features, quickly converge and greatly improve the accuracy.

In order to achieve the purpose, the scheme of the invention is as follows: a key feature learning model (KFSM) is built on the basis of fast RCNN + WideResNet, as shown in FIG. 4. A small number (500) of pictures marked with key features of objects (in this example, crush injury, poor clamping, exposed inner cores and good products) are used for training the model, so that the model learns the correlation between the key feature classification and the picture features, and the model has the capability of identifying and extracting the appearance key features. And then, the feature learning model is utilized to capture the target features of each image in the original training set, the captured features are automatically labeled according to the classification of the captured features, and finally, the learning model randomly rotates the original data and the feature data, adjusts the contrast and the brightness, and finally, the image quality is adjusted according to the following steps of 1: the ratio of 2 automatically generates the final training data, and then automatically calls the model for training.

After model training is completed, an evaluation system is used for comparing the accuracy of test data of the model, the wrongly classified pictures are fed back to the KFSM model, and the KFSM model can place the wrongly classified pictures into a training set of the KFSM model for iterative upgrading. The technology realizes the closed loop of key feature learning, key feature capturing, automatic data labeling and training and iterative whole model training, and the whole process is automatically completed by the system.

As shown in fig. 3, the present invention is composed of the following modules:

1. a key feature learning module; 2. a key characteristic data automatic generation module; 3. a model automatic training module; 4. the key feature learning model is an automatic iteration module, and each module is introduced below.

1. Key feature learning module

The core of the module is a key feature learning model built based on a FasterRCNN target detection model and a WideResNet classification model, the model consists of a candidate region generation module, four modules of feature extraction, classification and position correction, the flow chart of the model is shown in figure 1, and the architecture diagram is shown in figure 2.

1) The image is convolved with a wideResNet (CNN 1-CNN 13 layer) network to extract image features, which are used for subsequent candidate region screening and classification.

The principle of the method is that a large number of object pictures marked with types are used for training a model, picture features are extracted layer by layer through a plurality of layers of CNNs (convolutional neural networks), loss values are obtained through difference values of prediction marks and training marks, the weight causing the maximum loss is measured and updated according to reverse derivation of the loss values, the model gradually converges to obtain the optimal weight of each feature value in a large number of cyclic training, and training is completed. The model can carry out convolution one by one on the input image in work, and a new feature map can be formed and output after the features are found.

2) And screening 300 foreground candidate regions with the highest probability based on image features extracted by the WideResNet through the 3 x 3 convolution layer and the softmax full link layer.

3) And classifying the candidate regions by using the image features extracted in the first step.

4) The position of the candidate region is corrected by linear regression, as shown in fig. 5. The target detection adopts a rectangular box to screen out the target, so that four-dimensional vector expressions (x, y, w, h) are used for respectively representing the coordinates of the central point of the box, the width and the height. The sparse dotted line box represents the original check area, the dense dotted line box represents the correct area, and the original check area is adjusted through translation and scaling to be closer to the correct area. The original region is O (Ox, Oy, Ow, Oh), the correct region is T (Tx, Ty, Tw, Th), O gets the target region O1 through a linear transformation C, let O1 approach T infinitely,

C(Ox,Oy,Ow,Oh)=(O1x,O1y,O1w,O1h)≈(Tx,Ty,Tw,Th)；

translation formula: o1x = oxdx (O) + Ox, O1y = oydy (O) + Oy;

scaling formula: o1w = Owexp (dw (O)), O1h = oxexp (dh (O));

the key points are that dx (O), dy (O), dw (O) and dh (O) are obtained by linear regression learning.

Inputting a feature vector = X (features extracted by convolution), learning parameters = Z (transformation amount (tx, ty, tw, th) from an original region to a correct region), outputting = Y (dx (O), dy (O), dw (O), dh (O)), and Y = ZX.

In the training process of the model, all classified and labeled key feature images in a specified file are read (pre-screened by experts), then pictures in various formats are converted into JPG formats with uniform sizes (300 x 300 in the example), and the blank spaces of the reduced pictures are automatically filled with black. And writing the information of each pixel point of the converted picture into a record file, recording the corresponding marking information of the record file, and determining the weight value of each key feature by training a model (in the example, each type of 200 original samples is automatically expanded to 2000 samples in a mode of automatic rotation, brightness adjustment and random cutting).

2. Key characteristic data automatic generation module

After the training of the key feature learning model is completed, the generated model (pb file) is automatically used in a key feature capture module to capture key features of all original images, which specifically comprises the following steps:

1) and (3) automatic generation: the model returns the coordinate axes of the key feature region relative to the upper left corner (x 1, y 1) and the lower right corner (x2, y2) of the original image, and a local key feature picture is automatically generated according to the coordinate axes.

2) Automatic labeling: the system automatically marks the original picture and the local key feature picture according to the identified key feature category, so as to realize the automatic marking of the training data.

3) Automatic data expansion: the system can automatically expand data of the classes with less data according to the quantity comparison condition of the generated various types of training data so as to achieve the following conditions that the various types of data 1: 1. Automatic data expansion is performed on an original picture in a random cutting mode.

3. Model automatic test evaluation module

The system stores the updated weights after training for all batches of the model (each batch refers to the fixed cycle times of all training data, and the cycle times can be set as 10 in this example), restores the weights of each batch to the model, evaluates the weights with the pre-imported test data, and records the accuracy corresponding to the weights of each batch. After all the batch weights are tested, the system automatically recommends the top-ranked (top ten in this example) weight.

4. Automatic iteration module of key feature learning model

And automatically adding the pictures judged to be wrong into a training set by the model in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy rate of the trained model exceeds the accuracy rate of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

In conclusion, the design of the invention introduces the concepts of target detection, deep learning, FasterRCNN, CNN, automatic labeling, automatic iteration and WideResNet, so that the key characteristic sorting of mass data becomes possible (theoretically, the data has no upper limit), and the time for sorting and labeling the data is greatly improved. Because the data contains the key characteristics defined by human (expert) and having definite significance, the convergence speed and the generalization capability of the model training are obviously improved. The invention fundamentally solves the problems of manpower requirement, wrong labeling, disordered classification, low efficiency and the like caused by the requirement on a large amount of training data in the deep learning model training at present. The training data are efficiently optimized, and the accuracy can be greatly improved in a vertical subdivision scene.

It is further noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. The term "comprising", without further limitation, means that the element so defined is not excluded from the group consisting of additional identical elements in the process, method, article, or apparatus that comprises the element.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. An image classification model training method for automatically generating a training data set, comprising:

step 3, training key characteristic data;

2. The method for training an image classification model for automatically generating a training data set according to claim 1, wherein the step 1 specifically comprises:

3. The method for training an image classification model for automatically generating a training data set according to claim 1, wherein the step 2 specifically comprises:

4. The method as claimed in claim 1, wherein the step 3 of training the key feature data further comprises: and storing the updated weights after all batches are trained, restoring the weight of each batch to key feature learning, evaluating the weight by using pre-imported test data, recording the accuracy corresponding to the weight of each batch, and automatically recommending the weight which is ranked in the front after all batches are tested.

5. The method for training an image classification model to automatically generate a training data set according to claim 1, wherein in the step 4: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.

6. An image classification model training system that automatically generates a training data set, comprising:

7. The image classification model training system for automatically generating a training data set according to claim 6, wherein the key feature learning module is specifically configured to:

8. The image classification model training system for automatically generating a training data set according to claim 6, wherein the key feature data automatic generation module is specifically configured to:

9. The image classification model training system for automatically generating a training data set according to claim 6, wherein the model automatic training module is further configured to:

10. The image classification model training system for automatically generating a training data set according to claim 6, wherein the key feature learning model automatic iteration module is specifically configured to: and automatically adding the pictures judged to be wrong into a training set in the process of normally grabbing the key features, automatically training on the basis of the model with the current optimal weight, comparing whether the accuracy of the trained model exceeds the accuracy of the current weight, and if so, automatically importing the latest model and continuously iterating the model.