CN110046666A

CN110046666A - Mass picture mask method

Info

Publication number: CN110046666A
Application number: CN201910312598.0A
Authority: CN
Inventors: 何志权; 许琦; 何志海
Original assignee: Shenzhen Deep View Creative Technology Ltd
Current assignee: Shenzhen Deep View Creative Technology Ltd
Priority date: 2019-04-18
Filing date: 2019-04-18
Publication date: 2019-07-23
Anticipated expiration: 2039-04-18
Also published as: CN110046666B

Abstract

The present invention provides a kind of mass picture mask methods, comprising: step 1, establishes initial model according to the markup information to multiple pictures；Step 2, it is predicted using initial model to not marking picture；Step 3, Confidence Analysis is carried out to the prediction result in step 3, to pick out reliable prediction；Step 4, the prediction result of reliable prediction is replaced into markup information；Step 5, the initial model, 1 iteration of return step are updated using markup information obtained in step 4.The present invention can fast and effeciently mark mass picture, to solve the real bottleneck problem of labeled data in deep learning.

Description

Mass picture mask method

Technical field

The present invention relates to picture label technology field, in particular to a kind of mass picture mask method.

Background technique

Currently, deep learning has obtained increasingly extensive application in academia and industry.But deep learning model is deposited In the resource constraint problem of three aspects: the data volume of model training is big, if depending on network transmission, network bandwidth will become One bottleneck；A large amount of computing resource, deep learning model parameter is more, and calculating process is complicated, huge so as to cause calculation amount；It needs Want the model that a large amount of labeled data is complicated with training.

Labeled data is a cumbersome thing, is usually depicted point by point point by point with paintbrush alignment target, such as Fig. 1 institute Show, wherein (a) is original image, it is (b) picture after mark.As it can be seen that existing mode is time-consuming and laborious, and annotation process In, artificial factor and difference cause the effect of mark often undesirable.

The picture of mark magnanimity is even more to expend a large amount of manpower and material resources, and still, current one kind not yet is quickly and effectively marked The method for infusing mass picture.

Summary of the invention

The present invention provides a kind of mass picture mask methods, to solve at least one above-mentioned technical problem.

To solve the above problems, providing a kind of mass picture mask method as one aspect of the present invention, comprising: Step 1, initial model is established according to the markup information to multiple pictures；Step 2, it is carried out using initial model to picture is not marked Prediction；Step 3, Confidence Analysis is carried out to the prediction result in step 3, to pick out reliable prediction；Step 4, may be used Prediction result by prediction is replaced into markup information；Step 5, the introductory die is updated using markup information obtained in step 4 Type, 1 iteration of return step.

Preferably, the markup information in step 1 is obtained by way of manually marking.

Preferably, step 3 includes: the confidence level function that setting is used for assessment prediction reliability, which utilizes original graph As the output vector of the softmax of information and model layer assesses prediction result.

Preferably, step 3 includes: to judge that each point in prediction result calculates it using pixel in the field 4x4 around it Joint match degree between the output vector of the softmax of the information such as variance, the gradient of contrast and position model layer, such as Fruit is joint match according to preset confidence threshold value, then shows that prediction is reliable.

Preferably, the joint match is portrayed by neural network or support vector machines.

Preferably, updating the initial model using markup information obtained in step 4 includes: to set initial model as θ_t, The model that last time uses is θ '_t-1, then new model is θ=α θ '_t-1+(1-α)θ_t, wherein α is coefficent of exponential smoothing.

By adopting the above-described technical solution, the present invention can fast and effeciently mark mass picture, to solve deep learning The real bottleneck problem of middle labeled data marks a small amount of picture first, and one deep learning model of training, then utilizing should Model is predicted the remaining picture not marked, is assessed the result of prediction, accurate prediction is converted into mark, then The accuracy rate of re-optimization model, such loop iteration, model is higher and higher, and the accuracy rate of mark is also higher and higher.

Detailed description of the invention

Fig. 1 schematically shows the schematic diagram being labeled in the prior art to picture；

Fig. 2 schematically shows flow charts of the invention；

Fig. 3 schematically shows the schematic diagram that reliable prediction result is selected according to confidence level.

Specific embodiment

The embodiment of the present invention is described in detail below in conjunction with attached drawing, but the present invention can be defined by the claims Implement with the multitude of different ways of covering.

Assuming that there is an initial model, if it's not true, can taking human as a small amount of picture of mark, one introductory die of training Type.For convenience following explanation, we with defects detection as an example, as shown in Figure 1.

Step 1: predicting the data not marked with initial model.

Step 2: Confidence Analysis is carried out to the result of prediction.Because the model performance of step 1 is unsatisfactory at this time, prediction Result out alternates betwwen good and bad, and therefore, seeks to pick out reliable prediction to the purpose that prediction result does Confidence Analysis.

Confidence level functionFor assessment models forecasting reliability.The function utilizes original image information I and mould The output vector of the softmax layer of type assesses the curved portion in (b) of Fig. 1.It is specific as follows, to each curve On point, we calculate the variance of its contrast using pixel in the field 4x4 around it, the information such as gradient, and in this position, It is strong and weak that the softmax layer output of model represents the defect for predicting.Both information are if being identical matching, table Bright prediction is reliable.This mapping relations can pass through a simple neural network or support vector machines (support Vector machine) it portrays.

Step 3: crucial samples selection.Step 2 has done Confidence Analysis to all predictions, accordingly, so that it may select Reliable prediction.We can set a confidence threshold value T, when threshold value is greater than T, show that the prediction is reliable, retain, otherwise Give up, as shown in figure 3, the forecast confidence in the top box in (b) is low, is rejected, the reservation in following box.

Step 4: prediction result is converted to markup information.On very big probability, the prediction result with high confidence level Meet with artificial mark, can switch to for markup information.

Step 5: more new model.Using more markup informations, we can train a model θ_t, last time use Model be θ '_t-1, then new model is θ=α θ '_t-1+(1-α)θ_t, wherein α is coefficent of exponential smoothing.The model compares front Model have better estimated performance.

There is updated model, we return to step 1, the part not marked are predicted, meanwhile, it updates existing Prediction.The performance of such iteration, model can become better and better, and the quality of mark also steps up.

The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims

1. a kind of mass picture mask method characterized by comprising

Step 1, initial model is established according to the markup information to multiple pictures；

Step 2, it is predicted using initial model to not marking picture；

Step 3, Confidence Analysis is carried out to the prediction result in step 3, to pick out reliable prediction；

Step 4, the prediction result of reliable prediction is replaced into markup information；

Step 5, the initial model, 1 iteration of return step are updated using markup information obtained in step 4.

2. mass picture mask method according to claim 1, which is characterized in that the markup information in step 1 passes through people The mode of work mark obtains.

3. mass picture mask method according to claims 1 and 2, which is characterized in that step 3 includes: to be arranged for commenting Estimate the confidence level function of predicting reliability, which utilizes the output vector pair of the softmax layer of original image information and model Prediction result is assessed.

4. mass picture mask method according to claim 3, which is characterized in that step 3 includes: to judge in prediction result Each point calculate the information such as the variance of its contrast, gradient and the position model using pixel in the field 4x4 around it Joint match degree between softmax layers of output vector, if being joint match according to preset confidence threshold value, Show that prediction is reliable.

5. mass picture mask method according to claim 4, which is characterized in that the joint match passes through neural network Or support vector machines is portrayed.

6. mass picture mask method according to claim 4, which is characterized in that believed using being marked obtained in step 4 Breath updates the initial model

If initial model is θ_t, the model that the last time uses is θ '_t-1, then new model is θ=α θ '_t-1+(1-α)θ_t, wherein α For coefficent of exponential smoothing.