CN109934803A

CN109934803A - A method of scoring for mobile terminal shooting low resolution picture based on deep learning

Info

Publication number: CN109934803A
Application number: CN201910144106.1A
Authority: CN
Inventors: 袁子意
Original assignee: Shanghai City Poetry Mdt Infotech Ltd
Current assignee: Shanghai City Poetry Mdt Infotech Ltd
Priority date: 2019-02-27
Filing date: 2019-02-27
Publication date: 2019-06-25

Abstract

Deep learning is based on the invention discloses one kind and shoots low resolution picture scoring method for mobile terminal, and this method, which specifically includes that, collects various picture and generate label using user's marking data, forms a training data set；It constructs lightweight depth and rolls up machine neural network；Using training dataset, with image processing techniques, part of data are processed into and meet mobile terminal shooting picture distribution, training deep neural network finally utilizes neural network model, shoots to obtain low resolution picture and give a mark for mobile end equipment.The present invention by processing training set picture distribution with adapt to mobile terminal shoot low resolution picture distribution and analog subscriber really give a mark be distributed provide scoring, appraisal result is more close to human vision.

Description

It is a kind of that the scoring of low resolution picture is shot for mobile terminal based on deep learning Method

Technical field

The method for mobile terminal shooting low resolution picture scoring based on deep learning that the present invention relates to a kind of, belongs to Technical field of image processing.

Background technique

Currently, picture AI auto-scoring more and more attention has been paid to.Determine that the quality of a picture there are many elements, such as: Saturation degree, contrast, tone, composition, these abstract elements are difficult to be found by layman, to get objective a, conjunction The score of reason.Traditional picture scoring method is based primarily upon machine learning frame, and manual extraction artificial design features finally use Classification or regression model provide score, although the method fast speed, the hand-designed feature designed according to priori knowledge is past Toward the poor bottleneck of accuracy can be encountered, it is extremely difficult to optimal result, is typically only capable to obtain the result of suboptimum.Deep learning can be with Characteristics of image is automatically extracted, simple feature is combined into more complicated advanced features, thus realize image classification, mesh The complex tasks such as mark detection.Currently, volume machine neural network (CNN) has been widely used.By a large amount of training data, machine mind is rolled up It can learn feature largely hiding in data out through network and get corresponding score in conjunction with these features.However deep learning Technology equally exists following problem: (1) training data count issue: often want to obtain an extensive effect it is preferable as a result, It needs to be trained in mass data, causes the training time long, hsrdware requirements are high, and computation complexity is high, while how a large amount of Obtaining the mass data with label also becomes one of difficult point；(2) briefly feature identification is selected as classification or regression problem Cross entropy is selected as loss function to train depth network, for there are successional problems between each feature, cannot be obtained Better effects；(3) deeper feature can not be extracted, the depth emotion information expressed such as picture.Currently, there are many bases for market In the picture scoring model of CNN, however wherein most model is directed to open source data set offer data set and is trained, these Data set enormous amount, but common ground is to be taken from Professional Photography website, it is higher so as to cause picture quality, it does not meet and actually answers With under scene, the case where the low quality picture that user is shot with mobile end equipment, so as to cause model to low quality picture point Cloth bad adaptability, so applicability and practicability are restricted, it is difficult to meet the needs of market.

Summary of the invention

The present invention be directed to the shortcomings of the prior art, provide a kind of low for mobile terminal shooting based on deep learning The method of resolution chart scoring, appraisal result more close to human vision, meet actual operation requirements.

To solve the above problems, the technical solution used in the present invention is as follows:

A method of scoring for mobile terminal shooting low resolution picture based on deep learning, this method includes as follows Step:

Step 1: data collection；

Step 2: according to picture quality, manual sort, the distribution of analog subscriber number carries out section distribution；

Step 3: the picture in training set being done into Gaussian Blur in different Gaussian kernel sizes, and is properly added Gauss respectively and makes an uproar Sound and salt-pepper noise；

Step 4: construction depth neural network；

Step 5: pretreatment training set picture is inputted in neural network and is trained；

Step 6: according to neural network model, predicted pictures score.

As an improvement of the above technical solution, in step 1, it is collected pictures, will be collected pictures point by sorting key word Preferably, in, poor three classes, using data of the part open source data set AVA in photonet, because of open source data set often picture matter Amount is higher than the quality that collects pictures manually, needs to be allowed to picture from tape label in appropriate adjustment data set to be distributed in crawl picture point Cloth substantially coincide.

As an improvement of the above technical solution, in step 2, artificial marking, fraction range are 0-10 points, are finally obtained Label is the number given a mark under each different score sections, and since human cost is limited, this stamp methods is only applicable to fraction number According to；It collects pictures for other, using artificial scoring method is simulated, fraction range is 0-10 points, and the label finally obtained is every The number given a mark under a difference score section, other, which collect pictures, is divided into artificial decision procedure by good, middle preference, in, difference partially in, Poor 5 class is simulated artificial marking mode, will be got well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10, the specific steps are Good picture reciprocal fraction section is 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, and it is 5-7 that middle picture, which corresponds to section, Point, the partially middle corresponding section of difference is 3 to 5 points, and it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is known in certain priori Know.

As an improvement of the above technical solution, in step 3, picture in public data source is subjected to gamma correction, not Gaussian Blur is done with Gaussian kernel size, is suitably added Gaussian noise and salt-pepper noise, realizes data enhancing.

As an improvement of the above technical solution, in step 4, it is inputted entering data into deep learning neural network Before, it needs to make picture a large amount of pretreatment, is mostly that quality is higher due to obtaining picture, the higher picture of resolution ratio, and really answer With low, the lower picture of resolution ratio in the situation quality that be mostly user shot with mobile phone, training data need to be needed to do it is certain before place Reason is distributed the distribution of training data picture with the picture under real scene use as far as possible using the methods of Gaussian Blur close, Make model that can accomplish to adapt to low resolution picture, is trained in the picture input neural network after handling well, loss function It is prediction distribution in actual distribution mean square deviation；And test set is constructed to select the smallest model of loss function value for final mould Type.

As an improvement of the above technical solution, in step 5, test set construction step are as follows: (1) collect user and set with mobile terminal Standby shooting actual picture；(2) it is manually given a mark to test set data；(3) selection loss function value on test set is the smallest Model is final mask.

Compared with prior art, implementation result of the invention is as follows by the present invention:

The present invention adapts to the distribution of mobile terminal shooting low resolution picture by the distribution of processing training set picture and simulation is used Family, which really gives a mark to be distributed, provides scoring, and appraisal result more close to human vision, meets actual operation requirements.

Detailed description of the invention

Fig. 1 is the stream of the method that the scoring of low resolution picture is shot for mobile terminal of the present invention based on deep learning Journey structural block diagram.

Specific embodiment

Illustrate the contents of the present invention below in conjunction with specific embodiments.

The scoring of low resolution picture is shot for mobile terminal based on deep learning to be of the present invention as shown in Figure 1: The Structure and Process block diagram of method.

A kind of picture scoring method based on deep learning, method include the following steps

Step 1: data collection

Step 3: the picture in training set being subjected to gamma correction, different Gaussian kernel sizes do Gaussian Blur, and fit respectively When addition Gaussian noise and salt-pepper noise, data enhancing is realized；

Step 4: construction depth neural network；

Step 5；Training set picture is pre-processed, inputs in neural network and is trained；

Step 6 is according to neural network model, predicted pictures score.

In step 1: it is collected pictures by sorting key word, is divided into collecting pictures, middle preference, in, poor three classes make With data of the part open source data set AVA in photonet, however increase income data set often picture quality than collecting figure manually Tablet quality is high, it is therefore desirable to from tape label in appropriate adjustment data set, be allowed to picture and be distributed in the distribution that collects pictures substantially to coincide.

In step 2: artificial marking, for fraction range for 0-10 points, the label finally obtained is under each different score sections The number of marking, however since human cost is limited, this stamp methods is suitable for fraction data；Figure is collected for other Piece, the present invention is using artificial scoring method is simulated, and for fraction range for 0-10 points, the label finally obtained is each different score sections The number of lower marking, network are crawled picture and are divided into artificial decision procedure by good, middle preference, in, difference partially in, poor 5 class, simulate people Work marking mode, will get well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10；Specific steps preferably divide by picture correspondence Number interval is 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, and it is 5-7 points that middle picture, which corresponds to section, and difference is partially middle corresponding Section is 3 to 5 points, and it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is in certain priori knowledge.

In step 3: picture in public data source is subjected to gamma correction, does Gaussian Blur in different Gaussian kernel sizes, And it is suitably added Gaussian noise and salt-pepper noise, to realize that data enhance.

In step 4: construction depth neural network, because final application of the invention is in mobile phone terminal, it is therefore desirable to lightweight Neural network, the present invention construct lightweight neural network, and regular volume machine core is split and is used, to reduce model parameter amount, contract Subtract model size.

In steps of 5: before entering data into the input of deep learning neural network, needing to make picture a large amount of pre- place Reason is mostly that quality is higher since the present invention obtains picture, the higher picture of resolution ratio, and true applicable cases are mostly user's mobile phone It is low in the quality of shooting, the lower picture of resolution ratio, therefore need to do certain pre-treatment to training data, using the side such as Gaussian Blur Method keeps the distribution of training data picture as close with the picture distribution under real scene use as possible, allow model accomplish to adapt to it is low Resolution chart；It is trained in picture input neural network after handling well, loss function is prediction distribution in reality point The mean square deviation of cloth；It is final mask, test set construction step that test set, which is constructed, to select the smallest model of loss function value are as follows: (1) it collects user and shoots actual picture with mobile end equipment；(2) it is manually given a mark to test set data；(3) selection is being tested The smallest model of loss function value is final mask on integrating

In step 6: will test in the neural network after picture input trains, obtain corresponding marking.

The foregoing is a detailed description of the present invention in conjunction with specific embodiments, and it cannot be said that the present invention is specifically real It applies and is only limitted to these explanations.For those skilled in the art to which the present invention belongs, before not departing from present inventive concept It puts, a number of simple deductions or replacements can also be made, all shall be regarded as belonging to the scope of protection of the invention.

Claims

1. the method for mobile terminal shooting low resolution picture scoring based on deep learning, it is characterised in that: this method packet Include following steps:

Step 1: data collection；

Step 2: according to picture quality, manual sort, the distribution of analog subscriber number carries out section distribution

Step 3: the picture in training set is done into Gaussian Blur in different Gaussian kernel sizes, and be properly added respectively Gaussian noise with Salt-pepper noise；

Step 4: construction depth neural network；

Step 6: according to neural network model, predicted pictures score.

2. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 1, Be characterized in that: in step 1, being collected pictures by sorting key word, will collect pictures be divided into, in, poor three classes, use part Data of the open source data set AVA in photonet, because often picture quality is higher than the quality that collects pictures manually for open source data set, It needs to be allowed to picture from tape label in appropriate adjustment data set to be distributed in crawl picture and be distributed substantially to coincide.

3. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 2, Be characterized in that: in step 2, artificial to give a mark, for fraction range for 0-10 points, the label finally obtained is under each different score sections The number of marking, since human cost is limited, this stamp methods is only applicable to fraction data；It collects pictures, adopts for other With artificial scoring method is simulated, for fraction range for 0-10 points, the label finally obtained is the people to give a mark under each different score sections Number, other, which collect pictures, is divided into artificial decision procedure by good, middle preference, in, difference partially in, poor 5 class simulates artificial marking mould Formula will be got well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10, and preferably picture reciprocal fraction section is specific steps 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, it is 5-7 points that middle picture, which corresponds to section, and the partially middle corresponding section of difference is 3 To 5 points, it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is in certain priori knowledge.

4. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 3, It is characterized in that: in step 3, picture in public data source being subjected to gamma correction, does Gaussian mode in different Gaussian kernel sizes Paste is suitably added Gaussian noise and salt-pepper noise, realizes data enhancing.

5. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 4, It is characterized in that: in step 4, before entering data into the input of deep learning neural network, needing to make picture a large amount of pre- Processing is mostly that quality is higher due to obtaining picture, the higher picture of resolution ratio, and true applicable cases are mostly that user is shot with mobile phone Quality in low, the lower picture of resolution ratio, training data need to be needed to do certain pre-treatment, make to instruct using the methods of Gaussian Blur It is as close with the picture distribution under real scene use as possible to practice the distribution of data picture, makes model that can accomplish to adapt to low resolution figure Piece, the picture after handling well, which inputs in neural network, to be trained, and loss function is prediction distribution in actual distribution mean square deviation； And constructing test set to select the smallest model of loss function value is final mask.

6. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 5, It is characterized in that: in step 5, test set construction step are as follows: (1) collect user with mobile end equipment and shoot actual picture；(2) to survey Examination collection data are manually given a mark；(3) selecting the smallest model of loss function value on test set is final mask.