CN109934803A - A method of scoring for mobile terminal shooting low resolution picture based on deep learning - Google Patents

A method of scoring for mobile terminal shooting low resolution picture based on deep learning Download PDF

Info

Publication number
CN109934803A
CN109934803A CN201910144106.1A CN201910144106A CN109934803A CN 109934803 A CN109934803 A CN 109934803A CN 201910144106 A CN201910144106 A CN 201910144106A CN 109934803 A CN109934803 A CN 109934803A
Authority
CN
China
Prior art keywords
picture
data
low resolution
mobile terminal
deep learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201910144106.1A
Other languages
Chinese (zh)
Inventor
袁子意
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai City Poetry Mdt Infotech Ltd
Original Assignee
Shanghai City Poetry Mdt Infotech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai City Poetry Mdt Infotech Ltd filed Critical Shanghai City Poetry Mdt Infotech Ltd
Priority to CN201910144106.1A priority Critical patent/CN109934803A/en
Publication of CN109934803A publication Critical patent/CN109934803A/en
Withdrawn legal-status Critical Current

Links

Landscapes

  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

Deep learning is based on the invention discloses one kind and shoots low resolution picture scoring method for mobile terminal, and this method, which specifically includes that, collects various picture and generate label using user's marking data, forms a training data set;It constructs lightweight depth and rolls up machine neural network;Using training dataset, with image processing techniques, part of data are processed into and meet mobile terminal shooting picture distribution, training deep neural network finally utilizes neural network model, shoots to obtain low resolution picture and give a mark for mobile end equipment.The present invention by processing training set picture distribution with adapt to mobile terminal shoot low resolution picture distribution and analog subscriber really give a mark be distributed provide scoring, appraisal result is more close to human vision.

Description

It is a kind of that the scoring of low resolution picture is shot for mobile terminal based on deep learning Method
Technical field
The method for mobile terminal shooting low resolution picture scoring based on deep learning that the present invention relates to a kind of, belongs to Technical field of image processing.
Background technique
Currently, picture AI auto-scoring more and more attention has been paid to.Determine that the quality of a picture there are many elements, such as: Saturation degree, contrast, tone, composition, these abstract elements are difficult to be found by layman, to get objective a, conjunction The score of reason.Traditional picture scoring method is based primarily upon machine learning frame, and manual extraction artificial design features finally use Classification or regression model provide score, although the method fast speed, the hand-designed feature designed according to priori knowledge is past Toward the poor bottleneck of accuracy can be encountered, it is extremely difficult to optimal result, is typically only capable to obtain the result of suboptimum.Deep learning can be with Characteristics of image is automatically extracted, simple feature is combined into more complicated advanced features, thus realize image classification, mesh The complex tasks such as mark detection.Currently, volume machine neural network (CNN) has been widely used.By a large amount of training data, machine mind is rolled up It can learn feature largely hiding in data out through network and get corresponding score in conjunction with these features.However deep learning Technology equally exists following problem: (1) training data count issue: often want to obtain an extensive effect it is preferable as a result, It needs to be trained in mass data, causes the training time long, hsrdware requirements are high, and computation complexity is high, while how a large amount of Obtaining the mass data with label also becomes one of difficult point;(2) briefly feature identification is selected as classification or regression problem Cross entropy is selected as loss function to train depth network, for there are successional problems between each feature, cannot be obtained Better effects;(3) deeper feature can not be extracted, the depth emotion information expressed such as picture.Currently, there are many bases for market In the picture scoring model of CNN, however wherein most model is directed to open source data set offer data set and is trained, these Data set enormous amount, but common ground is to be taken from Professional Photography website, it is higher so as to cause picture quality, it does not meet and actually answers With under scene, the case where the low quality picture that user is shot with mobile end equipment, so as to cause model to low quality picture point Cloth bad adaptability, so applicability and practicability are restricted, it is difficult to meet the needs of market.
Summary of the invention
The present invention be directed to the shortcomings of the prior art, provide a kind of low for mobile terminal shooting based on deep learning The method of resolution chart scoring, appraisal result more close to human vision, meet actual operation requirements.
To solve the above problems, the technical solution used in the present invention is as follows:
A method of scoring for mobile terminal shooting low resolution picture based on deep learning, this method includes as follows Step:
Step 1: data collection;
Step 2: according to picture quality, manual sort, the distribution of analog subscriber number carries out section distribution;
Step 3: the picture in training set being done into Gaussian Blur in different Gaussian kernel sizes, and is properly added Gauss respectively and makes an uproar Sound and salt-pepper noise;
Step 4: construction depth neural network;
Step 5: pretreatment training set picture is inputted in neural network and is trained;
Step 6: according to neural network model, predicted pictures score.
As an improvement of the above technical solution, in step 1, it is collected pictures, will be collected pictures point by sorting key word Preferably, in, poor three classes, using data of the part open source data set AVA in photonet, because of open source data set often picture matter Amount is higher than the quality that collects pictures manually, needs to be allowed to picture from tape label in appropriate adjustment data set to be distributed in crawl picture point Cloth substantially coincide.
As an improvement of the above technical solution, in step 2, artificial marking, fraction range are 0-10 points, are finally obtained Label is the number given a mark under each different score sections, and since human cost is limited, this stamp methods is only applicable to fraction number According to;It collects pictures for other, using artificial scoring method is simulated, fraction range is 0-10 points, and the label finally obtained is every The number given a mark under a difference score section, other, which collect pictures, is divided into artificial decision procedure by good, middle preference, in, difference partially in, Poor 5 class is simulated artificial marking mode, will be got well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10, the specific steps are Good picture reciprocal fraction section is 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, and it is 5-7 that middle picture, which corresponds to section, Point, the partially middle corresponding section of difference is 3 to 5 points, and it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is known in certain priori Know.
As an improvement of the above technical solution, in step 3, picture in public data source is subjected to gamma correction, not Gaussian Blur is done with Gaussian kernel size, is suitably added Gaussian noise and salt-pepper noise, realizes data enhancing.
As an improvement of the above technical solution, in step 4, it is inputted entering data into deep learning neural network Before, it needs to make picture a large amount of pretreatment, is mostly that quality is higher due to obtaining picture, the higher picture of resolution ratio, and really answer With low, the lower picture of resolution ratio in the situation quality that be mostly user shot with mobile phone, training data need to be needed to do it is certain before place Reason is distributed the distribution of training data picture with the picture under real scene use as far as possible using the methods of Gaussian Blur close, Make model that can accomplish to adapt to low resolution picture, is trained in the picture input neural network after handling well, loss function It is prediction distribution in actual distribution mean square deviation;And test set is constructed to select the smallest model of loss function value for final mould Type.
As an improvement of the above technical solution, in step 5, test set construction step are as follows: (1) collect user and set with mobile terminal Standby shooting actual picture;(2) it is manually given a mark to test set data;(3) selection loss function value on test set is the smallest Model is final mask.
Compared with prior art, implementation result of the invention is as follows by the present invention:
The present invention adapts to the distribution of mobile terminal shooting low resolution picture by the distribution of processing training set picture and simulation is used Family, which really gives a mark to be distributed, provides scoring, and appraisal result more close to human vision, meets actual operation requirements.
Detailed description of the invention
Fig. 1 is the stream of the method that the scoring of low resolution picture is shot for mobile terminal of the present invention based on deep learning Journey structural block diagram.
Specific embodiment
Illustrate the contents of the present invention below in conjunction with specific embodiments.
The scoring of low resolution picture is shot for mobile terminal based on deep learning to be of the present invention as shown in Figure 1: The Structure and Process block diagram of method.
A kind of picture scoring method based on deep learning, method include the following steps
Step 1: data collection
Step 2: according to picture quality, manual sort, the distribution of analog subscriber number carries out section distribution;
Step 3: the picture in training set being subjected to gamma correction, different Gaussian kernel sizes do Gaussian Blur, and fit respectively When addition Gaussian noise and salt-pepper noise, data enhancing is realized;
Step 4: construction depth neural network;
Step 5;Training set picture is pre-processed, inputs in neural network and is trained;
Step 6 is according to neural network model, predicted pictures score.
In step 1: it is collected pictures by sorting key word, is divided into collecting pictures, middle preference, in, poor three classes make With data of the part open source data set AVA in photonet, however increase income data set often picture quality than collecting figure manually Tablet quality is high, it is therefore desirable to from tape label in appropriate adjustment data set, be allowed to picture and be distributed in the distribution that collects pictures substantially to coincide.
In step 2: artificial marking, for fraction range for 0-10 points, the label finally obtained is under each different score sections The number of marking, however since human cost is limited, this stamp methods is suitable for fraction data;Figure is collected for other Piece, the present invention is using artificial scoring method is simulated, and for fraction range for 0-10 points, the label finally obtained is each different score sections The number of lower marking, network are crawled picture and are divided into artificial decision procedure by good, middle preference, in, difference partially in, poor 5 class, simulate people Work marking mode, will get well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10;Specific steps preferably divide by picture correspondence Number interval is 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, and it is 5-7 points that middle picture, which corresponds to section, and difference is partially middle corresponding Section is 3 to 5 points, and it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is in certain priori knowledge.
In step 3: picture in public data source is subjected to gamma correction, does Gaussian Blur in different Gaussian kernel sizes, And it is suitably added Gaussian noise and salt-pepper noise, to realize that data enhance.
In step 4: construction depth neural network, because final application of the invention is in mobile phone terminal, it is therefore desirable to lightweight Neural network, the present invention construct lightweight neural network, and regular volume machine core is split and is used, to reduce model parameter amount, contract Subtract model size.
In steps of 5: before entering data into the input of deep learning neural network, needing to make picture a large amount of pre- place Reason is mostly that quality is higher since the present invention obtains picture, the higher picture of resolution ratio, and true applicable cases are mostly user's mobile phone It is low in the quality of shooting, the lower picture of resolution ratio, therefore need to do certain pre-treatment to training data, using the side such as Gaussian Blur Method keeps the distribution of training data picture as close with the picture distribution under real scene use as possible, allow model accomplish to adapt to it is low Resolution chart;It is trained in picture input neural network after handling well, loss function is prediction distribution in reality point The mean square deviation of cloth;It is final mask, test set construction step that test set, which is constructed, to select the smallest model of loss function value are as follows: (1) it collects user and shoots actual picture with mobile end equipment;(2) it is manually given a mark to test set data;(3) selection is being tested The smallest model of loss function value is final mask on integrating
In step 6: will test in the neural network after picture input trains, obtain corresponding marking.
The foregoing is a detailed description of the present invention in conjunction with specific embodiments, and it cannot be said that the present invention is specifically real It applies and is only limitted to these explanations.For those skilled in the art to which the present invention belongs, before not departing from present inventive concept It puts, a number of simple deductions or replacements can also be made, all shall be regarded as belonging to the scope of protection of the invention.

Claims (6)

1. the method for mobile terminal shooting low resolution picture scoring based on deep learning, it is characterised in that: this method packet Include following steps:
Step 1: data collection;
Step 2: according to picture quality, manual sort, the distribution of analog subscriber number carries out section distribution
Step 3: the picture in training set is done into Gaussian Blur in different Gaussian kernel sizes, and be properly added respectively Gaussian noise with Salt-pepper noise;
Step 4: construction depth neural network;
Step 5: pretreatment training set picture is inputted in neural network and is trained;
Step 6: according to neural network model, predicted pictures score.
2. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 1, Be characterized in that: in step 1, being collected pictures by sorting key word, will collect pictures be divided into, in, poor three classes, use part Data of the open source data set AVA in photonet, because often picture quality is higher than the quality that collects pictures manually for open source data set, It needs to be allowed to picture from tape label in appropriate adjustment data set to be distributed in crawl picture and be distributed substantially to coincide.
3. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 2, Be characterized in that: in step 2, artificial to give a mark, for fraction range for 0-10 points, the label finally obtained is under each different score sections The number of marking, since human cost is limited, this stamp methods is only applicable to fraction data;It collects pictures, adopts for other With artificial scoring method is simulated, for fraction range for 0-10 points, the label finally obtained is the people to give a mark under each different score sections Number, other, which collect pictures, is divided into artificial decision procedure by good, middle preference, in, difference partially in, poor 5 class simulates artificial marking mould Formula will be got well, middle preference, in, large deviations, poor 5 class is converted to the by stages 0-10, and preferably picture reciprocal fraction section is specific steps 8 to 10 points, middle 7 to 8 points of preference picture reciprocal fraction section, it is 5-7 points that middle picture, which corresponds to section, and the partially middle corresponding section of difference is 3 To 5 points, it is 1 to 3 point that poor picture, which corresponds to section, and the quasi- side group of this interval division is in certain priori knowledge.
4. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 3, It is characterized in that: in step 3, picture in public data source being subjected to gamma correction, does Gaussian mode in different Gaussian kernel sizes Paste is suitably added Gaussian noise and salt-pepper noise, realizes data enhancing.
5. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 4, It is characterized in that: in step 4, before entering data into the input of deep learning neural network, needing to make picture a large amount of pre- Processing is mostly that quality is higher due to obtaining picture, the higher picture of resolution ratio, and true applicable cases are mostly that user is shot with mobile phone Quality in low, the lower picture of resolution ratio, training data need to be needed to do certain pre-treatment, make to instruct using the methods of Gaussian Blur It is as close with the picture distribution under real scene use as possible to practice the distribution of data picture, makes model that can accomplish to adapt to low resolution figure Piece, the picture after handling well, which inputs in neural network, to be trained, and loss function is prediction distribution in actual distribution mean square deviation; And constructing test set to select the smallest model of loss function value is final mask.
6. the method for mobile terminal shooting low resolution picture scoring based on deep learning according to claim 5, It is characterized in that: in step 5, test set construction step are as follows: (1) collect user with mobile end equipment and shoot actual picture;(2) to survey Examination collection data are manually given a mark;(3) selecting the smallest model of loss function value on test set is final mask.
CN201910144106.1A 2019-02-27 2019-02-27 A method of scoring for mobile terminal shooting low resolution picture based on deep learning Withdrawn CN109934803A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910144106.1A CN109934803A (en) 2019-02-27 2019-02-27 A method of scoring for mobile terminal shooting low resolution picture based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910144106.1A CN109934803A (en) 2019-02-27 2019-02-27 A method of scoring for mobile terminal shooting low resolution picture based on deep learning

Publications (1)

Publication Number Publication Date
CN109934803A true CN109934803A (en) 2019-06-25

Family

ID=66985972

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910144106.1A Withdrawn CN109934803A (en) 2019-02-27 2019-02-27 A method of scoring for mobile terminal shooting low resolution picture based on deep learning

Country Status (1)

Country Link
CN (1) CN109934803A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111461249A (en) * 2020-04-09 2020-07-28 上海城诗信息科技有限公司 Photo scoring analysis method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105893916A (en) * 2014-12-11 2016-08-24 深圳市阿图姆科技有限公司 New method for detection of face pretreatment, feature extraction and dimensionality reduction description
CN106296690A (en) * 2016-08-10 2017-01-04 北京小米移动软件有限公司 The method for evaluating quality of picture material and device
CN108960087A (en) * 2018-06-20 2018-12-07 中国科学院重庆绿色智能技术研究院 A kind of quality of human face image appraisal procedure and system based on various dimensions evaluation criteria

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105893916A (en) * 2014-12-11 2016-08-24 深圳市阿图姆科技有限公司 New method for detection of face pretreatment, feature extraction and dimensionality reduction description
CN106296690A (en) * 2016-08-10 2017-01-04 北京小米移动软件有限公司 The method for evaluating quality of picture material and device
CN108960087A (en) * 2018-06-20 2018-12-07 中国科学院重庆绿色智能技术研究院 A kind of quality of human face image appraisal procedure and system based on various dimensions evaluation criteria

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111461249A (en) * 2020-04-09 2020-07-28 上海城诗信息科技有限公司 Photo scoring analysis method and system

Similar Documents

Publication Publication Date Title
CN108428227B (en) No-reference image quality evaluation method based on full convolution neural network
CN110136154A (en) Remote sensing images semantic segmentation method based on full convolutional network and Morphological scale-space
CN107092926A (en) Service robot object recognition algorithm based on deep learning
CN110675328A (en) Low-illumination image enhancement method and device based on condition generation countermeasure network
CN101819638B (en) Establishment method of pornographic detection model and pornographic detection method
CN106790019A (en) The encryption method for recognizing flux and device of feature based self study
CN109165682A (en) A kind of remote sensing images scene classification method merging depth characteristic and significant characteristics
CN109871845B (en) Certificate image extraction method and terminal equipment
CN106951473A (en) Towards the construction method of the deep vision question answering system of dysopia personage
CN104268590B (en) The blind image quality evaluating method returned based on complementary combination feature and multiphase
CN107092883A (en) Object identification method for tracing
CN109961434A (en) Non-reference picture quality appraisement method towards the decaying of level semanteme
Yusnita et al. Implementation of real-time static hand gesture recognition using artificial neural network
CN109559362B (en) Image subject face replacing method and device
CN109472193A (en) Method for detecting human face and device
CN106127108A (en) A kind of staff image region detection method based on convolutional neural networks
CN109829868A (en) A kind of lightweight deep learning model image defogging method, electronic equipment and medium
CN108460399A (en) A kind of child building block builds householder method and system
CN110427819A (en) The method and relevant device of PPT frame in a kind of identification image
CN110163567A (en) Classroom roll calling system based on multitask concatenated convolutional neural network
CN109919252A (en) The method for generating classifier using a small number of mark images
CN107862692A (en) A kind of ribbon mark of break defect inspection method based on convolutional neural networks
CN114724222B (en) AI digital human emotion analysis method based on multiple modes
CN109377494A (en) A kind of semantic segmentation method and apparatus for image
CN108509939A (en) A kind of birds recognition methods based on deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20190625