CN107844758A

CN107844758A - Intelligence pre- film examination method, computer equipment and readable storage medium storing program for executing

Info

Publication number: CN107844758A
Application number: CN201711002884.4A
Authority: CN
Inventors: 李甫
Original assignee: Wuxi Liangziyun Digital New Media Technology Co Ltd; Quantum Cloud Future (beijing) Mdt Infotech Ltd
Current assignee: Wuxi Liangziyun Digital New Media Technology Co Ltd; Quantum Cloud Future (beijing) Mdt Infotech Ltd
Priority date: 2017-10-24
Filing date: 2017-10-24
Publication date: 2018-03-27

Abstract

The invention discloses intelligence pre- film examination method, computer equipment and readable storage medium storing program for executing, it is related to technical field of video image processing, including：Obtain training pictures and checking pictures.Training picture in training pictures is pre-processed and marked, obtains training picture database.The average of training picture is calculated, and training picture is subtracted into average.The pre- review of a film by the censor training pattern caffenet models of intelligence are established under convolutional neural networks framework Caffe environment.It is trained using the training picture for subtracting average for caffenet models, the caffenet models after being trained.For the checking picture that checking picture is concentrated as the input picture of the caffenet models after training to being identified, recognition result is pre- review of a film by the censor result.The application employs image recognition sorting technique and technology based on deep learning, by there is the feature of supervision or unsupervised mode learning hierarchy to describe, so as to realize the pre- review of a film by the censor of intelligence.

Description

Intelligence pre- film examination method, computer equipment and readable storage medium storing program for executing

Technical field

The present invention relates to technical field of video image processing, and in particular to the pre- film examination method of intelligence, computer equipment and Readable storage medium storing program for executing.

Background technology

Along with the progress of computer science and technology, the research and development and application of artificial intelligence are more and more deeply extensive, and a lot Field and industry suffer from close combination, and progressively substitute the manually work in these fields or industry in future soon Make, turn into following main economic stimuli.

Work is identified in video display industry or video image field, any one video or characteristics of image at present, is all Completed by people.

Existing video image identification technology, it is necessary to largely train material and human cost, accuracy rate and discrimination it is low, And the quality of model personnel Feature Engineering work is to determine the key of effect quality.

Due to not using artificial intelligence-depth learning technology, existing video image identification technology is limited only to existing Set content is trained, lacks ability of self-teaching, so the image recognition work of cumbersome tera incognita complicated and changeable, nothing can not be adapted to Method solves many practical problems of reality scene.

The content of the invention

It is an object of the invention to provide intelligence pre- film examination method, computer equipment and readable storage medium storing program for executing, based on AI Technology, corresponding image recognition model is trained, solve some concrete application fields of video content in video display industry by this model The feature recognition work of scape is simultaneously classified, judged according to identification content results, the work such as early warning, and be applied to video display expose the false, Multiple reality scene fields such as film advertisement, movie and television contents Safety Examination.

To achieve the above object, technical scheme is as follows：

In a first aspect, the embodiment of the present application provides a kind of pre- film examination method of intelligence, including：

Obtain training pictures and checking pictures.

Training picture in training pictures is pre-processed and marked, obtains training picture database.

The average of training picture is calculated, and training picture is subtracted into average.

The pre- review of a film by the censor training pattern caffenet models of intelligence are established under convolutional neural networks framework Caffe environment.

It is trained using the training picture for subtracting average for the caffenet models, after being trained Caffenet models；Caffenet models after the training are carried out after input picture is received to the input picture Identification, exports the generic to input picture.

Input picture pair using the checking picture that the checking picture is concentrated as the caffenet models after the training It is identified, the generic for exporting the input picture is pre- review of a film by the censor result.

In one or more embodiments of first aspect, the training picture in training pictures is pre-processed, wrapped Include：

The filename for training picture, picture category, picture path are stored.

Adjusting training picture size is the uniform sizes of setting, and is converted to unified form.

In one or more embodiments of first aspect, the pre- review of a film by the censor training pattern caffenet models of intelligence are：

Low-level image feature extraction is carried out to input picture.

Feature coding is carried out to the low-level image feature extracted.

Space characteristics constraint is carried out to the low-level image feature after feature coding, obtains the description vectors of input picture.

The description vectors of image are classified using grader, obtain the recognition result of caffenet models.

In one or more embodiments of first aspect, low-level image feature extraction is carried out to the image of the pre- review of a film by the censor, including：

According to bottom of the local feature description in default compensation, yardstick extraction preliminary hearing picture as preliminary hearing picture Feature.

In one or more embodiments of first aspect, it is special that local feature description includes scale invariant feature conversion SIFT Sign, histograms of oriented gradients HOG features, local binary patterns LBP features.

In one or more embodiments of first aspect, feature coding is carried out to the low-level image feature extracted, including, adopt Encoded with vector quantization, be in sparse coding, local linear constraint coding or Fisher vector coding a kind of to being extracted Low-level image feature carries out feature coding；

In one or more embodiments of first aspect, space characteristics are carried out about to the low-level image feature after feature coding Beam, the description vectors of image are obtained, including：Using the method for pyramid characteristic matching to entering to the low-level image feature after feature coding Row space characteristics constrain, and obtain the description vectors of image.

In one or more embodiments of first aspect, grader is support vector machines or random forest.

Second aspect, the embodiment of the present application additionally provide a kind of computer equipment, including memory, processor and storage On a memory and the computer program that can run on a processor, the application first aspect is realized during computing device program The method that one or more embodiments provide.

The third aspect, the embodiment of the present application additionally provide a kind of computer-readable recording medium, are stored thereon with computer Program, the method that one or more embodiments of the application first aspect provide is realized when computer program is executed by processor.

The inventive method has the following advantages that：

The conventional image recognition technology based on machine learning, it is difficult to which the image for adapting to cumbersome tera incognita complicated and changeable is known Do not work, in this case, the application employs image recognition sorting technique and technology based on deep learning, by there is prison Superintend and direct or the feature of unsupervised mode learning hierarchy describes, so as to realize the pre- review of a film by the censor of intelligence.The advantages of the application is exactly profit Inputted with image pixel information, remain all information of input picture to the full extent, output is directly image recognition result.

Brief description of the drawings

Fig. 1 is the pre- review of a film by the censor method flow diagram of a kind of intelligence based on AI technologies that the embodiment of the present application provides；

Fig. 2 is the specific training step for the pre- review of a film by the censor training pattern caffenet models of intelligence that the embodiment of the present application provides；

Fig. 3 is the pre- film examination method of intelligence that embodiment is classified as with video display industry film.

Embodiment

Following examples are used to illustrate the present invention, but are not limited to the scope of the present invention.

Embodiment 1

Fig. 1 is refer to, Fig. 1 shows the pre- film examination method stream of a kind of intelligence based on AI technologies that the embodiment of the present application provides Cheng Tu, including：

S101, obtain training pictures and checking pictures；It can be in film wherein to train picture and checking picture Corresponding picture, training picture are used to be trained pre- review of a film by the censor training pattern, and checking picture term is instructed to the pre- review of a film by the censor trained Practice model to be verified.

S102, the training picture in training pictures is pre-processed and manually marked, obtain training picture database；

Wherein the training picture in training pictures is pre-processed, including：

The filename for training picture, picture category, picture path are stored；

Wherein artificial mark includes：

Artificial mark training picture generic.

S103, the average for calculating training picture, and training picture is subtracted into average；

S104, the pre- review of a film by the censor training pattern caffenet models of intelligence are established under convolutional neural networks framework Caffe environment；

S105, it is trained for caffenet models using the training picture for subtracting average, after being trained Caffenet models；Caffenet models after training can export the institute to input picture after input picture is received Belong to classification, as identify classification results.

S106, using verify picture concentrate checking picture as training after caffenet models input picture to progress Identification, recognition result is the generic of input picture, as pre- review of a film by the censor result.

It refer to Fig. 2, Fig. 2 shows the specific of pre- review of a film by the censor training pattern caffenet models of intelligence in the embodiment of the present application Training step, including

S201, the image to the pre- review of a film by the censor carry out low-level image feature extraction；Extracted largely according to fixed step size, yardstick from image Local feature description.Conventional local feature includes SIFT, and (Scale-Invariant Feature Transform, yardstick is not Become Feature Conversion), HOG (Histogram of Oriented Gradient, histograms of oriented gradients), LBP (Local Bianray Pattern, local binary patterns) etc., typically also prevent from losing excessive useful letter using various features description Breath.

S202, feature coding is carried out to the low-level image feature extracted；Bulk redundancy and noise are contained in low-level image feature, is The robustness (Robustness) of feature representation is improved, it is necessary to compiled using a kind of eigentransformation algorithm to low-level image feature Code, referred to as feature coding.Conventional feature coding include vector quantization coding, sparse coding, local linear constraint encode, Fisher vector coding etc..

S203, space characteristics constraint is carried out to the low-level image feature after feature coding, obtain the description vectors of image；Feature is compiled Space characteristics can typically be passed through after code to constrain, also referred to as feature converges.Feature convergence refers in a spatial dimension, to every One-dimensional characteristic takes maximum or average value, can obtain the feature representation of certain feature invariant shape.Pyramid characteristic matching is A kind of conventional feature assemblage method, this method propose, by image uniform piecemeal, feature convergence to be done in piecemeal.

S204, using grader the description vectors of image are classified, obtain the recognition result of the image of the pre- review of a film by the censor.

The latter image by preceding step can be described with the vector of a fixed dimension, next be exactly to pass through Grader is crossed to classify to image.Usually used grader include SVM (Support Vector Machine, support to Amount machine), random forest etc..

By above method flow, image recognition sorting technique and technology based on deep learning are employed, by there is prison Superintend and direct or the feature of unsupervised mode learning hierarchy describes, so as to realize the pre- review of a film by the censor of intelligence.Utilize the convolution of deep learning Neural network model carries out feature extraction and higher level of abstraction.Its advantages of is exactly to be inputted using image pixel information, at utmost On remain all information of input picture, model output is directly image recognition result.

Embodiment 2

The embodiment of the present application additionally provides a kind of computer equipment, including memory, processor and is stored in memory Computer program that is upper and can running on a processor, any possibility of the offer of embodiment 1 is realized during computing device program Method flow.

Embodiment 3

The embodiment of the present application additionally provides a kind of computer-readable recording medium, is stored thereon with computer program, calculates Machine program realizes any possible method flow that embodiment 1 provides when being executed by processor.

Embodiment 4

Fig. 3 is refer to, Fig. 3 is classified as embodiment with video display industry film, shows a kind of specific pre- review of a film by the censor side of intelligence Method, specifically comprise the following steps：

S1. actual demand business scenario is combined, prepares training pictures and checking pictures.

S2. pretreatment and artificial mark, the picture database trained are carried out to the training picture in training pictures. Image data pretreatment include：Preparing pictures inventory, adjustment picture size, structure picture database

S21. preparing pictures inventory, the contents such as picture file name, picture category, picture path are stored according to reference format In train.txt and val.txt files.

S22. picture size is adjusted to 256*256, unified adjustment picture size, in order to preferably analyze.

S23. picture database is built, in caffe environment, carries out picture training, it is necessary to there is picture database, and it is non-straight Input picture is connect, so train and val 2 groups of pictures are converted into lmbp lattice using create_imagenet.sh scripts Formula.

S3. image average is calculated using make_imagenet_mean.sh, after calculating image average, training speed can be improved Degree and precision.

S4. model is created, configuration file, training parameter is set.This is the most crucial place of this training pattern, Ye Shiben The core of secondary invention.The caffenet models that model is just carried with program, change solver.prototxt and train_ therein val.protxt。

S5. DSR, it is exactly finally training and test.If precision reaches reasonable demand, then it is assumed that training pattern Success.If you have questions, then training data and training parameter, re-optimization training pattern are readjusted.

S6. reality scene is applied.The training picture and checking picture that this time we prepare, are for movie and television contents film point Level prepares, can distinguish relate to it is yellow, relate to the content such as probably.Using this model, we just automatically identify the content in films and television programs, if Related bottom line is touched, and early warning is carried out to its content, graded reporting is generated, so as to realize video display industry film hierarchical application.

Although above with general explanation and specific embodiment, the present invention is described in detail, at this On the basis of invention, it can be made some modifications or improvements, this will be apparent to those skilled in the art.Therefore, These modifications or improvements without departing from theon the basis of the spirit of the present invention, belong to the scope of protection of present invention.

Claims

1. a kind of pre- film examination method of intelligence, it is characterised in that methods described includes：

Obtain training pictures and checking pictures；

Training picture in the training pictures is pre-processed and marks training picture generic, obtains training picture Database；

The average of the training picture is calculated, and the training picture is subtracted into average；

The pre- review of a film by the censor training pattern caffenet models of intelligence are established under convolutional neural networks framework Caffe environment；

It is trained using the training picture for subtracting average for the caffenet models, after being trained Caffenet models；Caffenet models after the training are carried out after input picture is received to the input picture Identification, exports the generic to input picture；

The checking picture that the checking picture is concentrated is as the input picture of the caffenet models after the training to carrying out Identification, the generic for exporting the input picture is pre- review of a film by the censor result.

2. the method as described in claim 1, it is characterised in that located in advance to the training picture in the training pictures Reason, including：

The filename of the training picture, picture category, picture path are stored；

Uniform sizes of the training picture size for setting are adjusted, and are converted to unified form.

3. the method as described in claim 1, it is characterised in that the pre- review of a film by the censor training pattern caffenet models of intelligence are：

Low-level image feature extraction is carried out to input picture；

Feature coding is carried out to the low-level image feature extracted；

Space characteristics constraint is carried out to the low-level image feature after feature coding, obtains the description vectors of the input picture；

The description vectors of described image are classified using grader, obtain the recognition result of the caffenet models.

4. method as claimed in claim 3, it is characterised in that the image to the pre- review of a film by the censor carries out low-level image feature and carried Take, including：

According to the local feature description in default compensation, the yardstick extraction preliminary hearing picture as the preliminary hearing picture Low-level image feature.

5. method as claimed in claim 4, it is characterised in that the local feature description changes including scale invariant feature SIFT feature, histograms of oriented gradients HOG features and local binary patterns LBP features.

6. method as claimed in claim 3, it is characterised in that the low-level image feature to being extracted carries out feature coding, bag Include, encoded using vector quantization, be in sparse coding, local linear constraint coding or Fisher vector coding a kind of to being carried The low-level image feature taken carries out feature coding.

7. method as claimed in claim 3, it is characterised in that the low-level image feature to after feature coding carries out space characteristics Constraint, the description vectors of described image are obtained, including：Using the method for pyramid characteristic matching to the bottom after feature coding Feature carries out space characteristics constraint, obtains the description vectors of described image.

8. method as claimed in claim 3, it is characterised in that the grader is support vector machines or random forest.

9. a kind of computer equipment, including memory, processor and storage can be run on a memory and on a processor Computer program, it is characterised in that the claim 1-8 any described sides are realized during the computing device described program Method.

10. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the computer program The claim 1-8 any described methods are realized when being executed by processor.