CN108734184A - A kind of method and device that sensitive image is analyzed - Google Patents

A kind of method and device that sensitive image is analyzed Download PDF

Info

Publication number
CN108734184A
CN108734184A CN201710248908.8A CN201710248908A CN108734184A CN 108734184 A CN108734184 A CN 108734184A CN 201710248908 A CN201710248908 A CN 201710248908A CN 108734184 A CN108734184 A CN 108734184A
Authority
CN
China
Prior art keywords
sample
training
image
picture
kinds
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710248908.8A
Other languages
Chinese (zh)
Other versions
CN108734184B (en
Inventor
杨现
常江龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suning Commerce Group Co Ltd
Original Assignee
Suning Commerce Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suning Commerce Group Co Ltd filed Critical Suning Commerce Group Co Ltd
Priority to CN201710248908.8A priority Critical patent/CN108734184B/en
Publication of CN108734184A publication Critical patent/CN108734184A/en
Application granted granted Critical
Publication of CN108734184B publication Critical patent/CN108734184B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0641Shopping interfaces
    • G06Q30/0643Graphical representation of items or shoppers

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The embodiment of the invention discloses a kind of method and devices analyzed sensitive image, are related to image identification technical field, can promote the automatization level for advertising pictures recognition detection, reduce manual examination and verification cost.The present invention includes:By being clustered to the sample image in the training sample set, further according to the sample image by cluster, by the corresponding all kinds of identification model of convolutional neural networks training, the identification model obtained later using training identifies corresponding all kinds of sensitization picture from picture library to be detected.The present invention is suitable for identifying the sensitization picture on line platform.

Description

A kind of method and device that sensitive image is analyzed
Technical field
The present invention relates to image identification technical field more particularly to a kind of method analyzed sensitive image and dresses It sets.
Background technology
With the construction of the development and the network platforms such as all kinds of online transaction platforms, on-line marketing platform of Internet technology, Major operator and size retail shop can all launch the Internet advertising of magnanimity in the network platform all the time.In order to which specification interconnects The publication behavior of net advertisement, protects the legitimate rights and interests of consumers, in the new Advertising Law promulgated in 2015, clear stipulaties internet Advertising campaign also has to comply with Advertising Law items regulation.
In current practical application, each network platform monitors the movable means of Internet advertising, mainly passes through detection Sensitive image differentiates and early warning may illegal advertisement.Existing nude picture detection method refers in particular to pornographic image, phase mostly The detection means answered and analysis method main development are from foundation《The Law of the P.R.C. on Administrative Penalties for Public Security》With《Criminal law》To obscene pornography Identification field is monitored, detection mode is mainly based on the sensitive organ of detection.Such as:Hand-designed has fixed color, shape With the characteristics of image of texture, and doubtful sensitive image is obtained according to the Image Feature Matching manually set.
But the accuracy of identification of existing way is relatively low, often by under normal circumstances birds meat products, underwear, sporting goods, The advertisements of the classifications commodity such as family planning articles, publicity image are reported by mistake into sensitive image, in the past mainly by being appealed by report side or The mode of monitoring personnel artificial treatment solves the problems, such as wrong report, this has been difficult to meet the magnanimity to being launched in the network platform at present The demand that Internet advertising is monitored in real time does not adapt to electric business platform especially and magnanimity demonstration image filtering is supervised The requirement of control, it is therefore desirable to the higher detection means of the degree of automation is developed, to control cost of labor.
Invention content
The embodiment of the present invention provides a kind of method analyzed sensitive image, can be promoted and advertising pictures are known The automatization level not detected reduces manual examination and verification cost.
In order to achieve the above objectives, the embodiment of the present invention adopts the following technical scheme that:
In a first aspect, the method that the embodiment of the present invention provides, including:
In extracted training sample set, according to the corresponding sensitive kind of each sample, in the training sample set Sample image clustered;
According to the sample image by cluster, pass through the corresponding all kinds of identification model of convolutional neural networks training;
The identification model obtained using training identifies corresponding all kinds of sensitization picture from picture library to be detected.
With reference to first aspect, described corresponding according to each sample in the first possible realization method of first aspect Sensitive kind clusters the sample image in the training sample set, including:
By preset neural network model, the sensitive features of each sample image are extracted from the training sample set, Wherein, the preset neural network model is trained by imagenet;
By preset clustering algorithm, the sample image that the similarity degree of sensitive features is met to test order is clustered to same One sample set closes.
The possible realization method of with reference to first aspect the first further includes in second of possible realization method:
In a sample set closes:
According at a distance from cluster centre, by the sample image in the subclass by closely sorting to remote, and sequence is chosen Preceding specified digit sample image as positive sample;
Utilize obtained positive sample training pattern grader;
By the trained model classifiers, the sample image in being closed to the sample set carries out classification meter It calculates, and the sample image by the score value being calculated less than pre-determined threshold is rejected.
The possible realization method of with reference to first aspect the first further includes in the third possible realization method:
Utilize extracted pre-training data set, the depth residual error network of the specified number of plies of training, the specified number of plies >=50;
The depth residual error network obtained by training corrects the sample set and closes.
With reference to first aspect, in the 4th kind of possible realization method of first aspect, further include:
All kinds of sensitization pictures is corresponded in the picture library to be detected to be identified, and after obtaining recognition result, Difficult example sample is extracted from the recognition result;
The parameter of all kinds of identification models is corresponded to according to the difficult example Sample Refreshment.
The 4th kind of possible realization method with reference to first aspect, it is described from institute in the 5th kind of possible realization method It states and extracts difficult example sample in recognition result, including:
Obtain the score value of each attribute in sensitization picture, wherein the score value of each attribute passes through institute in the sensitization picture Identification model is stated to be calculated;
According to the sequence of score value from large to small, sort to each attribute of acquired sensitization picture;
The additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence When spending threshold value, judgement is as the difficult example sample.
With reference to first aspect, in the 6th kind of possible realization method of first aspect, further include:
According to preset business rule, candidate image is acquired from electric business service platform, utilizes acquired candidate image more The new picture library;
And/or according to preset test order, the trained sample is extracted from the sample database pointed by the test order This set.
Second aspect, the device that the embodiment of the present invention provides, including:
Cluster module is used in extracted training sample set, according to the corresponding sensitive kind of each sample, to the instruction The sample image practiced in sample set is clustered;
Training module, for according to the sample image by cluster, passing through the corresponding all kinds of knowledge of convolutional neural networks training Other model;
Analysis module, the identification model for being obtained using training identify that correspondence is all kinds of from picture library to be detected Sensitization picture.
In conjunction with second aspect, in the first possible realization method of second aspect, the cluster module is specifically used for By preset neural network model, the sensitive features of each sample image are extracted from the training sample set;And by pre- If clustering algorithm, the sample image that the similarity degree of sensitive features is met to test order clusters to the same sample set It closes;Wherein, the preset neural network model is trained by imagenet.
Further include in second of possible realization method of second aspect in conjunction with second aspect:Filtering module is used for In a sample set closes:According at a distance from cluster centre, by the sample image in the subclass by closely to remote sequence, And choose sequence preceding specified digit sample image as positive sample;Obtained positive sample training pattern is recycled to classify Device;Later by the trained model classifiers, the sample image in being closed to the sample set carries out classified calculating, And the sample image by the score value being calculated less than pre-determined threshold is rejected;
Further include:Correction module, for utilizing extracted pre-training data set, the depth residual error net of the specified number of plies of training Network, the specified number of plies >=50;And the depth residual error network obtained by training, it corrects the sample set and closes.
Further include in the third possible realization method of second aspect in conjunction with second aspect:
Update module is identified for corresponding to all kinds of sensitization pictures in the picture library to be detected, and To after recognition result, the score value of each attribute in sensitization picture is obtained, wherein the score value of each attribute is logical in the sensitization picture The identification model is crossed to be calculated;And the sequence according to score value from large to small, to each attribute of acquired sensitization picture Sequence;
The additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence When spending threshold value, judgement is as the difficult example sample.;And the ginseng of all kinds of identification models is corresponded to according to the difficult example Sample Refreshment Number.
The method and device provided in an embodiment of the present invention that sensitive image is analyzed, by the training sample set Sample image in conjunction is clustered, all kinds of by convolutional neural networks training correspondence further according to the sample image by cluster Identification model corresponding all kinds of Sensitive Graphs are identified from picture library to be detected later using the obtained identification model of training Piece, to identify whether the picture that the trade company of electric business service platform is uploaded belongs to corresponding all kinds of sensitization picture.It realizes For trade company upload electric business service platform picture it is automatic detection, scanning, improve for advertising pictures recognition detection oneself Dynamicization is horizontal, reduces manual examination and verification cost.
Description of the drawings
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to needed in the embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for ability For the those of ordinary skill of domain, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.
Fig. 1 is a kind of possible system architecture schematic diagram provided in an embodiment of the present invention;
Fig. 2 is method flow schematic diagram provided in an embodiment of the present invention;
Fig. 3,4 be specific example provided in an embodiment of the present invention schematic diagram;
Fig. 5,6,7 are the structural schematic diagram of device provided in an embodiment of the present invention.
Specific implementation mode
To make those skilled in the art more fully understand technical scheme of the present invention, below in conjunction with the accompanying drawings and specific embodiment party Present invention is further described in detail for formula.Embodiments of the present invention are described in more detail below, the embodiment is shown Example is shown in the accompanying drawings, and in which the same or similar labels are throughly indicated same or similar element or has identical or class Like the element of function.It is exemplary below with reference to the embodiment of attached drawing description, is only used for explaining the present invention, and cannot It is construed to limitation of the present invention.Those skilled in the art of the present technique are appreciated that unless expressly stated, odd number shape used herein Formula " one ", "one", " described " and "the" may also comprise plural form.It is to be further understood that the specification of the present invention The middle wording " comprising " used refers to that there are the feature, integer, step, operation, element and/or component, but it is not excluded that Other one or more features of presence or addition, integer, step, operation, element, component and/or their group.It should be understood that When we say that an element is " connected " or " coupled " to another element, it can be directly connected or coupled to other elements, or There may also be intermediary elements.In addition, " connection " used herein or " coupling " may include being wirelessly connected or coupling.Here make Wording "and/or" includes any cell of one or more associated list items and all combines.The art Technical staff is appreciated that unless otherwise defined all terms (including technical terms and scientific terms) used herein have Meaning identical with the general understanding of the those of ordinary skill in fields of the present invention.It should also be understood that such as general Term, which should be understood that, those of defined in dictionary has a meaning that is consistent with the meaning in the context of the prior art, and Unless being defined as here, will not be explained with the meaning of idealization or too formal.
The embodiment of the present invention specifically may be implemented in a kind of system environments as shown in Figure 1, specifically include:Electric business takes Business platform, monitoring server and database;
Wherein it is possible to by monitoring server according to preset business rule, candidate image is acquired from electric business service platform, and Acquired candidate image is utilized to update the picture library in the database.It is taken in electric business specifically, picture library includes publication The advertising pictures being engaged on platform, or issued on the page that commodity details page, browse page etc. are used to show to consumer Picture.It can be taken from electric business by monitoring server real-time (such as according to certain update cycle, such as 10 minutes, 1 hour) All kinds of pictures are acquired as the candidate image on business platform, and import the picture library in the database.
Monitoring service implement body can be the server apparatus being individually made, such as:Rack, blade, tower or machine Cabinet type server apparatus can also use work station, mainframe computer etc. to have stronger computing capability hardware device;It can also The server cluster being made of multiple server apparatus.
Database is mainly used for storing picture library, can be specifically a kind of Redis databases or other kinds of distribution Formula database, relevant database etc., can be specifically include storage device data server and with data server phase Storage device even, or a kind of server set for database for being made of multiple data servers and storage server Group's system.
Electric business service platform can be specifically current on-line operation, include all kinds of service sub-systems, for online The plateform system of transaction, merchandise sales.On hardware view, electric business service platform is also specifically by a series of mutual foundation The server cluster of communication connection forms, electric business service platform specifically specific construction mode and used architecture standard, can With the common technology with reference to used in several large-scale net purchase platforms domestic at present, repeat no more in the present embodiment.
The embodiment of the present invention provides a kind of method analyzed sensitive image, as shown in Fig. 2, including:
S1, in extracted training sample set, according to the corresponding sensitive kind of each sample, to the training sample set In sample image clustered.
Include default in test order library wherein it is possible to also set up test order library in monitoring server in advance Test order, the test order specifically can by technical staff set and input monitoring server, such as:By technical staff Design some test templates, wherein a set of test template includes for test order, the correspondence set by specific application scene Training sample set and test needed for algorithm model, in order to which monitoring server is according to preset test order, from described The training sample set is extracted in sample database pointed by test order, it specifically can be according to current specific test environment certainly It is dynamic to transfer (or being operated by technical staff) test order.
In the present embodiment, sensitive kind can be understood as:For the picture of different traffic directions, represented commodity Either difference can pre-set difference to the type of article for commodity or article of difference in these pictures The sensitive kind of type, such as:For meat of poultris commodity, interior clothing commodity and family planning articles this 3 kinds of different traffic directions Commodity are sold the type of merchandize represented by the advertising pictures that the commodity of these commodity are launched and are different, sensitivity can be arranged Class1,2,3, and the advertising pictures of meat of poultris commodity are clustered to sensitive kind 1, cluster the advertising pictures of interior clothing commodity It is clustered to sensitive kind 3 to sensitive kind 2, by the advertising pictures of family planning articles.So that being taken from electric business by monitoring server Collected advertising pictures carry out classification differentiation according to sensitive kind in business platform, such as:Monitoring server is identified in Sensitive Graphs During filtering, the input of the identification model of sensitization picture be pending image url (Uniform Resoure Locator, Uniform resource locator), export the attributive classification result for pending image.
S2, the sample image clustered according to process pass through the corresponding all kinds of identification model of convolutional neural networks training.
Wherein, it can be understood as by the corresponding all kinds of identification model of convolutional neural networks training:Training is based on convolution god Currently used convolutional neural networks technology specifically may be used in identification disaggregated model through network, according to specific practical industry Business scene can respectively be built corresponding each come the identification model for building sensitization picture identification for different sensitive kinds The identification model of sensitive kind.
In the present embodiment, in the identification model of the corresponding each sensitive kind of specific training, for different types of sensitivity Type can configure different recognizers and for identification matched reference data, such as:It can be from electric business service platform Each service sub-system in extract corresponding professional knowledge rule, professional knowledge rule be used to indicate the traffic direction commodity or Some exclusive features of person's article, such as:The professional knowledge rule of meat of poultris commodity includes the type of animal limb, internal organ, And typical legend information, colouring information and the profile information of animal limb, internal organ are used for then for the commodity of sensitive kind 1 Identify that matched reference data may include the typical legend information and profile information of animal limb, internal organ, and in identification decision During, it is non-sensitive picture by the spectral discrimination for meeting these reference datas;For another example:The professional knowledge of interior clothing commodity Rule includes the shape, color and Facing material of common underwear (such as can simple process decision chart by color and luster and glossiness Which region is cloth in piece, which region is the skin of model), then it is matched for identification for the commodity of sensitive kind 2 Reference data may include shape, color, Facing material, and during identification decision, and will meet these reference datas And sensitive organ is wherein not present, and (traditional discriminant approach may be used in the judgement of sensitive organ, for example public security organ uses Obscene picture recognition means) spectral discrimination be non-sensitive picture
S3, the identification model obtained using training identify corresponding all kinds of sensitization picture from picture library to be detected.
Wherein, include the picture that is uploaded of trade company of electric business service platform in picture library to be detected, by monitoring service Device executes the flow of the present embodiment, to identify whether the picture that the trade company of electric business service platform is uploaded belongs to corresponding all kinds of Sensitization picture.
According to the regulation of new Advertising Law, monitoring and the pipe of the advertising pictures for reinforcing being issued on electric business service platform are needed Reason.But manually inspection processing is carried out by user feedback and visually at present, that there are efficiency is low, risk is big and heavy workload etc. Problem, therefore, it is necessary to the monitoring servers in through this embodiment to carry out automatic identification, in order to be able in time to electric business service The operator of platform and businessman send out warning.
It provides a kind of during filtering is identified for the sensitization picture under electric business business, identifies the calculation of filtering The training sample of method and the specific method of filtering.Different from the side of conventional method hand-designed color shapes textures characteristics of image Method, present invention employs convolutional neural networks, reduce the cost of labor of hand-designed feature.It is provided in an embodiment of the present invention right The method that sensitive image is analyzed, by being clustered to the sample image in the training sample set, further according to process The sample image of cluster, by the corresponding all kinds of identification model of convolutional neural networks training, the identification obtained later using training Model identifies corresponding all kinds of sensitization picture, to identify the institute of trade company of electric business service platform from picture library to be detected Whether the picture of upload belongs to corresponding all kinds of sensitization picture.Realize for trade company upload electric business service platform picture from Dynamic detection, scanning, improve the automatization level for advertising pictures recognition detection, reduce manual examination and verification cost.
Specifically, described carry out the sample image in the training sample set according to the corresponding sensitive kind of each sample Cluster, including:
By preset neural network model, the sensitive features of each sample image are extracted from the training sample set. Again by preset clustering algorithm, the sample image that the similarity degree of sensitive features is met to test order is clustered to the same sample This subclass.
Wherein, the preset neural network model is by imagenet (in a kind of computer vision system identification project For image recognition training database) be trained.Such as:The neural network model trained using imagenet, then profit The sensitive features of candidate image are extracted with the neural network model trained.To be had using preset clustering algorithm later similar A subset conjunction is merged into the extraction of sensitive features.
Further comprise the concrete mode be filtered, screen, arranged for being closed respectively for each sample set, It is combined into example with one of sample set, in a sample set closes:
According at a distance from cluster centre, by the sample image in the subclass by closely sorting to remote, and sequence is chosen Preceding specified digit sample image as positive sample.Utilizing obtained positive sample training pattern grader.Pass through later The trained model classifiers, the sample image in being closed to the sample set carries out classified calculating, and will calculate The score value arrived is rejected less than the sample image of pre-determined threshold.Such as:Candidate collection in each subclass, using apart from cluster centre For 100 nearest samples as positive sample, (for example the classification based on one class svm may be used in training pattern grader Device);Grader classified calculating is all used to every figure in subclass, according to classification results, by the lower non-class image of score It weeds out.
Optionally, one kind is also provided in the present embodiment for the corrected concrete mode of sample set, including:
Utilize extracted pre-training data set, the depth residual error network of the specified number of plies of training, the specified number of plies >=50. The depth residual error network obtained later by training corrects the sample set and closes.Such as:The public affairs held using ImageNet The depth residual error network that 1000 class Classification and Identification data sets used in match train 50 layers as pre-training data set is opened, it will The parameter for the model that pre-training obtains is finely adjusted using the sample set conjunction above with filtering, screening, arrangement processing, from And it avoids, due to the very few caused over-fitting of Sensitive Graphs training data, also avoiding feature extraction complicated in tional identification algorithm Step.
For example:As shown in Figure 4, the design for the specific cellular construction of depth residual error network, if depth network In certain hidden layer be H (x)-x → F (x), if it can be assumed that multiple non-linear layers combination can be similar to a complicated function, It so similarly assume that the residual error of hidden layer is similar to some complicated function.So hidden layer can be expressed as H (x) =F (x)+x.So a kind of completely new residual error structural unit is obtained, the output of residual unit is cascade by multiple convolutional layers It exports and is added between input element (to ensure that convolutional layer output is identical with input element dimension), after being activated using ReLU It arrives.This structure is cascaded up, depth residual error network has just been obtained.
Further, the present embodiment additionally provides a kind of mode advanced optimizing supervision recognition result, specifically includes:
All kinds of sensitization pictures is corresponded in the picture library to be detected to be identified, and after obtaining recognition result, Difficult example sample is extracted from the recognition result.And the parameter of all kinds of identification models is corresponded to according to the difficult example Sample Refreshment. To optimize convolutional neural networks parameter using difficult example, enhance the recognition capability of algorithm model.
Wherein, the difficult example sample of the extraction from the recognition result, including:
Obtain the score value of each attribute in sensitization picture, wherein the score value of each attribute passes through institute in the sensitization picture Identification model is stated to be calculated.
According to the sequence of score value from large to small, sort to each attribute of acquired sensitization picture.Wherein, sensitization picture Attribute can be understood as:Information associated with the image data of sensitization picture, such as:Title, source station address, the date, The information such as resolution ratio, size, tag along sort, these associated information are added usually as the attribute information of image data in image In data.
The additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence When spending threshold value, judgement is as the difficult example sample.Such as:The attribute of a sensitization picture includes title, source station address, day Phase, resolution ratio, size, tag along sort ... wait 10 attribute, calculate this 10 attribute by the identification model, obtain score value Maximum preceding 3 attribute is:Source station address (score value 0.4), title (score value 0.3) and tag along sort (score value 0.1), And confidence threshold value is 0.7, then the score value 0.8 of source station address+title+tag along sort is more than 0.7, judges that this is a sensitive The difficult example sample of picture.
Wherein it is possible to enhance algorithm recognition capability by Progressive CNN, with specific reference to the difficult example of detection as a result, It adds it in convolutional neural networks training data, enhances these effects of difficult example in the sample, to promote identification model For the recognition capability of these violation pictures being difficult to differentiate between.
Through this embodiment, the master map, details figure and solarization free hand drawing for uploading electric business service platform for trade company is realized to occur Obscene violation picture carry out automatic detection scanning, especially improve the intelligent level for advertising pictures management, reduce Manual examination and verification cost finally also reduces management platform risk.
And the different from the past simple dichotomy for being divided into Sensitive Graphs and non-sensitive figure, the present invention are directed to the possibility field of electric business If violation image is divided into Ganlei's (sensitive kind) by scape, increases the specific aim for special category picture recognition, improve simultaneously Recognition accuracy.Such as:In actual test, trade company's management platform in electric business service platform increases newly daily at present to be uploaded 2000000 images, it is huge by manual examination and verification cost, needed for 100 working hours man day.After the present embodiment, need daily artificial The picture number further verified is reduced within 500, reduces by 4000 times of cost of labor, while reducing artificial participation, is reduced The risk that maloperation is brought.
The embodiment of the present invention also provides a kind of device analyzed sensitive image as shown in Figure 5, and the device is specific It may operate on monitoring server as shown in Figure 1, which includes:
Cluster module is used in extracted training sample set, according to the corresponding sensitive kind of each sample, to the instruction The sample image practiced in sample set is clustered;
Training module, for according to the sample image by cluster, passing through the corresponding all kinds of knowledge of convolutional neural networks training Other model;
Analysis module, the identification model for being obtained using training identify that correspondence is all kinds of from picture library to be detected Sensitization picture.
Wherein, the cluster module is specifically used for by preset neural network model, from the training sample set Extract the sensitive features of each sample image;And by preset clustering algorithm, the similarity degree of sensitive features is met into test rule Sample image then is clustered to the same sample set and is closed;Wherein, the preset neural network model is instructed by imagenet Practice.
Further, as shown in fig. 6, further including:Filtering module, used in being closed in a sample set:According to cluster The distance at center by the sample image in the subclass by closely sorting to remote, and chooses the sample to sort in preceding specified digit Image is as positive sample;Recycle obtained positive sample training pattern grader;Pass through the trained model later Grader, the sample image in being closed to the sample set carries out classified calculating, and the score value being calculated is less than pre- gating The sample image of limit is rejected;
Further include:Correction module, for utilizing extracted pre-training data set, the depth residual error net of the specified number of plies of training Network, the specified number of plies >=50;And the depth residual error network obtained by training, it corrects the sample set and closes.
Further, all kinds of quick for being corresponded in the picture library to be detected as shown in fig. 7, update module Sense picture is identified, and after obtaining recognition result, obtains the score value of each attribute in sensitization picture, wherein the Sensitive Graphs The score value of each attribute is calculated by the identification model in piece;And the sequence according to score value from large to small, to being obtained Each attribute of the sensitization picture taken sorts;
The additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence When spending threshold value, judgement is as the difficult example sample.;And the ginseng of all kinds of identification models is corresponded to according to the difficult example Sample Refreshment Number.
The device provided in an embodiment of the present invention that sensitive image is analyzed, by the training sample set Sample image is clustered, and further according to the sample image by cluster, passes through the corresponding all kinds of identification of convolutional neural networks training Model, the identification model obtained later using training identify corresponding all kinds of sensitization picture from picture library to be detected, to Identify whether the picture that the trade company of electric business service platform is uploaded belongs to corresponding all kinds of sensitization picture.It realizes for trade company Automatic detection, the scanning for uploading the picture of electric business service platform, improve the automatization level for advertising pictures recognition detection, Reduce manual examination and verification cost.
Each embodiment in this specification is described in a progressive manner, identical similar portion between each embodiment Point just to refer each other, and each embodiment focuses on the differences from other embodiments.Especially for equipment reality For applying example, since it is substantially similar to the method embodiment, so describing fairly simple, related place is referring to embodiment of the method Part explanation.The above description is merely a specific embodiment, but protection scope of the present invention is not limited to This, any one skilled in the art in the technical scope disclosed by the present invention, the variation that can readily occur in or replaces It changes, should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with the protection model of claim Subject to enclosing.

Claims (11)

1. a kind of method analyzed sensitive image, which is characterized in that including:
In extracted training sample set, according to the corresponding sensitive kind of each sample, to the sample in the training sample set This image is clustered;
According to the sample image by cluster, pass through the corresponding all kinds of identification model of convolutional neural networks training;
The identification model obtained using training identifies corresponding all kinds of sensitization picture from picture library to be detected.
2. according to the method described in claim 1, it is characterized in that, it is described according to the corresponding sensitive kind of each sample to the instruction The sample image practiced in sample set is clustered, including:
By preset neural network model, the sensitive features of each sample image are extracted from the training sample set, wherein The preset neural network model is trained by imagenet;
By preset clustering algorithm, the sample image that the similarity degree of sensitive features is met to test order is clustered to same Sample set closes.
3. according to the method described in claim 2, it is characterized in that, further including:
In a sample set closes:
According at a distance from cluster centre, by the sample image in the subclass by closely sorting to remote, and sequence is chosen preceding The sample image of specified digit is as positive sample;
Utilize obtained positive sample training pattern grader;
By the trained model classifiers, the sample image in being closed to the sample set carries out classified calculating, and Sample image by the score value being calculated less than pre-determined threshold is rejected.
4. according to the method described in claim 2, it is characterized in that, further including:
Utilize extracted pre-training data set, the depth residual error network of the specified number of plies of training, the specified number of plies >=50;
The depth residual error network obtained by training corrects the sample set and closes.
5. according to the method described in claim 1, it is characterized in that, further including:
All kinds of sensitization pictures is corresponded in the picture library to be detected to be identified, and after obtaining recognition result, from institute It states and extracts difficult example sample in recognition result;
The parameter of all kinds of identification models is corresponded to according to the difficult example Sample Refreshment.
6. according to the method described in claim 5, it is characterized in that, described extract difficult example sample, packet from the recognition result It includes:
Obtain the score value of each attribute in sensitization picture, wherein the score value of each attribute passes through the knowledge in the sensitization picture Other model is calculated;According to the sequence of score value from large to small, sort to each attribute of acquired sensitization picture;
The additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence level threshold When value, judgement is as the difficult example sample.
7. according to the method described in claim 1, it is characterized in that, further including:
According to preset business rule, candidate image is acquired from electric business service platform, acquired candidate image is utilized to update institute State picture library;
And/or according to preset test order, the training sample set is extracted from the sample database pointed by the test order It closes.
8. a kind of device analyzed sensitive image, which is characterized in that including:
Cluster module is used in extracted training sample set, according to the corresponding sensitive kind of each sample, to the trained sample Sample image in this set is clustered;
Training module, for according to the sample image by cluster, passing through the corresponding all kinds of identification mould of convolutional neural networks training Type;
Analysis module, the identification model for being obtained using training identify corresponding all kinds of sensitivity from picture library to be detected Picture.
9. according to the method described in claim 8, it is characterized in that, the cluster module, is specifically used for passing through preset nerve Network model extracts the sensitive features of each sample image from the training sample set;It, will and by preset clustering algorithm The sample image that the similarity degree of sensitive features meets test order is clustered to the conjunction of the same sample set;Wherein, described default Neural network model trained by imagenet.
10. according to the method described in claim 8, it is characterized in that, further including:Filtering module, in a sample set In conjunction:According at a distance from cluster centre, by the sample image in the subclass by closely sorting to remote, and sequence is chosen preceding The sample image of specified digit is as positive sample;Recycle obtained positive sample training pattern grader;Pass through later by Trained model classifiers, the sample image in being closed to the sample set carries out classified calculating, and will be calculated Score value is rejected less than the sample image of pre-determined threshold;
Further include:Correction module, for utilizing extracted pre-training data set, training to specify the depth residual error network of the number of plies, The specified number of plies >=50;And the depth residual error network obtained by training, it corrects the sample set and closes.
11. according to the method described in claim 8, it is characterized in that, further including:
Update module is identified for corresponding to all kinds of sensitization pictures in the picture library to be detected, and is known After other result, the score value of each attribute in sensitization picture is obtained, wherein the score value of each attribute passes through institute in the sensitization picture Identification model is stated to be calculated;And the sequence according to score value from large to small, it sorts to each attribute of acquired sensitization picture; And the additive value to sort in the score value of the attribute of preceding specified digit is obtained, when the additive value is more than preset confidence threshold value When, judgement is as the difficult example sample.The parameter of all kinds of identification models is corresponded to further according to the difficult example Sample Refreshment.
CN201710248908.8A 2017-04-17 2017-04-17 Method and device for analyzing sensitive image Active CN108734184B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710248908.8A CN108734184B (en) 2017-04-17 2017-04-17 Method and device for analyzing sensitive image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710248908.8A CN108734184B (en) 2017-04-17 2017-04-17 Method and device for analyzing sensitive image

Publications (2)

Publication Number Publication Date
CN108734184A true CN108734184A (en) 2018-11-02
CN108734184B CN108734184B (en) 2022-06-07

Family

ID=63923944

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710248908.8A Active CN108734184B (en) 2017-04-17 2017-04-17 Method and device for analyzing sensitive image

Country Status (1)

Country Link
CN (1) CN108734184B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829069A (en) * 2018-12-28 2019-05-31 广州华多网络科技有限公司 Image audit processing method, device, electronic equipment and storage medium
CN109831699A (en) * 2018-12-28 2019-05-31 广州华多网络科技有限公司 Image audit processing method, device, electronic equipment and storage medium
CN109919170A (en) * 2018-11-29 2019-06-21 阿里巴巴集团控股有限公司 Change evaluation method, apparatus, electronic equipment and computer readable storage medium
CN110110982A (en) * 2019-04-26 2019-08-09 特赞(上海)信息科技有限公司 The checking method and device of intention material
CN110210356A (en) * 2019-05-24 2019-09-06 厦门美柚信息科技有限公司 A kind of picture discrimination method, apparatus and system
CN110222846A (en) * 2019-05-13 2019-09-10 中国科学院计算技术研究所 A kind of the information safety protection method and information security system of Internet terminal
CN110456955A (en) * 2019-08-01 2019-11-15 腾讯科技(深圳)有限公司 Exposure dress ornament detection method, device, system, equipment and storage medium
CN111144399A (en) * 2018-11-06 2020-05-12 富士通株式会社 Apparatus and method for processing image
CN111311316A (en) * 2020-02-03 2020-06-19 支付宝(杭州)信息技术有限公司 Method and device for depicting merchant portrait, electronic equipment, verification method and system
CN111626778A (en) * 2020-05-25 2020-09-04 陶乐仪 Advertisement pushing system and method
CN111726648A (en) * 2020-06-28 2020-09-29 百度在线网络技术(北京)有限公司 Method, device and equipment for detecting image data and computer readable storage medium
CN114004645A (en) * 2021-10-29 2022-02-01 浙江省民营经济发展中心(浙江省广告监测中心) Fuse media advertisement wisdom monitoring platform and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060095202A1 (en) * 2004-11-01 2006-05-04 Hitachi, Ltd. Method of delivering difference map data
CN103679132A (en) * 2013-07-15 2014-03-26 北京工业大学 A sensitive image identification method and a system
CN106228185A (en) * 2016-07-20 2016-12-14 武汉盈力科技有限公司 A kind of general image classifying and identifying system based on neutral net and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060095202A1 (en) * 2004-11-01 2006-05-04 Hitachi, Ltd. Method of delivering difference map data
CN103679132A (en) * 2013-07-15 2014-03-26 北京工业大学 A sensitive image identification method and a system
CN106228185A (en) * 2016-07-20 2016-12-14 武汉盈力科技有限公司 A kind of general image classifying and identifying system based on neutral net and method

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111144399B (en) * 2018-11-06 2024-03-05 富士通株式会社 Apparatus and method for processing image
CN111144399A (en) * 2018-11-06 2020-05-12 富士通株式会社 Apparatus and method for processing image
CN109919170A (en) * 2018-11-29 2019-06-21 阿里巴巴集团控股有限公司 Change evaluation method, apparatus, electronic equipment and computer readable storage medium
CN109919170B (en) * 2018-11-29 2023-12-05 创新先进技术有限公司 Change evaluation method, change evaluation device, electronic device and computer-readable storage medium
CN109831699A (en) * 2018-12-28 2019-05-31 广州华多网络科技有限公司 Image audit processing method, device, electronic equipment and storage medium
CN109829069A (en) * 2018-12-28 2019-05-31 广州华多网络科技有限公司 Image audit processing method, device, electronic equipment and storage medium
CN109831699B (en) * 2018-12-28 2021-07-20 广州华多网络科技有限公司 Image auditing processing method and device, electronic equipment and storage medium
CN110110982A (en) * 2019-04-26 2019-08-09 特赞(上海)信息科技有限公司 The checking method and device of intention material
CN110222846B (en) * 2019-05-13 2021-07-20 中国科学院计算技术研究所 Information security method and information security system for internet terminal
CN110222846A (en) * 2019-05-13 2019-09-10 中国科学院计算技术研究所 A kind of the information safety protection method and information security system of Internet terminal
CN110210356A (en) * 2019-05-24 2019-09-06 厦门美柚信息科技有限公司 A kind of picture discrimination method, apparatus and system
CN110456955B (en) * 2019-08-01 2022-03-29 腾讯科技(深圳)有限公司 Exposed clothing detection method, device, system, equipment and storage medium
CN110456955A (en) * 2019-08-01 2019-11-15 腾讯科技(深圳)有限公司 Exposure dress ornament detection method, device, system, equipment and storage medium
CN111311316B (en) * 2020-02-03 2023-05-23 支付宝(杭州)信息技术有限公司 Method and device for depicting merchant portrait, electronic equipment, verification method and system
CN111311316A (en) * 2020-02-03 2020-06-19 支付宝(杭州)信息技术有限公司 Method and device for depicting merchant portrait, electronic equipment, verification method and system
CN111626778A (en) * 2020-05-25 2020-09-04 陶乐仪 Advertisement pushing system and method
CN111726648A (en) * 2020-06-28 2020-09-29 百度在线网络技术(北京)有限公司 Method, device and equipment for detecting image data and computer readable storage medium
CN114004645A (en) * 2021-10-29 2022-02-01 浙江省民营经济发展中心(浙江省广告监测中心) Fuse media advertisement wisdom monitoring platform and electronic equipment

Also Published As

Publication number Publication date
CN108734184B (en) 2022-06-07

Similar Documents

Publication Publication Date Title
CN108734184A (en) A kind of method and device that sensitive image is analyzed
CN105303150B (en) Realize the method and system of image procossing
Narihira et al. Learning lightness from human judgement on relative reflectance
CN106408030B (en) SAR image classification method based on middle layer semantic attribute and convolutional neural networks
CN108171184A (en) Method for distinguishing is known based on Siamese networks again for pedestrian
Jing et al. Yarn-dyed fabric defect classification based on convolutional neural network
CN104866831B (en) The face recognition algorithms of characteristic weighing
CN104915673B (en) A kind of objective classification method and system of view-based access control model bag of words
CN103119607A (en) Optimization of human activity determination from video
CN109670546B (en) Commodity matching and quantity regression recognition algorithm based on preset template
CN108647702B (en) Large-scale food material image classification method based on transfer learning
CN110287856A (en) A kind of security personnel's behavior analysis system, method and device
CN110163722A (en) The big data analysis system and analysis method precisely sold for agricultural product
CN107622071B (en) Clothes image retrieval system and method under non-source-retrieval condition through indirect correlation feedback
CN109712324A (en) A kind of automatic vending machine image-recognizing method, good selling method and vending equipment
CN104573699A (en) Trypetid identification method based on medium field intensity magnetic resonance dissection imaging
Zhang et al. A review on cluster estimation methods and their application to neural spike data
Siddalingappa et al. Anomaly detection on medical images using autoencoder and convolutional neural network
CN107944373A (en) A kind of video anomaly detection method based on deep learning
Wei et al. (Retracted) Image analysis and pattern recognition method of three-dimensional process in physical education teaching based on big data
CN109214467B (en) Transformer operator dressing robust identification method considering classifier output sensitivity
CN104573745A (en) Fruit fly classification method based on magnetic resonance imaging
CN110414792A (en) Component centralized procurement management system and Related product based on BIM and big data
Zhang et al. Colour‐patterned fabric‐defect detection using unsupervised and memorial defect‐free features
Shermila et al. DEEPFIC: Food item classification with calorie calculation using dragonfly deep learning network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 210000, 1-5 story, Jinshan building, 8 Shanxi Road, Nanjing, Jiangsu.

Applicant after: SUNING.COM Co.,Ltd.

Address before: 210042 Suning Headquarters, No. 1 Suning Avenue, Xuanwu District, Nanjing City, Jiangsu Province

Applicant before: SUNING COMMERCE GROUP Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant