CN107886162A

CN107886162A - A kind of deformable convolution kernel method based on WGAN models

Info

Publication number: CN107886162A
Application number: CN201711123711.8A
Authority: CN
Inventors: 周智恒; 李立军; 胥静; 朱湘军; 李利苹; 汪壮雄
Original assignee: Guangzhou Gvs Intelligent Technology Co Ltd; Guangzhou Video-Star Intelligent Ltd By Share Ltd; South China University of Technology SCUT
Current assignee: Guangzhou Gvs Intelligent Technology Co Ltd; Guangzhou Video-Star Intelligent Ltd By Share Ltd; South China University of Technology SCUT
Priority date: 2017-11-14
Filing date: 2017-11-14
Publication date: 2018-04-06

Abstract

The invention discloses a kind of deformable convolution kernel method based on WGAN models, belong to deep learning field of neural networks, comprise the following steps：S1, construction are originally generated confrontation network model；S2, construction Wo Sesitan distances, the judging quota as confrontation network model；S3, initialization random noise, are inputted in maker；S4, carry out convolution using deformable convolution collecting image in WGAN models；S5, loss function that deformable convolution operation obtains input maker subsequently trained.The deformable convolution kernel method based on WGAN models that the present invention is built, change arbiter, maker receives the convolution mode after picture, arbiter, maker is allowed automatically to change the size of convolution kernel according to the situation of training, so as to adaptively learn to the feature of data images, the robustness of whole network training is improved.

Description

A kind of deformable convolution kernel method based on WGAN models

Technical field

The present invention relates to deep learning field of neural networks, and in particular to a kind of deformable convolution kernel based on WGAN models Method.

Background technology

Production confrontation network (Generative Adversarial Network, abbreviation GAN) is by Goodfellow In the deep learning framework that 2014 propose, it is based on the thought of " game theory ", construction maker (generator) and arbiter (discriminator) two kinds of models, the former generates image by the Uniform noise or gaussian random noise for inputting (0,1), after Person differentiates to the image of input, it is determined that being the image from data set or the image as caused by maker.

In traditional confrontation network model, do not have unified judgment criteria for maker generation picture quality, because This, it would be highly desirable to a kind of judging quota by the use of Wo Sesitan distances as generation confrontation network is proposed, so that the instruction of whole model White silk can learn the method for characteristics of image furthermore with deformable convolution, improve whole network toward progress is correctly oriented Training effectiveness.

The content of the invention

The invention aims to solve drawbacks described above of the prior art, there is provided a kind of based on the variable of WGAN models Shape convolution kernel method.

The purpose of the present invention can be reached by adopting the following technical scheme that：

A kind of deformable convolution kernel method based on WGAN models, the deformable convolution kernel method comprise the following steps：

S1, construction are originally generated confrontation network model, and generating image by maker inputs to arbiter progress network instruction Practice；

S2, construction Wo Sesitan distances, the judging quota as confrontation network model；

In the network model that the present invention relates to, the judge by the use of Wo Sesitan distances as generation confrontation network refers to Mark, so that the training of whole model past can be correctly oriented progress.

S3, initialization random noise, are inputted in maker；

S4, carry out convolution using deformable convolution collecting image in WGAN models；

In original generation confrontation network model, the shape of convolution kernel is generally square, and which has limited neutral net pair The free degree of characteristics of image study, and in the present invention, for this defect, the shape of convolution kernel is carried out using network training Adaptively change, so as to the feature of higher efficiency learning image into data set.

S5, loss function that deformable convolution operation obtains input maker subsequently trained.

Further, described step S2 is specific as follows：

Multiple convolution kernels are constructed, different convolution kernels, are represent during study, can be learnt to different images Feature.

Further, convolution, specific mistake are carried out using deformable convolution collecting image in WGAN in described step S4 Journey is as follows：

S41, the multiple different numerical value of construction but size identical convolution kernel；

S42, using the convolution kernel constructed, convolution is carried out to multiple images of maker generation respectively, it is more so as to obtain Open characteristic pattern.

Further, in described step S5, the loss function input maker that deformable convolution operation is obtained is carried out Follow-up training.Detailed process is as follows：

S51, the characteristic pattern after convolution in S4, input arbiter are differentiated；

S52, loss function that deformable convolution operation obtains input maker subsequently trained.

S53, the average of all loss functions is inputted and continues to be trained into maker.

The present invention is had the following advantages relative to prior art and effect：

Robustness：The present invention sets according to the operating process of deformable convolution and constructs multiple deformable convolution kernels, pass through The mode of convolution kernel size is dynamically changed in the training process, is applied and is being served as maker with sentencing with depth convolutional neural networks In the confrontation network model of other device, while the judging quota by the use of Wo Sesitan distances as generation confrontation network, so that whole The training of individual model past can be correctly oriented progress.

Brief description of the drawings

Fig. 1 is the training flow chart of the deformable convolution kernel method based on WGAN models disclosed in the present invention；

Fig. 2 is the schematic diagram for being transformed into deformable convolution kernel in the present invention to original convolution core.

Embodiment

To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is Part of the embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.

Embodiment

Present embodiment discloses a kind of deformable convolution kernel method based on WGAN models, the following steps are specifically included：

Step S1, construction is originally generated confrontation network model, and maker is inputted to arbiter by generating image and carries out net Network training.

Step S2, Wo Sesitan distances are constructed, the judging quota as confrontation network model；

Different convolution kernels, it is embodied in difference, the difference of ranks number of matrix numerical value.

Multiple convolution kernels are constructed, during image is handled, different convolution kernels is meant in network training Different characteristic of the process learning to generation image.

In the model of tradition confrontation network, the convolution kernel used in arbiter and maker is all fixed size and numerical value Consistent, training effectiveness in this case is relatively low, and the characteristics of image scope learnt is relatively small.And at this In invention, using deformable convolution, the operation of " 0 " is interleave in being carried out to original convolution core, can be learned so as to increase convolution kernel The characteristic range practised, further increase the efficiency of whole network study.

In actual applications, it should which according to the complexity of data images feature, the number of convolution kernel is set.

Step S3, random noise is initialized, is inputted in maker.

Step S4, convolution is carried out in WGAN models using deformable convolution collecting image.

Specific method is as follows：

S42, the error by anti-pass in network training process, adaptive change is carried out to the shape of convolution kernel.

Step S5, the loss function input maker that deformable convolution operation obtains subsequently is trained.Detailed process It is as follows：

S51, by the characteristic pattern after convolution in step S4, input arbiter is differentiated；

The effect of loss function is to weigh the ability that arbiter is judged generation image.The value of loss function is smaller, explanation In current iteration, arbiter can have the generation image of preferable performance discrimination maker；Property that is on the contrary then illustrating arbiter Can be poor.

The expression formula of loss function is：

Wherein, D (x) represents differentiation of the arbiter to image, and pr represents the distribution of data images, and pg represents generation image Distribution, λ is hyper parameter,For gradient, E is the functional symbol for taking average.

In summary, present embodiment discloses a kind of deformable convolution kernel method based on WGAN models, compared to tradition Original confrontation network model, change arbiter receive picture after the mode learnt to characteristics of image.It is right in tradition In the model of anti-network, arbiter is all fixed size with the convolution kernel used in maker and numerical value is consistent, in this feelings Training effectiveness under condition is relatively low, and the characteristics of image scope learnt is relatively small.And in the present invention, utilization is variable Shape convolution, the effect learnt according to network in the training process to characteristics of image, the size of convolution kernel is dynamically changed, so as to increase Big convolution kernel can learn the adaptivity of scope, further increase the efficiency that whole network learns.

Above-described embodiment is the preferable embodiment of the present invention, but embodiments of the present invention are not by above-described embodiment Limitation, other any Spirit Essences without departing from the present invention with made under principle change, modification, replacement, combine, simplification, Equivalent substitute mode is should be, is included within protection scope of the present invention.

Claims

A kind of 1. deformable convolution kernel method based on WGAN models, it is characterised in that described deformable convolution kernel method bag Include the following steps：

S1, construction are originally generated confrontation network model, and generating image by maker inputs to arbiter progress network training；

S2, construction Wo Sesitan distances, the judging quota as confrontation network model；

S3, initialization random noise, are inputted in maker；

S4, carry out convolution using deformable convolution collecting image in WGAN models；

S5, loss function that deformable convolution operation obtains input maker subsequently trained.
2. a kind of deformable convolution kernel method based on WGAN models according to claim 1, it is characterised in that described Step S4 detailed processes are as follows：

S41, the multiple different numerical value of construction but size identical convolution kernel；

S42, the error by anti-pass in network training process, adaptive change is carried out to the shape of convolution kernel.
3. a kind of deformable convolution kernel method based on WGAN models according to claim 1, it is characterised in that described Step S5 detailed processes are as follows：

S51, the characteristics of image figure that will be obtained after deformable convolution, input in arbiter and are differentiated；

S52, loss function that deformable convolution operation obtains input maker subsequently trained；

S53, the average of all loss functions is inputted and continues to be trained into maker.
4. a kind of deformable convolution kernel method based on WGAN models according to claim 1, it is characterised in that described The expression formula of loss function is：

<mrow> <mi>L</mi> <mrow> <mo>(</mo> <mi>D</mi> <mo>)</mo> </mrow> <mo>=</mo> <mo>-</mo> <msub> <mi>E</mi> <mrow> <mi>x</mi> <mo>~</mo> <mi>p</mi> <mi>r</mi> </mrow> </msub> <mo>&lsqb;</mo> <mi>D</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>&rsqb;</mo> <mo>+</mo> <msub> <mi>E</mi> <mrow> <mi>x</mi> <mo>~</mo> <mi>p</mi> <mi>g</mi> </mrow> </msub> <mo>&lsqb;</mo> <mi>D</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>)</mo> </mrow> <mo>&rsqb;</mo> <mo>+</mo> <msub> <mi>&lambda;E</mi> <mrow> <mi>x</mi> <mo>~</mo> <mi>X</mi> </mrow> </msub> <msub> <mo>&dtri;</mo> <mi>x</mi> </msub> </mrow>

Wherein, D (x) represents differentiation of the arbiter to image, and pr represents the distribution of data images, and pg represents point of generation image Cloth, λ are hyper parameter,For gradient, E is the functional symbol for taking average.