CN110852424B

CN110852424B - Processing method and device for countermeasure generation network

Info

Publication number: CN110852424B
Application number: CN201911122032.8A
Authority: CN
Inventors: 曾俊桦; 赵启斌; 周郭许
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2019-11-15
Filing date: 2019-11-15
Publication date: 2023-07-25
Anticipated expiration: 2039-11-15
Also published as: CN110852424A

Abstract

The application discloses a processing method and a device for an countermeasure generation network, wherein the method comprises the following steps: converting a parameter matrix of a full-connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors; performing tensor loop decomposition on the parameter tensors to obtain a plurality of kernel tensors; converting the kernel tensor into a matrix to obtain a kernel matrix; the nuclear matrix is replaced with a parameter matrix of a full-connection layer in a preset countermeasure generation network to obtain a new countermeasure generation network, and the technical problem that the existing countermeasure generation network has a plurality of parameters and high calculation amount, so that the requirement on hardware equipment is high is solved.

Description

Processing method and device for countermeasure generation network

Technical Field

The present disclosure relates to the field of deep learning technologies, and in particular, to a method and an apparatus for processing an countermeasure generation network.

Background

In recent years, convolutional neural networks are widely applied in the fields of image classification, face recognition, target detection and the like, along with the increase of the number of layers of the convolutional neural networks, the calculation complexity is higher and the calculation amount is larger, and in order to avoid the problem of network overfitting, more training data are needed when training the deep convolutional neural network. While collecting a large amount of training data requires a lot of manpower, material resources and time, and a large amount of training data cannot be obtained quickly, in order to solve these problems, an countermeasure generation network (Generative Adversarial Networks, GAN) is used in the prior art to perform data expansion.

The main task of the countermeasure generation network belongs to the application of deep learning in unsupervised, the countermeasure generation network is to automatically generate images or videos and the like which cannot be distinguished by human beings, the GAN mainly comprises two parts, one part is a generator, the other part is a discriminator, the generator is used for generating images or videos which are spurious, the discriminator is used for distinguishing whether the generated images or videos are real images or videos, the generator and the discriminator are mutually game, in the game process, the data generated by the generator are more and more real, the capability of the discriminator for distinguishing the data from the spurious is higher and higher, and balance is finally achieved. However, the generator and the arbiter are substantially composed of a convolutional layer and a fully-connected layer of the deep convolutional neural network, so that there are problems of more parameters and large calculation amount, and high requirements on hardware equipment are also caused.

Disclosure of Invention

The application provides a processing method and a processing device of an countermeasure generation network, which are used for solving the technical problem of high requirements on hardware equipment caused by multiple parameters and large calculation amount of the existing countermeasure generation network.

In view of this, a first aspect of the present application provides a method for processing an countermeasure generation network, including:

converting a parameter matrix of a full-connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors;

performing tensor loop decomposition on the parameter tensors to obtain a plurality of kernel tensors;

converting the kernel tensor into a matrix to obtain a kernel matrix;

and replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network.

Preferably, the converting the kernel tensor into a matrix to obtain a kernel matrix specifically includes:

selecting one target kernel tensor from all the kernel tensors to perform classical mode 2 expansion to obtain a first target matrix;

combining the first target matrix with non-target tensors in all the tensors to obtain a combined tensor;

performing classical mode 2 expansion on the combined tensors to obtain a second target matrix;

and carrying out reshape operation on the second target matrix to obtain a core matrix with the same size as the parameter matrix of the full connection layer.

Preferably, the merging operation of the first target matrix and all the kernel tensors except the target kernel tensor to obtain a merged tensor specifically includes:

multiplying the first target matrix by non-target ones of all the kernel tensors to obtain a combined tensor.

Preferably, the converting the parameter matrix of the full connection layer in the preset countermeasure generation network into tensors to obtain parameter tensors specifically includes:

and converting the parameter matrix of the full-connection layer in the preset countermeasure generation network into tensors based on bijection to obtain parameter tensors.

Preferably, the converting the parameter matrix of the full connection layer in the preset countermeasure generation network into tensors to obtain parameter tensors further includes:

and acquiring the preset countermeasure generation network.

A second aspect of the present application provides a processing apparatus for an countermeasure generation network, comprising:

the first conversion module is used for converting a parameter matrix of a full-connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors;

the decomposition module is used for performing tensor ring decomposition on the parameter tensors to obtain a plurality of kernel tensors;

the second conversion module is used for converting the kernel tensor into a matrix to obtain a kernel matrix;

and the replacing module is used for replacing the parameter matrix of the full-connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network.

Preferably, the second conversion module specifically includes:

the first unfolding sub-module is used for selecting one target nuclear tensor from all the nuclear tensors to conduct classical mode 2 unfolding to obtain a first target matrix;

the merging submodule is used for merging the first target matrix with non-target nuclear tensors in all the nuclear tensors to obtain a merged tensor;

the second unfolding sub-module is used for carrying out classical mode 2 unfolding on the combined tensors to obtain a second target matrix;

and the remolding sub-module is used for carrying out reshape operation on the second target matrix to obtain a core matrix with the same size as the parameter matrix of the full-connection layer.

Preferably, the merging sub-module is specifically configured to:

Preferably, the first conversion module is specifically configured to:

Preferably, the method further comprises:

and the acquisition module is used for acquiring the preset countermeasure generation network.

From the above technical scheme, the application has the following advantages:

the application provides a processing method of an countermeasure generation network, which comprises the following steps: converting a parameter matrix of a full-connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors; performing tensor loop decomposition on the parameter tensors to obtain a plurality of kernel tensors; converting the kernel tensor into a matrix to obtain a kernel matrix; and replacing the parameter matrix of the full-connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network. According to the processing method of the countermeasure generation network, the parameter tensor is obtained by carrying out high-order processing on the parameter matrix of the full-connection layer; performing tensor loop decomposition on the obtained parameter tensors to obtain a plurality of kernel tensors; the method comprises the steps that a kernel matrix is obtained based on kernel tensor conversion, the kernel matrix is replaced with a parameter matrix of a full-connection layer in a preset countermeasure generation network, a new countermeasure generation network is obtained, when the obtained new countermeasure generation network is trained or tested, hardware equipment only needs to store a small number of kernel tensors and does not need to store the whole parameter matrix of the full-connection layer, occupied memory of the hardware equipment is greatly reduced, the running speed of the equipment is improved, the new countermeasure generation network can be embedded into low-end equipment, and the technical problem that the hardware equipment is required to be high due to the fact that the existing countermeasure generation network has a large number of parameters and large calculation amount is solved; and the parameter matrix of the full-connection layer is replaced by the nuclear matrix, so that the structure of the original full-connection layer is not changed, and the performance of the network is ensured.

Drawings

FIG. 1 is a flow chart of one embodiment of a method of processing an countermeasure generation network provided herein;

FIG. 2 is a flow chart of another embodiment of a method of processing an countermeasure generation network provided herein;

fig. 3 is a schematic structural diagram of an embodiment of a processing device for an countermeasure generation network provided in the present application.

Detailed Description

In order to make the present application solution better understood by those skilled in the art, the following description will clearly and completely describe the technical solution in the embodiments of the present application with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.

For ease of understanding, referring to fig. 1, an embodiment of a processing method for an countermeasure generation network provided in the present application includes:

and step 101, converting a parameter matrix of a full-connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors.

Step 102, performing tensor loop decomposition on the parameter tensors to obtain a plurality of kernel tensors.

In order to solve the technical problem that the requirement on hardware equipment is high due to the fact that the existing countermeasure generation network is multiple in parameters and large in calculation amount, in the embodiment of the invention, a tensor ring decomposition method is adopted to compress the preset countermeasure generation network, most of parameters of the countermeasure generation network are in a full-connection layer, tensor ring decomposition is more effective for tensors of a high order, therefore, higher-order processing is carried out on a parameter matrix of the full-connection layer, the parameter matrix is converted into higher-order tensors, the parameter tensors are obtained, tensor ring decomposition is carried out on the obtained parameter tensors, a plurality of low-rank nuclear tensors are obtained, and the number of the nuclear tensors is preset.

Step 103, converting the kernel tensor into a matrix to obtain a kernel matrix.

And 104, replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network.

In the prior art, a tensor algebra method is adopted to compress the convolutional neural network, the operation of the convolutional neural network is regarded as a tensor operation to compress network parameters, and tensor algebra operation is used to replace the operation of a full-connection layer in the convolutional neural network, so that the original full-connection layer structure of the convolutional neural network is damaged, and the network performance is reduced. In order not to change the structure of the full connection layer, the obtained kernel tensor is converted to obtain a kernel matrix, and the kernel matrix is used for replacing a parameter matrix of the full connection layer in the preset countermeasure generation network, so that the structure of the original full connection layer is not changed, and the performance of the network is guaranteed.

According to the processing method of the countermeasure generation network, which is provided by the embodiment of the application, the parameter tensor is obtained by carrying out high-order processing on the parameter matrix of the full-connection layer; performing tensor loop decomposition on the obtained parameter tensors to obtain a plurality of kernel tensors; the method comprises the steps that a kernel matrix is obtained based on kernel tensor conversion, the kernel matrix is replaced with a parameter matrix of a full-connection layer in a preset countermeasure generation network, a new countermeasure generation network is obtained, when the obtained new countermeasure generation network is trained or tested, hardware equipment only needs to store a small number of kernel tensors and does not need to store the whole parameter matrix of the full-connection layer, occupied memory of the hardware equipment is greatly reduced, the running speed of the equipment is improved, the new countermeasure generation network can be embedded into low-end equipment, and the technical problem that the hardware equipment is required to be high due to the fact that the existing countermeasure generation network has a large number of parameters and large calculation amount is solved; and the parameter matrix of the full-connection layer is replaced by the nuclear matrix, so that the structure of the original full-connection layer is not changed, and the performance of the network is ensured.

For ease of understanding, referring to fig. 2, another embodiment of a processing method for generating a network for countering is provided herein, including:

step 201, acquiring a preset countermeasure generation network.

It should be noted that one of a plurality of types of countermeasure generation networks disclosed in the prior art may be acquired as a preset countermeasure generation network, or the countermeasure generation network may be constructed by a plurality of convolution layers and full connection layers as a preset countermeasure generation network, and the present invention is not limited in detail.

Step 202, converting a parameter matrix of a full connection layer in a preset countermeasure generation network into tensors to obtain parameter tensors.

In order to solve the technical problem of high requirements on hardware equipment caused by multiple parameters and large calculation amount of the existing countermeasure generation network, in the embodiment of the present application, a tensor ring decomposition method is adopted to perform compression processing on the preset countermeasure generation network, and because most of parameters of the countermeasure generation network are in a full connection layer, tensor ring decomposition is more effective on tensors of a high order, higher order processing is performed on a parameter matrix of the full connection layer, and the parameter matrix is converted into a high order tensor, so as to obtain the parameter tensor.

In the embodiment of the application, the dual-shot is adopted to convert the parameter matrix of the full-connection layer in the preset countermeasure generation network into tensors to obtain parameter tensors, and the parameter matrix of the full-connection layer is assumed to be W epsilon R ^m×n ，W (t, l) is the value of the parameter matrix W at the position (t, l), t E [1, m]，l∈[1,n]Defining bijective ((v) ₁ (t),…,v _d (t)),(u ₁ (l),…,u _d (l) The bijection can map the values of the indexes t, l onto a vector, respectively, the number of elements in the vector and the formula +.>As many as the number of terms, i.e., d, elements at a certain position of the parameter matrix can be mapped to elements at a certain position of the tensor by bijection so that the parameter matrix is converted into a parameter tensor, e.g., assuming that the parameter matrix W ε R ^8×8 The behavior of the parameter matrix is 8=2x2x2, the column is 8=2x2x2, and the parameter matrix can be transformed by bijectionChanging to a third-order tensor, defining a bijection of t, l for matrix element W (t, l), respectively, i.e., when t=1, a vector (1, 1); t=2, corresponds to one vector (1, 2); t=3, corresponds to one vector (1, 2, 1); t=4, corresponds to one vector (1, 2); t=5, corresponds to a vector (2, 1); t=6, corresponds to one vector (2, 1, 2); t=7, corresponds to one vector (2, 1); t=8, corresponds to one vector (2, 2); similarly, when l=1, a vector (1, 1) is corresponding; l=2, corresponds to one vector (1, 2); l=3, corresponds to one vector (1, 2, 1); l=4, corresponds to one vector (1, 2); l=5, corresponds to one vector (2, 1); l=6, corresponds to one vector (2, 1, 2); l=7, corresponds to one vector (2, 1); when l=8, a vector (2, 2) is corresponding, and when the parameter matrix W takes the element (2, 5), two vectors (1, 2) and (2, 1) can be obtained by the corresponding relation, i.e. W (t, l) =w ((v) ₁ (t),u ₁ (l)),…,(v _d (t),u _d (l) And) corresponding, W can be converted to a parameter tensor W ((v) ₁ (t),u ₁ (l)),…,(v _k (t),u _k (l),…,(v _d (t),u _d (l)))。

And 203, performing tensor loop decomposition on the parameter tensors to obtain a plurality of kernel tensors.

The number of the tensor loop decomposed rank and the tensor number are preset, and the tensor loop decomposed parameter is decomposed to obtain a preset number of tensor loop, for example, the number of the tensor is set as d, and the decomposed rank is (r ₁ ,r ₂ ,…,r _d+1 ) The kernel tensor can be expressed asSince the tensor ring decomposition is performed in the prior art, a detailed description of the tensor ring decomposition process is omitted here.

And 204, selecting a target kernel tensor from all the kernel tensors to perform classical mode 2 expansion to obtain a first target matrix.

It should be noted that, one target tensor may be randomly selected from all the tensors to perform classical mode 2 expansion to obtain the first target matrix, where the classical mode 2 expansion method belongs to the prior art, and detailed description of a specific classical mode 2 expansion process is omitted here.

Step 205, merging the first target matrix with non-target tensors in all the tensors to obtain a merged tensor.

It should be noted that, multiplying the first target matrix by the non-target tensors in all the tensors may be used to obtain the combined tensors.

And 206, performing classical mode 2 expansion on the combined tensors to obtain a second target matrix.

And 207, performing reshape operation on the second target matrix to obtain a core matrix with the same size as the parameter matrix of the full connection layer.

It should be noted that, according to the size of the parameter matrix of the full connection layer in the preset countermeasure generation network, the reshape operation is performed on the second target matrix to obtain a core matrix with the same size as the parameter matrix of the full connection layer, so that the parameter matrix of the full connection layer in the preset countermeasure generation network is replaced by the core matrix.

And step 208, replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network.

It should be noted that the mathematical expression of the full connection layer is:

y＝σ(Wx+b)；

wherein W.epsilon.R ^m×n ,x∈R ^n×1 ,b∈R ^m×1 ,y∈R ^m×1 。

Replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix, namely:

wherein Tr { Z ₁ (v ₁ (t),u ₁ (l)),…Z _k (v _k (t),u _k (l)),…,Z _d (v _d (t),u _d (l) As (x)Nuclear matrix, Z _k Is a nuclear tensor.

In the prior art, a tensor algebra method is adopted to compress the convolutional neural network, the operation of the convolutional neural network is regarded as a tensor operation to compress network parameters, and tensor algebra operation is used to replace the operation of a full-connection layer in the convolutional neural network, so that the original full-connection layer structure of the convolutional neural network is damaged, and the network performance is not ideal. In order not to change the structure of the full-connection layer, the obtained nuclear tensor is converted in the embodiment of the application, the obtained nuclear matrix is consistent with the parameter matrix of the original full-connection layer in size, the parameter matrix of the full-connection layer is replaced by the nuclear matrix, the structure of the original full-connection layer is not changed, and the performance of the network is guaranteed.

Wherein the resulting new challenge-generating network can be used to generate images or video to augment the data set. For example, noise and real data can be input into a new countermeasure generation network together, a generator and a discriminator in the new countermeasure generation network are trained until the new countermeasure generation network converges, a trained new countermeasure generation network is obtained, in the training process, hardware equipment only needs to store a few kernel tensors and does not need to store the whole parameter matrix of a full connection layer, occupied memory of the hardware equipment is greatly reduced, the running speed of the equipment is improved, the new countermeasure generation network can be embedded into low-end equipment, and the technical problem that the hardware equipment is required to be high due to the fact that the existing countermeasure generation network has a large number of parameters and large calculation amount is solved; and the obtained nuclear matrix is consistent with the parameter matrix of the original full-connection layer in size, and the parameter matrix of the full-connection layer is replaced by the nuclear matrix, so that the structure of the original full-connection layer is not changed, and the performance of the network is ensured.

For ease of understanding, referring to fig. 3, an embodiment of a processing apparatus for generating a network for countermeasure provided in the present application includes:

the first conversion module 301 is configured to convert a parameter matrix of a full connection layer in a preset challenge-generating network into a tensor, thereby obtaining a parameter tensor.

The decomposition module 302 is configured to perform tensor loop decomposition on the parameter tensor to obtain a plurality of kernel tensors.

A second conversion module 303, configured to convert the kernel tensor into a matrix, to obtain a kernel matrix.

And the replacing module 304 is configured to replace the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix, so as to obtain a new countermeasure generation network.

Further, the second conversion module 303 specifically includes:

the first expansion submodule 3031 is configured to select one target kernel tensor from all the kernel tensors to perform classical mode 2 expansion to obtain a first target matrix.

And the merging submodule 3032 is used for merging the first target matrix with non-target nuclear tensors in all the nuclear tensors to obtain merged tensors.

And a second expansion submodule 3033, configured to perform classical mode 2 expansion on the combined tensor to obtain a second target matrix.

And a remodeling submodule 3034, configured to perform reshape operation on the second target matrix to obtain a core matrix with the size identical to that of the parameter matrix of the full connection layer.

Further, the merging sub-module 3032 is specifically configured to:

multiplying the first target matrix by non-target ones of all the tensors to obtain a combined tensor.

Further, the first conversion module 301 is specifically configured to:

Further, the method further comprises the following steps:

an acquisition module 305 is configured to acquire a preset countermeasure generation network.

In the several embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.

The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.

The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be embodied in essence or a part contributing to the prior art or all or part of the technical solution in the form of a software product stored in a storage medium, including several instructions to execute all or part of the steps of the methods described in the embodiments of the present application by a computer device (which may be a personal computer, a server, or a network device, etc.). And the aforementioned storage medium includes: u disk, mobile hard disk, read-only memory (ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk, etc.

The above embodiments are merely for illustrating the technical solution of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the corresponding technical solutions.

Claims

1. A method of processing an countermeasure generation network, comprising:

converting the kernel tensor into a matrix to obtain a kernel matrix;

replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network, wherein the full connection layer of the new countermeasure generation network is as follows:

y＝σ(∑Tr(Z ₁ (v ₁ (t),u ₁ (l)),…，Z _k (v _k (t),u _k (l),…,Z _d (v _d (t),u _d (l)))x+b)；

in Tr { Z ₁ (v ₁ (t),u ₁ (l)),…Z _k (v _k (t),u _k (l)),…,Z _d (v _d (t),u _d (l) As a kernel matrix, Z) _k As the nuclear tensor, x is the input of the full-connection layer, b is the bias term of the full-connection layer, and y is the output of the full-connection layer;

the noise and the real image data or video data are input into the new countermeasure generation network together so as to train a generator and a discriminator in the new countermeasure generation network until the new countermeasure generation network converges to obtain a trained new countermeasure generation network, wherein in the training process, the generator is used for generating images or videos with false and spurious, and the discriminator is used for discriminating whether the generated images or videos are real images or videos; the trained new countermeasure generation network is used for generating images or videos so as to expand a data set;

the step of converting the kernel tensor into a matrix to obtain a kernel matrix specifically comprises the following steps:

2. The method for processing the countermeasure generation network according to claim 1, wherein the merging operation is performed on the first target matrix and all the core tensors except the target core tensor to obtain a merged tensor, specifically including:

3. The method for processing the challenge-generating network according to claim 1, wherein the step of converting the parameter matrix of the full-connection layer in the preset challenge-generating network into tensors to obtain parameter tensors specifically comprises:

4. A method for processing an countermeasure generation network according to claim 1, wherein the converting the parameter matrix of the fully connected layer in the preset countermeasure generation network into tensors to obtain parameter tensors further includes:

and acquiring the preset countermeasure generation network.

5. A processing apparatus for an countermeasure generation network, comprising:

the replacing module is used for replacing the parameter matrix of the full connection layer in the preset countermeasure generation network with the core matrix to obtain a new countermeasure generation network, and the full connection layer of the new countermeasure generation network is as follows:

the training module is used for inputting noise and real image data or video data into the new countermeasure generation network together so as to train a generator and a discriminator in the new countermeasure generation network until the new countermeasure generation network converges to obtain a trained new countermeasure generation network, and in the training process, the generator is used for generating a false and spurious image or video, and the discriminator is used for discriminating whether the generated image or video is a real image or video; the trained new countermeasure generation network is used for generating images or videos so as to expand a data set;

the second conversion module specifically includes:

6. The device for processing an countermeasure generation network of claim 5, wherein the merging submodule is specifically configured to:

7. The device for processing an countermeasure generation network of claim 5, wherein the first conversion module is specifically configured to:

8. The countermeasure generation network processing apparatus of claim 5, further comprising: