CN111291810A

CN111291810A - Information processing model generation method based on target attribute decoupling and related equipment

Info

Publication number: CN111291810A
Application number: CN202010080705.4A
Authority: CN
Inventors: 解为成; 温志威; 吴昊谦; 沈琳琳
Original assignee: Shenzhen University
Current assignee: Shenzhen University
Priority date: 2020-02-05
Filing date: 2020-02-05
Publication date: 2020-06-16
Anticipated expiration: 2040-02-05
Also published as: CN111291810B

Abstract

The invention provides an information processing model generation method based on target attribute decoupling and related equipment, which are characterized in that a feature graph output by a hidden layer is obtained, and the feature graph is encoded by utilizing Hash encoding to obtain coordinate values corresponding to each feature graph; clustering each feature graph according to the coordinate values to obtain feature graph groups, respectively calculating orthogonal loss and/or suppression loss corresponding to the feature graphs in each feature graph group, obtaining a model total loss value according to the calculated orthogonal loss and/or suppression loss, adjusting model parameters by using the model total loss value, and repeating the steps until the training is completed to obtain the generated information processing model. The method provided by the embodiment reduces attribute coupling by mining semantic attributes of the potential layer and constructing the orthogonal loss of the clustering group, and reduces cross coupling according to attributes by performing intersection suppression on the feature maps in the intersection region, so that attribute coupling between the feature maps is reduced, and the generalization capability of the network is improved.

Description

Information processing model generation method based on target attribute decoupling and related equipment

Technical Field

The invention relates to the technical field of neural networks, in particular to an information processing model generation method based on target attribute decoupling and related equipment.

Background

Since Goodfellow proposed generating a confrontational network (GAN), it has achieved impressive performance in various attribute editions, such as expression editions, style migration, hair color transformation, gender transformation, age transformation, and the like. However, semantic dependencies resulting from feature map interactions in the GAN's generated network may compromise the generalization capability of the generated network.

Feature maps are easily highly coupled to each other when GAN is learned from a large set of training data. However, highly coupled information learned from training data sets is difficult to apply to test data sets because the coupling between different identities and data sets varies widely. Meanwhile, the coupling information makes it difficult to edit one attribute independently because the coupling easily affects the generation of other attributes at the same time, resulting in generation of much noise at the time of attribute editing, affecting the generation effect.

To reduce the correlation between different profiles, various network structures and loss functions have been proposed. For example, Gram-Schmidt orthogonalization is adopted to reduce the interference between nodes so as to improve the network generalization capability, but for a network with a large number of nodes or feature maps, the decoupling according to the nodes or the feature maps is inefficient.

Therefore, the prior art is subject to further improvement.

Disclosure of Invention

In view of the defects in the prior art, the invention aims to provide a target attribute decoupling-based information processing model generation method and related equipment for a user, and overcome the defect that the generalization capability of a generated network is poor due to the fact that the generation effect is influenced by the coupling of feature maps when a network structure in the prior art is edited.

The technical scheme adopted by the invention for solving the technical problem is as follows:

in a first aspect, the present embodiment provides an information processing method based on target attribute decoupling, including the steps of:

inputting sample information to be processed into a preset generated confrontation network model, and acquiring a feature diagram output by a hidden layer of the generated confrontation network model;

encoding each feature map by utilizing Hash codes to obtain coordinate values corresponding to each feature map;

clustering each feature map according to the coordinate values to obtain at least two feature map groups;

respectively calculating orthogonal loss and/or suppression loss corresponding to the feature maps in each feature map group, and obtaining a model total loss value according to the calculated orthogonal loss and/or suppression loss;

and adjusting the model parameters of the generated countermeasure network model according to the total model loss value, and repeating the steps of inputting the to-be-processed sample information to a preset generated countermeasure network model to obtain the total model loss value according to the calculated orthogonal loss and/or suppression loss until the calculated total network loss value meets the preset condition, thereby obtaining the trained information processing model.

Optionally, the step of calculating the orthogonal loss and/or the suppression loss corresponding to the feature maps in each feature map group, and obtaining a total model loss value according to the calculated orthogonal loss and/or the calculated suppression loss includes:

calculating orthogonal loss among feature graphs in each feature group, and obtaining a model total loss value according to the orthogonal loss;

and/or calculating the inhibition loss corresponding to the feature maps at the intersection in each feature map group, and obtaining the total model loss value according to the inhibition loss.

Optionally, the step of encoding each feature map by using hash coding to obtain coordinate values corresponding to each feature map includes:

and carrying out hash coding on the average characteristic diagram obtained by averaging the areas of the characteristic diagrams to obtain the coordinate value of each characteristic diagram.

Optionally, the step of obtaining the coordinate values of each feature map by performing hash coding on the average feature map obtained by averaging the regions of each feature map includes:

and selecting the first N average characteristic graphs with the strongest response in the average characteristic graphs corresponding to the characteristic graphs for carrying out Hash coding to obtain coordinate values corresponding to the characteristic graphs, wherein N is a positive integer.

Optionally, the step of clustering each feature map according to the coordinate values to obtain at least two feature map groups includes:

and clustering each feature map by using a k-means clustering algorithm according to the coordinate values to obtain at least two feature map groups.

Optionally, the orthogonal loss function expression corresponding to the orthogonal loss is as follows:

where m is the number of feature map groups, g_i，g_jIs the vector vectorized by the average feature map of the ith and jth groups, and m, i and j are positive integers.

Optionally, the generating a confrontation network model includes generating a network module and discriminating the network module;

the generating of the total loss function of the confrontation network model comprises generating a loss function of the network module, judging the loss function of the network module, an orthogonal loss function and a restraining loss function.

Optionally, an expression of a total loss function corresponding to the model total loss value is as follows:

wherein E represents data distribution, G represents a generation network module, and G (x, c') is used for generation by the generation network moduleInput image x and target domain label c' conditioned image, G_supRepresenting a generating network module with feature suppression and c is an original domain label; l is_supThe method comprises the steps that a preset inhibition loss is obtained, and the influence deviation of used and unused characteristic inhibition on a characteristic diagram for counteracting the inhibited discard in the countermeasure network is represented; l is_recFor application to generating reconstruction losses in network modules, L_OriGAnd L_OriDRepresenting a loss function, λ, of the generating network module and the discriminating network module_sup、λ_recAnd λ_GOIs a hyperparameter, L_GAnd L_DRespectively, generating the objective functions of the network module and the discriminating network module D.

In a second aspect, the present embodiment also discloses an electronic device, which includes a processor, and a storage medium communicatively connected to the processor, wherein the storage medium is adapted to store a plurality of instructions; the processor is adapted to invoke instructions in the storage medium to perform the steps of implementing the information processing based on the target property decoupling.

In a third aspect, a computer readable storage medium stores one or more programs, which are executable by one or more processors, for implementing the steps of the target property decoupling based information processing model generation method.

The invention has the beneficial effects that the invention provides an information processing model generation method based on target attribute decoupling and related equipment, firstly, the characteristic graphs output by all hidden layers are obtained, and each characteristic graph is coded by utilizing Hash codes to obtain the coordinate value corresponding to each characteristic graph; clustering and grouping the characteristic graphs according to the coordinate values, respectively calculating orthogonal loss and/or suppression loss corresponding to the characteristic graphs in each characteristic graph group, obtaining a model total loss value according to the calculated orthogonal loss and/or suppression loss, adjusting model parameters by using the model total loss value, and repeating the steps until the training is completed to obtain the generated information processing model. The embodiment reduces attribute coupling between feature maps and improves generalization capability of a generated network by mining semantic attributes of a potential layer and constructing orthogonal loss of a cluster group to reduce attribute coupling, and by performing intersection suppression on feature maps in intersection areas of different groups to reduce cross coupling according to attributes.

Drawings

FIG. 1 is a flowchart illustrating steps of a method for generating an information processing model for a countermeasure network based on target attribute decoupling according to an embodiment of the present invention;

FIG. 2 is a schematic diagram illustrating a principle of hash encoding a feature map according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of the principle of orthogonal suppression and intersection suppression in an embodiment of the present invention;

FIG. 4 is a schematic diagram illustrating a method for processing an intersection suppression loss pair feature map in a training process according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Various non-limiting embodiments of the present invention are described in detail below with reference to the accompanying drawings.

Since existing decoupling algorithms are inefficient for networks with a large number of nodes or signature graphs, decoupling by node or by signature graph is inefficient. Meanwhile, semantic attributes of an object are often encoded by using a stack of network hidden outputs, and it is difficult to capture information about the semantic attributes by the operation of a feature mapping mode. Semantic attributes are often mined and pruned, and their decoupling can enable independent editing of each attribute and better generalization capability on test datasets. Therefore, the method disclosed by the embodiment of the invention mainly aims at the decoupling of semantic attributes to obtain a better decoupling effect. The semantic attribute represents the semantics of which attribute is represented by the feature in the network, for example, a feature represents that the GAN network is controlled to generate the attribute corresponding to the hair, and a feature represents that the GAN network is controlled to generate the attribute corresponding to the gender of the person.

In order to explore semantic attributes based on feature maps, Klemmer adopts local correlation of feature neighborhoods in the condition generation process. Wu and He uniformly divide the profile into groups and perform group-by-group normalization to simulate regularization by attribute.

In order to reduce the correlation between different feature maps in the generated countermeasure network GAN, namely, attribute coupling between the feature maps, the embodiment of the invention introduces semantic decomposition in the generation of the countermeasure network GAN to reduce the attribute correlation. The semantic decomposition is to classify and process the features representing different semantics, and comprises two methods of clustering group feature orthogonality and intersection inhibition. In this embodiment, feature decomposition is performed on the feature map to obtain a feature map group, and then decoupling processing is performed on the feature maps in different clustered feature map groups.

Exemplary method

The embodiment provides an information processing model generation method based on target attribute decoupling, as shown in fig. 1, including the steps of:

and step S1, inputting the sample information to be processed into a preset generated confrontation network model, and acquiring the characteristic diagram output by the generated confrontation network hidden layer.

The method of the embodiment of the invention performs decoupling processing aiming at the characteristic diagram output by the hidden layer of the generated countermeasure network. The generation of a competing network comprises two "competing" networks: the generation network module is used for capturing data distribution, and the discrimination network module is used for estimating the probability that one sample comes from real data instead of the generated sample. The generation network module and the judgment network module compete with each other so as to learn the required model. The hidden layer is other layers except the input layer and the output layer, and the method is suitable for target attribute decoupling of all feature graphs output by the hidden layer. Therefore, in this step, the feature maps output by the hidden layers are first obtained, and after being decoupled, the feature maps are input to the next hidden layer or the output layer.

In step S2, each feature map is encoded by hash encoding, and coordinate values corresponding to the respective feature maps are obtained.

Hash coding is the encoding of features using a hash algorithm, also called hash algorithm, which generally satisfies the relationship: and f, inputting data with any length, and outputting a fixed-length data key after the data are processed by a hash algorithm. Therefore, in this step, hash codes are used to encode each feature map, and coordinate values corresponding to each feature map are obtained.

Specifically, the step of encoding each feature map by using hash coding to obtain coordinate values corresponding to each feature map includes:

Examples are: referring to fig. 2, assuming that the hidden layer has 256 feature maps, each feature map is first divided into several patch blocks according to the area average, such as: firstly, dividing a hidden layer feature map with the size of 32 x 32 into 8 x 8 block patches, namely each block patch comprises 4 x 4 pixels, extracting the strongest response of topN (N is 2 in an experiment) in the block patches, splicing the N strongest responses to obtain an average feature map, carrying out hash coding on the average feature map, and identifying the obtained coordinates as coordinate values corresponding to the feature map. I.e., the signature is encoded by 256 x 32 to 256 x (N x 2), (N is required to be 2 because there are two values (x, y) per coordinate), and then the hash-coded coordinate values corresponding to the 256 signatures are obtained.

And step S3, clustering each feature map according to the coordinate values to obtain at least two feature map groups.

In this step, each feature map is clustered according to the coordinate values corresponding to each feature map, that is, each feature map is divided into at least two feature map groups according to the coordinate values. The clustering algorithm used in the step is a k-means clustering algorithm, namely, the k-means clustering algorithm is used for clustering each feature map according to the coordinate values to obtain at least two feature map groups.

The K-means clustering algorithm is characterized in that K objects are randomly selected as initial clustering centers at first, then Euclidean distances between each object and each clustering center are calculated, and each object is allocated to the clustering center closest to the object, so that each feature map group is obtained.

Further, the step of obtaining the coordinate values of each feature map by performing hash coding on the average feature map obtained by averaging the regions of each feature map includes:

and selecting the first N average characteristic graphs with the strongest response in the average characteristic graphs corresponding to the characteristic graphs for carrying out Hash coding to obtain coordinate values corresponding to the characteristic graphs, wherein N is a positive integer. In this embodiment, N is 2, that is, the number of feature map groups is 2.

Specifically, after each feature map is quantized, the obtained coordinate values are:

wherein (x)_ij，y_ij) The patch coordinates (j is an arbitrary number from 1 to nc) in 8 × 8 patch of the jth maximum response from the large to the small in the plurality of average feature maps, i represents the ith feature map, and nc is set to 2. The average feature map refers to a feature map obtained by performing area average division on an original feature map output by a hidden layer, for example: the original feature map size of the hidden layer is 32 × 32, and directly taking the maximum response may be unstable and poor in robustness, so that the original feature map is averaged according to the region of 4 × 4, the size of the averaged feature map becomes 8 × 8, and then the averaged feature map is used for hash coding to obtain coordinate values corresponding to the original feature map.

And step S4, respectively calculating the orthogonal loss and/or the suppression loss corresponding to the feature maps in each feature map group, and obtaining a model total loss value according to the calculated orthogonal loss and/or the calculated suppression loss.

And the vectorized average feature maps are orthogonal pairwise to obtain an orthogonal result, and the vectorized average feature maps are data, so that pairwise orthogonality can be performed on the vectorized average feature maps to obtain orthogonal data.

In the step, two methods of clustering group feature orthogonality and intersection inhibition can be used, or one method is used for realizing the decoupling operation of the feature map.

Referring to fig. 3, a specific way of using the feature orthogonality of the cluster group is as follows:

and calculating the orthogonal loss among the characteristic graphs in each characteristic group, and obtaining the total loss value of the model according to the orthogonal loss.

Specifically, the orthogonal loss is added into a total loss function in the process of training the network, so that the characteristic diagram attribute coupling of the generated confrontation network model output after the training is finished is low.

Further, after clustering the feature maps, the method for calculating the orthogonal loss according to each feature map group obtained by clustering is to calculate the orthogonal loss by using a preset orthogonal loss function expression, specifically, the orthogonal loss function expression is as follows:

And after clustering and grouping the characteristic graphs, calculating orthogonal loss values of the characteristic graphs in the group according to the grouped characteristic graphs by using the orthogonal loss function.

Further, the way of using intersection suppression is:

and calculating the inhibition loss corresponding to the feature maps at the intersection in each feature map group, obtaining a model total loss value according to the inhibition loss, and adding the inhibition loss to the model total loss value, so that decoupling of the output feature maps can be realized in the model training process.

In the specific implementation process, when only the orthogonal loss is considered, only the orthogonal loss is added to the model total loss value, if only the suppression loss is considered, only the suppression loss is added to the model total loss value, and if the optimal effect is to be obtained, the orthogonal loss and the suppression loss are added to the model total loss value at the same time, so that the target attribute of the feature map is decoupled. Because the distance between groups is increased by the orthogonality after clustering, and the distance inside the clustering group is reduced by the inhibition after clustering, the two can complement each other by combining, and therefore, a better attribute decoupling effect can be obtained.

Further, the generation of the countermeasure network comprises a generation network module and a discrimination network module;

the loss function comprises a loss value of the generation network module, a loss value of the discrimination network module, an orthogonal loss and a suppression loss. Wherein, the feature map orthogonality is gradually restrained in the training process through orthogonality loss; feature suppression is the suppression of loss constraints through intersection in the training process: and the inhibition loss is obtained by comparing the final effect graph generated by the feature graph at the intersection of the discarded hidden layers and the feature graph at the intersection of the unreleased hidden layers in the network during training. Both losses are increasingly constrained during the training process.

In one embodiment, the total loss function is expressed as:

wherein E represents data distribution, G represents generation of a network module, G (x, c ') generates an image conditioned on an input image x and a target domain label c' for the network module, G_supRepresenting a generating network module with feature suppression, and c is the original domain label, L_supIs a preset rejection loss, characterizing the influence deviation of used and unused characteristic rejection on counteracting rejected characteristic graphs in the reactive network, Lrec is the reconstruction loss applied in the generation network module, L_OriGAnd L_OriDRepresenting a loss function, λ, of the generating network module and the discriminating network module_sup、λ_recAnd λ_GOIs a hyperparameter, L_GAnd L_DRespectively, generating the objective functions of the network module and the discriminating network module D.

Specifically, due to the feature map in each clustered feature map group, if the feature map is far away from the cluster center, the feature generation is inhibited, that is, the feature map at the intersection between the cluster groups is inhibited. As shown in fig. 3, feature maps at the intersection of two feature maps are discarded, the suppression manner adopted in this embodiment is to perform L1 distance loss constraint by discarding the feature map generated without discarding the feature map, and the smaller L1loss, the less feature maps at the intersection are generated, where L1 is a preset distance value away from the cluster center, and in a specific embodiment, a distance from the cluster center exceeding 1.5 times the average distance between each feature map in the group and the central feature map may be regarded as an abnormal distance.

Further, as shown in fig. 4, a method for processing an intersection suppression loss to a feature map in a training process is shown, where the suppression loss is obtained by comparing a final effect map generated by the feature map at the intersection of the discarded hidden layers in the network and the feature map at the intersection of the unrecessed hidden layers in the training process. The discarding mode is shown in fig. 4, and the features inside the circle (the radius is 1.5 times the average distance of each feature of the cluster group from the cluster center) are retained, and the feature outside the circle is considered as the feature map easily coupled at the intersection, so that the feature map is discarded. In the training process, the target attribute features in the two decoupled cluster groups of fig. 4 are slowly orthogonal to each other along with the training of the network, and the features at the intersection are less and less.

Step S5, adjusting the model parameters of the generated countermeasure network model according to the total model loss value, and repeating the steps from inputting the to-be-processed sample information to the preset generated countermeasure network model to obtaining the total model loss value according to the calculated orthogonal loss and/or suppression loss until the calculated total network loss value meets the preset condition, so as to obtain the trained information processing model.

The method comprises the following steps in each training process of generating the confrontation network model: inputting sample information to be processed into a preset generated confrontation network model, obtaining a feature diagram output by each hidden layer in the generated confrontation network model, respectively calculating orthogonal loss and suppression loss according to the output feature diagrams, respectively adding the orthogonal loss and the suppression loss into a model total loss value, correcting the model total loss value, and modifying a model parameter of the generated confrontation network model according to the corrected model total loss value.

Repeating the training process: inputting sample information to be processed into a generated confrontation network model, obtaining a characteristic diagram output from one or any selected target hidden layer, and repeatedly calculating orthogonal loss and suppression loss between the characteristic diagrams until a total network loss value of the generated confrontation network model corresponding to the modified model parameters meets a preset condition, thereby obtaining the trained information processing model.

The feature maps output by the hidden layers in the countermeasure network model are preset to be generated, and the feature maps are firstly grouped by using an effective clustering algorithm, wherein the algorithm is used for mining semantic attributes of the potential layers and constructing the orthogonal loss of the clustering groups so as to reduce attribute coupling. Meanwhile, the feature maps in the intersection areas of different groups are further suppressed to reduce cross coupling according to attributes, the target attributes of the feature maps obtained in the trained information processing model are decoupled, the feature maps of different target attributes are mutually orthogonal, the distribution of the feature maps of different attributes is obvious, and mutual intersection of the feature maps rarely occurs, so that the trained information processing model can have better generalization capability, and can be applied to most networks or data sets to obtain good information processing effect.

Exemplary device

The embodiment also discloses an electronic device, which comprises a processor and a storage medium in communication connection with the processor, wherein the storage medium is suitable for storing a plurality of instructions; the processor is adapted to invoke instructions in the storage medium to perform the steps of implementing the information handling model generation method based on target attribute decoupling.

The embodiment also discloses a computer readable storage medium, wherein the computer readable storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to realize the steps of the information processing model generation method based on target attribute decoupling.

The invention provides an information processing model generation method based on target attribute decoupling and related equipment, which are characterized in that a feature graph generated and output by a hidden layer of a countermeasure network model is obtained, and the feature graph is encoded by utilizing Hash encoding to obtain coordinate values corresponding to each feature graph; clustering each feature graph according to the coordinate values to obtain feature graph groups, respectively calculating orthogonal loss and/or suppression loss corresponding to the feature graphs in each feature graph group, obtaining a model total loss value according to the calculated orthogonal loss and/or suppression loss, adjusting model parameters by using the model total loss value, and repeating the steps until the training is completed to obtain the generated information processing model. The method for generating the information processing model provided by this embodiment reduces attribute coupling by mining semantic attributes of the potential layer and constructing orthogonal loss of the cluster group, and reduces cross coupling by attributes by performing intersection suppression on the feature maps in the intersection region, thereby reducing attribute coupling between the feature maps and improving generalization capability of the generated information processing model.

It should be understood that equivalents and modifications of the technical solution and inventive concept thereof may occur to those skilled in the art, and all such modifications and alterations should fall within the scope of the appended claims.

Claims

1. An information processing model generation method based on target attribute decoupling is characterized by comprising the following steps:

2. The method according to claim 1, wherein the step of calculating the orthogonality loss and/or the suppression loss corresponding to the feature maps in each feature map group, and obtaining the total model loss value according to the calculated orthogonality loss and/or suppression loss comprises:

3. The method for generating an information processing model based on target attribute decoupling according to claim 1, wherein the step of encoding each feature map by using hash encoding to obtain coordinate values corresponding to each feature map comprises:

4. The method according to claim 3, wherein the step of obtaining the coordinate values of the feature maps by performing hash coding on the average feature map obtained by averaging the regions of the feature maps comprises:

5. The method for generating an information processing model based on target attribute decoupling according to claim 1, wherein the step of clustering each feature map according to the coordinate values to obtain at least two feature map groups comprises:

6. The method for generating an information processing model based on target attribute decoupling according to claim 2, wherein the orthogonal loss function expression corresponding to the orthogonal loss is as follows:

where m is the number of feature map groups, g_i，g_jIs the vector after the vectorization of the average feature map of the ith and jth groups, and m, i and j are positive integers.

7. The method for generating an information processing model based on target attribute decoupling according to claim 2, wherein the generating a confrontation network model comprises generating a network module and discriminating the network module;

8. The method for generating an information processing model based on target attribute decoupling according to claim 7, wherein an expression of a total loss function corresponding to the model total loss value is as follows:

wherein E represents data distribution, G represents a generation network module, G (x, c ') is used for generating an image with the condition of an input image x and a target domain label c', G_supRepresenting a generating network module with feature suppression and c is an original domain label; l is_supThe method comprises the steps that a preset inhibition loss is obtained, and the influence deviation of used and unused characteristic inhibition on a characteristic diagram for counteracting the inhibited discard in the countermeasure network is represented; l is_recFor application to generating reconstruction losses in network modules, L_OriGAnd L_OriDRepresenting a loss function, λ, of the generating network module and the discriminating network module_sup、λ_recAnd λ_GOIs a hyperparameter, L_GAnd L_DRespectively, generating the objective functions of the network module and the discriminating network module D.

9. An electronic device comprising a processor, a storage medium communicatively coupled to the processor, the storage medium adapted to store a plurality of instructions; the processor is adapted to call instructions in the storage medium to perform the steps of implementing the information processing model generation method based on target attribute decoupling of any of claims 1-8.

10. A computer readable storage medium, storing one or more programs, which are executable by one or more processors, to implement the steps of the information processing model generation method based on target attribute decoupling according to any one of claims 1 to 8.