CN113449865A

CN113449865A - Optimization method for enhancing training artificial intelligence model

Info

Publication number: CN113449865A
Application number: CN202111001506.0A
Authority: CN
Inventors: 周晓辉; 袁博
Original assignee: Computing Chip Shenzhen Information Technology Co ltd
Current assignee: Computing Chip Shenzhen Information Technology Co ltd
Priority date: 2021-08-30
Filing date: 2021-08-30
Publication date: 2021-09-28
Anticipated expiration: 2041-08-30
Also published as: CN113449865B

Abstract

The invention provides an optimization method for enhancing a training artificial intelligence model, which comprises the following steps in sequence: the method comprises the steps of obtaining an original data set and a pre-training model, generating two confrontation sample sets by utilizing a three-level gradient optimization generation method and a transformation algorithm function, generating two mixed attack sample sets and carrying out differential training until obtaining an identification model with the defense performance meeting the requirements. According to the invention, two confrontation sample groups with strong attack capacity can be generated by adopting a three-level gradient optimization method and combining a transformation algorithm function, wherein one group serves as a reference, the other group serves as an evolutionary enhancement group, and the association degree of the pre-training model and the confrontation samples is enhanced, so that the attack capacity of the confrontation samples is greatly improved, a defense model with high defense performance can be obtained in differential training, the purpose of artificial intelligence model enhanced training is finally achieved, the design is reasonable, a model with high defense grade can be obtained efficiently, and the method is suitable for large-scale popularization.

Description

Optimization method for enhancing training artificial intelligence model

Technical Field

The invention belongs to the technical field of artificial intelligence, and particularly relates to an optimization method for enhancing training of an artificial intelligence model.

Background

Artificial intelligence refers to the intelligence exhibited by a machine manufactured by a human. Artificial intelligence generally refers to techniques for presenting human intelligence through ordinary computer programs. Machine learning has been widely used in recent years, but there are many safety problems, such as the existence of countermeasures to reduce the safety of artificial intelligence. The countermeasure sample is data which can directly change or influence the recognition result of the machine learning model, and is obtained by using a certain algorithm to generate fine and well-constructed disturbance on element data, so that the original normal machine learning model is recognized wrongly or cannot be recognized. In order to defend against the attack of the sample, the most direct method is to adopt a resistance training method to optimize the model so as to improve the safety of the model. The countermeasure training can improve the anti-interference capability of deep learning for the countermeasure sample. Since the existence of countermeasure samples for the deep learning network model is proven, the substitution training of countermeasure samples is widely used to ensure the robustness of the neural network model against attacks.

At present, for the generation of the countermeasure sample, a single generation algorithm is adopted in many training stages, so the generated countermeasure sample has insufficient aggressivity to the artificial intelligent model, and the defending model still has a great attack success probability; moreover, the relevance of the pre-training model and the attack model is low, and the dispersion of the confrontation sample can be greatly expanded, so that the invalid workload of training is increased, and the training efficiency needs to be improved.

Disclosure of Invention

Aiming at the technical problems of the artificial intelligence, the invention provides an optimization method for enhancing the training artificial intelligence model, which has the advantages of reasonable design, simple structure, higher training intensity, higher efficiency and higher defense capability and is beneficial to obtaining the model.

In order to achieve the above object, the technical solution adopted by the present invention is that, the optimization method for enhancing training of the artificial intelligence model provided by the present invention includes steps of S1, extracting features of samples of the obtained original data set based on the neural network model to be trained, using the extracted features as an original data set, and training a recognition model which is not subjected to adversarial training, namely a pre-training model;

s2, generating a confrontation sample group I by using a three-level gradient optimization generation method; taking the pre-training model as a parameter for calculating transformation probability in a transformation algorithm function, transforming the confrontation samples adopting iteration in the gradient optimization generation method, and generating a confrontation sample group II by transformation of the transformation algorithm function;

s3, mixing the countermeasure sample set I and the original data set to generate a mixed attack sample set I, mixing the countermeasure sample set I, the countermeasure sample set II and the original data set to generate a mixed attack sample set II, and performing attack training of the recognition model again by using the two mixed sample attack sets to obtain two post-attack-training models;

s4, performing difference training on the two models after the attack training, performing pre-attack on the mixed attack sample set I, performing secondary attack on the mixed attack sample set II, if a difference function of moving target defense success is achieved, obtaining a final defense model, namely a trained recognition model, otherwise, regenerating a confrontation sample set II, wherein the method for generating the confrontation sample set II comprises the following steps: the iteration step size of the gradient optimization generation algorithm and the transformation probability in the transformation algorithm in the step S2 are adjusted, and the steps S3-S4 are executed again.

Preferably, the gradient optimization generation method in S2 includes the steps of: s2.1, passing the maximum loss function

Obtaining a challenge sample

The perturbation r is sought in the direction in which the gradient change of the loss function for x is greatest, i.e.

Wherein sign represents a sign function,

is the gradient of the loss function to the input x,

；

s2.2, finding disturbance in an iterative manner, i.e.

Where t is the current iteration number, iteration step size

Where T is the total number of iterations, and where Proj projects each updated challenge sample onto x

Within the neighborhood;

s2.3, the iteration part in the S2.2 is replaced by momentum iteration, and the multi-step iteration method based on gradient reduction of momentum is represented as

，

Wherein the content of the first and second substances,

and u is a momentum decay factor.

Preferably, the transformation algorithm function in S2 is:

where p is the probability of transformation, a random transformation function

Wherein, in the step (A),

，

is a probability coefficient and satisfies

G (x) is a function of the pre-trained model as a parameter for calculating the transformation probability.

Preferably, the perturbation r in S2.1 satisfies the function:

wherein, in the step (A),

a distance function representing the degree of disturbance, F represents a trained classification model,

representing the disturbed sample, wherein f1 is the confidence coefficient of the disturbed sample output in the original category; f2 is the perturbation distance of the sample,

representing the distribution of all the disturbed samples in the two directions in the whole disturbance range.

Preferably, will

Optimizing the multi-objective optimization problem with boundary limitation, and searching for the minimum disturbance, wherein the optimized function is as follows:

。

preferably, the differential function of moving target defense in S4 is:

wherein, in the step (A),

a proxy model representing the choice of an attacker,

a set of proxy models is represented that,

a system membership model representing the defensive party's selection by the game, N represents a set of system membership models,

to represent

On-generated counter sample attack

The higher the DL value, the better the moving object defense effect.

Compared with the prior art, the invention has the advantages and positive effects that:

1. according to the optimization method for the enhanced training artificial intelligence model, two confrontation sample groups with strong attack capacity can be generated by adopting a three-level gradient optimization method and combining a transformation algorithm function, one group serves as a reference, the other group serves as an evolutionary enhanced group, and the relevance between a pre-training model and the confrontation samples is enhanced, so that the attack capacity of the confrontation samples is greatly improved, a defense model with high defense performance can be obtained in difference training, the aim of enhanced training of the artificial intelligence model is finally achieved, the design is reasonable, the efficiency is high, the model with high defense level can be obtained, and the method is suitable for large-scale popularization.

Detailed Description

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and thus the present invention is not limited to the specific embodiments of the present disclosure.

The embodiment of the invention provides an optimization method for enhancing a training artificial intelligence model, which comprises the following steps of S1, extracting the characteristics of an acquired sample of an original data set based on a neural network model to be trained, and training a recognition model which is not subjected to antagonism training, namely a pre-training model, as an original data set;

s4, performing difference training on the two models after the attack training, performing pre-attack on the mixed attack sample set I, performing secondary attack on the mixed attack sample set II, if a difference function of moving target defense success is achieved, obtaining a final defense model, namely a trained recognition model, otherwise, regenerating a confrontation sample set II, wherein the method for generating the confrontation sample set II comprises the following steps: and adjusting the iteration step size of the gradient optimization generation algorithm and the transformation probability in the transformation algorithm in the step S2, and executing the steps S3-S4 again until a model with successful defense is obtained.

Two confrontation sample groups with strong attack ability can be generated by adopting a three-level gradient optimization method and combining a transformation algorithm function, wherein one group is used as a reference, the other group is used as an evolutionary enhanced group, and the association degree of the pre-training model and the confrontation samples is enhanced, so that the attack ability of the confrontation samples is greatly improved, a defense model with high defense performance can be obtained in differential training, the purpose of artificial intelligence model enhanced training is finally achieved, the design is reasonable, and the model with high defense grade can be obtained.

More specifically, the gradient optimization generation method in step S2 of the present invention includes the following steps: the gradient optimization generation method in S2 includes the steps of: s2.1, passing the maximum loss function

Obtaining a challenge sample

Finding the perturbation r in the direction in which the gradient of the loss function for x changes the most,

namely, it is

Wherein sign represents a sign function,

is the gradient of the loss function to the input x,

；

s2.2, finding disturbance in an iterative manner, i.e.

Where t is the current iteration number, iteration step size

Within the neighborhood;

，

Wherein the content of the first and second substances,

and u is a momentum decay factor.

In order to improve the attack capability of the mixed sample attack group for carrying out the secondary attack, the transformation algorithm function in the S2 in the invention is

Where p is the probability of transformation, a random transformation function

Wherein, in the step (A),

，

is a probability coefficient and satisfies

G (x) is a function of the pre-trained model as a parameter for calculating the transformation probability. Meanwhile, the pre-training model is effectively associated with the confrontation sample, so that the dispersion of the confrontation sample is reduced, the invalid workload of training is reduced, the training efficiency is improved, and the purpose of improving the sample attack capacity is further achieved.

In order to enhance the robustness of the model and find samples with rapidly changing confidence of the model, the disturbance r in the invention S2.1 satisfies the function:

wherein, in the step (A),

Further, because

The number of samples in the set is too large, and all the disturbance samples meeting the requirements cannot be found out and then the confrontation training of the model is carried out, so that the method needs to be used in

And finding out representative samples to carry out the countertraining. In order to find the minimum disturbance under the condition that the output result of the classifier has large change, the output result of the classifier is to be searched

。

in order to obtain a defense model with a moving defense function, the difference function of moving target defense in the invention S4 is:

。

wherein the content of the first and second substances,

a proxy model representing the choice of an attacker,

a set of proxy models is represented that,

to represent

On-generated counter sample attack

The higher the DL value, the better the moving object defense effect.

The above description is only a preferred embodiment of the present invention, and not intended to limit the present invention in other forms, and any person skilled in the art may apply the above modifications or changes to the equivalent embodiments with equivalent changes, without departing from the technical spirit of the present invention, and any simple modification, equivalent change and change made to the above embodiments according to the technical spirit of the present invention still belong to the protection scope of the technical spirit of the present invention.

Claims

1. An optimization method for enhancing a training artificial intelligence model comprises the steps of S1, extracting characteristics of an acquired sample of an original data set based on a neural network model to be trained, using the extracted sample as an original data set, and training a recognition model which is not subjected to antagonism training, namely a pre-training model;

the method is characterized by further comprising the following steps:

s4, performing difference training on the two models after the attack training, performing pre-attack on the mixed attack sample set I as a reference, performing secondary attack on the mixed attack sample set II, if the secondary attack reaches a difference function of moving target defense success, obtaining a final defense model, namely a trained recognition model, otherwise, regenerating the confrontation sample set II, wherein the method for generating the confrontation sample set II comprises the following steps: the iteration step size of the gradient optimization generation algorithm and the transformation probability in the transformation algorithm in the step S2 are adjusted, and the steps S3-S4 are executed again.

2. The optimization method for enhancing the training of the artificial intelligence model as claimed in claim 1, wherein the gradient optimization generation method in S2 comprises the following steps:

s2.1, passing the maximum loss function

Obtaining a challenge sample

Wherein, in the step (A),

the function of the symbol is represented by,

is the gradient of the loss function to the input x,

；

s2.2, adopting an iteration mode to search for disturbance,

namely, it is

Where t is the current iteration number, iteration step size

And T is the total number of iterations in the whole,

projecting each updated challenge sample onto x

In the field;

Wherein, in the step (A),

and u is a momentum decay factor.

3. The optimization method for enhancing training of artificial intelligence model of claim 2, wherein the transformation algorithm function in S2 is

Where p is the transformation probability, a random transformation function

Wherein, in the step (A),

，

is a probability coefficient and satisfies

4. The optimization method for enhancing the training of the artificial intelligence model according to claim 3, wherein the disturbance r in S2.1 satisfies the function:

where D (r0) represents a distance function of the degree of perturbation, F represents a trained classification model,

representing the disturbed sample, wherein f1 is the confidence coefficient of the disturbed sample output in the original category; f2 is the perturbation distance of the sample, and Z (r0) represents the distribution of all the perturbed samples in the two directions in the whole perturbation range.

5. The optimization method for enhancing the training artificial intelligence model of claim 4, wherein Z (r0) is optimized to an optimization problem with multiple objectives of boundary constraints, and the minimum perturbation is found, and the optimized function is:

。

6. the optimization method for enhancing training of artificial intelligence models according to claim 5, wherein the difference function of moving target defense in S4 is:

wherein, in the step (A),

a proxy model representing the choice of an attacker,

a set of proxy models is represented that,

to represent

On-generated counter sample attack

The higher the DL value, the better the moving object defense effect.