CN106953879A

CN106953879A - The cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model

Info

Publication number: CN106953879A
Application number: CN201710335128.7A
Authority: CN
Inventors: 张恒巍; 王晋东; 黄健明; 韩继红; 和志鸿; 李福林; 王衡军; 张畅
Original assignee: PLA Information Engineering University
Current assignee: PLA Information Engineering University
Priority date: 2017-05-12
Filing date: 2017-05-12
Publication date: 2017-07-14

Abstract

The invention belongs to computer network security defense technique field, being specifically related to a kind of cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model includes：Based on bounded rationality condition, using best response dynamics study mechanism, the attacking and defending Evolutionary Game Model based on best response dynamics is built；Dynamic Evolution and defence Evolutionary Equilibrium point are chosen using defender's strategy, defence policies On The Choice between different defenders is studied；On the basis of the best response dynamics Evolutionary Game Model of foundation, the model is analyzed and solved by specific example, promote Evolutionary Game Model.The present invention establishes the non-cooperative network attacking and defending Evolutionary Game Model under the conditions of bounded rationality, original state is chosen by arranging defender's strategy, by constantly evolution, best response dynamics will eventually make game playing system reach some stable state, so as to obtain optimal defence policies, method proposed by the present invention can be good at being applied to network security defence policies On The Choice, can provide network security research certain directive significance.

Description

The cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model

Technical field

The invention belongs to computer network security defense technique field, a kind of best response dynamics evolution is specifically related to rich Play chess the cyber-defence strategy choosing method of model.

Background technology

In recent years, the social life that the fast development of internet gives people brings huge change, particularly " internet + " strategy push the development of internet to a new climax.With the fast development of internet, cyberspace safety problem Become increasingly conspicuous.Network security problem is very severe, and for local and overseas disparate networks attacks, how Strengthens network is pacified The problem of full defence turns into current era urgent need to resolve, needing badly can be analyzed and be predicted to network-combination yarn behavior, Jin Ershi Alms giver moves the new technology of Prevention-Security.Because network safe state is determined by the agonistic behavior and its result of attacking and defending both sides in itself It is fixed, and target antagonism, tactful interdependence and the relation Non-synergic exactly game theory having in network-combination yarn confrontation Essential characteristic, therefore game theory increasingly rises in the research and application of network safety filed, and with using classical traditional game Based on model is analyzed network security behavior.

But, existing achievement in research sets up what is assumed in participant's rational mostly based on traditional game is theoretical Under the premise of, and such hypothesis is not consistent with actual conditions.Its betting model and real deviation are larger, so as to reduce safety The accuracy and directive significance of defence policies choosing method.For problem above, some scholars are used premised on bounded rationality Evolutionary game theory is analyzed applied to network-combination yarn.By analysis, evolutionary Game more conforms to network-combination yarn confrontation dynamic evolution Reality, turns to the gradual evolution process with certain adaptability learning ability, using typical by the behavior model of attacking and defending both sides Replicator dynamics equation is solved and analyzed.But it is low that replica locating study mechanism has that pace of learning is slow, strategy chooses efficiency Problem.

The content of the invention

The present invention mostly based on traditional game is theoretical, sets up false in participant's rational for existing achievement in research If on the premise of, and such hypothesis is not consistent with actual conditions, is drilled if existing and being directly applied to network-combination yarn confrontation Change game theory analysis, it will have that learning cycle is long, learning efficiency is not high, this is suitable by largely reduction model and method With sex chromosome mosaicism, a kind of cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model is proposed.

The technical scheme is that：A kind of cyber-defence strategy selection side of best response dynamics Evolutionary Game Model Method, comprises the following steps：

Step 1：Based on bounded rationality condition, using best response dynamics study mechanism, build and be based on best response dynamics Attacking and defending Evolutionary Game Model；

Step 2：Dynamic Evolution and defence Evolutionary Equilibrium point are chosen using defender's strategy, between different defenders Defence policies On The Choice is studied；

Step 3：On the basis of the best response dynamics Evolutionary Game Model of foundation, the model is entered by specific example Row analysis promotes Evolutionary Game Model with solving.

It is optimal in the cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the step 1 Reaction dynamic evolution betting model is represented by four-tuple, BRDEGM=(D, DS, P, U)

D={ d₁,d₂,…d_nDefence participant space is represented, wherein, d_iRepresent defender i, different defender can be with Choose different defence policies；

DS={ DS₁,DS₂,…DS_mDefender's policy space is represented, different defenders enjoy the defence policies jointly Collection；

P={ p₁,p₂,…p_mDefender's conviction set is represented, wherein, p_iRepresent that defender chooses defence policies DS_iIt is general Rate；

U={ U₁,U₂,…U_mRevenue function set is represented, wherein, U_iRepresent that defender chooses defence policies DS_iIt is acquired Income.

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the best response dynamics Equation isWherein N_tRepresent Selection Strategy DS in n defender₁Number, DS₁It is optional Any one defence policies in set of strategies.

Defendd in the cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the step 2 Strategy slightly chooses Dynamic Evolution：There is a kind of competition in network-combination yarn antagonistic process, between different defence policies to close System, the defence policies of high yield will eliminate the relatively low strategy of income.

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the defence of the high yield Strategy will be eliminated in the relatively low strategy of income, and gain matrix is：Wherein, u₁、u₂Point Wei not strategy DS₁、DS₂Income, a is u₁、u₂Difference.

Promoted in the cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the step 3 Evolutionary Game Model is mainly, when defender has any n defender, based on best response dynamics Evolutionary Game Model, To any two defence policies DS_iAnd DS_jCarry out evolutionary Game Analysis, it is assumed that DS_iIt is relative to DS_jDominating stragegy, and i ≠ J, over time, finally gives certain Evolution.

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the Evolution is： For there is the defender of n defender, when all defenders choose defence policies DS in first game_iOr strategy DS_j When, using best response dynamics study mechanism, it is then institute that the strategy of whole network defender, which chooses the stable state being finally reached, Some equal Selection Strategy DS of defender_iOr strategy DS_j。

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the Evolution is： For there is the defender of n defender, when n is odd number, in first game, as long as there is a defender to have chosen strategy DS_i, institute finally can all be converged on by the adjustment repeatedly in multiple periods to itself strategy by best response dynamics study mechanism There is defender's Selection Strategy DS_iStable state.

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the Evolution is： For there is the defender of n defender, when n is even number, in first game, there is a defender to have chosen defence policies DS_i, the equal Selection Strategy DS of other defenders_j, then, best response dynamics can not make all defenders converge on some stable shape State, evolution over time, adjustment of each defender to strategy can only be absorbed in loop cycle and change.

The cyber-defence strategy choosing method of described best response dynamics Evolutionary Game Model, the Evolution is： For there is the defender of n defender, in first game, strategy is have chosen simultaneously simply by the presence of two adjacent defenders DS_i, under best response dynamics study mechanism, evolution over time eventually converges on all defenders and all chooses plan Slightly DS_iStable state.

The beneficial effects of the invention are as follows：The present invention establishes the non-cooperative network attacking and defending evolutionary Game under the conditions of bounded rationality Model, and computable general equilibrium has been carried out with solving to the model.On this basis, from defender's angle, for different defence Policy learning adjustment process between person, using best response dynamics study mechanism, establishes the multistage weight between defender Double action state Evolutionary Game Model, is studied defence policies On The Choice between different defenders.In the optimal anti-of foundation Answer on the basis of dynamic evolution betting model, the model is analyzed and solved by specific example, and the model is made Further genralrlization, improves the versatility of model.Chosen just for the difference of defender's number parity, and defender's strategy The difference of beginning state, can all influence the final evolution result of whole game playing system.Initial shape is chosen by arranging defender's strategy State, by constantly evolution, best response dynamics will eventually make game playing system reach some stable state, so as to obtain optimal anti- Imperial strategy.Illustrate that method proposed by the present invention can be good at being applied to network security defence policies On The Choice, network is pacified Full research can provide certain directive significance.

Brief description of the drawings

Fig. 1 method of the present invention step schematic block diagrams；

Fig. 2 cyber-defence person's game theory schematic diagrames；

1 DS of the first games of Fig. 3₁Best response dynamics schematic diagram；

Two DS of the first games of Fig. 4₁Best response dynamics Developing Tactics process schematic；

Three DS of the first games of Fig. 5₁Best response dynamics Developing Tactics process schematic；

Fig. 6 is odd number as n, and original state only one of which selects DS₁Simulated effect schematic diagram；

Fig. 7 is odd number as n, and original state selects DS in the presence of two adjacent defenders₁Simulated effect schematic diagram.

Embodiment

Embodiment 1, with reference to Fig. 1-Fig. 7, a kind of cyber-defence strategy selection side of best response dynamics Evolutionary Game Model Method, comprises the following steps：

Best response dynamics Evolutionary Game Model is represented by four-tuple, BRDEGM=(D, DS, P, U) in step 1

Best response dynamics equation isWherein N_tRepresent to choose plan in n defender Slightly DS₁Number, DS₁It is any one defence policies in optional set of strategies.

Defender's strategy selection Dynamic Evolution is in step 2：In network-combination yarn antagonistic process, different defence policies Between there is a kind of competitive relation, the defence policies of high yield will eliminate the relatively low strategy of income；The defence plan of high yield Summary will be eliminated in the relatively low strategy of income, and gain matrix is：Wherein, u₁、u₂Respectively For tactful DS₁、DS₂Income, a is u₁、u₂Difference.

Evolutionary Game Model is promoted in step 3 is mainly, when defender has any n defender, based on optimal anti- Dynamic evolution betting model is answered, to any two defence policies DS_iAnd DS_jCarry out evolutionary Game Analysis, it is assumed that DS_iBe relative to DS_jDominating stragegy, and i ≠ j over time, finally gives certain Evolution.

Further：Evolution is：For there is the defender of n defender, when all defenders are in first game In all choose defence policies DS_iOr strategy DS_jWhen, using best response dynamics study mechanism, the strategy of whole network defender It is then all equal Selection Strategy DS of defender to choose the stable state being finally reached_iOr strategy DS_j。

Further, Evolution is：For there is the defender of n defender, when n is odd number, in first game In, as long as there is a defender to have chosen tactful DS_i, when passing through multiple to itself strategy by best response dynamics study mechanism The adjustment repeatedly of phase, finally can all converge on all defender's Selection Strategy DS_iStable state.

Further, Evolution is：For there is the defender of n defender, when n is even number, in first game In, there is a defender to have chosen defence policies DS_i, the equal Selection Strategy DS of other defenders_j, then, best response dynamics can not All defenders are made to converge on some stable state, evolution over time, adjustment of each defender to strategy can only be absorbed in week Phase cyclical variations.

Further, Evolution is：For there is the defender of n defender, in first game, simply by the presence of two Individual adjacent defender have chosen tactful DS simultaneously_i, under best response dynamics study mechanism, evolution over time, eventually Converge on the whole Selection Strategy DS of all defenders_iStable state.

Embodiment 2, with reference to Fig. 1-Fig. 7, a kind of cyber-defence strategy selection side of best response dynamics Evolutionary Game Model Method, for analyzing network-combination yarn evolutionary Game process.Due to conventional replica locating study mechanism, to there is pace of learning slower, learns The problems such as practising inefficient, the present invention still uses the thought of evolutionary Game, dynamic using peak optimization reaction based on bounded rationality condition State study mechanism, builds the attacking and defending Evolutionary Game Model based on best response dynamics, and analysis defender's strategy chooses dynamic evolution Process and defence Evolutionary Equilibrium point, are studied defence policies On The Choice between different defenders.In the optimal of foundation React on the basis of dynamic evolution betting model, the model is analyzed and solved by specific example, and by the model Make further genralrlization, improve the versatility of model.Method proposed by the present invention can be good at being applied to network security defence Tactful On The Choice, can provide network security research certain directive significance.

In network-combination yarn game playing system, policymaker by continuous trial and error, imitation and Developing Tactics, from original state with Time constantly develops, and can finally reach that some evolutionarily stable is balanced, and the direct shadow of policy learning method and process of policymaker Ring and arrive final evolutionarily stable state.For defender, it is assumed that different defenders shares same defence policies collection.Due to not Same defence policies can bring different incomes to defender, under the traction of yield variance and the driving of study mechanism, low to receive Beneficial defender constantly learns the strategy of the high defender of income.Evolution over time, the strategy of low income will be by high yield Strategy eliminate.For the attack of reality, from defence square degree, according to the collaboration between different defenders Relation, the game that best response dynamics study mechanism is applied between different defence policies is set up multistage peak optimization reaction and moved State repeats Evolutionary Game Model.Under the promotion of above-mentioned " study-improvement " mechanism, the selection probability of different defence policies is presented Dynamic evolution trend, may finally analyze and obtain network security defence policies choosing method.

Best response dynamics Evolutionary Game Model

It is dynamic using peak optimization reaction under conditions of bounded rationality based on evolutionary game theory for network defense side State Fast Learning mechanism, builds the best response dynamics Evolutionary Game Model between different defenders.

Define 1 best response dynamics Evolutionary Game Model BRDEGM (Best-response Dynamics Evolutionary Game Model) it can be expressed as four-tuple, BRDEGM=(D, DS, P, U), wherein

1. D={ d₁,d₂,…d_nRepresent defence participant space.Wherein, d_iDefender i is represented, different defenders can To choose different defence policies.

2. DS={ DS₁,DS₂,…DS_mRepresent defender's policy space.Different defenders enjoys the defence policies jointly Collection.

3. P={ p₁,p₂,…p_mRepresent defender's conviction set.Wherein, p_iRepresent that defender chooses defence policies DS_i's Probability.

4. U={ U₁,U₂,…U_mRepresent revenue function set.Wherein, U_iRepresent that defender chooses defence policies DS_iObtained The income taken.

In network-combination yarn antagonistic process, there is a kind of competitive relation, the defence plan of high yield between different defence policies Summary will eliminate the relatively low strategy of income.For any two defender d₁And d₂, it is assumed that DS₁、DS₂In being optional set of strategies Any two defence policies, wherein DS₁It is to compare DS₂Dominating stragegy, i.e. strategy DS₁Compare DS₂There is more preferable protection effect, Higher defence income, but DS can be brought₁Corresponding defence cost compares DS₂It is high.Using best response dynamics Fast Learning machine System, the Evolutionary Game Model set up under the conditions of bounded rationality.Game theory is as shown in Figure 2.

As game both sides difference Selection Strategy DS₁And DS₂When, choose DS₁Higher defence income will be obtained, remember high cost It is α, now, Selection Strategy DS to go out part₁Defender obtain income u₂- α, and Selection Strategy DS₂Defender because hitchhike Higher income is then obtained Deng behavior, u is designated as₂+a.Wherein, u₁-u₂＞＞ a.

From gain matrixAs can be seen that in the game in the presence of two pure strategies receive it is assorted Weigh (DS₁,DS₁) and (DS₂,DS₂), wherein (DS₁,DS₁) it is that Pareto very wise move is balanced.But if it is also contemplated that between defender Trusting relationship, or to factors such as Risk Sensitivities, then equilibrium (DS₂,DS₂) occur possibility can be bigger.

Based on above game condition, it is assumed that all defenders all in a circumference on, each defender with each Left and right neighbours carry out repeated game, learn than itself high defence policies of tactful income.The income point that note game both sides are obtained Wei not ∏₁And ∏₂If, p_i(t) it is the Selection Strategy DS in t periods, game person i neighbours₁Quantity, the quantity is possible to be taken Value has 0,1,2 three kind of situation.It can thus be concluded that, game person's Selection Strategy DS₁When obtain income for ∏₁=u₁×p_i(t)+(u₂-a)× [2-p_i(t)], Selection Strategy DS₂When obtain income for ∏₂=(u₂-a)×p_i(t)+u₂×[2-p_i(t)].According to peak optimization reaction Dynamic mechanism is understood, works as ∏₁>∏₂, i.e.,When, game person will be in next game stage Selection Strategy DS₂.Thus may be used To obtain following best response dynamics equation.

Wherein, N_tRepresent Selection Strategy DS in n defender₁Number.By the game dynamical equation, network is prevented The final stable state that imperial strategy is chosen will be certain trend for being chosen for defence policies.

Evolutionary Game Model is analyzed with solving

Based on best response dynamics Evolutionary Game Model established above, the policy learning process between defender is carried out Detailed description and analysis.In network-combination yarn confrontation, because defender is bounded rationality, and with Fast Learning Ability, can be analyzed and summarized to payoff on last stage, and make corresponding Developing Tactics at once, realize lower single order The defence maximum revenue of section.Over time, the strategy of whole defender, which is chosen, will reach a stable state.

Below by using from thinking from the particular to the general, using best response dynamics study mechanism, circumference game is carried out Concrete analysis.

Assuming that defender has 5 defenders, and 5 defenders are distributed in 5 different positions on circumference, such as Fig. 3 institutes Show, the defender on each position both can be with Selection Strategy DS₁, can also Selection Strategy DS₂, therefore, the game has at the beginning of 32 Beginning state, including a whole Selection Strategy DS₁, a whole Selection Strategy DS₂, it is left 30 and includes DS₁And DS₂ Two kinds of strategies.

p_i(t) it is the Selection Strategy DS in t periods, game person i neighbours₁Quantity, the possible value of the quantity has 0,1, 2 three kinds of situations.Correspondingly, Selection Strategy DS₂Neighbours' quantity be 1-p_i(t), equally exist 0,1,2 three kind of value.According to optimal React dynamic mechanism to understand, work as ∏₁>∏₂, i.e.,When, game person will be in next game stage Selection Strategy DS₂. Know u₁-u₂＞＞ α, thenDue to p_i(t) only exist 0,1,2 three kind of value, if at two of t period game persons i As long as there is one to have chosen tactful DS in neighbours₁, then game person i will choose DS in t+1 periods₁Strategy；If two neighbours are There is no Selection Strategy DS₁, then game person i will choose DS in t+1 periods₂Strategy.Therefore deduce that, when 5 defenders are first All choose DS₁Strategy (DS₂Strategy) when, final stable state chooses DS for all defenders₁Strategy (DS₂Strategy).

If there is 1 defender to have chosen DS in first game₁Strategy, and other defenders use DS₂Strategy When, then this 5 defenders have finally converged to all defenders and have used DS by the Developing Tactics repeatedly in 4 periods₁Plan Stable state slightly.As shown in figure 3, original state (the DS for giving defender₁,DS₂,DS₂,DS₂,DS₂), by 4 times Stage evolution, defender has been finally reached stable state (DS₁,DS₁,DS₁,DS₁,DS₁)。

Two non-conterminous defenders are contained it can be seen from best response dynamics adjustment process in Fig. 3 to adopt Use DS₁, three non-adjacent game persons use DS₁, four defenders use DS₁The peak optimization reaction of this several first game situation is moved State adjust process, they be respectively necessary for three, two and one stages adjustment can reach all use DS₁The stabilization of strategy State.Below to thering are two non-adjacent defenders and three adjacent defenders to use DS in first game₁Situation analyzed.

As shown in Figure 4, two adjacent defenders use DS₁Best response dynamics Developing Tactics process only need two stages It can reach all defenders and choose DS₁The stable state of strategy.As shown in Figure 5, three adjacent defenders use DS₁It is optimal Reaction dynamic strategy adjustment process only needs a stage to can reach all defenders and chooses DS₁The stable state of strategy.

Analyzed more than, in 32 kinds of possible first game situations, only one kind is evolutionarily stable in all defence Person's Selection Strategy DS₂, remaining 31 kinds finally can all converge on all selection DS₁State.Illustrate all defender's Selection Strategies DS₁Or DS₂The stable state belonged in the gambling process, but converge on DS₁Probability to be far longer than DS₂。

Above-mentioned two evolutionarily stable state is further understood, if defender is reaching all defender's Selection Strategy DS₁ Stable state under, there are a small number of defenders and deviate strategy DS₁Situation, best response dynamics can make defender strategy quickly Converge to and all choose DS₁State.Therefore, all defenders choose DS₁Stable state be with stability.On the contrary, working as Reach that all defenders choose DS₂Stable state be not but sane, once because some defender deviate DS₂, peak optimization reaction Dynamic can make the state of defender more and more remote from the stable state, therefore the equilibrium is not really stable.With it is long when Between evolution, defender most at last can Selection Strategy DS₁。

The popularization of Evolutionary Game Model

Because during actual cyber-defence, defender is made up of multiple defenders, it is therefore necessary to the game mould Type makees further genralrlization, i.e., when defender has any n defender, based on above fast reaction dynamic evolution game mould Type, to any two defence policies DS_iAnd DS_jEvolutionary Game Analysis is carried out (assuming that DS_iIt is relative to DS_jDominating stragegy, and i ≠ j), over time, finally give certain Evolution.Evolution for above particular number of networks defender is won Analysis is played chess, by further analysis and summary, following proposition can be obtained.

Proposition 1：For there is the defender of n defender, when all defenders choose defence plan in first game Slightly DS_i(tactful DS_j) when, using best response dynamics study mechanism, whole network defender strategy choose be finally reached it is steady It is then all equal Selection Strategy DS of defender to determine state_i(tactful DS_j)。

Proposition 2：For there is the defender of n defender, when n is odd number, in first game, as long as there is one to prevent Driver have chosen tactful DS_i, pass through the adjustment repeatedly in multiple periods to itself strategy by best response dynamics study mechanism, most It can all converge on all defender's Selection Strategy DS eventually_iStable state.

Proposition 3：For there is the defender of n defender, when n is even number, in first game, there is a defender It has chosen defence policies DS_i, the equal Selection Strategy DS of other defenders_j, then, best response dynamics can not receive all defenders Hold back in some stable state, evolution over time, adjustment of each defender to strategy can only be absorbed in loop cycle and change.

Proposition 4：It is same simply by the presence of two adjacent defenders in first game for there is the defender of n defender When have chosen tactful DS_i, under best response dynamics study mechanism, evolution over time eventually converges on all defence Person's whole Selection Strategy DS_iStable state.

Proposition 5：For there is the defender of n defender, if by arranging strategy of the defender in first game, By continuous dynamic evolution so that when the game playing system reaches a certain stage, occur in that some feelings in aforementioned four proposition Same evolutionary process will occurs in shape, the later stage.

Numerical simulation

Based on network-combination yarn evolutionary Game established above, experiment simulation is carried out using system dynamics, checking network is attacked The validity and reasonability of anti-Evolutionary Game Model and best response dynamics Evolutionary Game Model.Defender chooses different defence Original state, whole game playing system will produce different evolution results.Below by for the different defence initial shapes of defender State, carries out specific numerical simulation.The present invention will be used as simulation object using proposition 2 and proposition 4.

(1) when n be odd number, and defender's original state be only one of which defender's Selection Strategy DS₁, other defenders are equal Selection Strategy DS₂When, n=21 is taken, then Selection Strategy DS₁Defender's proportion beSelection Strategy DS₂Defender institute Accounting example beNow, there is the power that adjustment changes itself strategy in the game playing system between defender.It is imitative by system Very, defender's Selection Strategy DS is found₁Defender's ratio it is linear increase, and the DS of Selection Strategy₂Defender's ratio is linear Reduce, and just reached in the 10th simulation result final evolutionarily stable state.It is specific as shown in Figure 6.The evolution As a result can be system stable stateIn one kind, now DS₁For optimal defence policies.

(2) there is two adjacent defenders Selection Strategy DS simultaneously in n is odd number, and defender's original state₁, other The equal Selection Strategy DS of defender₂When, n=21 is taken, then Selection Strategy DS₁Defender's proportion beSelection Strategy DS₂'s Defender's proportion isNow, there is the power that adjustment changes itself strategy in the game playing system between defender.Pass through Constantly develop, defender's Selection Strategy DS₁Defender's ratio it is linear increase, and the DS of Selection Strategy₂Defender's ratio is into line Property reduce, and reach in the 10th simulation result final evolutionarily stable state.It is specific as shown in Figure 7.The evolution result can be with It is system stable stateIn one kind, now DS₁For optimal defence policies.

It can be seen from above simulation result, chosen just for the difference of defender's number parity, and defender's strategy The difference of beginning state, can all influence the final evolution result of whole game playing system.Initial shape is chosen by arranging defender's strategy State, by constantly evolution, best response dynamics will eventually make game playing system reach some stable state.By experimental result and this Literary model reasoning is contrasted, it can be seen that the evolution result in experimental system is consistent with the theory analysis in text, explanation The Evolutionary Game Model meets reality system Evolution, so as to demonstrate the achieved availability of this model.It can be applied In the network-combination yarn confrontation of reality, the Alliance Defense of defender is made a concrete analysis of and predicted, be the strategy choosing of defender Take and strong support is provided.

Claims

1. a kind of cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model, it is characterised in that：Including following Step：

Step 1：Based on bounded rationality condition, using best response dynamics study mechanism, attacking based on best response dynamics is built Anti- Evolutionary Game Model；

Step 2：Dynamic Evolution and defence Evolutionary Equilibrium point are chosen using defender's strategy, to being defendd between different defenders Tactful On The Choice is studied；

Step 3：On the basis of the best response dynamics Evolutionary Game Model of foundation, the model is divided by specific example Analysis promotes Evolutionary Game Model with solving.

2. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 1, it is special Levy and be：Best response dynamics Evolutionary Game Model is represented by four-tuple, BRDEGM=(D, DS, P, U) in the step 1

D={ d₁,d₂,…d_nDefence participant space is represented, wherein, d_iDefender i is represented, different defenders can choose not Same defence policies；

DS={ DS₁,DS₂,…DS_mDefender's policy space is represented, different defenders enjoys the defence policies collection jointly；

P={ p₁,p₂,…p_mDefender's conviction set is represented, wherein, p_iRepresent that defender chooses defence policies DS_iProbability；

U={ U₁,U₂,…U_mRevenue function set is represented, wherein, U_iRepresent that defender chooses defence policies DS_iAcquired receipts Benefit.

3. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 1, it is special Levy and be：The best response dynamics equation isWherein N_tRepresent to select in n defender Take tactful DS₁Number, DS₁It is any one defence policies in optional set of strategies.

4. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 1, it is special Levy and be：Defender's strategy selection Dynamic Evolution is in the step 2：In network-combination yarn antagonistic process, difference defence There is a kind of competitive relation between strategy, the defence policies of high yield will eliminate the relatively low strategy of income.

5. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 4, it is special Levy and be：The defence policies of the high yield will be eliminated in the relatively low strategy of income, and gain matrix is：

Wherein, u₁、u₂Respectively strategy DS₁、DS₂Income, a is u₁、u₂Difference.

6. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 1, it is special Levy and be：Evolutionary Game Model is promoted in the step 3 is mainly, when defender has any n defender, based on optimal Dynamic evolution betting model is reacted, to any two defence policies DS_iAnd DS_jCarry out evolutionary Game Analysis, it is assumed that DS_iIt is relative In DS_jDominating stragegy, and i ≠ j over time, finally gives certain Evolution.

7. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 6, it is special Levy and be：The Evolution is：For there is the defender of n defender, when all defenders select in first game Take defence policies DS_iOr strategy DS_jWhen, using best response dynamics study mechanism, the strategy of whole network defender is chosen most The stable state reached eventually is then all equal Selection Strategy DS of defender_iOr strategy DS_j。

8. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 6, it is special Levy and be：The Evolution is：For there is the defender of n defender, when n is odd number, in first game, as long as There is a defender to have chosen tactful DS_i, the anti-of multiple periods is passed through to itself strategy by best response dynamics study mechanism Polyphony is whole, finally can all converge on all defender's Selection Strategy DS_iStable state.

9. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 6, it is special Levy and be：The Evolution is：For there is the defender of n defender, when n is even number, in first game, there is one Individual defender have chosen defence policies DS_i, the equal Selection Strategy DS of other defenders_j, then, best response dynamics can not make to own Defender converges on some stable state, evolution over time, and adjustment of each defender to strategy can only be absorbed in loop cycle Change.

10. the cyber-defence strategy choosing method of best response dynamics Evolutionary Game Model according to claim 6, it is special Levy and be：The Evolution is：It is adjacent simply by the presence of two in first game for there is the defender of n defender Defender have chosen tactful DS simultaneously_i, under best response dynamics study mechanism, evolution over time is eventually converged on The whole Selection Strategy DS of all defenders_iStable state.