CN109636699A

CN109636699A - A kind of unsupervised intellectualized battle deduction system based on deeply study

Info

Publication number: CN109636699A
Application number: CN201811313124.XA
Authority: CN
Inventors: 瞿崇晓; 高翔; 夏少杰; 郑寄平; 胡青阳; 李思佳
Original assignee: CETC 52 Research Institute
Current assignee: CETC 52 Research Institute
Priority date: 2018-11-06
Filing date: 2018-11-06
Publication date: 2019-04-16

Abstract

The invention discloses a kind of unsupervised intellectualized battle deduction systems based on deeply study, the intellectualized battle decision of the system is fed back by the training pattern in deeply learning platform, have coordination and mating capability between multiple agent, system independent learning ability and Continuous optimization ability is substantially improved；Fighting terminal supports manual operation and the operation of machine autonomous learning, it can be achieved that man-machine confrontation exports high-quality training sample data；It can provide the first visual angle and overall situation visual angle to show, user can directly carry out operation scenario simulation training by the first visual angle and global visual angle and discs is deduced；All environment configurations definition have generalization characteristic, and consider that the fighting capacity of battle both sides is balanced, and intellectualized algorithm is made to have Tactic selection space, to be adapted to a variety of deep learning algorithms and equipment model, and fight terminal quantity in system and support extension.

Description

A kind of unsupervised intellectualized battle deduction system based on deeply study

Technical field

The invention belongs to fight against deducing technical field more particularly to a kind of unsupervised intelligence work based on deeply study War deduction system.

Background technique

Intellectualized battle, which refers to, to be realized on the basis of information fighting with intellectual weapon and means with efficient commanderization, strike Precision, operation automation, the high-tech pattern of operation that behavior intelligence is purport.In essence, intellectualized battle is people Radiation and extension of the intelligence to information battlefield and weapon system.In terms of battle order and means, intellectualized battle includes intelligentized The Attack Defence of command control warfare and smart weapon.The outstanding feature of intellectualized battle is smart weapon group and intelligent operation The appearance of means.

To solve the above problems, devising the related operation deduction system based on expert system both at home and abroad, main thought is The decision under specific situation is exported using the rule base of expert system, while emulating the game under scene between ourselves and the enemy.At present The common operation deduction system based on expert system is based primarily upon experts database or traditional genetic algorithm etc. and carries out decision, passes through Experts database or the new algorithmic approach enriched constantly promote the level of intelligence of operation deduction system, but hoisting power is limited.

With the rise of artificial intelligence technology, GPU server construction deep learning platform can use, using deeply Learning algorithm by constantly trained and model iteration lifting system level of intelligence, and realizes independent learning ability and mostly intelligently Cooperation between body.

Intensified learning is a branch in artificial intelligence machine learning areas, is merely able to for controlling one in some environment Under uninfluenced intelligent body, by the interaction between environment, including perception and receive awards, and continuously improve its row For, and the purpose of intensified learning, exactly select a series of action to maximize following reward.Deeply learns nerve net The ability that network extracts complicated high dimensional data feature incorporates intensified learning, and data are transformed into low-dimensional feature, convenient at intensified learning Reason.It is well known that military decision-making is the most complicated, activity most challenged in military field, and thereby promotes military auxiliary and determine The generation and development of plan support technology.It is contemplated that being constantly progressive for deeply learning art will be to military intelligence auxiliary Decision domain generates deep and great influence.

Summary of the invention

The purpose of the present invention is to provide a kind of unsupervised intellectualized battle deduction systems based on deeply study, improve Algorithm, engine and the process of operation deduction system of the tradition based on experts database, improve the level of intelligence and learning ability of intelligent body, Significantly promote the level of intelligence of deduction system.

To achieve the above object, the technical solution used in the present invention are as follows:

A kind of unsupervised intellectualized battle deduction system based on deeply study, the nothing based on deeply study Supervising intellectualized battle deduction system includes that confrontation terminal, operation simulation engine, deeply learning platform and data management are flat Platform is provided with equipment model library and model of place library in the operation simulation engine, is provided with sample in the data management platform Database and training pattern library, in which:

The confrontation terminal, for connecting the input interface of the operation simulation engine；

The operation simulation engine, the operational environment for being inputted according to device end define information, from equipment model library Equipment model relevant to operational environment is imported, model of place relevant to operational environment is imported from model of place library, completes just Beginningization, the operational environment include combat mode, combat unit and operation scene；

And when combat mode is training mode, operation game between ourselves and the enemy is carried out based on timeslice and is deduced, it will be current Environmental information is sent to the deeply learning platform, executes single step according to the movement feedback of deeply learning platform output Decision or multistep decision, and update environmental information；

It is also used to judge currently whether meet the flat relationship of victory or defeat or termination condition according to predefined rule, if current meet victory When bearing flat relationship or termination condition, output, which is fought, deduces analysis report, and saves Campaign Process as sample data to described The sample database of data management platform completes training；Otherwise continue game deduction of fighting between ourselves and the enemy, until meeting victory Bear flat relationship or termination condition；

The deeply learning platform, for importing training pattern from the training pattern library of data management platform, and In conjunction with the environmental information that the operation simulation engine is sent, Xiang Zuozhan simulation engine output action feedback；It is also used to update operation The corresponding training pattern of unit, and operation game deduce after output training pattern to data management platform training pattern It is saved in library；

The data management platform, for saving the training pattern and the work that the deeply learning platform exports The sample data that simulation engine of fighting exports；It is also used to export training pattern to the deeply learning platform.

Preferably, the environmental information includes: quantity, location information, current state, mutual spacing between ourselves and the enemy.

Preferably, the operation that the single step decision includes combat unit acts；The multistep decision includes to fight Multiple movements that fight one battle after another of unit.

Preferably, the confrontation terminal is as device end, by the input interface of the operation simulation engine to institute It states operation simulation engine input operational environment and defines information.

Preferably, the combat mode of the operation simulation engine further includes simulation model, the simulation model includes people Machine confrontation and the confrontation of machine machine；

The confrontation terminal, also provides for three-dimensional visualization interface, and the operation is presented in visualization under simulation model The environmental information of simulation engine；Also provide for confrontation operation interface, in man-machine confrontation by confrontation operation interface artificially to The operation simulation engine input, which is fought, to be operated.

Preferably, the unsupervised intelligence based on deeply study is made when the combat mode is that machine machine is fought War deduction system, performs the following operations:

The operation simulation engine, for accessing the three-dimensional visualization interface of the confrontation terminal, and according to device end The operational environment of input defines information, equipment model relevant to operational environment is imported from equipment model library, from model of place library Model of place relevant to operational environment is imported, training pattern is imported from the training pattern library of data management platform, is completed just Beginningization；And operation game between ourselves and the enemy is carried out based on timeslice and is deduced, current environmental information is sent to the deeply Learning platform executes single step decision according to the movement feedback of deeply learning platform output, and updates ring according to single step decision Effect is presented in border information, the visualization for refreshing the confrontation terminal；

It is also used to judge currently whether meet the flat relationship of victory or defeat or termination condition according to predefined rule, if current meet victory When bearing flat relationship or termination condition, output, which is fought, deduces analysis report, and saves Campaign Process as sample data to described The sample database of data management platform completes emulation；Otherwise continue game deduction of fighting between ourselves and the enemy, until meeting victory Bear flat relationship or termination condition；

The deeply learning platform, for importing training pattern from the training pattern library of data management platform, and In conjunction with the environmental information that the operation simulation engine is sent, Xiang Zuozhan simulation engine output action feedback；

The data management platform, for saving the sample data of the operation simulation engine output；It is also used to described Deeply learning platform and the operation simulation engine export training pattern.

Preferably, the unsupervised intelligence based on deeply study is made when the combat mode is man-machine confrontation War deduction system, performs the following operations:

The operation simulation engine, for accessing the three-dimensional visualization interface and confrontation operation interface of the confrontation terminal, And information is defined according to the operational environment that device end inputs, assembling die relevant to operational environment is imported from equipment model library Type imports model of place relevant to operational environment from model of place library, imports from the training pattern library of data management platform Training pattern completes initialization；And operated according to the operation of confrontation terminal, operation game between ourselves and the enemy is carried out based on timeslice and is pushed away It drills, current environmental information is sent to the deeply learning platform, the movement exported according to deeply learning platform Feedback executes single step decision, while receiving the operation operation for fighting terminal input, and according to single step decision and operation of fighting Environmental information is updated, the effect of visualization of the confrontation terminal is refreshed；

A kind of unsupervised intellectualized battle deduction system based on deeply study proposed by the present invention, intellectualized battle decision It is fed back by the training pattern in deeply learning platform, has coordination and mating capability between multiple agent, substantially Lifting system independent learning ability and Continuous optimization ability；It fights terminal and supports manual operation and the operation of machine autonomous learning, it can Realize that man-machine confrontation exports high-quality training sample data；It can provide the first visual angle and global visual angle to show, user passes through the first view Angle and global visual angle can directly carry out operation scenario simulation training and discs is deduced；All environment configurations definition have generalization special Property, and consider that the fighting capacity of battle both sides is balanced, so that intellectualized algorithm is had Tactic selection space, is calculated to be adapted to a variety of deep learnings Method and equipment model, and fight terminal quantity in system and support extension.

Detailed description of the invention

Fig. 1 is a kind of embodiment framework of the unsupervised intellectualized battle deduction system of the invention based on deeply study Figure；

Fig. 2 is a kind of embodiment process of the unsupervised intellectualized battle deduction system of the invention based on deeply study Block diagram；

Fig. 3 is a kind of embodiment operational effect figure that terminal is fought when fighting scene the invention shows air battle；

Fig. 4 is that a kind of embodiment of confrontation terminal when fighting scene the invention shows air battle runs global situation map.

Specific embodiment

Technical solution of the present invention is described in further details with reference to the accompanying drawings and examples, following embodiment is not constituted Limitation of the invention.

As shown in Figure 1, being the present embodiment provides a kind of unsupervised intellectualized battle deduction system based on deeply study The display panel control united when carrying out deployed with devices using confrontation terminal as intellectualized battle deduction system, deployment emulation simulator are soft Part provides confrontation operation interface and three-dimensional visualization interface, and the quantity that terminal is fought in system supports extension.

Emulator using work station as intellectualized battle deduction system disposes operation simulation engine；It is serviced using GPU Training equipment of the device as intellectualized battle deduction system, deployment depth intensified learning platform.

Storage equipment using storage dish battle array as intellectualized battle deduction system disposes data management platform, and data pipe Sample database and training pattern library are equipped in platform, provide training sample data import and export, the function such as training pattern management Energy.

Each equipment in the present embodiment passes through exchange network from each other and interconnects, it should be noted that each equipment mutually it Between can using be wirelessly connected, electrical connection or mechanical connection.

As seen from the figure, it is additionally provided with equipment model library and model of place library in intellectualized battle deduction system, equips model library Equipment model is created to combat unit, and the functions such as equipment model management are provided.Combat unit includes operation coherent element, such as Aircraft, radar, guided missile etc..Model of place library models scene, provides the functions such as model of place management.

The equipment model library and model of place library can share a storage equipment with data management platform, can also be with It is independently arranged in storage dish battle array, provides equipment model information and model of place information, the present embodiment for operation simulation engine In, memory headroom is opened up in operation simulation engine, for equipment model library and model of place library to be arranged.

Wherein, the operation simulation engine of work station is designed as a Combat simulation platform unrelated with algorithm, realization side Formula is the software package run under Linux or windows platform, and the implementation of the present embodiment is to run under Linux platform Python software package.Operation simulation engine is provided based on decision making algorithms interfaces such as deep learnings, and passes through the decision making algorithm interface Connection is established with the deeply learning platform, the key property of operation simulation engine is to provide: operational environment definition, equipment The sample data importing and export, the importing of training pattern library, the access of more training platform decisions that model library is imported, fought, fight and advise It then defines, deduces the functions such as analysis report.

In intellectualized battle deduction, the parameter configuration of operational environment and the application of algorithm model are also closely connected, if The side that fights in belligerent Board Lot, aspect of performance has landslide, and Operation Target is easy to reach, then another party will be complete It wins victory completely without method, the intelligent decision means under the clearly demarcated operational environment of this victory or defeat, including deeply learning algorithm It is difficult have performance space.Therefore, it is necessary to comprehensively consider the construction feature of battlefield conditions and simulated environment in actual demand, choose Suitable operational environment, so that the decision of battle both sides has choice, to allow algorithm decision that can embody value.

In addition, backstage training equipment of the deeply learning platform as intellectualized battle deduction system on GPU server, The agent model that can use deeply learning algorithm training combat unit may also be combined with tactical training, confrontation examination etc. The common intensive training combat unit of external samples data including practical combat data (common for practical flight data), output Training pattern is to training pattern library.

Deeply learning platform is related to DQN, DDPG, the mainstreams nitrification enhancement such as A3C, PPO.These algorithms need to be into Row transformation applies to intellectualized battle and deduces scene, and the state and control variable of simulated environment are defined as including that will output and input, And the reward function under each state is calculated according to the parameter configuration of operational environment and victory or defeat condition, realize algorithm and simulated environment Docking.The nitrification enhancement used in the present embodiment is DQN.

The present embodiment carries out a large amount of training tests in deeply learning platform according to specific algorithm, verify various algorithms and Combined effect and assessment.Wherein the central principle of algorithm can be summarized as: use deep neural network approximate value function (Q function) Or strategic function changes its state, and rewarded, passes through reward by controlling movement and the environment interaction of Intelligent target body Recovery value/strategic function parameter, reaches training goal.

Based on the central principle of the algorithm, promote the core objective of intellectualized battle deduction system: realize algorithm training with Intelligent decision.For this reason, it may be necessary to which the state variable space and the control variable space to simulated environment screen, obtain being suitable for calculating Method input, output state space S={ s, s2 ..., sn } and control space C={ c1, c2 ..., cm }.

Certain state, the screening for controlling space need to consider many factors: firstly, state space can completely react battlefield Situation is lost without information, and is controlled space and be required to complete combat unit various actions movement, while in currently emulation frame It can be realized in frame stable based on the iterative control of timeslice；Secondly, under the premise of meeting above-mentioned condition, state space with The dimension in control space needs to simplify as far as possible, controls the calculation amount of algorithm training in the achievable dimension of computing capability；This Outside, the parameter configuration of operational environment can also screen space and have an impact, if environment setting is made that simplification in one aspect Processing, corresponding state can also carry out corresponding simplify with control parameter.

As shown in connection with fig. 2, the work of each equipment in intellectualized battle deduction system is further illustrated:

Under the deployed with devices basis of intellectualized battle deduction system, Armament modeling and operation scene modeling are carried out in advance, It saves respectively into equipment model library and model of place library, simulation engine to be fought is selected.Meanwhile rule of engagement modeling is carried out, Judge to fight as operation simulation engine and currently whether meets the foundation of the flat relationship of victory or defeat or termination condition.

Before deduction of fighting starts, this operational environment deduced of fighting first is defined, the operational environment includes operation mould Formula, combat unit and operation scene are used as device end using confrontation terminal in the present embodiment, are emulated and drawn by the operation The input interface held up defines information to operation simulation engine input operational environment.

Further, confrontation terminal also provides three-dimensional visualization interface, and the operation is presented in visualization under simulation model The environmental information of simulation engine.Confrontation operation interface is provided simultaneously, in man-machine confrontation by confrontation operation interface artificially to institute State the input operation operation of operation simulation engine.

It should be noted that operational environment define information be not limited only to using confrontation terminal input, can also by with work The device end or background server input operational environment of the input interface connection of war simulation engine define information.

The optional combat mode of the intellectualized battle deduction system of the present embodiment includes training mode and simulation model, and is emulated Mode includes man-machine confrontation and the confrontation of machine machine.

S1, when combat mode be training mode when, based on deeply study unsupervised intellectualized battle deduction system, hold The following operation of row:

S11, operation simulation engine define information according to the operational environment that device end inputs, from equipment model library import with The relevant equipment model of operational environment imports model of place relevant to operational environment from model of place library, completes initialization.Its In, operational environment includes combat mode, combat unit and operation scene；

Meanwhile deeply learning platform is led from the training pattern library of data management platform according to algorithm configuration information Enter training pattern, completes initialization.

S12, operation simulation engine are based on timeslice and carry out the deduction of operation game between ourselves and the enemy, and current environmental information is sent out It send to the deeply learning platform, single step decision or multistep is executed according to the movement feedback of deeply learning platform output Decision, and update environmental information.

The environmental information that S13, deeply learning platform combination operation simulation engine are sent, the output of Xiang Zuozhan simulation engine Movement feedback.

Whether S14, operation simulation engine currently meet the flat relationship of victory or defeat or termination condition according to predefined rule judgement, if When currently meeting the flat relationship of victory or defeat or termination condition, output, which is fought, deduces analysis report, and using Campaign Process as sample data It saves to the sample database of the data management platform, completes training；Otherwise continue game deduction of fighting between ourselves and the enemy, Until meeting the flat relationship of victory or defeat or termination condition.

The sample data that S15, deeply learning platform are deduced by largely operation game, and combination tactical training, External samples data including the practical flights data such as confrontation examination promote intelligent decision level, according to work after training Performance of the war unit in this operation game deduction updates the corresponding training pattern of combat unit, and deduces in operation game After export training pattern saved into the training pattern library of data management platform.

In above-mentioned training mode, data management platform is when deeply learning platform is initialized to its output training Model, and the training pattern and operation simulation engine that deeply learning platform exports are saved after operation game is deduced The sample data of output.

It should be noted that the training pattern saved in data management platform equally has guarantor in deeply learning platform It deposits, dual memory greatly reduces data loss rate.Therefore in training mode, training pattern can be led from training pattern library Enter, the training pattern being stored in deeply learning platform can also be called directly.

Wherein, environmental information includes: quantity, location information, current state between ourselves and the enemy, and in operational environment One combat unit is center combat unit, the operation relationship of remaining combat unit and the center combat unit, operation relationship Such as enemy and we's relationship, relative distance, degree of danger, whether in mutual operational state etc..

And in order to make training be more nearly true operation scene, what operation simulation engine was executed in training mode can be with It is to be also possible to (as be based on based on the multistep decision of multiple movements that fight one battle after another based on the single step decision of operation movement Tactics combinative movement executes multistep decision), to improve the flexibility of the execution of combat unit, the training of training pattern is imitated Fruit is more preferable, also increases operation difficulty, has to the level of intelligence of deeply learning platform and preferably promotes effect.

S2, when combat mode be machine machine fight when, based on deeply study unsupervised intellectualized battle deduction system, hold The following operation of row:

S21, operation simulation engine access the three-dimensional visualization interface of the confrontation terminal, and according to device end input Operational environment defines information, imports relevant to operational environment equipment model from equipment model library, imported from model of place library and The relevant model of place of operational environment imports training pattern from the training pattern library of data management platform, completes initialization；

Meanwhile deeply learning platform imports training pattern from the training pattern library of data management platform, completes just Beginningization.

S22, operation simulation engine are based on timeslice and carry out the deduction of operation game between ourselves and the enemy, and current environmental information is sent out It send to the deeply learning platform, single step decision, and root is executed according to the movement feedback of deeply learning platform output Environmental information is updated according to single step decision, effect is presented in the visualization for refreshing the confrontation terminal.

The environmental information that S23, deeply learning platform are sent in conjunction with the operation simulation engine, Xiang Zuozhan simulation engine Output action feedback.

Whether S24, operation simulation engine currently meet the flat relationship of victory or defeat or termination condition according to predefined rule judgement, if When currently meeting the flat relationship of victory or defeat or termination condition, output, which is fought, deduces analysis report, and using Campaign Process as sample data It saves to the sample database of the data management platform, completes emulation；Otherwise continue game deduction of fighting between ourselves and the enemy, Until meeting the flat relationship of victory or defeat or termination condition.

In above-mentioned machine machine confrontation, data management platform is instructed to operation simulation engine and the output of deeply learning platform Practice model, and saves the sample data of operation simulation engine output after operation game is deduced.

Machine machine fights the level of intelligence that can be used for detecting deeply learning platform, and by largely confrontation data to instruction Practice model and makes analysis and judgment.

S3, when combat mode be man-machine confrontation when, it is described based on deeply study unsupervised intellectualized battle deduce system System, performs the following operations:

The three-dimensional visualization interface and confrontation operation interface of S31, operation simulation engine access confrontation terminal, and according to equipment The operational environment of terminal input defines information, equipment model relevant to operational environment is imported from equipment model library, from scene mould Type library imports model of place relevant to operational environment, imports training pattern from the training pattern library of data management platform, complete At initialization；

S32, operation simulation engine are operated according to the operation of confrontation terminal, are fought between ourselves and the enemy game based on timeslice It deduces, current environmental information is sent to the deeply learning platform, according to the dynamic of deeply learning platform output Make feedback and execute single step decision, while receiving the operation operation for fighting terminal input, and according to single step decision and the behaviour that fights Make update environmental information, refreshes the effect of visualization of the confrontation terminal.

The environmental information that S33, deeply learning platform combination operation simulation engine are sent, the output of Xiang Zuozhan simulation engine Movement feedback.

Whether S34, operation simulation engine currently meet the flat relationship of victory or defeat or termination condition according to predefined rule judgement, if When currently meeting the flat relationship of victory or defeat or termination condition, output, which is fought, deduces analysis report, and using Campaign Process as sample data It saves to the sample database of the data management platform, completes emulation；Otherwise continue game deduction of fighting between ourselves and the enemy, Until meeting the flat relationship of victory or defeat or termination condition.

In above-mentioned man-machine confrontation, data management platform is instructed to operation simulation engine and the output of deeply learning platform Practice model, and saves the sample data of operation simulation engine output after operation game is deduced.It should be noted that imitative When true mode, training pattern imported into deeply learning platform and operation simulation engine simultaneously from training pattern library, in depth It cooperates jointly as intelligent decision basis with environmental information in intensified learning platform, as single with operation in operation simulation engine Member matching is spare.By above-mentioned deducing maneuvers process as it can be seen that operation simulation engine can export after each training or emulation Portion, which is fought, deduces analysis report, which deduces analysis report and mainly realize that being based on belligerent deduction process and result carries out winning rate The statistical analysis such as statistics, Armament Demonstration and tactics research, convenient for the understanding to this deducing maneuvers process, and to such operation Deduce the global observing of long-term operation situation.

Further, sample data of the output into data management platform after each training or emulation, can be again It imports in operation simulation engine, for playing back Campaign Process.In each deducing maneuvers process, combat unit is according to deeply The intelligent decision of learning platform executes single step decision or multistep decision, is needed during training by the row of agent (intelligent body) Dynamic selection is converted into several discrete values, as the action value taken in next step.By for air battle fights scene, using working as The parameters such as position, angle, steering angle, the distance of preceding both sides are inputted as neural network, are built deep neural network and are trained, The probability value that output acts for next step, each step all select the movement so that maximum probability.And (depth is strong to improved DQN Chemical learning method), if Prioritized Replay and Dueling Network are verified, it is final obtain it is reasonable as a result, And it is cured as training pattern.Aircraft is realized that space reaches, detected, defendd, attack four core competence by great amount of samples training Combinatorial Optimization.

Man-machine confrontation in intellectualized battle deduction system joined artificial governing factor, break training mode and machine machine pair Anti- the case where fully relying on algorithm, realizes that man-machine confrontation exports high-quality training sample data, not only to the training mould of combat unit Type has promotes effect well, and training pattern is made to be more nearly true warfare decision.

It is shown using confrontation terminal as front end when man-machine confrontation and operation module is fought so that air battle fights scene as an example Flight simulator is disposed in terminal, by keyboard input control aircraft object, mock battle unit carries out man-machine confrontation.Pass through keyboard It can control the operation such as flying speed, azimuth of aircraft.Before start, user can on map selecting object position, Initial velocity and the azimuth of object are set.Keyboard input data (speed, azimuth) is converted into latitude and longitude coordinates, drive control The data information of operation is uploaded to server-side, by rear end decision-making treatment after artificially controlling flare maneuver by object's position variation Afterwards, the data processing section of the run action feedback of our aircraft to client, our aircraft after data processing is passed through Latitude coordinate drives the location information of our aircraft to show variation, and operational effect is as shown in Figure 3.

Confrontation terminal disposes global situation visual software simultaneously, the displaying of aircraft running track will make battlefield between ourselves and the enemy In figure, based on the exploitation of the front ends web technologies such as HTML5, JavaScript and Css, the part GIS is integrated, the page is rendered, Cartographic information, object information and special efficacy are visualized, it is as shown in Figure 4 to run global situation situation.

Deeply study is applied to air battle and fought in typical operation scene by the present embodiment, and the operation for designing realization is deduced System can be used for man-machine confrontation test, realize tactical confrontation scenario simulation, carries out air combat situation sensing capability for pilot and mentions For training tool.Based on deeply learning training model construction pilot training and checking system, meets pilot's formation and make The culture demand that the level of training of fighting improves.It is exported by typical air battle scene hands-on data and self game, summarizes and refine Representative tactics strategy provides guidance instruction for subsequent practical air fighting, and provides for operational exertion handbook necessary Material and training support.The case where emulation data can assess main battle weaponry is deduced simultaneously, optimizes the strategy that uses of equipment, identification Develop the core-capacitor factor of equipment.

It should be noted that the intellectualized battle deduction system of the present embodiment can also extend to other fields of military operations, pass through Artificial intelligence obtains training pattern, and then the training pattern by having certain level of intelligence realizes the autonomous work of combat unit War.

In embodiment provided by the present invention, so it is easy to understand that disclosed related system can pass through others Mode is realized.For example, the division of the platform or module, only a kind of logical function partition, can have another in actual implementation Outer division mode, for example, multiple units or components may be combined or can be integrated into a system or some features can be with Ignore, or does not execute.Another point, the indirect coupling or communication connection of unit or assembly can be electrical property, mechanical or other shapes Formula.

The unit as illustrated by the separation member may or may not be physically separated, as display Component may or may not be physical unit, it can and it is in one place, or may be distributed over multiple networks On unit.It can some or all of the units may be selected to achieve the purpose of the solution of this embodiment according to the actual needs.

It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.

Technical solution of the present invention substantially the part that contributes to existing technology or the technical solution in other words Completely or partially it can be embodied in the form of software products, which is stored in a storage medium, It is used including some instructions so that computer processor (processor) executes the whole of each embodiment the method for the present invention Or part steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.

The above embodiments are merely illustrative of the technical solutions of the present invention rather than is limited, without departing substantially from essence of the invention In the case where mind and its essence, those skilled in the art make various corresponding changes and change in accordance with the present invention Shape, but these corresponding changes and modifications all should fall within the scope of protection of the appended claims of the present invention.

Claims

1. a kind of unsupervised intellectualized battle deduction system based on deeply study, which is characterized in that described strong based on depth Chemistry practise unsupervised intellectualized battle deduction system include confrontation terminal, operation simulation engine, deeply learning platform and Data management platform is provided with equipment model library and model of place library, the data management platform in the operation simulation engine In be provided with sample database and training pattern library, in which:

The operation simulation engine, the operational environment for being inputted according to device end define information, import from equipment model library Equipment model relevant to operational environment imports model of place relevant to operational environment from model of place library, completes initialization, The operational environment includes combat mode, combat unit and operation scene；

And when combat mode is training mode, operation game between ourselves and the enemy is carried out based on timeslice and is deduced, by current environment Information is sent to the deeply learning platform, executes single step decision according to the movement feedback of deeply learning platform output Or multistep decision, and update environmental information；

It is also used to judge currently whether meet the flat relationship of victory or defeat or termination condition according to predefined rule, if it is flat currently to meet victory or defeat When relationship or termination condition, output, which is fought, deduces analysis report, and saves Campaign Process as sample data to the data The sample database of platform is managed, training is completed；Otherwise continue game deduction of fighting between ourselves and the enemy, until it is flat to meet victory or defeat Relationship or termination condition；

The deeply learning platform, for importing training pattern from the training pattern library of data management platform, and combines The environmental information that the operation simulation engine is sent, Xiang Zuozhan simulation engine output action feedback；It is also used to update combat unit Corresponding training pattern, and exported in training pattern library of the training pattern to data management platform after operation game is deduced It saves；

The data management platform, training pattern and described fight for saving the deeply learning platform output are imitated The sample data of true engine output；It is also used to export training pattern to the deeply learning platform.

2. the unsupervised intellectualized battle deduction system as described in claim 1 based on deeply study, which is characterized in that institute Stating environmental information includes: quantity, location information, current state, mutual spacing between ourselves and the enemy.

3. the unsupervised intellectualized battle deduction system as described in claim 1 based on deeply study, which is characterized in that institute The operation that single step decision includes combat unit is stated to act；The multistep decision include combat unit it is multiple fight one battle after another it is dynamic Make.

4. the unsupervised intellectualized battle deduction system as described in claim 1 based on deeply study, which is characterized in that institute Confrontation terminal is stated as device end, is inputted and is made to the operation simulation engine by the input interface of the operation simulation engine War Environment Definition information.

5. the unsupervised intellectualized battle deduction system as described in claim 1 based on deeply study, which is characterized in that institute The combat mode for stating operation simulation engine further includes simulation model, and the simulation model includes man-machine confrontation and the confrontation of machine machine；

The confrontation terminal, also provides for three-dimensional visualization interface, and the operation emulation is presented in visualization under simulation model The environmental information of engine；Confrontation operation interface is also provided for, in man-machine confrontation by confrontation operation interface artificially to described The input of operation simulation engine, which is fought, to be operated.

6. the unsupervised intellectualized battle deduction system as claimed in claim 5 based on deeply study, which is characterized in that institute State combat mode be machine machine confrontation when, it is described based on deeply study unsupervised intellectualized battle deduction system, execute it is as follows Operation:

The operation simulation engine for accessing the three-dimensional visualization interface of the confrontation terminal, and is inputted according to device end Operational environment define information, relevant to operational environment equipment model is imported from equipment model library, from the importing of model of place library Model of place relevant to operational environment imports training pattern from the training pattern library of data management platform, completes initialization； And operation game between ourselves and the enemy is carried out based on timeslice and is deduced, current environmental information is sent to the deeply and learns to put down Platform executes single step decision according to the movement feedback of deeply learning platform output, and updates environmental information according to single step decision, Effect is presented in the visualization for refreshing the confrontation terminal；

It is also used to judge currently whether meet the flat relationship of victory or defeat or termination condition according to predefined rule, if it is flat currently to meet victory or defeat When relationship or termination condition, output, which is fought, deduces analysis report, and saves Campaign Process as sample data to the data The sample database of platform is managed, emulation is completed；Otherwise continue game deduction of fighting between ourselves and the enemy, until it is flat to meet victory or defeat Relationship or termination condition；

The deeply learning platform, for importing training pattern from the training pattern library of data management platform, and combines The environmental information that the operation simulation engine is sent, Xiang Zuozhan simulation engine output action feedback；

The data management platform, for saving the sample data of the operation simulation engine output；It is also used to the depth Intensified learning platform and the operation simulation engine export training pattern.

7. the unsupervised intellectualized battle deduction system as claimed in claim 5 based on deeply study, which is characterized in that institute State combat mode be man-machine confrontation when, it is described based on deeply study unsupervised intellectualized battle deduction system, execute it is as follows Operation:

The operation simulation engine, for accessing the three-dimensional visualization interface and confrontation operation interface of the confrontation terminal, and root Information is defined according to the operational environment that device end inputs, imports equipment model relevant to operational environment from equipment model library, from Model of place library imports model of place relevant to operational environment, and training mould is imported from the training pattern library of data management platform Type completes initialization；And operated according to the operation of confrontation terminal, operation game between ourselves and the enemy is carried out based on timeslice and is deduced, it will Current environmental information is sent to the deeply learning platform, is held according to the movement feedback of deeply learning platform output Row single step decision, while the operation operation of the confrontation terminal input is received, and ring is updated according to single step decision and operation of fighting Border information refreshes the effect of visualization of the confrontation terminal；