Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention
In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is
A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art
Every other embodiment obtained without creative efforts, shall fall within the protection scope of the present invention.
Fig. 1 is the train dynamics emulation mode flow chart provided in an embodiment of the present invention based on machine learning.Such as Fig. 1 institute
Show, which comprises
Step 101, basis preset tag extraction attribute data information, the attribute data information and the default label phase
It closes;
The default label refers to by emulating the parameter to be predicted.According to the default label from the data board number of train
It is described related to the default label to refer to the attribute number according to middle extraction attribute data information relevant to the default label
It is believed that the value of breath has an impact to the value of the default label.
The train real time information parameter of train head end and tail end vehicle-mounted recording when the data board data are train operation;It can
The data board data are stored entirely in local, so that simulated program calls the attribute data information needed;It can also be direct
Extract the attribute data information that simulated program needs from the data board data, such as described category relevant to the default label
Property data information.After extracting the attribute data information, by the attribute data information preservation of extraction in local or database
For simulated program calling.
Step 102 demarcates the default label according to the attribute data information;
The attribute data information is on the influential data of the value of default label tool, according to the kinetic simulation of train
Type or empirical equation obtain the relationship of the attribute data information Yu the default label, and then according to the attribute data information
The default label is demarcated, namely the default label is calculated according to the attribute data information.
Step 103, using the attribute data information and the default label as training set, be based on support vector regression
Algorithm exercises supervision study, obtains and obtains the regression curve of the default label by the attribute data information;
Support vector regression algorithm (SVM) is a kind of sample learning algorithm for having solid theoretical basis, it avoids complexity
Deductive procedure, realize efficiently from training sample to the inference mode of forecast sample, SVR, which is simplified, common to be returned, is pre-
Survey problem reduces the complexity of mass data processing, substantially increases training effectiveness, is better than genetic algorithm.
It is input with the attribute data information and the default label when the attribute data information is various dimensions,
It is output with the default label, is trained using support vector regression algorithm;Wherein, the attribute data as input
Information and the default label as output obtain the regression curve for training;The default label as input
For supervision machine learning training process, during fitting obtains the regression curve, if according to the regression curve
The gap of the predicted value of the obtained default label and the value of the actual default label exceeds preset threshold, then gives up phase
Answer sample.
When being analyzed using support vector regression, according to the magnanimity history number of the attribute data information and the default label
According to being trained and being fitted, obtains and the regression curve of the default label is obtained by the attribute data information.
Particularly, if in any period the attribute data information or default label, when the attribute number it is believed that
It is input with temporal information when breath or default label are single dimension, is defeated with the attribute data information or the default label
Out, it is trained using support vector regression, the available attribute data information or the default label change over time
Model.
Step 104, using the regression curve, the value of label is preset according to the attribute data information prediction, it is real
The dynamics simulation simulation of existing virtual train;
After acquisition obtains the regression curve of the default label by the attribute data information, for the new received category
Property data information, the attribute data information is substituted into the regression curve, the value for predicting the default label may be implemented, from
And realize the dynamics simulation simulation of virtual train.
Support vector regression is the process of a supervised learning, exact according to history data set repetition training formation one
Training pattern directly can obtain the pre- bidding of demand using the training pattern in order to attribute data information certain known to the later period
Label, the entire model realization virtual analog of train, reduce the actual test amount of train.
With the continuous accumulation of sample, the training set in machine learning can be updated, to realize to the attribute number
It is believed that the continuous self study of breath and the default label, improves the accuracy of regression curve.
The embodiment of the present invention is by the support vector regression algorithm in machine learning, to the attribute data information and mark of train
Label are trained, and realize simulation and test to virtual train on emulation platform;Can reduce train field monitoring monitoring when
Between and manpower consumption, and improve efficiency and accuracy rate in entire test process.
Further, based on the above embodiment, before the basis presets tag extraction attribute data information, the side
Method further include: the setting default label, the default label specifically include trailer system link delay time, braking system chain
Road delay time and tractive force;Determine that the attribute data information, the attribute data information specifically include ATO speed, column
Distance away from nearest station of vehicle level, train, ATP speed, ATP speed limit, ATP alarm, level change the time, the ATO response time,
ATO draws acceleration and ATO braking acceleration.
Default label is set, the default label is to emulate the value that finally predicted.The default label can be with people
It is specified, the default label as described in specified according to emulation purpose.Label is preset described in the embodiment of the present invention specifically includes traction
System link delay time, braking system link delay time and tractive force, it is possible to understand that, the trailer system link prolongs
When time, braking system link delay time and tractive force be train operation in trailer system link delay time, braking system
System link delay time and tractive force.
In addition, can also be by obtaining important label using temperature figure tool, to realize the dynamics simulation to train.Benefit
It can be set with temperature figure for the label of simulation process and reference is provided.It is measured by test of many times, such as benefit in the embodiment of the present invention
It can know that the strong correlation attribute in the train dynamics parameter by selection converges to three key parameters substantially with temperature figure:
Trailer system link delay time, braking link delay time and tractive force;So selecting three keys in subsequent step
Label of the parameter as machine learning.Meanwhile if convergence result shows other protrusions by temperature map analysis repeatedly
Key parameter then should modify label through tradeoff at this time.
Determine that the attribute data information, the attribute data information are related to the default label according to default label
, influential on the result of the default label attribute data information.In three labels in the embodiment of the present invention, lead
Draw system link delay time and time when braking link delay time can be changed by level opens with corresponding ATO speed
The time difference that beginning changes thoroughly deserves, and tractive force can be obtained by tractive force model.To each in train travelling process
The research of institute's stress is divided into tractive force, running resistance, three kinds of brake force under operating status;Wherein running resistance and brake force influence
Tractive force size in entire train travelling process.
On the basis of the above embodiments, the embodiment of the present invention by setting label and attribute number relevant to label it is believed that
Breath, the realization for further machine learning algorithm provide premise, improve the reliability of emulation.
Further, based on the above embodiment, it is described according to the attribute data information demarcate the default label it
Before, the method also includes: the attribute data information is pre-processed and analyzed, is specifically included: removing abnormal attribute number
It is believed that breath, the attribute data information is normalized and is obtained the incidence relation of the attribute data information.
It removes abnormal attribute data information and refers to the abnormal data removed in the attribute data information.Remove abnormal attribute
Data information is to guarantee that the data reliability in subsequent step, abnormal data will affect subsequent regression algorithm to recurrence mould
The deduction of type;Since the data scale between the different attribute data information is different, in order to guarantee attribute data described in multiclass
Information does regression analysis in same data acquisition system, needs to be normalized, and otherwise can lose the lesser attribute of scale-value
Feature;The incidence relation of the attribute data information is obtained, it can be to provide reference by machine learning fit regression curve.
The embodiment of the present invention improves the effective of sample data by the way that attribute data information is pre-processed and analyzed
Property, improve the reliability and accuracy of curve matching.
Further, based on the above embodiment, described that pretreatment is carried out to the attribute data information and analyzes specific packet
It includes: the abnormal attribute data information being judged using box traction substation, and removes the abnormal attribute data information;Then it utilizes
The attribute data information is normalized in min-max standardized algorithm or Z-score standardized algorithm;Finally utilize
Temperature figure carries out strong correlation analysis to the attribute data information.
The abnormal attribute data information is judged using box traction substation, and removes the abnormal attribute data information;Due to
Data scale between the different abnormal attribute data informations is different, in order to guarantee that multi-class data is done in same data acquisition system
Regression analysis needs to be normalized, and otherwise can lose the lesser attributive character of scale-value, be standardized using min-max
Or normalized is completed in Z-score standardization, and the abnormal attribute data information is converted to nondimensional 0~1 value, with
Facilitate and compares;Strong correlation analysis finally is carried out to the attribute data information using temperature figure, finds strong correlation attribute to carry out
Further data analysis and processing.
Simulation software can be write completion by Python, be transported under 7 system of Windows by 2.7 interpreter of Python
Row, the simulation software can realize the function that the attribute data information is pre-processed and analyzed.
The embodiment of the present invention is by pre-processing attribute data information using box traction substation, normalization algorithm and temperature figure
With analysis, the validity of sample data is improved, improves the reliability and accuracy of curve matching.
Fig. 2 is the train dynamics simulation platform structure schematic diagram provided in an embodiment of the present invention based on machine learning.Such as
Shown in Fig. 2, the emulation platform includes attribute data data obtaining module 10, default label demarcating module 20, machine learning mould
Block 30 and simulation and prediction module 40, in which:
Attribute data data obtaining module 10 is specifically used for according to default tag extraction attribute data information, the attribute number
It is believed that breath is related to the default label;
The default label refers to by emulating the parameter to be predicted.Attribute data data obtaining module 10 is according to described pre-
Bidding label extract attribute data information relevant with the default label from the data board data of train, described to preset with described
Label correlation refers to that the value of the attribute data information has an impact to the value of the default label.Attribute data data obtaining module
After 10 extract the attribute data information, by the attribute data information preservation of extraction in local or database for emulation journey
Sequence is called.
Default label demarcating module 20 is specifically used for demarcating the default label according to the attribute data information;
The attribute data information is on the influential data of the value of default label tool, according to the kinetic simulation of train
Type or empirical equation obtain the relationship of the attribute data information Yu the default label, and then default label demarcating module 20
The default label is demarcated according to the attribute data information, namely the pre- bidding is calculated according to the attribute data information
Label.
Machine learning module 30 is specifically used for using the attribute data information and the default label as training set, base
It exercises supervision study in support vector regression algorithm, it is bent to obtain the recurrence that the default label is obtained by the attribute data information
Line;
When the attribute data information is various dimensions, machine learning module 30 is with the attribute data information and described pre-
Bidding label are input, are output with the default label, are trained using support vector regression algorithm;Wherein, as input
The attribute data information and the default label as output, obtain the regression curve for training;As input
The default label be used to supervision machine learning training process, during fitting obtains the regression curve, if root
The gap of the value of the predicted value and actual default label of the default label obtained according to the regression curve is beyond pre-
If threshold value then gives up respective sample.
When machine learning module 30 is analyzed using support vector regression, according to the attribute data information and the pre- bidding
The mass historical data of label is trained and is fitted, and it is bent to obtain the recurrence for obtaining the default label by the attribute data information
Line.
Simulation and prediction module 40 is specifically used for utilizing the regression curve, pre- according to the attribute data information prediction
It is marked with the value of label, realizes the dynamics simulation simulation of virtual train;
After the acquisition of machine learning module 30 obtains the regression curve of the default label by the attribute data information, emulation
Prediction module 40 substitutes into the regression curve for the new received attribute data information, by the attribute data information, can
To realize the value for predicting the default label, to realize the dynamics simulation simulation of virtual train.
The embodiment of the present invention is by the support vector regression algorithm in machine learning, to the attribute data information and mark of train
Label are trained, and realize simulation and test to virtual train on emulation platform;Can reduce train field monitoring monitoring when
Between and manpower consumption, and improve efficiency and accuracy rate in entire test process.
Further, based on the above embodiment, the emulation platform further include: default label setting module is specifically used for
The default label is set, when the default label specifically includes trailer system link delay time, braking system link delay
Between and tractive force;Attribute data information determination module is specifically used for determining the attribute data information, the attribute number it is believed that
Breath specifically includes ATO speed, train level, train distance, ATP speed, ATP speed limit, ATP alarm, level away from nearest station
Change time, ATO response time, ATO traction acceleration and ATO braking acceleration.
Default label setting module is used to set default label, and the default label is emulated and finally to be predicted
Value.The default label can be taking human as specified, the default label as described in specified according to emulation purpose.In addition, utilization can also be passed through
Temperature figure tool obtains important label, to realize the dynamics simulation to train.
Attribute data information determination module determines the attribute data information, the attribute data information according to default label
It is the attribute data information relevant to the default label, influential on the result of the default label.The present invention is real
It applies in three labels in example, when trailer system link delay time and braking link delay time can be changed by level
Time starts thoroughly deserving for the time difference changed with corresponding ATO speed, and tractive force can be obtained by tractive force model.
On the basis of the above embodiments, the embodiment of the present invention by setting label and attribute number relevant to label it is believed that
Breath, the realization for further machine learning algorithm provide premise, improve the reliability of emulation.
Further, based on the above embodiment, the emulation platform further include: pretreatment and analysis module are specifically used for
The attribute data information is pre-processed and is analyzed, comprising: remove abnormal attribute data information, to the attribute number it is believed that
Breath is normalized and obtains the incidence relation of the attribute data information.
It removes abnormal attribute data information and refers to the abnormal data removed in the attribute data information.Remove abnormal attribute
Data information is to guarantee that the data reliability in subsequent step, abnormal data will affect subsequent regression algorithm to recurrence mould
The deduction of type;Since the data scale between the different attribute data information is different, in order to guarantee attribute data described in multiclass
Information does regression analysis in same data acquisition system, needs to be normalized, and otherwise can lose the lesser attribute of scale-value
Feature;The incidence relation of the attribute data information is obtained, it can be to provide reference by machine learning fit regression curve.
The embodiment of the present invention improves the effective of sample data by the way that attribute data information is pre-processed and analyzed
Property, improve the reliability and accuracy of curve matching.
Further, based on the above embodiment, the pretreatment is specifically used for analysis module: judging institute using box traction substation
Abnormal attribute data information is stated, and removes the abnormal attribute data information;Then min-max standardized algorithm or Z- are utilized
The attribute data information is normalized in score standardized algorithm;Finally using temperature figure to the attribute data
Information carries out strong correlation analysis.
Pretreatment judges the abnormal attribute data information, and the abnormal attribute number using box traction substation with analysis module
It is believed that breath;Since the data scale between the different abnormal attribute data informations is different, in order to guarantee multi-class data same
Regression analysis is done in data acquisition system, needs to be normalized, otherwise can lose the lesser attributive character of scale-value, is utilized
Normalized is completed in min-max standardization or Z-score standardization, and the abnormal attribute data information is converted to dimensionless
0~1 value, compared with facilitating;Strong correlation analysis finally is carried out to the attribute data information using temperature figure, finds strong phase
Attribute is closed to carry out further data analysis and processing.
The embodiment of the present invention is by pre-processing attribute data information using box traction substation, normalization algorithm and temperature figure
With analysis, the validity of sample data is improved, improves the reliability and accuracy of curve matching.
Equipment provided in an embodiment of the present invention is for the above method, and concrete function can refer to above method process, this
Place repeats no more.
Fig. 3 is the structural schematic diagram of electronic equipment provided in an embodiment of the present invention.As shown in figure 3, electronic equipment 1 includes place
Manage device 301, memory 302 and bus 303.Wherein, the processor 301 and the memory 302 are complete by the bus 303
At mutual communication;The processor 301 is used to call the program instruction in the memory 302, to execute above-mentioned each side
Method provided by method embodiment, for example, according to default tag extraction attribute data information, the attribute data information with
The default label is related;The default label is demarcated according to the attribute data information;Using the attribute data information and
The default label is exercised supervision study based on support vector regression algorithm as training set, obtain by the attribute number it is believed that
Breath obtains the regression curve of the default label;It is pre- according to the attribute data information prediction using the regression curve
It is marked with the value of label, realizes the dynamics simulation simulation of virtual train.
The embodiment of the present invention discloses a kind of computer program product, and the computer program product is non-transient including being stored in
Computer program on computer readable storage medium, the computer program include program instruction, when described program instructs quilt
When computer executes, computer is able to carry out method provided by above-mentioned each method embodiment, for example, according to default label
Attribute data information is extracted, the attribute data information is related to the default label;It is demarcated according to the attribute data information
The default label;Using the attribute data information and the default label as training set, calculated based on support vector regression
Method exercises supervision study, obtains and obtains the regression curve of the default label by the attribute data information;Utilize the recurrence
Curve presets the value of label according to the attribute data information prediction, realizes the dynamics simulation simulation of virtual train.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium, the non-transient computer readable storage
Medium storing computer instruction, the computer instruction make the computer execute side provided by above-mentioned each method embodiment
Method, for example, according to default tag extraction attribute data information, the attribute data information is related to the default label;
The default label is demarcated according to the attribute data information;Using the attribute data information and the default label as instruction
Practice collection, exercised supervision study based on support vector regression algorithm, acquisition obtains the default label by the attribute data information
Regression curve;Using the regression curve, the value of label is preset according to the attribute data information prediction, is realized virtual
The dynamics simulation of train is simulated.
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through
The relevant hardware of program instruction is completed, and program above-mentioned can be stored in a computer readable storage medium, the program
When being executed, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned includes: ROM, RAM, magnetic disk or light
The various media that can store program code such as disk.
The embodiments such as electronic equipment described above are only schematical, wherein it is described as illustrated by the separation member
Unit may or may not be physically separated, and component shown as a unit may or may not be object
Manage unit, it can it is in one place, or may be distributed over multiple network units.It can select according to the actual needs
Some or all of the modules therein is selected to achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying wound
In the case where the labour for the property made, it can understand and implement.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can
It realizes by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on this understanding, on
Stating technical solution, substantially the part that contributes to existing technology can be embodied in the form of software products in other words, should
Computer software product may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, including several fingers
It enables and using so that an electronic equipment (can be personal computer, server or the network equipment etc.) executes each embodiment
Or method described in certain parts of embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although
Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used
To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features;
And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and
Range.