CN111913887B

CN111913887B - Software behavior prediction method based on beta distribution and Bayesian estimation

Info

Publication number: CN111913887B
Application number: CN202010836514.6A
Authority: CN
Inventors: 唐剑; 赵亮; 唐艺; 浦戈光
Original assignee: National Defense Technology Innovation Institute PLA Academy of Military Science
Current assignee: National Defense Technology Innovation Institute PLA Academy of Military Science
Priority date: 2020-08-19
Filing date: 2020-08-19
Publication date: 2022-11-11
Anticipated expiration: 2040-08-19
Also published as: CN111913887A

Abstract

The invention discloses a software behavior prediction method based on beta distribution and Bayesian estimation, which is used for greatly influencing a model under given behavior training data without converging the model to a fixed value, predicting next binary behavior and achieving good model training and behavior prediction effects. The invention is based on the model prediction technology of probability distribution, bayesian estimation and hidden Markov models, and completes the functions of model training and prediction under the conditions of small data volume and large model change. The tool uses a hidden Markov model to model and describe the situation, then selects the beta probability distribution, realizes the function of updating the model by single data on the basis of using the Bayesian theory, and realizes the function that the model can be greatly changed according to the actual situation and the prerequisite prior data on the basis that the internal rule accords with the beta distribution by setting the prerequisite prior data. The invention can achieve the effects of obtaining a single data change model by training under given binary behavior training data and predicting the next behavior, and expands the applicable objects of model training software.

Description

Software behavior prediction method based on beta distribution and Bayesian estimation

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to a software behavior prediction method based on beta distribution and Bayesian estimation.

Background

Currently, most model training software or algorithms are based on a large amount of data, and through the processes of multiple iterations, fitting and learning, the data is finally converged, so that a model with a fixed probability, such as a neural network algorithm, a clustering algorithm, a maximum expectation algorithm and the like, is obtained. Therefore, there is a lack of such mainstream model training software or algorithm that it cannot be used in situations where the model is changing and does not tend to converge. The situation has two characteristics, firstly, the data amount of a stable and probability-invariant model is small, and even only one model can exist in an extreme case; second, the model, while inherently regular, tracks in the course of changes, varies widely via a single data update. Therefore, a converged, stable model cannot be obtained using existing model training methods without having a large amount of data, a model, or explicit properties that can be converged.

Bayesian estimation typically requires four steps: supposing that the parameters to be estimated are regarded as random variables which accord with certain prior probability distribution, the estimation mode is adopted, and the prior probability density is converted into the posterior probability density through a Bayes rule by observing a sample. Two basic methods of probability density estimation: parameter estimates (parameter methods), which assume that random variables follow a certain distribution according to a general knowledge of the problem, the parameters of the distribution function being estimated by training data, such as ML estimates, bayesian estimates; nonparametric estimates (nonparametric methods), estimates of the probability density without the model, but only with the training data itself. Bayesian estimation is used in many fields, and for positioning applications, the target to be positioned or tracked is a good estimation of the target from a series of measurements.

In the project, a model prediction technology based on probability distribution, bayesian estimation and hidden Markov models is used, and the functions of model training and prediction under the conditions of small data volume and large model change are completed. The hidden Markov model is used for modeling and describing the situation, then the beta probability distribution is selected, the function of updating the model by single data is realized on the basis of using the Bayesian theory, and the function that the model can be greatly changed according to the actual situation and the prerequisite prior data on the basis that the rule accords with the beta distribution is realized by setting the prerequisite prior data.

Disclosure of Invention

The invention provides a software behavior prediction method based on beta distribution and Bayesian estimation and realizes a corresponding behavior prediction tool, wherein the tool takes behavior trajectory data of an experimental object and probability parameters set by experimenters as input, firstly carries out pretreatment on the behavior parameters of the experimental object to obtain a behavior trajectory of the experimental object, then uses the behavior trajectory of the experimental object to train a beta distribution step by step, firstly uses past behaviors as priors and integrates to obtain the current probability in single training aiming at single binary behaviors of the experimental object, then uses Bayesian theory knowledge and probability parameters set by the experimenters as priors to change the mode of probability change, and finally obtains the probability of a next behavior prediction value.

The invention provides a software behavior prediction method based on beta distribution and Bayesian estimation, which comprises the following steps:

s1, preprocessing software behavior data to obtain a software behavior track;

s2, gradually training the beta distribution by using the software behavior track, wherein the software behavior track can be decomposed into continuous single binary behaviors, and specifically, a beta distribution training algorithm is constructed aiming at the single binary behaviors of the software, namely: :

firstly, a beta probability distribution is established, and the beta probability distribution is a conjugate prior distribution, so that the posterior distribution after single data update is the same as the prior distribution structure, and the difference is only in parameters. There are two events, a and B, that are mutually exclusive, i.e., a occurrence will result in B not occurring, B occurs will result in a not occurring, and a and B will occur at least one, assuming that what needs to be predicted is the probability of a occurring. Let s _k Is the probability of occurrence of A at the kth time, r _k Is the probability of occurrence of A at the kth time, r _k Satisfying the beta probability distribution. Then there are:

p(s _k ＝1|S _k-1 )＝∫p(s _k ＝1|r _k )p(r _k |S _k-1 )dr _k

p(s _k ＝1|S _k-1 )＝∫r _k p(r _k |S _k-1 )dr _k

secondly, through software behavior trajectory training, setting alpha as the probability that the probability of A occurrence remains unchanged, p ₀ Is at initialization time r _k And finally obtaining the beta probability distribution of single software behavior prediction according to the occurrence probability, namely:

p(r _k |S _k-1 )＝αp(r _k-1 |S _k-1 )+(1-α)p ₀ (r _k )

s3, constructing a prior value optimization model algorithm by taking the single same behavior of the past software as a prior value, namely:

adding prior value information on the well-established beta probability distribution model, and assuming that the probability of the establishment of the prior value is as follows: p (attribute), and if the prediction probability before adding the prior value information is p (original), the following formula holds true according to the bayesian theory:

wherein p (experimenter | original) is the probability that the prior information is established under the condition of the prediction probability, namely the probability that the prior value p (original) is established in the corresponding beta distribution;

s4, optimizing the behavior predicted value of the software by selectively using a memory regression optimization algorithm, wherein the specific memory regression optimization processing comprises the following steps:

after adding the prior value information, an optional memory decay optimization is added. The memory decline algorithm receives a behavior sequence actionList, and after the memory decline processing, returns a Beta distribution list betaMemo DecayList, wherein the length of the list is the same as that of the behavior sequence actionList, and the list represents Beta distribution in a probability prediction function corresponding to each behavior in the actionList.

The one-time memory decay algorithm receives a behavior sequence actionList and then returns a single beta distribution that has undergone memory decay. The essence of the method is that in the process of training by using each behavior in actionList, the more the behavior at the back is, the higher the weight setting is, the more the influence of the behavior on the model is, and the more the behavior at the front is, the lower the weight setting is, which indicates that the influence of the long-term historical information on the current model is reduced. The weight setting is optionally changed, and has two options of discrete and linear, and the distribution shows that the influence of the reinforced recent behaviors on the model is increased linearly with the influence of the behaviors from far to near.

The method provided by the invention has the following advantages:

the software behavior prediction method based on the beta distribution and the Bayesian estimation is based on a single binary behavior update model, and is not a stable model with invariable probability which can be obtained only through training of a large amount of data.

According to the software behavior prediction method based on the beta distribution and the Bayesian estimation, the prior information can be added, so that the prediction probability can be more fit for the actual situation of behavior prediction, namely, the model can not stably change in a small range, but can be greatly changed based on the historical data and the prior information of the model, and more accurate judgment can be made.

According to the software behavior prediction method based on the beta distribution and the Bayes estimation, provided by the invention, a user can select to add a memory decline model and further select a memory decline mode, discrete or linear or exponential form.

Drawings

FIG. 1 is a flow chart of a method for behavior prediction;

FIG. 2 is a diagram of a beta distribution based behavior prediction architecture;

FIG. 3 is a functional block diagram of a behavior prediction tool.

Detailed Description

The invention is described in detail below with reference to the figures and examples.

The embodiment of the present invention provides a behavior prediction method based on a beta distribution and bayesian estimation model training, and in order to make understanding of the present invention more clear to those skilled in the art, a detailed description of the present invention will be described below with reference to specific implementations and accompanying drawings. On the basis of the invention, all other achievements obtained without creative work belong to the protection scope of the invention.

As shown in fig. 2 and 3, a behavioral prediction system structure diagram based on beta distribution takes behavioral trajectory data of an experimental object and probability parameters set by experimenters as input, firstly, the behavioral parameters of the experimental object are preprocessed to obtain a behavioral trajectory of the experimental object, then, the behavioral trajectory of the experimental object is used for gradually training one beta distribution, in single training aiming at single binary behaviors of the experimental object, firstly, past behaviors are used as priors, current probabilities are obtained through integration, then, the probability parameters set by the experimenters are added into bayesian theoretical knowledge to be used as priors, a probability change mode is changed, and finally, a probability predicted value of a next behavior is obtained.

As shown in fig. 1, the behavior prediction method based on the model training of the beta distribution and the bayesian estimation includes the following steps:

s1, preprocessing software behavior data to obtain a software behavior track;

s2, gradually training the beta distribution by using the software behavior track, decomposing the software behavior track into continuous single binary behaviors, and specifically constructing a beta distribution training algorithm aiming at the single binary behaviors of the software, namely: :

p(s _k ＝1|S _k-1 )＝∫p(s _k ＝1|r _k )p(r _k |S _k-1 )dr _k

p(s _k ＝1|S _k-1 )＝∫r _k p(r _k |S _k-1 )dr _k

p(r _k |S _k-1 )＝αp(r _k-1 |S _k-1 )+(1-α)p ₀ (r _k )

adding prior value information on the well-established beta probability distribution model, and assuming that the probability of the establishment of the prior value is as follows: p (experimental), and if the prediction probability before adding the prior value information is p (original), the following formula holds according to the bayesian theory:

wherein, p (experimenter | original) is the probability that the prior information holds in the case of the predicted probability, that is, the probability that the prior value p (original) holds in the corresponding beta distribution;

after adding the prior value information, an optional memory decay optimization is added. The Memory Decay Algorithm (Memory Decay Algorithm) receives a behavior sequence actionList, and after the Memory Decay processing, returns a Beta distribution list betaMemo DecayList, the length of the list is the same as that of the behavior sequence actionList, and the list represents Beta distribution in a probability prediction function corresponding to each behavior in the actionList.

The memory decay algorithm is as follows:

the one-time memory decay algorithm receives a behavior sequence actionList and then returns a single beta distribution that has undergone memory decay. The essence of this is that, in the training process using each behavior in actionList, the more the behavior at the back is, the higher the weight setting is, the more the influence of the behavior on the model is, and the more the behavior at the front is, the lower the weight setting is, which indicates that the influence of the history information at a long time on the current model is reduced. The weight setting is optional, as shown in fig. 1, there are two options of discrete and linear, and the distribution indicates that the influence of the recent behaviors on the model is strengthened and the influence of the behaviors from far to near is increased linearly.

The discrete memory regression model algorithm is shown as follows:

finally, it should be noted that: the above description is only for the purpose of illustrating embodiments of the present invention and is not intended to limit the present invention. Although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art can still modify the technical solutions of the foregoing embodiments or make equivalent substitutions for some technical features. Any modification, replacement, and improvement, etc., within the spirit and scope of the present invention are within the scope of the present invention.

Claims

1. A software behavior prediction method based on beta distribution and Bayesian estimation is characterized in that:

s1, preprocessing software behavior data to obtain a software behavior track;

s2, gradually training the beta distribution by using the software behavior track, wherein the software behavior track can be decomposed into continuous single binary behaviors, and specifically, a beta distribution training algorithm is constructed aiming at the single binary behaviors of the software, namely:

firstly, establishing a beta probability distribution, and setting two events A and B which are mutually exclusive, namely, A occurrence can cause B not to occur, B occurrence can cause A not to occur, and A and BAt least one, set s will occur _k Is the probability of occurrence of A at the kth time, r _k Is the probability of occurrence of A at the kth time, r _k Satisfying the beta probability distribution, then there are:

p(s _k ＝1|S _k-1 )＝∫p(s _k ＝1|r _k )p(r _k |S _k-1 )dr _k

p(s _k ＝1|S _k-1 )＝∫r _k p(r _k |S _k-1 )dr _k

p(r _k |S _k-1 )＝αp(r _k-1 |S _k-1 )+(1-α)p ₀ (r _k )

adding prior value information on the well-established beta probability distribution model, assuming that the probability of the establishment of the prior value is p (experimenter), and the prediction probability before the addition of the prior value information is p (origin), according to the Bayesian theory, the following formula is established:

then the predicted value of the single binary behavior of the software can be obtained, and further a software behavior prediction sequence is obtained;

s4, optimizing the behavior predicted value of the software by selectively using a memory regression optimization algorithm, wherein the specific memory regression optimization processing comprises the following steps: after adding prior value information, receiving a historical sequence of single software behaviors, and returning a beta distribution list after memory regression processing, wherein the length of the list is the same as that of the historical sequence of the software behaviors, and the beta distribution list represents beta distribution in a probability prediction function corresponding to each software behavior in the historical sequence of the software behaviors.

2. The software behavior prediction method based on beta distribution and bayes estimation according to claim 1, characterized in that: and setting a weight value for the historical sequence of the single software behavior in the S4, wherein the longer the current time is, the lower the weight value is, and the closer the current time is, the higher the weight setting is.

3. The software behavior prediction method based on beta distribution and bayes estimation according to claim 2, characterized in that: the weight values set for the historical sequence of single software behaviors in S4 have two options of discrete and linear.