CN114131600A - Method and system for generating robot source search scheme based on Gaussian mixture model - Google Patents
Method and system for generating robot source search scheme based on Gaussian mixture model Download PDFInfo
- Publication number
- CN114131600A CN114131600A CN202111457525.4A CN202111457525A CN114131600A CN 114131600 A CN114131600 A CN 114131600A CN 202111457525 A CN202111457525 A CN 202111457525A CN 114131600 A CN114131600 A CN 114131600A
- Authority
- CN
- China
- Prior art keywords
- gaussian
- source
- mixture model
- gaussian mixture
- robot
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1679—Programme controls characterised by the tasks executed
Landscapes
- Engineering & Computer Science (AREA)
- Robotics (AREA)
- Mechanical Engineering (AREA)
- Feedback Control In General (AREA)
Abstract
The invention discloses a method and a system for generating a robot source search scheme based on a Gaussian mixture model, wherein the method for generating the robot source search scheme based on the Gaussian mixture model comprises the following steps: establishing a source searching process model and determining an estimation representation of a source item parameter; aiming at a source search process model, representing probability estimation of source item parameters by using a particle filtering method; reconstructing probability estimation of the particles by using a Gaussian mixture model; determining an action scheme for searching autonomous sources of the control robot according to parameters of Gaussian distribution in the Gaussian mixture model; and (5) carrying out numerical experiments and adjusting the optimal parameters of the Gaussian mixture model. According to the method, the Gaussian mixture model is used for fitting the parameters obtained by the distribution fitting of the particles in the particle filter to control the action of the robot and generate the source search action scheme, so that the source search efficiency is greatly improved, and the calculation cost of the source search is reduced.
Description
Technical Field
The invention relates to the field of robot control, in particular to a method and a system for generating a robot source searching scheme based on a Gaussian mixture model, which are suitable for various gas signal source searching scenes, such as harmful gas leakage in chemical industrial parks, ocean oil spill pollution events and the like.
Background
In nature, it is particularly important to find weak, intermittent or noisy sources of gaseous signal emissions. Animals are hunted and searched for spouses by tracking odor signals, human beings provide an emergency decision scheme for the events such as chemical plant leakage, ocean oil spill pollution and the like by utilizing a signal source searching algorithm, and the perception robot can search for the sources of gas and radiation by an autonomous searching method. For an autonomous searcher, an efficient autonomous search action scheme is of great help in performing search tasks.
Biologically inspired algorithms have investigated the way animals find sources of odors and perform path planning. Random search is a search mode ubiquitous in nature. For example, ants in deserts follow an archimedes spiral path while foraging. Predatory fish in open sea areas follow the Levy pattern when predating in areas with rare prey, which can be demonstrated by a large number of motion data sets. When a priori knowledge or sensory cues can be perceived, the searcher will use some search strategies to obtain more additional useful information. Some organisms (e.g. e.coli, lobster) tend to move in the direction of the concentration gradient, which is defined as the Chemotaxis strategy. In addition, many work locates odor sources by mimicking the behavior of moths and proposes the silk algorithm. The biologically inspired search algorithm has high computational efficiency but relies on significant concentration gradients and concentration boundaries. However, in a real-world environment, turbulence can break the stable plume into disconnected blobs, disrupting the process of source search.
Most of the previously mentioned bio-inspired search strategies can be considered reactive. In addition, methods based on the principles of information theory have also been developed, i.e. cognitive search strategies. Vergasola et al proposed the earliest cognitive search strategy, known as the Infotaxis algorithm, which uses a grid-based approach to maintain information state. Ristic refines the Infotaxi algorithm by using sparse sensing cues in the form of sporadic non-zero sensor measurements. Hutchinson et al propose another cognitive search algorithm, Entrotaxi, which designs a reward function based on maximum entropy sampling. These search strategies express the source search problem as a particle filter-based Partially Observable Markov Decision Process (POMDP), replacing the grid-based approach. The proposal of the cognitive search strategy greatly improves the source search performance under the turbulent flow condition. However, the huge computational burden of cognitive strategies results in lengthy search times, which limits their application in practical search tasks. Therefore, it is important and challenging to improve the efficiency of cognitive strategies.
In summary, in the conventional solution process, for the source search problem based on the cognitive search strategy, the source item estimation is represented by a particle filtering method, and then a reward function is calculated to control the action of the robot. However, the calculation of the reward function is too complex, so that the efficiency of source search is influenced, and the calculation cost is increased.
Disclosure of Invention
The technical problems to be solved by the invention are as follows: aiming at the problems in the prior art, the invention provides a method and a system for generating a robot source search scheme based on a Gaussian mixture model.
In order to solve the technical problems, the invention adopts the technical scheme that:
a method for generating a robot source search scheme based on a Gaussian mixture model comprises the following steps:
1) establishing a source searching process model and determining an estimation representation of a source item parameter;
2) aiming at a source search process model, representing probability estimation of source item parameters by using a particle filtering method;
3) reconstructing probability estimation of the particles by using a Gaussian mixture model;
4) determining an action scheme for searching autonomous sources of the control robot according to parameters of Gaussian distribution in the Gaussian mixture model;
5) and (5) carrying out numerical experiments and adjusting the optimal parameters of the Gaussian mixture model.
Optionally, the source search process model established in step 1) is a partial observable markov decision process modeled as a source search process, and a functional expression of the source search process model is obtained as follows:
in the above formula, R (R | θ)0) Is the gas concentration at an arbitrary position r ═ { x, y } position, θ0={r0Q represents the source term parameter of the source, r0={x0,y0The position of the source in the two-dimensional search domain Ω, Q the diffusion intensity, a the spherical sensor radius, V the mean wind speed, D the effective diffusion coefficient of the gas, λ the intermediate variable, τ the lifetime of the gas molecules.
Optionally, the determining of the estimated representation of the source item parameters in step 1) refers to an information state in a partial observable markov decision process, and a bayesian framework shown in the following formula is adopted to estimate and update the posterior probability density function PDF of any k-th step:
in the above formula, θkRepresenting the source term parameters of step k, D1:k={d1(r1),d2(r2),…,dk(rk) Is the accumulated information of the time step k, d1(r1),d2(r2),…,dk(rk) Are respectively at an arbitrary position r1~rkInformation obtained, P (theta)k-1|D1:k-1) As a posterior probability density function PDF, P (d) of step k-1k(rk)|θk) For the probability of induction measurement, P (d)k(rk)|D1:k-1) Is a weight of probability density, and P (d)k(rk)|D1:k-1) The formula of the calculation function is:
P(dk(rk)|D1:k-1)=∫P(θk-1|D1:k-1)P(dk(rk)|θk)dθk,(3)
probability of induction measurement P (d)k(rk)|θk) The formula of the calculation function is:
in the above formula, P (d | θ)0) Is the probability of contacting a gas molecule d times per unit time.
Optionally, the step 2) of representing the probability estimation of the source parameter by using the particle filtering method specifically means that the probability estimation of the source parameter is implemented by using a sequential monte carlo framework and the particle filtering method, and the specific steps include: generating PDF with a posterior probability density function by sampling any k-th stepN random samples of the weights, wherein,represents an estimate of the parameters of the source item,is the associated weight, and the sum of the weights of the N random samples is 1, thereby expressing the posterior probability density function PDF of any k-th step as:
sampling from the proposed distribution to approximate the a posteriori probability density function PDF of any k-th step using a sequential importance sampling method using a sequential monte carlo framework, whereby the update of the a posteriori probability density function PDF of any k-th step is converted to an update of the associated weights as shown by:
in the above formula, the first and second carbon atoms are,is the non-normalized weight of the particle,in order to be the probability of a state transition,can be obtained from the formula (4) below,an importance function in the importance sampling principle;
presume q (θ) assuming that the source term parameters remain unchanged during the source search processk)=P(θk-1|D1:k-1) Thereby obtaining an updated simplified expression of the correlation weights as shown in the following equation:
and for the updated simplified expression of the correlation weights, an effective sampling proportion ESS shown by the following formula is adopted to define the degradation degree of the particle filter adopted in the particle filtering method:
when effective sampling ratio ESS is lower than threshold NTAnd a resampling method is adopted to solve the problem of particle degradation.
Optionally, when reconstructing a probability estimation of a particle using a gaussian mixture model in step 3), a function expression that a posterior probability density function PDF for an arbitrary k-th step is approximated by a density of the gaussian mixture model is:
in the above formula, PG(theta) represents an approximate Gaussian mixture model probability density function, K represents the number of Gaussian distributions, and coefficient piiSatisfy sigma pii=1,N(θ|μi,Σi) Representing covariance arrays sigma with a gaussian distributioniGaussian distribution of (u)iIs mean value, ΣiIs a Gaussian distribution covariance array; assuming that the distribution of the sampled particles is approximately PG(θ), introducing a hidden variable γ, which is a binary random variable of K dimensions, wherein among K values of the hidden variable γ, only one specific element γ (K) is 1, and the other elements are 0, so as to determine which gaussian model hidden variable in the gaussian mixture model is sampled by the sample, and the corresponding log-likelihood function functional expression is:
in the above formula, L (pi)i,μi,Σi) As a maximum likelihood function, gammaiFor hiding the ith dimension, π, of the variable γkIs a Gaussian distribution coefficient, N (θ | μk,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; and finally, solving the estimation of the weighted log-likelihood function of the Gaussian mixture model by adopting a maximum expectation algorithm EM.
Optionally, the solving of the estimate of the weighted log-likelihood function of the gaussian mixture model using the maximum expectation algorithm EM includes:
s1) setting initial parameters including random sample number N, gaussian distribution covariance array ΣkDistribution mean μkAnd the number of iterations mloop(ii) a Setting an initial cycle variable k and a cycle variable i to be 1;
s2) judging that the loop variable k is equal to the iteration number mloopIf the result is not true, skipping to the next step; if not, then,
s3) traverse the cyclic variable i from 1 to the random number of samples N and calculate a reference probability η (i, k) for each component:
in the above formula, N (θ (i) |. mu.)k,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; k represents the number of Gaussian distributions,. pikAnd pijAre all Gaussian distribution coefficients;
s4) calculating a mean value μ from the reference probability η (i, k) of each component based on the following equationkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd the value of the log-likelihood function lk;
lk=lnL(πi,μi,Σi),(15)
In the above formula, γ (i) represents a local weight;
s5) judging condition lk-lk-1Whether epsilon is less than or equal to epsilon or not is true, if not, jumping to execute the step S2); otherwise, the parameter mean value mu of the finally obtained Gaussian mixture modelkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd outputting as a result.
Optionally, step 4) comprises: firstly, searching a region where the maximum Gaussian distribution is located, defining the average central position of particle samples owned by the maximum Gaussian distribution as the center of the region, and after determining the center, firstly controlling the robot to go to the center of the region for searching: and (3) searching the whole area by using the area center as a search starting point and applying a square search strategy, and terminating the square search mode when the collected information updates the Gaussian mixture model to generate a new maximum Gaussian distribution area, so that a searcher moves to the new area center to search.
Optionally, the adjusting of the optimal parameters of the gaussian mixture model in step 5) specifically means adjusting the optimal number K of gaussian distributions.
In addition, the invention also provides a device for generating the robot source searching scheme based on the Gaussian mixture model, which comprises a microprocessor and a memory which are connected with each other, wherein the microprocessor is programmed or configured to execute the steps of the method for generating the robot source searching scheme based on the Gaussian mixture model.
Furthermore, the present invention also provides a computer-readable storage medium having stored therein a computer program programmed or configured to execute the method for generating the gaussian mixture model-based robot source search scheme.
Compared with the prior art, the invention has the following advantages: modeling a source searching process based on a cognitive search strategy, and representing source item parameters as information states; converting the information state into probability estimation by using a particle filtering method; extracting spatial information of particle distribution, and expressing the particle filter estimation into a plurality of Gaussian distributions by using a Gaussian mixture model; determining a robot autonomous source searching action scheme based on a Gaussian mixture model according to the parameters of Gaussian distribution; the optimal Gaussian mixture model parameters are determined through Monte Carlo simulation experiments, the Gaussian mixture model is used for fitting the distribution of particles in the particle filter, the action of the robot is controlled according to the parameters obtained through fitting, and a source searching action scheme is generated.
Drawings
FIG. 1 is a schematic diagram of a basic flow of a method according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of a maximum gaussian distribution region searching method in the embodiment of the present invention.
Fig. 3 is a schematic diagram of a robot autonomous source search scheme based on a gaussian mixture model in an embodiment of the present invention.
FIG. 4 is a diagram illustrating experimental results of Gaussian mixture model parameters in an embodiment of the present invention.
Detailed Description
The following will take the gas leakage source search problem as an example, and further describe the generation method and system of the robot source search scheme based on the gaussian mixture model in detail.
As shown in fig. 1, the method for generating the robot source search scheme based on the gaussian mixture model in this embodiment includes:
1) establishing a source searching process model and determining an estimation representation of a source item parameter;
2) aiming at a source search process model, representing probability estimation of source item parameters by using a particle filtering method;
3) reconstructing probability estimation of the particles by using a Gaussian mixture model;
4) determining an action scheme for searching autonomous sources of the control robot according to parameters of Gaussian distribution in the Gaussian mixture model;
5) and (5) carrying out numerical experiments and adjusting the optimal parameters of the Gaussian mixture model.
As known by cognitive search strategies, the source search process can be modeled as a partially observable Markov decision process, the source term estimation can be expressed as a posterior probability density function, and the position with the highest probability is the position of the estimated source. According to the cognitive search strategy, the source item estimation is expressed as an information state in a partial observable Markov decision process, and the subsequent utilization of the source item estimation is facilitated. In this embodiment, the source search process model established in step 1) refers to modeling a source search process as a partially observable markov decision process, and a function expression of the source search process model is obtained as follows:
in the above formula, R (R | θ)0) Is the gas concentration at an arbitrary position r ═ { x, y } position, θ0={r0Q represents the source term parameter of the source, r0={x0,y0The position of the source in the two-dimensional search domain Ω, Q the diffusion intensity, a the spherical sensor radius, V the mean wind speed, D the effective diffusion coefficient of the gas, λ the intermediate variable, τ the lifetime of the gas molecules.
In this embodiment, the determination of the estimation representation of the source item parameter in step 1) refers to an information state in a partial observable markov decision process, and a bayesian framework shown in the following formula is adopted to estimate and update the posterior probability density function PDF of any k-th step:
in the above formula, θkRepresenting the source term parameters of step k, D1:k={d1(r1),d2(r2),…,dk(rk) Is the time stepCumulative information of length k, d1(r1),d2(r2),…,dk(rk) Are respectively at an arbitrary position r1~rkInformation obtained, P (theta)k-1|D1:k-1) As a posterior probability density function PDF, P (d) of step k-1k(rk)|θk) For the probability of induction measurement, P (d)k(rk)|D1:k-1) Is a weight of probability density, and P (d)k(rk)|D1:k-1) The formula of the calculation function is:
P(dk(rk)|D1:k-1)=∫P(θk-1|D1:k-1)P(dk(rk)|θk)dθk,(3)
probability of induction measurement P (d)k(rk)|θk) The formula of the calculation function is:
in the above formula, P (d | θ)0) Is the probability of contacting a gas molecule d times per unit time. At the beginning of the source search, since there is no source item information of the history, it is assumed that the estimation range of the parameters is determined using a priori knowledge and the initial posterior probability density function PDF is represented using a uniform distribution.
Because the direct calculation of the source item estimation parameters in the source search process is too complex, the bayesian estimation of the source item parameters is realized by adopting a sequential monte carlo frame and a particle filtering method, degraded particles are screened by utilizing a resampling step, and newly sampled particles are added to obtain the particle filtering estimation of the source item parameters. Firstly, obtaining random samples through sampling, then updating a posterior probability density function according to a sequential importance sampling method, screening degraded particles after a resampling step, and adding new sampling particles to obtain particle filter estimation of source item parameters. In this embodiment, the probability estimation of the source parameter by using the particle filtering method in step 2) specifically refers to using a sequential monte carlo framework and the particle filtering methodThe probability estimation of the source item parameters is realized, and the specific steps comprise: generating PDF with a posterior probability density function by sampling any k-th stepN random samples of the weights, wherein,represents an estimate of the parameters of the source item,is the associated weight, and the sum of the weights of the N random samples is 1, i.e.:
the posterior probability density function PDF of any kth step is thus expressed as:
posterior probability density function PDFP (theta) from step kk|D1:k) Intermediate sampling is difficult and therefore sequential importance sampling methods are utilized. At each step, samples are taken from the proposed distribution to approximate P (θ)k|D1:k). The update of the PDF is then converted into an update of the associated weight. In this embodiment, a sequential monte carlo framework is adopted to sample the proposed distribution to approximate the a posteriori probability density function PDF of any k-th step using a sequential importance sampling method, so that the update of the a posteriori probability density function PDF of any k-th step is converted into an update of the correlation weight shown by the following formula:
in the above formula, the first and second carbon atoms are,is the non-normalized weight of the particle,in order to be the probability of a state transition,can be obtained from the formula (4) below,an importance function in the importance sampling principle;
presume q (θ) assuming that the source term parameters remain unchanged during the source search processk)=P(θk-1|D1:k-1) Thereby obtaining an updated simplified expression of the correlation weights as shown in the following equation:
and for the updated simplified expression of the correlation weights, an effective sampling proportion ESS shown by the following formula is adopted to define the degradation degree of the particle filter adopted in the particle filtering method:
when effective sampling ratio ESS is lower than threshold NTAnd a resampling method is adopted to solve the problem of particle degradation.
The particle filter estimation of the source item parameters actually converts source item information into a point estimation of particles, and on one hand, the point estimation can be used for carrying out calculation based on a cognitive search strategy and calculation of a reward function from the perspective of data information, and also can be used for extracting information from the perspective of space. The Gaussian mixture model is a model capable of representing the overall distribution of particles in space, and source item information is fully acquired by fitting point estimation into a probability model with a plurality of Gaussian distributions to obtain definite parameters of source item estimation. Any posterior probability density function PDF can be approximated by the density of the gaussian mixture model, so when reconstructing a probability estimation of a particle by using the gaussian mixture model in step 3) of this embodiment, the function expression of the posterior probability density function PDF of any k-th step approximated by the density of the gaussian mixture model is:
in the above formula, PG(theta) represents an approximate Gaussian mixture model probability density function, K represents the number of Gaussian distributions, and coefficient piiSatisfy sigma pii=1,N(θ|μi,Σi) Representing covariance arrays sigma with a gaussian distributioniGaussian distribution of (u)iIs mean value, ΣiIs a Gaussian distribution covariance array; assuming that the distribution of the sampled particles is approximately PG(θ), introducing a hidden variable γ, which is a binary random variable of K dimensions, wherein among K values of the hidden variable γ, only one specific element γ (K) is 1, and the other elements are 0, so as to determine which gaussian model hidden variable in the gaussian mixture model is sampled by the sample, and the corresponding log-likelihood function functional expression is:
in the above formula, L (pi)i,μi,Σi) As a maximum likelihood function, gammaiFor hiding the ith dimension, π, of the variable γkIs a Gaussian distribution coefficient, N (θ | μk,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; finally adopting maximum expectation algorithm EM to solve Gaussian mixtureEstimation of a weighted log-likelihood function of the model.
In this embodiment, the estimation of the weighted log-likelihood function of the gaussian mixture model by using the maximum expectation algorithm EM includes:
s1) setting initial parameters including random sample number N, gaussian distribution covariance array ΣkDistribution mean μkAnd the number of iterations mloop(ii) a Setting an initial cycle variable k and a cycle variable i to be 1;
s2) judging that the loop variable k is equal to the iteration number mloopIf the result is not true, skipping to the next step; if not, then,
s3) traverse the cyclic variable i from 1 to the random number of samples N and calculate a reference probability η (i, k) for each component:
in the above formula, N (θ (i) |. mu.)k,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; k represents the number of Gaussian distributions,. pikAnd pijAre all Gaussian distribution coefficients;
s4) calculating a mean value μ from the reference probability η (i, k) of each component based on the following equationkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd the value of the log-likelihood function lk;
lk=lnL(πi,μi,Σi),(15)
In the above formula, γ (i) represents a local weight;
s5) judging condition lk-lk-1Whether epsilon is less than or equal to epsilon or not is true, if not, jumping to execute the step S2); otherwise, the parameter mean value mu of the finally obtained Gaussian mixture modelkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd outputting as a result.
The probability distribution of the source term parameters is fitted to a plurality of gaussian distributions using a gaussian mixture model algorithm. However, it is necessary to consider how to use multiple gaussian distributed parameters to obtain an efficient search algorithm. After obtaining a plurality of probability models of gaussian distribution of the overall distribution, we can control the action of the robot according to the parameters of the mixed distribution. Each of the plurality of gaussian distributions has its own gaussian distribution parameter, and each gaussian distribution has its scaling factor in the overall distribution, and based on the parameters of the gaussian distributions and their scaling factors, we can determine that there is a higher probability that a source exists in a particular region. Exploring these areas requires the use of some commonly used search algorithms, while the robot updates the source item estimates in time based on the detected results, thus generating a new robot source search action scheme.
After obtaining multiple gaussian distributed parameters, we can use these parameters to determine a robot source search action scheme based on gaussian mixture distribution. In the gaussian mixture model, the gaussian distribution with the largest ratio of the dominant samples among all samples is defined as the largest effective gaussian distribution, i.e., the gaussian distribution with the largest ratio coefficient. We first search for the region where the maximum gaussian distribution is located, since the probability that this region is the region where the source location is located is the greatest. The average center position of the particle samples possessed by the maximum gaussian distribution is defined as the center of the region. After the center is determined, the robot is controlled to go to the center of the area to search. The center of the area is used as a search starting point, and a square search strategy is applied to search the whole area, as shown in fig. 2. When the collected information updates the Gaussian mixture model to generate a new maximum Gaussian distribution area, the square search mode is terminated, and a searcher goes to the center of the new area to search. Therefore, in this embodiment, step 4) includes: firstly, searching a region where the maximum Gaussian distribution is located, defining the average central position of particle samples owned by the maximum Gaussian distribution as the center of the region, and after determining the center, firstly controlling the robot to go to the center of the region for searching: and (3) searching the whole area by using the area center as a search starting point and applying a square search strategy, and terminating the square search mode when the collected information updates the Gaussian mixture model to generate a new maximum Gaussian distribution area, so that a searcher moves to the new area center to search.
Some parameters in the gaussian mixture model need to be preset in advance, and the number of the parameters such as gaussian distribution has an important influence on the generation of an action scheme, so that a monte carlo simulation experiment needs to be carried out to find the influence of the parameters of the gaussian mixture model on the robot source searching efficiency. In this embodiment, the adjusting of the optimal parameter of the gaussian mixture model in step 5) specifically means adjusting the optimal gaussian distribution number K. The initial parameter settings of the gaussian mixture model have an important influence on the result of the fitting. The number of gaussian distributions is particularly important, so that a large number of monte carlo experiments are performed to test parameters in the embodiment to select the optimal number K of gaussian distributions.
As shown in fig. 4, in this embodiment, 100 simulation experiments were performed under different gaussian distribution numbers K and source intensities Q, and the simulation parameters of the experiments are as follows: the value range Y of the coordinate Y is 10, the value range X of the coordinate X is 10, r0={6,6.5},r={0.5,0.5},Q=1,V=1,a=1τ=250,D=1,N=3000,mloop1000. When the distance between the position of the autonomous searching robot and the source position is less than a certain value, namely Dis<0.5, indicating that the robot successfully found the source, the search terminates. Similarly, if the source location is not searched after 500 steps of searching, the searching fails. Experiments evaluate the effectiveness of the robot autonomous source search action scheme based on the Gaussian mixture model under different conditions. It was found that the performance of the course of action under all conditions is best when the number of gaussian distributions K has a value of 3.
In summary, the method of the present embodiment can provide an autonomous source search action scheme for various gas source search problems, and the method of the present embodiment models a source search process based on a cognitive search strategy and re-represents source item parameters by using a particle filtering method. In order to improve the operation efficiency, a Gaussian mixture model is introduced to process the result of the particle filter estimation, and the point estimation of the particles is fitted into a plurality of Gaussian distributions. According to the parameters of the Gaussian distribution, the scheme of robot autonomous source searching action based on the Gaussian mixture model is designed. Compared with the existing autonomous source searching scheme, the method greatly improves the algorithm efficiency and the source searching success rate, and has the advantages of wide applicable scenes, high searching efficiency and success rate and strong robustness under various source searching conditions.
In addition, the present embodiment also provides a device for generating a robot source search solution based on a gaussian mixture model, which includes a microprocessor and a memory connected to each other, where the microprocessor is programmed or configured to execute the steps of the method for generating a robot source search solution based on a gaussian mixture model.
Furthermore, the present embodiment also provides a computer-readable storage medium, in which a computer program programmed or configured to execute the aforementioned generation method of the robot source search scheme based on the gaussian mixture model is stored.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-readable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein. The present application is directed to methods, apparatus (systems), and computer program products according to embodiments of the application, wherein the instructions that execute via the flowcharts and/or processor of the computer program product create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above embodiments, and all technical solutions belonging to the idea of the present invention belong to the protection scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may occur to those skilled in the art without departing from the principle of the invention, and are considered to be within the scope of the invention.
Claims (10)
1. A method for generating a robot source search scheme based on a Gaussian mixture model is characterized by comprising the following steps:
1) establishing a source searching process model and determining an estimation representation of a source item parameter;
2) aiming at a source search process model, representing probability estimation of source item parameters by using a particle filtering method;
3) reconstructing probability estimation of the particles by using a Gaussian mixture model;
4) determining an action scheme for searching autonomous sources of the control robot according to parameters of Gaussian distribution in the Gaussian mixture model;
5) and (5) carrying out numerical experiments and adjusting the optimal parameters of the Gaussian mixture model.
2. The method for generating the robot source search scheme based on the gaussian mixture model according to claim 1, wherein the source search process model established in step 1) is a partial observable markov decision process modeled as a source search process, and the functional expression of the source search process model is obtained as follows:
in the above formula, R (R | θ)0) Is the gas concentration at an arbitrary position r ═ { x, y } position, θ0={r0Q represents the source term parameter of the source, r0={x0,y0The position of the source in the two-dimensional search domain Ω, Q the diffusion intensity, a the spherical sensor radius, V the mean wind speed, D the effective diffusion coefficient of the gas, λ the intermediate variable, τ the lifetime of the gas molecules.
3. The method for generating the robot source search scheme based on the gaussian mixture model according to claim 2, wherein the estimation representation of the source item parameters determined in step 1) is an information state in a partial observable markov decision process, and a bayesian framework shown in the following formula is adopted to estimate and update the posterior probability density function PDF of any k step:
in the above formula, θkRepresenting the source term parameters of step k, D1:k={d1(r1),d2(r2),…,dk(rk) Is the accumulated information of the time step k, d1(r1),d2(r2),…,dk(rk) Are respectively at an arbitrary position r1~rkInformation obtained, P (theta)k-1|D1:k-1) As a posterior probability density function PDF, P (d) of step k-1k(rk)|θk) For the probability of induction measurement, P (d)k(rk)|D1:k-1) Is a weight of probability density, and P (d)k(rk)|D1:k-1) The formula of the calculation function is:
P(dk(rk)|D1:k-1)=∫P(θk-1|D1:k-1)P(dk(rk)|θk)dθk, (3)
probability of induction measurement P (d)k(rk)|θk) The formula of the calculation function is:
in the above formula, P (d | θ)0) Is the probability of contacting a gas molecule d times per unit time.
4. The method for generating the robot source search scheme based on the gaussian mixture model according to claim 3, wherein the step 2) of representing the probability estimation of the source parameter by using the particle filtering method specifically means that the probability estimation of the source parameter is realized by using a sequential monte carlo framework and the particle filtering method, and the specific steps include: generating PDF with a posterior probability density function by sampling any k-th stepN random samples of the weights, wherein,represents an estimate of the parameters of the source item,is the associated weight, and the sum of the weights of the N random samples is 1, thereby expressing the posterior probability density function PDF of any k-th step as:
sampling from the proposed distribution to approximate the a posteriori probability density function PDF of any k-th step using a sequential importance sampling method using a sequential monte carlo framework, whereby the update of the a posteriori probability density function PDF of any k-th step is converted to an update of the associated weights as shown by:
in the above formula, the first and second carbon atoms are,is the non-normalized weight of the particle,in order to be the probability of a state transition,can be obtained from the formula (4) below,an importance function in the importance sampling principle;
presume q (θ) assuming that the source term parameters remain unchanged during the source search processk)=P(θk-1|D1:k-1) Thereby obtaining an updated simplified expression of the correlation weights as shown in the following equation:
and for the updated simplified expression of the correlation weights, an effective sampling proportion ESS shown by the following formula is adopted to define the degradation degree of the particle filter adopted in the particle filtering method:
when effective sampling ratio ESS is lower than threshold NTAnd a resampling method is adopted to solve the problem of particle degradation.
5. The method for generating a robot source search solution based on gaussian mixture model according to claim 4, wherein when reconstructing probability estimation of particles using gaussian mixture model in step 3), the function expression of the posterior probability density function PDF for any k-th step approximated by the density of gaussian mixture model is:
in the above formula, PG(theta) represents an approximate Gaussian mixture model probability density function, K represents the number of Gaussian distributions, and coefficient piiSatisfy sigma pii=1,N(θ|μi,Σi) Representing covariance arrays sigma with a gaussian distributioniGaussian distribution of (u)iIs mean value, ΣiIs a Gaussian distribution covariance array; assuming that the distribution of the sampled particles is approximately PG(θ), introducing a hidden variable γ, which is a binary random variable of K dimensions, wherein among K values of the hidden variable γ, only one specific element γ (K) is 1, and the other elements are 0, so as to determine which gaussian model hidden variable in the gaussian mixture model is sampled by the sample, and the corresponding log-likelihood function functional expression is:
in the above formula, L (pi)i,μi,Σi) As a maximum likelihood function, gammaiFor hiding the ith dimension, π, of the variable γkIs a Gaussian distribution coefficient, N (θ | μk,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; and finally, solving the estimation of the weighted log-likelihood function of the Gaussian mixture model by adopting a maximum expectation algorithm EM.
6. The method for generating the Gaussian mixture model-based robot source search scheme according to claim 5, wherein the solving the estimation of the weighted log-likelihood function of the Gaussian mixture model by using the maximum expectation algorithm EM comprises:
s1) setting initial parameters including random sample number N, gaussian distribution covariance array ΣkDistribution mean μkAnd the number of iterations mloop(ii) a Setting an initial cycle variable k and a cycle variable i to be 1;
s2) judging that the loop variable k is equal to the iteration number mloopIf the result is not true, skipping to the next step; if not, then,
s3) traverse the cyclic variable i from 1 to the random number of samples N and calculate a reference probability η (i, k) for each component:
in the above formula, N (θ (i) |. mu.)k,Σk) Representing covariance arrays sigma with a gaussian distributionk(ii) a gaussian distribution of; k represents the number of Gaussian distributions,. pikAnd pijAre all Gaussian distribution coefficients;
s4) calculating a mean value μ from the reference probability η (i, k) of each component based on the following equationkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd the value of the log-likelihood function lk;
lk=lnL(πi,μi,Σi), (15)
In the above formula, γ (i) represents a local weight;
s5) judging condition lk-lk-1Whether epsilon is less than or equal to epsilon or not is true, if not, jumping to execute the step S2); otherwise, the parameter mean value mu of the finally obtained Gaussian mixture modelkGaussian distribution covariance array sigmakGaussian coefficient of distribution pikAnd outputting as a result.
7. The generation method of the robot source search scheme based on the Gaussian mixture model according to claim 6, wherein the step 4) comprises: firstly, searching a region where the maximum Gaussian distribution is located, defining the average central position of particle samples owned by the maximum Gaussian distribution as the center of the region, and after determining the center, firstly controlling the robot to go to the center of the region for searching: and (3) searching the whole area by using the area center as a search starting point and applying a square search strategy, and terminating the square search mode when the collected information updates the Gaussian mixture model to generate a new maximum Gaussian distribution area, so that a searcher moves to the new area center to search.
8. The method for generating a robot-source search solution based on gaussian mixture model as claimed in claim 7, wherein the adjusting of the optimal parameters of the gaussian mixture model in step 5) is to adjust the optimal number of gaussian distributions K.
9. An apparatus for generating a robot-derived search solution based on Gaussian mixture model, comprising a microprocessor and a memory connected to each other, wherein the microprocessor is programmed or configured to execute the steps of the method for generating a robot-derived search solution based on Gaussian mixture model according to any one of claims 1 to 8.
10. A computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, the computer program being programmed or configured to perform the method for generating a gaussian mixture model based robot source search solution according to any one of claims 1 to 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111457525.4A CN114131600A (en) | 2021-12-01 | 2021-12-01 | Method and system for generating robot source search scheme based on Gaussian mixture model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111457525.4A CN114131600A (en) | 2021-12-01 | 2021-12-01 | Method and system for generating robot source search scheme based on Gaussian mixture model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114131600A true CN114131600A (en) | 2022-03-04 |
Family
ID=80386569
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111457525.4A Pending CN114131600A (en) | 2021-12-01 | 2021-12-01 | Method and system for generating robot source search scheme based on Gaussian mixture model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114131600A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114326755A (en) * | 2022-03-15 | 2022-04-12 | 中国人民解放军国防科技大学 | Robot gas source searching method based on local path planning |
CN117148872A (en) * | 2023-11-01 | 2023-12-01 | 中国人民解放军国防科技大学 | Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103439688A (en) * | 2013-08-27 | 2013-12-11 | 大连理工大学 | Sound source positioning system and method used for distributed microphone arrays |
CN108108336A (en) * | 2016-11-24 | 2018-06-01 | 上海栖芯信息科技有限公司 | A kind of atmosphere pollution source tracing method towards industrial park |
US20190237066A1 (en) * | 2018-01-26 | 2019-08-01 | Sogang University Research Foundation | Source localization method by using steering vector estimation based on on-line complex gaussian mixture model |
CN110795841A (en) * | 2019-10-24 | 2020-02-14 | 北京交通大学 | Mathematical modeling method for uncertainty of intermittent energy output |
CN112488153A (en) * | 2020-11-06 | 2021-03-12 | 中国人民解放军国防科技大学 | Autonomous searching and positioning method for odor sources based on clustering tendency principle |
CN113505431A (en) * | 2021-06-07 | 2021-10-15 | 中国人民解放军国防科技大学 | ST-DQN-based target searching method, device, equipment and medium for marine unmanned aerial vehicle |
-
2021
- 2021-12-01 CN CN202111457525.4A patent/CN114131600A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103439688A (en) * | 2013-08-27 | 2013-12-11 | 大连理工大学 | Sound source positioning system and method used for distributed microphone arrays |
CN108108336A (en) * | 2016-11-24 | 2018-06-01 | 上海栖芯信息科技有限公司 | A kind of atmosphere pollution source tracing method towards industrial park |
US20190237066A1 (en) * | 2018-01-26 | 2019-08-01 | Sogang University Research Foundation | Source localization method by using steering vector estimation based on on-line complex gaussian mixture model |
CN110795841A (en) * | 2019-10-24 | 2020-02-14 | 北京交通大学 | Mathematical modeling method for uncertainty of intermittent energy output |
CN112488153A (en) * | 2020-11-06 | 2021-03-12 | 中国人民解放军国防科技大学 | Autonomous searching and positioning method for odor sources based on clustering tendency principle |
CN113505431A (en) * | 2021-06-07 | 2021-10-15 | 中国人民解放军国防科技大学 | ST-DQN-based target searching method, device, equipment and medium for marine unmanned aerial vehicle |
Non-Patent Citations (1)
Title |
---|
季雅泰: "A strategy for autonomous source searching using the Gaussian Mixture Model to fit the estimate of the source location", IEEE, pages 1817 - 1822 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114326755A (en) * | 2022-03-15 | 2022-04-12 | 中国人民解放军国防科技大学 | Robot gas source searching method based on local path planning |
CN117148872A (en) * | 2023-11-01 | 2023-12-01 | 中国人民解放军国防科技大学 | Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene |
CN117148872B (en) * | 2023-11-01 | 2024-01-26 | 中国人民解放军国防科技大学 | Robot collaborative source searching method, device and equipment under multi-gas diffusion source scene |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sussillo et al. | Lfads-latent factor analysis via dynamical systems | |
CN114131600A (en) | Method and system for generating robot source search scheme based on Gaussian mixture model | |
Xie et al. | Synthesizing dynamic patterns by spatial-temporal generative convnet | |
Rezende et al. | Stochastic backpropagation and approximate inference in deep generative models | |
CN107085942B (en) | Traffic flow prediction method, device and system based on wolf colony algorithm | |
Karul et al. | Case studies on the use of neural networks in eutrophication modeling | |
CN107146237B (en) | Target tracking method based on online state learning and estimation | |
Sauvage et al. | Optimal quantum control with poor statistics | |
Corucci et al. | Material properties affect evolutions ability to exploit morphological computation in growing soft-bodied creatures | |
CN113361777A (en) | Runoff prediction method and system based on VMD decomposition and IHHO optimization LSTM | |
CN112231964B (en) | Gas leakage source autonomous searching and positioning method based on deep reinforcement learning | |
CN111860787A (en) | Short-term prediction method and device for coupling directed graph structure flow data containing missing data | |
CN112163671A (en) | New energy scene generation method and system | |
CN109151332A (en) | Camera coding based on fitness function exposes optimal code word sequence search method | |
Jiang et al. | Sequential latent variable models for few-shot high-dimensional time-series forecasting | |
CN114154383A (en) | Multi-robot-source search scheme generation method and system based on cognitive search strategy | |
CN113139644B (en) | Information source navigation method and device based on deep Monte Carlo tree search | |
CN103473784B (en) | The blurred picture adaptive restoration method of learning based on higher dimensional space geological information | |
CN107808164B (en) | Texture image feature selection method based on firework algorithm | |
Arakaki et al. | Capturing the diversity of biological tuning curves using generative adversarial networks | |
Cyriac et al. | An overview of the applications of particle swarm in water resources optimization | |
Elamvazhuthi et al. | Controllability to equilibria of the 1-d Fokker-Planck equation with zero-flux boundary condition | |
Xie et al. | Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet | |
Majumder et al. | Two-step approach based multi-objective groundwater remediation in highly heterogeneous media using enhanced random vector functional link integrated with evolutionary marine predator algorithm | |
Alasti | An Efficient Machine Learning Algorithm For Spatial Tracking Of Correlated Signals In Wireless Sensor Field |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |