CN110653824A

CN110653824A - Method for characterizing and generalizing discrete trajectory of robot based on probability model

Info

Publication number: CN110653824A
Application number: CN201910973457.3A
Authority: CN
Inventors: 林立民
Original assignee: Tongji Institute Of Artificial Intelligence (suzhou) Co Ltd
Current assignee: Tongji Institute Of Artificial Intelligence (suzhou) Co Ltd
Priority date: 2019-07-26
Filing date: 2019-10-14
Publication date: 2020-01-07
Anticipated expiration: 2039-10-14
Also published as: CN110653824B

Abstract

The invention relates to a characterization and generalization method of discrete type track of robot based on probability model, which comprises the following steps: splitting the track into multiple sections, respectively teaching each section of track, acquiring a data source of discrete track characterization, and characterizing the discrete track: modeling the robot track based on a plurality of GMMs, extracting the correlation among a plurality of sections of tracks, representing the teaching track, and generalizing the track to output: and splicing the multiple sections of tracks through GMR to realize generalized output of the tracks, so that the output tracks have smoothness. The teaching process is simplified and the operability is strong; smooth splicing can be carried out on each track based on the time information; by learning the multitask constraint relation of the multiple mechanical arms, the multiple mechanical arms of the robot can cooperatively complete multitask.

Description

Method for characterizing and generalizing discrete trajectory of robot based on probability model

Technical Field

The invention relates to a method for characterizing and generalizing discrete type tracks of a robot based on a probability model.

Background

Teaching learning allows a robot to learn how a human performs a smart operation task in an unknown environment and to generate a robot trajectory meeting the requirements in a new environment and task goals. The trajectory generation strategy based on teaching learning can fully extract the characteristics of the teaching trajectory and generate the robot trajectory with certain generalization.

Discrete tracks are common in human life, such as Chinese character writing. But the teaching learning field has less research on feature extraction of discrete trajectories. The current research on the characterization and generalization of the discrete trajectory of the robot mainly has the following defects:

(1) and the teaching track is complicated and tedious. The acquisition of the discrete teaching data is basically obtained by continuous operation at present. When the learned track is complex, the teaching strategy is obviously complicated, a great amount of time and cost are needed in the teaching process, and a certain skill is also needed to continuously design the track.

(2) And lack of corresponding trajectory stitching strategies. Different from a continuous track, a discrete track is composed of multiple sections of tracks, certain space constraint and time constraint exist among the sections of tracks, corresponding strategies are lacked in the current researches on the characterization and generalization of the discrete track to splice the multiple discrete tracks, and the final output track can not have good smoothness while the related constraint of the original track is met.

Disclosure of Invention

The invention aims to provide a method for characterizing and generalizing discrete type tracks of a robot based on a probability model.

In order to achieve the purpose, the invention adopts the technical scheme that:

a method for characterizing and generalizing discrete trajectory of robot based on probability model includes:

(1) teaching of discrete trajectory: splitting the track into a plurality of sections, respectively teaching each section of track to obtain a data source of discrete track representation,

(2) and characterizing the discrete type track: modeling the robot track based on a plurality of Gaussian Mixture Models (GMMs), extracting the correlation among a plurality of sections of tracks, representing the teaching track,

(3) and outputting track generalization: and splicing the multiple tracks through Gaussian Mixture Regression (GMR) to realize generalized output of the tracks, so that the output tracks have smoothness.

Preferably, the data are clustered by using a k-means clustering algorithm (k-means), the class of the teaching data is divided, and the data of the Gaussian Mixture Model (GMM) are learned by using a maximum expectation algorithm (EM algorithm).

Further preferably, the learning of the data of the Gaussian Mixture Model (GMM) using the maximum expectation algorithm (EM algorithm) is performed cyclically using the estimation step (E-step) and the maximization step (M-step) until the parameters converge.

Further preferably, in the estimating step (E-step), it includes:

(1) classifying and dividing the sampling data,

(2) and the probability P (yt, gamma t | mu, Σ, pi) of generating a sample is determined for each class (K1, 2 … K),

(3) and solving a probability Q function generated by the sampling data.

Further preferably, the Q-function is maximized in a maximization step (M-step) to optimize parameters of the Gaussian Mixture Model (GMM).

Preferably, in (1): and acquiring a data source of discrete track representation by using a dragging teaching strategy.

Preferably, in (2): and (3) utilizing Matlab programming to realize a Gaussian Mixture Model (GMM) to characterize the teaching track.

Preferably, when modeling the robot trajectory by a Gaussian Mixture Model (GMM), a probability model is used to extract the correlation between the multiple segments of the trajectory.

Preferably, in (3): when the track is generalized and outputted by the Gaussian Mixture Regression (GMR), the generalized output is saved as a mat file as a desired track of the control system.

Preferably, the method further comprises (4) utilizing the generalized output trajectory as a desired trajectory for the control system, such that the robot trajectory tracks the desired trajectory

Further preferably, the robot performs the trajectory learning of the control system by multi-robot multi-task coordination.

Due to the application of the technical scheme, compared with the prior art, the invention has the following advantages and effects:

1. discretization and subsection teaching are carried out on the discrete track of the complex robot, so that the teaching process is simplified and the operability is strong;

2. representing multiple discrete tracks by adopting multiple GMMs, introducing time dimension information among the tracks, and performing smooth splicing on the tracks based on the time information when the GMRs can be finally used for track generalization output;

3. by learning the multitask constraint relation of the multiple mechanical arms, the multiple mechanical arms of the robot can cooperatively complete multitask.

Drawings

FIG. 1 is a schematic flow chart of the method of this embodiment;

FIG. 2 is a diagram illustrating GMM model parameter learning in this embodiment;

FIG. 3 is a graph of the trajectory teaching, GMM characterization, and GMR generalized output trajectory for the present embodiment;

FIG. 4 is a representation and generalized trajectory diagram of a trajectory "typing" continuous teaching strategy;

FIG. 5 is a representation and generalized trajectory diagram of a discrete teaching strategy for trajectory typing;

FIG. 6 is a diagram of the trajectory output of multi-robot multi-task coordination.

Detailed Description

The invention is further described below with reference to the accompanying drawings and embodiments:

as shown in the figure: a method for characterizing and generalizing discrete trajectory of robot based on probability model includes:

(1) teaching of discrete trajectory:

the source of the teaching data is obtained by using a strategy of dragging teaching, and the teaching data is firstly expressed: for two-dimensional teaching data, this is expressed herein as:

wherein, y_i,s，y_i,tThe spatial information and the time information of the teaching trajectory are respectively shown, and T shows the number of teaching points in the teaching trajectory.

(2) And characterizing discrete tracks and learning parameters:

for a multidimensional teaching variable y, the modeled GMM is:

wherein p (y) represents a probability density function, N (y, μ)_k，∑_k) Is expressed in μ_kIs mean value, Σ_kIs a gaussian probability density function of the covariance matrix.

Compared with the parameter estimation of a Gaussian model, the parameter estimation of the Gaussian mixture model is more complicated, and the main reason is that the existence of hidden variables cannot utilize a maximum likelihood estimation method to obtain the parameters of the model. For the teaching sample set Y ═ (Y)₁，y₂…y_T) By an implicit variable gamma_t，KCan be expanded into full data:

(y_t，γ_t，1，γ_t，2…γ_t，K)，t＝1，2...T (3)

if y_tFrom class 1 sampling, then there is γ_t，1＝1，γ_t，2＝0…γ_t，KIs represented by (y) 0_t，1，0，…0)。

The likelihood function for the complete data is:

the log-likelihood function for the full data is:

the Q function is defined as follows:

wherein, E (γ)_t，K|y_t，μⁱ，∑ⁱ，πⁱ) Is an estimate of γ:

the Q function is derived and its derivative is 0, which can be:

wherein

Respectively represent the (i +1) th iteration, the mean of the kth class, the covariance matrix, and the occupied weight.

(3) And outputting track generalization:

and after the teaching track is subjected to GMM representation coding, utilizing GMR to output the track.

Data point y for the teach path [ y ═ y^I，y^o]First, the distribution P (y) of the teaching data points is calculated using a probability model^I，y^o) Modeled as GMM, followed by computation of the condition variable (y) by GMR^o|y^I) Is desired E (y)^o|y^I) And covariance Cov (y)^o|y^I) Mixing E (y)^o|y^I) As a generalized output data point, in Cov (y)^o|y^I) And generating a motion track with smoothness under the constraint.

For a dataset of T D-dimensional teach data points, the GMM is modeled as follows:

wherein, pi_kIs the prior probability of the model, N (y, μ)_k，∑_k) Is measured in mu_kAs a mean value, by ∑_kIs a gaussian distribution of variance and has:

at a given y^IAnd the k-th Gaussian distribution, the condition variable (y)^o|y^IK) also follows a gaussian distribution, i.e.:

(y^o|y^I，k)～N(μ′_k，∑′_k) (13)

wherein, mu_k，∑_kRespectively as follows:

for the entire GMM, then there is (y)^o|y^I) Satisfies the following conditions:

wherein h is_kSatisfies the following conditions:

from this, the condition variable (y)^o|y^I) The mean μ and covariance ∑ of are:

(4) and designing a control system:

and designing a control system on a working space, and carrying out a multi-task collaborative track learning strategy by a plurality of mechanical arms of the robot. The teaching data set for GMM may be represented as y ═ y^I，y^o]Wherein y is^IAnd y^oRespectively a query vector and a vector to be encoded. In GMM-based robot trajectory characterization learning, the teaching data set is y ═ y^I _t，y^o _s]That is, the query vector is time information, and the vector to be encoded is spatial information of the track. For multi-robot multi-task collaborative track learning, a two-dimensional space vector (y) of a certain robot is used^I _s1，y^I _s2) As a query vector, the two-dimensional space vector (y) of the remaining mechanical arms^O _s1，y^O _s2，y^O _s3，y^O _s4…y^O _s2n) And as a vector to be coded, performing the characterization and learning of the track. For example, for the two-robot-arm two-task learning, it is necessary to construct a 4-dimensional (2 × 2) robot teaching dataset y ═ y^I _s1，y^I _s2，y^O _s1，y^O _s2) I.e. y^I＝(y^I _s1，y^I _s2)，y＝(y^O _s1，y^O _s2). During the track generalization output process, y^IFor query points, P (y) is estimated using GMR^o|y^I) For the rest two-dimensional space information y^OAnd outputting to realize the double-task cooperation of the double mechanical arms.

The above embodiments are merely illustrative of the technical ideas and features of the present invention, and the purpose thereof is to enable those skilled in the art to understand the contents of the present invention and implement the present invention, and not to limit the protection scope of the present invention. All equivalent changes and modifications made according to the spirit of the present invention should be covered within the protection scope of the present invention.

Claims

1. A characterization and generalization method of discrete trajectory of robot based on probability model is characterized in that: the method comprises the following steps:

(2) and characterizing the discrete type track: modeling robot tracks through a plurality of Gaussian Mixture Models (GMMs), extracting the correlation among a plurality of sections of tracks, representing teaching tracks,

(3) and outputting track generalization: and splicing the multiple tracks through Gaussian Mixture Regression (GMR) to realize generalized output of the tracks.

2. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: and clustering the data by using a k-means clustering algorithm (k-means), dividing the class of the teaching data, and learning the data of the Gaussian Mixture Model (GMM) by using a maximum expectation algorithm (EM algorithm).

3. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 2, wherein: when learning data of a Gaussian Mixture Model (GMM) by using a maximum expectation algorithm (EM algorithm), the estimation step (E-step) and the maximization step (M-step) are used for circulating until parameters are converged.

4. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 3, wherein: in the estimating step (E-step), it comprises:

(1) classifying and dividing the sampling data,

(3) and solving a probability Q function generated by the sampling data.

5. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 4, wherein: the Q function is maximized in a maximization step (M-step) to optimize the parameters of the Gaussian Mixture Model (GMM).

6. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: in (1): and acquiring a data source of discrete track representation by using a dragging teaching strategy.

7. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: in (2): and (3) utilizing Matlab programming to realize a Gaussian Mixture Model (GMM) to characterize the teaching track.

8. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: when the robot track is modeled by a Gaussian Mixture Model (GMM), a probability model is used for extracting the correlation among multiple sections of tracks.

9. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: in (3): when the track is generalized and outputted by the Gaussian Mixture Regression (GMR), the generalized output is saved as a mat file as a desired track of the control system.

10. The probabilistic model-based characterization and generalization method for discrete trajectories of robots according to claim 1, wherein: it also includes (4), control system design: and utilizing the track of the generalized output as a desired track of the control system, so that the robot track tracks the desired track.