CN108805974A

CN108805974A - A kind of intensive non-rigid motion structure algorithm based on Grassmann manifold

Info

Publication number: CN108805974A
Application number: CN201810520065.7A
Authority: CN
Inventors: 夏春秋
Original assignee: Shenzhen Vision Technology Co Ltd
Current assignee: Shenzhen Vision Technology Co Ltd
Priority date: 2018-05-28
Filing date: 2018-05-28
Publication date: 2018-11-13

Abstract

A kind of intensive non-rigid motion structure algorithm based on Grassmann manifold proposed in the present invention, its main contents includes Grassmann manifold, non-rigid motion structure formulation and three-dimensional reconstruction experiment and analysis, its process is that carrying out Jim Glassman formulation to trajectory range first indicates；Then, Jim Glassman formulation is carried out to shape space to indicate；Next, carrying out space-time formulation and carrying out bilinearity optimization；Finally, it carries out three-dimensional reconstruction experiment and carries out control variable analysis and motion time analysis.The present invention is based on Grassmann manifolds not to need any Prior Template when handling non-rigid motion structure, can handle with noisy nonlinear deformation and be capable of providing the final result based on basic data collection；For the present invention with better data scalability and with higher reconstruct accuracy, comprehensive performance is optimal.

Description

Grassmann manifold-based dense non-rigid motion structure algorithm

Technical Field

The invention relates to the field of three-dimensional reconstruction, in particular to a dense non-rigid motion structure algorithm based on Grassmann manifold.

Background

The three-dimensional reconstruction technology describes a real scene into a mathematical model conforming to the logical expression of a computer through the processes of depth data acquisition, preprocessing, point cloud registration and fusion, surface generation and the like, and relates to a plurality of subject systems including image processing, stereoscopic vision, mode identification and the like. In clinical medicine, a three-dimensional reconstruction technology can present a two-dimensional organ image in a three-dimensional form, and is beneficial to analysis and planning before an operation; in the cultural relic exhibition field, the three-dimensional reconstruction technology can be used for digitally processing precious cultural relics to build a network-based digital museum system, and can better solve the contradiction between rich exhibits and limited space and time; in addition, the three-dimensional reconstruction technology has important application in the fields of game development, industrial design, aerospace, navigation and the like. However, the existing three-dimensional reconstruction algorithm has the problems of low data expandability, low reconstruction accuracy and the like.

According to the dense non-rigid motion structure algorithm based on the Grassman manifold, Grassman formula expression is firstly carried out on a track space; then, performing Grassmann formulation expression on the shape space; then, performing space-time formulation and bilinear optimization; and finally, carrying out a three-dimensional reconstruction experiment and carrying out control variable analysis and runtime analysis. The invention does not need any prior template when processing the non-rigid motion structure based on the Grassmann manifold, can process the nonlinear deformation with noise and can provide the final result based on the basic data set; the invention has better data expandability, higher reconstruction accuracy and optimal comprehensive performance.

Disclosure of Invention

Aiming at the problems of low data expandability, low reconstruction accuracy and the like of the existing three-dimensional reconstruction algorithm, the invention aims to provide a dense non-rigid motion structure algorithm based on the Grassman manifold, which firstly carries out Grassman formula expression on a track space; then, performing Grassmann formulation expression on the shape space; then, performing space-time formulation and bilinear optimization; and finally, carrying out a three-dimensional reconstruction experiment and carrying out control variable analysis and runtime analysis.

In order to solve the above problems, the present invention provides a dense non-rigid motion structure algorithm based on grassmann manifold, which mainly comprises:

(one) Grassmann manifold;

(II) formulating a non-rigid motion structure;

and (III) three-dimensional reconstruction experiment and analysis.

Wherein said Grassmann manifold, generally denoted asByAll r dimensions of (1) are linear subspace composition, n is more than r; each point on the Grassmann manifold is represented by an n X r matrix (called X), where each column is made up of X tiles, denoted span (X) or abbreviated [ X ]]。

Wherein, the non-rigid motion structure formulation mainly comprises: a grassmann representation of trajectory space, a grassmann representation of shape space, a spatio-temporal formulation and a formulation operation.

Further, the grassmann manifold representation of the trajectory space is isomerously equivalent to a symmetric idempotent matrix, the grassmann manifold can be embedded in a symmetric matrix manifold, and self-expression can be defined in the embedding space:

wherein,representing a tensor constructed by mapping a symmetric matrix of all grassmann data points;representing a matrix of coefficients (K)_sTotal number of spatial groups); e_sAnd performing track group reconstruction error measurement on each manifold geometry.

Further, the grassmann representation of the shape space assumes that temporal deformation proceeds smoothly over time, and the deformed shape is based on local self-expression across frames:

whereinRepresenting a tensor constructed by mapping a symmetric matrix of all grassmann data points;representing a matrix of coefficients (K)_tTotal number of time groups); e_tTime group reconstruction error measurements are made for each manifold geometry.

Further, the space-time formulation combines the grassmann representation of the trajectory space and the grassmann representation of the shape space, and a combination formula (comprising the whole of the local space structure and the non-rigid shape) can be obtained by the constraint conditions and the reprojection error terms of the two:

wherein,Ψ_s＝ξ(C_s，S_s，σ_∈)，S_c＝ζ(Ψ_s，∑_s，V_S，N_S)，w for reprojection error constraint_sAnd R represents a three-dimensional reconstruction.

Further, the formula operates becauseIs a bilinear optimization variable, so the formula (3) is a non-convex problem and is difficult to solve; however, the non-convex problem can be efficiently solved using Augmented Lagrange Methods (ALMs); multiplying lagrange multiplierAnd arbitrary variables (J)_s，J_t) Substituting equation (3) with:

therein, Ψ_s＝ξ(C_s，S_s，σ_∈)，S_s＝ζ(Ψ_s，∑_s，V_S，N_S)， first, the formula ξ (·) calculates the Singular Value Decomposition (SVD) of the C matrix, i.e., C ═ U ·_c，∑_c，V_c]Then form a matrix A, i.e.Wherein sigma_∈Is based on the noise level and X ═ U_c(∑_c)^0.5Obtaining an empirical value; secondly, with A_ijThe grassmann samples are constructed from the S matrix, which reduces the task complexity from vector point index to linear subspace index.

Wherein, the three-dimensional reconstruction experiment and analysis, for the experiment, in order to carry out the quantitative evaluation of the three-dimensional reconstruction, the estimated shape of each frame isAnd the actual shape of the labelArranging up; mean root mean square three-dimensional reconstruction error formulaInitializing the segments by adopting a Kmeans + + algorithm; for the analysis, mainly control variable analysis and runtime analysis are included.

The analysis of the control variables is characterized in that the analysis of the control variables is mainly used for evaluating the importance of space-time limitation in formulation; the performance of the main observation formula is shown in four cases: (a) no space-time limitation; (b) only time limitations exist; (c) only limited in space; (d) with time and space limitations.

The runtime analysis is characterized in that the method is mainly used for comparing with other dense non-rigid motion structure reconstruction methods; the result shows that the expandability of the method in the data set exceeds 50000-70000 points, the method exceeds most methods and has higher reconstruction accuracy, and the comprehensive performance is optimal.

Drawings

FIG. 1 is a system framework diagram of a Grassmann manifold-based dense non-rigid motion structure algorithm of the present invention.

FIG. 2 is a Grassmann representation of a Grassmann manifold-based dense non-rigid motion structure algorithm of the present invention.

FIG. 3 is a three-dimensional reconstruction effect diagram of an image deblurring method based on the total variation of the weighted graph.

Detailed Description

It should be noted that the embodiments and features of the embodiments in the present application can be combined with each other without conflict, and the present invention is further described in detail with reference to the drawings and specific embodiments.

FIG. 1 is a system framework diagram of a Grassmann manifold-based dense non-rigid motion structure algorithm of the present invention. The method mainly comprises the following steps of Grassmann manifold, non-rigid motion structure formulation and three-dimensional reconstruction experiment and analysis.

Grassmann manifold, commonly denoted asByAll r dimensions of (1) are linear subspace composition, n is more than r; grid (C)Each point on the Raman manifold is represented by an n X r matrix (called X), where each column is made up of X tiles, denoted as span (X) or abbreviated as [ X ]]。

The non-rigid motion structure formulation mainly comprises the following steps: a grassmann representation of trajectory space, a grassmann representation of shape space, a spatio-temporal formulation and a formulation operation.

The space-time formulation is realized by combining a Grassmann representation method of a track space and a Grassmann representation method of a shape space, and a combined formula (comprising a local space structure and a non-rigid shape) can be obtained by constraint conditions of the two and a reprojection error term:

wherein,Ψ_s＝ξ(C_s，S_s，σ_∈)，S_s＝ζ(Ψ_s，∑_s，V_S，N_S)，the reprojection error constraint represents a three-dimensional reconstruction with Ws and R.

Wherein the formula operation is characterized in thatIs a bilinear optimization variable, so the formula (3) is a non-convex problem and is difficult to solve; however, the non-convex problem can be efficiently solved using Augmented Lagrange Methods (ALMs); multiplying lagrange multiplierAnd arbitrary variables (J)_s，J_t) Substituting equation (3) with:

therein, Ψ_s＝ξ(C_s，S_s，σ_∈)，S_s＝ζ(Ψ_s，∑_s，V_S，N_s)， first, the formula ξ (·) calculates the Singular Value Decomposition (SVD) of the C matrix, i.e., C ═ U ·_c，∑_c，V_c]Then form a matrix A, i.e.Wherein sigma_∈Is based on the noise level and X ═ U_c(∑_c)^0.5Obtaining an empirical value; second, the grassmann samples are constructed from the S matrix with Aij, which reduces the task complexity from vector point index to linear subspace index.

Three-dimensional reconstruction experiment and analysis, for the experiment, in order to carry out quantitative evaluation of three-dimensional reconstruction, the estimated shape of each frameAnd the actual shape of the labelArranging up; mean root mean square three-dimensional reconstruction error formula Initializing the segments by adopting a Kmeans + + algorithm; for the analysis, mainly control variable analysis and runtime analysis are included.

FIG. 2 is a Grassmann representation of a Grassmann manifold-based dense non-rigid kinematic structure algorithm. Including a grassmann representation of the trajectory space and a grassmann representation of the shape space.

The Grassmann representation method of the track space, the Grassmann manifold is equivalent to the symmetric power matrix isomerously, the Grassmann manifold can be embedded in the symmetric matrix manifold, and the self-expression can be defined in the embedding space:

The grassmann representation of shape space is characterized by the fact that, assuming that the temporal deformation proceeds smoothly over time, the deformation shape is based on local self-expression across frames:

wherein,representing a tensor constructed by mapping a symmetric matrix of all grassmann data points;representing a matrix of coefficients (K)_tTotal number of time groups); e_tTime group reconstruction error measurements are made for each manifold geometry.

FIG. 3 is a three-dimensional reconstruction effect diagram of an image deblurring method based on the total variation of the weighted graph. The three-dimensional reconstruction process comprises the steps of firstly carrying out Grassmann formulation expression on a track space; then, performing Grassmann formulation expression on the shape space; then, performing space-time formulation and bilinear optimization; and finally, carrying out a three-dimensional reconstruction experiment and carrying out control variable analysis and runtime analysis. The invention does not need any prior template when processing the non-rigid motion structure based on the Grassmann manifold, can process the nonlinear deformation with noise and can provide the final result based on the basic data set; the invention has better data expandability, higher reconstruction accuracy and optimal comprehensive performance.

It will be appreciated by persons skilled in the art that the invention is not limited to details of the foregoing embodiments and that the invention can be embodied in other specific forms without departing from the spirit or scope of the invention. In addition, various modifications and alterations of this invention may be made by those skilled in the art without departing from the spirit and scope of this invention, and such modifications and alterations should also be viewed as being within the scope of this invention. It is therefore intended that the following appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.

Claims

1. A dense non-rigid motion structure algorithm based on a Grassmann manifold is characterized by mainly comprising a Grassmann manifold (I); formulating a non-rigid motion structure; and (3) three-dimensional reconstruction experiment and analysis.

2. Grassmann manifold (i) according to claim 1, wherein the grassmann manifold is generally expressed as grasmann manifoldByAll r dimensions of (1) are linear subspace composition, n is more than r; each point on the Grassmann manifold is represented by an n X r matrix (called X), where each column is made up of X tiles, denoted span (X) or abbreviated [ X ]]。

3. The non-rigid kinematic structure formulation (ii) according to claim 1, characterized by essentially comprising: a grassmann representation of trajectory space, a grassmann representation of shape space, a spatio-temporal formulation and a formulation operation.

4. The grassmann representation method of trajectory space according to claim 3, wherein the grassmann manifold is isomorphically equivalent to a symmetric idempotent matrix, the grassmann manifold can be embedded in a symmetric matrix manifold, and self-expression can be defined in the embedding space:

5. Grassmann representation of shape space according to claim 3, wherein the shape deformation is based on local self-expression across frames, assuming that the temporal deformation proceeds smoothly over time:

6. The spatio-temporal formulation according to claim 3, wherein the grassmann representation of the trajectory space and the grassmann representation of the shape space are combined, and a combination formula (comprising the local spatial structure and the non-rigid shape as a whole) is obtained by the constraint conditions of the two and the reprojection error term:

wherein,Ψ_s＝ξ(C_s，S_s，σ_∈)，S_s＝ζ(Ψ_s，∑_s，V_S，N_S)，w for reprojection error constraint_sAnd R represents a three-dimensional reconstruction.

7. Operation on the basis of the formula according to claim 3, characterised in thatIs a bilinear optimization variable, so the formula (3) is a non-convex problem and is difficult to solve; however, the non-convex problem can be efficiently solved using Augmented Lagrange Methods (ALMs); multiplying lagrange multiplierAnd arbitrary variables (J)_s，J_t) Substituting equation (3) with:

therein, Ψ_s＝ξ(C_s，S_s，σ_∈)，S_s＝ζ(Ψ_s，∑_s，V_S，N_S)， first, the formula ξ (·) calculates the Singular Value Decomposition (SVD) of the C matrix, i.e., C ═ U ·_c，∑_c，V_c]Then form a matrix A, i.e.Wherein sigma_∈Is based on the noise level and X ═ U_c(∑_c)^0.5Obtaining an empirical value; secondly, with A_ijConstructing the Grassmann sample from the S matrix reduces the task of indexing from vector points to linear subspace indexesComplexity.

8. Three-dimensional reconstruction experiment and analysis (iii) as claimed in claim 1, wherein, for the experiment, the estimated shape of each frame is evaluated for quantitative evaluation of the three-dimensional reconstructionAnd the actual shape of the labelArranging up; mean root mean square three-dimensional reconstruction error formulaInitializing the segments by adopting a Kmeans + + algorithm; for the analysis, mainly control variable analysis and runtime analysis are included.

9. The control variable analysis of claim 8, wherein the control variable analysis is primarily for the purpose of assessing the importance of spatio-temporal constraints in the formulation; the performance of the main observation formula is shown in four cases: (a) no space-time limitation; (b) only time limitations exist; (c) only limited in space; (d) with time and space limitations.

10. The runtime analysis of claim 8, mainly used for comparison with other dense non-rigid motion structure reconstruction methods; the result shows that the expandability of the method in the data set exceeds 50000-70000 points, the method exceeds most methods and has higher reconstruction accuracy, and the comprehensive performance is optimal.