CN113436335A

CN113436335A - Incremental multi-view three-dimensional reconstruction method

Info

Publication number: CN113436335A
Application number: CN202110681635.2A
Authority: CN
Inventors: 金晶
Original assignee: Zhaoyuan State Owned Assets Management Co ltd
Current assignee: Zhaoyuan State Owned Assets Management Co ltd
Priority date: 2021-06-18
Filing date: 2021-06-18
Publication date: 2021-09-24
Anticipated expiration: 2041-06-18
Also published as: CN113436335B

Abstract

The invention provides an incremental multi-view three-dimensional reconstruction method, which comprises the steps of firstly, analyzing human body appearance principal components oriented to three-dimensional grid parameterization, and converting human body shape parameters into principal component coefficients; then, a loss function is designed according to the human body three-dimensional shape prediction, three additional output units of thickness deformation, width deformation and height deformation are added, the average error of all vertexes of a human body shape grid is used as the loss function, and the predicted human body shape and the real shape are scaled to a fixed length; finally, constructing and training a network regression model, inputting the two-dimensional contour of the human body and the posture of the camera into an encoder, performing regression on principal component coefficients, width deformation, height deformation and thickness deformation factors, and obtaining a reconstructed three-dimensional shape of the human body by applying inverse principal component analysis; the invention simplifies the manual process and the registration process, simultaneously does not need to consider the sequence and the number of the multiple views in the reconstruction process, and has wide application range and simple and convenient use mode.

Description

Incremental multi-view three-dimensional reconstruction method

Technical Field

The invention belongs to the technical field of artificial intelligence, and particularly relates to an incremental multi-view three-dimensional reconstruction method.

Background

In the traditional three-dimensional modeling of the human body appearance, the size of different parts of the human body is measured, and a reconstruction result is obtained by adopting a geometric method to perform parameter fitting. With the development of artificial intelligence technology, the problem of human body three-dimensional modeling can be solved through image reconstruction based on a deep learning method. Convolutional neural networks for three-dimensional reconstruction typically have an encoder that maps images of the human body into feature vectors and then decodes the feature vectors into the desired three-dimensional model output according to the representation of the three-dimensional shape. Common representations include voxels, meshes, and point clouds.

Multi-view three-dimensional reconstruction can overcome the self-occlusion problem. A convolutional neural network based three-dimensional reconstruction for the case of a fixed number of views can be solved. The long-short term memory neural network framework needs to be used when the number of views changes, but if the order of inputting the views changes, the output is not consistent.

Disclosure of Invention

The invention provides an incremental multi-view three-dimensional reconstruction method for solving the problem of multi-view three-dimensional model reconstruction under the conditions of different image sequences and different image quantities, which is not influenced by the sequence of input images and can be popularized to any number of views.

The invention is realized by the following scheme:

an incremental multi-view three-dimensional reconstruction method comprises the following steps:

the method comprises the following steps:

step a: carrying out human body two-dimensional contour processing on the image, and designing a three-dimensional model into a standard grid structure; analyzing the human body appearance principal components oriented to the three-dimensional grid parameterization; converting the human body shape parameters into principal component coefficients;

step b: designing a loss function aiming at the human body three-dimensional shape prediction; taking the average error of all the vertexes of the human body shape grid as a loss function, and scaling the predicted human body shape and the real shape to a fixed length;

step c: and constructing and training a network regression model, inputting the two-dimensional outline of the human body and the camera posture into an encoder, and obtaining the reconstructed three-dimensional shape of the human body by applying inverse principal component analysis.

Further, in step a:

step a 1: making a two-dimensional outline shape data set of a human body image; more than 1000 pictures are taken of the human body at each angle, external parameters of a camera are marked on each image, and two-dimensional contour processing of the human body is carried out on the images based on the existing segmentation network;

step a 2: designing a standard grid of a human body three-dimensional model; the three-dimensional model of the human body is designed as a standard mesh with 6980 vertices whose coordinates are expressed as P ═ P_xi p_yi p_zi]^TWherein p is_xi、p_yiAnd p_ziEach vertex position in the human body thickness, width and height directions is respectively represented, i is 1, …, 6980;

step a 3: analyzing the principal component coefficient of the human body appearance change; applying a principal component analysis method to a human body two-dimensional contour data set to obtain a principal component coefficient C ═ C for describing human body shape change₀ c₁ c₂ c₃ c₄ c₅ c₆ c₇ c₈ c₉]^TWherein c is₀Represents height; c. C₁Representing the circumference of the chest; c. C₂Representing the hip circumference; c. C₃Representing the abdominal circumference; c. C₄Represents lateral compressive stretching; c. C₅Indicating that the belly is large but the whole is thin; c. C₆Showing that the belly is enlarged and other parts are thin and small; c. C₇Indicating the degree of longitudinal body compression; c. C₈Represents the horizontal width; c. C₉Indicating the shoulder width.

Further, in step b:

step b 1: aiming at the prediction of a human body three-dimensional model, the thickness deformation k is increased_tWidth of deformation k_wHigh degree of deformation k_hThe output unit of (1); the vertex coordinates after deformation are denoted as P' ═ k_tp_xi k_wp_yi k_hp_zi]^T；

Step b 2: taking the average error of all the vertexes of the human body shape mesh as a loss function;

the Loss function Loss of the human body three-dimensional shape prediction is set as the average error described by the Euclidean distance d (·,) of the 6980 vertexes caused by the change of the human body shape parameters,

step b 3: in the process of calculating the human body three-dimensional shape loss function, the predicted human body shape and the real shape are scaled to the fixed length of 170 cm.

Further, in step c:

step c 1: inputting the two-dimensional outline of the human body and the posture of the camera into an encoder, and calculating a one-dimensional feature vector; combining a plurality of feature vectors generated by multiple views into a single feature vector through a pooling layer;

step c 2: decoder for principal component coefficient C, thickness deformation k_tWidth of deformation k_wHigh degree of deformation k_hThe factors are regressed, and a linear activation function is applied to all output units;

step c 3: by applying inverse principal component analysis, the k is deformed by the thickness_tWidth of deformation k_wHigh degree of deformation k_hAnd carrying out horizontal and vertical scaling on the vertex, and scaling to the whole human body to obtain the reconstructed human body three-dimensional shape.

The invention has the beneficial effects

(1) The method of the invention aims at the human body appearance, draws the human body three-dimensional outline by analyzing the sampling shape in the principal component model;

(2) the method extracts the human body contour and the camera attitude data based on the depth network to infer the parameter characteristics and the appearance of the human body, can process any number of input views without being influenced by the input sequence, and realizes incremental dense grid reconstruction;

(3) the incremental reconstruction is carried out based on the multi-view three-dimensional reconstruction model, the manual process and the registration process are simplified, meanwhile, the sequence and the number of the multi-view are not required to be considered in the reconstruction process, the reconstruction is carried out by the method, the application range is wide, and the use mode is simple and convenient.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments; all other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, an incremental multi-view three-dimensional reconstruction method:

the method comprises the following steps:

In step a:

In step b:

step b 1: to provide greater flexibility in human three-dimensional shape prediction, the thickness deformation k is increased for human three-dimensional model prediction_tWidth of deformation k_wHigh degree of deformation k_hThe output unit of (1); the vertex coordinates after deformation are denoted as P' ═ k_tp_xik_wp_yi k_hp_zi]^T；

In step c:

step c 1: inputting the two-dimensional outline of the human body and the posture of the camera into an encoder, and calculating a one-dimensional feature vector; the number of copies of the encoder is the same as that of the views, and then a plurality of feature vectors generated by the multiple views are combined into a single feature vector through a pooling layer;

step c 2: decoder pairs the principal component coefficient C, the thickness deformation k through two fully connected layers_tWidth of deformation k_wHigh degree of deformation k_hThe factors are regressed, and a linear activation function is applied to all output units;

The incremental multi-view three-dimensional reconstruction method provided by the invention is described in detail, the principle and the implementation mode of the invention are explained, and the description of the embodiment is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims

1. An incremental multi-view three-dimensional reconstruction method is characterized in that:

the method comprises the following steps:

2. The method of claim 1, further comprising: in step a:

step a 2: designing a standard grid of a human body three-dimensional model; the three-dimensional model of the human body is designed as a standard mesh with 6980 vertices whose coordinates are expressed as P ═ P_xi p_yi p_zi]^TWherein p is_xi、p_yiAnd p_ziRespectively indicate the thickness and width of the human bodyEach vertex position in the degree and height directions, i ═ 1, …, 6980;

3. The method of claim 2, further comprising: in step b:

4. A method of lossing according to claim 3, characterised by: in step c: