CN110570503B

CN110570503B - Method for acquiring normal vector, geometry and material of three-dimensional object based on neural network

Info

Publication number: CN110570503B
Application number: CN201910829629.XA
Authority: CN
Inventors: 吴鸿智; 周昆; 康凯彰
Original assignee: Hangzhou Faceunity Technology Co ltd; Zhejiang University ZJU
Current assignee: Hangzhou Faceunity Technology Co ltd; Zhejiang University ZJU
Priority date: 2019-09-03
Filing date: 2019-09-03
Publication date: 2021-04-16
Anticipated expiration: 2039-09-03
Also published as: CN110570503A

Abstract

The invention discloses a neural network-based method for acquiring normal vectors, geometry and materials of a three-dimensional object, and provides an acquisition method combined with a neural network on the idea of actively irradiating a plurality of specific patterns on the object, simultaneously acquiring photos and calculating the obtained photos to acquire the normal vectors of the object. Further, the method uses the obtained normal vectors for optimizing the model of the object. The method can also acquire the material characteristic information while acquiring the normal vector. And finally, obtaining a high-quality geometric and material acquisition result in a combined manner. The number of the illumination patterns obtained by the method is small, the accuracy of the obtained normal vector is high, and the method is not limited to a certain specific acquisition device.

Description

Method for acquiring normal vector, geometry and material of three-dimensional object based on neural network

Technical Field

The invention relates to a method for acquiring a normal vector of a three-dimensional object based on a neural network, a geometric optimization method based on the method and a material acquisition method, and belongs to the field of computer graphics and computer vision.

Background

Digitizing real objects has long been a problem in the field of computer graphics/vision. Currently, digitized real objects can be represented by three-dimensional grids and six-dimensional Spatially Varying Bidirectional Reflectance Distribution Functions (SVBRDFs). Based on the expression, the appearance of the object under any observation angle and illumination condition can be well rendered.

However, how to efficiently acquire reflection information and geometric information of an object simultaneously with high quality still remains a great challenge. On the one hand, high quality requires that as many observations as possible be obtained at the time of measurement. On the other hand, in the actual acquisition process, it is desirable to shorten the measurement time as much as possible. In addition, the high-quality normal vector can optimize a coarse three-dimensional grid so as to obtain a fine geometric model, but no method for efficiently acquiring the normal vector information with high quality exists at present, and the problem is still a great challenge

The existing method/system can collect another piece of information under the condition of known geometry or material. For example, scenes with less complex reflection properties can be acquired using structured light (Daniel Scharstein and Richard Szeliski.2003. high-acquisition stereo depth maps using structured light. in CVPR.) or SFM (structure-from-Motion) (Johannes Lutz Sch6nberger and Jan-Michael Frahm.2016.structure-from-Motion reconstructed. in CVPR.) techniques. For another example, in the illumination stage used for acquiring the material property, it can be assumed that the object is in a high beam (Andrew Gardner, Chris Tchou, Tim Hawkins, and Paul debevec.2003.linear light source reflectance. acm trans. graph.22, 3(2003), 749 + 758.), or the sampling difficulty can be simplified by using the prior knowledge that the acquisition object is a specific geometric object. Although there are some predecessors who have collected geometric and material information of an object at the same time, these methods have strong simplifying assumptions. For example, assume that the object is in the far light (Borom Tunwattanap, Graham Fyffe, Paul Graham, Jay Busch, xueaming Yu, Abhijeet Ghosh, and Paul Debevec.2013.acquiring reflecting and Shape from content polymeric biological reflecting. ACM transmitting. Graph.32, 4, aromatic 109 (annular 2013), 12 pages.), or the material information is isotropic (Rui Xia, Yue Dong, Pieter Peer, Xin Tong. recording and recording sharing. recording sharing and distributing reflecting Surface reflecting adapting Upper reflecting and reflecting Surface reflecting adapting illuminating. ACM transmitting. 35, 6, approximate participant (approximate participant) (the angle of gravity of 12. and drawing), and the material information is also limited by the simplified angle of the question of the company, 9, 12. and 12. by the question of the company, the question of the company, the name of the company, the question of the name of the object of the company, the question of the company, the question of.

Deep learning is a machine learning method that has gained wide application and great success in recent years in the fields of computer vision and computer graphics. The method is characterized in that a network is fitted with an objective function by specifying a loss function and utilizing a gradient descent method.

Disclosure of Invention

The invention aims to provide a method for acquiring normal vectors, geometry and materials of a three-dimensional object based on a neural network, aiming at the defects of the prior art.

The purpose of the invention is realized by the following technical scheme: a three-dimensional object normal vector obtaining method based on a neural network comprises the following steps:

(1) generating training data: acquiring parameters of acquisition equipment, including the distance and angle from a light source to an acquisition space origin, a characteristic curve of the light source, the distance and angle from a camera to the acquisition space origin, and internal and external parameters of the camera; these parameters are used to generate acquisition results that simulate actual cameras as training data.

(2) Training a neural network according to the training data obtained in the step (1), wherein the neural network is characterized as follows:

a. the input of the neural network is Lumitexel, which describes the reflection intensity of the sampling point to the incident light from each light source along a certain observation direction;

b. the output and regression target of the neural network is a normal vector corresponding to the input Lumitexel;

c. the first layer of the neural network comprises a linear fully-connected layer, and a parameter matrix of the linear fully-connected layer is obtained by training the following formula:

W_l＝f_w(W_raw)

wherein W_rawIs a parameter to be trained; w_lThe illumination matrix is n multiplied by m, n is the sampling precision of Lumitexel, and m is the number of illumination patterns; f. of_wIs a mapping for W_rawTransforming to enable the generated illumination matrix to correspond to the possible luminous intensity of the acquisition equipment;

d. the second layer and the later layers are nonlinear mapping networks, wherein the length of the output vector of the last layer is 3;

and after the training is finished, taking out the illumination matrix of the first layer of linear full-connection layer.

(3) Make miningThe collecting equipment generates illumination patterns according to the illumination matrix taken out in the step (2), sequentially illuminates the target three-dimensional object and obtains a group of photos r₁，r₂...，r_m(ii) a Sequentially traversing the picture's pixels and channels, from r each time₁，r₂...，r_mThe pixel values of the ith row, the jth column and the kth channel are taken out to form a vector a ═ a₁，a₂，…，a_mTaking a as an output vector of a first layer of linear full-connection layer of the neural network, and calculating to obtain an output vector of a last layer, wherein the obtained output vector is a normal vector of the surface of the three-dimensional object corresponding to the pixel; and traversing all the pixels to obtain a normal vector characteristic diagram of the surface of the three-dimensional object.

Further, in the step (1), a specific method for generating an acquisition result simulating an actual camera is as follows: randomly selecting a sampling point in a collection space where a target three-dimensional object may appear, randomly sampling a material parameter of the point, and generating Lumitexel by using a rendering model.

Further, the rendering model adopts a GGX model, and the generation formula is as follows:

wherein f is_r(ω_i；ω_oN, t, p) is the bidirectional reflectance distribution function, ω_oIs the outgoing direction, omega_iIs the incident direction, n represents the normal vector under the world coordinate system, t represents the x-axis direction of the local coordinate system of the sampling point under the world coordinate system, p is the material parameter vector including alpha_x，α_y，ρ_d，ρ_sIn which α is_xAnd alpha_yDenotes the roughness coefficient, ρ_dRepresenting the diffuse reflectance, ρ_sRepresents the specular reflectance; omega_hIs a half way vector, D_GGXIs a differential surface distribution term, F is a Fresnel term, G_GGXRepresenting a shading coefficient function.

Further, in the step (2), f is mapped_wSelecting a combination of normalization function and inversion function, requiring mIs an even number, W_rawIs of a size of

Can combine W_rawViewed as a

The combination of individual column vectors, expressed as follows:

f_wto W_rawIs normalized to be a unit vector, and then the negative number of each column is turned over to form a new vector, and the formula is expressed as follows:

f_w(W_raw)＝f_flip(f_normalize(W_raw))

W_normalizedcan be regarded as

A column vector, represented as follows:

further, in the step (2), a normalization layer may be further connected after the last layer of the neural network, and is used to normalize the output vector to be a unit vector.

The prediction of material characteristic information (such as diffuse reflection characteristic vector and specular reflection characteristic vector) can be added during the neural network training. The specific method comprises the following steps:

(1) for a sampling point, a cube with the side length d and the center at the origin of the acquisition space is adopted to surround the sampling point, and a plurality of points are uniformly sampled on each surface to serve as virtual light sources.

(2) Generating a material characteristic vector, wherein the formula is as follows:

V(I，P)＝F_near(x_l，x_p，ωi，n_p，n_l)I(l)f_r(ω_i；ω_o，n_p，t，p)

wherein I represents lighting information of each virtual light source l, including: spatial position x of virtual light source l_lNormal vector n of virtual light source l_IAnd the luminous intensity I (l) of the virtual light source l, wherein P comprises parameter information of the sampling point, and the method comprises the following steps: spatial position x of the sampling point_pNormal vector n in world coordinate system_pAnd the material parameter p of the sampling point of the x-axis direction t of the local coordinate system under the world coordinate system comprises alpha_x，α_y，ρ_d，ρ_sIn which α is_x，α_yDenotes the roughness coefficient, ρ_dRepresenting the diffuse reflectance, ρ_sRepresents the specular reflectance; omega_iRepresenting the incident vector, ω, in the world coordinate system_oRepresenting an emergent vector under a world coordinate system; f. of_r(ω_i；ω_o，n_pT, p) is the bidirectional reflectance distribution function, F_near(x_l，x_p，ω_i，n_p，n_l) For the low beam factor, the formula is as follows:

(3) setting the specular reflectivity of the sampling point to 0 to generate a diffuse reflection characteristic vector v_d。

(4) Setting the diffuse reflectance to 0, generating a specular reflection feature vector v_s。

(5) Spirit of the inventionOutput over network increment vector v'_d，v′_sP ', where v'_dAnd v_dSame length, v'_sAnd v_sThe lengths are the same, p 'is 3 and vector v'_d,v′_sP' are respectively the diffuse reflection eigenvectors v after eliminating the low beam factor_dSpecular reflection vector v after eliminating passing light factor_sAnd spatial position x_pAnd (4) predicting.

(6) The loss function of the material characteristic part is expressed as follows:

Loss_diffusc＝||v′_dF_near(x_l，p′，ω′_i，n′，n_l)-v_d||

Loss_specular＝||v′_sF_near(x_l，p′，ω′_i，n′，n_l)-v_s||

when training the neural network, adding the diffuse reflection Loss function Loss_diffuseAnd Loss of specular reflection function Loss_specular。

In actual use, the material characteristic information can be obtained while the normal vector is obtained.

(7) And acquiring a three-dimensional object material map according to the material characteristic information.

Further, in the step (7), the process of obtaining the material map of the three-dimensional object according to the material characteristic information is as follows:

(7.1) sampling m angles of the target three-dimensional object by using a camera, and then taking a picture obtained by sampling as input to obtain a three-dimensional grid; for each vertex, choose to have n_i·ω_oMinimum and visible sampling angle ω, n_iAs a normal vector, omega, of the vertices of a three-dimensional mesh_oThe emission vector in the world coordinate system is shown.

And (7.2) taking the three-dimensional grid of the three-dimensional object as input to obtain a mapping relation diagram from the material mapping to the grid.

(7.3) mapping each effective pixel on the mapping relation map to a sampling angle omega; the material characteristic v of the point obtained by combining the neural network_iFitting materialA quality parameter; and traversing the effective pixels on all the mapping relational graphs to obtain the final material mapping graph.

Further, in the step (7.3), the L-BFGS-B method is used for fitting the material parameters, and the expression is as follows:

minimize(||v_i-NN[f_r(ω_i；ωo，n′，t，p′)]||)

wherein p ' and t are fitting quantities, p ' is a material parameter vector, t represents the x-axis direction of a local coordinate system of a sampling point of a world coordinate system, n ' is a normal vector of neural network prediction, and NN represents the mapping of a neural network.

A method for optimizing a three-dimensional mesh using normal vectors, the method comprising the steps of:

(1) sampling m angles of a target three-dimensional object by a camera, irradiating illumination patterns obtained by training by using the method at each angle, and taking a picture obtained by sampling as input to obtain an initial three-dimensional point cloud and a three-dimensional grid; with an initial normal vector n at each vertex of the three-dimensional mesh_i。

(2) And (4) carrying out re-meshing (remesh) on the three-dimensional grid obtained in the step (1).

(3) For each vertex, choose to have n_i·ω_oA minimum sampling angle omega visible to the point, under which the normal vector predicted value n 'is obtained by the normal vector obtaining method'_iAs the normal vector n of the point_i。

(4) Every vertex position P in the current mesh_iChange to New position P'_iTo obtain a new mesh, the process includes a loss function L on the normal vector_normalAnd a vertex position loss function L_positionOptimizing; binding of L_normalAnd L_positionObtaining a joint loss function L_optAnd optimizing L using least squares_opt。

(5) Returning to the step (3) until L_optAnd converging to complete the optimization of the three-dimensional grid, and taking the three-dimensional grid as a final geometric model.

Further, in the step (4),

normal vector loss function L_normal: polygon { e }₀，e₁，…，e_kAs an approximation of the tangent plane, k is greater than 2, where e_jIs the j-th side of a polygon formed by P_iA central and adjacent vertex, a loss function L_normalIs expressed as follows:

n is the number of three-dimensional mesh vertexes;

vertex position loss function L_position：

Wherein

I is an identity matrix, alpha and beta are preset parameters alpha, beta belongs to [0, + ∞ ], and the new position P is realized by adjusting alpha and beta_iCorresponding joint loss function L_optThe optimization is achieved;

joint loss function L_opt：

L_opt＝λL_position+(1-λ)L_normal

Where λ ∈ [0, 1] is used to control the weight of both loss functions.

The invention has the beneficial effects that: the invention provides an acquisition method combined with a neural network on the idea of actively irradiating a plurality of specific patterns on an object, simultaneously acquiring photos and calculating the obtained photos to obtain the normal vector of the object. The average error of the normal vector obtained by the method is 3 degrees, which is much higher than the precision of the prior art. Further, the method uses the obtained normal vectors for optimizing the model of the object. The method can also acquire the material characteristic information while acquiring the normal vector, and finally combine to obtain a high-quality geometric and material acquisition result. The acquisition of material characteristic information does not require making assumptions that the spatial location is known. The three-dimensional object model and the material obtained by combination have extremely strong reality sense, which is higher than the prior level. The number of the illumination patterns obtained by the method is small, the accuracy of the obtained normal vector is high, and the method is not limited to a certain specific acquisition device. In addition, the mode of obtaining the illumination matrix by the method can directly ensure that the generated matrix is effective.

Drawings

FIG. 1 is a schematic perspective view of an acquisition apparatus in an embodiment;

fig. 2 (a) is a plan development view of the acquisition apparatus in the embodiment, 64 × 64 is the number of light sources per plane, and (b) is a side view;

FIG. 3 is a schematic flow chart of an embodiment;

FIG. 4 is a schematic diagram of a neural network used in an embodiment;

FIG. 5 is one of the resulting illumination patterns of the training in the example;

FIG. 6 is a typical Lumitexel used for training in the examples.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings and specific embodiments.

The invention provides an acquisition method combined with a neural network on the idea of actively irradiating a plurality of specific patterns on an object, simultaneously acquiring photos and calculating the obtained photos to obtain the normal vector of the object. Further, the method uses the obtained normal vectors for optimizing the model of the object. The method can also acquire the material characteristic information while acquiring the normal vector. And finally, obtaining a high-quality geometric and material acquisition result in a combined manner. The following describes the specific implementation of the three parts in detail:

a three-dimensional object normal vector obtaining method based on a neural network comprises the following steps:

(1) generating training data: acquiring parameters of acquisition equipment, including the distance and angle from a light source to the origin of an acquisition space (a space for placing an acquisition object in the acquisition equipment), a characteristic curve of the light source, the distance and angle from a camera to the origin of the acquisition space, and internal and external parameters of the camera; generating an acquisition result simulating an actual camera by using the parameters as training data; the specific method for generating the acquisition result of the simulated actual camera is as follows:

randomly selecting a sampling point in a collection space where a target three-dimensional object may appear, randomly sampling a material parameter of the point, and generating Lumitexel by using a rendering model, wherein the Lumitexel describes the reflection light intensity of the sampling point to incident light from each light source along a certain observation direction; the rendering model may adopt a GGX model, but is not limited thereto, and the generation formula using the GGX model is as follows:

a. the input of the neural network is Lumitexel;

W_l＝f_w(W_raw)

wherein W_rawIs a parameter to be trained; w_lThe illumination matrix is n multiplied by m, n is the sampling precision of Lumitexel (namely the number of light sources of the acquisition equipment), and m is the number of illumination patterns; f. of_wIs a mapping for W_rawThe transformation is performed such that the generated illumination matrix can correspond to the possible luminous intensities of the acquisition device (i.e. has a physical meaning). W_rawThe size of the matrix and the value range of each element depend on the actual use of f_wAnd then. f. of_wA combination of normalization and inversion functions may be selected, where m is an even number, W_rawIs of a size of

Can combine W_rawViewed as a

The combination of individual column vectors, expressed as follows:

f_w(W_raw)＝f_flip(f_normalize(W_raw))

W_normalizedcan be regarded as

A column vector, represented as follows:

d. the second and subsequent layers are nonlinear mapping networks, wherein the output vector length of the last layer is 3, and preferably, the last layer can be followed by a normalization layer for normalizing the output vector to be a unit vector.

And after the training is finished, taking out the parameter matrix (namely the illumination matrix) of the first layer of linear full-connected layer.

(3) Enabling the acquisition equipment to generate illumination patterns according to the illumination matrix extracted in the step (2), and sequentially illuminating the target three-dimensional object to obtain a group of photos r₁，r₂...，r_m(ii) a Sequentially traversing the picture's pixels and channels, from r each time₁，r₂...，r_mThe pixel values of the ith row, the jth column and the kth channel are taken out to form a vector a ═ a₁，a₂，…，a_mAnd a is used as an output vector of the first layer of linear full-connection layer of the neural network, and the output vector of the last layer is obtained through calculation, namely the normal vector of the surface of the three-dimensional object corresponding to the pixel is obtained. And traversing all the pixels to obtain a normal vector characteristic diagram of the surface of the three-dimensional object.

Secondly, a method for obtaining the material of the three-dimensional object based on the neural network, which comprises the following steps:

(1) for a sampling point, a cube with the side length of d and the center at the origin of the acquisition space is adopted to surround the sampling point, a plurality of points are uniformly sampled on each surface to serve as virtual light sources, the virtual light sources are point light sources, and the number of the virtual light sources on the cube is called the sampling precision of the characteristic vector. The use of virtual light sources eliminates the undesirable effects of gaps that may exist between the light sources of the acquisition device.

(2) The formula for generating the material feature vector is as follows:

V(I，P)＝F_near(x_l，x_p，ω_i，n_p，n_l)I(l)f_r(ω_i；ωo，n_p，t，p)

wherein I represents lighting information of each virtual light source l, including: spatial position x of virtual light source l_lNormal vector n of virtual light source l_lAnd the luminous intensity I (l) of the virtual light source l, wherein P comprises parameter information of the sampling point, and the method comprises the following steps: spatial position x of the sampling point_pNormal vector n in world coordinate system_pAnd the material parameter p of the sampling point of the x-axis direction t of the local coordinate system under the world coordinate system comprises alpha_x，α_y，ρ_d，ρ_sIn which α is_x，α_yDenotes the roughness coefficient, ρ_dRepresenting the diffuse reflectance, ρ_sRepresents the specular reflectance; omega_iRepresenting the incident vector, ω, in the world coordinate system_oThe emission vector in the world coordinate system is shown. f. of_r(ω_i；ω_o，n_pT, p) is the bidirectional reflectance distribution function, F_near(x_l，x_p，ω_i，n_p，n_l) For the low beam factor, the formula is as follows:

(3) using the above formula, a diffuse reflection feature vector v is generated with the specular reflectance of the sample point set to 0_d。

(4) Using the above formula, a specular reflection feature vector v is generated with the diffuse reflectance set to 0_s。

(5) Output of neural network is incremented by vector v'_d，v′_sP ', where v'_dAnd v_dSame length, v'_sAnd v_sThe lengths are the same, p 'is 3 and vector v'_d，v′_sP' are respectively the diffuse reflection eigenvectors v after eliminating the low beam factor_dSpecular reflection vector v after eliminating passing light factor_sAnd spatial position x_pAnd (4) predicting.

Loss_diffusc＝||v′_dF_near(x_l，p′，ω′_i，n′，n_l)-v_d|||

Loss_specular＝||v′_sF_near(x_l，p′，ω′_i，n′，n_l)-v_s||

(7) Obtaining a three-dimensional object material chartlet according to the material characteristic information, wherein the process is as follows:

(7.1) sampling m angles of the target three-dimensional object by using a camera, taking a picture obtained by sampling as input, and obtaining the three-dimensional grid by using a tool COLMAP disclosed in the industry. For each vertex, choose to have n_i·ω_oThe smallest and visible sampling angle ω for that point. n is_iAs a normal vector, omega, of the vertices of a three-dimensional mesh_oThe emission vector in the world coordinate system is shown.

And (7.2) using an Iso-characters in the field, taking the three-dimensional grid of the three-dimensional object as input, and obtaining a mapping relation diagram (hereinafter referred to as a mapping relation diagram) from the material mapping to the grid.

(7.3) mapping each effective pixel on the mapping relation map to a sampling angle omega; the material characteristic v of the point obtained by combining the neural network_i(including diffuse and specular reflectance characteristics), the material parameters were fitted using the L-BFGS-B method. The expression is as follows:

minimize(||v_i-NN[f_r(ω_i；ωo，n′，t，p′)]||)

wherein p ' and t are fitting quantities, p ' is a material parameter vector, t represents the x-axis direction of a local coordinate system of a sampling point of a world coordinate system, n ' is a normal vector of neural network prediction, and NN represents the mapping of a neural network. And traversing the effective pixels on all the mapping relational graphs to obtain the final material mapping graph. The fitting method at this stage is not limited to the L-BFGS-B method.

Thirdly, a method for optimizing a three-dimensional grid by using a normal vector, which comprises the following steps:

(1) the target three-dimensional object is sampled by a camera at m angles, the illumination pattern obtained by the training of the first part is used for illumination at each angle, the picture obtained by sampling is used as input, and an initial three-dimensional point cloud and a three-dimensional grid can be obtained by using a tool COLMAP disclosed in the industry. With an initial normal vector n at each vertex of the three-dimensional mesh_i。

(2) And (2) performing re-meshing (remesh) on the three-dimensional grid obtained in the step (1), for example, Delaunay triangulation can be adopted.

(3) For each vertex, choose to have n_i·ω_oA sampling angle omega at which the point is visible is obtained, and a normal vector predicted value n 'is obtained through the neural network-based three-dimensional object normal vector obtaining method under the sampling angle'_iAs the normal vector n of the point_i。

(4) Every vertex position P in the current mesh_iChange to New position P'_iTo obtain a new mesh, the process includes a loss function L on the normal vector_normalAnd a vertex position loss function L_positionAnd (4) optimizing.

Normal vector loss function L_normalIn that the property that the normal vector is perpendicular to the tangent plane is utilized to limit P 'in the new grid'_iThe tangent plane and the corresponding normal vector n_iPerpendicular to each other, a polygon { e'₀，e′₁，…，e′_kAs an approximation of the tangent plane, k is greater than 2, where e_jIs the j-th side of a polygon composed of P'_iA central and adjacent vertex, a loss functionNumber L_normalIs expressed as follows:

n is the number of three-dimensional mesh vertexes;

at the same time, the new position P 'is required'_iLimiting to prevent singular points, thus requiring a large degree of freedom in the movement of the new position in the normal vector direction, and a relatively small degree of freedom in the tangential direction, thus having a loss function L_position：

Wherein

I is an identity matrix, alpha and beta are preset parameters alpha, beta belongs to [0, + ∞ ], and the new position P 'is realized by adjusting alpha and beta'_iCorresponding joint loss function L_optAnd the optimization is achieved. (preferred values are relatively small α and larger β). Binding of L_normalAnd L_positionObtaining a joint loss function L_optAnd optimizing L using least squares_opt；

L_opt＝λL_position+(1-λ)L_normal

Where λ ∈ [0, 1] is used to control the weight of both loss functions (preferred value is [0.1, 0.3 ]).

(5) Returning to the step (3) until L_optAnd (4) converging, finishing the optimization of the three-dimensional grid, taking the three-dimensional grid as a final geometric model, and applying the optimized three-dimensional grid to the step (7) of the second part to obtain a high-quality material result.

A specific example of the collecting device is given below, and as shown in fig. 1 and 2, the collecting device is composed of 48 lamp panels, and a camera is fixed on the upper portion of the collecting device and used for collecting images. The center of the device is provided with a rotating platform driven by a stepping motor and used for placing a sampling object. 20480 LED lamp beads are densely arranged on each lamp panel. The lamp beads are controlled by the FPGA, and the brightness and the lighting time can be adjusted.

An example of an acquisition system applying the method of the present invention is given below, and the system is generally divided into the following modules: a preparation module: the method comprises two parts, a data set is provided for network training, and a group of BRDF parameters, a spatial position where a point is located and a camera position are input by using a GGX BRDF model, so that a Lumitexel can be obtained. The network training portion uses a Tensorflow open source framework, depicts the network as a graph, and trains using an Adam optimizer. The network structure is shown in fig. 4, each rectangle represents a layer of neurons, and the numbers in the rectangles represent the number of neurons in the layer. The leftmost layer is the input layer and the rightmost layer is the output layer. Full connection is used between layers. An acquisition module: the device is shown in fig. 1 and 2, and the specific structure is described above. A recovery module: and loading the trained neural network, and firstly calculating to obtain normal vector information and material characteristic vectors. The coarse three-dimensional mesh obtained using the COLMAP software was then optimized using the method described above. And calculating to obtain a mapping relation graph from the optimized geometric model. And performing material parameter fitting on each effective pixel on the mapping relation diagram.

Fig. 3 is a flowchart of the present embodiment. Firstly, generating training data, randomly sampling to obtain 2 hundred million Lumitexels, taking 80% as a training set, and taking the rest as a verification set. And initializing parameters by using an Xavier method during network training, wherein the learning rate is 1 e-4. The number of target shots is 32, so the size of the illumination matrix is: (20480,32). After the training is finished, the illumination matrix is taken out as an illumination pattern, the parameters of each column specify the light intensity of the lamp at the position, and fig. 5 shows the illumination pattern obtained by the training. The object is sampled at 24 angles, and the object is shot for 32 times at each angle according to the luminous intensity of the illumination pattern to obtain an acquisition result. And combining the pixel values of the 32 acquired pictures into a vector for each point on the acquired pictures. The following processing method comprises the following steps: 1. and loading parameters after the second layer of the network, and inputting the parameters into the acquired vectors. The method is used for recovering and obtaining the spatial position, the normal vector information and the diffuse reflection information at each position. 2. And performing geometric recovery by using COLMAP open source software to obtain point cloud and three-dimensional grid. 3. And using the normal vector acquisition method to obtain the geometry and the predicted normal vector as input, and iteratively optimizing the geometry. 4. And obtaining a mapping relation graph by using Isochart. 5. And finding corresponding real shooting data for each effective pixel on the mapping relation graph, and recovering the diffuse reflection characteristic vector and the specular reflection characteristic vector. A fitting module: and fitting the BRDF parameters to each collected point pixel by using an L-BFGS-B method.

The above description is only a preferred embodiment of the present invention, and the present invention is not limited to the above embodiment, and the present invention shall fall within the protection scope of the present invention as long as the technical effects of the present invention are achieved by the same means. Within the scope of protection of the present invention, various modifications and variations of the technical solution and/or embodiments thereof are possible.

Claims

1. A three-dimensional object normal vector acquisition method based on a neural network is characterized by comprising the following steps:

(1) generating training data: acquiring parameters of acquisition equipment, including the distance and angle from a light source to an acquisition space origin, a characteristic curve of the light source, the distance and angle from a camera to the acquisition space origin, and internal and external parameters of the camera; generating an acquisition result simulating an actual camera by using the parameters as training data;

W_l＝f_w(W_raw)

after training is finished, taking out the illumination matrix of the first layer of linear full-connection layer;

(3) enabling the acquisition equipment to generate illumination patterns according to the illumination matrix extracted in the step (2), and sequentially illuminating the target three-dimensional object to obtain a group of photos r₁，r₂...，r_m(ii) a Sequentially traversing the picture's pixels and channels, from r each time₁，r₂...，r_mThe pixel values of the ith row, the jth column and the kth channel are taken out to form a vector a ═ a₁，a₂，…，a_mTaking a as an output vector of a first layer of linear full-connection layer of the neural network, and calculating to obtain an output vector of a last layer, wherein the obtained output vector is a normal vector of the surface of the three-dimensional object corresponding to the pixel; and traversing all the pixels to obtain a normal vector characteristic diagram of the surface of the three-dimensional object.

2. The method for obtaining the normal vector of the three-dimensional object based on the neural network as claimed in claim 1, wherein in the step (1), the specific method for generating the acquisition result simulating the actual camera is as follows: randomly selecting a sampling point in a collection space where a target three-dimensional object may appear, randomly sampling a material parameter of the point, and generating Lumitexel by using a rendering model.

3. The method for obtaining the normal vector of the three-dimensional object based on the neural network as claimed in claim 2, wherein the rendering model adopts a GGX model, and the generation formula is as follows:

4. The method for obtaining the normal vector of the three-dimensional object based on the neural network as claimed in claim 1, wherein in the step (2), f is mapped_wSelecting a combination of a normalization function and a turnover function, wherein m is an even number, and W is required_rawIs of a size of

Can combine W_rawViewed as a

The combination of individual column vectors, expressed as follows:

f_w(W_raw)＝f_flip(f_normalize(W_raw))

W_normalizedcan be regarded as

A column vector, represented as follows:

5. the method according to claim 1, wherein in the step (2), a normalization layer is further connected after the last layer of the neural network, and is used for normalizing the output vector to be a unit vector.

6. A method for obtaining the material quality of a three-dimensional object based on a neural network, which is characterized in that the method adds the prediction of the material characteristic information when the neural network is trained according to claim 1, and comprises the following steps:

(1) for a sampling point, a cube with the side length d and the center at the origin of an acquisition space is adopted to surround the sampling point, and a plurality of points are uniformly sampled on each surface to serve as virtual light sources;

V(I，P)＝F_near(x_l，x_p，ω_i，n_p，n_l)I(l)f_r(ω_i；ω_o，n_p，t，p)

wherein I represents lighting information of each virtual light source l, including: spatial position x of virtual light source l_lNormal vector n of virtual light source l_lVirtual, virtualThe luminous intensity i (l) of the light source l, P contains parameter information of the sampling points, and includes: spatial position x of the sampling point_pNormal vector n in world coordinate system_pAnd the material parameter p of the sampling point of the x-axis direction t of the local coordinate system under the world coordinate system comprises alpha_x，α_y，ρ_d，ρ_sIn which α is_x，α_yDenotes the roughness coefficient, ρ_dRepresenting the diffuse reflectance, ρ_sRepresents the specular reflectance; omega_iRepresenting the incident vector, ω, in the world coordinate system_oRepresenting an emergent vector under a world coordinate system; f. of_r(ω_i；ω_o，n_pT, p) is the bidirectional reflectance distribution function, F_near(x_l，x_p，ω_i，n_p，n_l) For the low beam factor, the formula is as follows:

(3) setting the specular reflectivity of the sampling point to 0 to generate a diffuse reflection characteristic vector v_d；

(4) Setting the diffuse reflectance to 0, generating a specular reflection feature vector v_s；

(5) Output of neural network is incremented by vector v'_d，v′_sP ', where v'_dAnd v_dSame length, v'_sAnd v_sThe lengths are the same, p 'is 3 and vector v'_d，v′_sP' are respectively the diffuse reflection eigenvectors v after eliminating the low beam factor_dSpecular reflection vector v after eliminating passing light factor_sAnd spatial position x_pPredicting;

Loss_diffuse＝||v′_dF_near(x_l，p′，ω′_i，n′，n_l)-v_d||

Loss_specular＝||v′_sF_near(x_l，p′，ω′_i，n′，n_l)-v_s||

when training the neural network, adding the diffuse reflection Loss function Loss_diffuseAnd Loss of specular reflection function Loss_specular；

In actual use, the material characteristic information can be obtained while the normal vector is obtained;

7. The method for obtaining the material quality of the three-dimensional object based on the neural network as claimed in claim 6, wherein in the step (7), the process of obtaining the material quality map of the three-dimensional object according to the material characteristic information is as follows:

(7.1) sampling m angles of the target three-dimensional object by using a camera, and then taking a picture obtained by sampling as input to obtain a three-dimensional grid; for each vertex, choose to have n_i·ω_oMinimum and visible sampling angle ω, n_iAs a normal vector, omega, of the vertices of a three-dimensional mesh_oRepresenting an emergent vector under a world coordinate system;

(7.2) taking the three-dimensional grid of the three-dimensional object as input to obtain a mapping relation graph from the material mapping to the grid;

(7.3) mapping each effective pixel on the mapping relation map to a sampling angle omega; the material characteristic v of the point obtained by combining the neural network_iFitting material parameters; and traversing the effective pixels on all the mapping relational graphs to obtain the final material mapping graph.

8. The method for obtaining the material of the three-dimensional object based on the neural network as claimed in claim 7, wherein in the step (7.3), the L-BFGS-B method is used to fit the material parameters, and the expression is as follows:

minimize(||v_i-NN[f_r(ω_i；ω_o，n′，t，p′)]||)

9. A method for optimizing a three-dimensional mesh using normal vectors, comprising the steps of:

(1) sampling m angles of a target three-dimensional object by a camera, irradiating the target three-dimensional object by using an illumination pattern obtained by training according to the method of claim 1 at each angle, and taking a picture obtained by sampling as input to obtain an initial three-dimensional point cloud and a three-dimensional grid; with an initial normal vector n at each vertex of the three-dimensional mesh_i；

(2) Carrying out re-meshing (remesh) on the three-dimensional grid obtained in the step (1);

(3) for each vertex, choose to have n_i·ω_oA minimum and this point visible sampling angle ω at which normal vector predictor n 'is derived by the method of claim 1'_iAs the normal vector n of the point_i；

(4) Every vertex position P in the current mesh_iChange to New position P'_iTo obtain a new mesh, the process includes a loss function L on the normal vector_normalAnd a vertex position loss function L_positionOptimizing; binding of L_normalAnd L_positionObtaining a joint loss function L_optAnd optimizing L using least squares_opt；

10. The method of optimizing a three-dimensional mesh using normal vectors according to claim 9, wherein in the step (4),

normal vector loss function L_normal: polygon { e'₀，e′₁，…，e′_kAs an approximation of the tangent plane, k is greater than 2, where e'_jIs the j-th side of a polygon composed of P'_iA central and adjacent vertex, a loss function L_normalIs expressed as follows:

n is the number of three-dimensional mesh vertexes;

vertex position loss function L_position：

Wherein

I is an identity matrix, alpha and beta are preset parameters alpha, beta belongs to [0, + ∞ ], and the new position P 'is realized by adjusting alpha and beta'_iCorresponding joint loss function L_optThe optimization is achieved;

joint loss function L_opt：

L_opt＝λL_position+(1-λ)L_normal

Where λ ∈ [0, 1] is used to control the weight of both loss functions.