CN112401369B

CN112401369B - Body parameter measurement method, system, device, chip and medium based on human body reconstruction

Info

Publication number: CN112401369B
Application number: CN202011320729.9A
Authority: CN
Inventors: 姚想; 张迎梁
Original assignee: Plex VR Digital Technology Shanghai Co Ltd
Current assignee: Plex VR Digital Technology Shanghai Co Ltd
Priority date: 2020-11-23
Filing date: 2020-11-23
Publication date: 2024-10-15
Anticipated expiration: 2040-11-23
Also published as: CN112401369A

Abstract

Body parameter measurement method, system, device, chip and medium based on human body reconstruction, comprising: collecting color images of different visual angles around a human body; extracting image features to obtain a human body model; non-rigid deformation is carried out on the human body model, and the human body model is consistent with the template model; and extracting vertex coordinates of the deformed template model, and calculating to obtain body parameters. The method for measuring various body circumference parameters such as the body waistline and the like by fitting the body model through the body template model can measure the length and circumference information of a plurality of parts of the whole body, and has high measurement speed and high precision.

Description

Body parameter measurement method, system, device, chip and medium based on human body reconstruction

Technical Field

The invention relates to the fields of computer vision and computer graphics, in particular to a body parameter measurement method, system, equipment, chip and medium based on human body reconstruction.

Background

Body parameter measurement is used as a primary link of the clothing customization industry, and measurement accuracy and convenience of measurement operation are important. The human body is used as a non-rigid body with extremely large deformation, so that the difficulty of measurement is increased. The traditional measurement method mainly uses a flexible tape for manual measurement, the measurement precision is extremely easy to be influenced by subjective factors of a measurer (such as difficulty in accurately finding out a measurement part, difficulty in controlling the tightness degree of the tape and the like), the measurement mode is very troublesome, and a measurer with rich experience is required to find out the measurement part of the body of the measurer, and then uses the flexible tape for measurement from position to position; in addition, the measurement by using the tape manually inevitably results in the situation that the measurer is in close contact with the limb of the measurer, which causes some discomfort. The method realizes quick and accurate non-contact body parameter measurement, can be also used in the fields of medical health management and the like, and has great commercial value. CN201910631825.6 patent application discloses a method and device for measuring human parameters by using multi-depth camera, which comprises the following steps: collecting a current object depth image; fusing the depth images into regular point clouds; fitting a parameterized human body template, and solving morphological parameters and attitude parameters of the parameterized human body template, so that the SMPL human body template deformed based on the parameters is fitted with the obtained point cloud as much as possible; and acquiring the human body parameters of the current object by a human body parameter acquisition mode defined by the standard parameterized human body template. In the three-dimensional human model reconstruction stage, a mode of carrying out data fusion reconstruction by using 16 depth cameras to acquire point clouds is adopted, so that the hardware cost is high, the setting is complex, and the application is not extensive; body parameter measurement phase: the method of directly extracting grid vertices on the SMPL template model at the girth of the girth and the like, then calculating Euclidean distances between adjacent points one by one, and summing the calculated Euclidean distances as the girth can bring great errors at the girth and the like with complex topological structures. The reason is that the directly extracted grid vertices are not on the same plane, so that the curves forming the chest circumference are not on the same plane, and the normal measuring body circumference is measured on the same plane, which can lead to the measuring tape bending up and down for a circle, and the measuring result is far larger than the actual result.

Disclosure of Invention

The invention aims to solve the existing problems and aims to provide a body parameter measurement method, system, equipment, chip and medium based on human body reconstruction.

In order to achieve the above object, the technical solution of the present invention provides a body parameter measurement method based on human body reconstruction, including:

Step one, a camera collects color images of a plurality of different visual angles around a human body;

Extracting image features through the position and the posture of the camera and the internal and external parameter matrixes, predicting depth information of the image features and fusing the depth information to obtain a human body model;

Thirdly, carrying out non-rigid deformation on the human body model to enable the human body model to be consistent with the template model;

and step four, extracting vertex coordinates of the deformed template model, and calculating to obtain body parameters.

Further, in the first step, the camera surrounds the human body and captures an image every 120 °.

In the first step, the background of the portrait in the color image is removed in real time, the internal parameters and the external parameter matrix of the camera are output at the same time, and the height information is input.

Further, semantic segmentation is performed by adopting a convolutional neural network, namely, a target detection algorithm is executed to extract the position of a person, and then saliency detection is performed to obtain a human body image with the background removed.

In the second step, the camera extracts image features from the color image in real time by means of synchronous positioning and mapping based on monocular vision and an inertial sensor, and the image features are used for estimating the position and the posture of the camera.

Further, in the second step, the neural network model is trained through the color images of a plurality of visual angles, the camera internal reference matrix, the camera pose matrix and the high-precision human body three-dimensional model corresponding to the training set, so that the algorithm model can return to the three-dimensional human body model from any multi-visual angle human body color image.

In the third step, the mass centers of the template model and the human body model are overlapped, and after the corresponding points are extracted, the iterative optimization of the nearest neighbor points is carried out, so that coarse alignment is realized.

Further, after coarse alignment, the shape difference of the template model and the human body model is optimized, and the template model which can be used for expressing the human body model after denoising and deformation is obtained.

Further, after the bone registration, performing skin operation, namely establishing an association relation between bone points and grid vertices, and deforming the grid surface to adapt to the current bone posture.

Further, in the fourth step, the body parameters can be obtained by measuring the length of the curve formed by the vertices. The present invention also provides a system for measuring a body parameter, comprising:

the acquisition module acquires color images of a plurality of different visual angles around a human body;

The modeling module extracts image features through the position and the posture of the camera and the internal and external parameter matrix, predicts the depth information of the image features and fuses the depth information to obtain a human body model;

the deformation module is used for carrying out non-rigid deformation on the human body model to enable the human body model to be consistent with the template model;

and the calculation module is used for extracting vertex coordinates of the deformed template model and calculating body parameters.

Further, the acquisition module adopts APPLEARKIT platforms.

Further, the acquisition module shoots an RGB image around a human body at intervals of 120 degrees, simultaneously carries out real-time portrait matting at the mobile phone end through a neural network image segmentation algorithm to remove the background, shoots and outputs the internal parameters and the external parameters of the current camera, and reminds a user to input height information.

Further, APPLE ARKIT of the acquisition module extracts image features in real time from images acquired by the video stream based on the synchronous positioning and mapping modes of the monocular vision and the inertial sensor, and the image features are used for estimating the position and the posture of the user.

Further, the modeling module directly regresses and predicts the high-precision three-dimensional human body model by inputting parameters of the background-removed color images, the camera internal reference matrix and the camera pose matrix of three different visual angles into a neural network algorithm model trained in advance.

Further, the neural network algorithm model of the reconstruction module is obtained by training 500 high-precision human body three-dimensional models acquired by hundreds of color camera arrays, the training core is that three-view color images, camera internal reference matrixes and camera pose matrixes are rendered by using OpenGL to obtain human body three-dimensional models of a training set, 5000 3D points are sampled from the surface of the human body three-dimensional models and the inner and outer vicinity of the surface of the human body three-dimensional models, the corresponding relation between image pixel points and the sampled 3D points is obtained through the camera internal reference matrixes and the camera pose matrixes, the training model enables the human body three-dimensional models to predict the probability of the 3D points in the human body three-dimensional models from the images, and then the three-dimensional human body models are reconstructed.

Further, the deformation module calculates the mass centers of the template model and the human body model, then performs a rigid transformation, coincides the mass centers to the same point, respectively extracts the limb end points and the vertex points of the template model and the human body model as corresponding points, and performs a nearest neighbor point iterative algorithm to perform the rigid transformation so as to realize coarse alignment.

Further, after coarse alignment, a local rotation transformation matrix is executed at the joint where the template model needs to be deformed, the local rotation transformation matrix on the motion chain is multiplied, so that a global transformation matrix of the joint point can be obtained, and the bone deformation registration can be realized by minimizing the Euclidean distance between the corresponding joint points of the two.

Further, after bone registration, the deformation module performs skin operation, and the bone points and mesh vertex transformation formula:

w is the weight W of each bone point and the grid vertexes around the bone point; t is a transformation matrix T of each skeleton point, v _p is a mesh vertex before transformation, and v' _p is a vertex of the mesh after transformation.

Further, after grid gesture alignment, non-rigid deformation registration is performed: each vertex on the template model is subjected to a rotational-translational transformation such that each vertex thereon is positioned closest to a corresponding point on the manikin.

Further, the calculation module obtains vertex coordinates through the topological structure of the template model, and calculates body parameters. The invention provides a device for measuring a body parameter, comprising:

A camera that captures color images of several different perspectives around a human body;

The camera is in communication connection with the server; the server comprises a processor, and a memory for storing executable instructions of the processor, which when executed performs any of the methods described above. Further, the camera is an apple iphone mobile phone with APPLE ARKIT platforms.

The present invention provides a chip comprising a processor for calling and running a computer program from a memory, such that a device on which the chip is mounted performs any of the methods described above.

The present invention provides a computer readable medium having stored thereon computer program instructions which, when processed and executed, implement a method as described in any of the above.

Compared with the prior art, the method for measuring various body circumference parameters such as the waistline of the body by fitting the human body model containing noise information such as clothes folds through the human body template model can measure the length and circumference information of a plurality of parts of the whole body, has high measurement speed and high precision, is full-automatic non-contact measurement, can be widely used for tailoring clothes, managing and monitoring medical health, and has wide market application prospect.

The three-view human body image based on ARKit acquisition sparsity is adopted in the invention to reconstruct a high-precision human body three-dimensional model, the input is simple, the data acquisition software is very friendly to users, the manual intervention is less, other data processing and three-dimensional model reconstruction are all completed by the full automation of algorithm codes, and the invention has a very large application market.

Drawings

FIG. 1 is an explanatory diagram of capturing color images;

FIG. 2 is an explanatory diagram of capturing color images;

FIG. 3 is a schematic illustration of a phantom reconstructed from a color image, including noise;

FIG. 4 is a schematic illustration of a stencil model (TEMPLATE MESH);

FIG. 5 is a schematic illustration of a template model after pose fitting;

FIG. 6 is a schematic diagram of a template model after shape fitting;

Fig. 7 is an explanatory diagram of measurement results.

Detailed Description

The invention will now be further described with reference to the accompanying drawings.

Referring to fig. 1 to 5, fig. 1 to 5 illustrate an embodiment of the present invention, which aims at measuring a body parameter, mainly comprising:

step one, collecting color images of a plurality of different visual angles around a human body by using a camera;

Wherein, the equipment that adopts includes: a camera for capturing color images of several different perspectives around a human body. The camera is preferably an apple iphone mobile phone with APPLE ARKIT platforms; ARKit is an AR development platform as proposed by apples, where developers can create augmented reality applications using iPhone or iPad with ARKit tools. In this embodiment, ARKit implements functions such as data acquisition.

The camera is in communication connection with the server; the server comprises a processor, and a memory for storing executable instructions of the processor, which processor, when run, performs the method described above.

First, a color image of a human body is acquired. And can develop a mobile phone app of the iphone end based on the requirement of the method, the app has the following main functions:

Referring to fig. 1, an RGB image can be photographed every 120 ° around a human body, and simultaneously real-time portrait matting is performed through APPLE ARKIT to remove the background, and the internal parameters and external parameters of the current camera are output while photographing the image, and then the user is reminded to input height information. And transmitting the data to a server of a remote cloud through an http communication protocol.

The method for shooting the image every 120 degrees comprises the following steps: the mobile phone can rotate by 120 degrees each time by reading the rolling, yaw and pitching angles of the mobile phone acquired by the sensor through the sensor of the iPhone, such as a triaxial gyroscope, namely reminding a user of acquiring images.

Referring to fig. 2, in order to remove the influence of the background on the three-dimensional reconstruction of the human body, matting is required. The method for removing the portrait matting background comprises the following steps: performing semantic segmentation by adopting a convolutional neural network; firstly, a target detection algorithm is executed to extract the position of a person, and then saliency detection is carried out to obtain a human body image with the background removed.

The method for acquiring and processing the camera internal and external parameter matrix comprises the following steps: APPLE ARKIT can extract image features in real time from images acquired from video streams based on the slam mode of monocular vision and inertial sensors for estimating self position and pose.

Since the definition of the camera coordinate system in APPLE ARKIT is: the y-axis direction is opposite to the gravity direction, the x-axis direction is horizontal to the right, the z-axis direction meets the right-hand rule, and the definition of a camera coordinate system in an opencv library used in the subsequent reconstruction is as follows: the horizontal right and vertical downward are respectively the x-axis and the y-axis, and the z-axis meets the rule of the right hand, so that the coordinate system transformation is firstly carried out, and the coordinate system is unified.

Next, referring to fig. 3, a three-dimensional human body is reconstructed. Training data is rendered first. When three images are shot, the positions of the camera pose determine the positions of the camera pose in a world coordinate system, so 500 grids in a training set are required to be converted to the public visible positions of the three camera poses through rigid transformation, and then a camera projection matrix is obtained according to the camera internal reference matrix and the camera pose acquired in the stage one; and using an OpenGL render module, according to the projection matrix, three-dimensional points can be projected onto the 2D image from the grid, and images with three viewing angles can be generated. In order to increase the diversity of data and improve the generalization capability of an algorithm in the subsequent three-dimensional human body reconstruction, the embodiment also performs data enhancement work, namely Gaussian disturbance noise is respectively added to the distance from a camera to a grid, the pitch angle, the turnover angle and the yaw angle of the camera; in addition, the size and the orientation of the grid are randomly disturbed to a certain extent, and the camera pose and the internal reference matrix corresponding to the human body image generated after the noise disturbance is added are recalculated. Finally, a training set consisting of 500 sets of training data was obtained. Each set of data comprises: the method comprises the steps of removing background human body color pictures under three visual angles, corresponding camera internal reference matrixes, relative pose between the cameras and 3D point coordinates of the inside and the outside of a grid obtained by up-sampling the grid.

And obtaining a rough three-dimensional human body model through visual hull algorithm by using the background-removing images of three different visual angles, the corresponding camera internal reference matrix and the camera pose matrix. To obtain a high-precision three-dimensional mannequin, the neural network is trained: firstly, extracting image features through a convolutional neural network Hourglass, then sampling 3D points from grids, and then establishing a corresponding relation between the 3D points and 2D points of an image according to a camera projection matrix obtained by a camera pose and an internal reference matrix. On the basis, the multi-layer perceptron MLP predicts the probability that the 3D point corresponding to the image pixel is inside or outside the grid, three probabilities can be obtained each time due to the three-view image, the three probability values need to be averaged, and then the L2 distance between the predicted value and the true value is iteratively optimized in a supervised mode. And judging whether the result learned by the algorithm is good or not according to the intersection ratio between the predicted result and the real result, and indicating that the learning result is better when the IOU is closer to 1. Finally, the algorithm can well predict which points belong to points on the surface of the human body, and the final three-dimensional human body model can be reconstructed through the marking cube algorithm by using the discrete points. Then, the human body model is deformed in a non-rigid manner so as to conform to the template model.

Referring to fig. 4 and 5, bone deformation registration is performed first. In order to deform the human body model to the framework of the template model to be consistent, the human body model and the template model are required to be roughly aligned, the method of roughly aligning is to calculate the respective mass centers firstly, then execute a rigid body transformation, coincide the mass centers of two grids to the same point, respectively extract five points which are the four limb end points and the vertex point on the human body model and the template model as corresponding points, execute a nearest neighbor point iterative algorithm, and execute the rigid body transformation to realize the rough alignment.

After the rough alignment is completed, skeletal joint point deformation is needed, and skeletal actions of the human body model are deformed to be the same as those of the template model. The human body joint can move randomly within a certain range, and cascade relations exist among all the joint points, namely the movement of a father node can influence the movement of the human body joint and the movement of a son node of the father node. In order to realize bone deformation registration, the 3D coordinates of each joint point of a template model are known, then a local rotation transformation matrix is executed at the joint needing deformation, the local rotation transformation matrix on a motion chain is multiplied, the global transformation matrix of the joint point can be obtained, and the bone deformation registration can be realized by minimizing Euclidean distance between corresponding joint points of a human body model and the template model.

After bone registration, a skin operation is required, namely, an association relation between bone points and grid vertices is established, and the grid indication is deformed to adapt to the current bone posture. Each skeletal point may be associated with a number of vertices, and each vertex may be weighted differently. When a bone point undergoes rotational translational transformation, its associated mesh vertex also undergoes a different degree of positional transformation. Deformation of the whole grid can be controlled by controlling the rotation vector of the skeleton points, and the transformation formula of the grid vertex is as follows:

firstly, obtaining 3D coordinates of skeleton points of a template model, and calculating the weight w of each skeleton point and the surrounding mesh vertexes through a thermal diffusion equation; the basic principle is that the smaller the Euclidean distance from a bone point, the larger the weight, and the weight range is between 0 and 1. In the skeleton deformation registration stage, a transformation matrix T of each skeleton point can be obtained, v _p is a grid vertex before transformation, v' _p is a vertex of the transformed grid, and the coordinate of each vertex on the grid after gesture transformation can be obtained through the formula, so that a new grid is obtained.

Referring to fig. 6, after the grid is aligned, the shape alignment can be achieved by non-rigid deformation registration. The method for registering the non-rigid deformation comprises the following steps: and carrying out rotary translation transformation on each vertex on the template model so that the position of each vertex on the template model is closest to the corresponding point on the human body model, and simultaneously, the whole template model is kept smooth locally. In order to reduce the calculation amount and improve the fitting speed, the embodiment adopts a method of deforming the picture (deformation graph), namely, firstly taking the geodesic distance (geodesic distance) as a measurement standard on the human body model, uniformly and equally sampling all vertexes at fixed geodesic distance coefficients, and only optimizing the key points after sampling when subsequently optimizing the errors of corresponding points of the human body model and the template model, wherein the related points controlled by the key points only need to be ensured to be smooth during global optimization.

Finally, referring to fig. 7, parameters of the body circumference are measured. Since the topology of the phantom reconstructed each time is always changed, but the topology of the template model is not changed, the IDs of the vertices of the parts to be measured such as the arm circumference and the waistline can be easily extracted.

Because the number of the vertexes forming the waistline and other parts is very dense, and the distribution of the vertexes is basically on the same circumference plane, the distance between two adjacent points can be directly calculated and summed, and the waistline and other parts can be approximately expressed by polygons formed by dense straight line segments, so that a high-precision measurement result can be obtained.

However, the same point-taking method brings great errors at chest circumference with complex topological structure. The reason is that the vertices directly extracted are not on the same circumferential plane, so that the curves forming the chest circumference are not on the same plane, which can lead to a circle of the measuring tape bending up and down, resulting in a measurement result far greater than the actual result. The method and the device have the advantages that interpolation processing is carried out on the parts with the complex topological structures, so that the points forming the chest circumference are distributed in one plane as much as possible, and the measurement accuracy of the parts is greatly improved.

Referring to fig. 2, the present invention further provides a model optimized deployment system, comprising: the acquisition module acquires color images of a plurality of different visual angles around a human body;

Preferably, the acquisition module employs a APPLEARKIT platform.

Preferably, the acquisition module shoots an RGB image around a human body at intervals of 120 degrees, simultaneously carries out real-time portrait matting to remove the background through APPLE ARKIT, shoots and outputs the internal parameters and the external parameters of the current camera at the same time, and reminds a user to input height information.

Preferably, APPLE ARKIT of the acquisition module extracts image features in real time from images acquired by the video stream based on the synchronous positioning and mapping modes of the monocular vision and the inertial sensor, and the image features are used for estimating the position and the posture of the user.

Preferably, the modeling module obtains a rough three-dimensional human body model through visual hall algorithm through three background-removed color images with different visual angles, a camera internal reference matrix and a camera pose matrix; and extracting image features through a convolutional neural network Hourglass, sampling 3D points from the grid, establishing a corresponding relation between the 3D points and the 2D points of the image, and obtaining 3D points belonging to the surface of the human body through predicting depth information so as to reconstruct a human body model.

Preferably, the deformation module calculates the mass centers of the template model and the human body model, then performs a rigid transformation, coincides the mass centers to the same point, respectively extracts the limb end points and the vertex points of the template model and the human body model as corresponding points, and performs a nearest neighbor point iterative algorithm to perform the rigid transformation so as to realize coarse alignment.

Preferably, after coarse alignment, a local rotation transformation matrix is executed at the joint where the template model needs to be deformed, the local rotation transformation matrix on the motion chain is multiplied, so that a global transformation matrix of the joint point can be obtained, and the bone deformation registration can be realized by minimizing the Euclidean distance between the corresponding joint points of the two.

Preferably, after the bone registration, the deformation module performs a skinning operation, and the bone points and mesh vertex transformation formula:

Preferably, after the grid gestures are aligned, the deformation module performs non-rigid deformation registration: each vertex on the template model is subjected to a rotational-translational transformation such that each vertex thereon is positioned closest to a corresponding point on the manikin.

Preferably, the calculation module obtains vertex coordinates through the topological structure of the template model, and calculates the body parameters. In some implementations, the memory referred to in this example stores the following elements, an upgrade package, an executable unit, or a data structure, or a subset thereof, or an extended set thereof: an operating system and application programs.

The operating system includes various system programs, such as a framework layer, a core library layer, a driving layer, and the like, and is used for realizing various basic services and processing hardware-based tasks. And the application programs comprise various application programs and are used for realizing various application services. The program for implementing the method of the embodiment of the invention can be contained in an application program.

In the embodiment of the present invention, the processor is configured to execute the above method steps by calling a program or an instruction stored in the memory, specifically, a program or an instruction stored in an application program.

The embodiment of the invention also provides a chip for executing the method. Specifically, the chip includes: and a processor for calling and running the computer program from the memory, so that the device on which the chip is mounted is used for executing the above method.

The present invention also provides a computer readable storage medium having a computer program stored thereon, which when executed by a processor implements the steps of the above-described method of the present invention.

For example, machine-readable storage media may include, but are not limited to, various known and unknown types of non-volatile memory.

Embodiments of the present invention also provide a computer program product comprising computer program instructions for causing a computer to perform the above method.

Those of skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Those skilled in the art may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

In embodiments of the present application, the disclosed systems, electronic devices, and methods may be implemented in other ways. For example, the division of the units is only one logic function division, and other division manners are also possible in actual implementation. For example, multiple units or components may be combined or may be integrated into another system. In addition, the coupling between the individual units may be direct coupling or indirect coupling. In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or may exist alone physically, or the like.

The functions, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored on a machine-readable storage medium. Accordingly, aspects of the present application may be embodied in a software product, which may be stored on a machine-readable storage medium, which may include instructions for causing an electronic device to perform all or part of the processes of the aspects described in embodiments of the present application. The storage medium may include a ROM, a RAM, a removable disk, a hard disk, a magnetic disk, or an optical disk, etc. various media in which program codes can be stored.

It should be especially noted that, unlike the method of measuring parameters of multiple depth cameras and the device described in the foregoing description of the multiple depth cameras, the present embodiment directly optimizes parameters of controlling actions and shapes of the motion and shape parameters in the SMPL template model, so as to implement a method of fitting the motion and shapes of the motion and shape parameters in the SMPL template model to the depth camera array reconstruction. The method has the following two problems that the SMPL template model is poor in fitting the human body model:

1) The SMPL template model is not well matched with the Chinese posture;

2) The parameters of the SMPL template model for controlling the shape change of the human body are only 10, and the accurate change of the shape of the human body cannot be realized.

When the shape of the template model cannot be accurately fitted to the original human body model, larger errors are brought to the measurement of the body parameters, and the application is invalid.

The template model used in this embodiment is preferably manually trimmed by an animator using MAYA software on the basis of SMPL so that it better fits the average of the usual posture of chinese.

More importantly, the embodiment fits the motion in a bone-driven mode, but in the shape fitting part, non-rigid deformation registration is carried out on the template model, so that the local geometric detail surface of the template model can be optimized to be consistent with the original human model, the template model can express the shape of the human model more accurately, and the measurement accuracy is ensured.

In summary, the embodiment can fit a human body model containing noise information such as clothes folds through the human body template model, and then measure various body circumference parameters such as body waistline, and the like, so that the length and circumference information of a plurality of parts of the whole body can be measured, the measurement speed is high, the precision is high, the full-automatic non-contact measurement is realized, the method can be widely used for tailoring clothes, medical health management and monitoring, and the market application prospect is wide.

The data acquisition APP based on ARKit development adopted in the embodiment is used for acquiring sparse three-view human body images, and a high-precision human body three-dimensional model is reconstructed. The human body three-dimensional reconstruction method is simple in input, the data acquisition software is very friendly to users, the manual intervention is less, other data processing, three-dimensional model reconstruction and geometric texture mapping are all completed by the algorithm codes in a full-automatic mode, and the method has a large application market.

The embodiments of the present invention have been described above with reference to the accompanying drawings and examples, which are not to be construed as limiting the invention, but rather as modifications, variations or adaptations thereof may be made by those skilled in the art within the scope of the appended claims.

Claims

1. A body parameter measurement method based on human body reconstruction, characterized by comprising:

Firstly, a camera collects color images of a plurality of different visual angles around a human body, removes the background of the human image in the color images in real time, shoots and outputs an internal parameter matrix and an external parameter matrix of the camera at the same time, and inputs height information;

Extracting image features through the position and the posture of the camera, an internal parameter matrix and an external parameter matrix, predicting depth information of the image features, and fusing the depth information to obtain a human body model;

training a neural network algorithm model through a color image with a plurality of visual angles, a camera internal parameter matrix, a camera pose matrix and a high-precision human body three-dimensional model, so that the neural network algorithm model can reconstruct a human body model from the multi-visual angle color image;

Overlapping the mass centers of the template model and the human body model, extracting corresponding points of the limbs and the top of the head, and then performing iterative optimization on nearest neighbor points to realize coarse alignment; executing a local rotation transformation matrix at a joint of the template model to be deformed, multiplying the local rotation transformation matrix on the motion chain to obtain a global transformation matrix of the joint points, and realizing bone deformation registration by minimizing Euclidean distance between corresponding joint points of the human body model and the template model; performing skin operation after bone deformation registration, namely establishing an association relation between bone points and grid vertexes, and deforming the grid to adapt to the current bone posture; carrying out non-rigid deformation registration, namely carrying out rotation translation transformation on each vertex on the template model so that the position of each vertex on the template model is closest to a corresponding point on the human body model;

2. The body parameter measurement method based on human body reconstruction according to claim 1, wherein: in step one, the camera takes an image around the human body at every 120 °.

3. The body parameter measurement method based on human body reconstruction according to claim 1, wherein: the background of the portrait in the color image is removed by adopting a convolutional neural network to perform semantic segmentation, namely, a target detection algorithm is executed to extract the position of the person, and then saliency detection is performed to obtain the human body image with the background removed.

4. A body parameter measurement method based on human reconstruction according to claim 3, characterized in that: in the second step, the camera extracts image features from the color image in real time through a synchronous positioning mapping mode based on monocular vision and an inertial sensor, and the image features are used for estimating the position and the posture of the camera.

5. A system for measuring a body parameter, comprising:

the acquisition module acquires color images of a plurality of different visual angles around a human body; removing the background of the portrait in real time, shooting and outputting an internal parameter matrix and an external parameter matrix of the current camera at the same time, and reminding a user to input height information; extracting image features in real time for estimating the position and the posture of the user;

The modeling module extracts image features through the position and the gesture of the camera, an internal parameter matrix and an external parameter matrix, predicts the depth information of the image features and fuses the depth information to obtain a human body model; the method comprises the steps of inputting background-removed color images of three different visual angles, an internal parameter matrix of a camera and parameters of a pose matrix of the camera into a neural network algorithm model trained in advance, and directly regressively predicting a high-precision three-dimensional human body model;

Calculating the mass centers of the template model and the human body model, overlapping the mass centers to the same point, respectively extracting the limb end points and the vertex points of the template model and the human body model as corresponding points, and executing a nearest neighbor point iterative algorithm to perform rigid body transformation so as to realize coarse alignment; executing a local rotation transformation matrix at the joint of the template model needing to be deformed, and multiplying the local rotation transformation matrix on the motion chain to obtain a global transformation matrix of the joint points, wherein the bone deformation registration is realized by minimizing Euclidean distance between the corresponding joint points of the two; performing skin operation after bone deformation registration, namely establishing an association relation between bone points and grid vertexes, and deforming the grid to adapt to the current bone posture; carrying out non-rigid deformation registration, namely carrying out rotation translation transformation on each vertex on the template model so that the position of each vertex on the template model is closest to a corresponding point on the human body model;

6. An apparatus for measuring a body parameter, comprising:

The camera is in communication connection with the server; the server comprising a processor, and a memory for storing executable instructions of the processor, which processor, when run, performs the method of any of claims 1-4.

7. A chip comprising a processor for calling and running a computer program from a memory, such that a device on which the chip is mounted performs the method of any of claims 1-4.

8. A computer-readable medium, characterized by: on which computer program instructions are stored which, when processed and executed, implement the method of any of claims 1-4.