WO2023024036A1 - Method and apparatus for reconstructing three-dimensional model of person - Google Patents

Method and apparatus for reconstructing three-dimensional model of person Download PDF

Info

Publication number
WO2023024036A1
WO2023024036A1 PCT/CN2021/114840 CN2021114840W WO2023024036A1 WO 2023024036 A1 WO2023024036 A1 WO 2023024036A1 CN 2021114840 W CN2021114840 W CN 2021114840W WO 2023024036 A1 WO2023024036 A1 WO 2023024036A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
grid
voxel
target person
image
Prior art date
Application number
PCT/CN2021/114840
Other languages
French (fr)
Chinese (zh)
Inventor
白蔚
李万琦
胡伟
于金波
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2021/114840 priority Critical patent/WO2023024036A1/en
Priority to CN202180056556.0A priority patent/CN116157842A/en
Publication of WO2023024036A1 publication Critical patent/WO2023024036A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Processing Or Creating Images (AREA)

Abstract

Provided are a method and apparatus for reconstructing a three-dimensional model of a person, which relate to the technical field of media. A more complete three-dimensional model of a person having more abundant detailed information can be obtained, which not only extends the application scenarios of a three-dimensional model of a person, but also improves the quality of a reconstructed three-dimensional model of a person, thereby improving user experience. The method is applied to an electronic device, and comprises: an electronic device obtaining a first scale image and a second scale image of a target person, the first scale image comprising a first part of the target person, the second scale image comprising at least a second part of the target person, and the first part being part of the second part; then determining a first mesh three-dimensional model corresponding to the first scale image and determining a second mesh three-dimensional model corresponding to a second scale image; and fusing the first mesh three-dimensional model and the second mesh three-dimensional model so as to obtain a target three-dimensional model, the target three-dimensional model being used for displaying at least the second part of the target person.

Description

一种人物三维模型的重建方法及装置Method and device for reconstructing a three-dimensional model of a person 技术领域technical field
本申请实施例涉及媒体技术领域,尤其涉及一种人物三维模型的重建方法及装置。The embodiments of the present application relate to the field of media technologies, and in particular to a method and device for reconstructing a 3D model of a character.
背景技术Background technique
随着增强现实、虚拟现实技术的发展,以三维模型重建为核心的数字人物产品(即虚拟的3D人物)在娱乐、教育、金融、旅游等领域得到广泛的应用。With the development of augmented reality and virtual reality technology, digital character products (virtual 3D characters) centered on 3D model reconstruction have been widely used in entertainment, education, finance, tourism and other fields.
目前,人物三维模型重建可以包括对人物头部、脸部、上半身或全身进行三维模型重建。以对脸部三维模型重建和人体(即全身)进行三维模型重建为例,一种人物三维模型的重建方法是:对于脸部三维模型重建,将单张人脸图像输入至人脸体素回归网络,通过人脸体素回归网络的分析计算,得到人脸体素模型(人脸体素模型是三维模型);对于人体三维模型重建,将单张人体图像输入至人体体素回归网络,通过人体体素回归网络的分析计算,得到人体体素模型。Currently, the reconstruction of the 3D model of the character may include reconstruction of the 3D model of the head, face, upper body or whole body of the character. Taking the 3D model reconstruction of the face and the 3D model reconstruction of the human body (that is, the whole body) as an example, a method for reconstructing a 3D model of a person is: for the reconstruction of the 3D model of the face, a single face image is input into the face voxel regression network, through the analysis and calculation of the face voxel regression network, the face voxel model is obtained (the face voxel model is a three-dimensional model); for the reconstruction of the human body three-dimensional model, a single human body image is input into the human body voxel regression network, The analysis and calculation of the human body voxel regression network obtains the human body voxel model.
上述脸部三维模型仅能反映人物的局部信息,在很多应用场景中,可能还需要人物的更多的信息(例如上半身中除脸部之外的其他信息),因此单独的脸部三维模型的应用场景比较窄;上述人体三维模型可以反映人物的全局信息,但是对于一些细节信息的重建效果较差,例如人体三维模型中脸部区域的细节比较粗糙,也就是说,上述人体三维模型的重建质量较差,导致用户体验较差。The above 3D face model can only reflect the local information of the person. In many application scenarios, more information about the person may be needed (for example, information other than the face in the upper body), so the 3D model of the face alone The application scenarios are relatively narrow; the above-mentioned 3D human body model can reflect the overall information of the person, but the reconstruction effect of some detailed information is poor, for example, the details of the face area in the human 3D model are relatively rough, that is to say, the reconstruction of the above-mentioned 3D human body model Poor quality, resulting in a poor user experience.
发明内容Contents of the invention
本申请实施例提供一种人物三维模型的重建方法及装置,能够提高重建的人物三维模型的质量。Embodiments of the present application provide a method and device for reconstructing a three-dimensional model of a character, which can improve the quality of a reconstructed three-dimensional model of a character.
为达到上述目的,本申请实施例采用如下技术方案:In order to achieve the above purpose, the embodiment of the present application adopts the following technical solutions:
第一方面,本申请实施例提供一种人物三维模型的重建方法,应用于电子设备,该方法包括:电子设备获取目标人物的第一尺度图像和第二尺度图像,该第一尺度图像包括目标人物的第一部分,第二尺度图像包括目标人物的至少第二部分,该第一部分是该第二部分的一部分;然后电子设备确定第一尺度图像对应的第一网格三维模型,并且确定第二尺度图像对应的第二网格三维模型;以及电子设备对第一网格三维模型和第二网格三维模型进行融合处理,以得到目标三维模型,该目标三维模型用于显示目标人物的至少第二部分。In the first aspect, an embodiment of the present application provides a method for reconstructing a three-dimensional model of a person, which is applied to an electronic device. The method includes: the electronic device acquires a first scale image and a second scale image of a target person, and the first scale image includes the target The first part of the person, the second scale image includes at least a second part of the target person, the first part is a part of the second part; then the electronic device determines the first grid 3D model corresponding to the first scale image, and determines the second the second grid 3D model corresponding to the scale image; and the electronic device fuses the first grid 3D model and the second grid 3D model to obtain a target 3D model, and the target 3D model is used to display at least the first grid 3D model of the target person two parts.
通过本申请实施例提供的人物三维模型的重建方法,结合小尺度图像(例如第一尺度图像)的三维模型所体现的细节信息丰富、分辨率高的优点,以及大尺度图像(例如第二尺度图像)的三维模型能体现较大范围的人物特征(即整体性)的优点,将目标人物不同尺度的图像的三维模型进行融合,得到模型更加完整、且细节信息更加丰富的目标人物的三维模型,不仅延伸了人物三维模型的应用场景,而且提升了重建的人物三维模型的质量,从而提升用户体验。Through the method for reconstructing a 3D model of a character provided by the embodiment of the present application, the advantages of rich detailed information and high resolution embodied in the 3D model of a small-scale image (such as a first-scale image) and the advantages of a large-scale image (such as a second-scale image) The 3D model of the image) can reflect the advantages of a wide range of character characteristics (that is, integrity), and the 3D models of the images of different scales of the target person are fused to obtain a 3D model of the target person with a more complete model and richer detailed information. , which not only extends the application scenarios of the three-dimensional character model, but also improves the quality of the reconstructed three-dimensional model of the character, thereby improving user experience.
一种可能的实现方式中,上述确定第一尺度图像对应的第一网格三维模型,的方 法可以包括:基于第一体素回归网络,确定第一尺度图像对应的第一体素三维模型,并且将第一体素三维模型转换为第一网格三维模型。In a possible implementation manner, the method for determining the first grid 3D model corresponding to the first scale image may include: determining the first voxel 3D model corresponding to the first scale image based on the first voxel regression network, And convert the first voxel three-dimensional model into a first grid three-dimensional model.
本申请实施例中,将第一尺度图像输入至第一体素回归网络,能够得到第一尺度图像对应的第一体素三维模型,应理解,体素三维模型是体素回归网络的输出。In the embodiment of the present application, the first scale image is input to the first voxel regression network, and the first voxel three-dimensional model corresponding to the first scale image can be obtained. It should be understood that the voxel three-dimensional model is the output of the voxel regression network.
可选地,第一体素回归网络是卷积神经网络,该卷积神经网络可以为堆叠沙漏网络(stacked hourglass network),该第一体素回归网络是基于采集的多组二维图像和二维图像对应的标注了真实体素值的体素三维模型样本(训练数据集)对预设的堆叠沙漏网络训练得到的。应理解,第一尺度图像为目标人物不同部分的图像时,第一体素回归网络是基于对应的数据集训练得到,例如,第一尺度图像为目标人物的人脸图像,则第一体素回归网络是基于多组人脸图像和人脸图像对应的标注了真实体素值的人脸体素三维模型样本得到。Optionally, the first voxel regression network is a convolutional neural network, the convolutional neural network may be a stacked hourglass network, and the first voxel regression network is based on multiple groups of two-dimensional images collected and two The voxel 3D model sample (training data set) corresponding to the 3D image marked with the real voxel value is obtained by training the preset stacked hourglass network. It should be understood that when the first-scale image is an image of a different part of the target person, the first voxel regression network is trained based on the corresponding data set. For example, if the first-scale image is the face image of the target person, then the first voxel The regression network is obtained based on multiple groups of face images and face voxel 3D model samples corresponding to the face images with real voxel values marked.
综上,可以理解的是,当第一尺度图像为目标人物的人脸图像时,上述第一体素回归网络是用于预测人脸体素三维模型的体素回归网络。当第一尺度图像为目标人物的上半身图像时,上述第一体素回归网络是用于预测上半身体素三维模型的体素回归网络。To sum up, it can be understood that when the first scale image is the face image of the target person, the above-mentioned first voxel regression network is a voxel regression network for predicting the voxel three-dimensional model of the face. When the first scale image is the upper body image of the target person, the above-mentioned first voxel regression network is a voxel regression network for predicting the voxel three-dimensional model of the upper body.
可选地,本申请实施例中,电子设备将第一体素三维模型转换为第一网格三维模型的方法可以是:将提取的等值面上的三维顶点按照预设的规则连接成多边形(例如三角形),从而形成第一网格三维模型,该预设的规则可以是按照从左到右,从上到下的顺序依次将最近的三个三维顶点进行连接。Optionally, in the embodiment of the present application, the method for the electronic device to convert the first voxel 3D model into the first mesh 3D model may be: connect the extracted 3D vertices on the isosurface into polygons according to preset rules (such as a triangle), thereby forming the first grid three-dimensional model, the preset rule may be to connect the three nearest three-dimensional vertices in sequence from left to right and from top to bottom.
可选地,电子设备也可以采用立体渲染的方法将第一体素三维模型转换为第一网格三维模型。Optionally, the electronic device may also convert the first voxel 3D model into the first grid 3D model by using a stereo rendering method.
一种可能的实现方式中,上述确定第二尺度图像对应的第二网格三维模型的方法可以包括:基于第二体素回归网络,确定第二尺度图像对应的第二体素三维模型,并且将第二体素三维模型转换为第二网格三维模型。In a possible implementation manner, the method for determining the second grid 3D model corresponding to the second scale image may include: determining the second voxel 3D model corresponding to the second scale image based on the second voxel regression network, and The second voxel 3D model is converted to a second mesh 3D model.
本申请实施例中,将第二尺度图像输入至第二体素回归网络,能够得到第二尺度图像对应的第二体素三维模型。In the embodiment of the present application, the second scale image is input to the second voxel regression network, and the second voxel three-dimensional model corresponding to the second scale image can be obtained.
与第一体素回归网络的结构类似,第二体素回归网络也可以是卷积神经网络,该卷积神经网络为堆叠沙漏网络,该第二体素回归网络是基于采集的多组二维图像和二维图像对应的标注了真实体素值的体素三维模型样本(训练数据集)对预设的堆叠沙漏网络训练得到的。应理解,第二尺度图像为目标人物不同部分的图像时,第二体素回归网络是基于对应的数据集训练得到,例如,第二尺度图像为目标人物的上半身图像,则第二体素回归网络是基于多组上半身图像和上半身图像对应的标注了真实体素值的上半身体素三维模型样本得到。Similar to the structure of the first voxel regression network, the second voxel regression network can also be a convolutional neural network, the convolutional neural network is a stacked hourglass network, and the second voxel regression network is based on multiple sets of collected two-dimensional The voxel 3D model sample (training data set) corresponding to the image and the 2D image is obtained by training the preset stacked hourglass network. It should be understood that when the second-scale image is an image of a different part of the target person, the second voxel regression network is trained based on the corresponding data set. For example, if the second-scale image is the upper body image of the target person, then the second voxel regression network The network is obtained based on multiple sets of upper body images and upper body voxel 3D model samples corresponding to the upper body images with real voxel values marked.
综上,可以理解的是,当第二尺度图像为目标人物的上半身图像时,上述第二体素回归网络是用于预测上半身体素三维模型的体素回归网络。当第二尺度图像为目标人物的全身图像时,上述第二体素回归网络是用于预测人体体素三维模型的体素回归网络。To sum up, it can be understood that when the second-scale image is the upper body image of the target person, the above-mentioned second voxel regression network is a voxel regression network for predicting the upper body voxel three-dimensional model. When the second-scale image is a whole-body image of the target person, the above-mentioned second voxel regression network is a voxel regression network for predicting a voxel three-dimensional model of a human body.
需要说明的是,电子设备将第二体素三维模型转换为第二网格三维模型的方法与电子设备将第一体素三维模型转换为第一网格三维模型方法类似,因此,对于电子设 备将第二体素三维模型转换为第二网格三维模型的方法的详细描述可以参考上述电子设备将第一体素三维模型转换为第一网格三维模型的过程的描述,此处不再赘述。It should be noted that the method for the electronic device to convert the second voxel 3D model into the second grid 3D model is similar to the method for the electronic device to convert the first voxel 3D model into the first grid 3D model. Therefore, for the electronic device For a detailed description of the method for converting the second voxel 3D model into the second grid 3D model, please refer to the description of the process of converting the first voxel 3D model into the first grid 3D model by the above-mentioned electronic device, and details will not be repeated here .
一种可能的实现方式中,上述对第一网格三维模型和第二网格三维模型进行融合处理,以得到目标三维模型的具体过程包括:将第一网格三维模型转换至二维平面,得到第一平面展开图;并且将第二网格三维模型转换至二维平面,得到第二平面展开图,其中,该第一平面展开图中的第一图像区域对应第二平面展开图中的第二图像区域,该第一图像区域和第二图像区域对应目标人物的第一部分;然后对第一平面展开图进行裁剪,以获取第一图像区域,对第二平面展开图进行裁剪,以获取第二图像区域;并将第二平面展开图中的第二图像区域替换为第一图像区域,得到目标人物的目标平面展开图;最后,对目标平面展开图进行三维转换,得到目标三维模型。In a possible implementation manner, the specific process of merging the first grid 3D model and the second grid 3D model to obtain the target 3D model includes: converting the first grid 3D model to a 2D plane, obtaining a first plane expansion; and converting the second grid three-dimensional model to a two-dimensional plane to obtain a second plane expansion, wherein the first image area in the first plane expansion corresponds to the second plane expansion The second image area, the first image area and the second image area correspond to the first part of the target person; then the first plane expansion diagram is cropped to obtain the first image area, and the second plane expansion diagram is cropped to obtain the second image area; and replacing the second image area in the second plane expanded view with the first image area to obtain a target plane expanded view of the target person; finally, performing a three-dimensional transformation on the target plane expanded view to obtain a target three-dimensional model.
示例性的,第一网格三维模型是人脸的网格三维模型,第二网格三维模型为上半身的网格三维模型,则第一平面展开图中的第一图像区域可以为人脸对应的区域,则第二平面展开图中的第二图像区域也是人脸对应的区域,即第一图像区域和第二图像区域对应目标人物的第一部分(即人脸部分)。Exemplarily, the first grid 3D model is a grid 3D model of a human face, and the second grid 3D model is a grid 3D model of an upper body, then the first image area in the first plane expanded view may be a grid corresponding to a human face. area, the second image area in the second plane expanded view is also the area corresponding to the face, that is, the first image area and the second image area correspond to the first part (ie, the face part) of the target person.
本申请实施例中,电子设备可以采用二维参数化技术将第一网格三维模型和第二网格三维模型投影至二维平面,得到第一平面展开图和第二平面展开图。例如可以采用圆柱形投影,将一个圆柱面包围椭球体,并使之相切或相割,再根据某种条件将椭球面上的经纬网点投影到圆柱面上,然后,沿着圆柱面的一条母线切开,将其展成平面,得到第一平面展开图或第二平面展开图。In the embodiment of the present application, the electronic device may project the first grid three-dimensional model and the second grid three-dimensional model to a two-dimensional plane by using two-dimensional parameterization technology to obtain the first plane expansion diagram and the second plane expansion diagram. For example, a cylindrical projection can be used to surround a cylindrical surface and make it tangent or cut, and then project the latitude and longitude points on the ellipsoid surface onto the cylindrical surface according to certain conditions, and then, along a line of the cylindrical surface, The bus bar is cut open and developed into a plane to obtain the first plane expansion diagram or the second plane expansion diagram.
以第一图像区域为人脸部分为例,电子设备可以根据人脸特征点生成脸部范围的掩膜(也可以称为脸部区域框),然后根据脸部区域框从第一平面展开图和第二平面展开图裁剪出人脸部分。Taking the first image region as the face part as an example, the electronic device can generate a mask of the face range (also called a face region frame) according to the face feature points, and then expand the image from the first plane and The second plane expanded view crops out the face part.
应理解,上述电子设备从第一平面展开图中裁剪得到第一图像区域,并且电子设备对第二平面展开图进行裁剪,去掉第二平面展开图中的第二图像区域,电子设备分别对第一图像区域的边缘进行平滑,对第二平面展开图中去除第二图像区域之后的边缘也进行平滑之后,电子设备将第一图像区域的边缘与第二平面展开图中去除第二图像区域之后的边缘进行缝合,得到目标平面展开图。具体的,电子设备将第一图像区域的边缘和第二平面展开图中去除第二图像区域之后的边缘作为约束条件,计算这两个边缘的最接近的顶点,然后将两个边缘中的最接近的顶点连接起来,实现边缘缝合。It should be understood that the electronic device cuts out the first image area from the first expanded view, and the electronic device cuts the second expanded view to remove the second image area in the second expanded view. After smoothing the edge of an image area, and smoothing the edge after removing the second image area in the second plane expansion diagram, the electronic device compares the edge of the first image area with the second plane expansion diagram after removing the second image area The edge is stitched to obtain the target plane expansion diagram. Specifically, the electronic device takes the edge of the first image region and the edge after removing the second image region in the second plane expansion diagram as constraint conditions, calculates the closest vertex of the two edges, and then calculates the closest vertex of the two edges to Close vertices are joined to achieve edge stitching.
一种可能的实现方式中,在对第一网格三维模型和第二网格三维模型进行融合处理之前,本申请实施例提供的人物三维模型的重建方法还包括:对第一网格三维模型和/或第二网格三维模型进行网格平滑处理或网格简化处理中的至少一种。In a possible implementation manner, before performing fusion processing on the first grid 3D model and the second grid 3D model, the method for reconstructing the character 3D model provided in the embodiment of the present application further includes: first grid 3D model And/or the second mesh 3D model performs at least one of mesh smoothing or mesh simplification.
可选地,电子设备获得的第一网格三维模型和第二网络三维模型中,可能存在噪声网格(即与实际模型偏差较大的网格),为了得到更加准确的目标三维模型,电子设备可以采用相关的网格平滑算法对第一网格三维模型和/或第二网格三维模型进行网格平滑处理,以删除噪声网格。网格平滑算法可以为Taubin平滑算法、拉普拉斯(Laplacian)平滑算法、平均曲率(Curvature)平滑算法中的任一种,具体根据实际情况选择,本申请实施例不做限定。Optionally, in the first grid 3D model and the second network 3D model obtained by the electronic device, there may be noise grids (that is, grids with a large deviation from the actual model), in order to obtain a more accurate target 3D model, the electronic The device may perform grid smoothing processing on the first grid 3D model and/or the second grid 3D model by using a related grid smoothing algorithm, so as to delete noise grids. The mesh smoothing algorithm may be any one of Taubin smoothing algorithm, Laplacian smoothing algorithm, and average curvature (Curvature) smoothing algorithm, which is selected according to actual conditions, and is not limited in this embodiment of the present application.
可选地,电子设备获得的第一网格三维模型和第二网络三维模型中,网格的密度 可能较大,如此,进行网格融合处理的计算量较大,且耗时较久,为了降低网格三维模型融合过程中的计算量,电子设备可以采用网格简化算法对第一网格三维模型和/或第二网格三维模型进行网格简化处理。网格简化算法可以为边折叠(Edge Collapse)算法、基于度量的边折叠算法中的任一种,具体根据实际情况选择,本申请实施例不做限定。Optionally, in the first grid 3D model and the second network 3D model obtained by the electronic device, the density of the grids may be relatively large, so that the calculation of the grid fusion processing is relatively large and takes a long time. In order to To reduce the amount of calculation in the mesh 3D model fusion process, the electronic device may use a mesh simplification algorithm to perform mesh simplification processing on the first mesh 3D model and/or the second mesh 3D model. The mesh simplification algorithm can be any one of edge collapse (Edge Collapse) algorithm and metric-based edge collapse algorithm, which is selected according to the actual situation, and is not limited in the embodiment of the present application.
当需要对网格三维模型进行网格平滑处理和网格简化处理时,可以不限定这两种处理的顺序,例如,可以先对网格三维模型进行网格平滑处理,后对平滑之后的网格三维模型进行网格简化处理;也可以先对网格三维模型进行网格简化处理,后对简化之后的网格三维模型进行网格平滑处理。When it is necessary to perform mesh smoothing and mesh simplification on the mesh 3D model, the order of these two processes is not limited. For example, the mesh 3D model can be mesh smoothed first, and then the smoothed mesh The mesh simplification process can be performed on the grid 3D model; the grid simplification process can also be performed on the grid 3D model first, and then the grid smoothing process can be performed on the simplified grid 3D model.
在一种实现方式中,先对网格三维模型进行网格平滑处理,后对平滑之后的网格三维模型进行网格简化处理,这样能够更好地提升重建的三维模型的质量。In an implementation manner, the grid smoothing process is first performed on the grid three-dimensional model, and then the grid simplification process is performed on the smoothed grid three-dimensional model, which can better improve the quality of the reconstructed three-dimensional model.
一种可能的实现方式中,上述第一部分为目标人物的人脸,第二部分为目标人物的上半身;或者,第一部分为目标人物的上半身,第二部分为目标人物的全身。In a possible implementation, the first part is the face of the target person, and the second part is the upper body of the target person; or, the first part is the upper body of the target person, and the second part is the whole body of the target person.
第二方面,本申请实施例提供一种用于人物三维模型的重建装置,应用于电子设备,该装置包括:获取模块、确定模块和融合模块。其中,获取模块用于获取目标人物的第一尺度图像和第二尺度图像,该第一尺度图像包括目标人物的第一部分,该第二尺度图像包括目标人物的至少第二部分,第一部分是第二部分的一部分;确定模块于确定第一尺度图像对应的第一网格三维模型,并且确定第二尺度图像对应的第二网格三维模型;融合模块用于对第一网格三维模型和第二网格三维模型进行融合处理,以得到目标三维模型,该目标三维模型用于显示目标人物的至少第二部分。In a second aspect, the embodiment of the present application provides a reconstruction device for a 3D model of a character, which is applied to an electronic device, and the device includes: an acquisition module, a determination module, and a fusion module. Wherein, the obtaining module is used to obtain a first scale image of the target person and a second scale image, the first scale image includes a first part of the target person, the second scale image includes at least a second part of the target person, the first part is the first part A part of the second part; the determining module is used to determine the first grid 3D model corresponding to the first scale image, and determine the second grid 3D model corresponding to the second scale image; the fusion module is used for the first grid 3D model and the second grid 3D model The two mesh 3D models are fused to obtain a target 3D model, and the target 3D model is used to display at least the second part of the target person.
一种可能的实现方式中,上述确定模块具体用于基于第一体素回归网络,确定第一尺度图像对应的第一体素三维模型,并且将第一体素三维模型转换为第一网格三维模型;以及基于第二体素回归网络,确定第二尺度图像对应的第二体素三维模型,并且将第二体素三维模型转换为第二网格三维模型。In a possible implementation, the determination module is specifically configured to determine the first voxel three-dimensional model corresponding to the first scale image based on the first voxel regression network, and convert the first voxel three-dimensional model into the first grid a three-dimensional model; and based on the second voxel regression network, determine a second voxel three-dimensional model corresponding to the second-scale image, and convert the second voxel three-dimensional model into a second grid three-dimensional model.
一种可能的实现方式中,上述融合模块具体用于将第一网格三维模型转换至二维平面,得到第一平面展开图;并且将第二网格三维模型转换至二维平面,得到第二平面展开图;该第一平面展开图中的第一图像区域对应第二平面展开图中的第二图像区域,该第一图像区域和第二图像区域对应第一部分;以及对第一平面展开图进行裁剪,以获取第一图像区域,对第二平面展开图进行裁剪,以获取第二图像区域;并将第二平面展开图中的第二图像区域替换为第一图像区域,得到目标人物的目标平面展开图;再对目标平面展开图进行三维转换,得到目标三维模型。In a possible implementation, the above-mentioned fusion module is specifically used to convert the 3D model of the first grid to a 2D plane to obtain the unfolded view of the first plane; and convert the 3D model of the second grid to a 2D plane to obtain the second Two plane expansion diagrams; the first image area in the first plane expansion diagram corresponds to the second image area in the second plane expansion diagram, and the first image area and the second image area correspond to the first part; and the first plane expansion The image is cropped to obtain the first image area, and the second plane expanded view is cropped to obtain the second image area; and the second image area in the second plane expanded view is replaced with the first image area to obtain the target person The target plane expansion diagram; and then perform three-dimensional transformation on the target plane expansion diagram to obtain the target three-dimensional model.
一种可能的实现方式中,本申请实施例提供的用于人物三维模型的重建装置还包括处理模块;该处理模块用于对第一网格三维模型和/或第二网格三维模型进行网格平滑处理或网格简化处理中的至少一种。In a possible implementation manner, the apparatus for reconstructing a 3D model of a character provided in the embodiment of the present application further includes a processing module; at least one of mesh smoothing or mesh simplification.
一种可能的实现方式中,目标人物的第一部分为该目标人物的人脸,目标人物的第二部分为该目标人物的上半身;或者,目标人物的第一部分为该目标人物的上半身,目标人物的第二部分为该目标人物的全身。In a possible implementation, the first part of the target person is the face of the target person, and the second part of the target person is the upper body of the target person; or, the first part of the target person is the upper body of the target person, and the target person The second part of is the whole body of the target person.
第三方面,本申请实施例提供一种电子设备,包括存储器和与该存储器连接的至少一个处理器,该存储器用于存储指令,该指令被至少一个处理器读取后,执行第一 方面及其可能的实现方式中任意之一所述的方法。In a third aspect, an embodiment of the present application provides an electronic device, including a memory and at least one processor connected to the memory, the memory is used to store instructions, and after the instructions are read by at least one processor, the first aspect and the first aspect are executed. The method described in any one of its possible implementations.
第四方面,本申请实施例提供一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行以实现第一方面及其可能的实现方式中任意之一所述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement the method described in any one of the first aspect and its possible implementations .
第五方面,本申请实施例提供一种计算机程序产品,该计算机程序产品包含指令,当计算机程序产品在计算机上运行时,执行第一方面及其可能的实现方式中任意之一所述的方法。In the fifth aspect, the embodiment of the present application provides a computer program product, the computer program product includes instructions, and when the computer program product is run on the computer, execute the method described in any one of the first aspect and its possible implementations .
第六方面,本申请实施例提供一种芯片,包括存储器和处理器。存储器用于存储计算机指令。处理器用于从存储器中调用并运行该计算机指令,以执行第一方面及其可能的实现方式中任意之一所述的方法。In a sixth aspect, the embodiment of the present application provides a chip, including a memory and a processor. Memory is used to store computer instructions. The processor is used to call and execute the computer instructions from the memory, so as to execute the method described in any one of the first aspect and possible implementations thereof.
应当理解的是,本申请实施例的第二方面至第六方面技术方案及对应的可能的实施方式所取得的有益效果可以参见上述对第一方面及其对应的可能的实施方式的技术效果,此处不再赘述。It should be understood that the beneficial effects achieved by the technical solutions of the second aspect to the sixth aspect of the embodiment of the present application and the corresponding possible implementation manners can refer to the above-mentioned technical effects on the first aspect and the corresponding possible implementation manners, I won't repeat them here.
附图说明Description of drawings
图1为本申请实施例提供的体素三维模型的相关示意图;FIG. 1 is a related schematic diagram of a voxel three-dimensional model provided in the embodiment of the present application;
图2为本申请实施例提供的一种手机的硬件示意图;FIG. 2 is a hardware schematic diagram of a mobile phone provided by an embodiment of the present application;
图3为本申请实施例提供的一种人物三维模型的重建方法示意图;FIG. 3 is a schematic diagram of a method for reconstructing a three-dimensional model of a character provided in an embodiment of the present application;
图4为本申请实施例提供的一种人脸特征点的示意图;FIG. 4 is a schematic diagram of a facial feature point provided by an embodiment of the present application;
图5为本申请实施例提供的一种人脸的网格三维模型的示意图;FIG. 5 is a schematic diagram of a three-dimensional grid model of a human face provided in an embodiment of the present application;
图6为本申请实施例提供的另一种人物三维模型的重建方法示意图;FIG. 6 is a schematic diagram of another method for reconstructing a three-dimensional model of a character provided in the embodiment of the present application;
图7为本申请实施例提供的另一种人物三维模型的重建方法示意图;FIG. 7 is a schematic diagram of another method for reconstructing a three-dimensional model of a character provided by the embodiment of the present application;
图8为本申请实施例提供的一种上半身对应的网格三维模型投影至二维平面之后的平面展开图的示意图;FIG. 8 is a schematic diagram of a plane expansion view after projecting a grid 3D model corresponding to the upper body to a 2D plane provided by the embodiment of the present application;
图9为本申请实施例提供的一种待缝合的平面展开图的效果示意图和待缝合的平面展开图转换至三维空间的效果示意图;Fig. 9 is a schematic diagram of the effect of a plane unfolded view to be stitched and a schematic diagram of the effect of converting the plane unfolded view to be stitched into a three-dimensional space provided by the embodiment of the present application;
图10为本申请实施例提供的又一种人物三维模型的重建方法示意图;FIG. 10 is a schematic diagram of another method for reconstructing a 3D model of a character provided in the embodiment of the present application;
图11为本申请实施例提供的一种目标人物的上半身三维模型的重建过程示意图;FIG. 11 is a schematic diagram of the reconstruction process of a 3D model of the upper body of a target person provided in the embodiment of the present application;
图12为本申请实施例提供的一种用于人物三维模型的重建装置的结构示意图;FIG. 12 is a schematic structural diagram of a reconstruction device for a three-dimensional model of a person provided by an embodiment of the present application;
图13为本申请实施例提供的另一种用于人物三维模型的重建装置的结构示意图。FIG. 13 is a schematic structural diagram of another apparatus for reconstructing a three-dimensional model of a person provided by an embodiment of the present application.
具体实施方式Detailed ways
本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。The term "and/or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and/or B can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations.
本申请实施例的说明书和权利要求书中的术语“第一”和“第二”等是用于区别不同的对象,而不是用于描述对象的特定顺序。例如,第一尺度图像和第二尺度图像等是用于区别不同的尺度图像,而不是用于描述尺度图像的特定顺序。The terms "first" and "second" in the description and claims of the embodiments of the present application are used to distinguish different objects, rather than to describe a specific order of objects. For example, the first scale image and the second scale image are used to distinguish different scale images, rather than describing a specific order of the scale images.
在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的” 或者“例如”等词旨在以具体方式呈现相关概念。In the embodiments of the present application, words such as "exemplary" or "for example" are used as examples, illustrations or illustrations. Any embodiment or design scheme described as "exemplary" or "for example" in the embodiments of the present application shall not be interpreted as being more preferred or more advantageous than other embodiments or design schemes. Rather, the use of words such as "exemplary" or "such as" is intended to present related concepts in a concrete manner.
在本申请实施例的描述中,除非另有说明,“多个”的含义是指两个或两个以上。例如,多个处理单元是指两个或两个以上的处理单元;多个系统是指两个或两个以上的系统。In the description of the embodiments of the present application, unless otherwise specified, "plurality" means two or more. For example, multiple processing units refer to two or more processing units; multiple systems refer to two or more systems.
人物三维模型重建可以用于数字人建模,数字人即数字化的虚拟的3D人物,目前,数字人建模包括对真实的人物进行扫描动捕、三维建模、语音合成等处理,从而得到真实人物的数字形象,进而基于增强现实(AR)或虚拟现实(VR)等技术将该数字形象呈现在终端平台上。The 3D model reconstruction of characters can be used for digital human modeling. Digital human is a digitized virtual 3D character. The digital image of the character, and then present the digital image on the terminal platform based on technologies such as augmented reality (AR) or virtual reality (VR).
目前,一种人物三维模型重建的方法是:基于模型变形的方法得到人物三维模型,例如,对于人脸三维模型重建,可以基于三维可变形模型(3D face morphable model,3DMM)进行人脸三维模型重建,具体的,将人脸图像与预设的平均人脸模型(3DMM)进行匹配,对平均人脸模型进行形变,得到人脸三维模型;又例如,对于人体三维模型重建,可以基于蒙皮多人线性(skinned multi-person linear model,SMPL)模型进行人体三维模型重建,具体的,也是将人体图像与预设的平均人体SMPL模型进行匹配,对平均人体SMPL模型进行形变,得到人体三维模型。At present, a method for reconstructing a 3D model of a person is to obtain a 3D model of a person based on a method of deformation. Reconstruction, specifically, match the face image with the preset average face model (3DMM), and deform the average face model to obtain a three-dimensional face model; The multi-person linear (skinned multi-person linear model, SMPL) model is used to reconstruct the three-dimensional model of the human body. Specifically, the human body image is matched with the preset average human body SMPL model, and the average human body SMPL model is deformed to obtain the three-dimensional human body model. .
另一种人物三维模型重建的方法是:基于体素的人物三维模型,利用体素回归网络预测人脸或人体的体素三维模型。本申请实施例提供的人物三维模型的重建方法是一种基于体素的三维重建方法,下面对基于体素的三维重建过程涉及的概念进行简单介绍。Another method for reconstructing a 3D model of a person is: based on a voxel-based 3D model of a person, a voxel regression network is used to predict a 3D voxel model of a human face or human body. The method for reconstructing a 3D model of a character provided in the embodiment of the present application is a voxel-based 3D reconstruction method. The concepts involved in the voxel-based 3D reconstruction process will be briefly introduced below.
体素:是体积元素(volume pixel)的简称,假设一个大的体积空间(volume)为图1中的(a)所示的立方体,该体积空间可涵盖待建立的三维模型,该立方体包括多个小的立方体,其中,一个小的立方体即为一个体素,应理解,每个体素对应空间中的一个点,每个体素对应一个体素值,该体素值用于指示该体素是否属于待建立的三维模型。Voxel: It is the abbreviation of volume pixel. It is assumed that a large volume space (volume) is a cube as shown in (a) in Figure 1. This volume space can cover the three-dimensional model to be established. The cube includes multiple A small cube, where a small cube is a voxel, it should be understood that each voxel corresponds to a point in the space, each voxel corresponds to a voxel value, and the voxel value is used to indicate whether the voxel is Belongs to the 3D model to be built.
基于体素可进行人物三维模型重建,例如人脸三维模型重建、上半身三维模型重建、人体三维模型重建等等。Based on voxels, 3D model reconstruction of characters can be performed, such as 3D model reconstruction of human face, 3D model reconstruction of upper body, 3D model reconstruction of human body, etc.
以人脸三维模型重建为例,将一幅人脸图像输入至体素回归网络,通过体素回归网络预测体积空间(例如图1中的(a)所示的立方体)中的每一个体素的体素值,即得到人脸图像的三维模型在体积空间中的表示,图1中的(b)是体积空间的一个截面所包含的所有的体素的体素值的示例,即人脸体素三维模型。Taking the reconstruction of the 3D face model as an example, a face image is input to the voxel regression network, and each voxel in the volume space (such as the cube shown in (a) in Figure 1) is predicted by the voxel regression network. The voxel value of the face image, that is, the representation of the three-dimensional model of the face image in the volume space, (b) in Figure 1 is an example of the voxel value of all the voxels contained in a section of the volume space, that is, the face Voxel 3D model.
可选地,预测得到体积空间中的每一个体素的体素值(即人脸体素三维模型)之后,可以采用立体渲染的方法或者提取预设阈值的多边形等值面,从而得到人脸网格三维模型。以提取预设阈值的多边形等值面为例,可以基于截断的带符号距离函数(truncated signed distance function,TSDF)算法实现等值面提取,例如,对于图1中的(b)的示例,预设阈值取为0.0,那么提取立体空间中的所有体素值等于0.0的体素,所有体素值等于0.0的体素所形成的多边形的表面(称为等值面),得到人脸网格三维模型,例如图1中(c)。Optionally, after the voxel value of each voxel in the volume space is predicted (that is, the voxel 3D model of the face), a stereoscopic rendering method or a polygonal isosurface with a preset threshold can be extracted to obtain a face Mesh 3D model. Taking the extraction of a polygonal isosurface with a preset threshold as an example, the isosurface can be extracted based on a truncated signed distance function (truncated signed distance function, TSDF) algorithm. For example, for the example of (b) in Figure 1, pre Set the threshold to be 0.0, then extract all the voxels whose voxel values are equal to 0.0 in the three-dimensional space, and the polygonal surface (called isosurface) formed by all the voxels whose voxel values are equal to 0.0, to obtain the face grid A three-dimensional model, such as (c) in Figure 1.
可以理解的是,用于人脸三维模型重建的体素回归网络是基于大量的已知的人脸图像和人脸图像对应的三维模型训练得到的。可选地,体素回归网络可以为不同类型 的神经网络,例如,体素回归网络为卷积神经网络,该卷积神经网络可以为堆叠沙漏网络。It can be understood that the voxel regression network used for reconstruction of the 3D face model is trained based on a large number of known face images and 3D models corresponding to the face images. Optionally, the voxel regression network can be different types of neural networks, for example, the voxel regression network is a convolutional neural network, and the convolutional neural network can be a stacked hourglass network.
目前,在人物三维模型重建的方案中,均仅支持单个尺度的模型重建,例如,仅支持脸部三维模型重建,或者仅支持人体三维模型重建,或者仅支持上半身三维模型重建。示例性的,对于仅支持脸部三维模型重建的方案,得到的人脸三维模型仅包含脸部的信息,不包含人体其他部分的信息,模型缺乏完整性,而目前市场上的多数应用需要更多的人物信息,人脸三维模型的应用场景比较窄,商业化前景比较局限。对于仅支持人体三维模型重建的方案,得到的人体三维模型包括完整的人体信息,但对于局部,例如脸部,其模型重建效果较差,无法体现脸部的一些细节信息,即局部细节比较粗糙。At present, the schemes for 3D model reconstruction of characters only support model reconstruction of a single scale, for example, only support reconstruction of a 3D model of a face, or only support reconstruction of a 3D model of a human body, or only support reconstruction of a 3D model of an upper body. Exemplarily, for a solution that only supports facial 3D model reconstruction, the obtained 3D face model only contains information about the face, and does not contain information about other parts of the human body. The model lacks integrity, and most applications currently on the market need to be updated There is a lot of character information, the application scenarios of the 3D face model are relatively narrow, and the commercialization prospect is relatively limited. For the scheme that only supports the reconstruction of the 3D human body model, the obtained 3D human body model includes complete human body information, but for parts, such as the face, the model reconstruction effect is poor, and some detailed information of the face cannot be reflected, that is, the local details are relatively rough .
针对背景技术中的单纯的人脸三维模型重建,脸部三维模型仅能反映人物的局部信息,应用场景比较窄的问题,以及单纯的人体三维模型虽然可以反映人物的全局信息,但是对于一些细节信息的重建效果较差的问题,本申请实施例提供一种人物三维模型的重建方法及装置,在该方法中,可以对第一尺度图像对应的第一网格三维模型和第二尺度图像对应的第二网格三维模型进行融合处理,以得到目标三维模型,其中,第一尺度图像包括目标人物的第一部分,第二尺度图像包括目标人物的至少第二部分,该第一部分是第二部分的一部分,该目标三维模型用于显示目标人物的至少第二部分。通过该方案,能够提高重建的人物三维模型的质量,从而提升用户体验。In view of the simple 3D model reconstruction of the face in the background technology, the 3D face model can only reflect the local information of the person, and the application scene is relatively narrow, and the simple 3D model of the human body can reflect the global information of the person, but some details For the problem of poor information reconstruction effect, the embodiment of the present application provides a method and device for reconstructing a 3D model of a person. In this method, the first grid 3D model corresponding to the first scale image can be corresponding to the second scale image. The second grid 3D model is fused to obtain the target 3D model, wherein the first scale image includes a first part of the target person, and the second scale image includes at least a second part of the target person, and the first part is the second part As part of the target person, the three-dimensional model of the target is used to display at least a second part of the target person. Through this solution, the quality of the reconstructed three-dimensional model of the character can be improved, thereby improving user experience.
本申请实施例提供的人物三维模型的重建方法可以应用于手机、平板电脑或个人计算机(Ultra-mobile Personal Computer,UMPC)等电子设备中。或者,还可以应用于其他桌面型设备、膝上型设备、手持型设备、可穿戴设备、智能家居设备和车载型设备等电子设备中,例如上网本、智能手表、智能相机、上网本、个人数字助理(Personal Digital Assistant,PDA)、便携式多媒体播放器(Portable Multimedia Player,PMP)、专用媒体播放器或AR(增强现实)/VR(虚拟现实)设备等。本申请实施例对电子设备的具体类型和结构等不作限定。The method for reconstructing a three-dimensional model of a character provided in the embodiment of the present application can be applied to electronic devices such as a mobile phone, a tablet computer, or a personal computer (Ultra-mobile Personal Computer, UMPC). Alternatively, it can also be used in other electronic devices such as desktop, laptop, handheld, wearable, smart home and vehicle-mounted devices, such as netbooks, smart watches, smart cameras, netbooks, personal digital assistants (Personal Digital Assistant, PDA), portable multimedia player (Portable Multimedia Player, PMP), dedicated media player or AR (augmented reality) / VR (virtual reality) equipment, etc. The embodiment of the present application does not limit the specific type and structure of the electronic device.
以电子设备为手机为例,图2为本申请实施例提供的一种手机200的硬件结构示意图,该手机200包括处理器210,外部存储器接口220,内部存储器221,通用串行总线(universal serial bus,USB)接口230,充电管理模块240,电源管理模块241,电池242,天线1,天线2,移动通信模块250,无线通信模块260,音频模块270,扬声器270A,受话器270B,麦克风270C,耳机接口270D,传感器模块280,按键290,马达291,指示器292,摄像头293,显示屏294,以及用户标识模块(subscriber identification module,SIM)卡接口295等。其中传感器模块280可以包括压力传感器280A,陀螺仪传感器280B,气压传感器280C,磁传感器280D,加速度传感器280E,距离传感器280F,接近光传感器280G,指纹传感器280H,温度传感器280J,触摸传感器280K,环境光传感器280L,骨传导传感器280M等。Taking the electronic device as a mobile phone as an example, FIG. 2 is a schematic diagram of the hardware structure of a mobile phone 200 provided in the embodiment of the present application. The mobile phone 200 includes a processor 210, an external memory interface 220, an internal memory 221, and a universal serial bus (universal serial bus). bus, USB) interface 230, charging management module 240, power management module 241, battery 242, antenna 1, antenna 2, mobile communication module 250, wireless communication module 260, audio module 270, speaker 270A, receiver 270B, microphone 270C, earphone Interface 270D, sensor module 280, button 290, motor 291, indicator 292, camera 293, display screen 294, and subscriber identification module (subscriber identification module, SIM) card interface 295, etc. The sensor module 280 may include a pressure sensor 280A, a gyro sensor 280B, an air pressure sensor 280C, a magnetic sensor 280D, an acceleration sensor 280E, a distance sensor 280F, a proximity light sensor 280G, a fingerprint sensor 280H, a temperature sensor 280J, a touch sensor 280K, and an ambient light sensor. Sensor 280L, bone conduction sensor 280M, etc.
可以理解的是,本申请实施例示意的结构并不构成对手机200的具体限定。在本申请另一些实施例中,手机200可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that the structure shown in the embodiment of the present application does not constitute a specific limitation on the mobile phone 200 . In other embodiments of the present application, the mobile phone 200 may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components. The illustrated components can be realized in hardware, software or a combination of software and hardware.
处理器210可以包括一个或多个处理单元,例如:处理器210可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 210 may include one or more processing units, for example: the processor 210 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
其中,控制器可以是手机200的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。Wherein, the controller may be the nerve center and command center of the mobile phone 200 . The controller can generate an operation control signal according to the instruction opcode and timing signal, and complete the control of fetching and executing the instruction.
处理器210中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器210中的存储器为高速缓冲存储器。该存储器可以保存处理器210刚用过或循环使用的指令或数据。如果处理器210需要再次使用该指令或数据,可从存储器中直接调用。避免了重复存取,减少了处理器210的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 210 for storing instructions and data. In some embodiments, the memory in processor 210 is a cache memory. The memory may hold instructions or data that the processor 210 has just used or recycled. If the processor 210 needs to use the instruction or data again, it can be directly recalled from the memory. Repeated access is avoided, and the waiting time of the processor 210 is reduced, thereby improving the efficiency of the system.
在一些实施例中,处理器210可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, processor 210 may include one or more interfaces. The interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transmitter (universal asynchronous receiver/transmitter, UART) interface, mobile industry processor interface (mobile industry processor interface, MIPI), general-purpose input and output (general-purpose input/output, GPIO) interface, subscriber identity module (subscriber identity module, SIM) interface, and /or universal serial bus (universal serial bus, USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器210可以包含多组I2C总线。处理器210可以通过不同的I2C总线接口分别耦合触摸传感器280K,充电器,闪光灯,摄像头293等。例如:处理器210可以通过I2C接口耦合触摸传感器280K,使处理器210与触摸传感器280K通过I2C总线接口通信,实现手机200的触摸功能。The I2C interface is a bidirectional synchronous serial bus, including a serial data line (serial data line, SDA) and a serial clock line (derail clock line, SCL). In some embodiments, processor 210 may include multiple sets of I2C buses. The processor 210 can be respectively coupled to the touch sensor 280K, the charger, the flashlight, the camera 293 and so on through different I2C bus interfaces. For example, the processor 210 may be coupled to the touch sensor 280K through the I2C interface, so that the processor 210 and the touch sensor 280K communicate through the I2C bus interface to realize the touch function of the mobile phone 200 .
I2S接口可以用于音频通信。在一些实施例中,处理器210可以包含多组I2S总线。处理器210可以通过I2S总线与音频模块270耦合,实现处理器210与音频模块270之间的通信。在一些实施例中,音频模块270可以通过I2S接口向无线通信模块260传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, processor 210 may include multiple sets of I2S buses. The processor 210 may be coupled to the audio module 270 through an I2S bus to implement communication between the processor 210 and the audio module 270 . In some embodiments, the audio module 270 can transmit audio signals to the wireless communication module 260 through the I2S interface, so as to realize the function of answering calls through the Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块270与无线通信模块260可以通过PCM总线接口耦合。在一些实施例中,音频模块270也可以通过PCM接口向无线通信模块260传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communication, sampling, quantizing and encoding the analog signal. In some embodiments, the audio module 270 and the wireless communication module 260 may be coupled through a PCM bus interface. In some embodiments, the audio module 270 can also transmit audio signals to the wireless communication module 260 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器210与无线通信模块260。例如:处理器210通过UART接口与无线通信模块260中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块270可以通过UART接口向无线通信模块260传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus can be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is generally used to connect the processor 210 and the wireless communication module 260 . For example: the processor 210 communicates with the Bluetooth module in the wireless communication module 260 through the UART interface to realize the Bluetooth function. In some embodiments, the audio module 270 can transmit audio signals to the wireless communication module 260 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器210与显示屏294,摄像头293等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器210和摄像头293通过CSI接口通信,实现手机200的拍摄功能。处理器210和显示屏294通过DSI接口通信,实现手机200的显示功能。The MIPI interface can be used to connect the processor 210 with the peripheral devices such as the display screen 294 and the camera 293 . MIPI interface includes camera serial interface (camera serial interface, CSI), display serial interface (display serial interface, DSI), etc. In some embodiments, the processor 210 communicates with the camera 293 through the CSI interface to realize the shooting function of the mobile phone 200 . The processor 210 communicates with the display screen 294 through the DSI interface to realize the display function of the mobile phone 200 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器210与摄像头293,显示屏294,无线通信模块260,音频模块270,传感器模块280等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface can be used to connect the processor 210 with the camera 293 , the display screen 294 , the wireless communication module 260 , the audio module 270 , the sensor module 280 and so on. The GPIO interface can also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口230是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口230可以用于连接充电器为手机200充电,也可以用于手机200与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 230 is an interface conforming to the USB standard specification, specifically, it may be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 230 can be used to connect a charger to charge the mobile phone 200, and can also be used to transmit data between the mobile phone 200 and peripheral devices. It can also be used to connect headphones and play audio through them. This interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本申请实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对手机200的结构限定。在本申请另一些实施例中,手机200也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules shown in the embodiment of the present application is only a schematic illustration, and does not constitute a structural limitation of the mobile phone 200 . In other embodiments of the present application, the mobile phone 200 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
充电管理模块240用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块240可以通过USB接口230接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块240可以通过手机200的无线充电线圈接收无线充电输入。充电管理模块240为电池242充电的同时,还可以通过电源管理模块241为电子设备供电。The charging management module 240 is configured to receive charging input from the charger. Wherein, the charger may be a wireless charger or a wired charger. In some embodiments of wired charging, the charging management module 240 can receive the charging input of the wired charger through the USB interface 230 . In some wireless charging embodiments, the charging management module 240 can receive wireless charging input through the wireless charging coil of the mobile phone 200 . While the charging management module 240 is charging the battery 242 , it can also supply power to the electronic device through the power management module 241 .
电源管理模块241用于连接电池242,充电管理模块240与处理器210。电源管理模块241接收电池242和/或充电管理模块240的输入,为处理器210,内部存储器221,外部存储器,显示屏294,摄像头293,和无线通信模块260等供电。电源管理模块241还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块241也可以设置于处理器210中。在另一些实施例中,电源管理模块241和充电管理模块240也可以设置于同一个器件中。The power management module 241 is used for connecting the battery 242 , the charging management module 240 and the processor 210 . The power management module 241 receives the input from the battery 242 and/or the charging management module 240 to provide power for the processor 210 , internal memory 221 , external memory, display screen 294 , camera 293 , and wireless communication module 260 . The power management module 241 can also be used to monitor parameters such as battery capacity, battery cycle times, and battery health status (leakage, impedance). In some other embodiments, the power management module 241 can also be set in the processor 210 . In some other embodiments, the power management module 241 and the charging management module 240 may also be set in the same device.
手机200的无线通信功能可以通过天线1,天线2,移动通信模块250,无线通信模块260,调制解调处理器以及基带处理器等实现。The wireless communication function of the mobile phone 200 can be realized by the antenna 1, the antenna 2, the mobile communication module 250, the wireless communication module 260, the modem processor and the baseband processor.
天线1和天线2用于发射和接收电磁波信号。手机200中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。 Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in handset 200 can be used to cover single or multiple communication frequency bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: Antenna 1 can be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块250可以提供应用在手机200上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块250可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块250可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块250还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块250的至少部分功能模块可以被设置于处理器210 中。在一些实施例中,移动通信模块250的至少部分功能模块可以与处理器210的至少部分模块被设置在同一个器件中。The mobile communication module 250 can provide wireless communication solutions including 2G/3G/4G/5G applied on the mobile phone 200 . The mobile communication module 250 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like. The mobile communication module 250 can receive electromagnetic waves through the antenna 1, filter and amplify the received electromagnetic waves, and send them to the modem processor for demodulation. The mobile communication module 250 can also amplify the signal modulated by the modem processor, convert it into electromagnetic wave and radiate it through the antenna 1 . In some embodiments, at least part of the functional modules of the mobile communication module 250 may be set in the processor 210 . In some embodiments, at least part of the functional modules of the mobile communication module 250 and at least part of the modules of the processor 210 may be set in the same device.
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器270A,受话器270B等)输出声音信号,或通过显示屏294显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器210,与移动通信模块250或其他功能模块设置在同一个器件中。A modem processor may include a modulator and a demodulator. Wherein, the modulator is used for modulating the low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator sends the demodulated low-frequency baseband signal to the baseband processor for processing. The low-frequency baseband signal is passed to the application processor after being processed by the baseband processor. The application processor outputs sound signals through audio equipment (not limited to speaker 270A, receiver 270B, etc.), or displays images or videos through display screen 294 . In some embodiments, the modem processor may be a stand-alone device. In some other embodiments, the modem processor may be independent of the processor 210, and be set in the same device as the mobile communication module 250 or other functional modules.
无线通信模块260可以提供应用在手机200上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块260可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块260经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器210。无线通信模块260还可以从处理器210接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 260 can provide wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (Wireless Fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite system, etc. applied on the mobile phone 200 (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 260 may be one or more devices integrating at least one communication processing module. The wireless communication module 260 receives electromagnetic waves via the antenna 2 , frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 210 . The wireless communication module 260 can also receive the signal to be sent from the processor 210 , frequency-modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 to radiate out.
在一些实施例中,手机200的天线1和移动通信模块250耦合,天线2和无线通信模块260耦合,使得手机200可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the mobile phone 200 is coupled to the mobile communication module 250, and the antenna 2 is coupled to the wireless communication module 260, so that the mobile phone 200 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include global system for mobile communications (GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time division code division multiple access (time-division code division multiple access, TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC , FM, and/or IR techniques, etc. The GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (global navigation satellite system, GLONASS), a Beidou navigation satellite system (beidou navigation satellite system, BDS), a quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
手机200通过GPU,显示屏294,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏294和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器210可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The mobile phone 200 realizes the display function through the GPU, the display screen 294, and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 294 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering. Processor 210 may include one or more GPUs that execute program instructions to generate or change display information.
显示屏294用于显示图像,视频等。显示屏294包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,手机200可以包括1个或N个显示屏294,N为大于1的正整数。The display screen 294 is used to display images, videos and the like. Display 294 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diodes (quantum dot light emitting diodes, QLED), etc. In some embodiments, the mobile phone 200 may include 1 or N display screens 294, where N is a positive integer greater than 1.
手机200可以通过ISP,摄像头293,视频编解码器,GPU,显示屏294以及应用处理器等实现拍摄功能。The mobile phone 200 can realize the shooting function through ISP, camera 293 , video codec, GPU, display screen 294 and application processor.
ISP用于处理摄像头293反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头293中。The ISP is used for processing the data fed back by the camera 293 . For example, when taking a picture, open the shutter, the light is transmitted to the photosensitive element of the camera through the lens, and the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be located in the camera 293 .
摄像头293用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,手机200可以包括1个或N个摄像头293,N为大于1的正整数。Camera 293 is used to capture still images or video. The object generates an optical image through the lens and projects it to the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the light signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other image signals. In some embodiments, the mobile phone 200 may include 1 or N cameras 293, where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号(如音频信号等)。例如,当手机200在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals (such as audio signals, etc.). For example, when the mobile phone 200 selects a frequency point, the digital signal processor is used to perform Fourier transform on the energy of the frequency point.
视频编解码器用于对数字视频压缩或解压缩。手机200可以支持一种或多种视频编解码器。这样,手机200可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The handset 200 may support one or more video codecs. In this way, the mobile phone 200 can play or record videos in various encoding formats, for example: moving picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现手机200的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By referring to the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process input information and continuously learn by itself. Applications such as intelligent cognition of the mobile phone 200 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
外部存储器接口220可以用于连接外部存储卡,例如Micro SD卡,实现扩展手机200的存储能力。外部存储卡通过外部存储器接口220与处理器210通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 220 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the mobile phone 200. The external memory card communicates with the processor 210 through the external memory interface 220 to implement a data storage function. Such as saving music, video and other files in the external memory card.
内部存储器221可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器210通过运行存储在内部存储器221的指令,从而执行手机200的各种功能应用以及数据处理。内部存储器221可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储手机200使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器221可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。The internal memory 221 may be used to store computer-executable program codes including instructions. The processor 210 executes various functional applications and data processing of the mobile phone 200 by executing instructions stored in the internal memory 221 . The internal memory 221 may include an area for storing programs and an area for storing data. Wherein, the stored program area can store an operating system, at least one application program required by a function (such as a sound playing function, an image playing function, etc.) and the like. The storage data area can store data (such as audio data, phone book, etc.) created during the use of the mobile phone 200 . In addition, the internal memory 221 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like.
手机200可以通过音频模块270,扬声器270A,受话器270B,麦克风270C,耳机接口270D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The mobile phone 200 can realize the audio function through the audio module 270, the speaker 270A, the receiver 270B, the microphone 270C, the earphone interface 270D, and the application processor. Such as music playback, recording, etc.
音频模块270用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块270还可以用于对音频信号编码和解码。在一些 实施例中,音频模块270可以设置于处理器210中,或将音频模块270的部分功能模块设置于处理器210中。The audio module 270 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signal. The audio module 270 may also be used to encode and decode audio signals. In some embodiments, the audio module 270 can be set in the processor 210, or some functional modules of the audio module 270 can be set in the processor 210.
扬声器270A,也称“喇叭”,用于将音频电信号转换为声音信号。手机200可以通过扬声器270A收听音乐,或收听免提通话。 Speaker 270A, also referred to as a "horn", is used to convert audio electrical signals into sound signals. Cell phone 200 can listen to music through speaker 270A, or listen to hands-free calls.
受话器270B,也称“听筒”,用于将音频电信号转换成声音信号。当手机200接听电话或语音信息时,可以通过将受话器270B靠近人耳接听语音。 Receiver 270B, also called "earpiece", is used to convert audio electrical signals into audio signals. When the mobile phone 200 receives a call or a voice message, the receiver 270B can be placed close to the human ear to receive the voice.
麦克风270C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风270C发声,将声音信号输入到麦克风270C。手机200可以设置至少一个麦克风270C。在另一些实施例中,手机200可以设置两个麦克风270C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,手机200还可以设置三个,四个或更多麦克风270C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 270C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound by approaching the microphone 270C with a human mouth, and input the sound signal to the microphone 270C. The mobile phone 200 can be provided with at least one microphone 270C. In other embodiments, the mobile phone 200 can be provided with two microphones 270C, which can also implement a noise reduction function in addition to collecting sound signals. In some other embodiments, the mobile phone 200 can also be provided with three, four or more microphones 270C to realize sound signal collection, noise reduction, identify sound sources, realize directional recording functions, and the like.
耳机接口270D用于连接有线耳机。耳机接口270D可以是USB接口230,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone interface 270D is used for connecting wired earphones. The earphone interface 270D can be a USB interface 230, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
压力传感器280A用于感受压力信号,可以将压力信号转换成电信号,陀螺仪传感器280B可以用于确定手机200的运动姿态,气压传感器280C用于测量气压。磁传感器280D包括霍尔传感器,加速度传感器280E可检测手机200在各个方向上(一般为三轴)加速度的大小,接近光传感器280G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管,环境光传感器280L用于感知环境光亮度,指纹传感器280H用于采集指纹,温度传感器280J用于检测温度,触摸传感器280K,也称“触控面板”。触摸传感器280K可以设置于显示屏294,由触摸传感器280K与显示屏294组成触摸屏,也称“触控屏”。触摸传感器280K用于检测作用于其上或附近的触摸操作。骨传导传感器280M可以获取振动信号。在一些实施例中,骨传导传感器280M可以获取人体声部振动骨块的振动信号。The pressure sensor 280A is used to sense the pressure signal and can convert the pressure signal into an electrical signal. The gyro sensor 280B is used to determine the movement posture of the mobile phone 200 . The air pressure sensor 280C is used to measure the air pressure. The magnetic sensor 280D includes a Hall sensor, the acceleration sensor 280E can detect the acceleration of the mobile phone 200 in various directions (generally three axes), and the proximity light sensor 280G can include, for example, a light emitting diode (LED) and a photodetector, such as a photodiode. The light-emitting diodes can be infrared light-emitting diodes, the ambient light sensor 280L is used to sense the ambient light brightness, the fingerprint sensor 280H is used to collect fingerprints, the temperature sensor 280J is used to detect temperature, and the touch sensor 280K is also called "touch panel". The touch sensor 280K can be arranged on the display screen 294, and the touch sensor 280K and the display screen 294 form a touch screen, also called “touch screen”. The touch sensor 280K is used to detect a touch operation on or near it. The bone conduction sensor 280M can acquire vibration signals. In some embodiments, the bone conduction sensor 280M can acquire the vibration signal of the vibrating bone mass of the human voice.
按键290包括开机键,音量键等。按键290可以是机械按键。也可以是触摸式按键。手机200可以接收按键输入,产生与手机200的用户设置以及功能控制有关的键信号输入。The keys 290 include a power key, a volume key and the like. The key 290 may be a mechanical key. It can also be a touch button. The mobile phone 200 can receive key input and generate key signal input related to user settings and function control of the mobile phone 200 .
马达291可以产生振动提示。马达291可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏294不同区域的触摸操作,马达291也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。The motor 291 can generate a vibrating prompt. The motor 291 can be used for incoming call vibration prompts, and can also be used for touch vibration feedback. For example, touch operations applied to different applications (such as taking pictures, playing audio, etc.) may correspond to different vibration feedback effects. The motor 291 can also correspond to different vibration feedback effects for touch operations acting on different areas of the display screen 294 . Different application scenarios (for example: time reminder, receiving information, alarm clock, games, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect can also support customization.
指示器292可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 292 can be an indicator light, which can be used to indicate the charging status, the change of the battery capacity, and also can be used to indicate messages, missed calls, notifications and so on.
SIM卡接口295用于连接SIM卡。SIM卡可以通过插入SIM卡接口295,或从SIM卡接口295拔出,实现和手机200的接触和分离。手机200可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口295可以支持Nano SIM卡,Micro SIM卡, SIM卡等。同一个SIM卡接口295可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口295也可以兼容不同类型的SIM卡。SIM卡接口295也可以兼容外部存储卡。手机200通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,手机200采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在手机200中,不能和手机200分离。The SIM card interface 295 is used for connecting a SIM card. The SIM card can be inserted into the SIM card interface 295 or pulled out from the SIM card interface 295 to realize contact and separation with the mobile phone 200 . The mobile phone 200 can support 1 or N SIM card interfaces, where N is a positive integer greater than 1. SIM card interface 295 can support Nano SIM card, Micro SIM card, SIM card etc. Multiple cards can be inserted into the same SIM card interface 295 at the same time. The types of the multiple cards may be the same or different. The SIM card interface 295 is also compatible with different types of SIM cards. The SIM card interface 295 is also compatible with external memory cards. The mobile phone 200 interacts with the network through the SIM card to implement functions such as calling and data communication. In some embodiments, the mobile phone 200 adopts eSIM, that is, an embedded SIM card. The eSIM card can be embedded in the mobile phone 200 and cannot be separated from the mobile phone 200 .
可以理解的,本申请实施例中,电子设备(例如上述手机200)可以执行本申请实施例中的部分或全部步骤,这些步骤或操作仅是示例,电子设备还可以执行其它操作或者各种操作的变形。此外,各个步骤可以按照本申请实施例呈现的不同的顺序来执行,并且有可能并非要执行本申请实施例中的全部操作。本申请各实施例可以单独实施,也可以任意组合实施,本申请对此不作限定。It can be understood that in the embodiment of the present application, the electronic device (such as the above-mentioned mobile phone 200) can perform some or all of the steps in the embodiment of the present application, these steps or operations are only examples, and the electronic device can also perform other operations or various operations deformation. In addition, each step may be performed in a different order presented in the embodiment of the present application, and it may not be necessary to perform all operations in the embodiment of the present application. Each embodiment of the present application may be implemented independently or in any combination, which is not limited in the present application.
本申请实施例提供的人物三维模型的重建方法可以应用于具有如图2所示硬件结构的电子设备或者具有类似结构的电子设备。或者还可以应用于其他结构的电子设备中,本申请实施例对此不作限定。The method for reconstructing a three-dimensional model of a character provided in the embodiment of the present application may be applied to an electronic device having a hardware structure as shown in FIG. 2 or an electronic device having a similar structure. Or it may also be applied to electronic devices with other structures, which is not limited in this embodiment of the present application.
如图3所示,本申请实施例提供的人物三维模型的重建方法可以包括步骤301至步骤304。步骤301、电子设备获取目标人物的第一尺度图像和第二尺度图像。本申请实施例中,需要对目标人物进行三维模型重建,其中,第一尺度图像包括目标人物的第一部分,第二尺度图像包括目标人物的至少第二部分,第一部分是第二部分的一部分。可选地,本申请实施例提供的人物三维模型的重建方法可以用于人物不同尺度的三维模型重建,例如上半身的三维模型重建、全身三维模型重建等。As shown in FIG. 3 , the method for reconstructing a three-dimensional model of a character provided in this embodiment of the present application may include steps 301 to 304 . Step 301, the electronic device acquires a first scale image and a second scale image of a target person. In this embodiment of the present application, it is necessary to reconstruct a 3D model of the target person, wherein the first scale image includes a first part of the target person, the second scale image includes at least a second part of the target person, and the first part is a part of the second part. Optionally, the method for reconstructing a 3D model of a character provided in the embodiment of the present application may be used for 3D model reconstruction of a character at different scales, such as 3D model reconstruction of the upper body, 3D model reconstruction of the whole body, and the like.
在一种实现方式中,若进行人物上半身的三维模型重建,则上述目标人物的第一部分为目标人物的人脸,目标人物的第二部分为目标人物的上半身,也就是说,上述第一尺度图像为目标人物的人脸图像,第二尺度图像为目标人物的上半身图像,该人脸图像是上半身图像中的人脸部分。In one implementation, if the three-dimensional model of the upper body of the person is reconstructed, the first part of the target person is the face of the target person, and the second part of the target person is the upper body of the target person, that is to say, the first scale The image is a face image of the target person, and the second-scale image is an upper body image of the target person, and the face image is a face part in the upper body image.
本申请实施例中,电子设备采集到包含目标人物的图像之后,电子设备对该图像进行预处理以得到目标人物的人脸图像和目标人物的上半身图像。具体的,电子设备对采集到的图像进行人脸检测,根据人脸检测框和人脸特征点对图像进行裁剪、缩放等操作,得到目标人物的人脸图像,即第一尺度图像;并且对采集的图像进行上半身检测,根据上半身检测框和上半身特征点对图像进行裁剪、缩放的操作,得到目标人物的上半身图像,即第二尺度图像。In the embodiment of the present application, after the electronic device collects the image containing the target person, the electronic device preprocesses the image to obtain the face image of the target person and the upper body image of the target person. Specifically, the electronic device performs face detection on the collected images, and performs operations such as cropping and zooming on the images according to the face detection frame and face feature points to obtain the face image of the target person, that is, the first-scale image; and The collected image is detected for the upper body, and the image is cropped and scaled according to the upper body detection frame and the upper body feature points to obtain the upper body image of the target person, that is, the second scale image.
示例性的,人脸特征点可以包括如图4所示的68个特征点,当然人脸特征点也可以包括更少或更多的特征点,本申请实施例不做限定。上半身特征点可以包括头部、脖子、左肩、右肩、左手肘、右手肘等的特征点。Exemplarily, the face feature points may include 68 feature points as shown in FIG. 4 , of course, the face feature points may also include fewer or more feature points, which is not limited in this embodiment of the present application. The upper body feature points may include feature points of the head, neck, left shoulder, right shoulder, left elbow, right elbow, and the like.
在另一种实现方式中,若进行人物全身的三维模型重建,则上述目标人物的第一部分为目标人物的上半身,目标人物的第二部分为目标人物的全身,也就是说,上述第一尺度图像为目标人物的上半身图像,第二尺度图像为目标人物的全身图像。In another implementation, if the 3D model of the whole body of the person is reconstructed, the first part of the above-mentioned target person is the upper body of the target person, and the second part of the target person is the whole body of the target person, that is to say, the above-mentioned first scale The image is the upper body image of the target person, and the second scale image is the whole body image of the target person.
同理,电子设备采集到包含目标人物的图像之后,电子设备对该图像进行预处理以得到目标人物的上半身图像和目标人物的全身图像。具体的,电子设备对采集到的图像进行上半身检测,根据上半身检测框和上半身特征点对图像进行裁剪、缩放等操作,得到目标人物的上半身图像,即第一尺度图像;并且对采集的图像进行人体检测 (即全身检测),根据人体检测框和人体特征点对图像进行裁剪、缩放的操作,得到目标人物的全身图像,即第二尺度图像。Similarly, after the electronic device collects the image containing the target person, the electronic device preprocesses the image to obtain an upper body image of the target person and a whole body image of the target person. Specifically, the electronic device detects the upper body of the collected image, and performs operations such as cropping and zooming on the image according to the upper body detection frame and upper body feature points to obtain the upper body image of the target person, that is, the first-scale image; Human body detection (that is, whole-body detection) is to crop and zoom the image according to the human body detection frame and human body feature points, and obtain the whole-body image of the target person, that is, the second-scale image.
示例性的,上半身特征点可以包括头部、脖子、左肩、右肩、左手肘、右手肘等的特征点,人体特征点可以包括头部、脖子、左肩、右肩、左手肘、右手肘、左手、右手、左腰、右腰、左膝、右膝、左脚、右脚等特征点。Exemplarily, the feature points of the upper body may include feature points of the head, neck, left shoulder, right shoulder, left elbow, right elbow, etc., and the feature points of the human body may include the head, neck, left shoulder, right shoulder, left elbow, right elbow, Left hand, right hand, left waist, right waist, left knee, right knee, left foot, right foot and other feature points.
步骤302、电子设备确定第一尺度图像对应的第一网格三维模型。本申请实施例中,网格三维模型是三维模型的一种表示形式,示例性的,图5是一种人脸的网格三维模型的示例,结合图5可知,网格是由人脸模型中的三维顶点之间连接形成的多边形(比如三角形),应理解,模型对应的所有三维顶点的集合可以称为点云。In step 302, the electronic device determines a first grid three-dimensional model corresponding to the first scale image. In the embodiment of the present application, the grid 3D model is a representation of a 3D model. Exemplarily, FIG. 5 is an example of a grid 3D model of a human face. It can be seen from FIG. 5 that the grid is composed of a human face model A polygon (such as a triangle) formed by connections between three-dimensional vertices in the model, it should be understood that a collection of all three-dimensional vertices corresponding to the model may be called a point cloud.
结合图3,如图6所示,步骤302可以通过步骤3021至步骤3022实现。步骤3021、电子设备基于第一体素回归网络,确定第一尺度图像对应的第一体素三维模型。本申请实施例中,将第一尺度图像输入至第一体素回归网络,能够得到第一尺度图像对应的第一体素三维模型,应理解,体素三维模型是体素回归网络的输出。Referring to FIG. 3 , as shown in FIG. 6 , step 302 may be implemented through steps 3021 to 3022 . Step 3021, the electronic device determines a first voxel three-dimensional model corresponding to the first scale image based on the first voxel regression network. In the embodiment of the present application, the first scale image is input to the first voxel regression network, and the first voxel three-dimensional model corresponding to the first scale image can be obtained. It should be understood that the voxel three-dimensional model is the output of the voxel regression network.
可选地,第一体素回归网络是卷积神经网络,该卷积神经网络可以为堆叠沙漏网络,该第一体素回归网络是基于采集的多组二维图像和二维图像对应的标注了真实体素值的体素三维模型样本(训练数据集)对预设的堆叠沙漏网络训练得到的。应理解,第一尺度图像为目标人物不同部分的图像时,第一体素回归网络是基于对应的数据集训练得到,例如,第一尺度图像为目标人物的人脸图像,则第一体素回归网络是基于多组人脸图像和人脸图像对应的标注了真实体素值的人脸体素三维模型样本得到。Optionally, the first voxel regression network is a convolutional neural network, the convolutional neural network may be a stacked hourglass network, and the first voxel regression network is based on multiple sets of collected two-dimensional images and the corresponding annotations of the two-dimensional images The voxel 3D model sample (training data set) with the real voxel value is obtained by training the preset stacked hourglass network. It should be understood that when the first-scale image is an image of a different part of the target person, the first voxel regression network is trained based on the corresponding data set. For example, if the first-scale image is the face image of the target person, then the first voxel The regression network is obtained based on multiple groups of face images and face voxel 3D model samples corresponding to the face images with real voxel values marked.
综上,可以理解的是,当第一尺度图像为目标人物的人脸图像时,上述第一体素回归网络是用于预测人脸体素三维模型的体素回归网络。当第一尺度图像为目标人物的上半身图像时,上述第一体素回归网络是用于预测上半身体素三维模型的体素回归网络。To sum up, it can be understood that when the first scale image is the face image of the target person, the above-mentioned first voxel regression network is a voxel regression network for predicting the voxel three-dimensional model of the face. When the first scale image is the upper body image of the target person, the above-mentioned first voxel regression network is a voxel regression network for predicting the voxel three-dimensional model of the upper body.
步骤3022、电子设备将第一体素三维模型转换为第一网格三维模型。可选地,本申请实施例中,电子设备将第一体素三维模型转换为第一网格三维模型的方法可以是:将提取的等值面上的三维顶点按照预设的规则连接成多边形(例如三角形),从而形成第一网格三维模型,图5为网格三维模型的一种示例。该预设的规则可以是按照从左到右,从上到下的顺序依次将最近的三个三维顶点进行连接。可选地,电子设备也可以采用立体渲染的方法将第一体素三维模型转换为第一网格三维模型。In step 3022, the electronic device converts the first voxel three-dimensional model into a first grid three-dimensional model. Optionally, in the embodiment of the present application, the method for the electronic device to convert the first voxel 3D model into the first mesh 3D model may be: connect the extracted 3D vertices on the isosurface into polygons according to preset rules (such as a triangle), thereby forming a first three-dimensional grid model, and FIG. 5 is an example of a three-dimensional grid model. The preset rule may be to connect the three nearest three-dimensional vertices sequentially from left to right and from top to bottom. Optionally, the electronic device may also convert the first voxel 3D model into the first grid 3D model by using a stereo rendering method.
步骤303、电子设备确定第二尺度图像对应的第二网格三维模型。如图6所示,与上述步骤302类似,步骤303可以通过步骤3031至步骤3032实现。步骤3031、电子设备基于第二体素回归网络,确定第二尺度图像对应的第二体素三维模型。本申请实施例中,将第二尺度图像输入至第二体素回归网络,能够得到第二尺度图像对应的第二体素三维模型。 Step 303, the electronic device determines a second grid three-dimensional model corresponding to the second scale image. As shown in FIG. 6 , similar to the above step 302 , step 303 may be implemented through steps 3031 to 3032 . Step 3031, the electronic device determines a second voxel three-dimensional model corresponding to the second scale image based on the second voxel regression network. In the embodiment of the present application, the second scale image is input to the second voxel regression network, and the second voxel three-dimensional model corresponding to the second scale image can be obtained.
与第一体素回归网络的结构类似,第二体素回归网络也可以是卷积神经网络,该卷积神经网络为堆叠沙漏网络,该第二体素回归网络是基于采集的多组二维图像和二维图像对应的标注了真实体素值的体素三维模型样本(训练数据集)对预设的堆叠沙漏网络训练得到的。应理解,第二尺度图像为目标人物不同部分的图像时,第二体素回归网络是基于对应的数据集训练得到,例如,第二尺度图像为目标人物的上半身图 像,则第二体素回归网络是基于多组上半身图像和上半身图像对应的标注了真实体素值的上半身体素三维模型样本得到。Similar to the structure of the first voxel regression network, the second voxel regression network can also be a convolutional neural network, the convolutional neural network is a stacked hourglass network, and the second voxel regression network is based on multiple sets of collected two-dimensional The voxel 3D model sample (training data set) corresponding to the image and the 2D image is obtained by training the preset stacked hourglass network. It should be understood that when the second-scale image is an image of a different part of the target person, the second voxel regression network is trained based on the corresponding data set. For example, if the second-scale image is the upper body image of the target person, then the second voxel regression network The network is obtained based on multiple sets of upper body images and upper body voxel 3D model samples corresponding to the upper body images with real voxel values marked.
综上,可以理解的是,当第二尺度图像为目标人物的上半身图像时,上述第二体素回归网络是用于预测上半身体素三维模型的体素回归网络。当第二尺度图像为目标人物的全身图像时,上述第二体素回归网络是用于预测人体体素三维模型的体素回归网络。To sum up, it can be understood that when the second-scale image is the upper body image of the target person, the above-mentioned second voxel regression network is a voxel regression network for predicting the upper body voxel three-dimensional model. When the second-scale image is a whole-body image of the target person, the above-mentioned second voxel regression network is a voxel regression network for predicting a voxel three-dimensional model of a human body.
步骤3032、电子设备将第二体素三维模型转换为第二网格三维模型。需要说明的是,电子设备将第二体素三维模型转换为第二网格三维模型的方法与电子设备将第一体素三维模型转换为第一网格三维模型方法类似,因此,对于步骤3032的详细描述可以参考上述实施例中对于步骤3022的相关描述,此处不再赘述。In step 3032, the electronic device converts the second voxel three-dimensional model into a second grid three-dimensional model. It should be noted that the method for the electronic device to convert the second voxel 3D model into the second grid 3D model is similar to the method for the electronic device to convert the first voxel 3D model into the first grid 3D model. Therefore, for step 3032 For a detailed description, reference may be made to the relevant description of step 3022 in the above-mentioned embodiments, and details are not repeated here.
步骤304、电子设备对第一网格三维模型和第二网格三维模型进行融合处理,以得到目标三维模型。其中,目标三维模型用于显示目标人物的至少第二部分。In step 304, the electronic device fuses the first grid 3D model and the second grid 3D model to obtain a target 3D model. Wherein, the three-dimensional model of the target is used to display at least the second part of the target person.
本申请实施例中,若目标人物的第一部分为目标人物的人脸,目标人物的第二部分为目标人物的上半身,则目标三维模型为目标人物的上半身三维模型,即该目标三维模型用于显示目标人物的上半身。若目标人物的第一部分为目标人物的上半身,目标人物的第二部分为目标人物的全身,则目标三维模型为目标人物的人体三维模型,即该目标三维模型用于显示目标人物的全身。In the embodiment of the present application, if the first part of the target person is the face of the target person, and the second part of the target person is the upper body of the target person, then the target three-dimensional model is the upper body three-dimensional model of the target person, that is, the target three-dimensional model is used for Shows the upper body of the target person. If the first part of the target person is the upper body of the target person, and the second part of the target person is the whole body of the target person, then the target 3D model is a human body 3D model of the target person, that is, the target 3D model is used to display the target person's whole body.
可以理解的是,电子设备确定出的第一网格三维模型和第二网格三维模型可能是不对齐的,因此,在电子设备对第一网格三维模型和第二网格三维模型进行融合处理的过程中,电子设备需对第一网格三维模型和第二网格三维模型进行网格对齐处理。具体的,根据上述实施例中对于网格三维模型的介绍可知,网格对齐处理的过程可以理解为两个网格三维模型的点云配准的过程,通过网格对齐处理可以使得将第一网格三维模型中的所有网格与第二网格模型中对应的网格进行对齐,即第一网格三维模型的点云与第二网格三维模型的点云实现配准。It can be understood that the first grid 3D model and the second grid 3D model determined by the electronic device may not be aligned, therefore, the electronic device fuses the first grid 3D model and the second grid 3D model During the processing, the electronic device needs to perform grid alignment processing on the first grid 3D model and the second grid 3D model. Specifically, according to the introduction of the grid 3D model in the above embodiment, the process of grid alignment processing can be understood as the process of point cloud registration of two grid 3D models, and the grid alignment process can make the first All the grids in the grid 3D model are aligned with the corresponding grids in the second grid model, that is, the point cloud of the first grid 3D model is registered with the point cloud of the second grid 3D model.
可选地,本申请实施例中,电子设备可以先对第一网格三维模型和第二网格三维模型进行粗配准(coarse registration),然后再对第一网格三维模型和第二网格三维模型进行精配准(fine registration),其中,粗配准主要指的是在两幅点云之间的变换关系未知的情况下,计算两幅点云之间的仿射变换矩阵(仿射变换矩阵包括旋转矩阵和平移矩阵);精配准指的是在粗配准计算得到的仿射变换矩阵的基础上,计算更加精准的仿射变换矩阵。Optionally, in this embodiment of the present application, the electronic device may perform coarse registration on the first grid 3D model and the second grid 3D model, and then perform coarse registration on the first grid 3D model and the second grid 3D model. The fine registration is performed on the three-dimensional grid model. Among them, the coarse registration mainly refers to calculating the affine transformation matrix between the two point clouds when the transformation relationship between the two point clouds is unknown. The projection transformation matrix includes a rotation matrix and a translation matrix); fine registration refers to calculating a more accurate affine transformation matrix on the basis of the affine transformation matrix calculated by rough registration.
示例性,以目标人物的第一部分为目标人物的人脸,第二部分为目标人物的上半身为例,上述粗配准的过程中,电子设备可以使用人脸图像和上半身中的人脸图像的人脸特征点形成的点云计算仿射变换矩阵,例如图4所示的68个人脸特征点。粗配准之后,再将粗配准得到的仿射变换矩阵作为初始的仿射变换矩阵,计算更加精准的仿射变化矩阵,以对齐第一网格三维模型和第二网格三维模型。可选地,精配准算法可以为迭代最近点(iterative closest point,ICP)算法或各种变形的ICP算法,本申请实施例不做限定。Exemplarily, taking the first part of the target person as the face of the target person, and the second part as the upper body of the target person as an example, in the above rough registration process, the electronic device can use the combination of the face image and the face image in the upper body The point cloud calculation affine transformation matrix formed by the face feature points, for example, the 68 face feature points shown in Figure 4. After the rough registration, the affine transformation matrix obtained by the rough registration is used as the initial affine transformation matrix to calculate a more accurate affine transformation matrix to align the first grid 3D model and the second grid 3D model. Optionally, the fine registration algorithm may be an iterative closest point (ICP) algorithm or various variants of the ICP algorithm, which is not limited in this embodiment of the present application.
可选地,电子设备对第一网格三维模型和第二网格三维模型进行融合处理是将三维模型投影至二维平面,根据得到的平面展开图进行融合处理,融合处理完成之后再 将平面展开图投影至三维空间,得到目标三维模型。结合图6,如图7所示,步骤304可以通过步骤3041至步骤3045实现。Optionally, the fusion processing of the first grid 3D model and the second grid 3D model by the electronic device is to project the 3D model onto a 2D plane, perform fusion processing according to the obtained plane expansion diagram, and then merge the plane The expanded image is projected into a three-dimensional space to obtain the target three-dimensional model. Referring to FIG. 6 , as shown in FIG. 7 , step 304 may be implemented through steps 3041 to 3045 .
步骤3041、电子设备将第一网格三维模型转换至二维平面,得到第一平面展开图。步骤3042、电子设备将第二网格三维模型转换至二维平面,得到第二平面展开图。其中,第一平面展开图中的第一图像区域对应第二平面展开图中的第二图像区域,第一图像区域和第二图像区域对应目标人物的第一部分。例如,第一网格三维模型是人脸的网格三维模型,第二网格三维模型为上半身的网格三维模型,则第一平面展开图中的第一图像区域可以为人脸对应的区域,则第二平面展开图中的第二图像区域也是人脸对应的区域,即第一图像区域和第二图像区域对应目标人物的第一部分(即人脸部分)。 Step 3041. The electronic device converts the first grid 3D model to a 2D plane to obtain a first plane expansion diagram. In step 3042, the electronic device converts the 3D model of the second grid into a 2D plane to obtain a second plane expansion diagram. Wherein, the first image area in the first plan expansion view corresponds to the second image area in the second plan expansion view, and the first image area and the second image area correspond to the first part of the target person. For example, the first grid 3D model is a grid 3D model of a human face, and the second grid 3D model is a grid 3D model of an upper body, then the first image area in the first plane expanded view may be an area corresponding to a human face, Then the second image area in the second plane expanded view is also an area corresponding to the face, that is, the first image area and the second image area correspond to the first part (ie, the face part) of the target person.
本申请实施例中,电子设备可以采用二维参数化技术将第一网格三维模型和第二网格三维模型投影至二维平面,得到第一平面展开图和第二平面展开图。例如可以采用圆柱形投影,将一个圆柱面包围椭球体,并使之相切或相割,再根据某种条件将椭球面上的经纬网点投影到圆柱面上,然后,沿着圆柱面的一条母线切开,将其展成平面,得到第一平面展开图或第二平面展开图。例如,图8为上半身对应的网格三维模型投影至二维平面之后的平面展开图。In the embodiment of the present application, the electronic device may project the first grid three-dimensional model and the second grid three-dimensional model to a two-dimensional plane by using two-dimensional parameterization technology to obtain the first plane expansion diagram and the second plane expansion diagram. For example, a cylindrical projection can be used to surround a cylindrical surface and make it tangent or cut, and then project the latitude and longitude points on the ellipsoid surface onto the cylindrical surface according to certain conditions, and then, along a line of the cylindrical surface, The bus bar is cut open and developed into a plane to obtain the first plane expansion diagram or the second plane expansion diagram. For example, FIG. 8 is a plane expansion diagram after the grid 3D model corresponding to the upper body is projected onto a 2D plane.
步骤3043、电子设备对第一平面展开图进行裁剪,以获取第一图像区域,并且对第二平面展开图进行裁剪,以获取第二图像区域。本申请实施例中,以第一图像区域为人脸部分为例,电子设备根据人脸特征点生成脸部范围的掩膜(也可以称为脸部区域框),然后根据脸部区域框从第一平面展开图和第二平面展开图裁剪出人脸部分。 Step 3043, the electronic device crops the first plane expansion view to obtain the first image area, and crops the second plane expansion view to obtain the second image area. In the embodiment of the present application, taking the first image area as a face part as an example, the electronic device generates a mask of the face range (also called a face area frame) according to the face feature points, and then uses the face area frame from the first The first-plane expanded view and the second-planar expanded view are used to cut out the face part.
步骤3044、电子设备将第二平面展开图中的第二图像区域替换为第一平面展开图中的第一图像区域,得到目标人物的目标平面展开图。本申请实施例中,电子设备对第一图像区域的边缘与第二平面展开图中去除第二图像区域之后的边缘进行缝合,得到目标人物的目标平面展开图。应理解,上述电子设备从第一平面展开图中裁剪得到第一图像区域,并且电子设备对第二平面展开图进行裁剪,去掉第二平面展开图中的第二图像区域,电子设备分别对第一图像区域的边缘进行平滑,对第二平面展开图中去除第二图像区域之后的边缘也进行平滑之后,电子设备将第一图像区域的边缘与第二平面展开图中去除第二图像区域之后的边缘进行缝合,得到目标平面展开图。具体的,电子设备将第一图像区域的边缘和第二平面展开图中去除第二图像区域之后的边缘作为约束条件,计算这两个边缘的最接近的顶点,然后将两个边缘中的最接近的顶点连接起来,实现边缘缝合。示例性的,图9中的(a)是待缝合的平面展开图的效果示意图,图9中(b)是待缝合的平面展开图转换至三维空间的效果示意图。 Step 3044, the electronic device replaces the second image area in the second expanded plan view with the first image area in the first expanded plan view, to obtain the target expanded plan view of the target person. In the embodiment of the present application, the electronic device stitches the edge of the first image area and the edge after removing the second image area in the second plan view to obtain the target plan view of the target person. It should be understood that the electronic device cuts out the first image area from the first expanded view, and the electronic device cuts the second expanded view to remove the second image area in the second expanded view. After smoothing the edge of an image area, and smoothing the edge after removing the second image area in the second plane expansion diagram, the electronic device compares the edge of the first image area with the second plane expansion diagram after removing the second image area The edge is stitched to obtain the target plane expansion diagram. Specifically, the electronic device takes the edge of the first image region and the edge after removing the second image region in the second plane expansion diagram as constraint conditions, calculates the closest vertex of the two edges, and then calculates the closest vertex of the two edges to Close vertices are joined to achieve edge stitching. Exemplarily, (a) in FIG. 9 is a schematic diagram of the effect of the unfolded plan view to be stitched, and (b) in FIG. 9 is a schematic diagram of the effect of converting the unfolded plan view to be stitched into a three-dimensional space.
步骤3045、电子设备对目标二维平面展开图进行三维转换,得到目标三维模型。应理解,上述步骤3041或步骤3042中的将三维的信息映射至二维平面的过程的逆过程将目标二维平面展开图进行三维转换,得到目标三维模型,该目标三维模型为网格三维模型。In step 3045, the electronic device performs three-dimensional conversion on the two-dimensional plane expansion diagram of the target to obtain a three-dimensional model of the target. It should be understood that the inverse process of the process of mapping the three-dimensional information to the two-dimensional plane in the above step 3041 or step 3042 performs three-dimensional conversion on the target two-dimensional plane expansion diagram to obtain the target three-dimensional model, and the target three-dimensional model is a grid three-dimensional model .
可选地,结合图7,如图10所示,电子设备在对第一网格三维模型和第二网格三维模型进行融合处理(即步骤304)之前,本申请实施例提供的人物三维模型的重建方法还包括步骤305。步骤305、电子设备对第一网格三维模型和/或第二网格三维模 型进行网格平滑处理或网格简化处理中的至少一种。Optionally, referring to FIG. 7, as shown in FIG. 10, before the electronic device performs fusion processing on the first grid 3D model and the second grid 3D model (that is, step 304), the character 3D model provided by the embodiment of the present application The reconstruction method further includes step 305. Step 305, the electronic device performs at least one of mesh smoothing or mesh simplification on the first mesh 3D model and/or the second mesh 3D model.
本申请实施例中,在一种情况下,通过上述步骤302获得的第一网格三维模型和通过步骤303获得的第二网络三维模型中,可能存在噪声网格(即与实际模型偏差较大的网格),为了得到更加准确的目标三维模型,电子设备可以采用相关的网格平滑算法对第一网格三维模型和/或第二网格三维模型进行网格平滑处理,以删除噪声网格。示例性的,网格平滑算法可以为Taubin平滑算法、拉普拉斯(Laplacian)平滑算法、平均曲率(Curvature)平滑算法中的任一种,具体根据实际情况选择,本申请实施例不做限定。In the embodiment of the present application, in one case, in the first grid 3D model obtained through the above step 302 and the second network 3D model obtained through step 303, there may be noise grids (that is, a large deviation from the actual model grid), in order to obtain a more accurate target 3D model, the electronic device can use a related grid smoothing algorithm to perform grid smoothing on the first grid 3D model and/or the second grid 3D model to delete the noise network grid. Exemplarily, the grid smoothing algorithm can be any one of Taubin smoothing algorithm, Laplacian smoothing algorithm, and average curvature (Curvature) smoothing algorithm, which is selected according to the actual situation, and is not limited in the embodiment of the present application .
本申请实施例中,在另一种情况下,通过上述步骤302获得的第一网格三维模型和通过步骤303获得的第二网络三维模型中,网格的密度可能较大,如此,进行网格融合处理的计算量较大,且耗时较久,为了降低网格三维模型融合过程中的计算量,电子设备可以采用网格简化算法对第一网格三维模型和/或第二网格三维模型进行网格简化处理。示例性的,网格简化算法可以为边折叠(Edge Collapse)算法、基于度量的边折叠算法中的任一种,具体根据实际情况选择,本申请实施例不做限定。In the embodiment of the present application, in another case, in the first grid 3D model obtained through the above step 302 and the second network 3D model obtained through step 303, the density of the grids may be relatively high, so the network Grid fusion processing requires a large amount of calculation and takes a long time. In order to reduce the calculation amount in the process of grid 3D model fusion, the electronic device can use the grid simplification algorithm to process the first grid 3D model and/or the second grid The 3D model is subjected to mesh simplification. Exemplarily, the grid simplification algorithm may be any one of the edge collapse (Edge Collapse) algorithm and the metric-based edge collapse algorithm, which is selected according to the actual situation, and is not limited in the embodiment of the present application.
可选地,当需要对网格三维模型进行网格平滑处理和网格简化处理时,可以不限定这两种处理的顺序,例如,可以先对网格三维模型进行网格平滑处理,后对平滑之后的网格三维模型进行网格简化处理;也可以先对网格三维模型进行网格简化处理,后对简化之后的网格三维模型进行网格平滑处理。在一种实现方式中,先对网格三维模型进行网格平滑处理,后对平滑之后的网格三维模型进行网格简化处理,这样能够更好地提升重建的三维模型的质量。Optionally, when it is necessary to perform mesh smoothing and mesh simplification on the grid 3D model, the order of these two processes may not be limited. For example, the grid smoothing process may be performed on the grid 3D model first, and then The smoothed 3D mesh model is subjected to mesh simplification; the mesh simplification may also be performed on the mesh 3D model first, and then the mesh smoothing is performed on the simplified mesh 3D model. In an implementation manner, the grid smoothing process is first performed on the grid three-dimensional model, and then the grid simplification process is performed on the smoothed grid three-dimensional model, which can better improve the quality of the reconstructed three-dimensional model.
可选地,本申请实施例中,电子设备对第一图像区域的边缘与第二平面展开图中去除第二图像区域之后的边缘进行缝合,得到目标人物的目标平面展开图之后,电子设备在该目标平面展开图上进行纹理重计算,得到目标平面展开图中对应目标人物的第一部分(例如人脸区域)的网格三维模型的顶点坐标。具体的,电子设备计算第一网格三维模型与第二网格三维模型中被该第一网格三维模型所覆盖(或者称为替换)的区域之间的顶点映射关系,根据顶点映射关系,使用重心坐标进行插值,得到目标平面展开图中对应目标人物的第一部分(例如人脸区域)的网格三维模型的顶点坐标。Optionally, in the embodiment of the present application, the electronic device stitches the edge of the first image area with the edge after removing the second image area in the second plane expansion view, and after obtaining the target plane expansion view of the target person, the electronic device Texture recalculation is performed on the expanded target plane to obtain the vertex coordinates of the mesh three-dimensional model corresponding to the first part (for example, the face area) of the target person in the expanded target plane. Specifically, the electronic device calculates the vertex mapping relationship between the first grid 3D model and the area covered (or called replaced) by the first grid 3D model in the second grid 3D model. According to the vertex mapping relationship, The center of gravity coordinates are used for interpolation to obtain the vertex coordinates of the grid three-dimensional model corresponding to the first part (for example, the face area) of the target person in the target plane expanded view.
以第一尺度图像为目标人物的人脸图像,第二尺度图像为目标人物的上半身图像为例,根据目标人物的人脸图像和目标人物的上半身图像重建目标人物的上半身三维模型的过程可以参考图11示意的流程,首先,对采集的包含目标人物的图像进行预处理(即人脸检测、上半身检测、裁剪等)得到目标人物的人脸图像和目标人物的上半身图像;其次,将人脸图像输入至第一体素回归网络得到人脸体素三维模型,将上半身图像输入至第二体素回归网络得到上半身体素三维模型;再次,将人脸体素三维模型转换为人脸网格三维模型,将上半身体素三维模型转换为上半身网格三维模型;进而,对人脸网格三维模型和上半身网格三维模型进行网格平滑或网格简化中的至少一种处理;最后,将处理后的人脸网格模型和处理后的上半身网格三维模型进行网格融合处理,得到目标人物的上半身三维模型。Taking the first-scale image as the face image of the target person and the second-scale image as the upper body image of the target person as an example, the process of reconstructing the 3D model of the upper body of the target person based on the face image of the target person and the upper body image of the target person can be referred to The flow chart shown in Fig. 11, firstly, preprocessing (i.e., face detection, upper body detection, cropping, etc.) is performed on the collected image containing the target person to obtain the face image of the target person and the upper body image of the target person; secondly, the human face The image is input to the first voxel regression network to obtain a 3D face voxel model, and the upper body image is input to the second voxel regression network to obtain a 3D upper body body voxel model; again, the face voxel 3D model is converted into a face mesh 3D model Model, converting the upper body voxel 3D model into an upper body mesh 3D model; then, performing at least one of mesh smoothing or mesh simplification on the face mesh 3D model and upper body mesh 3D model; finally, processing The final face mesh model and the processed upper body mesh three-dimensional model are subjected to mesh fusion processing to obtain the upper body three-dimensional model of the target person.
综上,本申请实施例提供的人物三维模型的重建方法中,电子设备可以获取到目标人物的第一尺度图像和第二尺度图像之后,电子设备确定第一尺度图像对应的第一 网格三维模型,并且确定第二尺度图像对应的网格三维模型,然后,电子设备对第一尺度图像对应的第一网格三维模型和第二尺度图像对应的第二网格三维模型进行融合处理,以得到目标三维模型,该目标三维模型用于显示目标人物的至少第二部分。其中,第一尺度图像包括目标人物的第一部分,第二尺度图像包括目标人物的至少第二部分,该第一部分是第二部分的一部分。通过本申请实施例提供的人物三维模型的重建方法,结合小尺度图像(例如第一尺度图像)的三维模型所体现的细节信息丰富、分辨率高的优点,以及大尺度图像(例如第二尺度图像)的三维模型能体现较大范围的人物特征(即整体性)的优点,将目标人物不同尺度的图像的三维模型进行融合,得到模型更加完整、且细节信息更加丰富的目标人物的三维模型,不仅延伸了人物三维模型的应用场景,而且提升了重建的人物三维模型的质量,从而提升用户体验。To sum up, in the method for reconstructing a 3D model of a character provided by the embodiment of the present application, after the electronic device can obtain the first scale image and the second scale image of the target person, the electronic device determines the first grid 3D model corresponding to the first scale image. model, and determine the grid 3D model corresponding to the second-scale image, and then, the electronic device performs fusion processing on the first grid 3D model corresponding to the first-scale image and the second grid 3D model corresponding to the second-scale image, to A three-dimensional model of the target is obtained, and the three-dimensional model of the target is used to display at least a second portion of the target person. Wherein, the first scale image includes a first part of the target person, the second scale image includes at least a second part of the target person, and the first part is a part of the second part. Through the method for reconstructing a 3D model of a character provided by the embodiment of the present application, the advantages of rich detailed information and high resolution embodied in the 3D model of a small-scale image (such as a first-scale image) and the advantages of a large-scale image (such as a second-scale image) The 3D model of the image) can reflect the advantages of a wide range of character characteristics (that is, integrity), and the 3D models of the images of different scales of the target person are fused to obtain a 3D model of the target person with a more complete model and richer detailed information. , which not only extends the application scenarios of the three-dimensional character model, but also improves the quality of the reconstructed three-dimensional model of the character, thereby improving user experience.
相应地,本申请实施例提供一种用于人物三维模型的重建装置,该装置可以应用于电子设备,可以根据上述方法示例对该装置进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本发明实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。Correspondingly, an embodiment of the present application provides a device for reconstructing a 3D model of a character. The device can be applied to electronic equipment, and the functional modules of the device can be divided according to the above method example. For example, each function can be divided corresponding to each function module, or integrate two or more functions into one processing module. The above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. It should be noted that the division of modules in the embodiment of the present invention is schematic, and is only a logical function division, and there may be another division manner in actual implementation.
在采用对应各个功能划分各个功能模块的情况下,图12示出上述实施例中所涉及的用于人物三维模型的重建装置的一种可能的结构示意图。如图12所示,该装置包括获取模块1201、确定模块1202和融合模块1203。获取模块1201,用于获取目标人物的第一尺度图像和第二尺度图像,该第一尺度图像包括目标人物的第一部分,该第二尺度图像包括目标人物的至少第二部分,该第一部分是第二部分的一部分,例如执行上述方法实施例中的步骤301。确定模块1202,用于确定第一尺度图像对应的第一网格三维模型,并且确定第二尺度图像对应的第二网格三维模型,例如执行上述方法实施例中的步骤302和步骤303。融合模块1203,用于对第一网格三维模型和第二网格三维模型进行融合处理,以得到目标三维模型,该目标三维模型用于显示目标人物的至少第二部分,例如执行上述方法实施例中的步骤304。In the case of dividing each functional module corresponding to each function, FIG. 12 shows a possible structural schematic diagram of the apparatus for reconstructing a 3D model of a person involved in the above embodiment. As shown in FIG. 12 , the device includes an acquisition module 1201 , a determination module 1202 and a fusion module 1203 . An acquisition module 1201, configured to acquire a first scale image of the target person and a second scale image, the first scale image includes a first part of the target person, the second scale image includes at least a second part of the target person, the first part is A part of the second part, for example, executes step 301 in the above method embodiment. The determining module 1202 is configured to determine the first grid 3D model corresponding to the first-scale image, and determine the second grid 3D model corresponding to the second-scale image, for example, perform steps 302 and 303 in the above method embodiments. The fusion module 1203 is configured to perform fusion processing on the first grid 3D model and the second grid 3D model to obtain a target 3D model, and the target 3D model is used to display at least the second part of the target person, for example, performing the above method implementation Step 304 in the example.
可选地,上述确定模块1202具体用于基于第一体素回归网络,确定第一尺度图像对应的第一体素三维模型,并且将第一体素三维模型转换为第一网格三维模型;以及基于第二体素回归网络,确定第二尺度图像对应的第二体素三维模型,并且将第二体素三维模型转换为第二网格三维模型,例如执行上述方法实施例中的步骤3021至步骤3022,步骤3031至步骤3032。Optionally, the determination module 1202 is specifically configured to determine a first voxel 3D model corresponding to the first scale image based on the first voxel regression network, and convert the first voxel 3D model into a first grid 3D model; And based on the second voxel regression network, determine the second voxel three-dimensional model corresponding to the second scale image, and convert the second voxel three-dimensional model into the second grid three-dimensional model, for example, perform step 3021 in the above method embodiment Go to step 3022, step 3031 to step 3032.
可选地,上述融合模块1203具体用于将第一网格三维模型转换至二维平面,得到第一平面展开图;并且将第二网格三维模型转换至二维平面,得到第二平面展开图;该第一平面展开图中的第一图像区域对应第二平面展开图中的第二图像区域,第一图像区域和第二图像区域对应第一部分;以及对第一平面展开图进行裁剪,以获取第一图像区域,对第二平面展开图进行裁剪,以获取第二图像区域;并将第二平面展开图中的第二图像区域替换为第一图像区域,得到目标人物的目标平面展开图;再对该目标平面展开图进行三维转换,得到目标三维模型,例如执行上述方法实施例中的步骤3041至步骤3045。Optionally, the above fusion module 1203 is specifically used to convert the first grid 3D model to a 2D plane to obtain the first plane expansion; and convert the second grid 3D model to a 2D plane to obtain the second plane expansion Figure; the first image area in the first plane expanded view corresponds to the second image area in the second plane expanded view, and the first image area and the second image area correspond to the first part; and cutting the first plane expanded view, To obtain the first image area, crop the second plane expansion diagram to obtain the second image area; and replace the second image area in the second plane expansion diagram with the first image area to obtain the target plane expansion of the target person Fig. 3D transformation is performed on the target plane expanded view to obtain the target 3D model, for example, step 3041 to step 3045 in the above-mentioned method embodiment is performed.
可选地,本申请实施例提供的用于人物三维模型的重建装置还包括处理模块1204,该处理模块1204用于对第一网格三维模型和/或第二网格三维模型进行网格平滑处理或网格简化处理中的至少一种,例如执行上述方法实施例中的步骤305。Optionally, the apparatus for reconstructing a 3D model of a character provided in the embodiment of the present application further includes a processing module 1204, which is used to perform grid smoothing on the first grid 3D model and/or the second grid 3D model At least one of processing or mesh simplification processing, for example, execute step 305 in the above method embodiment.
上述用于人物三维模型的重建装置的各个模块还可以用于执行上述方法实施例中的其他动作,上述方法实施例涉及的各步骤的所有相关内容均可以援引到对应功能模块的功能描述,在此不再赘述。Each module of the above-mentioned reconstruction device for a three-dimensional model of a person can also be used to perform other actions in the above-mentioned method embodiment. All relevant content of each step involved in the above-mentioned method embodiment can be referred to the function description of the corresponding functional module. This will not be repeated here.
在采用集成的单元的情况下,图13示出了上述实施例中所涉及的用于人物三维模型的重建装置的另一种可能的结构示意图。如图13所示,本申请实施例提供的用于人物三维模型的重建装置可以包括:处理模块1301和通信模块1302。处理模块1301可以用于对该装置的动作进行控制管理,例如,处理模块1301可以用于支持该装置备执行上述方法实施例中的步骤301至步骤304、步骤305,和/或用于本文所描述的技术的其它过程。通信模块1302可以用于支持该装置与其他网络实体的通信。可选地,如图13所示,该用于人物三维模型的重建装置还可以包括存储模块1303,用于存储该装置的程序代码和数据。In the case of using integrated units, FIG. 13 shows another possible structural schematic diagram of the apparatus for reconstructing a 3D model of a person involved in the above embodiment. As shown in FIG. 13 , the apparatus for reconstructing a three-dimensional model of a character provided by the embodiment of the present application may include: a processing module 1301 and a communication module 1302 . The processing module 1301 can be used to control and manage the actions of the device. For example, the processing module 1301 can be used to support the device to execute steps 301 to 304 and 305 in the above method embodiments, and/or to implement the steps described herein. Other procedures of the described techniques. A communications module 1302 may be used to support communications of the device with other network entities. Optionally, as shown in FIG. 13 , the apparatus for reconstructing a three-dimensional model of a person may further include a storage module 1303 for storing program codes and data of the apparatus.
其中,处理模块1301可以是处理器或控制器(例如可以是上述如图2所示的处理器210),例如可以是中央处理器(central processing unit,CPU)、通用处理器、数字信号处理器(digital signal processor,DSP)、专用集成电路(application-specific integrated circuit,ASIC)、现场可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本发明实施例公开内容所描述的各种示例性的逻辑方框、模块和电路。上述处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等等。通信模块1302可以是收发器、收发电路或通信接口等(例如可以是上述如图2所示的移动通信模块250或无线通信模块260)。存储模块1303可以是存储器(例如可以是上述如图2所示的内部存储器221)。Wherein, the processing module 1301 may be a processor or a controller (such as the above-mentioned processor 210 shown in FIG. 2 ), such as a central processing unit (central processing unit, CPU), a general purpose processor, a digital signal processor (digital signal processor, DSP), application-specific integrated circuit (ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any of them combination. It can implement or execute various exemplary logical blocks, modules and circuits described in conjunction with the disclosure of the embodiments of the present invention. The above-mentioned processors may also be a combination of computing functions, for example, a combination of one or more microprocessors, a combination of DSP and a microprocessor, and so on. The communication module 1302 may be a transceiver, a transceiver circuit, or a communication interface (for example, it may be the mobile communication module 250 or the wireless communication module 260 shown in FIG. 2 ). The storage module 1303 may be a memory (for example, it may be the above-mentioned internal memory 221 shown in FIG. 2 ).
当处理模块1301为处理器,通信模块1302为收发器,存储模块1303为存储器时,处理器、收发器和存储器可以通过总线连接。总线可以是外设部件互连标准(peripheral component interconnect,PCI)总线或扩展工业标准结构(extended Industry standard architecture,EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。When the processing module 1301 is a processor, the communication module 1302 is a transceiver, and the storage module 1303 is a memory, the processor, the transceiver, and the memory may be connected through a bus. The bus may be a peripheral component interconnect standard (peripheral component interconnect, PCI) bus or an extended industry standard architecture (extended Industry standard architecture, EISA) bus or the like. The bus can be divided into address bus, data bus, control bus and so on.
上述用于人物三维模型的重建装置包含的模块实现上述功能的更多细节请参考前面各个方法实施例中的描述,在这里不再重复。For more details about the modules contained in the above-mentioned reconstruction device for a three-dimensional model of a person to realize the above-mentioned functions, please refer to the descriptions in the previous method embodiments, and will not be repeated here.
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。Each embodiment in this specification is described in a progressive manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件程序实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机指令时,全部或部分地产生按照本申请实施例中的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如 同轴电缆、光纤、数字用户线(digital subscriber line,DSL))方式或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包括一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质(例如,软盘、磁盘、磁带)、光介质(例如,数字视频光盘(digital video disc,DVD))、或者半导体介质(例如固态硬盘(solid state drives,SSD))等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using a software program, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer instructions are loaded and executed on the computer, all or part of the processes or functions according to the embodiments of the present application will be generated. The computer can be a general purpose computer, special purpose computer, computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transferred from a website, computer, server, or data center by wire (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) to another website site, computer, server or data center. The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device including a server, a data center, and the like integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a magnetic disk, a magnetic tape), an optical medium (for example, a digital video disc (digital video disc, DVD)), or a semiconductor medium (for example, a solid state drive (solid state drives, SSD)), etc. .
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Through the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above-mentioned functional modules is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated according to needs It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above. For the specific working process of the above-described system, device, and unit, reference may be made to the corresponding process in the foregoing method embodiments, and details are not repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, device and method can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:快闪存储器、移动硬盘、只读存储器、随机存取存储器、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor execute all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage medium includes: flash memory, mobile hard disk, read-only memory, random access memory, magnetic disk or optical disk, and other various media capable of storing program codes.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of protection of the present invention is not limited thereto. Anyone skilled in the art can easily think of changes or substitutions within the technical scope disclosed in the present invention. Should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be determined by the protection scope of the claims.

Claims (13)

  1. 一种人物三维模型的重建方法,其特征在于,应用于电子设备,所述方法包括:A method for reconstructing a three-dimensional model of a person, characterized in that it is applied to electronic equipment, and the method includes:
    获取目标人物的第一尺度图像和第二尺度图像,所述第一尺度图像包括所述目标人物的第一部分,所述第二尺度图像包括所述目标人物的至少第二部分,所述第一部分是所述第二部分的一部分;Acquiring a first scale image and a second scale image of the target person, the first scale image includes a first part of the target person, the second scale image includes at least a second part of the target person, the first part is part of said second part;
    确定所述第一尺度图像对应的第一网格三维模型;determining a first grid 3D model corresponding to the first scale image;
    确定所述第二尺度图像对应的第二网格三维模型;determining a second grid 3D model corresponding to the second scale image;
    对所述第一网格三维模型和所述第二网格三维模型进行融合处理,以得到目标三维模型,所述目标三维模型用于显示所述目标人物的至少第二部分。Fusion processing is performed on the first grid 3D model and the second grid 3D model to obtain a target 3D model, and the target 3D model is used to display at least a second part of the target person.
  2. 根据权利要求1所述的方法,其特征在于,The method according to claim 1, characterized in that,
    所述确定所述第一尺度图像对应的第一网格三维模型,包括:The determining the first grid 3D model corresponding to the first scale image includes:
    基于第一体素回归网络,确定所述第一尺度图像对应的第一体素三维模型,并且将所述第一体素三维模型转换为所述第一网格三维模型;determining a first voxel three-dimensional model corresponding to the first scale image based on the first voxel regression network, and converting the first voxel three-dimensional model into the first grid three-dimensional model;
    所述确定所述第二尺度图像对应的第二网格三维模型,包括:The determining the second grid 3D model corresponding to the second scale image includes:
    基于第二体素回归网络,确定所述第二尺度图像对应的第二体素三维模型,并且将所述第二体素三维模型转换为所述第二网格三维模型。Based on the second voxel regression network, a second voxel three-dimensional model corresponding to the second scale image is determined, and the second voxel three-dimensional model is converted into the second grid three-dimensional model.
  3. 根据权利要求1或2所述的方法,其特征在于,所述对所述第一网格三维模型和所述第二网格三维模型进行融合处理,以得到目标三维模型,包括:The method according to claim 1 or 2, wherein the fusion processing of the first grid 3D model and the second grid 3D model to obtain a target 3D model includes:
    将所述第一网格三维模型转换至二维平面,得到第一平面展开图;converting the first grid three-dimensional model to a two-dimensional plane to obtain a first plane expansion diagram;
    将所述第二网格三维模型转换至二维平面,得到第二平面展开图;所述第一平面展开图中的第一图像区域对应所述第二平面展开图中的第二图像区域,所述第一图像区域和第二图像区域对应所述第一部分;converting the second grid three-dimensional model to a two-dimensional plane to obtain a second plane expansion; the first image area in the first plane expansion corresponds to the second image area in the second plane expansion, The first image area and the second image area correspond to the first portion;
    对所述第一平面展开图进行裁剪,以获取所述第一图像区域;clipping the first plane expanded view to obtain the first image area;
    对所述第二平面展开图进行裁剪,以获取所述第二图像区域;clipping the second plane expansion image to obtain the second image area;
    将所述第二平面展开图中的所述第二图像区域替换为所述第一图像区域,得到所述目标人物的目标平面展开图;replacing the second image area in the second plane expansion diagram with the first image area to obtain a target plane expansion diagram of the target person;
    对所述目标平面展开图进行三维转换,得到所述目标三维模型。Three-dimensional conversion is performed on the target plane expansion diagram to obtain the target three-dimensional model.
  4. 根据权利要求1至3任一项所述的方法,其特征在于,在对所述第一网格三维模型和所述第二网格三维模型进行融合处理之前,所述方法还包括:The method according to any one of claims 1 to 3, wherein before performing fusion processing on the first grid 3D model and the second grid 3D model, the method further comprises:
    对所述第一网格三维模型和/或所述第二网格三维模型进行网格平滑处理或网格简化处理中的至少一种。At least one of mesh smoothing or mesh simplification is performed on the first mesh 3D model and/or the second mesh 3D model.
  5. 根据权利要求1至4任一项所述的方法,其特征在于,The method according to any one of claims 1 to 4, characterized in that,
    所述第一部分为所述目标人物的人脸,所述第二部分为所述目标人物的上半身;或者,The first part is the face of the target person, and the second part is the upper body of the target person; or,
    所述第一部分为所述目标人物的上半身,所述第二部分为所述目标人物的全身。The first part is the upper body of the target person, and the second part is the whole body of the target person.
  6. 一种用于人物三维模型的重建装置,其特征在于,应用于电子设备,所述装置包括:获取模块、确定模块和融合模块;A reconstruction device for a three-dimensional model of a person, characterized in that it is applied to electronic equipment, and the device includes: an acquisition module, a determination module and a fusion module;
    所述获取模块,用于获取目标人物的第一尺度图像和第二尺度图像,所述第一尺度图像包括所述目标人物的第一部分,所述第二尺度图像包括所述目标人物的至少第 二部分,所述第一部分是所述第二部分的一部分;The acquiring module is configured to acquire a first scale image of the target person and a second scale image, the first scale image includes a first part of the target person, and the second scale image includes at least a second scale image of the target person two parts, said first part being part of said second part;
    所述确定模块,用于确定所述第一尺度图像对应的第一网格三维模型,并且确定所述第二尺度图像对应的第二网格三维模型;The determination module is configured to determine a first grid 3D model corresponding to the first scale image, and determine a second grid 3D model corresponding to the second scale image;
    所述融合模块,用于对所述第一网格三维模型和所述第二网格三维模型进行融合处理,以得到目标三维模型,所述目标三维模型用于显示所述目标人物的至少第二部分。The fusion module is configured to perform fusion processing on the first grid 3D model and the second grid 3D model to obtain a target 3D model, and the target 3D model is used to display at least the first 3D model of the target person two parts.
  7. 根据权利要求6所述的装置,其特征在于,The device according to claim 6, characterized in that,
    所述确定模块,具体用于基于第一体素回归网络,确定所述第一尺度图像对应的第一体素三维模型,并且将所述第一体素三维模型转换为所述第一网格三维模型;以及基于第二体素回归网络,确定所述第二尺度图像对应的第二体素三维模型,并且将所述第二体素三维模型转换为所述第二网格三维模型。The determining module is specifically configured to determine a first voxel three-dimensional model corresponding to the first scale image based on the first voxel regression network, and convert the first voxel three-dimensional model into the first grid a three-dimensional model; and based on a second voxel regression network, determining a second voxel three-dimensional model corresponding to the second-scale image, and converting the second voxel three-dimensional model into the second grid three-dimensional model.
  8. 根据权利要求6或7所述的装置,其特征在于,A device according to claim 6 or 7, characterized in that
    所述融合模块,具体用于将所述第一网格三维模型转换至二维平面,得到第一平面展开图;并且将所述第二网格三维模型转换至二维平面,得到第二平面展开图;所述第一平面展开图中的第一图像区域对应所述第二平面展开图中的第二图像区域,所述第一图像区域和第二图像区域对应所述第一部分;以及对所述第一平面展开图进行裁剪,以获取所述第一图像区域;对所述第二平面展开图进行裁剪,以获取所述第二图像区域;并将所述第二平面展开图中的所述第二图像区域替换为所述第一图像区域,得到所述目标人物的目标平面展开图;再对所述目标平面展开图进行三维转换,得到所述目标三维模型。The fusion module is specifically used to convert the first grid three-dimensional model to a two-dimensional plane to obtain a first plane expansion diagram; and convert the second grid three-dimensional model to a two-dimensional plane to obtain a second plane Expanded view; the first image area in the first plan expanded view corresponds to the second image area in the second plan expanded view, and the first image area and the second image area correspond to the first part; and clipping the first plane expansion view to obtain the first image area; cutting the second plane expansion view to obtain the second image area; and The second image area is replaced with the first image area to obtain a target plane expanded view of the target person; and then a three-dimensional transformation is performed on the target plane expanded view to obtain the target three-dimensional model.
  9. 根据权利要求6至8任一项所述的装置,其特征在于,所述装置还包括处理模块;The device according to any one of claims 6 to 8, wherein the device further comprises a processing module;
    所述处理模块,用于对所述第一网格三维模型和/或所述第二网格三维模型进行网格平滑处理或网格简化处理中的至少一种。The processing module is configured to perform at least one of mesh smoothing or mesh simplification on the first mesh 3D model and/or the second mesh 3D model.
  10. 根据权利要求6至9任一项所述的装置,其特征在于,The device according to any one of claims 6 to 9, characterized in that,
    所述第一部分为所述目标人物的人脸,所述第二部分为所述目标人物的上半身;或者,The first part is the face of the target person, and the second part is the upper body of the target person; or,
    所述第一部分为所述目标人物的上半身,所述第二部分为所述目标人物的全身。The first part is the upper body of the target person, and the second part is the whole body of the target person.
  11. 一种电子设备,其特征在于,包括存储器和与所述存储器连接的至少一个处理器,所述存储器用于存储指令,所述指令被至少一个处理器读取后,执行如权利要求1至5任一项所述的方法。An electronic device, characterized in that it includes a memory and at least one processor connected to the memory, the memory is used to store instructions, and after the instructions are read by at least one processor, the instructions according to claims 1 to 5 are executed. any one of the methods described.
  12. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行以实现权利要求1至5任一项所述的方法。A computer-readable storage medium on which a computer program is stored, wherein the computer program is executed by a processor to implement the method according to any one of claims 1 to 5.
  13. 一种计算机程序产品,其特征在于,所述计算机程序产品包含指令,当所述计算机程序产品在计算机上运行时,执行权利要求1至5任一项所述的方法。A computer program product, characterized in that the computer program product includes instructions, and when the computer program product is run on a computer, the method according to any one of claims 1 to 5 is executed.
PCT/CN2021/114840 2021-08-26 2021-08-26 Method and apparatus for reconstructing three-dimensional model of person WO2023024036A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2021/114840 WO2023024036A1 (en) 2021-08-26 2021-08-26 Method and apparatus for reconstructing three-dimensional model of person
CN202180056556.0A CN116157842A (en) 2021-08-26 2021-08-26 Reconstruction method and device of character three-dimensional model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/114840 WO2023024036A1 (en) 2021-08-26 2021-08-26 Method and apparatus for reconstructing three-dimensional model of person

Publications (1)

Publication Number Publication Date
WO2023024036A1 true WO2023024036A1 (en) 2023-03-02

Family

ID=85322359

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/114840 WO2023024036A1 (en) 2021-08-26 2021-08-26 Method and apparatus for reconstructing three-dimensional model of person

Country Status (2)

Country Link
CN (1) CN116157842A (en)
WO (1) WO2023024036A1 (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080118021A1 (en) * 2006-11-22 2008-05-22 Sandeep Dutta Methods and systems for optimizing high resolution image reconstruction
US20100266181A1 (en) * 2008-04-30 2010-10-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for producing a ct reconstruction of an object comprising a high-resolution object region of interest
CN102063610A (en) * 2009-11-13 2011-05-18 鸿富锦精密工业(深圳)有限公司 Image identification system and method thereof
CN105487121A (en) * 2015-12-03 2016-04-13 长江大学 Method for constructing multi-scale digital rock core based on fusion of CT scanned image and electro-imaging image
CN112541483A (en) * 2020-12-25 2021-03-23 三峡大学 Dense face detection method combining YOLO and blocking-fusion strategy
CN112950769A (en) * 2021-03-31 2021-06-11 深圳市慧鲤科技有限公司 Three-dimensional human body reconstruction method, device, equipment and storage medium
CN113012282A (en) * 2021-03-31 2021-06-22 深圳市慧鲤科技有限公司 Three-dimensional human body reconstruction method, device, equipment and storage medium
CN113039581A (en) * 2018-09-14 2021-06-25 恩维医疗公司有限公司 Multi-scale image reconstruction of three-dimensional objects

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080118021A1 (en) * 2006-11-22 2008-05-22 Sandeep Dutta Methods and systems for optimizing high resolution image reconstruction
US20100266181A1 (en) * 2008-04-30 2010-10-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device and method for producing a ct reconstruction of an object comprising a high-resolution object region of interest
CN102063610A (en) * 2009-11-13 2011-05-18 鸿富锦精密工业(深圳)有限公司 Image identification system and method thereof
CN105487121A (en) * 2015-12-03 2016-04-13 长江大学 Method for constructing multi-scale digital rock core based on fusion of CT scanned image and electro-imaging image
CN113039581A (en) * 2018-09-14 2021-06-25 恩维医疗公司有限公司 Multi-scale image reconstruction of three-dimensional objects
CN112541483A (en) * 2020-12-25 2021-03-23 三峡大学 Dense face detection method combining YOLO and blocking-fusion strategy
CN112950769A (en) * 2021-03-31 2021-06-11 深圳市慧鲤科技有限公司 Three-dimensional human body reconstruction method, device, equipment and storage medium
CN113012282A (en) * 2021-03-31 2021-06-22 深圳市慧鲤科技有限公司 Three-dimensional human body reconstruction method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN116157842A (en) 2023-05-23

Similar Documents

Publication Publication Date Title
WO2021213120A1 (en) Screen projection method and apparatus, and electronic device
WO2021169394A1 (en) Depth-based human body image beautification method and electronic device
WO2022017261A1 (en) Image synthesis method and electronic device
US20240153209A1 (en) Object Reconstruction Method and Related Device
CN114140365B (en) Event frame-based feature point matching method and electronic equipment
CN113806456A (en) Mesh coding method and device
CN117078509B (en) Model training method, photo generation method and related equipment
CN113542580A (en) Method and device for removing light spots of glasses and electronic equipment
CN110138999B (en) Certificate scanning method and device for mobile terminal
CN110956571A (en) SLAM-based virtual-real fusion method and electronic equipment
WO2021057626A1 (en) Image processing method, apparatus, device, and computer storage medium
CN115147451A (en) Target tracking method and device thereof
CN112037157A (en) Data processing method and device, computer readable medium and electronic equipment
CN113536834A (en) Pouch detection method and device
CN115686182B (en) Processing method of augmented reality video and electronic equipment
CN114283195B (en) Method for generating dynamic image, electronic device and readable storage medium
WO2023024036A1 (en) Method and apparatus for reconstructing three-dimensional model of person
CN114697543B (en) Image reconstruction method, related device and system
CN111982293B (en) Body temperature measuring method and device, electronic equipment and storage medium
CN116797767A (en) Augmented reality scene sharing method and electronic device
CN114812381A (en) Electronic equipment positioning method and electronic equipment
CN111460942A (en) Proximity detection method and device, computer readable medium and terminal equipment
CN111626929B (en) Depth image generation method and device, computer readable medium and electronic equipment
CN114418837B (en) Dressing migration method and electronic equipment
CN116703741B (en) Image contrast generation method and device and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21954564

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE