CN114998554A - Three-dimensional cartoon face modeling method and device - Google Patents
Three-dimensional cartoon face modeling method and device Download PDFInfo
- Publication number
- CN114998554A CN114998554A CN202210483519.4A CN202210483519A CN114998554A CN 114998554 A CN114998554 A CN 114998554A CN 202210483519 A CN202210483519 A CN 202210483519A CN 114998554 A CN114998554 A CN 114998554A
- Authority
- CN
- China
- Prior art keywords
- face
- dimensional
- cartoon
- features
- feature
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/64—Three-dimensional objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics
- G06T2219/20—Indexing scheme for editing of 3D models
- G06T2219/2024—Style variation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computer Graphics (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Geometry (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Computer Hardware Design (AREA)
- General Engineering & Computer Science (AREA)
- Processing Or Creating Images (AREA)
Abstract
本申请公开了一种三维卡通人脸建模方法及装置,其中,方法包括:提取目标人物在二维图像中人脸的二维人脸特征;根据目标人物的三维深度图提取人脸的三维人脸特征;融合二维人脸特征、三维人脸特征和预设卡通风格特征,生成目标人脸的三维卡通人脸模型。由此,解决了相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。
The present application discloses a three-dimensional cartoon face modeling method and device, wherein the method includes: extracting two-dimensional face features of a target person's face in a two-dimensional image; Face features; fuse 2D face features, 3D face features and preset cartoon style features to generate a 3D cartoon face model of the target face. As a result, the technical problem of the cartoon image modeling of the real face based on the neural network in the related art, resulting in a single style and low recognition degree of the generated cartoon image, is solved.
Description
技术领域technical field
本申请涉及计算机图形学与深度学习技术领域,特别涉及一种三维卡通人脸建模方法及装置。The present application relates to the technical field of computer graphics and deep learning, and in particular, to a method and device for modeling a three-dimensional cartoon face.
背景技术Background technique
三维重建在3D游戏,动画,电影等诸多领域中有着广泛的应用。作为其中的一个主要以及最具有辨识度的部分,进行三维人脸重建的需求越来越高。3D reconstruction has a wide range of applications in 3D games, animation, movies and many other fields. As one of the main and most recognizable parts, the demand for 3D face reconstruction is getting higher and higher.
得益于计算机与移动终端等设备的算力提升,利用深度学习对真实人脸特征的提取技术趋于成熟,相关技术可以基于神经网络对真实人脸进行卡通形象生成。Thanks to the improvement of computing power of computers and mobile terminals, the extraction technology of real face features using deep learning tends to mature, and related technologies can generate cartoon images of real faces based on neural networks.
然而,相关技术只能由专业技术人员手动进行卡通形象建模,难以推广普及,并且生成的三维卡通人脸模型风格相似,无法实现多元化,且不具备真实人脸辨识度,有待改善。However, the related technology can only be manually modeled by professional technicians, which is difficult to popularize, and the generated 3D cartoon face models are similar in style, unable to achieve diversification, and have no real face recognition, which needs improvement.
发明内容SUMMARY OF THE INVENTION
本申请提供一种三维卡通人脸建模方法及装置,以解决相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。The present application provides a three-dimensional cartoon face modeling method and device to solve the technical problem in the related art that the cartoon image modeling of the real face based on the neural network results in a single style and low recognition degree of the generated cartoon image.
本申请第一方面实施例提供一种三维卡通人脸建模方法,包括以下步骤:提取目标人物在二维图像中人脸的二维人脸特征;根据所述目标人物的三维深度图提取所述人脸的三维人脸特征;以及融合所述二维人脸特征、所述三维人脸特征和预设卡通风格特征,生成所述目标人脸的三维卡通人脸模型。The embodiment of the first aspect of the present application provides a three-dimensional cartoon face modeling method, including the following steps: extracting two-dimensional face features of a target person's face in a two-dimensional image; The three-dimensional face feature of the face; and the three-dimensional cartoon face model of the target face is generated by fusing the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature.
可选地,在本申请的一个实施例中,所述生成所述目标人脸的三维卡通人脸模型,包括:根据由所述二维人脸特征、所述三维人脸特征和预设卡通风格特征融合得到的融合特征生成所述目标人脸的初始三维卡通人脸模型;计算所述初始三维卡通人脸模型的呈现程度,并在所述呈现程度小于预设阈值时,基于预设标准对所述二维人脸特征、所述三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,生成新的三维卡通人脸模型,迭代优化,直至所述新的三维卡通人脸模型的呈现程度大于或等于所述预设阈值,得到最终三维卡通人脸模型。Optionally, in an embodiment of the present application, the generating the 3D cartoon face model of the target face includes: The fusion feature obtained by fusion of style features generates an initial three-dimensional cartoon face model of the target face; calculates the degree of presentation of the initial three-dimensional cartoon face model, and when the degree of presentation is less than a preset threshold, based on a preset standard Linearly weighting at least one of the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature to generate a new three-dimensional cartoon face model, and iteratively optimize until the new three-dimensional cartoon face The presentation degree of the model is greater than or equal to the preset threshold, and a final three-dimensional cartoon face model is obtained.
可选地,在本申请的一个实施例中,在生成所述新的三维卡通人脸模型之后,还包括:获取当前迭代轮下的至少一个特征的当前特征权重值;根据所述当前特征权重值和所述上一迭代轮下的新的三维卡通人脸模型的呈现程度得到所述当前迭代轮的新的三维卡通人脸模型的呈现程度。Optionally, in an embodiment of the present application, after generating the new three-dimensional cartoon face model, the method further includes: acquiring a current feature weight value of at least one feature under the current iteration round; according to the current feature weight value and the presentation degree of the new 3D cartoon face model in the previous iteration round to obtain the presentation degree of the new 3D cartoon face model in the current iteration round.
可选地,在本申请的一个实施例中,所述呈现程度的计算公式为:Optionally, in an embodiment of the present application, the calculation formula of the presentation degree is:
其中,K表示与人物辨识度强相关总特征数量,M表示与人物满意度指标强相关总特征数量,TV为第V轮迭代前原先特征在呈现到模型中的程度,a表示人物辨识度对最终特征呈现所占的比重,1-a表示人物满意度对最终特征呈现所占的比重,DV(k)为第V轮中各特征在辨识度中所占比重,SV(m)为第V轮中各特征在满意度中所占比重。Among them, K represents the total number of features strongly related to character recognition, M represents the total number of features strongly related to the character satisfaction index, T V is the degree to which the original features are presented in the model before the V-th iteration, and a represents the character recognition The proportion of the final feature presentation, 1-a represents the proportion of the character satisfaction to the final feature presentation, D V (k) is the proportion of each feature in the V round of recognition, S V (m) is the proportion of each feature in the satisfaction in the V round.
可选地,在本申请的一个实施例中,所述预设卡通风格特征包括至少一个迪士尼卡通风格特征、至少一个日本卡通风格特征和至少一个Meta卡通风格特征中的至少一个。Optionally, in an embodiment of the present application, the preset cartoon style features include at least one of at least one Disney cartoon style feature, at least one Japanese cartoon style feature, and at least one Meta cartoon style feature.
本申请第二方面实施例提供一种三维卡通人脸建模装置,包括:第一提取模块,用于提取目标人物在二维图像中人脸的二维人脸特征;第二提取模块,用于根据所述目标人物的三维深度图提取所述人脸的三维人脸特征;以及建模模块,用于融合所述二维人脸特征、所述三维人脸特征和预设卡通风格特征,生成所述目标人脸的三维卡通人脸模型。An embodiment of the second aspect of the present application provides a three-dimensional cartoon face modeling device, including: a first extraction module for extracting two-dimensional face features of a target person's face in a two-dimensional image; a second extraction module for using extracting the three-dimensional face feature of the face according to the three-dimensional depth map of the target person; and a modeling module for fusing the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature, A three-dimensional cartoon face model of the target face is generated.
可选地,在本申请的一个实施例中,所述建模模块包括:融合单元,用于根据由所述二维人脸特征、所述三维人脸特征和预设卡通风格特征融合得到的融合特征生成所述目标人脸的初始三维卡通人脸模型;计算单元,用于计算所述初始三维卡通人脸模型的呈现程度,并在所述呈现程度小于预设阈值时,基于预设标准对所述二维人脸特征、所述三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,生成新的三维卡通人脸模型,迭代优化,直至所述新的三维卡通人脸模型的呈现程度大于或等于所述预设阈值,得到最终三维卡通人脸模型。Optionally, in an embodiment of the present application, the modeling module includes: a fusion unit, which is configured to obtain according to the fusion of the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature. Fusing the features to generate an initial three-dimensional cartoon face model of the target face; a computing unit for calculating the presentation degree of the initial three-dimensional cartoon face model, and when the presentation degree is less than a preset threshold, based on a preset standard Linearly weighting at least one of the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature to generate a new three-dimensional cartoon face model, and iteratively optimize until the new three-dimensional cartoon face The presentation degree of the model is greater than or equal to the preset threshold, and a final three-dimensional cartoon face model is obtained.
可选地,在本申请的一个实施例中,所述建模模块进一步用于,获取当前迭代轮下的至少一个特征的当前特征权重值;根据所述当前特征权重值和所述上一迭代轮下的新的三维卡通人脸模型的呈现程度得到所述当前迭代轮的新的三维卡通人脸模型的呈现程度。Optionally, in an embodiment of the present application, the modeling module is further configured to obtain a current feature weight value of at least one feature under the current iteration round; according to the current feature weight value and the previous iteration The presentation degree of the new 3D cartoon face model under the round is obtained from the presentation degree of the new 3D cartoon face model of the current iteration round.
可选地,在本申请的一个实施例中,所述呈现程度的计算公式为:Optionally, in an embodiment of the present application, the calculation formula of the presentation degree is:
其中,K表示与人物辨识度强相关总特征数量,M表示与人物满意度指标强相关总特征数量,TV为第V轮迭代前原先特征在呈现到模型中的程度,a表示人物辨识度对最终特征呈现所占的比重,1-a表示人物满意度对最终特征呈现所占的比重,DV(k)为第V轮中各特征在辨识度中所占比重,SV(m)为第V轮中各特征在满意度中所占比重。Among them, K represents the total number of features strongly related to character recognition, M represents the total number of features strongly related to the character satisfaction index, T V is the degree to which the original features are presented in the model before the V-th iteration, and a represents the character recognition The proportion of the final feature presentation, 1-a represents the proportion of the character satisfaction to the final feature presentation, D V (k) is the proportion of each feature in the V round of recognition, S V (m) is the proportion of each feature in the satisfaction in the V round.
可选地,在本申请的一个实施例中,所述预设卡通风格特征包括至少一个迪士尼卡通风格特征、至少一个日本卡通风格特征和至少一个Meta卡通风格特征中的至少一个。Optionally, in an embodiment of the present application, the preset cartoon style features include at least one of at least one Disney cartoon style feature, at least one Japanese cartoon style feature, and at least one Meta cartoon style feature.
本申请第三方面实施例提供一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述程序,以实现如上述实施例所述的三维卡通人脸建模方法。An embodiment of a third aspect of the present application provides an electronic device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the program to achieve The three-dimensional cartoon face modeling method described in the above embodiment.
本申请第四方面实施例提供一种计算机可读存储介质,所述计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行如上述实施例所述的三维卡通人脸建模方法。Embodiments of the fourth aspect of the present application provide a computer-readable storage medium, where the computer-readable storage medium stores computer instructions, and the computer instructions are used to cause the computer to execute the three-dimensional cartoon face reconstruction method described in the foregoing embodiments. model method.
本申请实施例可以利用三维深度图提取目标人物的三维人脸特征,并融合目标人物二维图像中的二维人脸特征及卡通风格特征,从而生成目标人脸的三维卡通人脸模型,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,有效满足建模需求,提升使用体验。由此,解决了相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。In the embodiment of the present application, the three-dimensional face feature of the target person can be extracted by using the three-dimensional depth map, and the two-dimensional face feature and cartoon style feature in the two-dimensional image of the target person can be combined to generate a three-dimensional cartoon face model of the target face, so that The generated 3D face model has a higher degree of recognition, and can change the style with the change of cartoon style, with higher flexibility, which can effectively meet the modeling needs and improve the user experience. As a result, the technical problem of the cartoon image modeling of the real face based on the neural network in the related art, resulting in a single style and low recognition degree of the generated cartoon image, is solved.
本申请附加的方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本申请的实践了解到。Additional aspects and advantages of the present application will be set forth, in part, in the following description, and in part will be apparent from the following description, or learned by practice of the present application.
附图说明Description of drawings
本申请上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present application will become apparent and readily understood from the following description of embodiments taken in conjunction with the accompanying drawings, wherein:
图1为根据本申请实施例提供的一种三维卡通人脸建模方法的流程图;1 is a flowchart of a method for modeling a three-dimensional cartoon face according to an embodiment of the present application;
图2为根据本申请一个实施例的Meta卡通风格示意图;2 is a schematic diagram of a Meta cartoon style according to an embodiment of the present application;
图3为根据本申请一个实施例的一种三维卡通人脸建模方法的流程图;3 is a flowchart of a method for modeling a three-dimensional cartoon face according to an embodiment of the present application;
图4为根据本申请实施例提供的一种三维卡通人脸建模装置的结构示意图;4 is a schematic structural diagram of a three-dimensional cartoon face modeling apparatus provided according to an embodiment of the present application;
图5为根据本申请实施例提供的电子设备的结构示意图。FIG. 5 is a schematic structural diagram of an electronic device provided according to an embodiment of the present application.
具体实施方式Detailed ways
下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本申请,而不能理解为对本申请的限制。The following describes in detail the embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to be used to explain the present application, but should not be construed as a limitation to the present application.
下面参考附图描述本申请实施例的三维卡通人脸建模方法及装置。针对上述背景技术中心提到的相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题,本申请提供了一种三维卡通人脸建模方法,在该方法中,可以利用三维深度图提取目标人物的三维人脸特征,并融合目标人物二维图像中的二维人脸特征及卡通风格特征,从而生成目标人脸的三维卡通人脸模型,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,有效满足建模需求,提升使用体验。由此,解决了相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。The following describes the three-dimensional cartoon face modeling method and apparatus according to the embodiments of the present application with reference to the accompanying drawings. Aiming at the technical problem that the cartoon image modeling of the real human face based on the neural network in the related art mentioned by the above-mentioned background technology center leads to the single style of the generated cartoon image and the low recognition degree, the present application provides a three-dimensional cartoon human face. Modeling method, in this method, the three-dimensional face features of the target person can be extracted by using the three-dimensional depth map, and the two-dimensional face features and cartoon style features in the two-dimensional image of the target person can be fused to generate a three-dimensional cartoon of the target face. The face model makes the generated 3D face model more recognizable, and can change the style with the change of cartoon style, which is more flexible, effectively meets the modeling needs and improves the user experience. As a result, the technical problem of the cartoon image modeling of the real face based on the neural network in the related art, resulting in a single style and low recognition degree of the generated cartoon image, is solved.
具体而言,图1为本申请实施例所提供的一种三维卡通人脸建模方法的流程示意图。Specifically, FIG. 1 is a schematic flowchart of a three-dimensional cartoon face modeling method provided by an embodiment of the present application.
如图1所示,该三维卡通人脸建模方法包括以下步骤:As shown in Figure 1, the three-dimensional cartoon face modeling method includes the following steps:
在步骤S101中,提取目标人物在二维图像中人脸的二维人脸特征。In step S101, two-dimensional face features of the target person's face in the two-dimensional image are extracted.
可以理解的是,人脸特征可以为神经网络提取的向量形式,以及表示面部各器官样式的特征,代表着某一类人某个器官的特点,提取方法,举例而言,可以由基于深度学习模型训练学习所得,或对面部器官细节特征进行建模所得,具体地,以眉毛为例,眉毛按照眉形可以有长眉、短眉、粗眉、细眉、稀眉、八字眉、上挑眉、一字眉、新月眉、角眉、颦眉等等。It can be understood that the facial features can be in the form of vectors extracted by the neural network, as well as the features representing the styles of various facial organs, representing the characteristics of a certain type of person and an organ. The extraction method, for example, can be based on deep learning. It is obtained from model training and learning, or from modeling the detailed features of facial organs. Specifically, taking eyebrows as an example, eyebrows can have long eyebrows, short eyebrows, thick eyebrows, thin eyebrows, thin eyebrows, splayed eyebrows, and raised eyebrows according to the shape of the eyebrows. Eyebrows, straight eyebrows, crescent eyebrows, corner eyebrows, frown eyebrows, etc.
在实际执行过程中,本申请实施例可以利用神经网络或传统计算机图形学,提取目标人物在二维图像中人脸的二维人脸特征和传统图形学中的二维特征算子,可以从平面角度进行人脸特征的提取,有利于后续与三维人脸特征进行融合,从而生成更具辨识度的三维人脸卡通模型。In the actual execution process, the embodiment of the present application can use neural network or traditional computer graphics to extract the two-dimensional face features of the target person's face in the two-dimensional image and the two-dimensional feature operator in traditional graphics, which can be obtained from The extraction of face features from the plane angle is conducive to the subsequent fusion with 3D face features, thereby generating a more recognizable 3D face cartoon model.
在步骤S102中,根据目标人物的三维深度图提取人脸的三维人脸特征。In step S102, three-dimensional face features of the face are extracted according to the three-dimensional depth map of the target person.
可以理解的是,人脸是立体的,仅凭常规的二维人脸特征提取,难以呈现目标人物的全部人脸特征,尤其是二维图像可能会受限于光影效果,导致对人脸面部轮廓的识别度较差,三维特征指对三维深度图处理后得到的特征,提取方式与二维特征类似,如基于神经网络算法的Deep Leaning技术进行特征提取,以及基于传统图形学人脸建模表示技术进行特征提取,形式与二维略有不同。It can be understood that the face is three-dimensional, and it is difficult to present all the facial features of the target person only by conventional two-dimensional facial feature extraction. Especially, the two-dimensional image may be limited by light and shadow effects, resulting in The recognition degree of the contour is poor. The three-dimensional feature refers to the feature obtained after processing the three-dimensional depth map. The extraction method is similar to the two-dimensional feature. Representation techniques for feature extraction, the form is slightly different from 2D.
因此,本申请实施例利用传统计算机图形学,根据目标人物的三维深度图,提取目标人物的三维人脸特征,即传统图形学中的三维特征算子,可以有效解除二维图像人脸特征提取的局限性,便于后续生成更具辨识度的三维人脸卡通模型。Therefore, the embodiment of the present application uses traditional computer graphics to extract the three-dimensional face features of the target person according to the three-dimensional depth map of the target person, that is, the three-dimensional feature operator in traditional graphics, which can effectively remove the face feature extraction from the two-dimensional image. The limitation of , which is convenient for the subsequent generation of a more recognizable 3D face cartoon model.
在步骤S103中,融合二维人脸特征、三维人脸特征和预设卡通风格特征,生成目标人脸的三维卡通人脸模型。In step S103, the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature are combined to generate a three-dimensional cartoon face model of the target face.
作为一种可能实现的方式,本申请实施例可以从二维人脸特征和三维人脸特征中分别提取出相应的关键特征,并将从二维人脸特征中提取的关键特征和从三维人脸特征中提取的对应的关键特征以及预设的卡通风格特征进行融合,进而获得更符合目标人物形象的目标人脸的特定风格的三维卡通人脸模型。As a possible way of implementation, the embodiment of the present application can extract corresponding key features from two-dimensional face features and three-dimensional face features, respectively, and combine the key features extracted from the two-dimensional face features and the three-dimensional face features. The corresponding key features extracted from the face features and the preset cartoon style features are fused to obtain a three-dimensional cartoon face model with a specific style of the target face that is more in line with the target character image.
其中,关键特征是指对于最终呈现的三维卡通人脸模型具有更高的人物辨识度以及人物满意度,同时与卡通风格保持一致的特征。三种特征,即二维人脸特征、三维人脸特征和预设卡通风格特征各自包含有不同的具体特征族,每个特征均代表了人脸面部器官的某个特点,具体融合方式是通过加权方法控制不同特点在最终人脸面部建模结果中的显示程度,使得某个人代表性的特征突出展示,相对弱化其它次要特征,具有满足辨识度指标的要求。Among them, the key feature refers to a feature that has higher character recognition and character satisfaction for the final presented 3D cartoon face model, and is consistent with the cartoon style. The three features, namely 2D facial features, 3D facial features and preset cartoon style features, each contain different specific feature families, each feature represents a certain feature of the facial organs of the human face, and the specific fusion method is through The weighting method controls the degree of display of different features in the final face modeling results, so that the representative features of a certain person are prominently displayed, and other secondary features are relatively weakened, which meets the requirements of the identification index.
本申请实施例通过融合二维人脸特征、三维人脸特征和预设卡通风格特征,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,人物满意度更高。In the embodiment of the present application, by fusing two-dimensional face features, three-dimensional face features and preset cartoon style features, the generated three-dimensional face model has a higher degree of recognition, and can change the style according to the change of cartoon style, flexibly The higher the degree, the higher the character satisfaction.
可选地,在本申请的一个实施例中,生成目标人脸的三维卡通人脸模型,包括:根据由二维人脸特征、三维人脸特征和预设卡通风格特征融合得到的融合特征生成目标人脸的初始三维卡通人脸模型;计算初始三维卡通人脸模型的呈现程度,并在呈现程度小于预设阈值时,基于预设标准对二维人脸特征、三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,生成新的三维卡通人脸模型,迭代优化,直至新的三维卡通人脸模型的呈现程度大于或等于预设阈值,得到最终三维卡通人脸模型。Optionally, in an embodiment of the present application, generating a 3D cartoon face model of the target face includes: generating according to fusion features obtained by fusing 2D face features, 3D face features and preset cartoon style features. The initial 3D cartoon face model of the target face; calculate the presentation degree of the initial 3D cartoon face model, and when the presentation degree is less than a preset threshold, analyze the 2D face features, 3D face features and presets based on preset criteria At least one feature in the cartoon style features is linearly weighted to generate a new three-dimensional cartoon face model, and iteratively optimizes until the rendering degree of the new three-dimensional cartoon face model is greater than or equal to a preset threshold, and the final three-dimensional cartoon face model is obtained.
在实际执行过程中,本申请实施例可以通过融合二维人脸特征、三维人脸特征和预设卡通风格特征,得到融合特征,从而生成目标人物的初始三维卡通人脸模型。In the actual execution process, the embodiment of the present application can obtain the fusion feature by fusing the two-dimensional face feature, the three-dimensional face feature and the preset cartoon style feature, thereby generating the initial three-dimensional cartoon face model of the target character.
进一步地,本申请实施例可以通过计算对初始三维卡通人脸模型进行优化,具体地,本申请实施例可以计算初始三维卡通人脸模型的呈现程度,当呈现程度小于预设阈值时,本申请实施例可以基于预设标准对二维人脸特征、三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,进而生成新的三维卡通人脸模型,实现迭代优化,直至新的三维卡通人脸模型的呈现程度大于或等于预设阈值,得到最终的三维卡通人脸模型。Further, the embodiment of the present application can optimize the initial three-dimensional cartoon face model by calculation. Specifically, the embodiment of the present application can calculate the presentation degree of the initial three-dimensional cartoon face model. When the presentation degree is less than a preset threshold, the present application The embodiment can linearly weight at least one of the two-dimensional face features, the three-dimensional face features, and the preset cartoon style features based on preset criteria, and then generate a new three-dimensional cartoon face model, and realize iterative optimization until the new three-dimensional The rendering degree of the cartoon face model is greater than or equal to the preset threshold, and the final three-dimensional cartoon face model is obtained.
其中,预设标准可以为人物辨识度和人物满意度的标准,具体地,辨识度与满意度的高低是服务于目标群体的,前者衡量标准在于输入图像与输出模型在关键特征上的Loss大小,后者基于目标群体的调研,如问卷上对结果的打分高低,此外,当两者相互矛盾时以人物满意度为主。Among them, the preset standard can be the standard of character recognition and character satisfaction. Specifically, the level of recognition and satisfaction is to serve the target group, and the former is measured by the loss of the key features of the input image and the output model. , the latter is based on the research of the target group, such as the score of the results on the questionnaire, in addition, when the two contradict each other, the character satisfaction is the main factor.
需要注意的是,预设阈值可以由本领域技术人员根据实际情况进行相应设置;预设标准可以由本领域技术人员根据三维卡通人脸模型的应用场景及目标群众的调研结果进行相应设置,在此不做具体限制。It should be noted that the preset threshold can be set correspondingly by those skilled in the art according to the actual situation; the preset standard can be set correspondingly by those skilled in the art according to the application scene of the 3D cartoon face model and the research results of the target audience, which is not described here. make specific restrictions.
可选地,在本申请的一个实施例中,呈现程度的计算公式为:Optionally, in an embodiment of the present application, the calculation formula of the presentation degree is:
其中,K表示与人物辨识度强相关总特征数量,M表示与人物满意度指标强相关总特征数量,TV为第V轮迭代前原先特征在呈现到模型中的程度,a表示人物辨识度对最终特征呈现所占的比重,1-a表示人物满意度对最终特征呈现所占的比重,DV(k)为第V轮中各特征在辨识度中所占比重,SV(m)为第V轮中各特征在满意度中所占比重。Among them, K represents the total number of features strongly related to character recognition, M represents the total number of features strongly related to the character satisfaction index, T V is the degree to which the original features are presented in the model before the V-th iteration, and a represents the character recognition The proportion of the final feature presentation, 1-a represents the proportion of the character satisfaction to the final feature presentation, D V (k) is the proportion of each feature in the V round of recognition, S V (m) is the proportion of each feature in the satisfaction in the V round.
进一步地,括号中的项,即:Further, the items in parentheses, namely:
为第V轮中各面部特征所占有的权重的更新值,将各特征权重更新值作用于上一轮特征呈现度TV后即得到第V轮迭代后原先特征在呈现到模型中的程度tV。is the updated value of the weight occupied by each facial feature in the Vth round. After applying the updated value of each feature weight to the feature presentation degree T V of the previous round, the degree t of the original feature in the model after the V round of iteration is obtained. V.
可选地,在本申请的一个实施例中,在生成新的三维卡通人脸模型之后,还包括:获取当前迭代轮下的至少一个特征的当前特征权重值;根据当前特征权重值和上一迭代轮下的新的三维卡通人脸模型的呈现程度得到当前迭代轮的新的三维卡通人脸模型的呈现程度。Optionally, in an embodiment of the present application, after generating a new three-dimensional cartoon face model, the method further includes: acquiring a current feature weight value of at least one feature under the current iteration round; The presentation degree of the new 3D cartoon face model in the iteration round is obtained from the presentation degree of the new 3D cartoon face model in the current iteration round.
具体地,参照上述公式,本申请实施例可以按照各个关键特征在人物辨识度和人物满意度的标准下,第V轮迭代的线性加权,进而得到第V轮下的各特征权重值,进一步地,本申请实施例可以将这一变化值乘在上一轮的程度呈现值上,以得到下一轮的呈现值。Specifically, with reference to the above formula, in the embodiment of the present application, the linear weighting of the V-th round of iterations can be performed according to the criteria of character recognition and character satisfaction for each key feature, and then each feature weight value under the V-th round can be obtained, and further , in this embodiment of the present application, the change value may be multiplied by the degree presentation value of the previous round to obtain the presentation value of the next round.
可选地,在本申请的一个实施例中,预设卡通风格特征包括至少一个迪士尼卡通风格特征、至少一个日本卡通风格特征和至少一个Meta卡通风格特征中的至少一个。Optionally, in an embodiment of the present application, the preset cartoon style features include at least one of at least one Disney cartoon style feature, at least one Japanese cartoon style feature, and at least one Meta cartoon style feature.
在一些实施例中,预设卡通风格可以但不限于迪士尼卡通风格、日本卡通风格或Meta 卡通风格,举例而言,如图2所示,为Meta卡通风格,右侧为扎克伯格,左侧为根据扎克伯格的人脸面部特征生成的卡通形象。In some embodiments, the preset cartoon style may be, but is not limited to, Disney cartoon style, Japanese cartoon style or Meta cartoon style. For example, as shown in FIG. 2 , it is Meta cartoon style, Zuckerberg on the right side, and Zuckerberg on the left side. The side is a cartoon image generated based on Zuckerberg's facial features.
本申请实施例可以从上述风格中提取相应的风格特征作为预设的卡通风格特征,并与二维人脸特征和三维人脸特征进行融合,从而生成特定风格的三维人脸卡通模型。In this embodiment of the present application, corresponding style features can be extracted from the above styles as preset cartoon style features, and fused with 2D face features and 3D face features, thereby generating a 3D face cartoon model with a specific style.
需要注意的是,预设的卡通风格特征可以由本领域技术人员根据实际需求,如三维人脸卡通模型的应用场景或群体调研结果等,进行相应设置,在此不做具体限制。It should be noted that the preset cartoon style features can be set correspondingly by those skilled in the art according to actual needs, such as application scenarios of the 3D face cartoon model or group survey results, etc., which are not specifically limited here.
下面结合图3所示,以一个具体实施例对本申请实施例的三维卡通人脸建模方法的工作原理进行详细阐述。The working principle of the three-dimensional cartoon face modeling method according to the embodiment of the present application will be described in detail below with reference to FIG. 3 .
如图3所示,本申请实施例包括以下步骤:As shown in Figure 3, the embodiment of the present application includes the following steps:
步骤S301:提取目标人物的二维图像特征。可以理解的是,人脸特征可以为神经网络提取的向量形式,以及表示面部各器官样式的特征,代表着某一类人某个器官的特点,提取方法,举例而言,可以由基于深度学习模型训练学习所得,或对面部器官细节特征进行建模所得,具体地,以眉毛为例,眉毛按照眉形可以有长眉、短眉、粗眉、细眉、稀眉、八字眉、上挑眉、一字眉、新月眉、角眉、颦眉等等。Step S301: Extract the two-dimensional image features of the target person. It can be understood that the facial features can be in the form of vectors extracted by the neural network, as well as the features representing the styles of various facial organs, representing the characteristics of a certain type of person and an organ. The extraction method, for example, can be based on deep learning. It is obtained from model training and learning, or from modeling the detailed features of facial organs. Specifically, taking eyebrows as an example, eyebrows can have long eyebrows, short eyebrows, thick eyebrows, thin eyebrows, thin eyebrows, splayed eyebrows, and raised eyebrows according to the shape of the eyebrows. Eyebrows, straight eyebrows, crescent eyebrows, corner eyebrows, frown eyebrows, etc.
在实际执行过程中,本申请实施例可以利用神经网络或传统计算机图形学,提取目标人物在二维图像中人脸的二维人脸特征和传统图形学中的二维特征算子,可以从平面角度进行人脸特征的提取,有利于后续与三维人脸特征进行融合,从而生成更具辨识度的三维人脸卡通模型。In the actual execution process, the embodiment of the present application can use neural network or traditional computer graphics to extract the two-dimensional face features of the target person's face in the two-dimensional image and the two-dimensional feature operator in traditional graphics, which can be obtained from The extraction of face features from the plane angle is conducive to the subsequent fusion with 3D face features, thereby generating a more recognizable 3D face cartoon model.
步骤S302:提取目标人物的三维图像特征。可以理解的是,人脸是立体的,仅凭常规的二维人脸特征提取,难以呈现目标人物的全部人脸特征,尤其是二维图像可能会受限于光影效果,导致对人脸面部轮廓的识别度较差,三维特征指对三维深度图处理后得到的特征,提取方式与二维特征类似,如基于神经网络算法的Deep Leaning技术进行特征提取,以及基于传统图形学人脸建模表示技术进行特征提取,形式与二维略有不同。Step S302: Extract the three-dimensional image features of the target person. It can be understood that the face is three-dimensional, and it is difficult to present all the facial features of the target person only by conventional two-dimensional facial feature extraction. Especially, the two-dimensional image may be limited by light and shadow effects, resulting in The recognition degree of the contour is poor. The three-dimensional feature refers to the feature obtained after processing the three-dimensional depth map. The extraction method is similar to the two-dimensional feature. Representation techniques for feature extraction, the form is slightly different from 2D.
因此,本申请实施例利用传统计算机图形学,根据目标人物的三维深度图,提取目标人物的三维人脸特征,即传统图形学中的三维特征算子,可以有效解除二维图像人脸特征提取的局限性,便于后续生成更具辨识度的三维人脸卡通模型。Therefore, the embodiment of the present application uses traditional computer graphics to extract the three-dimensional face features of the target person according to the three-dimensional depth map of the target person, that is, the three-dimensional feature operator in traditional graphics, which can effectively remove the face feature extraction from the two-dimensional image. The limitation of , which is convenient for the subsequent generation of a more recognizable 3D face cartoon model.
步骤S303:提取卡通风格特征。在一些实施例中,卡通风格可以是迪士尼卡通风格、日本卡通风格或Meta卡通风格等,举例而言,如图2所示,为Meta卡通风格,右侧为扎克伯格,左侧为根据扎克伯格的人脸面部特征生成的卡通形象。本申请实施例可以从上述风格中提取相应的风格特征作为预设的卡通风格特征,并与二维人脸特征和三维人脸特征进行融合,从而生成特定风格的三维人脸卡通模型。Step S303: Extract cartoon style features. In some embodiments, the cartoon style may be Disney cartoon style, Japanese cartoon style or Meta cartoon style, etc. For example, as shown in FIG. 2 , it is Meta cartoon style, the right side is Zuckerberg, the left side is the A cartoon image of Zuckerberg's facial features. In this embodiment of the present application, corresponding style features can be extracted from the above styles as preset cartoon style features, and fused with 2D face features and 3D face features, thereby generating a 3D face cartoon model with a specific style.
需要注意的是,卡通风格可以由本领域技术人员根据实际需求,如三维人脸卡通模型的应用场景或群体调研结果等,进行相应设置,在此不做具体限制。It should be noted that the cartoon style can be set accordingly by those skilled in the art according to actual needs, such as the application scenarios of the three-dimensional face cartoon model or the results of group research, etc., which is not specifically limited here.
步骤S304:特征融合,生成初始三维人脸卡通模型。作为一种可能实现的方式,本申请实施例可以从二维人脸特征和三维人脸特征中分别提取出相应的关键特征,并将从二维人脸特征中提取的关键特征和从三维人脸特征中提取的对应的关键特征以及预设的卡通风格特征进行融合,进而获得更符合目标人物形象的目标人脸的特定风格的三维卡通人脸模型。Step S304: Feature fusion to generate an initial three-dimensional face cartoon model. As a possible way of implementation, the embodiment of the present application can extract corresponding key features from two-dimensional face features and three-dimensional face features, respectively, and combine the key features extracted from the two-dimensional face features and the three-dimensional face features. The corresponding key features extracted from the face features and the preset cartoon style features are fused to obtain a three-dimensional cartoon face model with a specific style of the target face that is more in line with the target character image.
其中,关键特征是指对于最终呈现的三维卡通人脸模型具有更高的人物辨识度以及人物满意度,同时与卡通风格保持一致的特征。三种特征,即二维人脸特征、三维人脸特征和预设卡通风格特征各自包含有不同的具体特征族,每个特征均代表了人脸面部器官的某个特点,具体融合方式是通过加权方法控制不同特点在最终人脸面部建模结果中的显示程度,使得某个人代表性的特征突出展示,相对弱化其它次要特征,具有满足辨识度指标的要求。Among them, the key feature refers to a feature that has higher character recognition and character satisfaction for the final presented 3D cartoon face model, and is consistent with the cartoon style. The three features, namely 2D facial features, 3D facial features and preset cartoon style features, each contain different specific feature families, each feature represents a certain feature of the facial organs of the human face, and the specific fusion method is through The weighting method controls the degree of display of different features in the final face modeling results, so that the representative features of a certain person are prominently displayed, and other secondary features are relatively weakened, which meets the requirements of the identification index.
本申请实施例通过融合二维人脸特征、三维人脸特征和预设卡通风格特征,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,人物满意度更高。In the embodiment of the present application, by fusing two-dimensional face features, three-dimensional face features and preset cartoon style features, the generated three-dimensional face model has a higher degree of recognition, and can change the style according to the change of cartoon style, flexibly The higher the degree, the higher the character satisfaction.
步骤S305:对初始三维人脸卡通模型进行迭代优化,获得最终三维人脸卡通模型。进一步地,本申请实施例可以通过计算对初始三维卡通人脸模型进行优化,具体地,本申请实施例可以计算初始三维卡通人脸模型的呈现程度,当呈现程度小于预设阈值时,本申请实施例可以基于预设标准对二维人脸特征、三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,进而生成新的三维卡通人脸模型,实现迭代优化,直至新的三维卡通人脸模型的呈现程度大于或等于预设阈值,得到最终的三维卡通人脸模型。Step S305: Perform iterative optimization on the initial three-dimensional face cartoon model to obtain a final three-dimensional face cartoon model. Further, the embodiment of the present application can optimize the initial three-dimensional cartoon face model by calculation. Specifically, the embodiment of the present application can calculate the presentation degree of the initial three-dimensional cartoon face model. When the presentation degree is less than a preset threshold, the present application The embodiment can linearly weight at least one of the two-dimensional face features, the three-dimensional face features, and the preset cartoon style features based on preset criteria, and then generate a new three-dimensional cartoon face model, and realize iterative optimization until the new three-dimensional The rendering degree of the cartoon face model is greater than or equal to the preset threshold, and the final three-dimensional cartoon face model is obtained.
其中,预设标准可以为人物辨识度和人物满意度的标准,具体地,辨识度与满意度的高低是服务于目标群体的,前者衡量标准在于输入图像与输出模型在关键特征上的Loss大小,后者基于目标群体的调研,如问卷上对结果的打分高低,此外,当两者相互矛盾时以人物满意度为主。Among them, the preset standard can be the standard of character recognition and character satisfaction. Specifically, the level of recognition and satisfaction is to serve the target group, and the former is measured by the loss of the key features of the input image and the output model. , the latter is based on the research of the target group, such as the score of the results on the questionnaire, in addition, when the two contradict each other, the character satisfaction is the main factor.
其中,呈现程度的计算公式为:The formula for calculating the degree of presentation is:
其中,K表示与人物辨识度强相关总特征数量,M表示与人物满意度指标强相关总特征数量,TV为第V轮迭代前原先特征在呈现到模型中的程度,a表示人物辨识度对最终特征呈现所占的比重,1-a表示人物满意度对最终特征呈现所占的比重,DV(k)为第V轮中各特征在辨识度中所占比重,SV(m)为第V轮中各特征在满意度中所占比重。Among them, K represents the total number of features strongly related to character recognition, M represents the total number of features strongly related to the character satisfaction index, T V is the degree to which the original features are presented in the model before the V-th iteration, and a represents the character recognition The proportion of the final feature presentation, 1-a represents the proportion of the character satisfaction to the final feature presentation, D V (k) is the proportion of each feature in the V round of recognition, S V (m) is the proportion of each feature in the satisfaction in the V round.
进一步地,括号中的项,即:Further, the items in parentheses, namely:
为第V轮中各面部特征所占有的权重的更新值,将各特征权重更新值作用于上一轮特征呈现度TV后即得到第V轮迭代后原先特征在呈现到模型中的程度tV。is the updated value of the weight occupied by each facial feature in the Vth round. After applying the updated value of each feature weight to the feature presentation degree T V of the previous round, the degree t of the original feature in the model after the V round of iteration is obtained. V.
进一步地,本申请实施例可以按照各个关键特征在人物辨识度和人物满意度的标准下,第V轮迭代的线性加权,进而得到第V轮下的各特征权重值,进一步地,本申请实施例可以将这一变化值乘在上一轮的程度呈现值上,以得到下一轮的呈现值。Further, in this embodiment of the present application, the linear weighting of the V-th round of iterations can be performed according to the criteria of character recognition degree and character satisfaction for each key feature, and then each feature weight value under the V-th round can be obtained. Further, the present application implements For example, this change value can be multiplied by the degree present value of the previous round to obtain the present value of the next round.
需要注意的是,预设阈值可以由本领域技术人员根据实际情况进行相应设置;预设标准可以由本领域技术人员根据三维卡通人脸模型的应用场景及目标群众的调研结果进行相应设置,在此不做具体限制。It should be noted that the preset threshold can be set correspondingly by those skilled in the art according to the actual situation; the preset standard can be set correspondingly by those skilled in the art according to the application scene of the 3D cartoon face model and the research results of the target audience, which is not described here. make specific restrictions.
根据本申请实施例提出的三维卡通人脸建模方法,可以利用三维深度图提取目标人物的三维人脸特征,并融合目标人物二维图像中的二维人脸特征及卡通风格特征,从而生成目标人脸的三维卡通人脸模型,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,有效满足建模需求,提升使用体验。由此,解决了相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。According to the 3D cartoon face modeling method proposed in this embodiment of the present application, the 3D depth map can be used to extract the 3D facial features of the target person, and the 2D facial features and cartoon style features in the 2D image of the target person can be fused to generate The 3D cartoon face model of the target face makes the generated 3D face model more recognizable, and can change the style with the change of cartoon style, which is more flexible, effectively meets the modeling needs and improves the user experience. . As a result, the technical problem of the cartoon image modeling of the real face based on the neural network in the related art, resulting in a single style and low recognition degree of the generated cartoon image, is solved.
其次参照附图描述根据本申请实施例提出的三维卡通人脸建模装置。Next, the three-dimensional cartoon face modeling apparatus proposed according to the embodiments of the present application will be described with reference to the accompanying drawings.
图4是本申请实施例的三维卡通人脸建模装置的方框示意图。FIG. 4 is a schematic block diagram of a three-dimensional cartoon face modeling apparatus according to an embodiment of the present application.
如图4所示,该三维卡通人脸建模装置10包括:第一提取模块100、第二提取模块200 和建模模块300。As shown in FIG. 4 , the three-dimensional cartoon
具体地,第一提取模块100,用于提取目标人物在二维图像中人脸的二维人脸特征。Specifically, the
第二提取模块200,用于根据目标人物的三维深度图提取人脸的三维人脸特征。The
建模模块300,用于融合二维人脸特征、三维人脸特征和预设卡通风格特征,生成目标人脸的三维卡通人脸模型。The
可选地,在本申请的一个实施例中,建模模块300包括:融合单元和计算单元。Optionally, in an embodiment of the present application, the
其中,融合单元,用于根据由二维人脸特征、三维人脸特征和预设卡通风格特征融合得到的融合特征生成目标人脸的初始三维卡通人脸模型。Wherein, the fusion unit is used to generate an initial 3D cartoon face model of the target face according to the fusion feature obtained by fusing the 2D face feature, the 3D face feature and the preset cartoon style feature.
计算单元,用于计算初始三维卡通人脸模型的呈现程度,并在呈现程度小于预设阈值时,基于预设标准对二维人脸特征、三维人脸特征和预设卡通风格特征中的至少一个特征线性加权,生成新的三维卡通人脸模型,迭代优化,直至新的三维卡通人脸模型的呈现程度大于或等于预设阈值,得到最终三维卡通人脸模型。The calculation unit is used to calculate the presentation degree of the initial three-dimensional cartoon face model, and when the presentation degree is less than the preset threshold, based on the preset standard, at least one of the two-dimensional face features, the three-dimensional face features and the preset cartoon style features is evaluated. A feature is linearly weighted to generate a new 3D cartoon face model, and iteratively optimize until the rendering degree of the new 3D cartoon face model is greater than or equal to a preset threshold, and the final 3D cartoon face model is obtained.
可选地,在本申请的一个实施例中,建模模块300进一步用于,获取当前迭代轮下的至少一个特征的当前特征权重值;根据当前特征权重值和上一迭代轮下的新的三维卡通人脸模型的呈现程度得到当前迭代轮的新的三维卡通人脸模型的呈现程度。Optionally, in an embodiment of the present application, the
可选地,在本申请的一个实施例中,呈现程度的计算公式为:Optionally, in an embodiment of the present application, the calculation formula of the presentation degree is:
其中,K表示与人物辨识度强相关总特征数量,M表示与人物满意度指标强相关总特征数量,TV为第V轮迭代前原先特征在呈现到模型中的程度,a表示人物辨识度对最终特征呈现所占的比重,1-a表示人物满意度对最终特征呈现所占的比重,DV(k)为第V轮中各特征在辨识度中所占比重,SV(m)为第V轮中各特征在满意度中所占比重。Among them, K represents the total number of features strongly related to character recognition, M represents the total number of features strongly related to the character satisfaction index, T V is the degree to which the original features are presented in the model before the V-th iteration, and a represents the character recognition The proportion of the final feature presentation, 1-a represents the proportion of the character satisfaction to the final feature presentation, D V (k) is the proportion of each feature in the V round of recognition, S V (m) is the proportion of each feature in the satisfaction in the V round.
可选地,在本申请的一个实施例中,预设卡通风格特征包括至少一个迪士尼卡通风格特征、至少一个日本卡通风格特征和至少一个Meta卡通风格特征中的至少一个。Optionally, in an embodiment of the present application, the preset cartoon style features include at least one of at least one Disney cartoon style feature, at least one Japanese cartoon style feature, and at least one Meta cartoon style feature.
需要说明的是,前述对三维卡通人脸建模方法实施例的解释说明也适用于该实施例的三维卡通人脸建模装置,此处不再赘述。It should be noted that, the foregoing explanations on the embodiment of the three-dimensional cartoon face modeling method are also applicable to the three-dimensional cartoon face modeling apparatus of this embodiment, which will not be repeated here.
根据本申请实施例提出的三维卡通人脸建模装置,可以利用三维深度图提取目标人物的三维人脸特征,并融合目标人物二维图像中的二维人脸特征及卡通风格特征,从而生成目标人脸的三维卡通人脸模型,使得生成的三维人脸模型具备更高的辨识度,且可以随着卡通风格的变化进行风格转变,灵活度更高,有效满足建模需求,提升使用体验。由此,解决了相关技术中基于神经网络对真实人脸进行卡通形象建模,导致生成的卡通形象风格单一且辨识度较低的技术问题。According to the 3D cartoon face modeling device proposed in the embodiment of the present application, the 3D face feature of the target person can be extracted by using the 3D depth map, and the 2D face feature and cartoon style feature in the 2D image of the target person can be fused to generate The 3D cartoon face model of the target face makes the generated 3D face model more recognizable, and can change the style with the change of cartoon style, which is more flexible, effectively meets the modeling needs and improves the user experience. . As a result, the technical problem of the cartoon image modeling of the real face based on the neural network in the related art, resulting in a single style and low recognition degree of the generated cartoon image, is solved.
图5为本申请实施例提供的电子设备的结构示意图。该电子设备可以包括:FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. The electronic device may include:
存储器501、处理器502及存储在存储器501上并可在处理器502上运行的计算机程序。
处理器502执行程序时实现上述实施例中提供的三维卡通人脸建模方法。When the
进一步地,电子设备还包括:Further, the electronic device also includes:
通信接口503,用于存储器501和处理器502之间的通信。The
存储器501,用于存放可在处理器502上运行的计算机程序。The
存储器501可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。The
如果存储器501、处理器502和通信接口503独立实现,则通信接口503、存储器501和处理器502可以通过总线相互连接并完成相互间的通信。总线可以是工业标准体系结构(Industry Standard Architecture,简称为ISA)总线、外部设备互连(PeripheralComponent,简称为PCI)总线或扩展工业标准体系结构(Extended Industry StandardArchitecture,简称为EISA)总线等。总线可以分为地址总线、数据总线、控制总线等。为便于表示,图5中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。If the
可选地,在具体实现上,如果存储器501、处理器502及通信接口503,集成在一块芯片上实现,则存储器501、处理器502及通信接口503可以通过内部接口完成相互间的通信。Optionally, in terms of specific implementation, if the
处理器502可能是一个中央处理器(Central Processing Unit,简称为CPU),或者是特定集成电路(Application Specific Integrated Circuit,简称为ASIC),或者是被配置成实施本申请实施例的一个或多个集成电路。The
本实施例还提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如上的三维卡通人脸建模方法。This embodiment also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by the processor, implements the above three-dimensional cartoon face modeling method.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或N个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present application. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or N of the embodiments or examples. Furthermore, those skilled in the art may combine and combine the different embodiments or examples described in this specification, as well as the features of the different embodiments or examples, without conflicting each other.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本申请的描述中,“N个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as indicating or implying relative importance or implying the number of indicated technical features. Thus, a feature delimited with "first", "second" may expressly or implicitly include at least one of that feature. In the description of the present application, "N" means at least two, such as two, three, etc., unless otherwise expressly and specifically defined.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或N个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本申请的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本申请的实施例所属技术领域的技术人员所理解。Any process or method description in the flowcharts or otherwise described herein may be understood to represent a module, segment or portion of code comprising one or N executable instructions for implementing custom logical functions or steps of the process, And the scope of the preferred embodiments of the present application includes alternative implementations in which the functions may be performed out of the order shown or discussed, including performing the functions in a substantially simultaneous manner or in the reverse order depending upon the functions involved, which should be Those skilled in the art to which the embodiments of the present application pertain will be understood.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或N个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus. More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections (electronic devices) with one or N wires, portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.
应当理解,本申请的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,N个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。如,如果用硬件来实现和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of this application may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, the N steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware as in another embodiment, it can be implemented by any one of the following techniques known in the art, or a combination thereof: discrete with logic gates for implementing logic functions on data signals Logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.
本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。Those skilled in the art can understand that all or part of the steps carried by the methods of the above embodiments can be completed by instructing the relevant hardware through a program, and the program can be stored in a computer-readable storage medium, and the program can be stored in a computer-readable storage medium. When executed, one or a combination of the steps of the method embodiment is included.
此外,在本申请各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing module, or each unit may exist physically alone, or two or more units may be integrated into one module. The above-mentioned integrated modules can be implemented in the form of hardware, and can also be implemented in the form of software function modules. If the integrated modules are implemented in the form of software functional modules and sold or used as independent products, they may also be stored in a computer-readable storage medium.
上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本申请的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本申请的限制,本领域的普通技术人员在本申请的范围内可以对上述实施例进行变化、修改、替换和变型。The above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, and the like. Although the embodiments of the present application have been shown and described above, it should be understood that the above embodiments are exemplary and should not be construed as limitations to the present application. Embodiments are subject to variations, modifications, substitutions and variations.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210483519.4A CN114998554B (en) | 2022-05-05 | 2022-05-05 | Three-dimensional cartoon face modeling method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210483519.4A CN114998554B (en) | 2022-05-05 | 2022-05-05 | Three-dimensional cartoon face modeling method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114998554A true CN114998554A (en) | 2022-09-02 |
CN114998554B CN114998554B (en) | 2024-08-20 |
Family
ID=83024474
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210483519.4A Active CN114998554B (en) | 2022-05-05 | 2022-05-05 | Three-dimensional cartoon face modeling method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114998554B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108765273A (en) * | 2018-05-31 | 2018-11-06 | Oppo广东移动通信有限公司 | Virtual face-lifting method and device for taking pictures of faces |
CN111354079A (en) * | 2020-03-11 | 2020-06-30 | 腾讯科技(深圳)有限公司 | Three-dimensional face reconstruction network training and virtual face image generation method and device |
CN111402394A (en) * | 2020-02-13 | 2020-07-10 | 清华大学 | Three-dimensional exaggerated cartoon face generation method and device |
CN114299206A (en) * | 2021-12-31 | 2022-04-08 | 清华大学 | Three-dimensional cartoon face generation method and device, electronic equipment and storage medium |
CN114419253A (en) * | 2022-01-13 | 2022-04-29 | 广州虎牙科技有限公司 | A kind of cartoon face construction, live broadcast method and related device |
-
2022
- 2022-05-05 CN CN202210483519.4A patent/CN114998554B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108765273A (en) * | 2018-05-31 | 2018-11-06 | Oppo广东移动通信有限公司 | Virtual face-lifting method and device for taking pictures of faces |
CN111402394A (en) * | 2020-02-13 | 2020-07-10 | 清华大学 | Three-dimensional exaggerated cartoon face generation method and device |
CN111354079A (en) * | 2020-03-11 | 2020-06-30 | 腾讯科技(深圳)有限公司 | Three-dimensional face reconstruction network training and virtual face image generation method and device |
CN114299206A (en) * | 2021-12-31 | 2022-04-08 | 清华大学 | Three-dimensional cartoon face generation method and device, electronic equipment and storage medium |
CN114419253A (en) * | 2022-01-13 | 2022-04-29 | 广州虎牙科技有限公司 | A kind of cartoon face construction, live broadcast method and related device |
Non-Patent Citations (2)
Title |
---|
YI ZHENG等: "Cartoon Face Recognition: A Benchmark Dataset", HTTPS://ARXIV.ORG/ABS/1907.13394, 27 June 2020 (2020-06-27) * |
署光等: "基于稀疏形变模型的三维卡通人脸生成", 电子学报, 15 August 2010 (2010-08-15) * |
Also Published As
Publication number | Publication date |
---|---|
CN114998554B (en) | 2024-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111354079A (en) | Three-dimensional face reconstruction network training and virtual face image generation method and device | |
CN119963705A (en) | Method, computing device, and computer-readable storage medium for portrait animation | |
CN114092615B (en) | UV mapping on 3D objects using artificial intelligence | |
EP3923244A1 (en) | Animated image driving method and apparatus based on artificial intelligence | |
CN111047509B (en) | Image special effect processing method, device and terminal | |
US20140306962A1 (en) | Mesh skinning technique | |
CN111652123B (en) | Image processing and image synthesizing method, device and storage medium | |
JP2023029984A (en) | Method, device, electronic apparatus, and readable storage medium for generating virtual image | |
CN107578467B (en) | Three-dimensional modeling method and device for medical instrument | |
CN108463823A (en) | A reconstruction method, device and terminal of a user's hair model | |
CN111833236A (en) | Method and device for generating three-dimensional face model simulating user | |
CN114708374A (en) | Virtual image generation method, device, electronic device and storage medium | |
CN114494611A (en) | Intelligent three-dimensional reconstruction method, device, equipment and medium based on nerve basis function | |
CN113223128B (en) | Method and apparatus for generating image | |
CN118691744A (en) | Three-dimensional Gaussian radiation field training method, device, equipment, storage medium and program product | |
CN115690281B (en) | Character expression driving method and device, storage medium, electronic device | |
CN114998554B (en) | Three-dimensional cartoon face modeling method and device | |
CN116934948A (en) | Re-illuminable three-dimensional digital human construction method and device based on multi-view video | |
CN116894919A (en) | A digital human generation method, device, electronic device and readable storage medium | |
CN114373034A (en) | Image processing method, image processing apparatus, image processing device, storage medium, and computer program | |
CN114781642B (en) | A method and device for generating cross-media correspondence knowledge | |
CN115390663B (en) | Virtual man-machine interaction method, system, equipment and storage medium | |
CN116524106B (en) | Image labeling method, device, equipment, storage medium and program product | |
CN109919016B (en) | A method and device for generating facial expressions on objects without facial organs | |
US20240013500A1 (en) | Method and apparatus for generating expression model, device, and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |