CN112200894B

CN112200894B - Automatic digital human facial expression animation migration method based on deep learning framework

Info

Publication number: CN112200894B
Application number: CN202011413230.2A
Authority: CN
Inventors: 赵锐; 侯志迎
Original assignee: Jiangsu Yuanli Digital Technology Co ltd
Current assignee: Jiangsu Yuanli Digital Technology Co ltd
Priority date: 2020-12-07
Filing date: 2020-12-07
Publication date: 2021-03-09
Anticipated expiration: 2040-12-07
Also published as: CN112200894A

Abstract

The invention provides a digital human facial expression animation automatic migration method based on a deep learning framework, which comprises the following steps: s1: generating training data, making the same expression models of multiple frames of two different digital persons, extracting the parameter values of animation controller channels corresponding to the expression models, and generating a vector of the parameter dimensions of the animation controller; s2: building a model of the neural network, and building a four-layer neural network; s3: training a network model, and establishing the relation between the parameter values of the controllers under the same expression of different digital persons; s4: and (3) performing expression migration by using the trained model, inputting the controller parameter value of one digital person into the network model to obtain the controller parameter value corresponding to the same expression of the other digital person, and applying the generated controller parameter value to the digital person model so as to drive the network 3D space vertex position on the digital person model. The invention greatly improves the production efficiency of the virtual character animation by replacing manual operation with automatic migration.

Description

Automatic digital human facial expression animation migration method based on deep learning framework

Technical Field

The invention belongs to the technical field of animation production, and particularly relates to a digital human facial expression animation automatic migration method based on a deep learning framework.

Background

In the existing animation production process, a virtual digital human facial expression animation is produced by an animator by adjusting the numerical value of a facial model controller and manually setting a key frame. Animation of a virtual character completely depends on manual operation, so that the time consumption is long, the efficiency is low, and the labor cost is high. When the same animation needs to be done for a second different virtual digital human character, the animator needs to do the same operation again, and an automatic process which can be replaced does not exist, so that the animation production efficiency of the high-precision human facial expression is greatly hindered.

Disclosure of Invention

The invention aims to provide a digital human facial expression animation automatic migration method based on a deep learning framework, which greatly improves the production efficiency of virtual character animation by replacing manual operation with automatic migration;

the invention provides the following technical scheme:

the digital human facial expression animation automatic migration method based on the deep learning framework comprises the following steps:

s1: generating training data, making the same expression models of multiple frames of two different digital persons, extracting the parameter values of animation controller channels corresponding to the expression models, and generating a vector of the parameter dimensions of the animation controller;

s2: building a model of the neural network, building four layers of neural networks, wherein input and output are controller parameter vectors, and the middle two layers of hidden layers are respectively provided with a plurality of neurons;

s3: training a network model, establishing a relation between controller parameter values under the same expression of different digital persons, taking the controller parameter value of one digital person as input, taking the controller parameter value of the other digital person as a label, establishing a regression task, adopting a mean square error as a cost function, and storing a model parameter with the lowest loss value in the training process;

s4: the method comprises the steps that a trained model is used for expression migration, a controller parameter value of one digital person is input into a network model to obtain a controller parameter value corresponding to the same expression of the other digital person, and the generated controller parameter value is applied to the digital person model, so that the network 3D space vertex position on the digital person model is driven;

preferably, the extraction process in S1 includes the following steps: extracting an animation controller channel attribute value on the model from each key frame, writing the animation controller channel attribute value into a text file, wherein each line of the text file represents one frame, each column represents one controller channel, and numerical values record the controller channel attribute value;

preferably, in S1, a key frame is selected to extract a parameter value of an animation controller channel corresponding to the expression model, where the key frame is a representative expression;

preferably, there are 500 neurons in each hidden layer in S2;

preferably, the network model in S4 is trained and then used as a tool in Maya software in real time;

the invention has the beneficial effects that: the invention learns the mapping relation between the channel parameters of the animation controller among different models by using the neural network, thereby realizing the effect of driving the animation of another model by using the animation of the known model, and the whole manufacturing process can be generated automatically without manual operation.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

fig. 1 is a diagram of a network architecture used by the present invention.

Detailed Description

the method comprises the following specific implementation steps:

step S1: training data generation

101: the artist makes multiple frames of the same expression of two different digital people A and B;

102: compiling plug-in units to extract parameter values of animation controller channels corresponding to the models, wherein the parameter values are between 0 and 1, and generating a vector of the animation controller parameter dimension;

103: selecting a plurality of key frames (standard representative expressions such as laughing, smiling, crying and the like);

c + + plug-ins are written in maya, the plug-ins can extract the animation controller channel attribute values on the model from each key frame, and a text file is written in the animation controller channel attribute values. Each line of the text file represents a frame, each column represents a controller channel, and numerical values are recorded to be controller channel attribute values;

step S2: neural network construction

As shown in fig. 1, a 4-layer neural network is established, the input and output are controller parameter vectors, and the middle two hidden layers respectively have 500 neurons, because the control channel parameter is small relative to the data volume (about 127 channels), the network is too complex and is easy to be over-fitted;

step S3: training network

The method aims to establish the relation between controller parameter values under the same expression of a digital person A and a digital person B, establish a regression task by taking the controller parameter value of the digital person A as input and the controller parameter value of the digital person B as a label, and store a model parameter with the lowest loss value in the training process by adopting a mean square error as a cost function;

specifically, establishing a regression task is a training process: assuming that the parameter values required for digital person B to achieve the same expression are vector y, the net prediction result is ŷ. The network training process is to compare ŷ with the actual reference value of y, and continuously adjust the weight and deviation of the network until the prediction error is minimized, and the cost function is the loss function and the mean square error can be selected. When the obtained error is smaller than a satisfactory threshold value, the training is stopped, and the state of the network (i.e., the weight of each neuron) at that time is recorded. The model recording the network weight is a trained network model, and the trained network model can be used for the next step;

step S4: expression migration using trained models

401: inputting the controller parameter value of the digital person A into the network model to obtain the controller parameter value corresponding to the same expression of the digital person B;

402: writing a plug-in to apply the generated controller parameter value to the model B, so as to drive the network 3D space vertex position on the model B;

403: after being trained, the network model can be used as a tool in maya software in real time;

according to the automatic migration method of the digital human facial expression animation based on the deep learning framework, the facial expressions between digital human models are automatically converted through deep learning without manual operation, so that the known animation of the model A is directly applied to the model B, and natural and vivid animation effects are generated;

although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. The digital human facial expression animation automatic migration method based on the deep learning framework is characterized by comprising the following steps of:

the extraction process in S1 includes the following steps: extracting an animation controller channel attribute value on the model from each key frame, writing the animation controller channel attribute value into a text file, wherein each line of the text file represents one frame, each column represents one controller channel, and numerical values record the controller channel attribute value;

s4: and (3) performing expression migration by using the trained model, inputting the controller parameter value of one digital person into the network model to obtain the controller parameter value corresponding to the same expression of the other digital person, and applying the generated controller parameter value to the digital person model so as to drive the network 3D space vertex position on the digital person model.

2. The method according to claim 1, wherein in step S1, a key frame is selected to extract parameter values of an animation controller channel corresponding to the expression model, and the key frame is a representative expression.

3. The deep learning framework-based automatic migration method for digital human facial expression animations according to claim 1, wherein the hidden layers in S2 have 500 neurons each.

4. The deep learning framework-based automatic migration method for digital human facial expression animations according to claim 1, wherein the network model in S4 is trained and used as a tool in Maya software in real time.