CN116310015A

CN116310015A - Computer system, method and medium

Info

Publication number: CN116310015A
Application number: CN202310246436.8A
Authority: CN
Inventors: 张刘灿; 丁佳伟; 黄鹏; 于秋燕; 张翔; 戚欠礼; 明小凯
Original assignee: Hangzhou Ruoxi Enterprise Management Co ltd
Current assignee: Hangzhou Ruoxi Enterprise Management Co ltd
Priority date: 2023-03-15
Filing date: 2023-03-15
Publication date: 2023-06-23

Abstract

The invention relates to the technical field of cartoon making, and discloses a computer system, which comprises: a frame image extraction module for extracting a frame image from the three-dimensional animated video; a person recognition module for recognizing a person from the frame image and acquiring person pose data; the key frame extraction module is used for screening and obtaining a key frame sequence from the frame image; the gesture sequence correction module firstly extracts a key frame sequence corresponding to the first person and then marks the key frame; a reference sequence correction module for marking key frames of a key frame sequence of a third person; a key frame sequence synthesis module for extracting a reference key frame sequence; synthesizing each reference key frame sequence with the key frame sequence of the first person respectively; the invention can map the reference image for the key frame which needs to be hand painted in the 2D animation 2D process based on image processing and data mining, and provides a reference with higher matching degree for hand painted personnel.

Description

Computer system, method and medium

Technical Field

The invention relates to the technical field of cartoon making, in particular to a computer system.

Background

In the process of 2D animation, edge processing and color processing are performed on the images through deep learning, if the style of the 2D animation is to be reflected, part of frame images need to be replaced by frame images generated based on hand-painted key frames, and therefore the capability of hand-painted staff to draw super-reality character actions among real actions is required, and the requirement on artistic creation capability of the hand-painted staff is high.

Disclosure of Invention

The invention provides a computer system which solves the technical problem of XXXXX in the related technology.

The present invention provides a computer system comprising:

the frame image extraction module is used for extracting a first frame image from the three-dimensional animation video and extracting a second frame image from the two-dimensional animation video; a person recognition module for recognizing a person from the frame image and acquiring person pose data; the information of the person includes the sex of the person, and the person posture data includes person posture parameters; a character matching module for selecting a first character extracted from the first frame image and then matching a second character extracted from the second frame image, the first character having the same information as the second character, the matched second character being marked as a third character; the key frame extraction module is used for screening and obtaining a key frame sequence from the frame image;

the gesture sequence correction module firstly extracts a key frame sequence corresponding to a first person, marks the key frame, wherein the mark value of the key frame to be redrawn is 0, and the mark values of the other key frames are 1;

the reference sequence correction module is used for extracting a key frame sequence corresponding to the third person and then marking key frames of the key frame sequence of the third person;

the key frame sequence synthesis module is used for calculating the reference distances of the key frame sequences of the first person and the second person, and extracting the key frame sequences of the first N second persons with the smallest reference distances of the key frame sequences of the first person as reference key frame sequences; each reference key frame sequence is synthesized with the key frame sequence of the first person.

Further, the method for screening and obtaining the key frame from the frame image comprises the following steps:

step 101, extracting character gesture parameters of a selected character from each frame image to generate a character gesture sequence, wherein each unit of the character gesture sequence comprises the character gesture parameters of the selected character in one frame image;

step 102, traversing backwards from the sequence unit of the character gesture sequence until the first similarity between the traversed sequence unit and the sequence unit from which the traversing starts is smaller than a set first similarity threshold; marking the sequence unit of traversal termination as a first sequence unit;

step 103, iteratively executing step 102 until all sequence units of the character gesture sequence are traversed, wherein the first sequence unit of the character gesture sequence is traversed in the first execution, and the first sequence unit generated in the previous execution is traversed backwards in each subsequent execution;

104, extracting all the generated first sequence units to generate a first human gesture sequence;

step 105, extracting a frame image of the character pose parameter source of the first sequence unit as a key frame, and ordering the key frame according to the time point to generate a key frame sequence.

Further, the calculation formula of the first similarity is as follows:

wherein k is human boneNumber of limbs, θ of the stent model _i1 An included angle between the ith limb of the character gesture parameter of a sequence unit of a character gesture sequence and the ith limb in the standard gesture model; θ _i2 And the included angle between the ith limb of the character gesture parameter of the sequence unit of the other character gesture sequence and the ith limb in the standard gesture model.

Further, the method of keyframe tagging of the keyframe sequence of the third person comprises:

defining a two-dimensional random field, wherein a node set of the two-dimensional random field is denoted as V, and a set of edges between the nodes is denoted as E;

the energy function of the two-dimensional random field is:

wherein X is a node set, θ _p (x _p ) As a potential function of node p, θ _pq (x _p ,x _q ) As a potential function of the edge between nodes p and q, x _p The marker value, x, of the image node mapped for node p _q A marker value of a key frame sequence mapped for the node q; a mark value of 0 indicates that the character gesture parameter corresponding to the key frame has larger deviation from the real character; a mark value of 1 indicates that the character gesture parameter corresponding to the key frame is close to the real character;

wherein,,

simila _p,q represents x _p And x _q A second similarity of character pose parameters mapped by the mapped keyframes;

wherein,,

S ₁ for a set movement entropy threshold, S (x _p ) Is x _p The motion entropy of the mapped key frame is calculated as follows:

S(x _p )＝∑ _c∈K cosθ _c1,c2 wherein K is the set of limbs of the human skeletal model, wherein θ _c1,c2 Represents x _p The included angle between the c-th limb of the character gesture parameters of the mapped key frame and the c-th limb in the standard gesture model;

the marker value of each node of the two-dimensional random field is used as the marker value of the key frame sequence when the energy function is minimized.

Further, the calculation formula of the second similarity is as follows:

wherein k is the number of limbs, theta of the human skeleton model _ip An included angle between the ith limb of a person posture parameter and the ith limb in the standard posture model; θ _iq Is the included angle between the ith limb of the gesture parameter of another person and the ith limb in the standard gesture model.

Further, the key frame sequence synthesis module includes:

a distance matrix generation module for establishing an m×n distance matrix, wherein the element of the ith row and the jth column of the distance matrix is denoted as d (i, j);

d (i, j) has a value d _ij ，d _ij A second distance or a third distance representing an ith keyframe of the keyframe sequence Q and a jth keyframe of the keyframe sequence C;

a first distance calculation module for calculating a second distance filling distance matrix, generating a path K from the 1 st row and 1 st column elements to the mth row and n column elements on the distance matrix, summing values of the distance matrix elements on the path K as path distance values, selecting a path K with the smallest path distance value as a shortest path, and using the path distance value of the shortest path as a first distance dist ₁ ；

A fourth distance calculation module for calculating a third distance filling distance matrix, generating a path K from the 1 st row and 1 st column elements to the m th row and n th column elements on the distance matrix, summing values of the distance matrix elements on the path K as path distance values, and selecting a path K with the smallest path distance value as path distance valueShortest path, taking path distance value of the shortest path as fourth distance dist ₄ ；

A reference distance calculation module that calculates a reference distance, a reference distance dist, based on the first distance and the fourth distance _c The calculation formula of (2) is as follows:

wherein t is the maximum value of the number of key frames of the two key frame sequences, and k is the number of limbs of the human skeleton model;

and the synthesis module is used for mapping the key frame sequence of the first person and the key frame marked as 0 of the reference key frame sequence, and the third distance between the two key frames establishing the mapping relation is smaller than a set third distance threshold value.

Further, a second distance dist ₂ The calculation formula of (2) is as follows:

dist ₂ ＝|Q _i -C _j i, wherein Q _i The index value C of the ith key frame of the key frame sequence Q _j Is the tag value of the j-th key frame of the key frame sequence C.

Further, the calculation formula of the third distance is as follows:

The invention provides a method for processing images by a computer, which uses a computer system to execute the following steps:

step 201, extracting a first frame image from a three-dimensional animation video and extracting a second frame image from a two-dimensional animation video;

step 202, identifying a person from a frame image and acquiring person posture data;

step 203, screening and obtaining a key frame sequence from the frame image;

step 204, extracting a key frame sequence corresponding to the first person and the third person, and then marking the key frame;

step 205, calculating the reference distances of the key frame sequences of the first person and the second person, and extracting the key frame sequences of the first N second persons with the smallest reference distances of the key frame sequences of the first person as reference key frame sequences;

step 206, each reference key frame sequence is synthesized with the key frame sequence of the first person.

The invention provides a computer storage medium for storing the above computer system.

The invention has the beneficial effects that:

the invention can map the reference image of the key frame which needs to be hand painted in the 2D animation 2D process based on image processing and data mining, provides a reference with higher matching degree for the hand painted staff, is convenient for imitation, and reduces the requirement on the artistic creation capability of the hand painted staff.

Drawings

FIG. 1 is a block diagram of a computer system of the present invention;

FIG. 2 is a block diagram of a key frame sequence synthesis module of the present invention;

FIG. 3 is a flow chart of a method of screening a frame image for key frames in accordance with the present invention;

fig. 4 is a flow chart of a method of processing an image by a computer of the present invention.

In the figure: the system comprises a frame image extraction module 101, a person identification module 102, a person matching module 103, a key frame extraction module 104, a gesture sequence correction module 105, a reference sequence correction module 106, a key frame sequence synthesis module 107, a distance matrix generation module 1071, a first distance calculation module 1072, a fourth distance calculation module 1073, a reference distance calculation module 1074 and a synthesis module 1075.

Detailed Description

The subject matter described herein will now be discussed with reference to example embodiments. It is to be understood that these embodiments are merely discussed so that those skilled in the art may better understand and implement the subject matter described herein and that changes may be made in the function and arrangement of the elements discussed without departing from the scope of the disclosure herein. Various examples may omit, replace, or add various procedures or components as desired. In addition, features described with respect to some examples may be combined in other examples as well.

Example 1

As shown in fig. 1-3, a computer system, comprising:

a frame image extraction module 101 for extracting a first frame image from a three-dimensional animated video and extracting a second frame image from a two-dimensional animated video;

a person recognition module 102 for recognizing a person from the frame image and acquiring person pose data;

the information of the person includes the sex of the person, and the person posture data includes person posture parameters;

the information of the current persona may also contain more content, such as age, persona type, etc., which facilitates more accurate matching, but the amount of persona data required is large enough, otherwise more content means more filtering items, and insufficient data amount may result in insufficient data amount of the persona available for matching.

A person matching module 103, configured to select a first person extracted from the first frame image, and then match a second person extracted from the second frame image, where the first person is the same as the second person in information, and the matched second person is marked as a third person;

a key frame extraction module 104, configured to screen and obtain a key frame sequence from a frame image;

the method for screening and obtaining the key frames from the frame images comprises the following steps:

step 101, extracting character pose parameters of a selected character (the selected character is a first character or a third character) from each frame image, generating a character pose sequence, wherein each unit of the character pose sequence comprises the character pose parameters of the selected character in one frame image;

step 105, extracting a frame image of a character attitude parameter source of the first sequence unit as a key frame, and sequencing the key frame according to a time point to generate a key frame sequence;

in this step, the key frame sequence may express a plurality of actions with larger intervals, the key frame sequence is intercepted by setting a time threshold by the time difference of the time points between the key frames of the key frame sequence, and the intercepted key frame sequence enters the gesture sequence correction module 105, the reference sequence correction module 106 and the key frame sequence synthesis module 107 for processing respectively.

The time difference can be set to be 3min, and the number of truncated key frame sequences can be reduced as the time difference increases;

the calculation formula of the first similarity is as follows:

wherein k is the number of limbs, theta of the human skeleton model _i1 Ith limb and standard gesture of character gesture parameter of sequence unit of character gesture sequenceIncluded angle of ith limb in model; θ _i2 An included angle between the ith limb of the character gesture parameter of the sequence unit of the gesture sequence of another character and the ith limb in the standard gesture model;

in one embodiment of the invention, the person pose parameters are generated from the image based on a person pose estimation algorithm such as the openPose algorithm.

The first similarity threshold may be adjusted according to the number of limbs of the skeletal model, and is generally inversely proportional to the number of limbs of the skeletal model.

The gesture sequence correction module 105 firstly extracts a key frame sequence corresponding to a first person, marks the key frame, wherein the mark value of the key frame to be redrawn is 0, and the mark values of the rest key frames are 1;

a reference sequence correction module 106, configured to extract a key frame sequence corresponding to the third person, and then mark key frames of the key frame sequence of the third person;

the method for marking the key frames of the key frame sequence of the third person comprises the following steps:

the energy function of the two-dimensional random field is:

wherein,,

the calculation formula of the second similarity is as follows:

wherein k is the number of limbs, theta of the human skeleton model _ip An included angle between the ith limb of a person posture parameter and the ith limb in the standard posture model; θ _iq An included angle between the ith limb of the gesture parameter of another person and the ith limb in the standard gesture model;

wherein,,

wherein K is the set of limbs of the human skeletal model, wherein θ _c1,c2 Represents x _p The included angle between the c-th limb of the character gesture parameters of the mapped key frame and the c-th limb in the standard gesture model;

the standard posture model is a limb angle parameter of the human body skeleton model in a natural standing state that the human body is naturally hung by both hands.

A keyframe sequence synthesizing module 107, configured to calculate reference distances of keyframe sequences of the first person and the second person, and extract keyframe sequences of the first N second persons having the smallest reference distances from the keyframe sequences of the first person as reference keyframe sequences;

each reference key frame sequence is synthesized with the key frame sequence of the first person.

The key frame sequence synthesis module 107 includes:

a distance matrix generation module 1071 that creates an m×n distance matrix whose i-th row and j-th column elements are denoted as d (i, j);

a first distance calculation module 1072 for calculating a second distance filling distance matrix, generating a path K from the 1 st row and 1 st column element to the m th row and n th column element on the distance matrix, summing values of the elements of the distance matrix on the path K as path distance values, selecting a path K with the smallest path distance value as a shortest path, and selecting the path distance value of the shortest path as a first distance dist ₁ 。

A fourth distance calculation module 1073 for calculating a third distance filling distance matrix, generating a path K from the 1 st row and 1 st column element to the m th row and n th column element on the distance matrix, summing values of the elements of the distance matrix on the path K as path distance values, selecting a path K with the smallest path distance value as a shortest path, and selecting the path distance value of the shortest path as a fourth distance dist ₄ 。

Second distance dist ₂ The calculation formula of (2) is as follows:

dist ₂ ＝|Q _i -C _j |

wherein Q is _i The index value C of the ith key frame of the key frame sequence Q _j Is the tag value of the j-th key frame of the key frame sequence C.

The calculation formula of the third distance is as follows:

A reference distance calculation module 1074 that calculates a reference distance, reference distance dist, based on the first distance and the fourth distance _c The calculation formula of (2) is as follows:

wherein t is the maximum value of the number of key frames of the two key frame sequences, and k is the number of limbs of the human skeleton model.

The synthesizing module 1075 is configured to map a key frame sequence of the first person and a key frame labeled 0 of the reference key frame sequence, where a third distance between two key frames establishing the mapping relationship is smaller than a set third distance threshold.

As shown in fig. 4, the present embodiment provides a method for processing an image by using a computer system as described above, which performs the following steps:

step 203, screening and obtaining a key frame sequence from the frame image;

The embodiment has been described above with reference to the embodiment, but the embodiment is not limited to the above-described specific implementation, which is only illustrative and not restrictive, and many forms can be made by those of ordinary skill in the art, given the benefit of this disclosure, are within the scope of this embodiment.

Claims

1. A computer system, comprising:

2. The computer system of claim 1, wherein the means for screening the key frames from the frame images comprises:

3. The computer system of claim 2, wherein the first similarity is calculated as:

wherein k is the number of limbs, theta of the human skeleton model _i1 An included angle between the ith limb of the character gesture parameter of a sequence unit of a character gesture sequence and the ith limb in the standard gesture model; θ _i2 And the included angle between the ith limb of the character gesture parameter of the sequence unit of the other character gesture sequence and the ith limb in the standard gesture model.

4. The computer system of claim 1, wherein the method of keyframe tagging of the keyframe sequence of the third person comprises:

the energy function of the two-dimensional random field is:

wherein X is a node set, θ _p x _p As a potential function of node p, θ _pq x _p ,x _q As a potential function of the edge between nodes p and q, x _p The marker value, x, of the image node mapped for node p _q A marker value of a key frame sequence mapped for the node q; a mark value of 0 indicates that the character gesture parameter corresponding to the key frame has larger deviation from the real character; a mark value of 1 indicates that the character gesture parameter corresponding to the key frame is close to the real character;

wherein,,

wherein,,

S ₁ for a set movement entropy threshold, sx _p Is x _p The motion entropy of the mapped key frame is calculated as follows:

Sx _p ＝∑ _c∈K cosθ _c1,c2 wherein K is the set of limbs of the human skeletal model, wherein θ _c1,c2 Represents x _p The included angle between the c-th limb of the character gesture parameters of the mapped key frame and the c-th limb in the standard gesture model;

5. The computer system of claim 4, wherein the second similarity is calculated as:

6. The computer system of claim 1, wherein the key frame sequence synthesis module comprises:

A fourth distance calculation module for calculating a third distance filling distance matrix, generating a path K from the 1 st row and 1 st column elements to the m th row and n th column elements on the distance matrix, summing values of the distance matrix elements on the path K as path distance values, and selecting a path K with the smallest path distance value as the shortest pathThe path, the path distance value of the shortest path is taken as the fourth distance dist ₄ ；

7. The computer system of claim 6, wherein the second distance dist ₂ The calculation formula of (2) is as follows:

dist ₂ ＝Q _i -C _j wherein Q is _i The index value C of the ith key frame of the key frame sequence Q _j Is the tag value of the j-th key frame of the key frame sequence C.

8. The computer system of claim 6, wherein the third distance is calculated as:

9. A method of computer processing an image, characterized in that a computer system according to any of claims 1-8 is applied for performing the steps of:

step 203, screening and obtaining a key frame sequence from the frame image;

10. A computer storage medium storing a computer system according to any one of claims 1-8.