CN110310351A - A kind of 3 D human body skeleton cartoon automatic generation method based on sketch - Google Patents

A kind of 3 D human body skeleton cartoon automatic generation method based on sketch Download PDF

Info

Publication number
CN110310351A
CN110310351A CN201910597737.9A CN201910597737A CN110310351A CN 110310351 A CN110310351 A CN 110310351A CN 201910597737 A CN201910597737 A CN 201910597737A CN 110310351 A CN110310351 A CN 110310351A
Authority
CN
China
Prior art keywords
network
sketch
human body
data
skeleton cartoon
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910597737.9A
Other languages
Chinese (zh)
Other versions
CN110310351B (en
Inventor
马昊
李淑琴
丁濛
孟坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Information Science and Technology University
Original Assignee
Beijing Information Science and Technology University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Information Science and Technology University filed Critical Beijing Information Science and Technology University
Priority to CN201910597737.9A priority Critical patent/CN110310351B/en
Publication of CN110310351A publication Critical patent/CN110310351A/en
Application granted granted Critical
Publication of CN110310351B publication Critical patent/CN110310351B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Psychiatry (AREA)
  • Human Computer Interaction (AREA)
  • Social Psychology (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The present invention relates to a kind of 3 D human body skeleton cartoon automatic generation method based on sketch, this system is to simplify the interactive mode of skeleton cartoon production as setting about a little, the head and the tail frame of animation to be generated is provided in a manner of the input of sketch image, realize that the sketch three-dimensional reconstruction of system and skeleton cartoon interpolation frame information are automatically synthesized two parts module respectively using tensorflow building neural network framework, it is final to realize the 3 D human body skeleton cartoon automatic generation method based on sketch.System front and back end code logic separation, flexibility with higher.The system can carry out pretreatment operation to user's input picture automatically, to meet each section network inputs format, can reduce the complexity in the interaction of whole system.

Description

A kind of 3 D human body skeleton cartoon automatic generation method based on sketch
Technical field
The present invention relates to Computer Animated Graph field more particularly to a kind of 3 D human body skeleton cartoons based on sketch certainly Dynamic generation method.
Background technique
In computer graphics, dimensional Modeling Technology provides many necessary methods and is used to object in the real world Body is converted into the mathematical expression form under three-dimensional system of coordinate, and is rendered by computer program, to realize virtual empty Between simulate real world effect.The current 3 d modeling softwares for having many maturations, such as CAD, Maya, 3DsMax etc., and The extensive use of every field has all been obtained in actual three-dimensional modeling.Although these 3 d modeling softwares are widely used for Production environment, but generally require the training by profession in the use of software and there is the learning curve of steeper. Traditional 3 d modeling software is emphasized to limit using input form of the cumbersome rules of interaction to user, builds to improve Mould precision, therefore, even for profession modeling personnel for modeling task also can be costly time overhead.
In order to solve this problem, the dimensional Modeling Technology based on sketch is provided using cartographical sketching side as input The effective ways of formula progress Geometric Modeling.Modeling technique based on sketch describe it is a kind of based on gesture for construction indicate (CSG) rapid modeling for simple threedimensional model may be implemented in the method modeled, this method.Three-dimensional modeling based on sketch Technology finally exports corresponding three-dimensional mould according to the sketch outline information of input mainly using two-dimentional sketch curve as input Type.Modeling technique based on sketch is mainly to be the user with painting ability but shortage 3 d modeling software use experience and mention Out and design.In recent years, the modeling technique based on sketch is used for three-dimensional modeling task more and more by people, and may be used To be widely used in some special dimensions, such as animal modeling, game role modeling, dress designing, scalp electroacupuncture.Pass through letter Single two-dimentional sketch outline stroke, user can be used sketch input interface and carry out the complex model pair with free geometric jacquard patterning unit surface The modeling of elephant, so as to reduce the time overhead of modeling period in a more effective manner.
For three-dimensional modeling, Computer Animated Graph is always the key points and difficulties studied.Computer animation is Existing in the form of animation sequence, need to summarize animated actions using storyboard on this basis, and in animation time A fixed scene is drawn in detail in the specific key frame of axis, it is desirable that can reflect in each key frame entire Multidate information of the animation in particular moment.In order to guarantee the continuity of animation sequence, needs to be added between a pair of of key frame and insert It is worth frame, and the number density of interpolated frame determines the quality of animation.For relative two dimensional animation, three-dimensional animation needs more rule It then limits, more there is challenge in the research of wherein 3 D human body skeleton cartoon.In 3 D human body skeleton cartoon, human body is benefit It is modeled with articulated chain body, wherein particular moment skeletal joint point can be reflected in the world in each fixed key frame Specific coordinate position in coordinate system.In complete three-dimensional skeleton cartoon manufacturing process, it is necessary first to skeleton structure Three-dimensional modeling is carried out to need to model personnel during modeling to simulate the positional relationship of real human body skeletal joint point The three-dimensional modeling and skeleton structure knowledge for grasping profession, need to put into biggish manpower and time cost;Followed by need With reference to artis during real human body skeleton motion coordinate position variation track so that it is determined that key frame and interpolated frame model And carry out necessary editing.Although entire manufacturing process can be realized by using animation soft, by In the rules of interaction for needing a large amount of professional domain knowledge and complexity, therefore, it has become the bottlenecks that limitation user uses.
Summary of the invention
In view of this, the application provides a kind of 3 D human body skeleton cartoon automatic generation method based on sketch.The system It is automatic by way of inputting any two human actions sketch image using sketch modeling technique and the solution of deep learning method 3 D human body skeleton cartoon is generated, to improve 3 D human body skeleton cartoon by the frequency of interaction and complexity of reduction system Producing efficiency.
The application is achieved by the following technical solution:
A kind of 3 D human body skeleton cartoon automatic generation method based on sketch, this method comprises the following steps
Step 1, it realizes the interaction with user, receives the human action sketch image file of user's input;
Step 2, background model is called according to the sketch image file;
Step 3, the missing in completion animation sequence is carried out according to the human action information in animation sequence in head and the tail frame to insert Value frame is automatically synthesized, and then realizes the generation of full animation sequence;
Step 4, by the generation data render of the full animation sequence to screen, visual 3 D human body bone is obtained Animation.
Further, in the step 2, background model is called according to the sketch image file, is specifically included:
Step 201, the human action sketch image inputted according to user carries out image pre-processing method, to be met The sketch image data of network inputs format;
Step 202, sketch image recognition web tab is formulated, is obtained for describing network to sketch recognition capability Export result;
Step 203, sketch image recognition network training is carried out, specific sketch recognition result is obtained according to input sketch image And realize the mapping for arriving three-dimensional space skeletal joint point coordinate information;
Step 204, the coordinate information of skeletal joint point in human body three-dimensional space is obtained.
Further, described image preprocess method in step 201, specifically includes:
The sketch image data that successively user is inputted using profile testing method, fill method, equal proportion Zoom method Image transformation is carried out, to obtain meeting the network inputs of the 3 D human body skeleton model reconstruction model based on sketch image.
Further, the sketch successively user inputted using profile testing method, fill method, equal proportion Zoom method Image data carries out image transformation, specifically includes:
Sketch image is inputted to user and carries out human body closed curve contour detecting, so that human body can be described by obtaining in image The main region part of movement;
According to the obtained body curve's profile of the contour detecting to closing section realize fill, with improve image for The descriptive power of human action;
Convert original image to the network inputs for meeting the 3 D human body skeleton model reconstruction model based on sketch image Format, and shield the Unnecessary detail information in original image.
Further, in step 202, the formulation sketch image recognition web tab, specifically includes:
Label is divided into three levels according to the relationship between human action, respectively action classification, movement pattern classification and Act frame category, three kinds of labels for motion images descriptive power from thick to thin, final action action frame class label be used to retouch State the action message of single frames in specific animation sequence.
Further, in step 203, the progress sketch image recognition network training, specifically includes:
The identification of action sketch image is carried out using convolutional neural networks layering according to the formulation of sketch recognition web tab With classification, comprising: training method, parameter adjustment and error function setting;
The training method uses tensorflow as deep learning tool, and gradually right using the mode of hierarchical classification Network is trained, and decomposes the model of more difficult training using the mode of Model Fusion in training and carry out weak typing The training of model, and every department pattern is merged, obtain final result.
For adjusting network parts parameter to be optimal effect, parameter includes: convolution kernel ruler for the parameter adjustment Very little, weight and deviation Initialize installation, convolution layer number, optimizer setting and learning rate Initialize installation;
Wherein the convolutional layer quantity determines the dimension and network query function amount of character representation, convolutional layer more multiple features indicate more Abstract, while calculation amount is also bigger, the fewer character representation of convolutional layer is smaller with hour operation quantity closer to initial data.
Further, in step 3, the human action information according in animation sequence in head and the tail frame complete dynamic Being automatically synthesized for the missing interpolated frame in sequence is drawn, is specifically included:
When given any two act frame data, using data-oriented as the head of one section of Complete three-dimensional skeleton animation Tail frame simultaneously automatically generates the interpolation frame data lacked between two frames, used method include skeleton cartoon feature extracting method and Interpolated frame automatic synthesis method.
Further, the skeleton cartoon feature extracting method passes through coding and decoding using convolution autoencoder network structure Operation undergoes data to regenerate process, and the input data of network is complete skeleton cartoon sequence data, network it is final Output is the regeneration data of animation sequence, and optimisation strategy when network training is minimized between initial data and regeneration data Variance distance, trained model can pass through coding and calculate the feature extraction for realizing original skeleton cartoon.
Further, the mode that the interpolated frame automatic synthesis method is combined using convolution feedforward network with interpolation arithmetic The gradually variation tendency between recovery action ultimately generates complete skeleton cartoon sequence, includes using nearest in network layer structure Interpolated layer, convolutional layer and the active coating of adjacent interpolation strategies.
Further, the interpolated layer using arest neighbors interpolation strategies, specifically includes: arest neighbors interpolation strategies can be to original Beginning data carry out the amplification in size, and can retain primary data information (pdi), by being staged through interpolated layer in network query function Calculate to meet final output data format size;
Data Jing Guo interpolated layer are abstracted and are reversed by the convolutional layer, realize the fitting to target output;
The active coating for increasing the non-linear of network, and reduces the relation of interdependence in network parameter, alleviates The over-fitting of network.
Compared with the prior art, the advantages of the present invention are as follows:
1) it proposes to combine using arest neighbors interpolation with convolution strategy on carrying out the design that interpolated frame is automatically synthesized model Network model, and according to the variance of the output of true animation sequence and network model output apart from step-up error function, so as to To improve model level by minimizing error amount in the network training stage.
2) sketch Three-dimension Reconstruction Model and 3 D human body skeleton cartoon model is automatically synthesized to encapsulate using layer architecture To specific functional module and realize complete interactive function.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, this hair Bright illustrative embodiments and their description are used to explain the present invention, and are not constituted improper limitations of the present invention.
Fig. 1 is the composite structural diagram of automatic generation method of the invention;
Fig. 2 is the image list that three-dimensional skeletal joint dot position information rendering generates;
Fig. 3 is using two-dimentional sketch image recognition model schematic layered;
The trend chart of error and accuracy when Fig. 4 is trained;
Fig. 5 is the schematic diagram realizing three-dimensional skeleton model according to the Freehandhand-drawing action sketch of input and rebuilding;
Fig. 6 is that skeleton cartoon is automatically synthesized model structure schematic diagram;
Fig. 7 is the training Time Duration Error trend graph that animation feature extracts model;
Fig. 8 is that animation feature extracts model test results figure;
Fig. 9 is the error trend graph that interpolated frame is automatically synthesized network model t raining period;
Figure 10 is automatically generated the system sequence figure of method;
Figure 11 is automatically generated the operation result exemplary diagram of method.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all embodiments consistent with the application.On the contrary, they be only with it is such as appended The example of the consistent device and method of some aspects be described in detail in claims, the application.
It is only to be not intended to be limiting the application merely for for the purpose of describing particular embodiments in term used in this application. It is also intended in the application and the "an" of singular used in the attached claims, " described " and "the" including majority Form, unless the context clearly indicates other meaning.It is also understood that term "and/or" used herein refers to and wraps It may be combined containing one or more associated any or all of project listed.
Below in conjunction with attached drawing and example, the present invention is described in further detail.
In the manufacturing process of three-dimensional animation, compared with using traditional animation Software for producing, this mode of sketch drafting for Simpler easy for user, user is not required to the excessive professional art knowledge of to master and rules of interaction, uses 2 D animation This theory of production that input mode carries out three-dimensional animation can increase substantially entire production from the angle for simplifying input form The efficiency in period.On the other hand, with deep learning especially convolutional neural networks (Convolutional Neural Networks, CNN) Successful utilization and a large amount of online actions capture number of the model in computer graphics and computer vision According to the presence in library, the production of skeleton animation is also advanced towards more intelligent direction.By using deep learning algorithm The change in location relationship in motion capture data library between movement and artis is extracted and learns, and then automatic by computer program Key frame and interpolated frame are generated to the manual skeleton cartoon editing for replacing tradition complicated, further reduced interactive answer Miscellaneous degree, improves producing efficiency.
Fig. 1 shows automatic for the 3 D human body skeleton cartoon based on sketch described in 1 in accordance with an embodiment of the present disclosure The frame of generation method.
It is given birth to automatically refering to what is shown in Fig. 1, embodiment of the disclosure 1 provides a kind of 3 D human body skeleton cartoon based on sketch At method, it should be noted that step shown in the flowchart of the accompanying drawings can be in such as a group of computer-executable instructions It is executed in computer system, but in some cases, it can be to be different from step shown or described by sequence execution herein Suddenly.Refering to what is shown in Fig. 1, the 3 D human body skeleton cartoon automatic generation method based on sketch includes:
Head and the tail frame human action sketch image: the sketch image input by user that can describe human action is used respectively It is acted to describe first frame and the tail frame of animation sequence to be generated;
Image preprocessing: original input picture is located in advance using contour detecting, filling and equal proportion scaling respectively Reason, purpose meet subsequent network input size, and the crucial Pixel Information in enlarged drawing;
Type of action sorter network: the classification of action classification is carried out using the strategy of Model Fusion, wherein weak point of every part Class device is all made of two convolutional layers and two full articulamentums, and wherein the number of convolution kernel is respectively 32 and 64 in convolutional layer, For convolution kernel having a size of 3 × 3, full articulamentum interior joint number is 1024, and Model Fusion is using monolayer BP network as fusion device;
Movement pattern formal classification network: using the convolutional neural networks of single step mode, connected entirely by two layers of convolutional layer and two Layer composition is connect, wherein the number of convolution kernel is respectively 32 and 64 in convolutional layer, and convolution kernel is having a size of 3 × 3, full articulamentum interior joint Number is 1024, for identification the specific movement pattern form in certain action classification;
Crucial frame alignment network: using the convolutional neural networks of single step mode, by two layers of convolutional layer and two full articulamentum groups At;Wherein the number of convolution kernel is respectively 32 and 128 in convolutional layer, and convolution kernel is having a size of 3 × 3, full articulamentum interior joint number It is 2056, specifically acts frame information with identification;
3 D human body skeletal joint point coordinate information: point for finally entering sketch can be exported by crucial frame alignment network Class is as a result, can obtain 3 D human body bone from classification results to the mapping relations three-dimensional skeletal joint point coordinate according to known The coordinate information of bone artis;
3 D human body skeleton cartoon interpolated frame is automatically synthesized network: using arest neighbors interpolation before convolution strategy combines Present network structure, with complete movement tendency information is automatically synthesized from known the first two frames action message;
Skeleton cartoon feature extraction and Restoration model: this department pattern both can obtain animation according to existing animation sequence Characteristic information, on the contrary can also be according to the final complete animation sequence of animation feature Information recovering.It is automatically synthesized by interpolated frame The input of network can obtain one section of complete motion characteristic information, and it is dynamic that complete bone is obtained by Restoration model Draw sequence data.
Complete skeleton cartoon sequence data: network and skeleton cartoon feature extraction are automatically synthesized according to interpolated frame and restore mould The calculating of type ultimately generates complete skeleton cartoon sequence data;
Three-dimensional rendering: it is sat according to the skeleton artis indicated frame by frame in obtained skeleton cartoon sequence data in three-dimensional Absolute location information under mark system, uses OpenGL to carry out three-dimensional rendering as figure class libraries;
3 D human body skeleton cartoon: according to carrying out three-dimensional rendering frame by frame, and screen refresh frequency is controlled, finally obtains three-dimensional Skeleton animation is simultaneously output on screen.
Refering to what is shown in Fig. 2, fixed in order to provide type of action sorter network, movement pattern formal classification network and key frame Position network training when training dataset, use the three-dimensional animation sequence in human body motion capture data library as initial data simultaneously Parsing obtains the skeletal structure under three-dimensional space frame by frame, converts two-dimensional frames for three-dimensional skeletal structure using the mode of rendering and acts Image list and the mapping for establishing the corresponding location information to skeletal joint point each under three-dimensional system of coordinate.
Refering to what is shown in Fig. 3, giving the model structure of sketch sorter network, wherein action classification sorter network uses model The mode of fusion, each weak typing network is made of two layers of convolutional layer and two full articulamentums, and is made using softmax For classifier, use the intersection entropy function as shown in formula (1) as error function,
Loss (p, q)=- ∑j pj log qj (1)
Model Fusion is carried out according to the method as shown in formula (2) using monolayer BP network;
P '=∑i w i·p i+b (2)
Wherein piIndicate the classification results of i-th of Weak Classifier, the final classification of p ' expression strong classifier is as a result, wiIt indicates Weak Classifier i exports the weight of result, and b indicates offset.
Movement pattern formal classification network and crucial frame alignment sorter network are that use is complete by two layers of convolutional layer and two The convolutional neural networks composition of articulamentum composition, and use softmax as classifier, intersect using as shown in formula (1) Entropy function is as error function.
Refering to what is shown in Fig. 4, in order to prove sorter network can convergence and monitor network in training process convergence variation, The changing tendency of error and accuracy in three kinds of network training process is had recorded respectively.
Three kinds of sorter networks are tested by using test set data, obtain test result as shown in the table.
Refering to what is shown in Fig. 5, the validity in order to prove model, use OpenGL as tool storage room according to using input sketch It identifies that obtained skeletal joint point carries out three-dimensional rendering in the specific co-ordinate position information of three-dimensional space, obtains visual three-dimensional Skeleton model.
Refering to what is shown in Fig. 6, giving the structure that 3 D human body skeleton cartoon is automatically synthesized model.Entire model can be divided into Two parts, wherein first part's (left part) is action message feature extraction and recovery unit, and middle process is special in this section The raw motion data that sign is extracted can be indicated in hidden unit in the form of acting manifold.In addition, in order to may be implemented From the characteristic in movement manifold to the recovery of raw motion data, this unit in design using can carry out simultaneously The convolution autoencoder network of feature extraction and characteristic recovery bidirectional operation.Second part (right part) is that movement tendency information is extensive Multiple unit, this part are connected with the top of action message extraction module, and loss movement is gradually completing in entire calculating process The prediction and recovery of trend feature information, output result are mapped to hidden unit in a manner of acting manifold, this subnetwork makes The feedforward network structure combined with arest neighbors interpolation with convolution strategy.By the characteristic information predicting and restore finally by The action message recovery operation of a part of unit further restores complete action message, realizes the interpolation of animation sequence missing frame With completion.
Wherein the coding in convolution autoencoder network is calculated as shown in formula (3):
Wherein weightOffsetm =256 indicate the quantity of hidden unit, ω0Indicate 3 × 3 convolution kernel size, operationIndicate that convolution algorithm, Ψ indicate sampling The core maximum pond operation that sliding step is 2 having a size of 3 and in the first dimension, operation result can carry out input data to drop and adopt Sample simultaneously halves the length of the first dimension of input data, and Relu is finally used to increase the non-of network model as activation primitive Linearly.Input data X can be abstracted by complete encoding operation EC, obtain movement manifoldAnd it deposits Store up hidden unit.
Decoding in convolution autoencoder network is calculated as shown in formula (4):
In decoding calculates, the movement manifold H in hidden unit is as input, W0 TWith W1 TRespectively weight W0And W1Turn It sets, is carrying out operationWhen be actually to have carried out de-convolution operation.
Motion characteristic based on convolution autoencoder network structure extracts shown in the error function such as formula (5) of network:
Wherein, X indicates original bone cartoon section.
Refering to what is shown in Fig. 7, giving the error tendency that the animation feature based on autoencoder network extracts model.
Refering to what is shown in Fig. 8, giving the test result that the animation feature based on autoencoder network extracts model.
Shown in the feedforward network calculation formula such as formula (6) that arest neighbors interpolation is combined with convolution strategy:
Wherein function NN (x, s) indicate using arest neighbors interpolation method by the size of the first dimension of x be amplified to s (such asThen); WithRespectively indicate convolution meter in every layer The weight of calculation, wherein m1=64, m2=64, m3=128, ω1And ω2Indicate 3 × 3 convolution kernel size, ω3And ω4Expression 5 × 5 convolution kernel size, ω5Indicate 7 × 7 convolution kernel size;b01、b02、b03、b04And b05Respectively indicate each layer convolution algorithm Deviation.By operationIn order to increase the non-linear of network, every layer of calculated result is all used Relu carries out Nonlinear Processing as activation primitive.Shown in error function such as formula (7) when network training:
The formula is meant that by measuring the movement tendency information predicted by feedforward network to realistic operation trend The variance of distance is carrying out acting quality when manifold is predicted between characteristic to react feedforward network.
Network is automatically synthesized with the interpolated frame that convolution strategy combines using arest neighbors interpolation refering to what is shown in Fig. 9, giving The error tendency in model training period.
Refering to what is shown in Fig. 10, giving the system sequence of the 3 D human body skeleton cartoon automatic generation method based on sketch Figure.
With reference to shown in Figure 11, the result example of system operation is given, may be implemented from given two human action sketches Automatic conversion of the image to 3 D human body skeleton cartoon.
Those of ordinary skill in the art will appreciate that all or part of the steps in the above method can be instructed by program Related hardware is completed, and described program can store in computer readable storage medium, such as read-only memory, disk or CD Deng.Optionally, one or more integrated circuits also can be used to realize, accordingly in all or part of the steps of above-described embodiment Ground, each module/unit in above-described embodiment can take the form of hardware realization, can also use the shape of software function module Formula is realized.The present invention is not limited to the combinations of the hardware and software of any particular form.
It should be noted that the invention may also have other embodiments, without departing substantially from spirit of that invention and its essence In the case of, those skilled in the art can make various corresponding changes and modifications according to the present invention, but these are corresponding Change and modification all should fall within the scope of protection of the appended claims of the present invention.

Claims (10)

1. a kind of 3 D human body skeleton cartoon automatic generation method based on sketch, which is characterized in that this method includes following step Suddenly
Step 1, it realizes the interaction with user, receives the human action sketch image file of user's input;
Step 2, background model is called according to the sketch image file;
Step 3, the missing interpolated frame in completion animation sequence is carried out according to the human action information in animation sequence in head and the tail frame Be automatically synthesized, and then realize full animation sequence generation;
Step 4, by the generation data render of the full animation sequence to screen, it is dynamic to obtain visual 3 D human body bone It draws.
2. 3 D human body skeleton cartoon automatic generation method according to claim 1, which is characterized in that in the step 2 In, background model is called according to the sketch image file, is specifically included:
Step 201, the human action sketch image inputted according to user carries out image pre-processing method, to obtain meeting network The sketch image data of input format;
Step 202, sketch image recognition web tab is formulated, is obtained for describing output of the network to sketch recognition capability As a result;
Step 203, sketch image recognition network training is carried out, specific sketch recognition result and real is obtained according to input sketch image Now arrive the mapping of three-dimensional space skeletal joint point coordinate information;
Step 204, the coordinate information of skeletal joint point in human body three-dimensional space is obtained.
3. 3 D human body skeleton cartoon automatic generation method according to claim 2, which is characterized in that in step 201 Described image preprocess method, specifically includes:
Successively the sketch image data that user inputs is carried out using profile testing method, fill method, equal proportion Zoom method Image transformation, to obtain meeting the network inputs of the 3 D human body skeleton model reconstruction model based on sketch image.
4. 3 D human body skeleton cartoon automatic generation method according to claim 3, which is characterized in that successively use profile Detection method, fill method, equal proportion Zoom method carry out image transformation to the sketch image data that user inputs, specific to wrap It includes:
Sketch image is inputted to user and carries out human body closed curve contour detecting, so that human action can be described by obtaining in image Main region part;
Closing section is realized according to the contour detecting obtained body curve's profile and is filled, to improve image for human body The descriptive power of movement;
Convert original image to the network inputs format for meeting the 3 D human body skeleton model reconstruction model based on sketch image, And shield the Unnecessary detail information in original image.
5. 3 D human body skeleton cartoon automatic generation method according to claim 3, which is characterized in that in step 202, The formulation sketch image recognition web tab, specifically includes:
Label is divided into three levels, respectively action classification, movement pattern classification and movement according to the relationship between human action Frame category, three kinds of labels for motion images descriptive power from thick to thin, final action action frame class label be used to describe to have The action message of single frames in body animation sequence.
6. 3 D human body skeleton cartoon automatic generation method according to claim 2, which is characterized in that in step 203, The progress sketch image recognition network training, specifically includes:
It is layered the identification for carrying out action sketch image using convolutional neural networks and divides according to formulating for sketch recognition web tab Class, comprising: training method, parameter adjustment and error function setting;
The training method uses tensorflow as deep learning tool, and using the mode of hierarchical classification gradually to network It is trained, and the model of more difficult training is decomposed using the mode of Model Fusion in training and carries out weak typing model Training, and every department pattern is merged, obtains final result.
The parameter adjustment is optimal effect for adjusting network parts parameter, and parameter includes: convolution kernel size, power Weight and deviation Initialize installation, convolution layer number, optimizer setting and learning rate Initialize installation;
Wherein the convolutional layer quantity determines the dimension and network query function amount of character representation, and convolutional layer more multiple features indicate more abstract Change, while calculation amount is also bigger, the fewer character representation of convolutional layer is smaller with hour operation quantity closer to initial data.
7. 3 D human body skeleton cartoon automatic generation method according to claim 1, which is characterized in that in step 3, institute State the missing interpolated frame carried out according to the human action information in animation sequence in head and the tail frame in completion animation sequence from dynamic circuit connector At specifically including:
When given any two act frame data, using data-oriented as the head and the tail frame of one section of Complete three-dimensional skeleton animation And the interpolation frame data lacked between two frames are automatically generated, used method includes skeleton cartoon feature extracting method and interpolation Frame automatic synthesis method.
8. 3 D human body skeleton cartoon automatic generation method according to claim 7, which is characterized in that
The skeleton cartoon feature extracting method passes through coding and decoding operation experience one using convolution autoencoder network structure At process, the input data of network is complete skeleton cartoon sequence data for data reproduction, and the final output of network is animation sequence The regeneration data of column, optimisation strategy when network training are to minimize initial data and regenerate the variance distance between data, Trained model can calculate the feature extraction for realizing original skeleton cartoon by coding.
9. 3 D human body skeleton cartoon automatic generation method according to claim 7, which is characterized in that
The mode gradually recovery action that the interpolated frame automatic synthesis method is combined using convolution feedforward network with interpolation arithmetic Between variation tendency, ultimately generate complete skeleton cartoon sequence, include using arest neighbors interpolation strategies in network layer structure Interpolated layer, convolutional layer and active coating.
10. 3 D human body skeleton cartoon automatic generation method according to claim 7, which is characterized in that
The interpolated layer using arest neighbors interpolation strategies, specifically includes: arest neighbors interpolation strategies can carry out ruler to initial data Amplification on very little, and primary data information (pdi) can be retained, met in network query function by being staged through the calculating of interpolated layer Final output data format size;
Data Jing Guo interpolated layer are abstracted and are reversed by the convolutional layer, realize the fitting to target output;
The active coating for increasing the non-linear of network, and reduces the relation of interdependence in network parameter, alleviates network Over-fitting.
CN201910597737.9A 2019-07-04 2019-07-04 Sketch-based three-dimensional human skeleton animation automatic generation method Active CN110310351B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910597737.9A CN110310351B (en) 2019-07-04 2019-07-04 Sketch-based three-dimensional human skeleton animation automatic generation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910597737.9A CN110310351B (en) 2019-07-04 2019-07-04 Sketch-based three-dimensional human skeleton animation automatic generation method

Publications (2)

Publication Number Publication Date
CN110310351A true CN110310351A (en) 2019-10-08
CN110310351B CN110310351B (en) 2023-07-21

Family

ID=68078118

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910597737.9A Active CN110310351B (en) 2019-07-04 2019-07-04 Sketch-based three-dimensional human skeleton animation automatic generation method

Country Status (1)

Country Link
CN (1) CN110310351B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111401272A (en) * 2020-03-19 2020-07-10 支付宝(杭州)信息技术有限公司 Face feature extraction method, device and equipment
CN111862276A (en) * 2020-07-02 2020-10-30 南京师范大学 Automatic skeleton animation production method based on formalized action description text
CN113112576A (en) * 2021-04-15 2021-07-13 华强方特(深圳)动漫有限公司 Method for automatically disassembling Maya large bone total weight to fine differentiated bone
CN113505751A (en) * 2021-07-29 2021-10-15 同济大学 Human skeleton action recognition method based on difference map convolutional neural network
CN114067091A (en) * 2022-01-17 2022-02-18 深圳慧拓无限科技有限公司 Multi-source data labeling method and system, electronic equipment and storage medium
CN114842155A (en) * 2022-07-04 2022-08-02 埃瑞巴蒂成都科技有限公司 High-precision automatic bone binding method
CN116704553A (en) * 2023-06-13 2023-09-05 长江大学 Human body characteristic identification auxiliary system based on computer vision technology

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101958007A (en) * 2010-09-20 2011-01-26 南京大学 Three-dimensional animation posture modeling method by adopting sketch
US20160232698A1 (en) * 2015-02-06 2016-08-11 Electronics And Telecommunications Research Institute Apparatus and method for generating animation
WO2017084204A1 (en) * 2015-11-19 2017-05-26 广州新节奏智能科技有限公司 Method and system for tracking human body skeleton point in two-dimensional video stream
CN107968962A (en) * 2017-12-12 2018-04-27 华中科技大学 A kind of video generation method of the non-conterminous image of two frames based on deep learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101958007A (en) * 2010-09-20 2011-01-26 南京大学 Three-dimensional animation posture modeling method by adopting sketch
US20160232698A1 (en) * 2015-02-06 2016-08-11 Electronics And Telecommunications Research Institute Apparatus and method for generating animation
WO2017084204A1 (en) * 2015-11-19 2017-05-26 广州新节奏智能科技有限公司 Method and system for tracking human body skeleton point in two-dimensional video stream
CN107968962A (en) * 2017-12-12 2018-04-27 华中科技大学 A kind of video generation method of the non-conterminous image of two frames based on deep learning

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
HUANG HAIBIN,KALOGERAKIS E,YUMER E,ET AL: "Shape synthesis from sketches via procedural models and convolutional networks", 《IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS》 *
SILLA C N,FREITAS A A: "A survey of hierarchical classification across different application domains", 《DATA MINING AND KNOWLEDGE DISCOVERY》 *
刘旭等: "辅助室内定位的关键人体姿态识别", 《科学技术与工程》 *
王俊岭等: "深层次特征学习的Adaboost大规模图像分类算法", 《电视技术》 *
王富平等: "多尺度微分模式相似性角点检测算法", 《光电工程》 *
马昊,李淑琴,丁濛,孟坤: "基于草图的三维建模技术综述", 《智能计算机与应用》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111401272A (en) * 2020-03-19 2020-07-10 支付宝(杭州)信息技术有限公司 Face feature extraction method, device and equipment
CN111401272B (en) * 2020-03-19 2021-08-24 支付宝(杭州)信息技术有限公司 Face feature extraction method, device and equipment
CN111862276A (en) * 2020-07-02 2020-10-30 南京师范大学 Automatic skeleton animation production method based on formalized action description text
CN111862276B (en) * 2020-07-02 2023-12-05 南京师范大学 Automatic skeletal animation production method based on formalized action description text
CN113112576A (en) * 2021-04-15 2021-07-13 华强方特(深圳)动漫有限公司 Method for automatically disassembling Maya large bone total weight to fine differentiated bone
CN113505751B (en) * 2021-07-29 2022-10-25 同济大学 Human skeleton action recognition method based on difference map convolutional neural network
CN113505751A (en) * 2021-07-29 2021-10-15 同济大学 Human skeleton action recognition method based on difference map convolutional neural network
CN114067091B (en) * 2022-01-17 2022-08-16 深圳慧拓无限科技有限公司 Multi-source data labeling method and system, electronic equipment and storage medium
CN114067091A (en) * 2022-01-17 2022-02-18 深圳慧拓无限科技有限公司 Multi-source data labeling method and system, electronic equipment and storage medium
CN114842155A (en) * 2022-07-04 2022-08-02 埃瑞巴蒂成都科技有限公司 High-precision automatic bone binding method
CN114842155B (en) * 2022-07-04 2022-09-30 埃瑞巴蒂成都科技有限公司 High-precision automatic bone binding method
CN116704553A (en) * 2023-06-13 2023-09-05 长江大学 Human body characteristic identification auxiliary system based on computer vision technology
CN116704553B (en) * 2023-06-13 2024-01-26 长江大学 Human body characteristic identification auxiliary system based on computer vision technology

Also Published As

Publication number Publication date
CN110310351B (en) 2023-07-21

Similar Documents

Publication Publication Date Title
CN110310351A (en) A kind of 3 D human body skeleton cartoon automatic generation method based on sketch
Xiang et al. Deep learning for image inpainting: A survey
CN110599573B (en) Method for realizing real-time human face interactive animation based on monocular camera
CN111553968B (en) Method for reconstructing animation of three-dimensional human body
Zhang et al. Hair-GAN: Recovering 3D hair structure from a single image using generative adversarial networks
Shen et al. Deepsketchhair: Deep sketch-based 3d hair modeling
CN113255457A (en) Animation character facial expression generation method and system based on facial expression recognition
CN111524226A (en) Method for detecting key point and three-dimensional reconstruction of ironic portrait painting
CN116385606A (en) Speech signal driven personalized three-dimensional face animation generation method and application thereof
Wang et al. Computer-aided traditional art design based on artificial intelligence and human-computer interaction
Kobayashi et al. Motion capture dataset for practical use of AI-based motion editing and stylization
Hu et al. Pose-aware attention network for flexible motion retargeting by body part
CN114170353A (en) Multi-condition control dance generation method and system based on neural network
Unlu et al. Interactive sketching of mannequin poses
Zhou et al. Hierarchical learning recurrent neural networks for 3D motion synthesis
Chang et al. 3D hand reconstruction with both shape and appearance from an RGB image
Zhang et al. Hair-gans: Recovering 3d hair structure from a single image
Victor et al. Pose Metrics: a New Paradigm for Character Motion Edition
Yu et al. Application of Computer Graphics and Machine Learning in Computer Aided Design of Digital Sculpture
Tejera et al. Space-time editing of 3d video sequences
Tian et al. Augmented Reality Animation Image Information Extraction and Modeling Based on Generative Adversarial Network
Park et al. A feature‐based approach to facial expression cloning
Jiang et al. Animation scene generation based on deep learning of CAD data
Lu et al. Design and implementation of a virtual teacher teaching system algorithm based on facial expression recognition in the era of big data
Li et al. Pose-aware 3D talking face synthesis using geometry-guided audio-vertices attention

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant