CN110084845A - Deformation Prediction method, apparatus and computer readable storage medium - Google Patents

Deformation Prediction method, apparatus and computer readable storage medium Download PDF

Info

Publication number
CN110084845A
CN110084845A CN201910365833.0A CN201910365833A CN110084845A CN 110084845 A CN110084845 A CN 110084845A CN 201910365833 A CN201910365833 A CN 201910365833A CN 110084845 A CN110084845 A CN 110084845A
Authority
CN
China
Prior art keywords
network
prediction
deformation
deformation prediction
target object
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910365833.0A
Other languages
Chinese (zh)
Inventor
王智华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201910365833.0A priority Critical patent/CN110084845A/en
Publication of CN110084845A publication Critical patent/CN110084845A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Abstract

The invention discloses a kind of Deformation Prediction methods, by obtaining the depth image of target object and inputting three-dimensional reconstruction network, to obtain the threedimensional model that the three-dimensional reconstruction network is generated based on the depth image;Then the threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, obtains the prediction model of deformation of Deformation Prediction network output;Task is executed based on the prediction model of deformation later.The invention also discloses a kind of Deformation Prediction device and computer readable storage mediums.This method inherits the extensive attribute for generating confrontation network, this means that network can in the case where the single depth views of given object reconstructed object entire three-dimensional appearance, and it is opposite with traditional finite element method, this method does not need largely to calculate cost, realize sufficiently fast Deformation Prediction speed, can be used for including robot grab article in real time including numerous real-time applications.

Description

Deformation Prediction method, apparatus and computer readable storage medium
Technical field
The present invention relates to field of artificial intelligence more particularly to a kind of Deformation Prediction method, apparatus and computer-readable Storage medium.
Background technique
Key areas of the deep learning as current machine learning suffers from weight in directions such as image recognition, speech analysis It applies.Deep learning is from neural network evolution, and deep learning is substantially the artificial neural network of deep layer, it is not one The isolated technology of item, but the synthesis of the multiple fields such as mathematics, statistical machine learning, computer science and artificial neural network. Up to the present, oneself warp of people is directed to all kinds of different problems such as classification, detection, identification and devises a variety of effective networks Structure.
Ambient enviroment is understood and is interacted, is one of the significant capability that robot needs to possess, and including machine The precondition that many application technologies including people's technology and augmented reality are achieved, especially in prediction target object In the real-time deformation situation under by external force.It is blocked, interferes however, being frequently run onto target object in practice The situations such as object and localized loss, difficulty increases when above situation all can cause to predict the deformation of target object in real time, Such as robot can only be observed when manipulating an object target object part and non-integral.In addition, current technology exists It is all often that all target objects are regarded as to static during handling machine people grabs object problem, it is non-deformable Hard objects.However in reality most of objects be all it is deformable, different journeys can be generated in the effect for encountering external force The deformation of degree.If can be inferred that object issuable deformation when being crawled, facilitate robot to being crawled object Preparatory control judgement is generated, to generate the control strategy of adaptation.
The depth image that traditional Deformation Prediction method gets robot directly inputs generation network, is generating network Two steps of prediction of reconstruction and three-dimensional stereo model deflection of the middle completion from partial-depth image to three-dimensional stereo model, Due to generating network more suitable for high-dimensional model treatment, there are also to be improved for this process.
Above content is only used to facilitate the understanding of the technical scheme, and is not represented and is recognized that above content is existing skill Art.
Summary of the invention
The main purpose of the present invention is to provide a kind of Deformation Prediction method, apparatus and computer readable storage medium, purports Depth image is directly handled solving tradition generation network, the technical problem for causing resulting prediction model of deformation effect bad.
To achieve the above object, the present invention provides a kind of Deformation Prediction method, is applied to Deformation Prediction system, the deformation Forecasting system includes three-dimensional reconstruction network and Deformation Prediction network, the Deformation Prediction method the following steps are included:
It obtains the depth image of target object and inputs three-dimensional reconstruction network, be based on institute to obtain the three-dimensional reconstruction network State the threedimensional model of depth image generation;
The threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, become Shape predicts the prediction model of deformation of network output;
Task is executed based on the prediction model of deformation.
Preferably, it is described obtain target object depth image and input three-dimensional reconstruction network, to obtain the Three-dimensional Gravity The step of threedimensional model that establishing network is generated based on the depth image includes:
The 2.5D depth image is inputted three-dimensional reconstruction network by the 2.5D depth image for obtaining target object;
Three-dimensional reconstruction processing is carried out to the 2.5D depth image in three-dimensional reconstruction network, generates corresponding three-dimensional mould Type.
Preferably, the Deformation Prediction network includes generating network and differentiating network, described by the threedimensional model and base Deformation Prediction network is inputted in the conditional vector that target object characteristic generates, obtains the Deformation Prediction mould of Deformation Prediction network output The step of type includes:
First condition vector is generated based on target object characteristic, the threedimensional model and the first condition vector are inputted Network is generated, the prediction of distortion amount of target object is obtained;
The true strain amount for obtaining target object, using the prediction of distortion amount of the generation network output as differentiation network Input value determines whether the prediction of distortion amount for generating network output meets expectation in conjunction with the true strain amount;
When the prediction of distortion amount of the generation network output meets desired, determine that the prediction of distortion amount is Deformation Prediction Model.
Preferably, the differentiation network successively includes five warp laminations, wherein each warp lamination includes one and swashs Function living;
The generation network includes variation autocoder and decoder;
The variation autocoder successively includes five convolutional layers, a full articulamentum and two μ and σ layers, In, each warp lamination includes an activation primitive;
The decoder successively includes five warp laminations, wherein each warp lamination includes an activation primitive.
Preferably, described the step of generating first condition vector based on target object characteristic, includes:
Determine the material property of target object and the stress condition of target object;
First condition vector is generated based on the material property and stress condition.
Preferably, the true strain amount for obtaining target object makees the prediction of distortion amount of the generation network output Determine whether the prediction of distortion amount for generating network output meets the phase in conjunction with the true strain amount for the input value for differentiating network The step of prestige includes:
Obtain the true strain amount that physical modeler generates;
The prediction of distortion amount input of the generation network output is differentiated into network, in conjunction with the true strain amount and is based on mesh The second condition vector that object properties generate is marked, determines whether the prediction of distortion amount meets expectation.
Preferably, it is described obtain target object depth image and input three-dimensional reconstruction network, to obtain the Three-dimensional Gravity Before the step of threedimensional model that establishing network is generated based on the depth image, the method also includes:
Several real-world object deformation patterns are collected, corresponding image data base is established;
Deformation Prediction network is trained based on described image database, so that the prediction energy of the Deformation Prediction network Power gradually increases.
Preferably, described that Deformation Prediction network is trained based on described image database, so that the Deformation Prediction The step of predictive ability of network gradually increases include:
Several voxel grids and conditional vector are obtained based on image data base, if generating by physical modeler by described Several ground truths pair of dry voxel grid and conditional vector composition;
With several described ground truths to training Deformation Prediction network, the related ginseng of Deformation Prediction network is continued to optimize Number, so that the predictive ability of the Deformation Prediction network gradually increases.
In addition, to achieve the above object, the present invention also provides a kind of Deformation Prediction device, the Deformation Prediction device packet It includes: memory, processor and being stored in the Deformation Prediction program that can be run on the memory and on the processor, it is described Deformation Prediction program realizes the step of Deformation Prediction method as described in any one of the above embodiments when being executed by the processor.
In addition, to achieve the above object, it is described computer-readable the present invention also provides a kind of computer readable storage medium It is stored with Deformation Prediction program on storage medium, realizes when the Deformation Prediction program is executed by processor such as any of the above-described institute The step of Deformation Prediction method stated.
The present invention program, by obtaining the depth image of target object and inputting three-dimensional reconstruction network, to obtain described three Dimension rebuilds the threedimensional model that network is generated based on the depth image;Then by the threedimensional model and based on target object characteristic The conditional vector of generation inputs Deformation Prediction network, obtains the prediction model of deformation of Deformation Prediction network output;It is based on institute later It states prediction model of deformation and executes task;This method inherits the extensive attribute for generating confrontation network, it means that network can be The entire three-dimensional appearance of reconstructed object in the case where the single depth views of given object, and with traditional finite element method phase Instead, this method does not need largely to calculate cost, realizes sufficiently fast Deformation Prediction speed, can be used for including that robot is real-time Grab numerous real-time applications including article.
Detailed description of the invention
Fig. 1 is the structure of the affiliated terminal of Deformation Prediction device in the hardware running environment that the embodiment of the present invention is related to Schematic diagram;
Fig. 2 is the flow diagram of Deformation Prediction method first embodiment of the present invention;
Fig. 3 is that the connection of three-dimensional reconstruction network and Deformation Prediction network in Deformation Prediction method first embodiment of the present invention is closed It is schematic diagram;
Fig. 4 is to obtain the depth image of target object in Deformation Prediction method second embodiment of the present invention and input Three-dimensional Gravity Establishing network, refinement process the step of to obtain threedimensional model that the three-dimensional reconstruction network is generated based on the depth image are shown It is intended to;
Fig. 5 is in Deformation Prediction method 3rd embodiment of the present invention by the threedimensional model and based on the life of target object characteristic At conditional vector input Deformation Prediction network, obtain Deformation Prediction network output prediction model of deformation the step of refinement stream Journey schematic diagram;
Fig. 6 is generation network and differentiation schematic network structure in Deformation Prediction method 3rd embodiment of the present invention;
Fig. 7 is the flow diagram in Deformation Prediction method fourth embodiment of the present invention.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
As shown in Figure 1, the affiliated terminal structure of device that Fig. 1 is the hardware running environment that the embodiment of the present invention is related to shows It is intended to.
The terminal of that embodiment of the invention can be PC, be also possible to smart phone, tablet computer, E-book reader, MP3 (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard audio level 3) Player, MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image expert's compression standard sound Frequency level 3) the packaged type terminal device having a display function such as player, portable computer.
As shown in Figure 1, the terminal may include: processor 1001, such as CPU, network interface 1004, user interface 1003, memory 1005, communication bus 1002.Wherein, communication bus 1002 is for realizing the connection communication between these components. User interface 1003 may include display screen (Display), input unit such as keyboard (Keyboard), optional user interface 1003 can also include standard wireline interface and wireless interface.Network interface 1004 optionally may include that the wired of standard connects Mouth, wireless interface (such as WI-FI interface).Memory 1005 can be high speed RAM memory, be also possible to stable memory (non-volatile memory), such as magnetic disk storage.Memory 1005 optionally can also be independently of aforementioned processor 1001 storage device.
Optionally, terminal can also include camera, RF (Radio Frequency, radio frequency) circuit, sensor, audio Circuit, WiFi module etc..Wherein, sensor such as optical sensor, motion sensor and other sensors.Specifically, light Sensor may include ambient light sensor and proximity sensor, wherein ambient light sensor can according to the light and shade of ambient light come The brightness of display screen is adjusted, proximity sensor can close display screen and/or backlight when mobile terminal is moved in one's ear.As One kind of motion sensor, gravity accelerometer can detect the size of (generally three axis) acceleration in all directions, quiet Size and the direction that can detect that gravity when only, the application that can be used to identify mobile terminal posture are (such as horizontal/vertical screen switching, related Game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap) etc.;Certainly, mobile terminal can also match The other sensors such as gyroscope, barometer, hygrometer, thermometer, infrared sensor are set, details are not described herein.
It will be understood by those skilled in the art that the restriction of the not structure paired terminal of terminal structure shown in Fig. 1, can wrap It includes than illustrating more or fewer components, perhaps combines certain components or different component layouts.
As shown in Figure 1, as may include that operating system, network are logical in a kind of memory 1005 of computer storage medium Believe module, Subscriber Interface Module SIM and Deformation Prediction program.
In terminal shown in Fig. 1, network interface 1004 is mainly used for connecting background server, carries out with background server Data communication;User interface 1003 is mainly used for connecting client (user terminal), carries out data communication with client;And processor 1001 can be used for calling the Deformation Prediction program stored in memory 1005.
In the present embodiment, Deformation Prediction device includes: memory 1005, processor 1001 and is stored in the memory On 1005 and the Deformation Prediction program that can be run on the processor 1001, wherein processor 1001 calls memory 1005 When the Deformation Prediction program of middle storage, and execute following operation:
It obtains the depth image of target object and inputs three-dimensional reconstruction network, be based on institute to obtain the three-dimensional reconstruction network State the threedimensional model of depth image generation;
The threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, become Shape predicts the prediction model of deformation of network output;
Task is executed based on the prediction model of deformation.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
The 2.5D depth image is inputted three-dimensional reconstruction network by the 2.5D depth image for obtaining target object;
Three-dimensional reconstruction processing is carried out to the 2.5D depth image in three-dimensional reconstruction network, generates corresponding three-dimensional mould Type.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
First condition vector is generated based on target object characteristic, the threedimensional model and the first condition vector are inputted Network is generated, the prediction of distortion amount of target object is obtained;
The true strain amount for obtaining target object, using the prediction of distortion amount of the generation network output as differentiation network Input value determines whether the prediction of distortion amount for generating network output meets expectation in conjunction with the true strain amount;
When the prediction of distortion amount of the generation network output meets desired, determine that the prediction of distortion amount is Deformation Prediction Model.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
The differentiation network successively includes five warp laminations, wherein each warp lamination includes an activation primitive;
The generation network includes variation autocoder and decoder;
The variation autocoder successively includes five convolutional layers, a full articulamentum and two μ and σ layers, In, each warp lamination includes an activation primitive;
The decoder successively includes five warp laminations, wherein each warp lamination includes an activation primitive.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
Determine the material property of target object and the stress condition of target object;
First condition vector is generated based on the material property and stress condition.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
Obtain the true strain amount that physical modeler generates;
The prediction of distortion amount input of the generation network output is differentiated into network, in conjunction with the true strain amount and is based on mesh The second condition vector that object properties generate is marked, determines whether the prediction of distortion amount meets expectation.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
Several real-world object deformation patterns are collected, corresponding image data base is established;
Deformation Prediction network is trained based on described image database, so that the prediction energy of the Deformation Prediction network Power gradually increases.
Further, processor 1001 can call the Deformation Prediction program stored in memory 1005, also execute following Operation:
Several voxel grids and conditional vector are obtained based on image data base, if generating by physical modeler by described Several ground truths pair of dry voxel grid and conditional vector composition;
With several described ground truths to training Deformation Prediction network, the related ginseng of Deformation Prediction network is continued to optimize Number, so that the predictive ability of the Deformation Prediction network gradually increases.
First embodiment of the invention provides a kind of Deformation Prediction method, is applied to Deformation Prediction system, the Deformation Prediction System includes three-dimensional reconstruction network and Deformation Prediction network, is Deformation Prediction method first embodiment of the present invention referring to Fig. 2, Fig. 2 Flow diagram, the Deformation Prediction method includes:
Step S10 obtains the depth image of target object and inputs three-dimensional reconstruction network, to obtain the Three-dimensional Gravity networking The threedimensional model that network is generated based on the depth image;
Depth image is also range image, refer to using distance (depth) value of each point in from image acquisition device to scene as The image of pixel value.Acquisition methods have: laser radar Depth Imaging method, computer stereo vision imaging, coordinate measuring machine method, not That the Schlieren method, Structure light method.The material property of the target object includes Young's modulus and Poisson's ratio, the target object by Power situation includes amount of force and position of action point.Conditional vector is to carry out discretization to multiple physical conditions, is obtained multiple Corresponding numerical value set, is indicated with the form of vector.For example, when robot needs to judge the deflection of target obstacle, Obtain the size of the power applied to it, it, can be by the way that these three features be compiled when the material of the active position of power and barrier Code is only hot vector (f, a, m), including each feature including f, a, m indicates the condition of discretization with binary form; Fixed number of bits each feature can also be arranged.In the present embodiment, it is shooting function by robot itself The depth image of target object is got, this image not only includes target object, further includes surrounding environment.Depth image It is converted by certain processing, such as coordinate, may be calculated point cloud data, while the point cloud data met certain condition can also Using inverse as depth image.
As shown in figure 3, the structure of three-dimensional reconstruction network and Deformation Prediction network is cascade form.The work of three-dimensional reconstruction network Be by 2.5D depth image by three-dimensional reconstruction generate have stereochemical structure threedimensional model, it is later that the threedimensional model is defeated Enter into Deformation Prediction network, so that Deformation Prediction network handles the threedimensional model, and then obtains prediction model of deformation.
In this embodiment, robot realization is helped to grab in such a way that three-dimensional reconstruction network and Deformation Prediction network are connected in series Prediction when taking object to object deflection.When predicting the deflection of target object, which is shot, is obtained One depth image, usually 2.5D depth image, obtains the 2.5D depth image of target object, by the 2.5D depth image Input three-dimensional reconstruction network;Three-dimensional reconstruction processing is carried out to the 2.5D depth image in three-dimensional reconstruction network, generates and corresponds to Threedimensional model.
The threedimensional model and the conditional vector generated based on target object characteristic are inputted Deformation Prediction net by step S20 Network obtains the prediction model of deformation of Deformation Prediction network output;
It interacts what robot and circumstances not known generated without constraint, can possess as the common-sense mankind's intuition pushes away Reason ability be it is necessary, can especially know target object generated reaction when by external force.To everyday objects Physical characteristic carry out modeling be autonomous robot basic premise.We have proposed a new generations to fight network, can Deformation of the target object under by external force is predicted from the threedimensional model of a target object.The network is based on generating confrontation net Network, and be trained on the different object sets generated by physics finite element model simulator.Network is generated mainly by certainly Dynamic encoder and decoder composition, for the ease of the internal structure of reconstructed voxel grid, the autocoder in the present invention is being passed There is great-jump-forward connection between the encoder and decoder of system.
Using certain way by target object characteristic encoding at conditional vector, above-mentioned target object characteristic includes target object Material property and target object stress condition.The material property of target object or stress condition are encoded into continuously Using conditional vector as the input of form, rather than according to the discontinuous classification of rank.
Step S30 executes task based on the prediction model of deformation.
When getting the prediction of distortion amount with generation network output, can executing for task includes: manipulation task, crawl Task, ground deformation assessment and other in the end-to-end academic environment of intuitive physical model predictive power to non-rigid object Influence task.
The Deformation Prediction method proposed in the present embodiment, by obtaining the depth image of target object and inputting three-dimensional reconstruction Network, to obtain the threedimensional model that the three-dimensional reconstruction network is generated based on the depth image;Then by the threedimensional model Deformation Prediction network is inputted with the conditional vector generated based on target object characteristic, the deformation for obtaining the output of Deformation Prediction network is pre- Survey model;Task is executed based on the prediction model of deformation later;This method inherits the extensive attribute for generating confrontation network, this Mean network can in the case where the single depth views of given object reconstructed object entire three-dimensional appearance, and with biography The finite element method of system realizes sufficiently fast Deformation Prediction speed, can use on the contrary, this method does not need largely to calculate cost In including robot grab article in real time including numerous real-time applications.
Based on first embodiment, the second embodiment of Deformation Prediction method of the present invention is proposed, referring to Fig. 4, step S10 packet It includes:
Step S11 obtains the 2.5D depth image of target object, and the 2.5D depth image is inputted Three-dimensional Gravity networking Network;
Step S12 carries out three-dimensional reconstruction processing to the 2.5D depth image in three-dimensional reconstruction network, generates corresponding Threedimensional model.
Obtain the 2.5D depth image of target object;Voxelization processing is carried out to the 2.5D depth image, is corresponded to The first voxel grid.Voxelization processing is the process for converting depth image to voxel grid, voxel grid namely one 3D point cloud.The definition of point cloud is from when beam of laser is irradiated to body surface, and the laser reflected can carry orientation, distance Etc. information.If laser beam is scanned according to certain track, the laser point information of reflection will be recorded in scanning, due to Scanning is extremely fine, then can obtain a large amount of laser point, thus can form laser point cloud.Point cloud format has * .las;* .pcd;* .txt etc..Depth image may be calculated point cloud data by coordinate conversion;Regular and necessary information point cloud number According to can be using inverse as depth image.
The Deformation Prediction method proposed in the present embodiment will be described by obtaining the 2.5D depth image of target object 2.5D depth image inputs three-dimensional reconstruction network;Then Three-dimensional Gravity is carried out to the 2.5D depth image in three-dimensional reconstruction network Processing is built, corresponding threedimensional model is generated;The present invention predicts what it was generated by obtaining the 2.5D depth map of target object Deformation, realizes the reconstruction of accurate threedimensional model.
Based on first embodiment, the 3rd embodiment of Deformation Prediction method of the present invention is proposed, referring to Fig. 5, Deformation Prediction net Network includes generating network and differentiating that network, step S20 include:
Step S21 generates first condition vector based on target object characteristic, by the threedimensional model and the first condition Vector input generates network, obtains the prediction of distortion amount of target object;
Step S22 obtains the true strain amount of target object, using the prediction of distortion amount of the generation network output as sentencing The input value of other network determines whether the prediction of distortion amount for generating network output meets expectation in conjunction with the true strain amount;
Step S23 determines that the prediction of distortion amount is when the prediction of distortion amount of the generation network output meets desired Prediction model of deformation.
Determine the material property of target object and the stress condition of target object;Based on the material property and stress Situation generates first condition vector, and the first condition vector is inputted and generates network.The material property of target object includes mesh The Young's modulus and Poisson's ratio of object materials are marked, the stress condition of target object includes the size and work of external force suffered by target object With point.
As shown in fig. 6, the generation network includes variation autocoder and decoder;The variation autocoder according to Secondary includes five convolutional layers, a full articulamentum and two μ and σ layers, wherein each warp lamination includes an activation Function;The decoder successively includes five warp laminations, wherein each warp lamination includes an activation primitive.
Voxel grid is inputted after generating network, initially enters and divide autocoder part partially, available one one-dimensional Vector is merged with based on first condition vector made of target object characteristic encoding, one after being merged it is one-dimensional to Amount, input decoder obtain the prediction of distortion amount generated by decoder processes.
As shown in fig. 6, differentiating that network successively includes five warp laminations, wherein each warp lamination includes one and swashs Function living.
In conjunction with the true strain amount and the second condition vector generated based on target object characteristic, determine that the prediction becomes Whether shape amount meets expectation.Second condition vector be modified so as to prediction of distortion amount with voxel grid form and true Real deflection is connected.Since the deflection of prediction target object is a high-dimensional problem, differentiate that network can export one A intensive vector differentiates the similarity as a result, namely between prediction of distortion amount and true strain amount to represent.According to this phase Judge to generate whether the prediction of distortion amount that network exports meets expectation like degree result.
The Deformation Prediction method proposed in the present embodiment will by generating first condition vector based on target object characteristic The threedimensional model and first condition vector input generate network, obtain the prediction of distortion amount of target object;Then it obtains The true strain amount of target object, using it is described generation network output prediction of distortion amount as differentiation network input value, in conjunction with The true strain amount, determines whether the prediction of distortion amount for generating network output meets expectation;It is defeated in the generation network later When prediction of distortion amount out meets desired, determine that the prediction of distortion amount is prediction model of deformation;The method of the present invention utilizes intuition Physical model predicts real-time deformation situation of three-dimension object when by external force, by the material property and stress condition of the object As conditional vector, it is added to depth variation autocoder and antagonistic training network, from the material property of target object The property of learning objective body enables the network in the method for the present invention to promote on a large scale, mitigates to a large amount of training datas Demand.
Based on first embodiment, propose the fourth embodiment of Deformation Prediction method of the present invention, reference Fig. 7, step S10 it Before, the method also includes:
Step S40 collects several real-world object deformation patterns, establishes corresponding image data base;
Step S50 is trained based on described image database to network is generated, so that the prediction energy for generating network Power gradually increases.
Several voxel grids and conditional vector are obtained based on image data base, if generating by physical modeler by described Several ground truths pair of dry voxel grid and conditional vector composition;With several described ground truths pair and differentiate network Common training generates network, continues to optimize the relevant parameter for generating network, so that the predictive ability for generating network gradually increases By force.
Physical modeler can simulate the rule of various object of which movement in real world, can be by preset picture database It is stored in the memory of itself, when being trained to generation network, several opposite true value is generated based on picture database Right, which forms to by three-dimensional point cloud and conditional vector, converts corresponding body by software for three-dimensional point cloud later Plain grid.By the ground truth of physical engine generation to generation network is input to, in the predictive information for obtaining generating network output When, by the predictive information and above-mentioned ground truth to as the input for differentiating network, predictive information be exactly by generation network over the ground Face true value is to the predicted value estimated according to certain rule.Later by between differentiation network query function predictive information and ground truth pair Difference, updated according to the difference differentiate network loss function in corresponding parameter.Meanwhile more according to above-mentioned loss function The relevant parameter of newly-generated network, so that the predictive ability for generating network gradually increases, that is, the predictive information exported is more next Closer to the ground truth pair of input, reach the degree for differentiating that network can not be differentiated.
The machine learning method of the intuition physics proposed in the present embodiment, by collecting several real-world object deformation patterns, Establish corresponding image data base;It is then based on described image database and is trained to network is generated, so that the generation net The predictive ability of network gradually increases;The present invention uses the structure system of cascade form, and conditional vector can be made as generation network A part with network inputs value is differentiated, realizes effective network training mode.
In addition, the embodiment of the present invention also proposes a kind of computer readable storage medium, the computer readable storage medium On be stored with Deformation Prediction program, following operation is realized when the Deformation Prediction program is executed by processor:
It obtains the depth image of target object and inputs three-dimensional reconstruction network, be based on institute to obtain the three-dimensional reconstruction network State the threedimensional model of depth image generation;
The threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, become Shape predicts the prediction model of deformation of network output;
Task is executed based on the prediction model of deformation.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
The 2.5D depth image is inputted three-dimensional reconstruction network by the 2.5D depth image for obtaining target object;
Three-dimensional reconstruction processing is carried out to the 2.5D depth image in three-dimensional reconstruction network, generates corresponding three-dimensional mould Type.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
First condition vector is generated based on target object characteristic, the threedimensional model and the first condition vector are inputted Network is generated, the prediction of distortion amount of target object is obtained;
The true strain amount for obtaining target object, using the prediction of distortion amount of the generation network output as differentiation network Input value determines whether the prediction of distortion amount for generating network output meets expectation in conjunction with the true strain amount;
When the prediction of distortion amount of the generation network output meets desired, determine that the prediction of distortion amount is Deformation Prediction Model.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
The generation network includes variation autocoder and decoder;
The variation autocoder successively includes five convolutional layers, a full articulamentum and two μ and σ layers, In, each warp lamination includes an activation primitive;
The decoder successively includes five warp laminations, wherein each warp lamination includes an activation primitive.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
Determine the material property of target object and the stress condition of target object;
First condition vector is generated based on the material property and stress condition.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
Obtain the true strain amount that physical modeler generates;
The prediction of distortion amount input of the generation network output is differentiated into network, in conjunction with the true strain amount and is based on mesh The second condition vector that object properties generate is marked, determines whether the prediction of distortion amount meets expectation.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
Several real-world object deformation patterns are collected, corresponding image data base is established;
Deformation Prediction network is trained based on described image database, so that the prediction energy of the Deformation Prediction network Power gradually increases.
Further, following operation is also realized when the Deformation Prediction program is executed by processor:
Several voxel grids and conditional vector are obtained based on image data base, if generating by physical modeler by described Several ground truths pair of dry voxel grid and conditional vector composition;
With several described ground truths to training Deformation Prediction network, the related ginseng of Deformation Prediction network is continued to optimize Number, so that the predictive ability of the Deformation Prediction network gradually increases.
It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the system that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or system institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or system.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in one as described above In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone, Computer, server, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims (10)

1. a kind of Deformation Prediction method is applied to Deformation Prediction system, which is characterized in that the Deformation Prediction system includes three-dimensional Rebuild network and Deformation Prediction network, the Deformation Prediction method the following steps are included:
It obtains the depth image of target object and inputs three-dimensional reconstruction network, be based on the depth to obtain the three-dimensional reconstruction network Spend the threedimensional model that image generates;
The threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, it is pre- to obtain deformation The prediction model of deformation of survey grid network output;
Task is executed based on the prediction model of deformation.
2. Deformation Prediction method as described in claim 1, which is characterized in that the depth image for obtaining target object is simultaneously defeated Enter three-dimensional reconstruction network, packet the step of to obtain threedimensional model that the three-dimensional reconstruction network is generated based on the depth image It includes:
The 2.5D depth image is inputted three-dimensional reconstruction network by the 2.5D depth image for obtaining target object;
Three-dimensional reconstruction processing is carried out to the 2.5D depth image in three-dimensional reconstruction network, generates corresponding threedimensional model.
3. Deformation Prediction method as described in claim 1, which is characterized in that the Deformation Prediction network include generate network and Differentiate network, it is described that the threedimensional model and the conditional vector generated based on target object characteristic are inputted into Deformation Prediction network, Obtain Deformation Prediction network output prediction model of deformation the step of include:
First condition vector is generated based on target object characteristic, the threedimensional model and first condition vector input are generated Network obtains the prediction of distortion amount of target object;
The true strain amount for obtaining target object, using the prediction of distortion amount of the generation network output as the input for differentiating network Value determines whether the prediction of distortion amount for generating network output meets expectation in conjunction with the true strain amount;
When the prediction of distortion amount of the generation network output meets desired, determine that the prediction of distortion amount is Deformation Prediction mould Type.
4. Deformation Prediction method as claimed in claim 3, which is characterized in that the differentiation network successively includes five deconvolution Layer, wherein each warp lamination includes an activation primitive;
The generation network includes variation autocoder and decoder;
The variation autocoder successively includes five convolutional layers, a full articulamentum and two μ and σ layers, wherein every A warp lamination all includes an activation primitive;
The decoder successively includes five warp laminations, wherein each warp lamination includes an activation primitive.
5. Deformation Prediction method as claimed in claim 3, which is characterized in that described to generate first based on target object characteristic The step of part vector includes:
Determine the material property of target object and the stress condition of target object;
First condition vector is generated based on the material property and stress condition.
6. Deformation Prediction method as claimed in claim 3, which is characterized in that the true strain amount for obtaining target object, Life is determined in conjunction with the true strain amount using the prediction of distortion amount of the generation network output as the input value for differentiating network Include: at whether the prediction of distortion amount that network exports meets the step of desired
Obtain the true strain amount that physical modeler generates;
The prediction of distortion amount input of the generation network output is differentiated into network, in conjunction with the true strain amount and is based on object The second condition vector that bulk properties generates, determines whether the prediction of distortion amount meets expectation.
7. Deformation Prediction method as described in claim 1, which is characterized in that the depth image for obtaining target object is simultaneously defeated Enter three-dimensional reconstruction network, the step of to obtain threedimensional model that the three-dimensional reconstruction network is generated based on the depth image it Before, the method also includes:
Several real-world object deformation patterns are collected, corresponding image data base is established;
Deformation Prediction network is trained based on described image database so that the predictive ability of the Deformation Prediction network by It is cumulative strong.
8. Deformation Prediction method as claimed in claim 7, which is characterized in that described pre- to deforming based on described image database Survey grid network is trained, so that the predictive ability of the Deformation Prediction network includes: the step of gradually increasing
Obtain several voxel grids and conditional vector based on image data base, by physical modeler generate by it is described several Several ground truths pair of voxel grid and conditional vector composition;
With several described ground truths to training Deformation Prediction network, the relevant parameter of Deformation Prediction network is continued to optimize, with Gradually increase the predictive ability of the Deformation Prediction network.
9. a kind of Deformation Prediction device, which is characterized in that the Deformation Prediction device includes: memory, processor and is stored in On the memory and the Deformation Prediction program that can run on the processor, the Deformation Prediction program is by the processor It realizes when execution such as the step of Deformation Prediction method described in any item of the claim 1 to 8.
10. a kind of computer readable storage medium, which is characterized in that it is pre- to be stored with deformation on the computer readable storage medium Ranging sequence realizes such as Deformation Prediction described in any item of the claim 1 to 8 when the Deformation Prediction program is executed by processor The step of method.
CN201910365833.0A 2019-04-30 2019-04-30 Deformation Prediction method, apparatus and computer readable storage medium Pending CN110084845A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910365833.0A CN110084845A (en) 2019-04-30 2019-04-30 Deformation Prediction method, apparatus and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910365833.0A CN110084845A (en) 2019-04-30 2019-04-30 Deformation Prediction method, apparatus and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN110084845A true CN110084845A (en) 2019-08-02

Family

ID=67418455

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910365833.0A Pending CN110084845A (en) 2019-04-30 2019-04-30 Deformation Prediction method, apparatus and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110084845A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110517352A (en) * 2019-08-27 2019-11-29 电子科技大学 A kind of three-dimensional rebuilding method of object, storage medium, terminal and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017120897A1 (en) * 2016-01-15 2017-07-20 武汉武大卓越科技有限责任公司 Object surface deformation feature extraction method based on line scanning three-dimensional point cloud
CN107220600A (en) * 2017-05-17 2017-09-29 清华大学深圳研究生院 A kind of Picture Generation Method and generation confrontation network based on deep learning
CN108021131A (en) * 2017-11-28 2018-05-11 王智华 A kind of robot vision recognition methods, device and computer-readable recording medium
CN108510573A (en) * 2018-04-03 2018-09-07 南京大学 A method of the multiple views human face three-dimensional model based on deep learning is rebuild
CN108564119A (en) * 2018-04-04 2018-09-21 华中科技大学 A kind of any attitude pedestrian Picture Generation Method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017120897A1 (en) * 2016-01-15 2017-07-20 武汉武大卓越科技有限责任公司 Object surface deformation feature extraction method based on line scanning three-dimensional point cloud
CN107220600A (en) * 2017-05-17 2017-09-29 清华大学深圳研究生院 A kind of Picture Generation Method and generation confrontation network based on deep learning
CN108021131A (en) * 2017-11-28 2018-05-11 王智华 A kind of robot vision recognition methods, device and computer-readable recording medium
CN108510573A (en) * 2018-04-03 2018-09-07 南京大学 A method of the multiple views human face three-dimensional model based on deep learning is rebuild
CN108564119A (en) * 2018-04-04 2018-09-21 华中科技大学 A kind of any attitude pedestrian Picture Generation Method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHIHUA WANG 等: "3D-PhysNet: Learning the Intuitive Physics of Non-Rigid Object Deformations", 《ARXIV:1805.00328V2》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110517352A (en) * 2019-08-27 2019-11-29 电子科技大学 A kind of three-dimensional rebuilding method of object, storage medium, terminal and system
CN110517352B (en) * 2019-08-27 2022-06-03 电子科技大学 Three-dimensional reconstruction method, storage medium, terminal and system of object

Similar Documents

Publication Publication Date Title
US11941719B2 (en) Learning robotic tasks using one or more neural networks
CN108021131B (en) Robot vision identification method and device and computer readable storage medium
CN112614213B (en) Facial expression determining method, expression parameter determining model, medium and equipment
IL297846B1 (en) Deep learning system for cuboid detection
Shi et al. RoboCraft: Learning to see, simulate, and shape elasto-plastic objects in 3D with graph networks
CN105051755A (en) Part and state detection for gesture recognition
CN104508709A (en) Animating objects using the human body
CN110503686A (en) Object pose estimation method and electronic equipment based on deep learning
CN112991503B (en) Model training method, device, equipment and medium based on skin weight
CN110135497A (en) Method, the method and device of Facial action unit intensity estimation of model training
US11138812B1 (en) Image processing for updating a model of an environment
CN112348285B (en) Crowd evacuation simulation method in dynamic environment based on deep reinforcement learning
CN108460152A (en) A kind of the space-filling curve method, apparatus and computer readable storage medium of data
CN115131849A (en) Image generation method and related device
CN114140841A (en) Point cloud data processing method, neural network training method and related equipment
CN108845833A (en) Intelligent shutdown method, apparatus and computer readable storage medium
CN110084845A (en) Deformation Prediction method, apparatus and computer readable storage medium
CN113066125A (en) Augmented reality method and related equipment thereof
CN116968024A (en) Method, computing device and medium for obtaining control strategy for generating shape closure grabbing pose
Kiatos et al. Grasping unknown objects by exploiting complementarity with robot hand geometry
CN110070542A (en) Machine learning method, device and the computer readable storage medium of intuition physics
CN113869189A (en) Human behavior recognition method, system, device and medium
CN112037315A (en) Method and device for generating local descriptor and method and device for generating model
CN115601513A (en) Model hyper-parameter selection method and related device
Kim et al. Automatic labeled LiDAR data generation and distance-based ensemble learning for human segmentation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190802

RJ01 Rejection of invention patent application after publication