WO2022148379A1 - Image processing method and apparatus, electronic device, and readable storage medium - Google Patents

Image processing method and apparatus, electronic device, and readable storage medium Download PDF

Info

Publication number
WO2022148379A1
WO2022148379A1 PCT/CN2022/070336 CN2022070336W WO2022148379A1 WO 2022148379 A1 WO2022148379 A1 WO 2022148379A1 CN 2022070336 W CN2022070336 W CN 2022070336W WO 2022148379 A1 WO2022148379 A1 WO 2022148379A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
key feature
migrated
initial
target
Prior art date
Application number
PCT/CN2022/070336
Other languages
French (fr)
Chinese (zh)
Inventor
李益永
黄秋实
孙准
井雪
项伟
Original Assignee
百果园技术(新加坡)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百果园技术(新加坡)有限公司 filed Critical 百果园技术(新加坡)有限公司
Publication of WO2022148379A1 publication Critical patent/WO2022148379A1/en

Links

Images

Classifications

    • G06T3/04
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Definitions

  • the present disclosure belongs to the technical field of image processing, and in particular, relates to an image processing method, apparatus, electronic device and readable storage medium.
  • Pose transfer means that after an image A is processed, the person P in the image A has the pose of the person H in the other image B, and a composite image C is obtained.
  • multiple images A, multiple images B and multiple images C are used as training samples to train an image transfer model, and then the new image A and image B are processed according to the image transfer model to obtain New composite image C.
  • the present disclosure provides an image processing method, which solves the problems of cumbersome migration process and incomplete migration to a certain extent.
  • a first aspect of the embodiments of the present disclosure provides an image processing method, the method includes:
  • the image to be migrated includes: a target object whose posture is to be converted;
  • the reference image includes: a reference object showing a reference posture;
  • a target composite image is determined according to the pose transfer matrix, the image to be transferred, and the initial image.
  • a second aspect of the embodiments of the present disclosure provides an image processing apparatus, the apparatus includes:
  • a first acquisition module configured to acquire an image to be migrated and a reference image;
  • the image to be migrated includes: a target object whose posture is to be converted;
  • the reference image includes: a reference object that presents a reference posture;
  • a second acquisition module configured to acquire the first key feature of the target object and the second key feature of the reference object
  • a first determination module configured to determine a posture transition matrix according to the first key feature and the second key feature
  • the third acquisition module is used to acquire the initial image
  • the second determining module is configured to determine a target composite image according to the posture transfer matrix, the image to be transferred and the initial image.
  • a third aspect of the embodiments of the present disclosure provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the The processor implements the steps of the method as described in the first aspect when executed.
  • a fourth aspect of the embodiments of the present disclosure provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented.
  • the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture; the target object is acquired The first key feature of the reference object and the second key feature of the reference object; determine a posture transfer matrix according to the first key feature and the second key feature; obtain an initial image; The migration image and the initial image determine a target composite image.
  • it is not necessary to acquire a large number of training samples to train the model to obtain the target composite image, which reduces the tediousness of image migration, and to acquire the initial image, the pose transfer matrix, the image to be migrated and the initial image are used to analyze the entire image.
  • the image to be migrated is migrated, so it can be ensured that the details of the image to be migrated are displayed in the target composite image, preventing the omission of details.
  • FIG. 1 is a flowchart of steps of an image processing method provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic diagram of an image processing method provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of another image processing method provided by an embodiment of the present disclosure.
  • FIG. 4 is a block diagram of an image processing apparatus provided by an embodiment of the present disclosure.
  • FIG. 5 is a structural block diagram of an electronic device provided by an embodiment of the present disclosure.
  • FIG. 1 a flowchart of steps of an image processing method provided by an embodiment of the present disclosure is shown, and the image processing method specifically includes the following steps:
  • Step 101 Obtain an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; and the reference image includes: a reference object showing a reference posture.
  • the image to be migrated is an image of m 1 *n 1 *3, where m 1 is the width of the image to be migrated, n 1 is the height of the image to be migrated, and 3 means that the image to be migrated is an RGB image.
  • the reference image is an image of m 2 *n 2 *3, where m 2 is the width of the reference image, n 2 is the height of the reference image, and 3 means that the reference image is an RGB image.
  • the target object and the reference object usually refer to the human body object in the image; with reference to FIG. 2 , wherein, the image A is the image to be migrated, and the image B is the reference image; the image A to be migrated includes the target object P,
  • the reference image B includes: the reference object H.
  • the user can select the image to be migrated and the reference image from the image memory according to requirements, and can also capture and obtain at any time, which is not limited.
  • the user can select a video as a reference video, then use each frame of image in the reference video as a reference image, and then process the image to be migrated based on each frame of reference image.
  • Step 102 Acquire a first key feature of the target object and a second key feature of the reference object.
  • each pixel in the image to be migrated can be represented by a dimensional vector or coordinate
  • each pixel in the reference image can also be represented by a dimensional vector or coordinate.
  • an image has 10 rows*10 columns of pixels, then the coordinates of the pixel p in the 5th row and 5th column are represented as (5,5); the pixel p is represented by a one-dimensional vector as p(45).
  • the first key feature refers to the coordinates of multiple feature points that can mark the pose of the target object; for example, the first key feature may be the coordinates of each joint of the target object; each joint includes: shoulder joint, elbow Joints, radiocarpal joints, carpal metacarpal joints, hip joints, knee joints, ankle joints, etc.
  • the first key feature may also be the coordinates of the main parts of the human body, for example, the parts that characterize the posture of the head, including: eyes, nose tip, temples, and the tip of the chin; the parts that characterize the posture of the arms, including: shoulder joints, elbow joints and Carpal-metacarpal joint; parts that characterize hand posture, including: knuckles and fingertips of each finger; parts that characterize leg posture, including: hip joint, knee joint, ankle joint.
  • the parts that characterize the posture of the head including: eyes, nose tip, temples, and the tip of the chin
  • the parts that characterize the posture of the arms including: shoulder joints, elbow joints and Carpal-metacarpal joint
  • parts that characterize hand posture including: knuckles and fingertips of each finger
  • parts that characterize leg posture including: hip joint, knee joint, ankle joint.
  • the first key feature is a preset key feature in the target object; the second key feature is in one-to-one correspondence with the first key feature.
  • the second key feature can be obtained according to the first key feature, wherein the second key feature corresponds to the first key feature, for example, the first key feature includes: the shoulder joint of the target object, The coordinates of the elbow joint, radiocarpal joint, carpal metacarpal joint, hip joint, knee joint, and ankle joint in the image to be migrated, then the second key feature includes: the shoulder joint, elbow joint, radiocarpal joint, and carpal metacarpal joint of the reference object , the coordinates of the hip, knee, and ankle joints in the reference image.
  • the first key feature is set as the coordinates of each feature point of the human face, such as eyes, nose, eyebrows, ears, mouth, etc.
  • the user can select the part to be migrated as required. For example, when only the face is migrated, only the first key feature of the face is selected, and when only the body is migrated, only the first key feature of the body is selected. key features.
  • Step 103 Determine a posture transition matrix according to the first key feature and the second key feature.
  • the step 103 includes: determining the coordinate value of each of the first key features and the coordinate value of each of the second key features;
  • the attitude transfer matrix is determined according to the coordinate value of the first key feature and the coordinate value of the second key feature, and the attitude transfer matrix is used to convert the coordinate value of the first key feature into a The coordinate value of the second key feature corresponding to the first key feature.
  • the attitude transfer matrix refers to the attitude transfer matrix required for the coordinates of the first key feature to be transferred to the coordinates of the second key feature.
  • the first key feature includes: the coordinates of the temple (a, b), the shoulder joint (c, d), the coordinates of the temple (m, n) of the second key feature, and the shoulder joint (o, p);
  • the coordinates of the temple of the first key feature are stored as (m, n)
  • the coordinates of the shoulder joint of the first key feature are stored as (o, p)
  • the coordinates of the elbow joint of the first key feature are stored as (q, r ), and so on.
  • the attitude transition matrix W is obtained.
  • the attitude transition matrix W can also be obtained in this way.
  • the coordinates of each first key feature are Px
  • the coordinates of each second key feature are Py
  • each pixel point of the image to be transferred can be transferred using the attitude transfer matrix W.
  • Step 104 acquiring an initial image.
  • the initial image is an initial image that needs to be input in order to complete the subsequent steps of obtaining the target composite image by using a preset method in the embodiment of the present disclosure.
  • step 104 includes: inputting the pose transfer matrix and the image to be transferred into an initial network model to obtain an initial image.
  • the initial network model may be a model trained according to data samples, wherein the data samples include: a plurality of pose transfer matrix samples for converting image samples to be migrated into reference image samples, and a plurality of image samples to be migrated and multiple targets to synthesize image samples; use these data samples to train to obtain the initial network model; then input the attitude transfer matrix and the image to be migrated into the initial network model obtained by training, that is, to obtain the initial image, using this method to obtain the initial image, is
  • the target object in the image to be migrated adopts the initial composite image in the pose of the reference object, but the details of the initial composite image are still missing and cannot fully present all the features of the image to be migrated.
  • the details of the image to be migrated can be completed after proceeding with the next steps.
  • the initial image obtained in this way has some features of the image to be migrated, but it is not very clear, and all the pixels in the image to be migrated have not been migrated. Using the initial image obtained in this way as the basis for subsequent calculations can improve the quality of the image to be migrated.
  • step 104 includes: taking a preset image whose dimension vector is zero as the initial image.
  • the preset image can be stored in the memory, and is called when the image to be migrated is processed.
  • the dimensional vector corresponding to the initial image may also be assigned a value of zero, and then subsequent calculations are performed.
  • Step 105 Determine a target composite image according to the pose transfer matrix, the image to be transferred, and the initial image.
  • step 105 includes: obtaining an intermediate composite image according to a preset method, the pose transfer matrix, the image to be migrated, and the initial image; and using the intermediate composite image as a new The initial image is performed cyclically for a preset number of times according to the preset method, as well as the posture transition matrix, the image to be migrated and the initial image to obtain an intermediate composite image.
  • F(Z, Px, Py) is set to represent the target object in the image to be migrated from the gesture of the target object to the dimensional vector of the target composite image in the gesture of the reference object. Then it is required that when min ⁇ F[z, P x , P y ]-x ⁇ approaches 0, the details of the image to be migrated are displayed in the target composite image. Among them, x is the dimension vector of the image to be migrated. The following solution steps for min ⁇ F[z, P x , P y ]-x ⁇ approaching 0;
  • the preset mode described in the embodiment of the present disclosure is specifically as follows:
  • Z k+1 is the first dimension vector of the intermediate composite image
  • Z k is the second dimension vector in the initial image
  • W is the attitude transition matrix
  • r k r k-1 + ⁇ k-1 Apk-1
  • p k r k + ⁇ k-1 p k-1 ;
  • the model obtains the initial fourth dimension vector of the initial image.
  • the attitude transition matrix W is the above W[P x , P y ].
  • Z 1 x/WZ 0 is obtained, wherein Z 1 is the first time that the attitude transition matrix W, the image to be migrated x and the initial image Z 0 are input into the formula corresponding to the above preset method, and the dimensional vector of the intermediate composite image is obtained; Z 0 is the dimensional vector of the obtained initial image.
  • the dimensional vector of the above-mentioned image may be a one-dimensional vector, a two-dimensional vector, or a three-dimensional vector, which is not limited herein.
  • the preset number of times is greater than or equal to 2; when the preset number of times is 2, the final target composite image is Z 2 .
  • the intermediate composite image as the new initial image can be cyclically executed, and the posture according to the preset method and the posture can be executed cyclically for a preset number of times.
  • the transition matrix, the image to be migrated, and the initial image are used to obtain an intermediate composite image, until the target composite image is obtained to be a satisfactory image for the user.
  • the method includes: multiple frames of reference images; the reference images include a time sequence; after step 105, the method further includes: arranging the multiple frames of the target composite images according to the time sequence to obtain a target composite video .
  • the method further includes: inputting the target composite image into the completion model to obtain the final composite image.
  • the completion model is used to complete the missing parts in the target synthetic image. For example, when the image to be migrated input by the user is an image lacking a face or a part of limbs, the completion model completes these missing parts.
  • the completion model can be trained based on a large number of images as training samples; for example, a back photo (without face photo), a legless photo, an armless photo and a corresponding full body photo are used as training samples to train the completion model.
  • multiple frames of reference images with a time sequence form a reference video; the user can click to upload the reference video, the reference video includes: multiple frames of reference images, and the multiple frames of reference images have a corresponding time sequence; Steps 101 to 105 are performed in sequence for each frame of image in the video, and finally a multi-frame target composite image is obtained, and the multi-frame target composite image is arranged in a time series to obtain the final target composite video.
  • the method further includes: identifying each frame of images in the reference video, selecting an image including a human object as a reference image; taking an image not including a human object as a transition image; then finally synthesizing the multi-frame target image and the transition image
  • the images are arranged in time series to obtain the target composite video.
  • the reference video includes dancing actions or other actions, which are not limited herein.
  • the step 105 includes: extracting the target object in the image to be migrated; determining a composite object according to the pose transfer matrix, the target object and the initial image; The background of the image is combined with the composite object to obtain the target composite image.
  • the target object in the image to be migrated is migrated, and the background of the target object in the image to be migrated is not migrated.
  • the obtained human body The object is a composite object, and then the background of the reference image and the composite object are composited. Referring to FIG. 3 , after the target object corresponding to the migration image A is migrated, the background of the reference image B is used to obtain the target composite image C.
  • the entire image A to be migrated may also be migrated to obtain the target composite image C.
  • the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture; the target object is acquired The first key feature of the reference object and the second key feature of the reference object; determine a posture transfer matrix according to the first key feature and the second key feature; obtain an initial image; The migration image and the initial image determine a target composite image.
  • it is not necessary to acquire a large number of training samples to train the model to obtain the target composite image, which reduces the tediousness of image migration, and to acquire the initial image, the pose transfer matrix, the image to be migrated and the initial image are used to analyze the entire image.
  • the image to be migrated is migrated, so it can be ensured that the details of the image to be migrated are displayed in the target composite image, preventing the omission of details.
  • FIG. 4 is a block diagram of an image processing apparatus provided by an embodiment of the present disclosure. As shown in the figure, the apparatus may include:
  • a first acquisition module configured to acquire an image to be migrated and a reference image;
  • the image to be migrated includes: a target object whose posture is to be converted;
  • the reference image includes: a reference object that presents a reference posture;
  • a second acquisition module configured to acquire the first key feature of the target object and the second key feature of the reference object
  • a first determination module configured to determine a posture transition matrix according to the first key feature and the second key feature
  • the third acquisition module is used to acquire the initial image
  • the second determining module is configured to determine a target composite image according to the posture transfer matrix, the image to be transferred and the initial image.
  • the image processing apparatus provided by the embodiment of the present disclosure has functional modules corresponding to executing the image processing method, can execute the image processing method provided by the embodiment of the present disclosure, and can achieve the same beneficial effects.
  • an electronic device may include: a processor, a memory, and a computer program stored on the memory and executable on the processor, the When the processor executes the program, each process of the above image processing method embodiment is implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here.
  • the electronic device may specifically include: a processor 301 , a storage device 302 , a display screen 303 with a touch function, an input device 304 , an output device 305 , and a communication device 306 .
  • the number of processors 301 in the electronic device may be one or more, and one processor 301 is taken as an example in FIG. 5 .
  • the processor 301 , the storage device 302 , the display screen 303 , the input device 304 , the output device 305 and the communication device 306 of the electronic device may be connected by a bus or in other ways.
  • a computer-readable storage medium is also provided, where instructions are stored in the computer-readable storage medium, when the computer-readable storage medium runs on a computer, the computer causes the computer to execute any one of the foregoing embodiments. the image processing method.

Abstract

The present disclosure provides an image processing method and apparatus. The image processing method comprises: acquiring an image to be migrated and a reference image, the image to be migrated comprising a target object of which the posture is to be converted; the reference image comprising a reference object presenting a reference posture; acquiring a first key feature of the target object and a second key feature of the reference object; determining a posture migration matrix according to the first key feature and the second key feature; acquiring an initial image; and determining a target synthetic image according to the posture migration matrix, the image to be migrated, and the initial image. In embodiments of the present disclosure, a large number of training sample training models do not need to be acquired to obtain the target synthetic image, so that the complexity of image migration is reduced; moreover, the initial image is acquired, and the whole image to be migrated is migrated according to the posture migration matrix, the image to be migrated, and the initial image, so that all the details of the image to be migrated can be ensured to be displayed in the target synthetic image, and details are prevented from being missed.

Description

图像处理方法、装置、电子设备和可读存储介质Image processing method, apparatus, electronic device and readable storage medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本公开要求在2021年01月05日提交中国专利局、申请号为202110009523.2、名称为“图像处理方法、装置、电子设备和可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。This disclosure claims the priority of a Chinese patent application with application number 202110009523.2 and titled "Image Processing Method, Apparatus, Electronic Device, and Readable Storage Medium" filed with the China Patent Office on January 5, 2021, the entire contents of which are hereby incorporated by reference Incorporated in this disclosure.
技术领域technical field
本公开属于图像处理技术领域,特别是涉及一种图像处理方法、装置、电子设备和可读存储介质。The present disclosure belongs to the technical field of image processing, and in particular, relates to an image processing method, apparatus, electronic device and readable storage medium.
背景技术Background technique
姿态迁移是指,一张图像A经过处理后,使图像A中的人物P具有另一张图像B的人物H的姿态,得到合成图像C。Pose transfer means that after an image A is processed, the person P in the image A has the pose of the person H in the other image B, and a composite image C is obtained.
目前,为了实现姿态迁移,采用多张图像A、多张图像B和多张图像C作为训练样本,训练一图像迁移模型,然后根据该图像迁移模型对新的图像A和图像B进行处理,得到新的合成图像C。At present, in order to achieve pose transfer, multiple images A, multiple images B and multiple images C are used as training samples to train an image transfer model, and then the new image A and image B are processed according to the image transfer model to obtain New composite image C.
上述姿态迁移方式,在训练图像迁移模型时,需要准备大量的训练样本,训练方式繁琐。并且,采用上述图像迁移模型进行图像迁移时,当两张图像中的人物的衣着体型相差较大时,合成图像C中的人物不能保持原图像A中的人物P的细节,不同合成图像C中的人物在不同视角和姿态下的形状相差较大,此外也可能会出现人物中只有部分人体得到了迁移,其他人体部分则需要再次进行处理才能达到迁移,导致迁移过程繁琐。In the above pose transfer method, when training the image transfer model, a large number of training samples need to be prepared, and the training method is cumbersome. Moreover, when the above-mentioned image migration model is used for image migration, when the clothing and body shapes of the characters in the two images are quite different, the characters in the composite image C cannot keep the details of the character P in the original image A. The shapes of the characters in different perspectives and postures are quite different. In addition, it may occur that only part of the human body of the characters has been migrated, and other parts of the human body need to be processed again to achieve the migration, resulting in a cumbersome migration process.
概述Overview
有鉴于此,本公开提供一种图像处理方法,在一定程度上解决了迁移过 程繁琐,并且迁移不全面的问题。In view of this, the present disclosure provides an image processing method, which solves the problems of cumbersome migration process and incomplete migration to a certain extent.
本公开实施例第一方面提供一种图像处理方法,所述方法包括:A first aspect of the embodiments of the present disclosure provides an image processing method, the method includes:
获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;Acquiring an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture;
获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;acquiring the first key feature of the target object and the second key feature of the reference object;
根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;Determine a posture transition matrix according to the first key feature and the second key feature;
获取初始图像;get the initial image;
根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。A target composite image is determined according to the pose transfer matrix, the image to be transferred, and the initial image.
本公开实施例第二方面提供一种图像处理装置,所述装置包括:A second aspect of the embodiments of the present disclosure provides an image processing apparatus, the apparatus includes:
第一获取模块,用于获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;a first acquisition module, configured to acquire an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object that presents a reference posture;
第二获取模块,用于获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;a second acquisition module, configured to acquire the first key feature of the target object and the second key feature of the reference object;
第一确定模块,用于根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;a first determination module, configured to determine a posture transition matrix according to the first key feature and the second key feature;
第三获取模块,用于获取初始图像;The third acquisition module is used to acquire the initial image;
第二确定模块,用于根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。The second determining module is configured to determine a target composite image according to the posture transfer matrix, the image to be transferred and the initial image.
本公开实施例第三方面提供一种电子设备,该电子设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如第一方面所述的方法的步骤。A third aspect of the embodiments of the present disclosure provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, the program or instruction being executed by the The processor implements the steps of the method as described in the first aspect when executed.
本公开实施例第四方面提供一种可读存储介质,该可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如第一方面所述的方法的步骤。A fourth aspect of the embodiments of the present disclosure provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to the first aspect are implemented.
在本公开实施例中,通过获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;获取初始图像;根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。在本公开实施例中,并不需要获取大量的训练样本训练模型来得到目标合成图像,降低了图像迁移的繁琐程度,并且获取初始图像,采用姿态迁移矩阵、待迁移图像和初始图像以对整个待迁移图像进行迁移,因此,能够保证待迁移图像的细节均在目标合成图像中展示,防止细节的遗漏。In the embodiment of the present disclosure, by acquiring an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture; the target object is acquired The first key feature of the reference object and the second key feature of the reference object; determine a posture transfer matrix according to the first key feature and the second key feature; obtain an initial image; The migration image and the initial image determine a target composite image. In the embodiment of the present disclosure, it is not necessary to acquire a large number of training samples to train the model to obtain the target composite image, which reduces the tediousness of image migration, and to acquire the initial image, the pose transfer matrix, the image to be migrated and the initial image are used to analyze the entire image. The image to be migrated is migrated, so it can be ensured that the details of the image to be migrated are displayed in the target composite image, preventing the omission of details.
上述说明仅是本公开技术方案的概述,为了能够更清楚了解本公开的技术手段,而可依照说明书的内容予以实施,并且为了让本公开的上述和其它目的、特征和优点能够更明显易懂,以下特举本公开的具体实施方式。The above description is only an overview of the technical solutions of the present disclosure. In order to understand the technical means of the present disclosure more clearly, it can be implemented according to the contents of the description, and in order to make the above-mentioned and other purposes, features and advantages of the present disclosure more obvious and easy to understand , the following specific embodiments of the present disclosure are given.
附图说明Description of drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本公开的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are for purposes of illustrating preferred embodiments only and are not to be considered limiting of the present disclosure. Also, the same components are denoted by the same reference numerals throughout the drawings. In the attached image:
图1是本公开实施例提供的一种图像处理方法的步骤流程图;FIG. 1 is a flowchart of steps of an image processing method provided by an embodiment of the present disclosure;
图2是本公开实施例提供的一种图像处理方法的示意图;FIG. 2 is a schematic diagram of an image processing method provided by an embodiment of the present disclosure;
图3是本公开实施例提供的另一种图像处理方法的示意图;3 is a schematic diagram of another image processing method provided by an embodiment of the present disclosure;
图4是本公开实施例提供的一种图像处理装置的框图;4 is a block diagram of an image processing apparatus provided by an embodiment of the present disclosure;
图5是本公开实施例提供的一种电子设备的结构框图。FIG. 5 is a structural block diagram of an electronic device provided by an embodiment of the present disclosure.
具体实施例specific embodiment
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示 了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided so that the present disclosure will be more thoroughly understood, and will fully convey the scope of the present disclosure to those skilled in the art.
参照图1,示出本公开实施例提供的一种图像处理方法的步骤流程图,该图像处理方法具体包括如下步骤:Referring to FIG. 1, a flowchart of steps of an image processing method provided by an embodiment of the present disclosure is shown, and the image processing method specifically includes the following steps:
步骤101,获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象。Step 101: Obtain an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; and the reference image includes: a reference object showing a reference posture.
其中,待迁移图像为m 1*n 1*3的图像,其中,m 1是待迁移图像的宽,n 1是待迁移图像的高,3是指待迁移图像为RGB图像。参考图像为m 2*n 2*3的图像,其中m 2是参考图像的宽,n 2是参考图像的高,3是指参考图像为RGB图像。 The image to be migrated is an image of m 1 *n 1 *3, where m 1 is the width of the image to be migrated, n 1 is the height of the image to be migrated, and 3 means that the image to be migrated is an RGB image. The reference image is an image of m 2 *n 2 *3, where m 2 is the width of the reference image, n 2 is the height of the reference image, and 3 means that the reference image is an RGB image.
在本公开实施例中,目标对象和参考对象通常是指图像中的人体对象;参照图2,其中,A图像为待迁移图像,B图像为参考图像;待迁移图像A中包括目标对象P,参考图像B中包括:参考对象H。In the embodiment of the present disclosure, the target object and the reference object usually refer to the human body object in the image; with reference to FIG. 2 , wherein, the image A is the image to be migrated, and the image B is the reference image; the image A to be migrated includes the target object P, The reference image B includes: the reference object H.
在本公开实施例中,待迁移图像和参考图像用户可以根据需求在图像存储器中选取,也可以随时拍摄获取,对此不加以限定。In the embodiment of the present disclosure, the user can select the image to be migrated and the reference image from the image memory according to requirements, and can also capture and obtain at any time, which is not limited.
此外,用户可以选取一段视频作为参考视频,则将参考视频中的各帧图像均作为参考图像,然后将待迁移图像基于每帧参考图像进行处理。In addition, the user can select a video as a reference video, then use each frame of image in the reference video as a reference image, and then process the image to be migrated based on each frame of reference image.
步骤102,获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征。Step 102: Acquire a first key feature of the target object and a second key feature of the reference object.
在本公开实施例中,将待迁移图像用维向量表示,则待迁移图像的维向量为:x(i*m*n+j*m+k)=x(j,k,i);其中,3≥i≥1,n≥i≥1,m≥i≥1。将参考图像也用维向量表示,则参考图像的维向量为y(i*m*n+j*m+k)=y(j,k,i);其中,3≥i≥1,n≥i≥1,m≥i≥1。In the embodiment of the present disclosure, the image to be migrated is represented by a dimensional vector, then the dimensional vector of the image to be migrated is: x(i*m*n+j*m+k)=x(j, k, i); where , 3≥i≥1, n≥i≥1, m≥i≥1. The reference image is also represented by a dimensional vector, then the dimensional vector of the reference image is y(i*m*n+j*m+k)=y(j, k, i); among them, 3≥i≥1, n≥ i≥1, m≥i≥1.
具体的,待迁移图像中的每个像素点均可用维向量或坐标表示其位置,参考图像的每个像素点也可以用维向量或坐标表示其位置。例如:一个图像 有10行*10列的像素点,则第5行5列的像素点p的坐标表示为(5,5);该像素点p用一维向量表示为p(45)。Specifically, each pixel in the image to be migrated can be represented by a dimensional vector or coordinate, and each pixel in the reference image can also be represented by a dimensional vector or coordinate. For example: an image has 10 rows*10 columns of pixels, then the coordinates of the pixel p in the 5th row and 5th column are represented as (5,5); the pixel p is represented by a one-dimensional vector as p(45).
在本公开实施例中,第一关键特征是指能够标记目标对象姿态的多个特征点的坐标;例如,第一关键特征可以是目标对象的各个关节的坐标;各个关节包括:肩关节、肘关节、桡腕关节、腕掌关节、髋关节、膝关节、踝关节等。此外,第一关键特征也可以是人体主要部位的坐标,例如,表征头部姿态的部位,包括:眼睛、鼻尖、太阳穴和下巴尖;表征手臂的姿态的部位,包括:肩关节、肘关节和腕掌关节;表征手姿态的部位,包括:各个手指的指关节和指尖;表征腿部姿态的部位,包括:髋关节、膝关节、踝关节。In the embodiment of the present disclosure, the first key feature refers to the coordinates of multiple feature points that can mark the pose of the target object; for example, the first key feature may be the coordinates of each joint of the target object; each joint includes: shoulder joint, elbow Joints, radiocarpal joints, carpal metacarpal joints, hip joints, knee joints, ankle joints, etc. In addition, the first key feature may also be the coordinates of the main parts of the human body, for example, the parts that characterize the posture of the head, including: eyes, nose tip, temples, and the tip of the chin; the parts that characterize the posture of the arms, including: shoulder joints, elbow joints and Carpal-metacarpal joint; parts that characterize hand posture, including: knuckles and fingertips of each finger; parts that characterize leg posture, including: hip joint, knee joint, ankle joint.
在本公开实施例中,所述第一关键特征为所述目标对象中的预设关键特征;所述第二关键特征与所述第一关键特征一一对应。In the embodiment of the present disclosure, the first key feature is a preset key feature in the target object; the second key feature is in one-to-one correspondence with the first key feature.
具体的,当获取到第一关键特征,可根据第一关键特征获取第二关键特征,其中,第二关键特征与第一关键特征对应,例如,第一关键特征包括:目标对象的肩关节、肘关节、桡腕关节、腕掌关节、髋关节、膝关节、踝关节在待迁移图像中的坐标,则第二关键特征包括:参考对象的肩关节、肘关节、桡腕关节、腕掌关节、髋关节、膝关节、踝关节在参考图像中的坐标。Specifically, when the first key feature is obtained, the second key feature can be obtained according to the first key feature, wherein the second key feature corresponds to the first key feature, for example, the first key feature includes: the shoulder joint of the target object, The coordinates of the elbow joint, radiocarpal joint, carpal metacarpal joint, hip joint, knee joint, and ankle joint in the image to be migrated, then the second key feature includes: the shoulder joint, elbow joint, radiocarpal joint, and carpal metacarpal joint of the reference object , the coordinates of the hip, knee, and ankle joints in the reference image.
在本公开实施例中,如果目标图像仅包括脸部图像时,即只对面部姿态进行迁移时,则将第一关键特征设置为人脸部各个特征点的坐标,例如:眼睛、鼻子、眉毛、耳朵、嘴巴等。In this embodiment of the present disclosure, if the target image only includes facial images, that is, when only facial poses are migrated, the first key feature is set as the coordinates of each feature point of the human face, such as eyes, nose, eyebrows, ears, mouth, etc.
在本公开实施例中,用户可以根据需要选择迁移的部位,例如,只对面部进行迁移时,则只选取面部的第一关键特征,当只对躯体进行迁移时,则只选择躯体的第一关键特征。In the embodiment of the present disclosure, the user can select the part to be migrated as required. For example, when only the face is migrated, only the first key feature of the face is selected, and when only the body is migrated, only the first key feature of the body is selected. key features.
步骤103,根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵。Step 103: Determine a posture transition matrix according to the first key feature and the second key feature.
在本公开实施例中,所述步骤103,包括:确定各个所述第一关键特征的坐标值,以及各个所述第二关键特征的坐标值;In this embodiment of the present disclosure, the step 103 includes: determining the coordinate value of each of the first key features and the coordinate value of each of the second key features;
根据所述第一关键特征的坐标值和所述第二关键特征的坐标值,确定所述姿态迁移矩阵,所述姿态迁移矩阵用于将所述第一关键特征的坐标值,转换为与所述第一关键特征对应的第二关键特征的坐标值。The attitude transfer matrix is determined according to the coordinate value of the first key feature and the coordinate value of the second key feature, and the attitude transfer matrix is used to convert the coordinate value of the first key feature into a The coordinate value of the second key feature corresponding to the first key feature.
在本公开实施例中,姿态迁移矩阵是指第一关键特征的坐标迁移到第二关键特征的坐标所需要的姿态迁移矩阵。例如,第一关键特征包括:太阳穴的坐标(a,b),肩关节(c,d)时,第二关键特征的太阳穴的坐标(m,n),肩关节(o,p);则将第一关键特征的太阳穴的坐标存储为(m,n),将第一关键特征的肩关节的坐标存储为(o,p),将第一关键特征的肘关节的坐标存储为(q,r),以此类推。其中,根据
Figure PCTCN2022070336-appb-000001
求出姿态迁移矩阵W,当第一关键特征包括多个(大于等于3个)时,也可以采用该种方式求得姿态迁移矩阵W。
In the embodiment of the present disclosure, the attitude transfer matrix refers to the attitude transfer matrix required for the coordinates of the first key feature to be transferred to the coordinates of the second key feature. For example, the first key feature includes: the coordinates of the temple (a, b), the shoulder joint (c, d), the coordinates of the temple (m, n) of the second key feature, and the shoulder joint (o, p); The coordinates of the temple of the first key feature are stored as (m, n), the coordinates of the shoulder joint of the first key feature are stored as (o, p), and the coordinates of the elbow joint of the first key feature are stored as (q, r ), and so on. Among them, according to
Figure PCTCN2022070336-appb-000001
The attitude transition matrix W is obtained. When the first key feature includes multiple (greater than or equal to three) features, the attitude transition matrix W can also be obtained in this way.
在本公开实施例中,各个第一关键特征的坐标为Px,各个第二关键特征的坐标为Py,则W=W[Px,Py]为由第一关键特征变换为第二关键特征所需要的姿态迁移矩阵。In the embodiment of the present disclosure, the coordinates of each first key feature are Px, and the coordinates of each second key feature are Py, then W=W[Px,Py] is required for the transformation from the first key feature to the second key feature The pose transfer matrix of .
其中,当确定姿态迁移矩阵W后,待迁移图像的各个像素点均可采用该姿态迁移矩阵W进行转移。Wherein, after the attitude transfer matrix W is determined, each pixel point of the image to be transferred can be transferred using the attitude transfer matrix W.
步骤104,获取初始图像。 Step 104, acquiring an initial image.
在本公开实施例中,初始图像是本公开实施例为完成后续采用预设方式得到目标合成图像的步骤,需要输入的一个初始图像。In the embodiment of the present disclosure, the initial image is an initial image that needs to be input in order to complete the subsequent steps of obtaining the target composite image by using a preset method in the embodiment of the present disclosure.
在本公开实施例中,步骤104包括:将所述姿态迁移矩阵和所述待迁移图像输入初始网络模型,得到初始图像。In this embodiment of the present disclosure, step 104 includes: inputting the pose transfer matrix and the image to be transferred into an initial network model to obtain an initial image.
在本公开实施例中,初始网络模型可以是根据数据样本训练得到的模型,其中,数据样本包括:多个待迁移图像样本转换至参考图像样本的姿态迁移矩阵样本,以及多个待迁移图像样本和多个目标合成图像样本;采用这些数据样本训练得到初始网络模型;然后将姿态迁移矩阵和所述待迁移图像输入训练得到的初始网络模型,即得到初始图像,采用该方式得到初始图像,是 待迁移图像中的目标对象采用参考对象的姿态下的初始合成图像,但是该初始合成图像的细节还有遗漏,不能完整呈现出待迁移图像的全部特征。在继续执行后续步骤后,即可对待迁移图像的细节补充完整。In this embodiment of the present disclosure, the initial network model may be a model trained according to data samples, wherein the data samples include: a plurality of pose transfer matrix samples for converting image samples to be migrated into reference image samples, and a plurality of image samples to be migrated and multiple targets to synthesize image samples; use these data samples to train to obtain the initial network model; then input the attitude transfer matrix and the image to be migrated into the initial network model obtained by training, that is, to obtain the initial image, using this method to obtain the initial image, is The target object in the image to be migrated adopts the initial composite image in the pose of the reference object, but the details of the initial composite image are still missing and cannot fully present all the features of the image to be migrated. The details of the image to be migrated can be completed after proceeding with the next steps.
此外,初始网络模型的工作原理也可以是Z 0=W*x;其中,Z 0是初始图像的维向量,W是姿态迁移矩阵,x是待迁移图像的维向量。这样得到的初始图像具有待迁移图像的一些特征,但是还不是很清晰,也并未将待迁移图像中所有的像素点进行了迁移。将该方式得到的初始图像作为后续计算的基础,能够提高待迁移图像迁移的质量。 In addition, the working principle of the initial network model can also be Z 0 =W*x; where Z 0 is the dimensional vector of the initial image, W is the pose transfer matrix, and x is the dimensional vector of the image to be migrated. The initial image obtained in this way has some features of the image to be migrated, but it is not very clear, and all the pixels in the image to be migrated have not been migrated. Using the initial image obtained in this way as the basis for subsequent calculations can improve the quality of the image to be migrated.
可选地,步骤104,包括:将维向量为零的预设图像作为所述初始图像。其中,该预设图像可以存储在存储器中,在对待迁移图像进行处理时,进行调用。Optionally, step 104 includes: taking a preset image whose dimension vector is zero as the initial image. Wherein, the preset image can be stored in the memory, and is called when the image to be migrated is processed.
在本公开实施例中,也可以给初始图像对应的维向量赋值为零,进而进行后续的计算。In this embodiment of the present disclosure, the dimensional vector corresponding to the initial image may also be assigned a value of zero, and then subsequent calculations are performed.
步骤105,根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。Step 105: Determine a target composite image according to the pose transfer matrix, the image to be transferred, and the initial image.
在本公开实施例中,步骤105,包括:根据预设方式,以及所述姿态迁移矩阵、所述待迁移图像和所述初始图像,得到中间合成图像;将所述中间合成图像作为新的所述初始图像,循环执行预设次数的所述根据预设方式,以及所述姿态迁移矩阵、所述待迁移图像和所述初始图像,得到中间合成图像的步骤。In this embodiment of the present disclosure, step 105 includes: obtaining an intermediate composite image according to a preset method, the pose transfer matrix, the image to be migrated, and the initial image; and using the intermediate composite image as a new The initial image is performed cyclically for a preset number of times according to the preset method, as well as the posture transition matrix, the image to be migrated and the initial image to obtain an intermediate composite image.
在本公开实施例中,设定F(Z,Px,Py)代表,待迁移图像中的目标对象由目标对象的姿态迁移成参考对象的姿态下的目标合成图像的维向量。则要求min∑‖F[z,P x,P y]-x‖趋近于0时,待迁移图像的细节均在目标合成图像中展现。其中,x为待迁移图像的维向量。以下对min∑‖F[z,P x,P y]-x‖在趋近0的时候求解步骤; In the embodiment of the present disclosure, F(Z, Px, Py) is set to represent the target object in the image to be migrated from the gesture of the target object to the dimensional vector of the target composite image in the gesture of the reference object. Then it is required that when min∑‖F[z, P x , P y ]-x‖ approaches 0, the details of the image to be migrated are displayed in the target composite image. Among them, x is the dimension vector of the image to be migrated. The following solution steps for min∑‖F[z, P x , P y ]-x‖ approaching 0;
1)对min∑‖F[z,P x,P y]-x‖进行优化得到
Figure PCTCN2022070336-appb-000002
Figure PCTCN2022070336-appb-000003
1) Optimize min∑‖F[z, P x , P y ]-x‖ to get
Figure PCTCN2022070336-appb-000002
Figure PCTCN2022070336-appb-000003
2)令A=(W[P x,P y]) TW[P x,P y],b=(W[P x,P y]) Tx;则对
Figure PCTCN2022070336-appb-000004
进行反问题模型化求解方程组AZ=b;
2) Let A=(W[P x ,P y ]) T W[P x ,P y ], b=(W[P x ,P y ]) T x ; then
Figure PCTCN2022070336-appb-000004
Carry out inverse problem modeling to solve equation system AZ=b;
3)设定求解精度e=0.0000001,则,r 0=b-AZ 0;p 0=r 0;如果r 0大于e,
Figure PCTCN2022070336-appb-000005
Figure PCTCN2022070336-appb-000006
r k=r k-1k-1Apk-1;p k=r kk-1p k-1
Figure PCTCN2022070336-appb-000007
其中,令A=(W) TW;P 0=r 0;r 0=b-AZ 0;b=W Tx;
3) Set the solution precision e=0.0000001, then, r 0 =b-AZ 0 ; p 0 =r 0 ; if r 0 is greater than e,
Figure PCTCN2022070336-appb-000005
Figure PCTCN2022070336-appb-000006
r k =r k-1k-1 Apk-1; p k =r kk-1 p k-1 ;
Figure PCTCN2022070336-appb-000007
Wherein, let A=(W) T W; P 0 =r 0 ; r 0 =b-AZ 0 ; b=W T x;
4)对上述公式进行整理得到,Z k+1=f(b,A,Z k),可见,目标合成图像Z k+1是依赖于姿态迁移矩阵W,待迁移图像x,和初始图像Z k的。 4) After sorting out the above formula, Z k+1 = f(b, A, Z k ), it can be seen that the target composite image Z k+1 depends on the pose transfer matrix W, the image to be transferred x, and the initial image Z k 's.
在本公开实施例中,根据上述步骤1)-步骤4)可知,本公开实施例中所述预设方式具体如下:In the embodiment of the present disclosure, according to the above steps 1)-step 4), the preset mode described in the embodiment of the present disclosure is specifically as follows:
Z k+1=Z kkP kZ k+1 =Z kk P k ;
其中,Z k+1为所述中间合成图像的第一维向量;Z k为所述初始图像中的第二维向量;其中W为所述姿态迁移矩阵;
Figure PCTCN2022070336-appb-000008
r k=r k-1k-1Apk-1;p k=r kk-1p k-1
Figure PCTCN2022070336-appb-000009
其中,令A=(W) TW;P 0=r 0;r 0=b-AZ 0;b=W Tx;其中,x为所述待迁移图像的第三维向量,Z 0为初始网络模型得到最初的所述初始图像的第四维向量。
Wherein, Z k+1 is the first dimension vector of the intermediate composite image; Z k is the second dimension vector in the initial image; wherein W is the attitude transition matrix;
Figure PCTCN2022070336-appb-000008
r k =r k-1k-1 Apk-1; p k =r kk-1 p k-1 ;
Figure PCTCN2022070336-appb-000009
Wherein, let A=(W) T W; P 0 =r 0 ; r 0 =b-AZ 0 ; b=W T x; where x is the third-dimensional vector of the image to be migrated, and Z 0 is the initial network The model obtains the initial fourth dimension vector of the initial image.
在本公开实施例中,预设方式是指采用上述公式Z k+1=Z kkP k得到目标合成图像。 In the embodiment of the present disclosure, the preset mode refers to obtaining the target composite image by adopting the above formula Z k+1 =Z kk P k .
在本公开实施例中,姿态迁移矩阵W即为上述W[P x,P y]。 In the embodiment of the present disclosure, the attitude transition matrix W is the above W[P x , P y ].
具体的,举例说明,将上述获取的初始图像作为Z 0;则第一次将姿态迁移矩阵W、待迁移图像x和初始图像Z 0输入上述预设方式对应的公式,得到:Z 1=Z 00P 0
Figure PCTCN2022070336-appb-000010
其中,r 0=b-Az 0;P 0=r 0;则α 0=1/A=1/(W) TW;P 0=b-Az 0=W Tx-(W) TW·Z 0;则Z 1=Z 0+(1/(W) TW)·(W Tx-(W) TW·Z 0)=x/W-Z 0。最终得到Z 1=x/W-Z 0,其中,Z 1是第一次将姿态迁移矩阵W、待迁 移图像x和初始图像Z 0输入上述预设方式对应的公式,得到中间合成图像的维向量;Z 0为获取到的初始图像的维向量。
Specifically, for example, the initial image obtained above is taken as Z 0 ; then, for the first time, the attitude transition matrix W, the image to be migrated x and the initial image Z 0 are input into the formula corresponding to the above-mentioned preset mode, to obtain: Z 1 =Z 00 P 0 ;
Figure PCTCN2022070336-appb-000010
Wherein, r 0 =b-Az 0 ; P 0 =r 0 ; then α 0 =1/A=1/(W) T W; P 0 =b-Az 0 =W T x-(W) T W· Z 0 ; then Z 1 =Z 0 +(1/(W) T W)·(W T x-(W) T W·Z 0 )=x/WZ 0 . Finally, Z 1 =x/WZ 0 is obtained, wherein Z 1 is the first time that the attitude transition matrix W, the image to be migrated x and the initial image Z 0 are input into the formula corresponding to the above preset method, and the dimensional vector of the intermediate composite image is obtained; Z 0 is the dimensional vector of the obtained initial image.
将上述得到的中间合成图像Z 1作为新的初始图像,第二次将移矩阵W、待迁移图像x和新的初始图像Z 1输入上述预设方式对应的公式,得到:Z 2=Z 11P 1;其中,
Figure PCTCN2022070336-appb-000011
r 1=r 00Ap 0;p 1=r 10p 0
Figure PCTCN2022070336-appb-000012
其中,令A=(W) TW;P 0=r 0;r 0=b-AZ 0;b=W Tx;得到Z 2
Taking the intermediate composite image Z 1 obtained above as a new initial image, input the shift matrix W, the image to be migrated x and the new initial image Z 1 into the formula corresponding to the above preset mode for the second time, and obtain: Z 2 =Z 11 P 1 ; where,
Figure PCTCN2022070336-appb-000011
r 1 =r 00 Ap 0 ; p 1 =r 10 p 0 ;
Figure PCTCN2022070336-appb-000012
Wherein, let A=(W) T W; P 0 =r 0 ; r 0 =b-AZ 0 ; b=W T x; Z 2 is obtained.
在本公开实施例中,上述图像的维向量可以是一维向量、二维向量或者三维向量,在此不加以限定。In the embodiment of the present disclosure, the dimensional vector of the above-mentioned image may be a one-dimensional vector, a two-dimensional vector, or a three-dimensional vector, which is not limited herein.
在本公开实施例中,预设次数大于等于2;当预设次数为2次时,则最终目标合成图像为Z 2。其中,当最终得到的目标合成图像Z 2细节不够清晰时,可以循环执行将所述中间合成图像作为新的所述初始图像,循环执行预设次数的所述根据预设方式,以及所述姿态迁移矩阵、所述待迁移图像和所述初始图像,得到中间合成图像,直到得到所述目标合成图像是用户满意的图像为止。 In the embodiment of the present disclosure, the preset number of times is greater than or equal to 2; when the preset number of times is 2, the final target composite image is Z 2 . Wherein, when the final obtained target composite image Z 2 is not clear enough in detail, the intermediate composite image as the new initial image can be cyclically executed, and the posture according to the preset method and the posture can be executed cyclically for a preset number of times. The transition matrix, the image to be migrated, and the initial image are used to obtain an intermediate composite image, until the target composite image is obtained to be a satisfactory image for the user.
在本公开实施例中,包括:多帧参考图像;所述参考图像包括时间序列;则步骤105之后,还包括:将多帧所述目标合成图像按照所述时间序列进行排列,得到目标合成视频。In the embodiment of the present disclosure, it includes: multiple frames of reference images; the reference images include a time sequence; after step 105, the method further includes: arranging the multiple frames of the target composite images according to the time sequence to obtain a target composite video .
在本公开实施例中,还包括,将目标合成图像输入补全模型,得到最终合成图像。其中,补全模型用于将目标合成图像中缺失的部分进行补全。例如,当用户输入的待迁移图像为缺少人脸或者缺少部分肢体的图像,补全模型将这些缺失的部分进行补全。In the embodiment of the present disclosure, the method further includes: inputting the target composite image into the completion model to obtain the final composite image. Among them, the completion model is used to complete the missing parts in the target synthetic image. For example, when the image to be migrated input by the user is an image lacking a face or a part of limbs, the completion model completes these missing parts.
具体的,补全模型可根据大量图像作为训练样本训练得到;例如,将背面照(无脸照),无腿照以及无胳膊照和对应的身体完整的照片作为训练样本,训练补全模型。Specifically, the completion model can be trained based on a large number of images as training samples; for example, a back photo (without face photo), a legless photo, an armless photo and a corresponding full body photo are used as training samples to train the completion model.
其中,多帧具有时间序列的参考图像组成参考视频;用户可以点击上传参考视频,参考视频包括:多帧参考图像,多帧参考图像具有对应的时间序 列;服务器或者电子设备将待迁移图像和参考视频中的每帧图像依次执行步骤101-105,最终得到多帧目标合成图像,多帧目标合成图像按照时间序列进行排列,即可得到最终的目标合成视频。Among them, multiple frames of reference images with a time sequence form a reference video; the user can click to upload the reference video, the reference video includes: multiple frames of reference images, and the multiple frames of reference images have a corresponding time sequence; Steps 101 to 105 are performed in sequence for each frame of image in the video, and finally a multi-frame target composite image is obtained, and the multi-frame target composite image is arranged in a time series to obtain the final target composite video.
在本公开实施例中,还包括:识别参考视频中的各帧图像,选取包括人体对象的图像作为参考图像;将不包含人体对象的图像作为过渡图像;则最终将多帧目标合成图像和过渡图像按照时间序列排列,得到目标合成视频。In the embodiment of the present disclosure, the method further includes: identifying each frame of images in the reference video, selecting an image including a human object as a reference image; taking an image not including a human object as a transition image; then finally synthesizing the multi-frame target image and the transition image The images are arranged in time series to obtain the target composite video.
其中,参考视频中包括跳舞动作或者其他动作,在此不加以限定。Wherein, the reference video includes dancing actions or other actions, which are not limited herein.
在本公开实施例中,所述步骤105,包括:提取所述待迁移图像中的目标对象;根据所述姿态迁移矩阵、所述目标对象以及所述初始图像,确定合成对象;将所述参考图像的背景与所述合成对象进行合成,得到所述目标合成图像。In this embodiment of the present disclosure, the step 105 includes: extracting the target object in the image to be migrated; determining a composite object according to the pose transfer matrix, the target object and the initial image; The background of the image is combined with the composite object to obtain the target composite image.
在本公开实施例中,只将待迁移图像中的目标对象进行迁移,而将待迁移图像中除目标对象的背景并不进行迁移,将待迁移图像中的目标对象进行迁移后,得到的人体对象为合成对象,然后将参考图像的背景与合成对象进行合成,参照图3,待迁移图像A对应的目标对象迁移后,采用了参考图像B的背景,得到所述目标合成图像C。In the embodiment of the present disclosure, only the target object in the image to be migrated is migrated, and the background of the target object in the image to be migrated is not migrated. After migrating the target object in the image to be migrated, the obtained human body The object is a composite object, and then the background of the reference image and the composite object are composited. Referring to FIG. 3 , after the target object corresponding to the migration image A is migrated, the background of the reference image B is used to obtain the target composite image C.
在本公开实施例中,参照图2,也可以对整个待迁移图像A进行迁移,得到目标合成图像C。In the embodiment of the present disclosure, referring to FIG. 2 , the entire image A to be migrated may also be migrated to obtain the target composite image C.
在本公开实施例中,通过获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;获取初始图像;根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。在本公开实施例中,并不需要获取大量的训练样本训练模型来得到目标合成图像,降低了图像迁移的繁琐程度,并且获取初始图像,采用姿态迁移矩阵、待迁移图像和初始图像以对整个待迁移图像进行迁移, 因此,能够保证待迁移图像的细节均在目标合成图像中展示,防止细节的遗漏。In the embodiment of the present disclosure, by acquiring an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture; the target object is acquired The first key feature of the reference object and the second key feature of the reference object; determine a posture transfer matrix according to the first key feature and the second key feature; obtain an initial image; The migration image and the initial image determine a target composite image. In the embodiment of the present disclosure, it is not necessary to acquire a large number of training samples to train the model to obtain the target composite image, which reduces the tediousness of image migration, and to acquire the initial image, the pose transfer matrix, the image to be migrated and the initial image are used to analyze the entire image. The image to be migrated is migrated, so it can be ensured that the details of the image to be migrated are displayed in the target composite image, preventing the omission of details.
图4是本公开实施例提供的一种图像处理装置的框图,如图所示,该装置可以包括:FIG. 4 is a block diagram of an image processing apparatus provided by an embodiment of the present disclosure. As shown in the figure, the apparatus may include:
第一获取模块,用于获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;a first acquisition module, configured to acquire an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object that presents a reference posture;
第二获取模块,用于获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;a second acquisition module, configured to acquire the first key feature of the target object and the second key feature of the reference object;
第一确定模块,用于根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;a first determination module, configured to determine a posture transition matrix according to the first key feature and the second key feature;
第三获取模块,用于获取初始图像;The third acquisition module is used to acquire the initial image;
第二确定模块,用于根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。The second determining module is configured to determine a target composite image according to the posture transfer matrix, the image to be transferred and the initial image.
本公开实施例提供的图像处理装置具备执行图像处理方法相应的功能模块,可执行本公开实施例所提供的图像处理方法,且能达到相同的有益效果。The image processing apparatus provided by the embodiment of the present disclosure has functional modules corresponding to executing the image processing method, can execute the image processing method provided by the embodiment of the present disclosure, and can achieve the same beneficial effects.
在本公开提供的又一实施例中,还提供了一种电子设备,电子设备可以包括:处理器、存储器以及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述程序时实现上述图像处理方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。示例的,如图5所示,该电子设备具体可以包括:处理器301、存储装置302、具有触摸功能的显示屏303、输入装置304、输出装置305以及通信装置306。该电子设备中处理器301的数量可以是一个或者多个,图5中以一个处理器301为例。该电子设备的处理器301、存储装置302、显示屏303、输入装置304、输出装置305以及通信装置306可以通过总线或者其他方式连接。In yet another embodiment provided by the present disclosure, an electronic device is also provided. The electronic device may include: a processor, a memory, and a computer program stored on the memory and executable on the processor, the When the processor executes the program, each process of the above image processing method embodiment is implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here. For example, as shown in FIG. 5 , the electronic device may specifically include: a processor 301 , a storage device 302 , a display screen 303 with a touch function, an input device 304 , an output device 305 , and a communication device 306 . The number of processors 301 in the electronic device may be one or more, and one processor 301 is taken as an example in FIG. 5 . The processor 301 , the storage device 302 , the display screen 303 , the input device 304 , the output device 305 and the communication device 306 of the electronic device may be connected by a bus or in other ways.
在本公开提供的又一实施例中,还提供了一种计算机可读存储介质,该 计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述实施例中任一所述的图像处理方法。In yet another embodiment provided by the present disclosure, a computer-readable storage medium is also provided, where instructions are stored in the computer-readable storage medium, when the computer-readable storage medium runs on a computer, the computer causes the computer to execute any one of the foregoing embodiments. the image processing method.
在本公开提供的又一实施例中,还提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述实施例中任一所述的图像处理方法。In yet another embodiment provided by the present disclosure, there is also provided a computer program product containing instructions, which when run on a computer, causes the computer to execute the image processing method described in any one of the foregoing embodiments.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that, in this document, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any relationship between these entities or operations. any such actual relationship or sequence exists. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device that includes a list of elements includes not only those elements, but also includes not explicitly listed or other elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.
本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于系统实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。Each embodiment in this specification is described in a related manner, and the same and similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, as for the system embodiments, since they are basically similar to the method embodiments, the description is relatively simple, and for related parts, please refer to the partial descriptions of the method embodiments.
以上所述仅为本公开的较佳实施例而已,并非用于限定本公开的保护范围。凡在本公开的精神和原则之内所作的任何修改、等同替换、改进等,均包含在本公开的保护范围内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the protection scope of the present disclosure. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present disclosure are included in the protection scope of the present disclosure.

Claims (13)

  1. 一种图像处理方法,其特征在于,所述方法包括:An image processing method, characterized in that the method comprises:
    获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;Acquiring an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object showing a reference posture;
    获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;acquiring the first key feature of the target object and the second key feature of the reference object;
    根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;Determine a posture transition matrix according to the first key feature and the second key feature;
    获取初始图像;get the initial image;
    根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。A target composite image is determined according to the pose transfer matrix, the image to be transferred, and the initial image.
  2. 根据权利要求1所述的方法,其特征在于,所述获取初始图像,包括:The method according to claim 1, wherein the acquiring an initial image comprises:
    将所述姿态迁移矩阵和所述待迁移图像输入初始网络模型,得到初始图像。Inputting the pose transfer matrix and the image to be transferred into an initial network model to obtain an initial image.
  3. 根据权利要求1所述的方法,其特征在于,所述获取初始图像,包括:The method according to claim 1, wherein the acquiring an initial image comprises:
    将维向量为零的预设图像作为所述初始图像。A preset image whose dimension vector is zero is used as the initial image.
  4. 根据权利要求1所述的方法,其特征在于,所述根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像,包括:The method according to claim 1, wherein the determining the target composite image according to the pose transfer matrix, the image to be transferred and the initial image comprises:
    根据预设方式,以及所述姿态迁移矩阵、所述待迁移图像和所述初始图像,得到中间合成图像;Obtain an intermediate composite image according to a preset manner, and the pose transfer matrix, the image to be transferred, and the initial image;
    将所述中间合成图像作为新的所述初始图像,循环执行预设次数的所述根据预设方式,以及所述姿态迁移矩阵、所述待迁移图像和所述初始图像,得到中间合成图像的步骤,得到所述目标合成图像。Taking the intermediate composite image as the new initial image, and cyclically executing the preset method for a preset number of times, as well as the attitude transition matrix, the image to be migrated, and the initial image, to obtain the intermediate composite image. step to obtain the target composite image.
  5. 根据权利要求4所述的方法,其特征在于,所述预设方式如下:The method according to claim 4, wherein the preset mode is as follows:
    Z k+1=Z kkP kZ k+1 =Z kk P k ;
    其中,Z k+1为所述中间合成图像的第一维向量;Z k为所述初始图像中的第二维向量;其中W为所述姿态迁移矩阵;
    Figure PCTCN2022070336-appb-100001
    r k=r k-1k-1Apk-1;p k= r kk-1p k-1
    Figure PCTCN2022070336-appb-100002
    其中,令A=(W) TW;P 0=r 0;r 0=b-AZ 0;b=W Tx;其中,x为所述待迁移图像的第三维向量,Z 0为初始网络模型得到最初的所述初始图像的第四维向量。
    Wherein, Z k+1 is the first dimension vector of the intermediate composite image; Z k is the second dimension vector in the initial image; wherein W is the attitude transition matrix;
    Figure PCTCN2022070336-appb-100001
    r k =r k-1k-1 Apk-1; p k = r kk-1 p k-1 ;
    Figure PCTCN2022070336-appb-100002
    Wherein, let A=(W) T W; P 0 =r 0 ; r 0 =b-AZ 0 ; b=W T x; where x is the third-dimensional vector of the image to be migrated, and Z 0 is the initial network The model obtains the initial fourth dimension vector of the initial image.
  6. 根据权利要求1所述的方法,其特征在于,所述第一关键特征为所述目标对象中的预设关键特征;所述第二关键特征与所述第一关键特征一一对应。The method according to claim 1, wherein the first key feature is a preset key feature in the target object; the second key feature is in one-to-one correspondence with the first key feature.
  7. 根据权利要求6所述的方法,其特征在于,所述根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵,包括:The method according to claim 6, wherein the determining a posture transition matrix according to the first key feature and the second key feature comprises:
    确定各个所述第一关键特征的坐标值,以及各个所述第二关键特征的坐标值;Determine the coordinate value of each of the first key features, and the coordinate value of each of the second key features;
    根据所述第一关键特征的坐标值和所述第二关键特征的坐标值,确定所述姿态迁移矩阵,所述姿态迁移矩阵用于将所述第一关键特征的坐标值,转换为与所述第一关键特征对应的第二关键特征的坐标值。The attitude transfer matrix is determined according to the coordinate value of the first key feature and the coordinate value of the second key feature, and the attitude transfer matrix is used to convert the coordinate value of the first key feature into a The coordinate value of the second key feature corresponding to the first key feature.
  8. 根据权利要求7所述的方法,其特征在于,包括:多帧参考图像;所述参考图像包括时间序列;The method according to claim 7, characterized in that, comprising: multiple frames of reference images; the reference images comprise time series;
    则所述根据所述姿态迁移矩阵、所述待迁移图像和所述初始图像,确定所述目标合成图像之后,还包括:Then, after determining the target composite image according to the posture transfer matrix, the image to be transferred and the initial image, the method further includes:
    将多帧所述目标合成图像按照所述时间序列进行排列,得到目标合成视频。Arrange multiple frames of the target composite images according to the time sequence to obtain a target composite video.
  9. 根据权利要求1所述的方法,其特征在于,所述根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像,包括:The method according to claim 1, wherein the determining the target composite image according to the pose transfer matrix, the image to be transferred and the initial image comprises:
    提取所述待迁移图像中的目标对象;extracting the target object in the image to be migrated;
    根据所述姿态迁移矩阵、所述目标对象以及所述初始图像,确定合成对象;determining a synthetic object according to the pose transfer matrix, the target object and the initial image;
    将所述参考图像的背景与所述合成对象进行合成,得到所述目标合成图像。Synthesize the background of the reference image and the synthetic object to obtain the target synthetic image.
  10. 一种图像处理装置,其特征在于,所述装置包括:An image processing device, characterized in that the device comprises:
    第一获取模块,用于获取待迁移图像和参考图像;所述待迁移图像中包括:待转换姿态的目标对象;所述参考图像中包括:呈现参考姿态的参考对象;a first acquisition module, configured to acquire an image to be migrated and a reference image; the image to be migrated includes: a target object whose posture is to be converted; the reference image includes: a reference object that presents a reference posture;
    第二获取模块,用于获取所述目标对象的第一关键特征以及所述参考对象的第二关键特征;a second acquisition module, configured to acquire the first key feature of the target object and the second key feature of the reference object;
    第一确定模块,用于根据所述第一关键特征和所述第二关键特征,确定姿态迁移矩阵;a first determining module, configured to determine a posture transition matrix according to the first key feature and the second key feature;
    第三获取模块,用于获取初始图像;The third acquisition module is used to acquire the initial image;
    第二确定模块,用于根据所述姿态迁移矩阵、所述待迁移图像以及所述初始图像,确定目标合成图像。The second determining module is configured to determine a target composite image according to the posture transfer matrix, the image to be transferred and the initial image.
  11. 一种电子设备,其特征在于,所述电子设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1-9任意一项所述的方法的步骤。An electronic device, characterized in that the electronic device includes a processor, a memory, and a program or instruction stored on the memory and executable on the processor, and the program or instruction is executed by the processor while implementing the steps of the method according to any one of claims 1-9.
  12. 一种可读存储介质,其特征在于,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1-9任意一项所述的方法的步骤。A readable storage medium, characterized in that a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method according to any one of claims 1-9 are implemented.
  13. 一种计算机程序产品,包括计算机可读代码,当所述计算机可读代码在计算处理设备上运行时,导致所述计算处理设备执行根据权利要求1-9任意一项所述的方法的步骤。A computer program product comprising computer readable code which, when run on a computing processing device, causes the computing processing device to perform the steps of the method according to any one of claims 1-9.
PCT/CN2022/070336 2021-01-05 2022-01-05 Image processing method and apparatus, electronic device, and readable storage medium WO2022148379A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110009523.2 2021-01-05
CN202110009523.2A CN112734632B (en) 2021-01-05 2021-01-05 Image processing method, device, electronic equipment and readable storage medium

Publications (1)

Publication Number Publication Date
WO2022148379A1 true WO2022148379A1 (en) 2022-07-14

Family

ID=75591231

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/070336 WO2022148379A1 (en) 2021-01-05 2022-01-05 Image processing method and apparatus, electronic device, and readable storage medium

Country Status (2)

Country Link
CN (1) CN112734632B (en)
WO (1) WO2022148379A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115423752A (en) * 2022-08-03 2022-12-02 荣耀终端有限公司 Image processing method, electronic device and readable storage medium

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112734632B (en) * 2021-01-05 2024-02-27 百果园技术(新加坡)有限公司 Image processing method, device, electronic equipment and readable storage medium
CN115713582B (en) * 2022-12-02 2023-10-27 北京百度网讯科技有限公司 Avatar generation method, device, electronic equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111027438A (en) * 2019-12-03 2020-04-17 Oppo广东移动通信有限公司 Human body posture migration method, mobile terminal and computer storage medium
US20200218883A1 (en) * 2017-12-25 2020-07-09 Beijing Sensetime Technology Development Co., Ltd. Face pose analysis method, electronic device, and storage medium
CN111626218A (en) * 2020-05-28 2020-09-04 腾讯科技(深圳)有限公司 Image generation method, device and equipment based on artificial intelligence and storage medium
CN111652798A (en) * 2020-05-26 2020-09-11 浙江大华技术股份有限公司 Human face pose migration method and computer storage medium
CN112734632A (en) * 2021-01-05 2021-04-30 百果园技术(新加坡)有限公司 Image processing method, image processing device, electronic equipment and readable storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670474B (en) * 2018-12-28 2023-07-25 广东工业大学 Human body posture estimation method, device and equipment based on video
US11030772B2 (en) * 2019-06-03 2021-06-08 Microsoft Technology Licensing, Llc Pose synthesis
CN110705625A (en) * 2019-09-26 2020-01-17 北京奇艺世纪科技有限公司 Image processing method and device, electronic equipment and storage medium
CN112115783B (en) * 2020-08-12 2023-11-14 中国科学院大学 Depth knowledge migration-based face feature point detection method, device and equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200218883A1 (en) * 2017-12-25 2020-07-09 Beijing Sensetime Technology Development Co., Ltd. Face pose analysis method, electronic device, and storage medium
CN111027438A (en) * 2019-12-03 2020-04-17 Oppo广东移动通信有限公司 Human body posture migration method, mobile terminal and computer storage medium
CN111652798A (en) * 2020-05-26 2020-09-11 浙江大华技术股份有限公司 Human face pose migration method and computer storage medium
CN111626218A (en) * 2020-05-28 2020-09-04 腾讯科技(深圳)有限公司 Image generation method, device and equipment based on artificial intelligence and storage medium
CN112734632A (en) * 2021-01-05 2021-04-30 百果园技术(新加坡)有限公司 Image processing method, image processing device, electronic equipment and readable storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115423752A (en) * 2022-08-03 2022-12-02 荣耀终端有限公司 Image processing method, electronic device and readable storage medium

Also Published As

Publication number Publication date
CN112734632B (en) 2024-02-27
CN112734632A (en) 2021-04-30

Similar Documents

Publication Publication Date Title
WO2022148379A1 (en) Image processing method and apparatus, electronic device, and readable storage medium
US11741629B2 (en) Controlling display of model derived from captured image
WO2021093453A1 (en) Method for generating 3d expression base, voice interactive method, apparatus and medium
JP7015152B2 (en) Processing equipment, methods and programs related to key point data
WO2021169806A1 (en) Image processing method and apparatus, computer device, and storage medium
CN111062326B (en) Self-supervision human body 3D gesture estimation network training method based on geometric driving
CN109410119A (en) Mask image distortion method and its system
WO2020147796A1 (en) Image processing method and apparatus, image device, and storage medium
CN114529639A (en) Method, device, equipment and storage medium for generating virtual image animation
CN108305321A (en) A kind of three-dimensional human hand 3D skeleton patterns real-time reconstruction method and apparatus based on binocular color imaging system
JP2017037424A (en) Learning device, recognition device, learning program and recognition program
WO2020147791A1 (en) Image processing method and device, image apparatus, and storage medium
CN113239892A (en) Monocular human body three-dimensional attitude estimation method based on data enhancement architecture
WO2022174574A1 (en) Sensor-based bare-hand data annotation method and system
Leng et al. Stable hand pose estimation under tremor via graph neural network
Lan et al. Data fusion-based real-time hand gesture recognition with Kinect V2
Shi et al. Fbi-pose: Towards bridging the gap between 2d images and 3d human poses using forward-or-backward information
Liang et al. Hand pose estimation by combining fingertip tracking and articulated ICP
WO2020147794A1 (en) Image processing method and apparatus, image device and storage medium
KR20220159988A (en) Facial mesh deformation with fine wrinkles
Jiang et al. Aviation medical simulation training based on interactive technology
US20200334904A1 (en) Articulated model registration apparatus and method
Taylor et al. Vrprop-net: Real-time interaction with virtual props
CN114973396B (en) Image processing method, image processing device, terminal equipment and computer readable storage medium
US11954943B2 (en) Method for generating synthetic data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22736541

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22736541

Country of ref document: EP

Kind code of ref document: A1