WO2021190321A1 - Image processing method and device - Google Patents

Image processing method and device Download PDF

Info

Publication number
WO2021190321A1
WO2021190321A1 PCT/CN2021/080280 CN2021080280W WO2021190321A1 WO 2021190321 A1 WO2021190321 A1 WO 2021190321A1 CN 2021080280 W CN2021080280 W CN 2021080280W WO 2021190321 A1 WO2021190321 A1 WO 2021190321A1
Authority
WO
WIPO (PCT)
Prior art keywords
human body
model
value
dimensional joint
result
Prior art date
Application number
PCT/CN2021/080280
Other languages
French (fr)
Chinese (zh)
Inventor
甄海洋
周维
Original Assignee
虹软科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 虹软科技股份有限公司 filed Critical 虹软科技股份有限公司
Priority to KR1020227037422A priority Critical patent/KR20220160066A/en
Priority to JP2022558577A priority patent/JP7448679B2/en
Publication of WO2021190321A1 publication Critical patent/WO2021190321A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning

Definitions

  • two-dimensional joint points in the same state will correspond to different three-dimensional joint points before and after, and the recognition accuracy of three-dimensional joint points depends on two The recognition accuracy of the three-dimensional joint points results in low recognition accuracy of the three-dimensional joint points.
  • the first model adopts an hourglass network structure or a feature map pyramid FPN network structure.
  • Fig. 5 is a flowchart of an optional image processing method according to an embodiment of the present application.
  • the first model may be used to process the human body image to obtain the parameter values of the SMPL model; and then the two-dimensional joint points or the three-dimensional joint points may be obtained based on the parameter values.
  • the human body is detected on the original image to obtain the human body image, and then the human body image is processed by the trained first model to obtain the processing result of the human body image, thereby simultaneously achieving The purpose of human body detection, two-dimensional and three-dimensional joint point positioning and SMPL model establishment, and can further generate a human body model.
  • the purpose of slimming the human body can be achieved.
  • Processing to achieve image processing effects such as thin arms, thin legs, and thin waists.
  • the SMPL model discriminator processes the parameter values of the third result (that is, the SMPL model output by the preset model) to obtain a classification result of the parameter values of the third result, where the classification result is used to characterize the third result Whether the parameter value is the real value collected by the collecting device; based on the classification result and the target loss value, it is determined whether to stop training the preset model.
  • the D discriminator in the Generative Adversarial Network (GAN) can be used as the SMPL model discriminator.
  • the aforementioned transmission device is used to receive or send data via a network.
  • the above-mentioned specific examples of the network may include a wired network and a wireless network.
  • the transmission device includes a network adapter (Network Interface Controller, NIC), which can be connected to other network devices and routers via a network cable so as to communicate with the Internet or a local area network.
  • the transmission device is a radio frequency (RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF radio frequency

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Human Computer Interaction (AREA)
  • Image Analysis (AREA)
  • Processing Or Creating Images (AREA)
  • Image Generation (AREA)

Abstract

An image processing method and device. The method comprises: acquiring an original image (S102); performing human body detection on the original image to obtain a human body image (S104); performing processing on the human body image by using a trained first model to obtain a processing result of the human body image, wherein the processing result comprises: a two-dimensional articulation point, a three-dimensional articulation point, and a skinned multi-person linear (SMPL) model (S106); and generating a human body model according to the processing result of the human body image (S108). The method solves the technical problem of low recognition accuracy for positioning of the two-dimensional and three-dimensional articulation points and reconstruction of a human body parameter.

Description

图像处理方法和装置Image processing method and device
本申请要求于2020年03月27日提交中国专利局、优先权号为202010231605.7、发明名称为“图像处理方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office with priority number 202010231605.7 and invention title "Image Processing Method and Apparatus" on March 27, 2020, the entire content of which is incorporated into this application by reference.
技术领域Technical field
本申请涉及计算机视觉技术领域,具体而言,涉及一种图像处理方法和装置。This application relates to the field of computer vision technology, and specifically to an image processing method and device.
背景技术Background technique
目前行业内相关人体技术包括人体检测、二维和三维关节点定位、分割等等。针对二维和三维关节点定位以及人体参数系数重建等部分,目前可以采用如下方案实现:1)首先对图像用深度学习的方案进行人体检测,检测完成之后把人体区域截取出来,再用深度学习网络去估计二维关节点,然后再用二维关节点去估计三维关节点、人体姿态和形状参数。但是,利用二维关节点去估计三维关节点,会存在动作的歧义性,例如,同一个状态的二维关节点会对应前后不同的三维关节点,并且三维关节点的识别准确度依赖于二维关节点的识别准确度,导致三维关节点的识别准确度较低。2)首先对图像用深度学习的方案进行人体检测,检测完成之后把人体区域截取出来,再用深度学习网络直接对三维关节点进行预测,将三维关节点变成三维体素网格,来推断每个关节的每个体素网格的可能性,从而进行训练和预测。但是,由于三维关节点的样本较难获得,大部分训练样本是实验室环境下采集到的,对户外场景的鲁棒性不高,而且采用体素网格进行预测,计算量大,实时性较低。3)先对人体进行检测,再对检测的图片进行人物分割或者是解析,然后再利用分割和解析的结果,通过优化的方法进行人体模型估计。但是,由于人体分割和解析的要求太高,结果的偏差会影响到人体重建的效果。At present, relevant human body technologies in the industry include human body detection, two-dimensional and three-dimensional joint point positioning, segmentation, and so on. For the two-dimensional and three-dimensional joint point positioning and the reconstruction of human body parameter coefficients, the following schemes can be used at present: 1) First, use the deep learning scheme for human body detection on the image. After the detection is completed, the human body area is cut out, and then deep learning is used. The network estimates the two-dimensional joint points, and then uses the two-dimensional joint points to estimate the three-dimensional joint points, the posture and shape parameters of the human body. However, using two-dimensional joint points to estimate three-dimensional joint points will cause ambiguity in actions. For example, two-dimensional joint points in the same state will correspond to different three-dimensional joint points before and after, and the recognition accuracy of three-dimensional joint points depends on two The recognition accuracy of the three-dimensional joint points results in low recognition accuracy of the three-dimensional joint points. 2) First, perform human body detection on the image with a deep learning scheme. After the detection is completed, the human body area is cut out, and then the deep learning network is used to directly predict the 3D joint points, and the 3D joint points are turned into a 3D voxel grid to infer Possibility of each voxel grid of each joint for training and prediction. However, because the samples of 3D joint points are difficult to obtain, most of the training samples are collected in a laboratory environment, which is not robust to outdoor scenes, and voxel grids are used for prediction, which requires a large amount of calculation and real-time performance. Lower. 3) The human body is detected first, and then the detected pictures are segmented or analyzed, and then the results of the segmentation and analysis are used to estimate the human body model through an optimized method. However, due to the high requirements of human body segmentation and analysis, the deviation of the results will affect the effect of human body reconstruction.
针对上述方案中的问题,目前尚未提出有效的解决方案。In view of the problems in the above-mentioned schemes, no effective solutions have yet been proposed.
发明内容Summary of the invention
本申请至少部分实施例提供了一种图像处理方法和装置,以至少解决相关技术中针对二维和三维关节点定位以及人体参数重建的识别准确度较低的技术问题。At least some embodiments of the present application provide an image processing method and device to at least solve the technical problem of low recognition accuracy for two-dimensional and three-dimensional joint point positioning and human body parameter reconstruction in related technologies.
根据本申请实施例的一个方面,提供了一种图像处理方法,包括:获取原始图像;对原始图像进行人体检测,得到人体图像;利用训练好的第一模型对人体图像进行处 理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;根据人体图像的处理结果生成人体模型。According to one aspect of the embodiments of the present application, an image processing method is provided, which includes: obtaining an original image; performing human body detection on the original image to obtain a human body image; and using a trained first model to process the human body image to obtain the human body image The processing results include: two-dimensional joint points, three-dimensional joint points and a skinned multi-person linear SMPL model; the human body model is generated according to the processing result of the human body image.
可选地,该方法还包括:获取多组训练样本,其中,每组训练样本包含:人体图像,二维关节点的第一标记信息,三维关节点的第二标记信息,以及SMPL模型的参数值;利用多组训练样本对预设模型进行训练,并获取预设模型的目标损失值;在目标损失值小于预设值的情况下,停止对预设模型进行训练,并确定预设模型为第一模型;在目标损失值大于预设值的情况下,继续利用多组训练样本对预设模型进行训练,直至目标损失值小于预设值。Optionally, the method further includes: obtaining multiple sets of training samples, where each set of training samples includes: a human body image, first label information of two-dimensional joint points, second label information of three-dimensional joint points, and parameters of the SMPL model Value; use multiple sets of training samples to train the preset model and obtain the target loss value of the preset model; if the target loss value is less than the preset value, stop training the preset model and determine that the preset model is The first model; when the target loss value is greater than the preset value, continue to use multiple sets of training samples to train the preset model until the target loss value is less than the preset value.
可选地,利用多组训练样本对预设模型进行训练,并获取预设模型的目标损失值包括:将多组训练样本输入预设模型,并获取预设模型的输出结果,其中,输出结果包括:二维关节点的第一结果,三维关节点的第二结果和SMPL模型的第三结果;基于第一标记信息和第一结果,得到二维关节点的第一损失值;基于第二标记信息和第二结果,得到三维关节点的第二损失值;基于参数值和第三结果,得到SMPL模型的第三损失值;基于第一损失值、第二损失值和第三损失值,得到目标损失值。Optionally, using multiple sets of training samples to train the preset model and obtaining the target loss value of the preset model includes: inputting multiple sets of training samples into the preset model and obtaining the output result of the preset model, where the output result Including: the first result of the two-dimensional joint point, the second result of the three-dimensional joint point and the third result of the SMPL model; based on the first label information and the first result, the first loss value of the two-dimensional joint point is obtained; based on the second The label information and the second result are used to obtain the second loss value of the three-dimensional joint point; based on the parameter value and the third result, the third loss value of the SMPL model is obtained; based on the first loss value, the second loss value, and the third loss value, Get the target loss value.
可选地,SMPL模型的参数值是通过采集装置采集到的真实数据,或,通过对采集装置采集到的参数值进行调整得到的调整数据。Optionally, the parameter values of the SMPL model are real data collected by the collecting device, or adjusted data obtained by adjusting the parameter values collected by the collecting device.
可选地,基于参数值和第三结果,得到SMPL模型的第三损失值包括:在参数值是通过采集装置采集到的真实数值的情况下,基于参数值和第三结果,得到第三损失值;在参数值是通过对采集装置采集到的参数值进行调整得到的调整数值的情况下,基于参数值获得三维关节点,将三维关节点投射到二维平面上获得二维关节点,基于投射的二维关节点和第一标记信息,得到二维关节点的第四损失值,并将第四损失值确定为第三损失值。Optionally, obtaining the third loss value of the SMPL model based on the parameter value and the third result includes: in the case that the parameter value is a real value collected by the collecting device, obtaining the third loss based on the parameter value and the third result Value; in the case that the parameter value is the adjusted value obtained by adjusting the parameter value collected by the acquisition device, the three-dimensional joint point is obtained based on the parameter value, and the three-dimensional joint point is projected onto the two-dimensional plane to obtain the two-dimensional joint point, based on The projected two-dimensional joint points and the first label information are used to obtain the fourth loss value of the two-dimensional joint point, and the fourth loss value is determined as the third loss value.
可选地,该方法还包括:利用判别器对第三结果的参数值进行处理,得到第三结果的参数值的分类结果,其中,分类结果设置为表征第三结果的参数值是否是通过采集装置采集到的真实数值;基于分类结果和目标损失值,确定是否停止对预设模型进行训练。Optionally, the method further includes: using a discriminator to process the parameter value of the third result to obtain a classification result of the parameter value of the third result, wherein the classification result is set to characterize whether the parameter value of the third result is collected The real value collected by the device; based on the classification result and the target loss value, it is determined whether to stop training the preset model.
可选地,利用生成对抗网络对判别器进行训练。Optionally, the discriminator is trained using a generative confrontation network.
可选地,对原始图像进行人体检测,得到人体图像包括:利用训练好的第二模型对原始图像进行处理,得到人体在原始图像中的位置信息;基于位置信息对原始图像进行裁剪和归一化处理,得到人体图像。Optionally, performing human body detection on the original image to obtain the human body image includes: processing the original image by using the trained second model to obtain the position information of the human body in the original image; and cropping and normalizing the original image based on the position information Chemical processing to obtain a human body image.
可选地,第一模型采用沙漏型网络结构或特征图金字塔FPN网络结构。Optionally, the first model adopts an hourglass network structure or a feature map pyramid FPN network structure.
根据本申请实施例的另一方面,还提供了一种图像处理装置,包括:获取模块,设置为获取原始图像;检测模块,设置为对原始图像进行人体检测,得到人体图像;处理模块,设置为利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;生成模块,设置为根据人体图像的处理结果生成人体模型。According to another aspect of the embodiments of the present application, an image processing device is also provided, including: an acquisition module configured to acquire an original image; a detection module configured to perform human body detection on the original image to obtain a human body image; a processing module configured to In order to use the trained first model to process the human body image to obtain the processing result of the human body image, the processing result includes: two-dimensional joint points, three-dimensional joint points and the skin multi-person linear SMPL model; the generation module is set to be based on The human body image is processed as a result to generate a human body model.
根据本申请实施例的另一方面,还提供了一种存储介质,存储介质包括存储的程序,其中,在程序运行时控制存储介质所在设备执行上述的图像处理方法。According to another aspect of the embodiments of the present application, a storage medium is also provided. The storage medium includes a stored program, wherein the device where the storage medium is located is controlled to execute the above-mentioned image processing method when the program is running.
根据本申请实施例的另一方面,还提供了一种处理器,处理器设置为运行程序,其中,程序运行时执行上述的图像处理方法。According to another aspect of the embodiments of the present application, a processor is also provided, the processor is configured to run a program, wherein the above-mentioned image processing method is executed when the program is running.
在本申请至少部分实施例中,在获取到原始图像之后,首先对原始图像进行人体检测,得到人体图像,然后利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,从而同时实现人体检测、二维和三维关节点定位以及SMPL模型建立的目的,并可以进一步生成人体模型。容易注意到的是,由于利用一个模型即可同时得到二维关节点,三维关节点和SMPL模型,无需通过二维关节点估计三维关节点,从而达到了提高图像识别准确度的技术效果,进而解决了相关技术中针对二维和三维关节点定位以及人体参数重建的识别准确度较低技术问题。In at least some of the embodiments of the present application, after acquiring the original image, first perform human body detection on the original image to obtain a human body image, and then use the trained first model to process the human body image to obtain the processing result of the human body image. At the same time, the purpose of human body detection, 2D and 3D joint point positioning and SMPL model establishment can be realized, and the human body model can be further generated. It is easy to notice that since one model can obtain two-dimensional joint points, three-dimensional joint points and SMPL models at the same time, there is no need to estimate three-dimensional joint points through two-dimensional joint points, thereby achieving the technical effect of improving the accuracy of image recognition, and then It solves the technical problem of low recognition accuracy for two-dimensional and three-dimensional joint point positioning and human body parameter reconstruction in related technologies.
附图说明Description of the drawings
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described here are used to provide a further understanding of the application and constitute a part of the application. The exemplary embodiments and descriptions of the application are used to explain the application, and do not constitute an improper limitation of the application. In the attached picture:
图1是根据本申请实施例的一种图像处理方法的流程图;Fig. 1 is a flowchart of an image processing method according to an embodiment of the present application;
图2是根据本申请实施例的一种可选的人体图像的示意图;Fig. 2 is a schematic diagram of an optional human body image according to an embodiment of the present application;
图3是根据本申请实施例的一种可选的人体模型的示意图;Fig. 3 is a schematic diagram of an optional human body model according to an embodiment of the present application;
图4a是根据本申请实施例的一种可选的平均形状的人体模型的示意图;Fig. 4a is a schematic diagram of an optional average-shaped human body model according to an embodiment of the present application;
图4b是根据本申请实施例的一种可选的加上形状参数后生成的人体模型的示意图;4b is a schematic diagram of an optional human body model generated after adding shape parameters according to an embodiment of the present application;
图4c是根据本申请实施例的一种可选的加上形状参数和姿态参数后生成的人体模型的示意图;Fig. 4c is a schematic diagram of an optional human body model generated after adding shape parameters and posture parameters according to an embodiment of the present application;
图4d是根据本申请实施例的一种可选的根据检测到的人体动作生成的人体模型 的示意图;Figure 4d is a schematic diagram of an optional human body model generated based on detected human actions according to an embodiment of the present application;
图5是根据本申请实施例的一种可选的图像处理方法的流程图;Fig. 5 is a flowchart of an optional image processing method according to an embodiment of the present application;
图6是根据本申请实施例的一种可选的GAN网络的示意图;以及Fig. 6 is a schematic diagram of an optional GAN network according to an embodiment of the present application; and
图7是根据本申请实施例的一种图像处理装置的示意图。Fig. 7 is a schematic diagram of an image processing device according to an embodiment of the present application.
具体实施方式Detailed ways
为了使本技术领域的人员更好地理解本申请方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分的实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本申请保护的范围。In order to enable those skilled in the art to better understand the solutions of the application, the technical solutions in the embodiments of the application will be clearly and completely described below in conjunction with the drawings in the embodiments of the application. Obviously, the described embodiments are only These are a part of the embodiments of this application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work should fall within the protection scope of this application.
需要说明的是,本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It should be noted that the terms “first” and “second” in the specification and claims of the application and the above-mentioned drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It should be understood that the data used in this way can be interchanged under appropriate circumstances, so that the embodiments of the present application described herein can be implemented in a sequence other than those illustrated or described herein. In addition, the terms "including" and "having" and any variations of them are intended to cover non-exclusive inclusions. For example, a process, method, system, product, or device that includes a series of steps or units is not necessarily limited to those clearly listed. Those steps or units may include other steps or units that are not clearly listed or are inherent to these processes, methods, products, or equipment.
实施例1Example 1
根据本申请实施例,提供了一种图像处理方法,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。According to an embodiment of the present application, an image processing method is provided. It should be noted that the steps shown in the flowchart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions. The logical order is shown in, but in some cases, the steps shown or described can be performed in a different order than here.
图1是根据本申请实施例的一种图像处理方法的流程图,如图1所示,该方法包括如下步骤:Fig. 1 is a flowchart of an image processing method according to an embodiment of the present application. As shown in Fig. 1, the method includes the following steps:
步骤S102,获取原始图像;Step S102, obtaining an original image;
上述原始图像可以是从输入的视频流数据截取的图像,也可以是直接获取的图像,该原始图像中包含有人体。The foregoing original image may be an image intercepted from the input video stream data, or may be an image obtained directly, and the original image contains a human body.
步骤S104,对原始图像进行人体检测,得到人体图像;Step S104: Perform human body detection on the original image to obtain a human body image;
上述人体图像可以是原始图像中提取出的包含完整的人体区域的最小图像,如图 2所示。The above-mentioned human body image may be the smallest image that contains a complete human body region extracted from the original image, as shown in FIG. 2.
在一种可选的实施例中,可以采用深度学习模型,例如Faster RCNN(Faster Region Convolutional Neural Networks,快速区域卷积神经网络)、YOLO(You Only Look Once)和SSD(Single Shot Detector)等检测框架及其变形进行人体检测。本领域技术人员可知,在不同设备和应用场景下,可以选择不同的检测框架,以快速准确地实现人体检测,得到人体图像。In an alternative embodiment, a deep learning model can be used, such as Faster RCNN (Faster Region Convolutional Neural Networks, fast regional convolutional neural network), YOLO (You Only Look Once), and SSD (Single Shot Detector) detection The frame and its deformation are subject to human detection. Those skilled in the art can know that in different devices and application scenarios, different detection frameworks can be selected to quickly and accurately realize human body detection and obtain human body images.
可选地,对原始图像进行人体检测,得到人体图像包括:利用训练好的深度学习模型对原始图像进行处理,得到人体在原始图像中的位置信息;基于位置信息对原始图像进行裁剪和归一化处理,得到人体图像。其中,人体图像中的人体位置可以采用原始图像中包含完整的人体区域的最小包围矩形框表示,以二维坐标(left,top,bottom,right)的形式表达。Optionally, performing human body detection on the original image to obtain a human body image includes: using a trained deep learning model to process the original image to obtain the position information of the human body in the original image; and cropping and normalizing the original image based on the position information Chemical processing to obtain a human body image. Wherein, the position of the human body in the human body image can be represented by the smallest enclosing rectangular frame containing the complete human body area in the original image, and expressed in the form of two-dimensional coordinates (left, top, bottom, right).
步骤S106,利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和SMPL(Skinned Multi-Person Linear,蒙皮多人线性)模型。Step S106: Use the trained first model to process the human body image to obtain the processing result of the human body image, where the processing result includes: two-dimensional joint points, three-dimensional joint points and SMPL (Skinned Multi-Person Linear). Linear) model.
可选地,上述第一模型可以采用沙漏型网络结构或FPN(Feature Pyramid Networks,特征图金字塔)网络结构。例如,输入为w*h图像时,输出的特征图可以是w*h或者w/4*h/4的图像。Optionally, the above-mentioned first model may adopt an hourglass network structure or an FPN (Feature Pyramid Networks) network structure. For example, when the input is a w*h image, the output feature map can be a w*h or w/4*h/4 image.
上述关节点可以是人体上每个关节的位置坐标,例如手腕、手肘等,如图2所示。The above-mentioned joint points may be the position coordinates of each joint on the human body, such as a wrist, an elbow, etc., as shown in FIG. 2.
二维关节点可以表示成热力图(Heat Map)形式,也可以表示成坐标向量形式。其中,对于热力图形式,可以将每个关节点表示成一个特征图,假设输入的人体图像为w*h的图像,那么输出的特征图为相同大小或者等比例缩放的图像,在关节点所在位置的特征图的值是1,其他位置的特征图的值为0。在一个示例中,当人体的二维关节点为16个时,则可以使用16张w*h或者w/2*h/2或者更小的特征图来表示人体的二维关节点。The two-dimensional joint points can be expressed in the form of a heat map, or in the form of a coordinate vector. Among them, for the heat map form, each joint point can be represented as a feature map. Assuming that the input human body image is a w*h image, then the output feature map is an image of the same size or scaled in equal proportions. The value of the feature map of the location is 1, and the value of the feature map of other locations is 0. In an example, when there are 16 two-dimensional joint points of the human body, 16 feature maps of w*h or w/2*h/2 or smaller can be used to represent the two-dimensional joint points of the human body.
三维关节点同样可以有热力图和坐标向量两种表达方式,其中,对于热力图形式,相比于二维关节点,三维关节点在三维空间上增加了z轴信息,将热力图扩散成一个长方体。Three-dimensional joint points can also be expressed in two ways: heat map and coordinate vector. Among them, for the form of heat map, compared with two-dimensional joint points, three-dimensional joint points add z-axis information to the three-dimensional space, spreading the heat map into one cuboid.
在一种可选的实施例中,可以先利用第一模型对人体图像进行处理,得到SMPL模型的参数值;再基于参数值得到二维关节点或三维关节点。In an optional embodiment, the first model may be used to process the human body image to obtain the parameter values of the SMPL model; and then the two-dimensional joint points or the three-dimensional joint points may be obtained based on the parameter values.
步骤S108,根据人体图像的处理结果生成人体模型。In step S108, a human body model is generated according to the processing result of the human body image.
如图3所示,SMPL模型可以包含形状(shape)参数和姿态(pose)参数,根据形 状参数和姿态参数生成的人体模型可以包含多个顶点和三维关节点,每个顶点和三维关节点为包含(x,y,z)坐标的三维向量。图4a至图4c示出了根据形状参数和姿态参数生成人体模型的过程,其中,图4a表示平均形状的人体模型,图4b表示在平均形状基础上加上形状参数后生成的人体模型,图4c表示在平均形状基础上加上形状参数和姿态参数后生成的人体模型。图4d表示在图4c生成的人体模型基础上,根据检测到的人体动作生成的人体模型。通过比较图4b和图4c可以看出,两者的差异不是非常大,因此,在一些应用中,可以仅根据形状参数生成人体模型实现人体建模。As shown in Figure 3, the SMPL model can include shape parameters and pose parameters. The human body model generated according to the shape parameters and pose parameters can include multiple vertices and three-dimensional joint points. Each vertex and three-dimensional joint point is A three-dimensional vector containing (x, y, z) coordinates. Figures 4a to 4c show the process of generating a human body model based on shape parameters and posture parameters. Figure 4a shows a human body model with an average shape, and Figure 4b shows a human body model generated after adding shape parameters to the average shape. 4c represents the human body model generated after adding shape parameters and posture parameters on the basis of the average shape. Fig. 4d shows a human body model generated based on the human body motion detected on the basis of the human body model generated in Fig. 4c. By comparing Figure 4b and Figure 4c, it can be seen that the difference between the two is not very large. Therefore, in some applications, a human body model can be generated only based on shape parameters to achieve human body modeling.
通过本申请上述实施例,在获取到原始图像之后,首先对原始图像进行人体检测,得到人体图像,然后利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,从而同时实现人体检测、二维和三维关节点定位以及SMPL模型建立的目的,并可以进一步生成人体模型。容易注意到的是,由于利用一个模型即可同时得到二维关节点,三维关节点和SMPL模型,无需通过二维关节点估计三维关节点,从而达到了提高图像识别准确度的技术效果,进而解决了相关技术中针对二维和三维关节点定位以及人体参数重建的识别准确度较低的技术问题。Through the above-mentioned embodiments of the application, after the original image is obtained, the human body is detected on the original image to obtain the human body image, and then the human body image is processed by the trained first model to obtain the processing result of the human body image, thereby simultaneously achieving The purpose of human body detection, two-dimensional and three-dimensional joint point positioning and SMPL model establishment, and can further generate a human body model. It is easy to notice that since one model can obtain two-dimensional joint points, three-dimensional joint points and SMPL models at the same time, there is no need to estimate three-dimensional joint points through two-dimensional joint points, thereby achieving the technical effect of improving the accuracy of image recognition, and then It solves the technical problem of low recognition accuracy for two-dimensional and three-dimensional joint point positioning and human body parameter reconstruction in related technologies.
在第一个应用场景中,可以实时检测人体动作以驱动人体动画(AVATAR)模型,例如,基于二维关节点和三维关节点去捕获人体动作,使得人体动画模型跟随人体动作做出相应的动作,实现互动交互。In the first application scenario, the human body motion can be detected in real time to drive the human body animation (AVATAR) model, for example, based on 2D joint points and 3D joint points to capture human body motion, so that the human body animation model can follow the human body to make corresponding actions , To achieve interactive interaction.
在第二个应用场景中,可以根据处理结果中的二维关节点和三维关节点实现对人体瘦身等编辑目的,例如,对人体图像上的手臂、腿部、身体等相应位置的图像像素进行处理,实现瘦手臂、瘦腿、瘦腰等图像处理效果。In the second application scenario, according to the two-dimensional joint points and three-dimensional joint points in the processing result, the purpose of slimming the human body can be achieved. Processing to achieve image processing effects such as thin arms, thin legs, and thin waists.
可选地,在本申请上述实施例中,该图像处理方法还包括:获取多组训练样本,其中,每组训练样本中包含:人体图像,二维关节点的第一标记信息,三维关节点的第二标记信息,以及SMPL模型的参数值;利用多组训练样本对预设模型进行训练,并获取预设模型的目标损失值;在目标损失值小于预设值的情况下,停止对预设模型进行训练,并确定预设模型为第一模型;在目标损失值大于预设值的情况下,继续利用多组训练样本对预设模型进行训练,直至目标损失值小于预设值。目标损失值越小,则识别准确度越高,上述预设值可以预先根据图像识别准确度和效率的要求设定,通过该预设值可以确定模型是否训练完成。Optionally, in the foregoing embodiment of the present application, the image processing method further includes: acquiring multiple sets of training samples, wherein each set of training samples includes: a human body image, first label information of two-dimensional joint points, and three-dimensional joint points The second label information of the, and the parameter values of the SMPL model; use multiple sets of training samples to train the preset model, and obtain the target loss value of the preset model; if the target loss value is less than the preset value, stop pre-preparation Set the model for training, and determine that the preset model is the first model; when the target loss value is greater than the preset value, continue to use multiple sets of training samples to train the preset model until the target loss value is less than the preset value. The smaller the target loss value is, the higher the recognition accuracy is. The above-mentioned preset value can be set in advance according to the requirements of image recognition accuracy and efficiency, and it can be determined whether the training of the model is completed or not through the preset value.
可选地,利用多组训练样本对预设模型进行训练,并获取预设模型的目标损失值包括:将多组训练样本输入预设模型,并获取预设模型的输出结果,其中,输出结果包括:二维关节点的第一结果,三维关节点的第二结果和SMPL模型的第三结果;基于第一标记信息和第一结果,得到二维关节点的第一损失值;基于第二标记信息和第二结果,得到三维关节点的第二损失值;基于参数值和第三结果,得到SMPL模型的 第三损失值;基于第一损失值、第二损失值和第三损失值,得到目标损失值。Optionally, using multiple sets of training samples to train the preset model and obtaining the target loss value of the preset model includes: inputting multiple sets of training samples into the preset model and obtaining the output result of the preset model, where the output result Including: the first result of the two-dimensional joint point, the second result of the three-dimensional joint point and the third result of the SMPL model; based on the first label information and the first result, the first loss value of the two-dimensional joint point is obtained; based on the second The label information and the second result are used to obtain the second loss value of the three-dimensional joint point; based on the parameter value and the third result, the third loss value of the SMPL model is obtained; based on the first loss value, the second loss value, and the third loss value, Get the target loss value.
可选地,在本申请上述实施例中,该图像处理方法还包括对训练样本标注二维关节点的第一标记信息,三维关节点的第二标记信息。Optionally, in the foregoing embodiment of the present application, the image processing method further includes marking the training sample with first label information of the two-dimensional joint points and second label information of the three-dimensional joint points.
在一种可选的实施例中,对于二维关节点,第一损失值可以基于预测的热力图(即第一结果)和标记标签的热力图(即第一标记信息)得到,或者基于预测的坐标向量(即第一结果)和标记标签的坐标向量(即第一标记信息)得到,或者基于热力图和坐标向量的综合信息得到。In an alternative embodiment, for the two-dimensional joint points, the first loss value can be obtained based on the predicted heat map (ie, the first result) and the heat map of the label (ie, the first label information), or based on the prediction The coordinate vector (that is, the first result) and the coordinate vector of the tag label (that is, the first tag information) are obtained, or it is obtained based on the integrated information of the heat map and the coordinate vector.
对于三维关节点,同样地,第二损失值可以基于预测的热力图(即第二结果)和标记标签的热力图(即第二标记信息)得到,或者基于预测的坐标向量(即第二结果)和标记标签的坐标向量(即第二标记信息)得到,或者基于热力图和坐标向量的综合信息得到。For three-dimensional joint points, similarly, the second loss value can be obtained based on the predicted heat map (i.e., the second result) and the heat map of the tag label (i.e., the second tag information), or based on the predicted coordinate vector (i.e., the second result) ) And the coordinate vector of the mark label (ie, the second mark information), or based on the integrated information of the heat map and the coordinate vector.
其中,使用坐标向量方式相较于热力图方式,计算较为方便。Among them, the coordinate vector method is more convenient for calculation than the heat map method.
可选地,上述SMPL模型的参数值可以是通过采集装置采集到的真实数值,或,通过对采集装置采集到的参数值进行调整得到的调整数值。在一种可选的实施例中,可以通过真实数值和调整数值预测SMPL模型的参数值,其中,真实数据的权重较大,调整数值的权重较小。Optionally, the parameter value of the aforementioned SMPL model may be the actual value collected by the collecting device, or the adjusted value obtained by adjusting the parameter value collected by the collecting device. In an optional embodiment, the parameter values of the SMPL model can be predicted by the real value and the adjusted value, where the weight of the real data is larger, and the weight of the adjusted value is smaller.
上述的采集装置可以是实验室环境或户外环境中,多个固定位置上设置的摄像头或者传感器。The aforementioned collection device may be cameras or sensors installed in multiple fixed positions in a laboratory environment or an outdoor environment.
由于只有在实验室环境中采集的数据能够获得精准真实的SMPL模型的参数值,户外环境中采集的数据没有办法获取精准的SMPL模型的参数值。因此,在实际计算中,对于SMPL模型,可以基于参数值的类型,采用不同的方式计算第三损失值。可选地,在参数值是通过采集装置采集到的真实数值时,可以采用直接回归的方式计算第三损失值,即基于参数值和第三结果,得到第三损失值;在参数值是通过对采集装置采集到的参数值进行调整得到的调整数值时,可以根据SMPL模型的参数值获得三维关节点,将三维关节点投射到二维平面上获得二维关节点,基于投射的二维关节点和第一标记信息计算二维关节点的第四损失值,将该损失值作为第三损失值,并回传到SMPL模型的参数空间,以更新SMPL模型的参数值。Since only data collected in a laboratory environment can obtain accurate and true parameter values of the SMPL model, data collected in an outdoor environment cannot obtain accurate parameter values of the SMPL model. Therefore, in actual calculations, for the SMPL model, different methods can be used to calculate the third loss value based on the type of parameter value. Optionally, when the parameter value is the real value collected by the acquisition device, the third loss value can be calculated by direct regression, that is, the third loss value is obtained based on the parameter value and the third result; when the parameter value is passed When adjusting the adjusted values obtained by adjusting the parameter values collected by the acquisition device, the three-dimensional joint points can be obtained according to the parameter values of the SMPL model, and the three-dimensional joint points can be projected on the two-dimensional plane to obtain the two-dimensional joint points, based on the projected two-dimensional joints The points and the first mark information calculate the fourth loss value of the two-dimensional joint point, use the loss value as the third loss value, and return it to the parameter space of the SMPL model to update the parameter value of the SMPL model.
在训练过程中,目标损失值为第一损失值、第二损失值和第三损失值的综合,可以通过求取加权和的方式计算得到。In the training process, the target loss value is a combination of the first loss value, the second loss value, and the third loss value, which can be calculated by calculating the weighted sum.
在一种可选的实施例中,在模型训练过程中,二维关节点、三维关节点和SMPL模型的参数可以同时进行学习,整体进行回归,生成模型,另外,如图5所示,可以 采用SMPL模型判别器对SMPL模型的参数值进行判别,判断其参数值是通过网络随机生成的数值还是采集的真实数值,从而提高模型效果的真实性。可选地,SMPL模型判别器对第三结果(即预设模型输出的SMPL模型)的参数值进行处理,得到第三结果的参数值的分类结果,其中,分类结果用于表征第三结果的参数值是否是通过采集装置采集到的真实数值;基于该分类结果和目标损失值,确定是否停止对预设模型进行训练。其中,可以采用对抗生成网络(Generative Adversarial Network,GAN)中的D判别器作为SMPL模型判别器。In an optional embodiment, during the model training process, the two-dimensional joint points, the three-dimensional joint points, and the parameters of the SMPL model can be learned at the same time, and the overall regression can be performed to generate the model. In addition, as shown in Figure 5, you can The SMPL model discriminator is used to discriminate the parameter values of the SMPL model, and determine whether the parameter values are randomly generated values through the network or real values collected, thereby improving the authenticity of the model effect. Optionally, the SMPL model discriminator processes the parameter values of the third result (that is, the SMPL model output by the preset model) to obtain a classification result of the parameter values of the third result, where the classification result is used to characterize the third result Whether the parameter value is the real value collected by the collecting device; based on the classification result and the target loss value, it is determined whether to stop training the preset model. Among them, the D discriminator in the Generative Adversarial Network (GAN) can be used as the SMPL model discriminator.
在一种可选的实施例中,由于户外环境中采集的数据没有办法获取精准的SMPL模型的参数值,所以可能会生成不太正常的参数值,为解决该问题,在本申请实施例中增加GAN网络来对SMPL模型判别器(即D判别器)进行训练,如图6所示,GAN网络包括G生成器和D判别器,D判别器为一个二分类网络,接收来自G生成器随机生成的数值和采集的真实数值,并输出表示数据真实性的标签,例如,当接收到真实数值时,输出接近正标签(通常,正标签设置为1),当接收G生成器随机生成的数值时,输出接近负标签(通常,负标签设置为0),通过D判别器来描述随机生成的数值和真实数值的差异,然后根据该差异来更新G生成器随机生成的数值的权重,使得G生成器随机生成的数值更接近于真实数值,并提高D判别器分辨随机生成的数值和真实数值的能力。In an optional embodiment, because the data collected in the outdoor environment cannot obtain accurate parameter values of the SMPL model, abnormal parameter values may be generated. To solve this problem, in the embodiment of the present application Add GAN network to train SMPL model discriminator (ie D discriminator), as shown in Figure 6, GAN network includes G generator and D discriminator, D discriminator is a two-class network, receiving random from G generator The generated value and the collected real value, and output a label indicating the authenticity of the data, for example, when the real value is received, the output is close to the positive label (usually, the positive label is set to 1), when receiving the value randomly generated by the G generator When the output is close to the negative label (usually, the negative label is set to 0), the D discriminator is used to describe the difference between the randomly generated value and the real value, and then the weight of the value randomly generated by the G generator is updated according to the difference, so that G The value randomly generated by the generator is closer to the real value, and the ability of the D discriminator to distinguish between the randomly generated value and the real value is improved.
实施例2Example 2
根据本申请实施例,提供了一种图像处理装置,该装置可以执行上述实施例1中记载的图像处理方法,该实施例中的优选实施例和应用场景与上述实施例1相同,在此不做赘述。According to an embodiment of the present application, an image processing device is provided, which can execute the image processing method described in the above embodiment 1. The preferred embodiments and application scenarios in this embodiment are the same as those in the above embodiment 1. Do repeat.
图7是根据本申请实施例的一种图像处理装置的示意图,如图7所示,该装置包括:Fig. 7 is a schematic diagram of an image processing device according to an embodiment of the present application. As shown in Fig. 7, the device includes:
获取模块72,设置为获取原始图像;The obtaining module 72 is configured to obtain the original image;
检测模块74,设置为对原始图像进行人体检测,得到人体图像;The detection module 74 is configured to perform human body detection on the original image to obtain a human body image;
处理模块76,设置为利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和SMPL模型的参数值;The processing module 76 is configured to process the human body image by using the trained first model to obtain a processing result of the human body image, where the processing result includes: two-dimensional joint points, three-dimensional joint points, and parameter values of the SMPL model;
生成模块78,设置为根据人体图像的处理结果生成人体模型。The generating module 78 is configured to generate a human body model according to the processing result of the human body image.
此处需要说明的是,上述获取模块72、检测模块74、处理模块76和生成模块78可以作为装置的一部分运行在计算机终端中,可以通过计算机终端中的处理器来执行上述模块实现的功能,计算机终端也可以是智能手机(如Android手机、iOS手机等)、 平板电脑、掌上电脑以及移动互联网设备(Mobile Internet Devices,MID)、PAD等终端设备。It should be noted here that the acquisition module 72, the detection module 74, the processing module 76, and the generation module 78 can be run in a computer terminal as a part of the device, and the functions implemented by the above modules can be executed by the processor in the computer terminal. The computer terminal can also be a smart phone (such as an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, and a mobile Internet device (Mobile Internet Devices, MID), PAD, and other terminal devices.
可选地,在本申请上述实施例中,该装置还包括:获取模块还设置为获取多组训练样本,其中,每组训练样本中包含:人体图像,二维关节点的第一标记信息,三维关节点的第二标记信息,以及SMPL模型的参数值;训练模块,设置为利用多组训练样本对预设模型进行训练,并获取预设模型的目标损失值;停止训练模块,设置为在目标损失值小于预设值的情况下,停止对预设模型进行训练,并确定预设模型为第一模型;训练模块还设置为在目标损失值大于预设值的情况下,继续利用多组训练样本对预设模型进行训练,直至目标损失值小于预设值。Optionally, in the above-mentioned embodiment of the present application, the device further includes: the acquisition module is further configured to acquire multiple sets of training samples, wherein each set of training samples includes: a human body image, and first label information of two-dimensional joint points, The second label information of the three-dimensional joint points and the parameter values of the SMPL model; the training module is set to use multiple sets of training samples to train the preset model and obtain the target loss value of the preset model; stop the training module and set it to When the target loss value is less than the preset value, stop training the preset model and determine that the preset model is the first model; the training module is also set to continue to use multiple groups when the target loss value is greater than the preset value The training samples train the preset model until the target loss value is less than the preset value.
此处需要说明的是,上述获取模块、训练模块和停止训练模块可以作为装置的一部分运行在计算机终端中,可以通过计算机终端中的处理器来执行上述模块实现的功能,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备、PAD等终端设备。It should be noted here that the aforementioned acquisition module, training module, and stop training module can be run in a computer terminal as part of the device, and the functions implemented by the aforementioned modules can be executed by the processor in the computer terminal, and the computer terminal can also be a smart device. Mobile phones (such as Android phones, iOS phones, etc.), tablet computers, handheld computers, mobile Internet devices, PAD and other terminal devices.
可选地,训练模块包括:获取单元,设置为将多组训练样本输入预设模型,并获取预设模型的输出结果,其中,输出结果包括:二维关节点的第一结果,三维关节点的第二结果和SMPL模型的第三结果;第一处理单元,设置为基于第一标记信息和第一结果,得到二维关节点的第一损失值;第二处理单元,设置为基于第二标记信息和第二结果,得到三维关节点的第二损失值;第三处理单元,设置为基于参数值和第三结果,得到SMPL模型的第三损失值;第四处理单元,设置为基于第一损失值、第二损失值和第三损失值,得到目标损失值。Optionally, the training module includes: an obtaining unit configured to input multiple sets of training samples into the preset model and obtain the output result of the preset model, where the output result includes: the first result of the two-dimensional joint point, the three-dimensional joint point The second result of the SMPL model and the third result of the SMPL model; the first processing unit is set to obtain the first loss value of the two-dimensional joint point based on the first mark information and the first result; the second processing unit is set to be based on the second The label information and the second result are used to obtain the second loss value of the three-dimensional joint point; the third processing unit is set to obtain the third loss value of the SMPL model based on the parameter value and the third result; the fourth processing unit is set to be based on the first A loss value, a second loss value, and a third loss value are used to obtain the target loss value.
此处需要说明的是,上述获取单元、第一处理单元、第二处理单元、第三处理单元和第四处理单元可以作为装置的一部分运行在计算机终端中,可以通过计算机终端中的处理器来执行上述模块实现的功能,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备、PAD等终端设备。It should be noted here that the acquisition unit, the first processing unit, the second processing unit, the third processing unit, and the fourth processing unit can be run in the computer terminal as part of the device, and can be processed by the processor in the computer terminal. To perform the functions implemented by the above modules, the computer terminal may also be a smart phone (such as an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, a mobile Internet device, a PAD, and other terminal devices.
可选地,第三处理单元还设置为在参数值是通过采集装置采集到的情况下,基于参数值和第三结果,得到第三损失值;在参数值是通过对采集装置采集到的参数值进行调整得到的情况下,基于参数值获得三维关节点,将三维关节点投射到二维平面上获得二维关节点,基于投射的二维关节点和第一标记信息,得到二维关节点的第四损失值,并将第四损失值确定为第三损失值。Optionally, the third processing unit is further configured to obtain a third loss value based on the parameter value and the third result when the parameter value is collected by the collecting device; when the parameter value is the parameter collected by the collecting device When the value is adjusted, the 3D joint points are obtained based on the parameter values, the 3D joint points are projected onto the 2D plane to obtain the 2D joint points, and the 2D joint points are obtained based on the projected 2D joint points and the first label information The fourth loss value of, and the fourth loss value is determined as the third loss value.
可选地,该装置还包括:处理模块还设置为利用判别器对第三结果的参数值进行处理,得到第三结果的参数值的分类结果,其中,分类结果用于表征第三结果的参数值是否是通过采集装置采集到的真实数值;停止训练模块还设置为基于分类结果和目 标损失值,确定是否停止对预设模型进行训练。Optionally, the device further includes: the processing module is further configured to use the discriminator to process the parameter value of the third result to obtain a classification result of the parameter value of the third result, wherein the classification result is used to characterize the parameter of the third result Whether the value is a real value collected by the acquisition device; the stop training module is also set to determine whether to stop training the preset model based on the classification result and the target loss value.
可选地,训练模块还设置为利用生成对抗网络对判别器进行训练。Optionally, the training module is further configured to train the discriminator by using a generative confrontation network.
可选地,检测模块包括:检测单元,设置为利用训练好的第二模型对原始图像进行处理,得到人体在原始图像中的位置信息;第五处理单元,设置为基于位置信息对原始图像进行裁剪和归一化处理,得到人体图像。Optionally, the detection module includes: a detection unit configured to process the original image using the trained second model to obtain the position information of the human body in the original image; the fifth processing unit is configured to perform processing on the original image based on the position information Crop and normalize processing to get the human body image.
此处需要说明的是,上述检测单元和第五处理单元可以作为装置的一部分运行在计算机终端中,可以通过计算机终端中的处理器来执行上述模块实现的功能,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备、PAD等终端设备。It should be noted here that the above detection unit and the fifth processing unit can be run in a computer terminal as part of the device, and the functions implemented by the above modules can be executed by the processor in the computer terminal. The computer terminal can also be a smart phone ( Such as Android phones, iOS phones, etc.), tablet computers, handheld computers, mobile Internet devices, PAD and other terminal devices.
本申请实施例所提供的各个功能单元可以在移动终端、计算机终端或者类似的运算装置中运行,也可以作为存储介质的一部分进行存储。The functional units provided in the embodiments of the present application may run in a mobile terminal, a computer terminal or a similar computing device, or may be stored as a part of a storage medium.
由此,本申请的实施例可以提供一种计算机终端,该计算机终端可以是计算机终端群中的任意一个计算机终端设备。可选地,在本实施例中,上述计算机终端也可以替换为移动终端等终端设备。Therefore, the embodiments of the present application may provide a computer terminal, and the computer terminal may be any computer terminal device in a computer terminal group. Optionally, in this embodiment, the above-mentioned computer terminal may also be replaced with a terminal device such as a mobile terminal.
可选地,在本实施例中,上述计算机终端可以位于计算机网络的多个网络设备中的至少一个网络设备。Optionally, in this embodiment, the above-mentioned computer terminal may be located in at least one network device among a plurality of network devices in the computer network.
在本实施例中,上述计算机终端可以执行图像处理方法中以下步骤的程序代码:获取原始图像;对原始图像进行人体检测,得到人体图像;利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;根据人体图像的处理结果生成人体模型。In this embodiment, the above-mentioned computer terminal can execute the program code of the following steps in the image processing method: obtain the original image; perform human body detection on the original image to obtain the human body image; use the trained first model to process the human body image to obtain The processing result of the human body image, where the processing result includes: two-dimensional joint points, three-dimensional joint points and a skinned multi-person linear SMPL model; the human body model is generated according to the processing result of the human body image.
可选地,该计算机终端可以包括:一个或多个处理器、存储器、以及传输装置。Optionally, the computer terminal may include: one or more processors, memories, and transmission devices.
其中,存储器可用于存储软件程序以及模块,如本申请实施例中的图像处理方法及装置对应的程序指令/模块,处理器通过运行存储在存储器内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的图像处理方法。存储器可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器可进一步包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。Among them, the memory can be used to store software programs and modules, such as program instructions/modules corresponding to the image processing method and device in the embodiments of the present application. The processor executes various functional applications by running the software programs and modules stored in the memory. And data processing, that is, to achieve the above-mentioned image processing method. The memory may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory may further include a memory remotely provided with respect to the processor, and these remote memories may be connected to the terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, corporate intranets, local area networks, mobile communication networks, and combinations thereof.
上述的传输装置用于经由一个网络接收或者发送数据。上述的网络具体实例可包括有线网络及无线网络。在一个实例中,传输装置包括一个网络适配器(Network Interface Controller,NIC),其可通过网线与其他网络设备与路由器相连从而可与互联网或局域网进行通讯。在一个实例中,传输装置为射频(Radio Frequency,RF)模块,其用于通过无线方式与互联网进行通讯。The aforementioned transmission device is used to receive or send data via a network. The above-mentioned specific examples of the network may include a wired network and a wireless network. In one example, the transmission device includes a network adapter (Network Interface Controller, NIC), which can be connected to other network devices and routers via a network cable so as to communicate with the Internet or a local area network. In one example, the transmission device is a radio frequency (RF) module, which is used to communicate with the Internet in a wireless manner.
其中,具体地,存储器用于存储第一模型、蒙皮多人线性SMPL模型、处理结果、以及应用程序。Among them, specifically, the memory is used to store the first model, the skinned multi-person linear SMPL model, the processing result, and the application program.
处理器可以通过传输装置调用存储器存储的信息及应用程序,以执行上述方法实施例中的各个可选或优选实施例的方法步骤的程序代码。The processor may call the information and application programs stored in the memory through the transmission device to execute the program code of the method steps of each optional or preferred embodiment in the foregoing method embodiments.
本领域普通技术人员可以理解,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备、PAD等终端设备。Those of ordinary skill in the art can understand that the computer terminal may also be a smart phone (such as an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, a mobile Internet device, a PAD, and other terminal devices.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令终端设备相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(Read-Only Memory,ROM)、随机存取器(Random Access Memory,RAM)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing the relevant hardware of the terminal device through a program. The program can be stored in a computer-readable storage medium, which can be Including: flash disk, read-only memory (Read-Only Memory, ROM), random access device (Random Access Memory, RAM), magnetic disk or optical disk, etc.
实施例3Example 3
根据本申请实施例,提供了一种计算机可读存储介质,计算机可读存储介质包括存储的程序,其中,在程序运行时控制计算机可读存储介质所在设备执行上述实施例1中的图像处理方法。According to an embodiment of the present application, a computer-readable storage medium is provided, and the computer-readable storage medium includes a stored program, wherein when the program is running, the device where the computer-readable storage medium is located is controlled to execute the image processing method in Embodiment 1 above. .
可选地,在本实施例中,上述计算机可读存储介质可以位于计算机网络中计算机终端群中的任意一个计算机终端中,或者位于移动终端群中的任意一个移动终端中。Optionally, in this embodiment, the above-mentioned computer-readable storage medium may be located in any computer terminal in a computer terminal group in a computer network, or located in any mobile terminal in a mobile terminal group.
可选地,在本实施例中,计算机可读存储介质被设置为存储用于执行以下步骤的程序代码:获取原始图像;对原始图像进行人体检测,得到人体图像;利用训练好的第一模型对人体图像进行处理,得到人体图像的处理结果,其中,处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;根据人体图像的处理结果生成人体模型。Optionally, in this embodiment, the computer-readable storage medium is configured to store program code for performing the following steps: obtain an original image; perform human body detection on the original image to obtain a human body image; use the trained first model The human body image is processed to obtain the processing result of the human body image, where the processing result includes: two-dimensional joint points, three-dimensional joint points and a skin multi-person linear SMPL model; the human body model is generated according to the processing result of the human body image.
可选地,在本实施例中,计算机可读存储介质还可以被设置为存储图像处理方法提供的各种优选地或可选的方法步骤的程序代码。Optionally, in this embodiment, the computer-readable storage medium may also be configured to store the program code of various preferred or optional method steps provided by the image processing method.
如上参照附图以示例的方式描述了根据本发明的图像处理方法及装置。但是,本 领域技术人员应当理解,对于上述本发明所提出的图像处理方法及装置,还可以在不脱离本发明内容的基础上做出各种改进。因此,本发明的保护范围应当由所附的权利要求书的内容确定。As above, the image processing method and device according to the present invention are described by way of example with reference to the accompanying drawings. However, those skilled in the art should understand that various improvements can be made to the image processing method and device proposed by the present invention without departing from the content of the present invention. Therefore, the protection scope of the present invention should be determined by the content of the appended claims.
实施例4Example 4
根据本申请实施例,提供了一种处理器,处理器设置为运行程序,其中,程序运行时执行上述实施例1中的图像处理方法。According to an embodiment of the present application, a processor is provided, and the processor is configured to run a program, wherein the image processing method in Embodiment 1 is executed when the program is running.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the foregoing embodiments of the present application are for description only, and do not represent the superiority or inferiority of the embodiments.
在本申请的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments of the present application, the description of each embodiment has its own focus. For a part that is not described in detail in an embodiment, reference may be made to related descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的技术内容,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,可以为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed technical content can be implemented in other ways. The device embodiments described above are only illustrative. For example, the division of the units may be a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, units or modules, and may be in electrical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of the present application essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , Including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage media include: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program codes. .
以上所述仅是本申请的优选实施方式,应当指出,对于本技术领域的普通技术人 员来说,在不脱离本申请原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本申请的保护范围。The above are only the preferred embodiments of this application. It should be pointed out that for those of ordinary skill in the art, without departing from the principle of this application, several improvements and modifications can be made, and these improvements and modifications are also Should be regarded as the scope of protection of this application.
工业实用性Industrial applicability
如上所述,本申请至少部分实施例提供的图像处理方法和装置具有以下有益效果:由于利用一个模型即可同时得到二维关节点,三维关节点和SMPL模型,无需通过二维关节点估计三维关节点,从而达到了提高图像识别准确度的技术效果,进而解决了相关技术中针对二维和三维关节点定位以及人体参数重建的识别准确度较低技术问题。As described above, the image processing method and device provided by at least some of the embodiments of the present application have the following beneficial effects: Since one model can be used to obtain two-dimensional joint points, three-dimensional joint points and SMPL models at the same time, there is no need to estimate three-dimensional through two-dimensional joint points. The key points, so as to achieve the technical effect of improving the accuracy of image recognition, and then solve the technical problem of low recognition accuracy for two-dimensional and three-dimensional joint point positioning and human body parameter reconstruction in related technologies.

Claims (17)

  1. 一种图像处理方法,包括:An image processing method, including:
    获取原始图像;Get the original image;
    对所述原始图像进行人体检测,得到人体图像;Performing human body detection on the original image to obtain a human body image;
    利用训练好的第一模型对所述人体图像进行处理,得到所述人体图像的处理结果,其中,所述处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;Use the trained first model to process the human body image to obtain a processing result of the human body image, where the processing result includes: two-dimensional joint points, three-dimensional joint points, and a skinned multi-person linear SMPL model;
    根据所述人体图像的处理结果生成人体模型。A human body model is generated according to the processing result of the human body image.
  2. 根据权利要求1所述的方法,其中,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    获取多组训练样本,其中,每组训练样本包含:人体图像,二维关节点的第一标记信息,三维关节点的第二标记信息,以及SMPL模型的参数值;Obtain multiple sets of training samples, where each set of training samples includes: a human body image, the first label information of the two-dimensional joint points, the second label information of the three-dimensional joint points, and the parameter values of the SMPL model;
    利用所述多组训练样本对预设模型进行训练,并获取所述预设模型的目标损失值;Training a preset model by using the multiple sets of training samples, and obtaining a target loss value of the preset model;
    在所述目标损失值小于预设值的情况下,停止对所述预设模型进行训练,并确定所述预设模型为所述第一模型;If the target loss value is less than a preset value, stop training the preset model, and determine that the preset model is the first model;
    在所述目标损失值大于所述预设值的情况下,继续利用所述多组训练样本对所述预设模型进行训练,直至所述目标损失值小于所述预设值。In a case where the target loss value is greater than the preset value, continue to use the multiple sets of training samples to train the preset model until the target loss value is less than the preset value.
  3. 根据权利要求2所述的方法,其中,利用所述多组训练样本对预设模型进行训练,并获取所述预设模型的目标损失值包括:The method according to claim 2, wherein training a preset model using the multiple sets of training samples and obtaining a target loss value of the preset model comprises:
    将所述多组训练样本输入所述预设模型,并获取所述预设模型的输出结果,其中,所述输出结果包括:所述二维关节点的第一结果,所述三维关节点的第二结果和SMPL模型的第三结果;The multiple sets of training samples are input to the preset model, and the output result of the preset model is obtained, where the output result includes: the first result of the two-dimensional joint point, and the first result of the three-dimensional joint point The second result and the third result of the SMPL model;
    基于所述第一标记信息和所述第一结果,得到所述二维关节点的第一损失值;Obtaining a first loss value of the two-dimensional joint point based on the first label information and the first result;
    基于所述第二标记信息和所述第二结果,得到所述三维关节点的第二损失值;Obtaining a second loss value of the three-dimensional joint point based on the second mark information and the second result;
    基于所述参数值和所述第三结果,得到所述SMPL模型的第三损失值;Obtaining a third loss value of the SMPL model based on the parameter value and the third result;
    基于所述第一损失值、所述第二损失值和所述第三损失值,得到所述目标损失值。Based on the first loss value, the second loss value, and the third loss value, the target loss value is obtained.
  4. 根据权利要求3所述的方法,其中,所述SMPL模型的参数值是通过采集装置采集到的真实数值,或,通过对所述采集装置采集到的参数值进行调整得到的调整 数值。The method according to claim 3, wherein the parameter value of the SMPL model is a real value collected by a collecting device, or an adjusted value obtained by adjusting the parameter value collected by the collecting device.
  5. 根据权利要求4所述的方法,其中,基于所述参数值和所述第三结果,得到所述SMPL模型的第三损失值包括:The method according to claim 4, wherein, based on the parameter value and the third result, obtaining the third loss value of the SMPL model comprises:
    在所述参数值是通过所述采集装置采集到的真实数值的情况下,基于所述参数值和所述第三结果,得到所述第三损失值;In the case that the parameter value is a real value collected by the collecting device, obtain the third loss value based on the parameter value and the third result;
    在所述参数值是通过对所述采集装置采集到的参数值进行调整得到的调整数值的情况下,基于所述参数值获得三维关节点,将所述三维关节点投射到二维平面上获得二维关节点,基于投射的二维关节点和所述第一标记信息,得到所述二维关节点的第四损失值,并将所述第四损失值确定为所述第三损失值。In the case where the parameter value is an adjustment value obtained by adjusting the parameter value collected by the acquisition device, a three-dimensional joint point is obtained based on the parameter value, and the three-dimensional joint point is projected onto a two-dimensional plane to obtain For a two-dimensional joint point, a fourth loss value of the two-dimensional joint point is obtained based on the projected two-dimensional joint point and the first label information, and the fourth loss value is determined as the third loss value.
  6. 根据权利要求3所述的方法,其中,所述方法还包括:The method according to claim 3, wherein the method further comprises:
    利用判别器对所述第三结果的参数值进行处理,得到所述第三结果的参数值的分类结果,其中,所述分类结果用于表征所述第三结果的参数值是否是通过采集装置采集到的真实数值;The parameter value of the third result is processed by a discriminator to obtain a classification result of the parameter value of the third result, wherein the classification result is used to characterize whether the parameter value of the third result passes through the acquisition device The actual value collected;
    基于所述分类结果和所述目标损失值,确定是否停止对所述预设模型进行训练。Based on the classification result and the target loss value, it is determined whether to stop training the preset model.
  7. 根据权利要求6所述的方法,其中,利用生成对抗网络对所述判别器进行训练。The method according to claim 6, wherein the discriminator is trained using a generative adversarial network.
  8. 根据权利要求1所述的方法,其中,对所述原始图像进行人体检测,得到人体图像包括:The method according to claim 1, wherein performing human body detection on the original image to obtain a human body image comprises:
    利用训练好的第二模型对所述原始图像进行处理,得到人体在所述原始图像中的位置信息;Processing the original image by using the trained second model to obtain position information of the human body in the original image;
    基于所述位置信息对所述原始图像进行裁剪和归一化处理,得到所述人体图像。The original image is cropped and normalized based on the position information to obtain the human body image.
  9. 根据权利要求1所述的方法,其中,所述第一模型采用沙漏型网络结构或特征图金字塔FPN网络结构。The method according to claim 1, wherein the first model adopts an hourglass-type network structure or a feature map pyramid FPN network structure.
  10. 根据权利要求1所述的方法,其中,利用训练好的第一模型对所述人体图像进行处理,得到所述人体图像的处理结果包括:The method according to claim 1, wherein processing the human body image by using the trained first model to obtain the processing result of the human body image comprises:
    利用所述第一模型对所述人体图像进行处理,得到所述SMPL模型;Processing the human body image by using the first model to obtain the SMPL model;
    基于所述SMPL模型得到所述二维关节点或所述三维关节点。Obtain the two-dimensional joint point or the three-dimensional joint point based on the SMPL model.
  11. 根据权利要求1所述的方法,其中,所述二维关节点采用热力图形式或坐标向量 形式进行表达,所述三维关节点采用热力图形式或坐标向量形式进行表达。The method according to claim 1, wherein the two-dimensional joint points are expressed in a heat map form or a coordinate vector form, and the three-dimensional joint points are expressed in a heat map form or a coordinate vector form.
  12. 根据权利要求1所述的方法,其中,在根据所述人体图像的处理结果生成人体模型之后,所述方法还包括:The method according to claim 1, wherein after generating a human body model according to the processing result of the human body image, the method further comprises:
    基于所述二维关节点和所述三维关节点捕获人体动作;Capturing human body movements based on the two-dimensional joint points and the three-dimensional joint points;
    基于所述人体动作驱动人体动画模型。The human body animation model is driven based on the human body motion.
  13. 根据权利要求1所述的方法,其中,在根据所述人体图像的处理结果生成人体模型之后,所述方法还包括:The method according to claim 1, wherein after generating a human body model according to the processing result of the human body image, the method further comprises:
    基于所述二维关节点和所述三维关节点,对所述人体图像上的目标位置的图像像素进行处理。Based on the two-dimensional joint points and the three-dimensional joint points, image pixels of the target position on the human body image are processed.
  14. 根据权利要求4所述的方法,其中,所述SMPL模型的参数值通过所述真实数值和所述调整数值预测得到,其中,所述真实数值的权重大于所述调整数值的权重。The method according to claim 4, wherein the parameter value of the SMPL model is obtained by predicting the real value and the adjusted value, wherein the weight of the real value is greater than the weight of the adjusted value.
  15. 一种图像处理装置,包括:An image processing device, including:
    获取模块,设置为获取原始图像;The acquisition module is set to acquire the original image;
    检测模块,设置为对所述原始图像进行人体检测,得到人体图像;The detection module is configured to perform human body detection on the original image to obtain a human body image;
    处理模块,设置为利用训练好的第一模型对所述人体图像进行处理,得到所述人体图像的处理结果,其中,所述处理结果包括:二维关节点,三维关节点和蒙皮多人线性SMPL模型;The processing module is configured to process the human body image by using the trained first model to obtain a processing result of the human body image, wherein the processing result includes: two-dimensional joint points, three-dimensional joint points and multiple skins Linear SMPL model;
    生成模块,设置为根据所述人体图像的处理结果生成人体模型。The generating module is configured to generate a human body model according to the processing result of the human body image.
  16. 一种计算机可读存储介质,所述计算机可读存储介质包括存储的程序,其中,在所述程序运行时控制所述计算机可读存储介质所在设备执行权利要求1至14中任意一项所述的图像处理方法。A computer-readable storage medium, the computer-readable storage medium includes a stored program, wherein, when the program is running, the device where the computer-readable storage medium is located is controlled to execute any one of claims 1 to 14 Image processing method.
  17. 一种处理器,所述处理器设置为运行程序,其中,所述程序运行时执行权利要求1至14中任意一项所述的图像处理方法。A processor configured to run a program, wherein the image processing method according to any one of claims 1 to 14 is executed when the program is running.
PCT/CN2021/080280 2020-03-27 2021-03-11 Image processing method and device WO2021190321A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
KR1020227037422A KR20220160066A (en) 2020-03-27 2021-03-11 Image processing method and apparatus
JP2022558577A JP7448679B2 (en) 2020-03-27 2021-03-11 Image processing method and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010231605.7A CN113449570A (en) 2020-03-27 2020-03-27 Image processing method and device
CN202010231605.7 2020-03-27

Publications (1)

Publication Number Publication Date
WO2021190321A1 true WO2021190321A1 (en) 2021-09-30

Family

ID=77808126

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/080280 WO2021190321A1 (en) 2020-03-27 2021-03-11 Image processing method and device

Country Status (4)

Country Link
JP (1) JP7448679B2 (en)
KR (1) KR20220160066A (en)
CN (1) CN113449570A (en)
WO (1) WO2021190321A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114157526A (en) * 2021-12-23 2022-03-08 广州新华学院 Digital image recognition-based home security remote monitoring method and device
CN114299204A (en) * 2021-12-22 2022-04-08 深圳市海清视讯科技有限公司 Three-dimensional cartoon character model generation method and device
CN115482557A (en) * 2022-10-09 2022-12-16 中国电信股份有限公司 Human body image generation method, system, device and storage medium
CN115775300A (en) * 2022-12-23 2023-03-10 北京百度网讯科技有限公司 Reconstruction method of human body model, training method and device of human body reconstruction model
CN117351432A (en) * 2023-12-04 2024-01-05 环球数科集团有限公司 Training system for multi-target recognition model of scenic spot tourist
CN117745978A (en) * 2024-02-20 2024-03-22 四川大学华西医院 Simulation quality control method, equipment and medium based on human body three-dimensional reconstruction algorithm
CN115775300B (en) * 2022-12-23 2024-06-11 北京百度网讯科技有限公司 Human body model reconstruction method, human body model reconstruction training method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140204013A1 (en) * 2013-01-18 2014-07-24 Microsoft Corporation Part and state detection for gesture recognition
CN108345869A (en) * 2018-03-09 2018-07-31 南京理工大学 Driver's gesture recognition method based on depth image and virtual data
CN109615582A (en) * 2018-11-30 2019-04-12 北京工业大学 A kind of face image super-resolution reconstruction method generating confrontation network based on attribute description
CN110298916A (en) * 2019-06-21 2019-10-01 湖南大学 A kind of 3 D human body method for reconstructing based on synthesis depth data

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6373026B2 (en) 2014-03-20 2018-08-15 株式会社東芝 Image processing apparatus, image processing system, image processing method, and program
JP6912215B2 (en) 2017-02-09 2021-08-04 国立大学法人東海国立大学機構 Detection method and detection program to detect the posture of an object
CN108053469A (en) * 2017-12-26 2018-05-18 清华大学 Complicated dynamic scene human body three-dimensional method for reconstructing and device under various visual angles camera
JP2020030613A (en) 2018-08-22 2020-02-27 富士通株式会社 Information processing device, data calculating program and data calculating method
CN109285215B (en) * 2018-08-28 2021-01-08 腾讯科技(深圳)有限公司 Human body three-dimensional model reconstruction method and device and storage medium
CN109702741B (en) 2018-12-26 2020-12-18 中国科学院电子学研究所 Mechanical arm vision grasping system and method based on self-supervision learning neural network
CN109859296B (en) * 2019-02-01 2022-11-29 腾讯科技(深圳)有限公司 Training method of SMPL parameter prediction model, server and storage medium
CN110020633B (en) * 2019-04-12 2022-11-04 腾讯科技(深圳)有限公司 Training method of posture recognition model, image recognition method and device
CN110188598B (en) 2019-04-13 2022-07-05 大连理工大学 Real-time hand posture estimation method based on MobileNet-v2

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140204013A1 (en) * 2013-01-18 2014-07-24 Microsoft Corporation Part and state detection for gesture recognition
CN108345869A (en) * 2018-03-09 2018-07-31 南京理工大学 Driver's gesture recognition method based on depth image and virtual data
CN109615582A (en) * 2018-11-30 2019-04-12 北京工业大学 A kind of face image super-resolution reconstruction method generating confrontation network based on attribute description
CN110298916A (en) * 2019-06-21 2019-10-01 湖南大学 A kind of 3 D human body method for reconstructing based on synthesis depth data

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114299204A (en) * 2021-12-22 2022-04-08 深圳市海清视讯科技有限公司 Three-dimensional cartoon character model generation method and device
CN114157526A (en) * 2021-12-23 2022-03-08 广州新华学院 Digital image recognition-based home security remote monitoring method and device
CN114157526B (en) * 2021-12-23 2022-08-12 广州新华学院 Digital image recognition-based home security remote monitoring method and device
CN115482557A (en) * 2022-10-09 2022-12-16 中国电信股份有限公司 Human body image generation method, system, device and storage medium
CN115482557B (en) * 2022-10-09 2023-11-17 中国电信股份有限公司 Human body image generation method, system, equipment and storage medium
CN115775300A (en) * 2022-12-23 2023-03-10 北京百度网讯科技有限公司 Reconstruction method of human body model, training method and device of human body reconstruction model
CN115775300B (en) * 2022-12-23 2024-06-11 北京百度网讯科技有限公司 Human body model reconstruction method, human body model reconstruction training method and device
CN117351432A (en) * 2023-12-04 2024-01-05 环球数科集团有限公司 Training system for multi-target recognition model of scenic spot tourist
CN117351432B (en) * 2023-12-04 2024-02-23 环球数科集团有限公司 Training system for multi-target recognition model of scenic spot tourist
CN117745978A (en) * 2024-02-20 2024-03-22 四川大学华西医院 Simulation quality control method, equipment and medium based on human body three-dimensional reconstruction algorithm
CN117745978B (en) * 2024-02-20 2024-04-30 四川大学华西医院 Simulation quality control method, equipment and medium based on human body three-dimensional reconstruction algorithm

Also Published As

Publication number Publication date
CN113449570A (en) 2021-09-28
JP7448679B2 (en) 2024-03-12
JP2023519012A (en) 2023-05-09
KR20220160066A (en) 2022-12-05

Similar Documents

Publication Publication Date Title
WO2021190321A1 (en) Image processing method and device
CN110139115B (en) Method and device for controlling virtual image posture based on key points and electronic equipment
KR102319177B1 (en) Method and apparatus, equipment, and storage medium for determining object pose in an image
CN111598998B (en) Three-dimensional virtual model reconstruction method, three-dimensional virtual model reconstruction device, computer equipment and storage medium
WO2020228766A1 (en) Target tracking method and system based on real scene modeling and intelligent recognition, and medium
KR101791590B1 (en) Object pose recognition apparatus and method using the same
WO2023071964A1 (en) Data processing method and apparatus, and electronic device and computer-readable storage medium
JP7311640B2 (en) Behavior prediction method and device, gait recognition method and device, electronic device, and computer-readable storage medium
JP7292492B2 (en) Object tracking method and device, storage medium and computer program
CN109934847A (en) The method and apparatus of weak texture three-dimension object Attitude estimation
Bultmann et al. Real-time multi-view 3D human pose estimation using semantic feedback to smart edge sensors
Li et al. Lidarcap: Long-range marker-less 3d human motion capture with lidar point clouds
Liu et al. 4d human body capture from egocentric video via 3d scene grounding
JP2018013999A (en) Pose estimation device, method, and program
CN109992809A (en) A kind of construction method of buildings model, device and storage device
CN111899279A (en) Method and device for detecting motion speed of target object
CN115482556A (en) Method for key point detection model training and virtual character driving and corresponding device
Shen et al. Learning human mesh recovery in 3D scenes
CN104978583A (en) Person action recognition method and person action recognition device
CN117711066A (en) Three-dimensional human body posture estimation method, device, equipment and medium
CN112183431A (en) Real-time pedestrian number statistical method and device, camera and server
CN114548224A (en) 2D human body pose generation method and device for strong interaction human body motion
CN113761965B (en) Motion capture method, motion capture device, electronic equipment and storage medium
CN114463663A (en) Method and device for calculating height of person, electronic equipment and storage medium
CN104182990B (en) A kind of Realtime sequence images motion target area acquisition methods

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21776258

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022558577

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 20227037422

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21776258

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 21776258

Country of ref document: EP

Kind code of ref document: A1