WO2022033513A1 - Target segmentation method and apparatus, and computer-readable storage medium and computer device - Google Patents

Target segmentation method and apparatus, and computer-readable storage medium and computer device Download PDF

Info

Publication number
WO2022033513A1
WO2022033513A1 PCT/CN2021/112044 CN2021112044W WO2022033513A1 WO 2022033513 A1 WO2022033513 A1 WO 2022033513A1 CN 2021112044 W CN2021112044 W CN 2021112044W WO 2022033513 A1 WO2022033513 A1 WO 2022033513A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
image
segmentation
frame
target frame
Prior art date
Application number
PCT/CN2021/112044
Other languages
French (fr)
Chinese (zh)
Inventor
贾配洋
林晓帆
蔡锦霖
Original Assignee
影石创新科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 影石创新科技股份有限公司 filed Critical 影石创新科技股份有限公司
Publication of WO2022033513A1 publication Critical patent/WO2022033513A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration by the use of more than one image, e.g. averaging, subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging

Abstract

The present application is applicable to the field of image processing. Provided are a target segmentation method and apparatus, and a computer-readable storage medium and a computer device. The method comprises: obtaining a first target frame in a target image by using a preset target acquisition model; enlarging the first target frame according to a first preset proportion, so as to obtain an enlarged first target frame; by using a preset target segmentation model, processing an image area corresponding to the enlarged first target frame, so as to obtain a mask image after target segmentation, and a second target frame located in the mask image after target segmentation; mapping the mask image after target segmentation to the size of a target image, so as to obtain a first mask image of the target image; and enlarging the second target frame according to a second preset proportion, so as to obtain an enlarged second target frame, and fusing an image corresponding to the enlarged second target frame with the first mask image, so as to obtain a second mask image of the target image. The present application facilitates correcting a wrongly segmented pixel outside a target instance, thereby improving the accuracy of target segmentation.

Description

目标分割方法、装置、计算机可读存储介质及计算机设备Object segmentation method, apparatus, computer-readable storage medium, and computer device 技术领域technical field
本申请属于图像处理领域,尤其涉及一种目标分割方法、装置、计算机可读存储介质及计算机设备。The present application belongs to the field of image processing, and in particular, relates to a target segmentation method, apparatus, computer-readable storage medium, and computer equipment.
背景技术Background technique
目标分割是指对图像中包含目标的部分进行目标分割检测,将图像中的目标和背景进行分离,得到目标分割后的掩膜图以供后续对目标进行处理。目标可以是人像、动物、车等任何目标。例如,当目标是人像时,后续对目标进行处理可以是对目标进行美颜、虚化等处理。Target segmentation refers to performing target segmentation and detection on the part of the image containing the target, separating the target and the background in the image, and obtaining a mask map after target segmentation for subsequent processing of the target. The target can be anything like a portrait, an animal, a car, etc. For example, when the target is a portrait, the subsequent processing of the target may be to perform processing such as beautifying and blurring the target.
然而,现有技术的目标分割方法通常是采用预设分割模型,根据扩大后的第一目标框获取目标分割后的掩膜图。采用这种方法的分割精度不高,无法准确分割出目标的边缘。However, the target segmentation method in the prior art usually adopts a preset segmentation model, and obtains a mask image after target segmentation according to the enlarged first target frame. The segmentation accuracy of this method is not high, and the edge of the target cannot be accurately segmented.
技术问题technical problem
本申请实施例的目的在于提供一种目标分割方法、装置、计算机可读存储介质、计算机设备及相机,旨在解决以上问题之一。The purpose of the embodiments of the present application is to provide a target segmentation method, apparatus, computer-readable storage medium, computer equipment, and camera, aiming to solve one of the above problems.
技术解决方案technical solutions
第一方面,本申请提供了一种目标分割方法,所述方法包括:In a first aspect, the present application provides a target segmentation method, the method comprising:
S101、采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标;S101, using a preset target acquisition model to obtain a first target frame in a target image, where the target image has a target to be segmented;
S102、按照第一预设比例扩大第一目标框,得到扩大后的第一目标框;S102, expanding the first target frame according to a first preset ratio to obtain an enlarged first target frame;
S103、采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;S103, using a preset target segmentation model to process the image area corresponding to the enlarged first target frame, to obtain a mask image after target segmentation and a second target frame located in the mask image after the target segmentation;
S104、将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。S104: Map the mask image after the target segmentation to the size of the target image to obtain a first mask image of the target image; expand the second target frame according to a second preset ratio to obtain an enlarged second target frame, and expand the second target frame. The image corresponding to the second target frame is fused with the first mask image to obtain a second mask image of the target image.
第二方面,本申请提供了一种目标分割装置,所述装置包括:In a second aspect, the present application provides a target segmentation device, the device comprising:
第一目标框获取模块,用于采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标;a first target frame acquisition module, configured to use a preset target acquisition model to obtain a first target frame in a target image, where the target image has a target to be segmented;
第一扩大模块,用于按照第一预设比例扩大第一目标框,得到扩大后的第一目标框;a first enlargement module, configured to enlarge the first target frame according to a first preset ratio to obtain an enlarged first target frame;
目标分割模块,用于采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;The target segmentation module is used to process the image area corresponding to the enlarged first target frame by using the preset target segmentation model, and obtain the mask image after target segmentation and the first target image in the mask image after the target segmentation. Two target boxes;
第二扩大模块,用于将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;并按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。The second expansion module is used to map the mask image after the target segmentation to the size of the target image to obtain the first mask image of the target image; and expand the second target frame according to the second preset ratio to obtain the enlarged first mask image. The second target frame is obtained by fusing the image corresponding to the enlarged second target frame with the first mask image to obtain the second mask image of the target image.
第三方面,本申请提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如所述的目标分割方法的步骤。In a third aspect, the present application provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements the steps of the object segmentation method as described above.
第四方面,本申请提供了一种计算机设备,包括:In a fourth aspect, the present application provides a computer device, comprising:
一个或多个处理器;one or more processors;
存储器;以及memory; and
一个或多个计算机程序,所述处理器和所述存储器通过总线连接,其中所述一个或多个计算机程序被存储在所述存储器中,并且被配置成由所述一个或多个处理器执行,所述处理器执行所述计算机程序时实现如所述的目标分割方法的步骤。one or more computer programs, the processor and the memory connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors , the processor implements the steps of the object segmentation method when executing the computer program.
第五方面,本申请提供了一种相机,包括:In a fifth aspect, the application provides a camera, including:
一个或多个处理器;one or more processors;
存储器;以及memory; and
一个或多个计算机程序,所述处理器和所述存储器通过总线连接,其中所述一个或多个计算机程序被存储在所述存储器中,并且被配置成由所述一个或多个处理器执行,所述处理器执行所述计算机程序时实现如所述的目标分割方法的步骤。one or more computer programs, the processor and the memory connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors , the processor implements the steps of the object segmentation method when executing the computer program.
有益效果beneficial effect
在本申请实施例中,由于采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理后获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;然后将目标分割后的掩膜图映射到目标图像的尺寸,并按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。因此有助于修正目标实例外的误分割像素,提高了目标分割的精度。In the embodiment of the present application, because the preset target segmentation model is used to process the image area corresponding to the enlarged first target frame, the mask map after target segmentation and the mask map located in the target segmentation are obtained after processing. The second target frame of The image corresponding to the target frame is fused with the first mask image to obtain a second mask image of the target image. Therefore, it is helpful to correct the wrongly segmented pixels outside the target instance and improve the accuracy of target segmentation.
附图说明Description of drawings
图1是本申请一实施例提供的目标分割方法的应用场景示意图。FIG. 1 is a schematic diagram of an application scenario of a target segmentation method provided by an embodiment of the present application.
图2是本申请一实施例提供的目标分割方法的流程图。FIG. 2 is a flowchart of a target segmentation method provided by an embodiment of the present application.
图3是目标图像为平面图像的示意图。FIG. 3 is a schematic diagram of a target image being a plane image.
图4是将第一目标框的每个边扩大第一预设比例,得到扩大后的第一目标框的示意图。FIG. 4 is a schematic diagram of enlarging each side of the first target frame by a first preset ratio to obtain an enlarged first target frame.
图5是目标分割后的掩膜图的示意图。FIG. 5 is a schematic diagram of a mask map after object segmentation.
图6是目标图像的第一掩膜图的示意图。FIG. 6 is a schematic diagram of a first mask map of a target image.
图7是扩大后的第二目标框的示意图。FIG. 7 is a schematic diagram of an enlarged second target frame.
图8是目标图像的第二掩膜图的示意图。FIG. 8 is a schematic diagram of a second mask map of the target image.
图9是目标图像的人物分割图的示意图。FIG. 9 is a schematic diagram of a person segmentation map of a target image.
图10是具有多个待分割的目标的目标图像的示意图。FIG. 10 is a schematic diagram of an object image with multiple objects to be segmented.
图11是目标图像的第三掩膜图的示意图。FIG. 11 is a schematic diagram of a third mask map of the target image.
图12是目标图像的人物分割图的示意图。FIG. 12 is a schematic diagram of a person segmentation map of a target image.
图13是本申请一实施例提供的目标分割装置示意图。FIG. 13 is a schematic diagram of a target segmentation apparatus provided by an embodiment of the present application.
图14是本申请一实施例提供的计算机设备的具体结构框图。FIG. 14 is a specific structural block diagram of a computer device provided by an embodiment of the present application.
图15是本申请一实施例提供的相机的具体结构框图。FIG. 15 is a specific structural block diagram of a camera provided by an embodiment of the present application.
本发明的实施方式Embodiments of the present invention
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
以下结合具体实施例对本发明的具体实现进行详细描述:The specific implementation of the present invention is described in detail below in conjunction with specific embodiments:
本申请一实施例提供的目标分割方法的应用场景可以是计算机设备或相机,计算机设备或相机执行本申请一实施例提供的目标分割方法获得目标图像的掩膜图。本申请一实施例提供的目标分割方法的应用场景也可以包括相连接的计算机设备100和相机200(如图1所示)。计算机设备100和相机200中可运行至少一个的应用程序。计算机设备100可以是服务器、台式计算机、移动终端等,移动终端包括手机、平板电脑、笔记本电脑、个人数字助理等。相机200可以是普通的相机或者全景相机等。普通的相机是指用于拍摄平面图像和平面视频的拍摄装置。计算机设备100或者是相机200执行本申请一实施例提供的目标分割方法获得目标图像的掩膜图。An application scenario of the target segmentation method provided by an embodiment of the present application may be a computer device or a camera, and the computer device or the camera executes the target segmentation method provided by an embodiment of the present application to obtain a mask image of a target image. An application scenario of the target segmentation method provided by an embodiment of the present application may also include a connected computer device 100 and a camera 200 (as shown in FIG. 1 ). At least one application program of the computer device 100 and the camera 200 may be executed. The computer device 100 may be a server, a desktop computer, a mobile terminal, and the like, and the mobile terminal includes a mobile phone, a tablet computer, a notebook computer, a personal digital assistant, and the like. The camera 200 may be an ordinary camera or a panoramic camera or the like. A common camera refers to a photographing device for taking flat images and flat videos. The computer device 100 or the camera 200 executes the target segmentation method provided by an embodiment of the present application to obtain a mask image of the target image.
请参阅图2,是本申请一实施例提供的目标分割方法的流程图,本实施例主要以该目标分割方法应用于计算机设备或相机为例来举例说明,本申请一实施例提供的目标分割方法包括以下步骤:Please refer to FIG. 2 , which is a flowchart of a target segmentation method provided by an embodiment of the present application. This embodiment mainly takes the application of the target segmentation method to a computer device or a camera as an example for illustration. The target segmentation method provided by an embodiment of the present application The method includes the following steps:
S101、采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标。S101. Use a preset target acquisition model to obtain a first target frame in a target image, where the target image has a target to be segmented.
在本申请一实施例中,所述目标图像可为平面图像(如图3所示)、全景图像或从全景图像中截取的待分割的目标对应的图像。In an embodiment of the present application, the target image may be a plane image (as shown in FIG. 3 ), a panoramic image, or an image corresponding to the target to be segmented captured from the panoramic image.
待分割的目标可以是人、动物、车等任何目标。待分割的目标对应的图像可包括待分割的目标的一部分,例如待分割的目标是人时,待分割的目标对应的图像可以是人脸、人上半身或者完整的人体等。The target to be segmented can be any target such as people, animals, cars, etc. The image corresponding to the target to be segmented may include a part of the target to be segmented. For example, when the target to be segmented is a person, the image corresponding to the target to be segmented may be a human face, a human upper body, or a complete human body.
预设的目标获取模型可以是目标检测模型或目标跟踪模型。目标检测模型可以为经典的机器学习方法检测模型或利用目标检测数据集进行学习的深度学习目标检测模型。目标跟踪模型包括线图模型(Stick Figures Model)、 二维轮廓模型 (2D Contours Model)和立体模型(Volumetric Model)等。The preset target acquisition model may be a target detection model or a target tracking model. The target detection model can be a classical machine learning method detection model or a deep learning target detection model that uses target detection datasets to learn. Object tracking models include line graph models (Stick Figures Model), 2D Contours Model) and three-dimensional model (Volumetric Model) and so on.
如果是针对视频中的目标进行实时分割的话,则预设的目标获取模型可以是目标跟踪模型,也可以是目标检测模型。If the target in the video is segmented in real time, the preset target acquisition model may be a target tracking model or a target detection model.
在本申请一实施例中,目标检测模型可以是利用目标检测数据集进行学习的深度学习目标检测模型。如果采用利用目标检测数据集进行学习的深度学习目标检测模型获得目标图像中的第一目标框,目标检测数据集包括全景图像时,对于全景图像、从全景图像中截取的待分割的目标对应的图像和目标存在稍微形变的平面图像,图像分割效果比经典的机器学习方法检测模型效果更好。因为目标检测数据集包括全景图像,全景图像中物体有形变,经过数据训练后,模型输出的结果对有形变的目标分割的效果比其他算法好,模型在形变图像上的泛化能力更强,能够兼容处理有稍微形变的图像,目标的分割精度较高。In an embodiment of the present application, the target detection model may be a deep learning target detection model that is learned by using a target detection data set. If the first target frame in the target image is obtained by using a deep learning target detection model that uses target detection data sets for learning, and when the target detection data set includes panoramic images, for the panoramic image and the target to be segmented intercepted from the panoramic image corresponding to The image and the target have a slightly deformed plane image, and the image segmentation effect is better than the detection model of the classical machine learning method. Because the target detection data set includes panoramic images, the objects in the panoramic images are deformed. After data training, the results of the model output are better than other algorithms for the segmentation of deformed targets, and the model has stronger generalization ability on deformed images. It can be compatible with slightly deformed images, and the segmentation accuracy of the target is higher.
在本申请一实施例中,所述利用目标检测数据集进行学习的深度学习目标检测模型是单阶段的目标检测模型或双阶段的目标检测模型,并在保证精度的情况下通过模型压缩等方法降低计算量,因此在模型推理速度上相对于没有通过模型压缩的方法更有优势。模型压缩是指对于深度学习网络结构和参数进行压缩,即采用更少的网络层数,更少的深度卷积神经网络的通道数量等达到相似或符合任务要求精度。In an embodiment of the present application, the deep learning target detection model that uses the target detection data set for learning is a single-stage target detection model or a two-stage target detection model, and the model compression method is used under the condition of ensuring accuracy. Reduce the amount of computation, so it has an advantage in the model inference speed compared to the method without model compression. Model compression refers to the compression of the deep learning network structure and parameters, that is, using fewer network layers and fewer channels of the deep convolutional neural network to achieve similar accuracy or meet the task requirements.
S102、按照第一预设比例扩大第一目标框,得到扩大后的第一目标框。S102: Enlarge the first target frame according to a first preset ratio to obtain an enlarged first target frame.
在本申请一实施例中,S102具体可以为:In an embodiment of the present application, S102 may specifically be:
将第一目标框的每个边扩大第一预设比例,得到扩大后的第一目标框(如图4所示)。第一预设比例的范围可为5%至30%之间,即扩大后的第一目标框的每个边是原第一目标框的每个边的105%至130%之间,当然也可以是其他比例。Each side of the first target frame is enlarged by a first preset ratio to obtain an enlarged first target frame (as shown in FIG. 4 ). The range of the first preset ratio can be between 5% and 30%, that is, each side of the enlarged first target frame is between 105% and 130% of each side of the original first target frame. Other ratios are possible.
由于扩边有可能超出图像的边界,因此所述将第一目标框的每条边扩大第一预设比例,得到扩大后的第一目标框之后,所述方法还可以包括:Since the edge expansion may exceed the boundary of the image, after the expansion of each edge of the first target frame by a first preset ratio to obtain the enlarged first target frame, the method may further include:
判断扩大后的第一目标框的所有边中是否有超出目标图像覆盖的范围的,如果有,则将超出目标图像覆盖的范围的边修改为扩大至与目标图像所对应的边缘一致的位置。因此能避免扩大后的第一目标框超出图像的边界。Determine whether all sides of the expanded first target frame are beyond the range covered by the target image, and if so, modify the side beyond the range covered by the target image to be expanded to a position consistent with the edge corresponding to the target image. Therefore, the expanded first target frame can be prevented from exceeding the boundary of the image.
S103、采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框(如图5所示)。S103, using a preset target segmentation model to process the image area corresponding to the enlarged first target frame to obtain a mask image after target segmentation and a second target frame located in the mask image after the target segmentation ( as shown in Figure 5).
所述位于所述目标分割后的掩膜图中的第二目标框具体可以通过以下方式获得:识别目标分割后的掩膜图中的目标,并将目标的边界形成的矩形框作为第二目标框。The second target frame in the mask image after the target segmentation can be specifically obtained by: identifying the target in the mask image after the target segmentation, and using the rectangular frame formed by the boundary of the target as the second target frame.
所述预设的目标分割模型可为经典的分割机器学习算法,如YOLACT(You Only Look At CoefficienTs)等算法,还可为采用利用获得的目标分割数据集进行学习的深度学习目标分割模型,该模型经过学习训练可实现在端侧的部署,如相机、计算机设备等,可达到实时实例分割的效果。本申请实施例中的目标分割数据集可以包括平面图像和全景图像。全景图像具有更好的泛化能力,使目标的分割效果更好。且对于目标轮廓标注更加精准,每个目标实例标注polygon(多边形)建模实例由更多个像素点组成。The preset target segmentation model can be a classic segmentation machine learning algorithm, such as YOLACT (You Only Look At CoefficienTs) and other algorithms, and can also be a deep learning target segmentation model that uses the obtained target segmentation data set for learning. The effect of real-time instance segmentation. The target segmentation dataset in this embodiment of the present application may include plane images and panoramic images. Panoramic images have better generalization ability, which makes the segmentation of objects better. Moreover, the target contour annotation is more accurate, and each target instance labeling polygon (polygon) modeling instance consists of more pixels.
YOLACT(You Only Look At CoefficienTs)等算法为单阶段实例分割算法模型,无需对应区域的多尺度特征信息(多尺度特征信息在神经网络结构中从不同卷积层提取)。单阶段实例分割模型,速度更快,对于深度神经网络做了进一步模型压缩等,以进一步提高实例分割模型速度。Algorithms such as YOLACT (You Only Look At CoefficienTs) are single-stage instance segmentation algorithm models that do not require multi-scale feature information of the corresponding region (multi-scale feature information is extracted from different convolutional layers in the neural network structure). The single-stage instance segmentation model is faster, and further model compression is performed for the deep neural network to further improve the speed of the instance segmentation model.
所述采用利用获得的目标分割数据集进行学习的深度学习目标分割模型可以是单阶段目标分割模型或双阶段目标分割模型,并在保证精度的情况下通过模型压缩等方法降低计算量,因此在模型推理速度上相对于没有通过模型压缩的方法更有优势。模型压缩是指对于深度学习网络结构和参数进行压缩,即采用更少的网络层数,更少的深度卷积神经网络的通道数量等达到相似或符合任务要求精度。The deep learning target segmentation model that uses the obtained target segmentation data set for learning can be a single-stage target segmentation model or a two-stage target segmentation model, and the amount of calculation is reduced through model compression and other methods under the condition of ensuring accuracy. Compared with the method without model compression, the model inference speed is more advantageous. Model compression refers to the compression of the deep learning network structure and parameters, that is, using fewer network layers and fewer channels of the deep convolutional neural network to achieve similar accuracy or meet the task requirements.
对于目标分割模型,目前开源的数据集,对于人物的实例分割精度不够,无法准确分割出人物的边缘。比如coco(Common Objects in Context,上下文中的公共对象)数据集,由于标注比较粗糙,无法满足高精度人物分割需求。Supervisely数据集标注精度较高,但是数据量较小,包含图像场景有限,无法直接使用其进行训练并应用到高精度人物分割产品。本申请可以采用利用目标分割数据集进行学习的深度学习目标分割模型,本申请中采用的目标分割数据集相较于coco数据集,对于人体轮廓标注更加精准,基于此数据集训练后的目标分割模型对于人体边缘、随身携带物品(如手机、帽子、头盔、背包、头盔、球拍、雨伞等人体可能随身携带或手持的)等具有更好的分割效果,分割精度较高。For the target segmentation model, the current open source datasets have insufficient instance segmentation accuracy for characters, and cannot accurately segment the edges of characters. For example, the coco (Common Objects in Context, public objects in the context) dataset cannot meet the requirements of high-precision person segmentation due to the rough labeling. The Supervisely dataset has high labeling accuracy, but the amount of data is small and contains limited image scenes, so it cannot be directly used for training and applied to high-precision person segmentation products. This application can use a deep learning target segmentation model that uses the target segmentation data set for learning. Compared with the coco data set, the target segmentation data set used in this application is more accurate for human body contour labeling. The target segmentation after training based on this data set The model has better segmentation effect and higher segmentation accuracy for human body edges and portable items (such as mobile phones, hats, helmets, backpacks, helmets, rackets, umbrellas, etc., which may be carried or held by the human body).
S104、将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。S104: Map the mask image after the target segmentation to the size of the target image to obtain a first mask image of the target image; expand the second target frame according to a second preset ratio to obtain an enlarged second target frame, and expand the second target frame. The image corresponding to the second target frame is fused with the first mask image to obtain a second mask image of the target image.
在本申请一实施例中,所述将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图具体为:In an embodiment of the present application, the mask image obtained after the target segmentation is mapped to the size of the target image to obtain the first mask image of the target image is specifically:
扩大目标分割后的掩膜图的边,使扩大边后的目标分割后的掩膜图的尺寸与目标图像的尺寸相同,获得目标图像的第一掩膜图,如图6所示。Expand the edge of the mask image after the target segmentation, so that the size of the mask image after the expanded edge is the same as the size of the target image, and obtain the first mask image of the target image, as shown in Figure 6.
由于S102中,按照第一预设比例扩大第一目标框,得到扩大后的第一目标框时,可能增加更多背景像素,图像中可能包含超过一个人,可能存在其它人干扰或背景误分类。因此通过所述按照第二预设比例扩大第二目标框(如图7中的内框),得到扩大后的第二目标框(如图7所示)。Since the first target frame is enlarged according to the first preset ratio in S102, when the enlarged first target frame is obtained, more background pixels may be added, the image may contain more than one person, and there may be interference from other people or misclassification of the background . Therefore, by expanding the second target frame (as shown in the inner frame in FIG. 7 ) according to the second preset ratio, an enlarged second target frame (as shown in FIG. 7 ) is obtained.
第二预设比例的范围可为5%至30%之间,即扩大后的第二目标框的每个边是原第二目标框的每个边的105%至130%之间,当然也可以是其他比例。The range of the second preset ratio can be between 5% and 30%, that is, each side of the enlarged second target frame is between 105% and 130% of each side of the original second target frame. Other ratios are possible.
S104之后,所述方法还可以包括以下步骤:After S104, the method may further include the following steps:
将目标图像的第二掩膜图(如图8所示)与目标图像进行融合,获得目标图像的人物分割图(如图9所示)。The second mask image of the target image (as shown in Figure 8) is fused with the target image to obtain a person segmentation map of the target image (as shown in Figure 9).
在S101中,如果所述目标图像具有多个待分割的目标,则获得的第一目标框有多个。In S101, if the target image has multiple targets to be segmented, there are multiple first target frames obtained.
如图10所示,所述目标图像具有多个待分割的目标,则针对每个待分割的目标均执行S102至S104,得到多个所述目标图像的第二掩膜图,然后将多个所述目标图像的第二掩膜图进行融合得到目标图像的第三掩膜图,如图11所示,最后将目标图像的第三掩膜图与目标图像进行融合,获得目标图像的人物分割图,如图12所示。As shown in FIG. 10 , the target image has a plurality of targets to be segmented, then S102 to S104 are performed for each target to be segmented to obtain a plurality of second mask images of the target images, and then multiple The second mask of the target image is fused to obtain the third mask of the target image, as shown in Figure 11, and finally the third mask of the target image is fused with the target image to obtain the character segmentation of the target image Figure, as shown in Figure 12.
在本申请一实施例中,S101前,所述方法还可以包括以下步骤:In an embodiment of the present application, before S101, the method may further include the following steps:
对目标图像进行归一化得到归一化后的目标图像。归一化是降采样的一种方式,对目标图像进行归一化是指对目标图像进行降采样,以提高计算速度。The target image is normalized to obtain the normalized target image. Normalization is a way of down-sampling, and normalizing the target image refers to down-sampling the target image to improve the calculation speed.
则S101具体为:采用预设的目标获取模型获得归一化后的目标图像中的第一目标框,所述目标图像具有待分割的目标。S101 is specifically as follows: using a preset target acquisition model to obtain a first target frame in a normalized target image, where the target image has a target to be segmented.
在本申请一实施例中,在S102和S103之间,所述方法还可以包括以下步骤:In an embodiment of the present application, between S102 and S103, the method may further include the following steps:
对扩大后的第一目标框对应的图像区域进行归一化得到归一化后的图像。即对预设的目标分割模型的输入做归一化(减均值、除方差等),对预设的目标分割模型的输入做归一化是目前深度学习大部分模型的常规操作,即将图像的像素值从0-255归一化到如[-1,+1]之间的,以0为中心,可以加速模型收敛等。The image area corresponding to the enlarged first target frame is normalized to obtain a normalized image. That is to normalize the input of the preset target segmentation model (minus the mean, divide the variance, etc.), and normalize the input of the preset target segmentation model is the routine operation of most deep learning models at present, that is, the image The pixel value is normalized from 0-255 to such as [-1,+1], centered at 0, which can speed up model convergence, etc.
则S103具体为:采用预设的目标分割模型对归一化后的图像进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框。S103 is specifically as follows: using a preset target segmentation model to process the normalized image to obtain a mask image after target segmentation and a second target frame located in the mask image after target segmentation.
S103和S104之间,所述方法还可以包括以下步骤:Between S103 and S104, the method may further include the following steps:
对目标分割后的掩膜图进行上采样操作。Perform an upsampling operation on the mask image after target segmentation.
上采样是指将目标分割后的掩膜图放大,以得到和输入预设的目标分割模型的图像尺寸一致的目标分割后的掩膜图。Upsampling refers to enlarging the mask image after target segmentation to obtain a mask image after target segmentation that is consistent with the image size of the input preset target segmentation model.
请参阅图13,本申请一实施例提供的目标分割装置可以是运行于计算机设备或相机中的一个计算机程序或一段程序代码,例如该目标分割装置为一个应用软件;该目标分割装置可以用于执行本申请实施例提供的目标分割方法中的相应步骤。本申请一实施例提供的目标分割装置包括:Referring to FIG. 13 , the target segmentation device provided by an embodiment of the present application may be a computer program or a piece of program code running in a computer device or a camera, for example, the target segmentation device is an application software; the target segmentation device can be used for The corresponding steps in the target segmentation method provided by the embodiment of the present application are performed. The target segmentation device provided by an embodiment of the present application includes:
第一目标框获取模块11,用于采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标;The first target frame acquisition module 11 is used to obtain the first target frame in the target image by adopting a preset target acquisition model, and the target image has the target to be segmented;
第一扩大模块12,用于按照第一预设比例扩大第一目标框,得到扩大后的第一目标框;The first enlargement module 12 is used to enlarge the first target frame according to a first preset ratio to obtain the enlarged first target frame;
目标分割模块13,用于采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;The target segmentation module 13 is configured to use a preset target segmentation model to process the image area corresponding to the enlarged first target frame, and obtain a mask image after target segmentation and a mask image located in the mask image after the target segmentation. the second target frame;
第二扩大模块14,用于将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;并按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。The second enlargement module 14 is used for mapping the mask image after the target segmentation to the size of the target image to obtain the first mask image of the target image; and expanding the second target frame according to the second preset ratio to obtain the enlarged For the second target frame, the image corresponding to the enlarged second target frame is fused with the first mask image to obtain a second mask image of the target image.
本申请一实施例提供的目标分割装置与本申请一实施例提供的目标分割方法属于同一构思,其具体实现过程详见说明书全文,此处不再赘述。The target segmentation device provided by an embodiment of the present application and the target segmentation method provided by an embodiment of the present application belong to the same concept, and the specific implementation process thereof is detailed in the full text of the specification, which will not be repeated here.
本申请一实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如本申请一实施例提供的目标分割方法的步骤。An embodiment of the present application further provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements the target segmentation method provided by an embodiment of the present application. step.
图14示出了本申请一实施例提供的计算机设备的具体结构框图,该计算机设备可以是图1中所示的计算机设备,一种计算机设备100包括:一个或多个处理器101、存储器102、以及一个或多个计算机程序,其中所述处理器101和所述存储器102通过总线连接,所述一个或多个计算机程序被存储在所述存储器102中,并且被配置成由所述一个或多个处理器101执行,所述处理器101执行所述计算机程序时实现如本申请一实施例提供的目标分割方法的步骤。FIG. 14 shows a specific structural block diagram of a computer device provided by an embodiment of the present application. The computer device may be the computer device shown in FIG. 1 . A computer device 100 includes: one or more processors 101 and a memory 102 , and one or more computer programs, wherein the processor 101 and the memory 102 are connected by a bus, the one or more computer programs are stored in the memory 102 and are configured to be executed by the one or A plurality of processors 101 execute, and when the processors 101 execute the computer program, the steps of the target segmentation method provided by an embodiment of the present application are implemented.
计算机设备可以是台式计算机、移动终端等,移动终端包括手机、平板电脑、笔记本电脑、个人数字助理等。The computer equipment may be a desktop computer, a mobile terminal, etc., and the mobile terminal includes a mobile phone, a tablet computer, a notebook computer, a personal digital assistant, and the like.
图15示出了本申请一实施例提供的相机的具体结构框图,该相机可以是图1中所示的相机,一种相机200包括:一个或多个处理器201、存储器202、以及一个或多个计算机程序,其中所述处理器201和所述存储器202通过总线连接,所述一个或多个计算机程序被存储在所述存储器202中,并且被配置成由所述一个或多个处理器201执行,所述处理器201执行所述计算机程序时实现如本申请一实施例提供的目标分割方法的步骤。FIG. 15 shows a specific structural block diagram of a camera provided by an embodiment of the present application. The camera may be the camera shown in FIG. 1 . A camera 200 includes: one or more processors 201 , a memory 202 , and one or more A plurality of computer programs, wherein the processor 201 and the memory 202 are connected by a bus, the one or more computer programs are stored in the memory 202, and are configured to be executed by the one or more processors 201 is executed. When the processor 201 executes the computer program, the steps of the target segmentation method provided by an embodiment of the present application are implemented.
在本申请中,由于采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理后获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;然后将目标分割后的掩膜图映射到目标图像的尺寸,并按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。因此有助于修正目标实例外的误分割像素,即其它非实例目标的像素、其它类似目标像素等,提高了目标分割的精度。In the present application, because the preset target segmentation model is used to process the image area corresponding to the enlarged first target frame, the mask image after target segmentation and the first target image in the mask image after target segmentation are obtained. Second target frame; then map the mask image after target segmentation to the size of the target image, and expand the second target frame according to the second preset ratio to obtain the enlarged second target frame, and the enlarged second target frame The corresponding image is fused with the first mask image to obtain a second mask image of the target image. Therefore, it is helpful to correct the wrongly segmented pixels outside the target instance, that is, the pixels of other non-instance targets, other similar target pixels, etc., and improve the accuracy of target segmentation.
应该理解的是,本申请各实施例中的各个步骤并不是必然按照步骤标号指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,各实施例中至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, the steps in the embodiments of the present application are not necessarily executed sequentially in the order indicated by the step numbers. Unless explicitly stated herein, the execution of these steps is not strictly limited to the order, and these steps may be performed in other orders. Moreover, at least a part of the steps in each embodiment may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed and completed at the same time, but may be executed at different times. The execution of these sub-steps or stages The sequence is also not necessarily sequential, but may be performed alternately or alternately with other steps or sub-steps of other steps or at least a portion of a phase.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一非易失性计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM (SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM (ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the program can be stored in a non-volatile computer-readable storage medium , when the program is executed, it may include the flow of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory. Nonvolatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Road (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. For the sake of brevity, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, all It is considered to be the range described in this specification.
以上实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。 因此,本发明专利的保护范围应以所附权利要求为准。The above examples only represent several embodiments of the present invention, and the descriptions thereof are specific and detailed, but should not be construed as a limitation on the scope of the invention patent. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present invention, several modifications and improvements can also be made, which all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be subject to the appended claims.

Claims (10)

  1. 一种目标分割方法,其特征在于,所述方法包括:A target segmentation method, characterized in that the method comprises:
    S101、采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标;S101, using a preset target acquisition model to obtain a first target frame in a target image, where the target image has a target to be segmented;
    S102、按照第一预设比例扩大第一目标框,得到扩大后的第一目标框;S102, expanding the first target frame according to a first preset ratio to obtain an enlarged first target frame;
    S103、采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;S103, using a preset target segmentation model to process the image area corresponding to the enlarged first target frame, to obtain a mask image after target segmentation and a second target frame located in the mask image after the target segmentation;
    S104、将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。S104: Map the mask image after the target segmentation to the size of the target image to obtain a first mask image of the target image; expand the second target frame according to a second preset ratio to obtain an enlarged second target frame, and expand the second target frame. The image corresponding to the second target frame is fused with the first mask image to obtain a second mask image of the target image.
  2. 如权利要求1所述的目标分割方法,其特征在于,所述预设的目标获取模型是目标检测模型或目标跟踪模型。The target segmentation method according to claim 1, wherein the preset target acquisition model is a target detection model or a target tracking model.
  3. 如权利要求1所述的目标分割方法,其特征在于,所述S102具体为:The target segmentation method according to claim 1, wherein the S102 is specifically:
    将第一目标框的每个边扩大第一预设比例,得到扩大后的第一目标框;所述将第一目标框的每条边扩大第一预设比例,得到扩大后的第一目标框之后,所述方法还包括:Enlarging each side of the first target frame by a first preset ratio to obtain an enlarged first target frame; expanding each side of the first target frame by a first preset ratio to obtain an enlarged first target frame After the box, the method further includes:
    判断扩大后的第一目标框的所有边中是否有超出目标图像覆盖的范围的,如果有,则将超出目标图像覆盖的范围的边修改为扩大至与目标图像所对应的边缘一致的位置。Determine whether all sides of the expanded first target frame are beyond the range covered by the target image, and if so, modify the side beyond the range covered by the target image to be expanded to a position consistent with the edge corresponding to the target image.
  4. 如权利要求1所述的目标分割方法,其特征在于,所述S104之后,所述方法还包括:The target segmentation method according to claim 1, wherein after the S104, the method further comprises:
    将目标图像的第二掩膜图与目标图像进行融合,获得目标图像的人物分割图。The second mask image of the target image is fused with the target image to obtain a person segmentation map of the target image.
  5. 如权利要求1至4任一项所述的目标分割方法,其特征在于,在S101中,如果所述目标图像具有多个待分割的目标,则获得的第一目标框有多个;The target segmentation method according to any one of claims 1 to 4, wherein, in S101, if the target image has multiple targets to be segmented, there are multiple first target frames obtained;
    针对每个待分割的目标均执行S102至S104,得到多个所述目标图像的第二掩膜图,然后将多个所述目标图像的第二掩膜图进行融合得到目标图像的第三掩膜图,最后将目标图像的第三掩膜图与目标图像进行融合,获得目标图像的人物分割图。S102 to S104 are executed for each target to be segmented to obtain a plurality of second masks of the target images, and then the second masks of the plurality of target images are fused to obtain a third mask of the target image Finally, the third mask image of the target image is fused with the target image to obtain the person segmentation map of the target image.
  6. 如权利要求1所述的目标分割方法,其特征在于,所述S101之前,所述方法还包括:对目标图像进行归一化得到归一化后的目标图像;The target segmentation method according to claim 1, wherein before the step S101, the method further comprises: normalizing the target image to obtain a normalized target image;
    S101具体为:采用预设的目标获取模型获得归一化后的目标图像中的第一目标框;S101 is specifically: using a preset target acquisition model to obtain a first target frame in the normalized target image;
    在S102和S103之间,所述方法还包括:对扩大后的第一目标框对应的图像区域进行归一化得到归一化后的图像;Between S102 and S103, the method further includes: normalizing the image area corresponding to the enlarged first target frame to obtain a normalized image;
    S103具体为:采用预设的目标分割模型对归一化后的图像进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;S103 is specifically: using a preset target segmentation model to process the normalized image to obtain a mask image after the target segmentation and a second target frame located in the mask image after the target segmentation;
    S103和S104之间,所述方法还包括:对目标分割后的掩膜图进行上采样操作。Between S103 and S104, the method further includes: performing an upsampling operation on the mask image after the target segmentation.
  7. 一种目标分割装置,其特征在于,所述装置包括:A target segmentation device, characterized in that the device comprises:
    第一目标框获取模块,用于采用预设的目标获取模型获得目标图像中的第一目标框,所述目标图像具有待分割的目标;a first target frame acquisition module, configured to use a preset target acquisition model to obtain a first target frame in a target image, where the target image has a target to be segmented;
    第一扩大模块,用于按照第一预设比例扩大第一目标框,得到扩大后的第一目标框;a first enlargement module, configured to enlarge the first target frame according to a first preset ratio to obtain an enlarged first target frame;
    目标分割模块,用于采用预设的目标分割模型对扩大后的第一目标框对应的图像区域进行处理,获得目标分割后的掩膜图和位于所述目标分割后的掩膜图中的第二目标框;The target segmentation module is used to process the image area corresponding to the enlarged first target frame by using the preset target segmentation model, and obtain the mask image after target segmentation and the first target image in the mask image after the target segmentation. Two target boxes;
    第二扩大模块,用于将目标分割后的掩膜图映射到目标图像的尺寸,获得目标图像的第一掩膜图;并按照第二预设比例扩大第二目标框,得到扩大后的第二目标框,将扩大后的第二目标框对应的图像与第一掩膜图进行融合获得目标图像的第二掩膜图。The second expansion module is used to map the mask image after the target segmentation to the size of the target image to obtain the first mask image of the target image; and expand the second target frame according to the second preset ratio to obtain the enlarged first mask image. The second target frame is obtained by fusing the image corresponding to the enlarged second target frame with the first mask image to obtain the second mask image of the target image.
  8. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至6任一项所述的目标分割方法的步骤。A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the target segmentation method according to any one of claims 1 to 6 is implemented. step.
  9. 一种计算机设备,包括:A computer device comprising:
    一个或多个处理器;one or more processors;
    存储器;以及memory; and
    一个或多个计算机程序,所述处理器和所述存储器通过总线连接,其中所述一个或多个计算机程序被存储在所述存储器中,并且被配置成由所述一个或多个处理器执行,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1至6任一项所述的目标分割方法的步骤。one or more computer programs, the processor and the memory connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors , characterized in that, when the processor executes the computer program, the steps of the target segmentation method according to any one of claims 1 to 6 are implemented.
  10. 一种相机,包括:A camera comprising:
    一个或多个处理器;one or more processors;
    存储器;以及memory; and
    一个或多个计算机程序,所述处理器和所述存储器通过总线连接,其中所述一个或多个计算机程序被存储在所述存储器中,并且被配置成由所述一个或多个处理器执行,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1至6任一项所述的目标分割方法的步骤。one or more computer programs, the processor and the memory connected by a bus, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors , characterized in that, when the processor executes the computer program, the steps of the target segmentation method according to any one of claims 1 to 6 are implemented.
PCT/CN2021/112044 2020-08-11 2021-08-11 Target segmentation method and apparatus, and computer-readable storage medium and computer device WO2022033513A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010801662.4A CN111968134B (en) 2020-08-11 2020-08-11 Target segmentation method, device, computer readable storage medium and computer equipment
CN202010801662.4 2020-08-11

Publications (1)

Publication Number Publication Date
WO2022033513A1 true WO2022033513A1 (en) 2022-02-17

Family

ID=73365963

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/112044 WO2022033513A1 (en) 2020-08-11 2021-08-11 Target segmentation method and apparatus, and computer-readable storage medium and computer device

Country Status (2)

Country Link
CN (1) CN111968134B (en)
WO (1) WO2022033513A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116452600A (en) * 2023-06-15 2023-07-18 上海蜜度信息技术有限公司 Instance segmentation method, system, model training method, medium and electronic equipment
CN117237397A (en) * 2023-07-13 2023-12-15 天翼爱音乐文化科技有限公司 Portrait segmentation method, system, equipment and storage medium based on feature fusion

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111968134B (en) * 2020-08-11 2023-11-28 影石创新科技股份有限公司 Target segmentation method, device, computer readable storage medium and computer equipment
CN113689460B (en) * 2021-09-02 2023-12-15 广州市奥威亚电子科技有限公司 Video target object tracking detection method, device, equipment and storage medium
CN116797500A (en) * 2022-03-14 2023-09-22 腾讯科技(深圳)有限公司 Image processing method, device, storage medium, electronic equipment and product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190286940A1 (en) * 2018-03-13 2019-09-19 Fujitsu Limited Method and apparatus for training image model, and method and apparatus for category prediction
CN110650291A (en) * 2019-10-23 2020-01-03 Oppo广东移动通信有限公司 Target focus tracking method and device, electronic equipment and computer readable storage medium
CN111179287A (en) * 2020-01-03 2020-05-19 厦门美图之家科技有限公司 Portrait instance segmentation method, device, equipment and storage medium
CN111968134A (en) * 2020-08-11 2020-11-20 影石创新科技股份有限公司 Object segmentation method and device, computer readable storage medium and computer equipment

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5488526B2 (en) * 2011-04-28 2014-05-14 フリュー株式会社 Image processing apparatus and image processing method
CN111489320A (en) * 2019-01-29 2020-08-04 华为技术有限公司 Image processing method and device
CN111179282B (en) * 2019-12-27 2024-04-23 Oppo广东移动通信有限公司 Image processing method, image processing device, storage medium and electronic apparatus
CN111311528B (en) * 2020-01-22 2023-07-28 广州虎牙科技有限公司 Image fusion optimization method, device, equipment and medium
CN111429421B (en) * 2020-03-19 2021-08-27 推想医疗科技股份有限公司 Model generation method, medical image segmentation method, device, equipment and medium
CN111507994B (en) * 2020-04-24 2023-10-03 Oppo广东移动通信有限公司 Portrait extraction method, portrait extraction device and mobile terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190286940A1 (en) * 2018-03-13 2019-09-19 Fujitsu Limited Method and apparatus for training image model, and method and apparatus for category prediction
CN110650291A (en) * 2019-10-23 2020-01-03 Oppo广东移动通信有限公司 Target focus tracking method and device, electronic equipment and computer readable storage medium
CN111179287A (en) * 2020-01-03 2020-05-19 厦门美图之家科技有限公司 Portrait instance segmentation method, device, equipment and storage medium
CN111968134A (en) * 2020-08-11 2020-11-20 影石创新科技股份有限公司 Object segmentation method and device, computer readable storage medium and computer equipment

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116452600A (en) * 2023-06-15 2023-07-18 上海蜜度信息技术有限公司 Instance segmentation method, system, model training method, medium and electronic equipment
CN116452600B (en) * 2023-06-15 2023-10-03 上海蜜度信息技术有限公司 Instance segmentation method, system, model training method, medium and electronic equipment
CN117237397A (en) * 2023-07-13 2023-12-15 天翼爱音乐文化科技有限公司 Portrait segmentation method, system, equipment and storage medium based on feature fusion

Also Published As

Publication number Publication date
CN111968134A (en) 2020-11-20
CN111968134B (en) 2023-11-28

Similar Documents

Publication Publication Date Title
WO2022033513A1 (en) Target segmentation method and apparatus, and computer-readable storage medium and computer device
US11915514B2 (en) Method and apparatus for detecting facial key points, computer device, and storage medium
CN110807448B (en) Human face key point data enhancement method
CN106778928B (en) Image processing method and device
CN110135406B (en) Image recognition method and device, computer equipment and storage medium
WO2018086607A1 (en) Target tracking method, electronic device, and storage medium
WO2022078041A1 (en) Occlusion detection model training method and facial image beautification method
WO2020052476A1 (en) Feature point locating method, storage medium and computer device
WO2021068325A1 (en) Facial action recognition model training method, facial action recognition method and apparatus, computer device, and storage medium
US10121055B1 (en) Method and system for facial landmark localization
CN110287836B (en) Image classification method and device, computer equipment and storage medium
WO2022179401A1 (en) Image processing method and apparatus, computer device, storage medium, and program product
WO2021051543A1 (en) Method for generating face rotation model, apparatus, computer device and storage medium
CN113469092B (en) Character recognition model generation method, device, computer equipment and storage medium
CN113435594B (en) Security detection model training method, device, equipment and storage medium
CN111860582B (en) Image classification model construction method and device, computer equipment and storage medium
CN115631112B (en) Building contour correction method and device based on deep learning
Kim et al. Robust facial landmark extraction scheme using multiple convolutional neural networks
WO2022116161A1 (en) Portrait cartooning method, robot, and storage medium
US9349038B2 (en) Method and apparatus for estimating position of head, computer readable storage medium thereof
WO2024011859A1 (en) Neural network-based face detection method and device
CN112613457B (en) Image acquisition mode detection method, device, computer equipment and storage medium
CN114841851A (en) Image generation method, image generation device, electronic equipment and storage medium
Agarwal et al. Unmasking the potential: evaluating image inpainting techniques for masked face reconstruction
CN112200774A (en) Image recognition apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21855564

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14.06.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21855564

Country of ref document: EP

Kind code of ref document: A1