WO2022105622A1 - Image segmentation method and apparatus, readable medium, and electronic device - Google Patents

Image segmentation method and apparatus, readable medium, and electronic device Download PDF

Info

Publication number
WO2022105622A1
WO2022105622A1 PCT/CN2021/128958 CN2021128958W WO2022105622A1 WO 2022105622 A1 WO2022105622 A1 WO 2022105622A1 CN 2021128958 W CN2021128958 W CN 2021128958W WO 2022105622 A1 WO2022105622 A1 WO 2022105622A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
center point
feature information
segmented
pixel
Prior art date
Application number
PCT/CN2021/128958
Other languages
French (fr)
Chinese (zh)
Inventor
喻冬东
王长虎
Original Assignee
北京有竹居网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京有竹居网络技术有限公司 filed Critical 北京有竹居网络技术有限公司
Publication of WO2022105622A1 publication Critical patent/WO2022105622A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

An image segmentation method and apparatus, a readable medium, and an electronic device. The method comprises: acquiring predicted object center point feature information of each of multiple pixels in an image to be segmented, wherein the predicted object center point feature information indicating a level of reliability that the pixel is the center point of an object; determining center point position information of an object in the image according to the predicted object center point feature information of each pixel; and performing image segmentation on the image according to the center point position information. Since the method takes into consideration central point position information of an object in an image to be segmented, a region where the object is located is emphasized and can be distinctly distinguished from a background region, such that the object in the foreground can be accurately separated when the image is subjected to image segmentation, thereby effectively reducing interference of the background region, and improving the accuracy of image segmentation and segmentation performance.

Description

图像分割方法、装置、可读介质及电子设备Image segmentation method, device, readable medium and electronic device
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请是以CN申请号为202011295274.X,申请日为2020年11月18日的申请为基础,并主张其优先权,该CN申请的公开内容在此作为整体引入本申请中。This application is based on the CN application number 202011295274.X and the filing date is Nov. 18, 2020, and claims its priority, and the disclosure content of this CN application is hereby incorporated into this application as a whole.
技术领域technical field
本公开涉及图像处理技术领域,具体地,涉及一种图像分割方法、装置、可读介质及电子设备。The present disclosure relates to the technical field of image processing, and in particular, to an image segmentation method, apparatus, readable medium, and electronic device.
背景技术Background technique
图像分割在图像处理技术领域具有重要应用。图像分割指的是把图像分割成若干个具有相似性质的区域,即将图像划分成若干互不相交的区域的过程。Image segmentation has important applications in the field of image processing technology. Image segmentation refers to the process of dividing an image into several regions with similar properties, that is, dividing the image into several disjoint regions.
图像中通常会存在较为显著的前景物体,图像分割可以将前景物体所在的区域与背景区域分割出来。There are usually more prominent foreground objects in the image, and image segmentation can segment the area where the foreground object is located from the background area.
发明内容SUMMARY OF THE INVENTION
提供该发明内容部分以便以简要的形式介绍构思,这些构思将在后面的具体实施方式部分被详细描述。该发明内容部分并不旨在标识要求保护的技术方案的关键特征或必要特征,也不旨在用于限制所要求的保护的技术方案的范围。This Summary is provided to introduce concepts in a simplified form that are described in detail in the Detailed Description section that follows. This summary section is not intended to identify key features or essential features of the claimed technical solution, nor is it intended to be used to limit the scope of the claimed technical solution.
第一方面,本公开提供一种图像分割方法,所述方法包括:获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;根据所述中心点位置信息,对所述待分割图像进行图像分割。In a first aspect, the present disclosure provides an image segmentation method, the method comprising: acquiring predicted object center point feature information of each pixel of a plurality of pixels in an image to be segmented, where the predicted object center point feature information is used to represent The pixel point is the reliability of the center point of the object; according to the feature information of the predicted object center point of each pixel point, determine the center point position information of the object in the image to be segmented; The to-be-segmented image is subjected to image segmentation.
第二方面,本公开提供一种图像分割装置,所述装置包括:获取模块,用于获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;确定模块,用于根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;图像分割模块,用于根据所述中心点位置信息,对所述待分割图像进行图像分割。In a second aspect, the present disclosure provides an image segmentation device, the device comprising: an acquisition module for acquiring predicted object center point feature information of each pixel of a plurality of pixels in an image to be segmented, the predicted object center point The feature information is used to characterize the reliability that the pixel point is the center point of the object; the determination module is used to determine the center point position information of the object in the image to be segmented according to the feature information of the predicted object center point of each pixel point ; an image segmentation module, configured to perform image segmentation on the to-be-segmented image according to the center point position information.
第三方面,本公开提供一种非瞬时性计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现本公开第一方面提供的所述方法的步骤。In a third aspect, the present disclosure provides a non-transitory computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the first aspect of the present disclosure.
第四方面,本公开提供一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现本公开第一方面提供的所述方法的步骤。In a fourth aspect, the present disclosure provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device, so as to implement the computer program provided in the first aspect of the present disclosure. the steps of the method.
第五方面,本公开提供了一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行如第一方面所述的图像分割方法。In a fifth aspect, the present disclosure provides a computer program comprising: instructions that, when executed by a processor, cause the processor to perform the image segmentation method of the first aspect.
第六方面,本公开提供了一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行如第一方面所述的图像分割方法。In a sixth aspect, the present disclosure provides a computer program product comprising instructions that, when executed by a processor, cause the processor to perform the image segmentation method of the first aspect.
本公开的其他特征和优点将在随后的具体实施方式部分予以详细说明。Other features and advantages of the present disclosure will be described in detail in the detailed description that follows.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。在附图中:The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent when taken in conjunction with the accompanying drawings and with reference to the following detailed description. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that the originals and elements are not necessarily drawn to scale. In the attached image:
图1是根据一示例性实施例示出的一种图像分割方法的流程图。Fig. 1 is a flowchart of an image segmentation method according to an exemplary embodiment.
图2是根据一示例性实施例示出的一种确定待分割图像中物体的中心点位置信息的方法的流程图。Fig. 2 is a flow chart of a method for determining center point position information of an object in an image to be segmented according to an exemplary embodiment.
图3是根据另一示例性实施例示出的一种图像分割方法的流程图。Fig. 3 is a flowchart of an image segmentation method according to another exemplary embodiment.
图4是根据一示例性实施例示出的一种图像分割装置的框图。Fig. 4 is a block diagram of an image segmentation apparatus according to an exemplary embodiment.
图5是根据一示例性实施例示出的一种电子设备的结构示意图。Fig. 5 is a schematic structural diagram of an electronic device according to an exemplary embodiment.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for the purpose of A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes, and are not intended to limit the protection scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "including" and variations thereof are open-ended inclusions, ie, "including but not limited to". The term "based on" is "based at least in part on." The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in the present disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order of functions performed by these devices, modules or units or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "a" and "a plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless the context clearly indicates otherwise, they should be understood as "one or a plurality of". multiple".
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are only for illustrative purposes, and are not intended to limit the scope of these messages or information.
本公开的发明人发现,在相关技术中在对图像进行分割时,容易受到图像中背景区域的干扰,使得对图像中物体的分割效果不好,并且当存在物体被局部遮挡等情况时,也会使得图像分割结果不够准确。The inventor of the present disclosure found that in the related art, when an image is segmented, it is easily interfered by the background area in the image, so that the segmentation effect of the objects in the image is not good, and when there are situations such as objects being partially occluded, the It will make the image segmentation result inaccurate.
鉴于此,本公开的实施例提供一种图像分割方法,以减小图像中背景区域的干扰。In view of this, embodiments of the present disclosure provide an image segmentation method to reduce interference of background regions in an image.
图1是根据一示例性实施例示出的一种图像分割方法的流程图,该方法可应用于具有处理能力的电子设备中,如终端或服务器,如图1所示,该方法可包括步骤S101~S103。Fig. 1 is a flow chart of an image segmentation method according to an exemplary embodiment. The method can be applied to an electronic device with processing capability, such as a terminal or a server. As shown in Fig. 1 , the method may include step S101 ~ S103.
在步骤S101中,获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息。In step S101, the feature information of the predicted object center point of each pixel point of a plurality of pixel points in the image to be divided is acquired.
待分割图像可以是预先存储的图像,或者实时采集的图像,也可以是视频中的图像帧,本公开不做具体限制。待分割图像中可能存在一个或多个较为显著的前景物体,物体可包括人体和物品等,例如,待分割图像中存在一辆或多辆车,或存在一个或多个人,对待分割图像进行图像分割,可以划分图像中前景物体所在区域和图像中的背景区域。The image to be segmented may be a pre-stored image, an image collected in real time, or an image frame in a video, which is not specifically limited in the present disclosure. There may be one or more prominent foreground objects in the image to be segmented. Objects may include human bodies and objects. For example, there are one or more vehicles or one or more people in the image to be segmented. Segmentation can divide the area where the foreground object in the image is located and the background area in the image.
其中,像素点的预测物体中心点特征信息可用于表征该像素点为物体中心点的可信度,即该像素点为物体中心点的可能性或概率。示例地,电子设备中可集成有物体中心点预测模块,该物体中心点预测模块可生成待分割图像对应的热图(heatmap),该热图与待分割图像的分辨率可以是相同的,热图中每个像素点的信息可用于表征待分割图像中对应像素点的预测物体中心点特征信息。其中,若像素点为物体中心点的可能性较高,则该像素点的预测物体中心点特征信息相对较高;若像素点为物体中心点的可能性较低,则该像素点的预测物体中心点特征信息相对较低。对于该预测物体中心点特征信息的具体表现形式,本公开不做限制,例如可通过0-1之间的数值进行表示。The feature information of the predicted object center point of the pixel point can be used to represent the reliability of the pixel point as the object center point, that is, the possibility or probability that the pixel point is the object center point. For example, an object center point prediction module may be integrated into the electronic device, and the object center point prediction module may generate a heatmap corresponding to the image to be segmented. The information of each pixel in the figure can be used to represent the feature information of the predicted object center point of the corresponding pixel in the image to be segmented. Among them, if the possibility of the pixel point being the center point of the object is high, the feature information of the predicted object center point of the pixel point is relatively high; The center point feature information is relatively low. The present disclosure does not limit the specific representation of the feature information of the predicted object center point, for example, it can be represented by a value between 0 and 1.
在步骤S102中,根据各像素点的预测物体中心点特征信息,确定待分割图像中物体的中心点位置信息。In step S102, according to the feature information of the predicted object center point of each pixel point, the center point position information of the object in the image to be segmented is determined.
其中,物体的中心点位置信息可以指的是物体中心点在待分割图像中的坐标信息。The position information of the center point of the object may refer to the coordinate information of the center point of the object in the image to be segmented.
值得说明的是,本公开中在对待分割图像进行图像分割时,无需预先指定特定的物体,也并非仅确定特定物体的中心点位置信息,在待分割图像中存在多个物体的情况下,根据各像素点的预测物体中心点特征信息,可同时确定出该多个物体各自的中心点位置信息,从而在进行图像分割时,可对该多个物体所在的区域均进行准确分割。It is worth noting that in the present disclosure, when performing image segmentation on the image to be segmented, there is no need to pre-designate a specific object, nor is it only necessary to determine the center point position information of a specific object. The feature information of the predicted object center point of each pixel point can simultaneously determine the respective center point position information of the multiple objects, so that when performing image segmentation, the regions where the multiple objects are located can be accurately segmented.
在步骤S103中,根据中心点位置信息,对待分割图像进行图像分割。In step S103, image segmentation is performed on the image to be segmented according to the center point position information.
考虑物体的中心点位置信息,可以使得物体所在的区域在待分割图像中更加突出,与背景区域之间的区别更加显著,从而在对待分割图像进行图像分割时,更容易把前景物体分割出来,有效减小背景区域的干扰。并且,即使物体被局部遮挡,而其中心点位置信息不会受到物体表面特征(例如颜色、尺寸等)的影响,另外,即使图像中有其他相似物体出现,该物体的中心点位置信息也不会受到其他相似物体的影响,因此,根据物体的中心点位置信息进行图像分割,可以把物体所在区域准确分割出来,提升图像分割的准确度和分割效果。Considering the position information of the center point of the object, the area where the object is located can be more prominent in the image to be segmented, and the difference between it and the background area is more significant, so it is easier to segment the foreground object when the image to be segmented is segmented. Effectively reduce the interference in the background area. Moreover, even if the object is partially occluded, the center point position information of the object will not be affected by the surface features of the object (such as color, size, etc.). In addition, even if other similar objects appear in the image, the center point position information of the object is not It will be affected by other similar objects. Therefore, image segmentation based on the center point position information of the object can accurately segment the area where the object is located, and improve the accuracy and segmentation effect of image segmentation.
通过上述技术方案,首先获取待分割图像中各像素点的预测物体中心点特征信息,根据各像素点的预测物体中心点特征信息,可确定待分割图像中物体的中心点位置信息,进而根据物体的中心点位置信息,对待分割图像进行图像分割。这样,考虑物体的中心点位置信息,可以使得物体所在的区域在待分割图像中更加突出,与背景区域之间的区别更加显著,从而在对待分割图像进行图像分割时,能够更准确地把前景物体分割出来,有效减小背景区域的干扰。并且,即使存在物体被局部遮挡或图像中有其他相似物体出现等情况,该物体的中心点位置信息并不会受到影响,因此,根据物体的中心点位置信息进行图像分割,可以把物体所在区域准确分割出来,提升图像分割的准确度和分割效果。Through the above technical solution, first obtain the feature information of the predicted object center point of each pixel in the image to be segmented, and according to the predicted object center point feature information of each pixel point, the center point position information of the object in the image to be segmented can be determined, and then according to the object center point feature information The position information of the center point is used to segment the image to be segmented. In this way, considering the position information of the center point of the object, the area where the object is located can be made more prominent in the image to be segmented, and the difference between it and the background area is more significant, so that the foreground can be more accurately segmented when the image to be segmented is segmented. The object is segmented to effectively reduce the interference of the background area. Moreover, even if the object is partially occluded or other similar objects appear in the image, the center point position information of the object will not be affected. Accurate segmentation can improve the accuracy and segmentation effect of image segmentation.
在一可选实施方式中,S102中根据各像素点的预测物体中心点特征信息,确定待分割图像中物体的中心点位置信息,可包括:将所述多个像素点的预测物体中心点特征信息中局部极大值对应的像素点的位置信息,确定为物体的中心点位置信息。其中,不论待分割图像是单独的图像,还是视频中的图像帧,均可采用该实施方式确定物体的中心点位置信息。In an optional embodiment, in S102, according to the predicted object center point feature information of each pixel point, determining the center point position information of the object in the image to be segmented may include: The position information of the pixel corresponding to the local maximum value in the information is determined as the position information of the center point of the object. Wherein, regardless of whether the image to be segmented is an individual image or an image frame in a video, this embodiment can be used to determine the center point position information of the object.
其中,局部极大值对应的像素点,可以指的是物体所在区域的像素点中预测物体中心点特征信息相对最大的像素点。如果待分割图像中存在多个物体,每一物体所在区域的像 素点中,均存在预测物体中心点特征信息相对较大的点,因此可确定出多个局部极大值点,不同的局部极大值点对应不同物体的中心点。如果待分割图像中存在一个物体,可确定出一个局部极大值点,该局部极大值点也即是预测物体中心点特征信息最大的像素点,可将该像素点的位置信息确定为待分割图像中物体的中心点位置信息。Among them, the pixel point corresponding to the local maximum value may refer to the pixel point with the largest predicted feature information of the center point of the object among the pixel points in the area where the object is located. If there are multiple objects in the to-be-segmented image, in the pixels of each object's area, there are points with relatively large feature information of the predicted object center point, so multiple local maximum points can be determined, and different local maximum points can be determined. Large value points correspond to the center points of different objects. If there is an object in the image to be segmented, a local maximum point can be determined, and the local maximum point is also the pixel with the largest feature information of the predicted object center point, and the position information of the pixel point can be determined as the point to be The position information of the center point of the object in the segmented image.
如此,像素点的预测物体中心点特征信息可用于表征该像素点为物体中心点的可能性,将多个像素点的预测物体中心点特征信息中局部极大值对应的像素点确定为物体的中心点,在待分割图像中存在多个物体的情况下,能够准确确定该多个物体各自的中心点位置信息。In this way, the feature information of the predicted object center point of a pixel point can be used to represent the possibility that the pixel point is the object center point, and the pixel point corresponding to the local maximum value in the predicted object center point feature information of multiple pixel points is determined as the object's center point. For the center point, when there are multiple objects in the image to be segmented, the position information of the respective center points of the multiple objects can be accurately determined.
在另一可选实施方式中,在待分割图像为视频中的图像帧的情况下,可以结合视频中的参考图像帧确定待分割图像中物体的中心点位置信息。图2是根据该实施方式示出的一种确定待分割图像中物体的中心点位置信息的方法的流程图,如图2所示,上述步骤S102可包括步骤S1021~S1023。In another optional implementation manner, when the image to be segmented is an image frame in a video, the position information of the center point of the object in the image to be segmented may be determined in combination with a reference image frame in the video. FIG. 2 is a flow chart of a method for determining center point position information of an object in an image to be segmented according to this embodiment. As shown in FIG. 2 , the above-mentioned step S102 may include steps S1021 to S1023 .
在步骤S1021中,根据参考图像帧中物体的中心点位置信息以及该物体的运动轨迹信息,确定待分割图像中该物体的预测中心点。In step S1021, the predicted center point of the object in the image to be segmented is determined according to the center point position information of the object in the reference image frame and the motion track information of the object.
其中,参考图像帧可以为视频中与待分割图像不同的图像帧。示例地,该参考图像帧可以是视频中的首帧图像帧,也可是视频中待分割图像的上一帧,本公开不做具体限制。The reference image frame may be an image frame in the video that is different from the image to be segmented. For example, the reference image frame may be the first image frame in the video, or may be the previous frame of the image to be divided in the video, which is not specifically limited in the present disclosure.
视频中的图像帧具有一定的连续性,物体的运动轨迹信息可包括物体的运动方向信息、移动速度、移动加速度等等,根据物体的运动轨迹信息,可确定在视频中从参考图像帧的时刻到待分割图像的时刻,物体移动的距离和方向。其中,参考图像帧中物体的中心点位置信息可以是预先确定出的,根据参考图像帧中物体的中心点位置信息以及物体的运动轨迹信息,可确定待分割图像中物体的预测中心点,该预测中心点即根据物体的运动轨迹初步确定出的可能的物体的中心点位置。The image frames in the video have a certain continuity, and the motion track information of the object can include the object's motion direction information, moving speed, moving acceleration, etc. According to the object's motion track information, the moment in the video from the reference image frame can be determined. The distance and direction of movement of the object at the moment when the image is to be segmented. Wherein, the center point position information of the object in the reference image frame may be predetermined, and the predicted center point of the object in the to-be-segmented image can be determined according to the center point position information of the object in the reference image frame and the motion track information of the object. The predicted center point is the possible center point position of the object that is initially determined according to the motion trajectory of the object.
在步骤S1022中,确定待分割图像中物体所在的区域中预测物体中心点特征信息最大的预设数量个像素点。In step S1022, a preset number of pixels with the largest feature information of the predicted object center point in the region where the object is located in the image to be segmented is determined.
待分割图像中该物体所在的区域存在多个像素点,可确定该多个像素点中预测物体中心点特征信息最大的预设数量个像素点,即预测物体中心点特征信息排名在前K位的像素点,该预设数量个像素点为物体中心点的可能性较高。其中K可表示预设数量,K大于或等于1,对其取值本公开不做具体限制。There are multiple pixels in the area where the object is located in the image to be segmented, and the preset number of pixels with the largest predicted object center point feature information among the multiple pixel points can be determined, that is, the predicted object center point feature information is ranked in the top K positions , the preset number of pixels is more likely to be the center point of the object. Wherein K may represent a preset number, and K is greater than or equal to 1, and its value is not specifically limited in the present disclosure.
在S1023中,将预设数量个像素点中与预测中心点之间的距离最近的像素点的位置信息,确定为中心点位置信息。In S1023, the position information of the pixel point with the closest distance to the prediction center point among the preset number of pixel points is determined as the center point position information.
示例地,可针对预设数量个像素点中每一像素点,计算该像素点与预测中心点之间的距离,与预测中心点之间的距离最近的像素点,不仅为物体中心点的可能性较高,且与根据物体运动轨迹确定出的预测中心点之间的距离最近,因此可将该像素点的位置信息作为待分割图像中物体的中心点位置信息。For example, for each pixel in a preset number of pixels, the distance between the pixel and the prediction center point can be calculated, and the pixel point with the closest distance to the prediction center point is not only the possibility of the object center point. The distance between the prediction center point determined according to the motion trajectory of the object is relatively high, so the position information of the pixel point can be used as the center point position information of the object in the image to be segmented.
值得说明的是,对于S1021和S1022的执行顺序,本公开不做具体限制,可以先执行S1022再执行S1021,或者二者可并行执行,图2仅为示例。It is worth noting that the present disclosure does not specifically limit the execution order of S1021 and S1022, S1022 may be executed first and then S1021 may be executed, or both may be executed in parallel, and FIG. 2 is only an example.
通过上述技术方案,在待分割图像为视频中的图像帧的情况下,可结合视频中的参考图像帧以及物体的运动轨迹信息,确定待分割图像中物体的中心点位置信息,结合考虑物体的运动轨迹信息,可以使得确定出的物体的中心点位置信息更为准确。Through the above technical solution, in the case where the image to be segmented is an image frame in a video, the reference image frame in the video and the motion trajectory information of the object can be combined to determine the center point information of the object in the image to be segmented. The motion track information can make the determined position information of the center point of the object more accurate.
图3是根据另一示例性实施例示出的一种图像分割方法的流程图,如图3所示,该方法可包括步骤S301~S304,其中上述步骤S103可包括S303和S304。Fig. 3 is a flowchart of an image segmentation method according to another exemplary embodiment. As shown in Fig. 3 , the method may include steps S301 to S304, wherein the above step S103 may include S303 and S304.
在步骤S301(101)中,获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息。该步骤S301的实施方式可参照步骤S101。In step S301 (101), the feature information of the predicted object center point of each pixel point of a plurality of pixel points in the image to be divided is acquired. For the implementation of this step S301, reference may be made to step S101.
在步骤S302(102)中,根据各像素点的预测物体中心点特征信息,确定待分割图像中物体的中心点位置信息。该步骤中可采用本公开任一实施方式提供的确定物体的中心点位置信息的方式。In step S302 (102), according to the feature information of the predicted object center point of each pixel point, the center point position information of the object in the image to be segmented is determined. In this step, the method of determining the position information of the center point of the object provided by any embodiment of the present disclosure may be used.
在步骤S303中,根据中心点位置信息和预测物体中心点特征信息,确定待分割图像中多个像素点的各像素点的目标物体中心点特征信息。In step S303, according to the center point position information and the predicted object center point feature information, the target object center point feature information of each pixel point of the plurality of pixels in the image to be segmented is determined.
其中,中心点位置信息对应的像素点的目标物体中心点特征信息大于该像素点的预测物体中心点特征信息,中心点位置信息对应的像素点即物体的中心点,也即是进一步提升物体中心点的特征信息,例如可采用高斯算法增大物体中心点的特征信息。Among them, the center point feature information of the target object of the pixel corresponding to the center point position information is greater than the predicted object center point feature information of the pixel point, and the pixel point corresponding to the center point position information is the center point of the object, that is, to further improve the center point of the object The feature information of the point, for example, the Gaussian algorithm can be used to increase the feature information of the center point of the object.
在步骤S304中,根据目标物体中心点特征信息,对待分割图像进行图像分割。In step S304, image segmentation is performed on the image to be segmented according to the feature information of the center point of the target object.
由于中心点位置信息对应的像素点的目标物体中心点特征信息大于该像素点的预测物体中心点特征信息,使得物体中心点的特征信息更高,因此根据目标物体中心点特征信息进行图像分割,可以更准确地把前景物体分割出来,进一步提升图像分割的准确性和分割效果。Since the feature information of the target object center point of the pixel corresponding to the center point position information is greater than the predicted object center point feature information of the pixel, the feature information of the object center point is higher, so the image segmentation is performed according to the target object center point feature information, The foreground objects can be more accurately segmented, and the accuracy and segmentation effect of image segmentation can be further improved.
该步骤S304可进一步包括:获取待分割图像中多个像素点的各像素点的预设特征信息;根据待分割图像中多个像素点的各像素点的目标物体中心点特征信息和预设特征信息,对待分割图像进行图像分割。This step S304 may further include: acquiring preset feature information of each pixel of the multiple pixels in the image to be segmented; information to perform image segmentation on the image to be segmented.
预设特征信息可包括图像语义特征信息和图像边缘特征信息中的至少一者。其中,可 对待分割图像中每个像素点进行分类,确定每个像素点所属的语义标签,属于同一语义标签的像素点的图像语义特征信息可以相同。图像的边缘可以指的是图像中亮度变化或灰度变化最显著的部分,位于边缘部分的像素点的图像边缘特征信息相对较大。The preset feature information may include at least one of image semantic feature information and image edge feature information. Among them, each pixel in the image to be segmented can be classified to determine the semantic label to which each pixel belongs, and the image semantic feature information of pixels belonging to the same semantic label can be the same. The edge of an image may refer to the part of the image with the most significant change in brightness or gray level, and the image edge feature information of pixels located in the edge part is relatively large.
可选地,根据待分割图像中多个像素点的各像素点的目标物体中心点特征信息和预设特征信息,对待分割图像进行图像分割,可包括:根据待分割图像中多个像素点的各像素点的目标物体中心点特征信息和预设特征信息,确定该像素点的目标特征信息;根据所述多个像素点的各像素点的目标特征信息,对待分割图像进行图像分割。Optionally, performing image segmentation on the to-be-segmented image according to the target object center point feature information and preset feature information of each pixel of the plurality of pixels in the to-be-segmented image may include: The target feature information of the target object center point and the preset feature information of each pixel point are used to determine the target feature information of the pixel point; according to the target feature information of each pixel point of the plurality of pixel points, image segmentation is performed on the image to be segmented.
其中,在预设特征信息包括图像语义特征信息和图像边缘特征信息中的一者的情况下,可将待分割图像中像素点的目标物体中心点特征信息与该像素点的预设特征信息的乘积,作为该像素点的目标特征信息。在预设特征信息包括图像语义特征信息和图像边缘特征信息的情况下,可将待分割图像中像素点的目标物体中心点特征信息、该像素点的图像语义特征信息以及该像素点的图像边缘特征信息的乘积,作为该像素点的目标特征信息。Wherein, when the preset feature information includes one of image semantic feature information and image edge feature information, the feature information of the center point of the target object of the pixel point in the image to be segmented and the preset feature information of the pixel point can be compared. The product is used as the target feature information of the pixel. When the preset feature information includes image semantic feature information and image edge feature information, the feature information of the target object center point of the pixel in the image to be segmented, the image semantic feature information of the pixel, and the image edge of the pixel can be The product of feature information is used as the target feature information of the pixel.
本公开的实施例中,将像素点的目标物体中心点特征信息点乘到该像素点的预设特征信息,由于物体中心点及其附近像素点的目标物体中心点特征信息相对较高,不为物体中心点区域的像素点的目标物体中心点特征信息相对较低,因此将像素点的目标物体中心点特征信息与预设特征信息相乘,可使得物体中心点的目标特征信息更高。这样,根据目标特征信息进行图像分割时,可以使得物体所在区域在待分割图像中更加突出,可以更准确地把前景物体从待分割图像中分割出来。In the embodiment of the present disclosure, the feature information point of the target object center point of a pixel point is multiplied by the preset feature information of the pixel point. The target object center point feature information of the pixel points in the object center point area is relatively low, so multiplying the target object center point feature information of the pixel point with the preset feature information can make the target object feature information of the object center point higher. In this way, when the image is segmented according to the target feature information, the region where the object is located can be made more prominent in the image to be segmented, and the foreground object can be segmented from the image to be segmented more accurately.
示例地,根据所述多个像素点的各像素点的目标特征信息对待分割图像进行图像分割的示例性实施方式可以为:将待分割图像和所述多个像素点的各像素点的目标特征信息输入到图像分割模型中,以通过图像分割模型对待分割图像进行图像分割。Illustratively, an exemplary implementation of performing image segmentation on the image to be segmented according to the target feature information of each pixel of the plurality of pixels may be: dividing the image to be segmented and the target feature of each pixel of the plurality of pixels. The information is input into the image segmentation model to perform image segmentation on the image to be segmented by the image segmentation model.
图像分割模型可以是任一种网络模型,例如全卷积的网络模型。该图像分割模型可以是预先训练出的。The image segmentation model can be any network model, such as a fully convolutional network model. The image segmentation model may be pre-trained.
其中,像素点的目标特征信息是根据该像素点的目标物体中心点特征信息,以及图像语义特征信息和/或图像边缘特征信息得到的,图像分割模型在对待分割图像进行图像分割时,由于物体中心点的目标特征信息相对更高,因此物体所在的区域与背景区域之间的区别更加显著,根据各像素点的目标特征信息,可以更准确地把物体所在的区域分割出来,从而提升图像分割效果和图像分割准确度。Among them, the target feature information of a pixel is obtained according to the feature information of the center point of the target object of the pixel, and the image semantic feature information and/or the image edge feature information. When the image segmentation model performs image segmentation on the image to be segmented, due to the object The target feature information of the center point is relatively higher, so the difference between the area where the object is located and the background area is more significant. According to the target feature information of each pixel point, the area where the object is located can be more accurately segmented, thereby improving image segmentation. Effects and image segmentation accuracy.
基于同一发明构思,本公开还提供一种图像分割装置,图4是根据一示例性实施例示出的一种图像分割装置的框图,如图4所示,该装置400可包括:Based on the same inventive concept, the present disclosure also provides an image segmentation device. FIG. 4 is a block diagram of an image segmentation device according to an exemplary embodiment. As shown in FIG. 4 , the device 400 may include:
获取模块401,用于获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;The obtaining module 401 is used to obtain the predicted object center point feature information of each pixel point of a plurality of pixel points in the image to be divided, the predicted object center point feature information is used to represent the reliability of the pixel point as the object center point ;
确定模块402,用于根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;A determination module 402, configured to determine the center point position information of the object in the to-be-segmented image according to the feature information of the predicted object center point of each pixel;
图像分割模块403,用于根据所述中心点位置信息,对所述待分割图像进行图像分割。The image segmentation module 403 is configured to perform image segmentation on the to-be-segmented image according to the center point position information.
采用上述装置,首先获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,根据各像素点的预测物体中心点特征信息,可确定待分割图像中物体的中心点位置信息,进而根据物体的中心点位置信息,对待分割图像进行图像分割。这样,考虑物体的中心点位置信息,可以使得物体所在的区域在待分割图像中更加突出,与背景区域之间的区别更加显著,从而在对待分割图像进行图像分割时,能够更准确地把前景物体分割出来,有效减小背景区域的干扰。并且,即使存在物体被局部遮挡或图像中有其他相似物体出现等情况,该物体的中心点位置信息并不会受到影响,因此,根据物体的中心点位置信息进行图像分割,可以把物体所在区域准确分割出来,提升图像分割的准确度和分割效果。Using the above-mentioned device, first obtain the feature information of the predicted object center point of each pixel point in the image to be segmented, and then determine the center point position information of the object in the image to be segmented according to the feature information of the predicted object center point of each pixel point. , and then perform image segmentation on the image to be segmented according to the position information of the center point of the object. In this way, considering the position information of the center point of the object, the area where the object is located can be made more prominent in the image to be segmented, and the difference between it and the background area is more significant, so that the foreground can be more accurately segmented when the image to be segmented is segmented. The object is segmented to effectively reduce the interference of the background area. Moreover, even if the object is partially occluded or other similar objects appear in the image, the center point position information of the object will not be affected. Accurate segmentation can improve the accuracy and segmentation effect of image segmentation.
可选地,所述图像分割模块403,可包括:第一确定子模块,用于根据所述中心点位置信息和所述预测物体中心点特征信息,确定所述待分割图像中多个像素点的各像素点的目标物体中心点特征信息,其中,所述中心点位置信息对应的像素点的所述目标物体中心点特征信息大于所述像素点的所述预测物体中心点特征信息;第一图像分割子模块,用于根据所述目标物体中心点特征信息,对所述待分割图像进行图像分割。Optionally, the image segmentation module 403 may include: a first determination sub-module, configured to determine a plurality of pixel points in the image to be segmented according to the center point position information and the predicted object center point feature information. The feature information of the center point of the target object of each pixel point, wherein the feature information of the center point of the target object of the pixel point corresponding to the position information of the center point is greater than the feature information of the center point of the predicted object of the pixel point; first The image segmentation sub-module is configured to perform image segmentation on the to-be-segmented image according to the feature information of the center point of the target object.
可选地,所述第一图像分割子模块,可包括:获取子模块,用于获取所述待分割图像中多个像素点的各像素点的预设特征信息,所述预设特征信息包括图像语义特征信息和图像边缘特征信息中的至少一者;第二分割子模块,用于根据所述待分割图像中多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,对所述待分割图像进行图像分割。Optionally, the first image segmentation sub-module may include: an acquisition sub-module for acquiring preset feature information of each pixel of a plurality of pixels in the to-be-segmented image, where the preset feature information includes: at least one of image semantic feature information and image edge feature information; a second segmentation sub-module, used for the target object center point feature information of each pixel point in the image to be segmented and the Preset feature information, and perform image segmentation on the to-be-segmented image.
可选地,所述第二分割子模块,可包括:第二确定子模块,用于根据所述待分割图像中多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,确定所述像素点的目标特征信息,其中,在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息中的一者的情况下,将所述待分割图像中像素点的目标物体中心点特征信息与所述像素点的预设特征信息的乘积,作为所述像素点的所述目标特征信息,在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息的情况下,将所述待分割图像中像素点的目标物体中心点特征信息、所述像素点的图像语义特征信息以及所述像素 点的图像边缘特征信息的乘积,作为所述像素点的所述目标特征信息;第三分割子模块,用于根据多个像素点的各像素点的所述目标特征信息,对所述待分割图像进行图像分割。Optionally, the second segmentation sub-module may include: a second determination sub-module, which is configured to, according to the feature information of the center point of the target object and the Preset feature information to determine target feature information of the pixel, wherein, in the case that the preset feature information includes one of the image semantic feature information and the image edge feature information, the to-be-to-be-featured feature information is The product of the target object center point feature information of a pixel in the segmented image and the preset feature information of the pixel is taken as the target feature information of the pixel, where the preset feature information includes the image semantic feature. information and the image edge feature information, the product of the target object center point feature information of the pixel point in the to-be-segmented image, the image semantic feature information of the pixel point, and the image edge feature information of the pixel point , as the target feature information of the pixel point; a third segmentation sub-module, configured to perform image segmentation on the to-be-segmented image according to the target feature information of each pixel point of a plurality of pixel points.
可选地,所述第三分割子模块,可包括:输入子模块,用于将所述待分割图像和各像素点的所述目标特征信息输入到图像分割模型中,以通过所述图像分割模型对所述待分割图像进行图像分割。Optionally, the third segmentation sub-module may include: an input sub-module for inputting the to-be-segmented image and the target feature information of each pixel into an image segmentation model, so as to divide the image through the image segmentation. The model performs image segmentation on the image to be segmented.
可选地,所述确定模块402,可包括:第三确定子模块,用于将多个像素点的所述预测物体中心点特征信息中局部极大值对应的像素点的位置信息,确定为所述中心点位置信息。Optionally, the determining module 402 may include: a third determining sub-module, configured to determine the position information of the pixel corresponding to the local maximum value in the feature information of the predicted object center point of the plurality of pixel points as the location information of the center point.
可选地,所述待分割图像为视频中的图像帧;所述确定模块402,可包括:第四确定子模块,用于根据参考图像帧中所述物体的中心点位置信息以及所述物体的运动轨迹信息,确定所述待分割图像中所述物体的预测中心点,其中,所述参考图像帧为所述视频中与所述待分割图像不同的图像帧;第五确定子模块,用于确定所述待分割图像中所述物体所在的区域中所述预测物体中心点特征信息最大的预设数量个像素点;第六确定子模块,用于将所述预设数量个像素点中与所述预测中心点之间的距离最近的像素点的位置信息,确定为所述中心点位置信息。Optionally, the to-be-segmented image is an image frame in a video; the determining module 402 may include: a fourth determining sub-module, configured to determine the object according to the center point position information of the object in the reference image frame and the object The motion trajectory information of the to-be-segmented image determines the predicted center point of the object in the to-be-segmented image, wherein the reference image frame is an image frame in the video that is different from the to-be-segmented image; the fifth determination sub-module uses In determining the area where the object is located in the to-be-segmented image, the preset number of pixels with the largest feature information of the center point of the predicted object; the sixth determination sub-module is used to determine the number of pixels in the preset number of pixels. The position information of the pixel with the closest distance to the prediction center point is determined as the center point position information.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the apparatus in the above-mentioned embodiments, the specific manner in which each module performs operations has been described in detail in the embodiments of the related method, and will not be described in detail here.
下面参考图5,其示出了适于用来实现本公开实施例的电子设备500的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图5示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Referring next to FIG. 5 , it shows a schematic structural diagram of an electronic device 500 suitable for implementing an embodiment of the present disclosure. Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like. The electronic device shown in FIG. 5 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
如图5所示,电子设备500可以包括处理装置(例如中央处理器、图形处理器等)501,其可以根据存储在只读存储器(ROM)502中的程序或者从存储装置508加载到随机访问存储器(RAM)503中的程序而执行各种适当的动作和处理。在RAM 503中,还存储有电子设备500操作所需的各种程序和数据。处理装置501、ROM 502以及RAM 503通过总线504彼此相连。输入/输出(I/O)接口505也连接至总线504。As shown in FIG. 5 , an electronic device 500 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 501 that may be loaded into random access according to a program stored in a read only memory (ROM) 502 or from a storage device 508 Various appropriate actions and processes are executed by the programs in the memory (RAM) 503 . In the RAM 503, various programs and data required for the operation of the electronic device 500 are also stored. The processing device 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504. An input/output (I/O) interface 505 is also connected to bus 504 .
通常,以下装置可以连接至I/O接口505:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置506;包括例如液晶显示器(LCD)、扬 声器、振动器等的输出装置507;包括例如磁带、硬盘等的存储装置508;以及通信装置509。通信装置509可以允许电子设备500与其他设备进行无线或有线通信以交换数据。虽然图5示出了具有各种装置的电子设备500,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Typically, the following devices may be connected to the I/O interface 505: input devices 506 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 507 such as a computer; a storage device 508 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 509 . Communication means 509 may allow electronic device 500 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 5 shows electronic device 500 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置509从网络上被下载和安装,或者从存储装置508被安装,或者从ROM 502被安装。在该计算机程序被处理装置501执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network via the communication device 509, or from the storage device 508, or from the ROM 502. When the computer program is executed by the processing apparatus 501, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In this disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present disclosure, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device . Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
在一些实施方式中,客户端、服务器可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, the client and server can use any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects. Examples of communication networks include local area networks ("LAN"), wide area networks ("WAN"), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:获取待分割图像中各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;根据所述中心点位置信息,对所述待分割图像进行图像分割。The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: obtains the predicted object center point feature information of each pixel in the image to be segmented, and the described The feature information of the predicted object center point is used to represent the reliability of the pixel point as the object center point; according to the predicted object center point feature information of each pixel point, the center point position information of the object in the image to be segmented is determined; Image segmentation is performed on the to-be-segmented image according to the center point position information.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言——诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)——连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to via Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的模块可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,模块的名称在某种情况下并不构成对该模块本身的限定,例如,获取模块还可以被描述为“中心点特征信息获取模块”。The modules involved in the embodiments of the present disclosure may be implemented in software or hardware. Wherein, the name of the module does not constitute a limitation of the module itself under certain circumstances, for example, the acquisition module may also be described as a "central point feature information acquisition module".
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD) 等等。The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs), etc.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
根据本公开的一个或多个实施例,示例1提供了一种图像分割方法,所述方法包括:获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;根据所述中心点位置信息,对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 1 provides an image segmentation method, the method comprising: acquiring predicted object center point feature information of each pixel of a plurality of pixels in an image to be segmented, the prediction The feature information of the object center point is used to represent the reliability of the pixel point as the object center point; according to the predicted object center point feature information of each pixel point, the center point position information of the object in the to-be-segmented image is determined; The center point position information is used to perform image segmentation on the to-be-segmented image.
根据本公开的一个或多个实施例,示例2提供了示例1的方法,所述根据所述中心点位置信息,对所述待分割图像进行图像分割,包括:根据所述中心点位置信息和所述预测物体中心点特征信息,确定所述待分割图像中多个像素点的各像素点的目标物体中心点特征信息,其中,所述中心点位置信息对应的像素点的所述目标物体中心点特征信息大于所述像素点的所述预测物体中心点特征信息;根据所述目标物体中心点特征信息,对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 2 provides the method of Example 1, wherein performing image segmentation on the to-be-segmented image according to the center point position information includes: according to the center point position information and The predicted object center point feature information is to determine the target object center point feature information of each pixel of a plurality of pixels in the image to be segmented, wherein the target object center of the pixel corresponding to the center point position information The point feature information is greater than the feature information of the predicted object center point of the pixel point; image segmentation is performed on the to-be-segmented image according to the center point feature information of the target object.
根据本公开的一个或多个实施例,示例3提供了示例2的方法,所述根据所述目标物体中心点特征信息,对所述待分割图像进行图像分割,包括:获取所述待分割图像中多个像素点的各像素点的预设特征信息,所述预设特征信息包括图像语义特征信息和图像边缘特征信息中的至少一者;根据所述待分割图像中多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 3 provides the method of Example 2, wherein performing image segmentation on the image to be segmented according to the feature information of the center point of the target object includes: acquiring the image to be segmented The preset feature information of each pixel point of the multiple pixels in the image, the preset feature information includes at least one of image semantic feature information and image edge feature information; The target object center point feature information and the preset feature information of the pixel points are used to perform image segmentation on the to-be-segmented image.
根据本公开的一个或多个实施例,示例4提供了示例3的方法,所述根据所述待分割图像中多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,对所述待分割图像进行图像分割,包括:根据所述待分割图像中多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,确定所述像素点的目标特征信息,其中,在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息中的一者的情 况下,将所述待分割图像中像素点的目标物体中心点特征信息与所述像素点的预设特征信息的乘积,作为所述像素点的所述目标特征信息,在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息的情况下,将所述待分割图像中像素点的目标物体中心点特征信息、所述像素点的图像语义特征信息以及所述像素点的图像边缘特征信息的乘积,作为所述像素点的所述目标特征信息;根据多个像素点的各像素点的所述目标特征信息,对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 4 provides the method of Example 3, wherein the target object center point feature information of each pixel point of a plurality of pixel points in the to-be-segmented image and the prediction method are provided in Example 4. Assuming feature information, performing image segmentation on the to-be-segmented image includes: determining the target object center point feature information and the preset feature information of each pixel point of a plurality of pixel points in the to-be-segmented image. The target feature information of the pixel point, wherein, in the case where the preset feature information includes one of the image semantic feature information and the image edge feature information, the target of the pixel point in the image to be segmented The product of the feature information of the object center point and the preset feature information of the pixel point is used as the target feature information of the pixel point, and the preset feature information includes the image semantic feature information and the image edge feature. In the case of information, the product of the target object center point feature information of the pixel points in the image to be segmented, the image semantic feature information of the pixel points and the image edge feature information of the pixel points is used as the pixel point. the target feature information; image segmentation is performed on the to-be-segmented image according to the target feature information of each pixel point of a plurality of pixel points.
根据本公开的一个或多个实施例,示例5提供了示例4的方法,所述根据各像素点的所述目标特征信息,对所述待分割图像进行图像分割,包括:将所述待分割图像和各像素点的所述目标特征信息输入到图像分割模型中,以通过所述图像分割模型对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 5 provides the method of Example 4, wherein performing image segmentation on the to-be-segmented image according to the target feature information of each pixel includes: dividing the to-be-segmented image The image and the target feature information of each pixel point are input into the image segmentation model, so as to perform image segmentation on the to-be-segmented image through the image segmentation model.
根据本公开的一个或多个实施例,示例6提供了示例1的方法,所述根据多个像素点的各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息,包括:将所述多个像素点的所述预测物体中心点特征信息中局部极大值对应的像素点的位置信息,确定为所述中心点位置信息。According to one or more embodiments of the present disclosure, Example 6 provides the method of Example 1, wherein according to the feature information of the predicted object center point of each pixel point of the plurality of pixel points, determine the object in the image to be segmented. The center point position information includes: determining the position information of the pixel point corresponding to the local maximum value in the predicted object center point feature information of the plurality of pixel points as the center point position information.
根据本公开的一个或多个实施例,示例7提供了示例1的方法,所述待分割图像为视频中的图像帧;所述根据多个像素点的各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息,包括:根据参考图像帧中所述物体的中心点位置信息以及所述物体的运动轨迹信息,确定所述待分割图像中所述物体的预测中心点,其中,所述参考图像帧为所述视频中与所述待分割图像不同的图像帧;确定所述待分割图像中所述物体所在的区域中所述预测物体中心点特征信息最大的预设数量个像素点;将所述预设数量个像素点中与所述预测中心点之间的距离最近的像素点的位置信息,确定为所述中心点位置信息。According to one or more embodiments of the present disclosure, Example 7 provides the method of Example 1, wherein the image to be segmented is an image frame in a video; the predicted object center point according to each pixel point of a plurality of pixel points feature information, and determining the center point position information of the object in the image to be segmented includes: determining the center point location information of the object in the reference image frame and the motion track information of the object The predicted center point of the object, wherein the reference image frame is an image frame in the video that is different from the image to be segmented; determine the center point feature of the predicted object in the area where the object is located in the image to be segmented A preset number of pixels with the largest information; the position information of the pixel with the closest distance to the predicted center point among the preset number of pixels is determined as the center point position information.
根据本公开的一个或多个实施例,示例8提供一种图像分割装置,所述装置包括:获取模块,用于获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;确定模块,用于根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;图像分割模块,用于根据所述中心点位置信息,对所述待分割图像进行图像分割。According to one or more embodiments of the present disclosure, Example 8 provides an apparatus for image segmentation, the apparatus includes: an acquisition module configured to acquire feature information of predicted object center points of each pixel point of a plurality of pixel points in an image to be segmented , the feature information of the predicted object center point is used to represent the reliability of the pixel point as the object center point; the determination module is used to determine the image to be segmented according to the predicted object center point feature information of each pixel point The center point position information of the object in the middle; the image segmentation module is configured to perform image segmentation on the to-be-segmented image according to the center point position information.
根据本公开的一个或多个实施例,示例9提供一种非瞬时性计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现示例1-7中任一示例所述方法的步骤。According to one or more embodiments of the present disclosure, Example 9 provides a non-transitory computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements the method described in any one of Examples 1-7 A step of.
根据本公开的一个或多个实施例,示例10提供一种电子设备,包括:存储装置,其上 存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现示例1-7中任一示例所述方法的步骤。According to one or more embodiments of the present disclosure, Example 10 provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device to achieve The steps of the method of any of Examples 1-7.
根据本公开的一些实施例,还提供了一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行如前所述的图像分割方法。According to some embodiments of the present disclosure, there is also provided a computer program comprising: instructions which, when executed by a processor, cause the processor to perform the image segmentation method as previously described.
根据本公开的一些实施例,还提供了一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行如前所述的图像分割方法。According to some embodiments of the present disclosure, there is also provided a computer program product comprising instructions that, when executed by a processor, cause the processor to perform the image segmentation method as previously described.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is merely a preferred embodiment of the present disclosure and an illustration of the technical principles employed. Those skilled in the art should understand that the scope of the disclosure involved in the present disclosure is not limited to the technical solutions formed by the specific combination of the above-mentioned technical features, and should also cover, without departing from the above-mentioned disclosed concept, the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of its equivalent features. For example, a technical solution is formed by replacing the above features with the technical features disclosed in the present disclosure (but not limited to) with similar functions.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。Additionally, although operations are depicted in a particular order, this should not be construed as requiring that the operations be performed in the particular order shown or in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, although the above discussion contains several implementation-specific details, these should not be construed as limitations on the scope of the present disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Although the subject matter has been described in language specific to structural features and/or logical acts of method, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims. Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.

Claims (13)

  1. 一种图像分割方法,包括:An image segmentation method, comprising:
    获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;Obtaining predicted object center point feature information of each pixel point of a plurality of pixel points in the image to be segmented, where the predicted object center point feature information is used to characterize the reliability of the pixel point as an object center point;
    根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;以及According to the feature information of the predicted object center point of each pixel point, determine the center point position information of the object in the to-be-segmented image; and
    根据所述中心点位置信息,对所述待分割图像进行图像分割。Image segmentation is performed on the to-be-segmented image according to the center point position information.
  2. 根据权利要求1所述的方法,其中,所述根据所述中心点位置信息,对所述待分割图像进行图像分割,包括:The method according to claim 1, wherein the performing image segmentation on the to-be-segmented image according to the center point position information comprises:
    根据所述中心点位置信息和所述预测物体中心点特征信息,确定所述待分割图像中所述多个像素点的各像素点的目标物体中心点特征信息,其中,所述中心点位置信息对应的像素点的所述目标物体中心点特征信息大于该像素点的所述预测物体中心点特征信息;以及According to the center point position information and the predicted object center point feature information, determine the target object center point feature information of each pixel point of the plurality of pixel points in the image to be segmented, wherein the center point position information The target object center point feature information of the corresponding pixel point is greater than the predicted object center point feature information of the pixel point; and
    根据所述目标物体中心点特征信息,对所述待分割图像进行图像分割。Image segmentation is performed on the to-be-segmented image according to the feature information of the center point of the target object.
  3. 根据权利要求2所述的方法,其中,所述根据所述目标物体中心点特征信息,对所述待分割图像进行图像分割,包括:The method according to claim 2, wherein the performing image segmentation on the to-be-segmented image according to the feature information of the center point of the target object comprises:
    获取所述待分割图像中所述多个像素点的各像素点的预设特征信息,所述预设特征信息包括图像语义特征信息和图像边缘特征信息中的至少一者;以及acquiring preset feature information of each pixel of the plurality of pixels in the image to be segmented, the preset feature information including at least one of image semantic feature information and image edge feature information; and
    根据所述待分割图像中所述多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,对所述待分割图像进行图像分割。Image segmentation is performed on the to-be-segmented image according to the target object center point feature information and the preset feature information of each pixel of the plurality of pixels in the to-be-segmented image.
  4. 根据权利要求3所述的方法,其中,所述根据所述待分割图像中所述多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,对所述待分割图像进行图像分割,包括:The method according to claim 3, wherein, according to the feature information of the target object center point and the preset feature information of each pixel point of the plurality of pixel points in the image to be segmented, the Image segmentation is performed on the image to be segmented, including:
    根据所述待分割图像中所述多个像素点的各像素点的所述目标物体中心点特征信息和所述预设特征信息,确定所述像素点的目标特征信息;以及According to the target object center point feature information and the preset feature information of each pixel point of the plurality of pixel points in the image to be divided, determine the target feature information of the pixel point; and
    根据所述多个像素点的各像素点的所述目标特征信息,对所述待分割图像进行图 像分割。Image segmentation is performed on the image to be segmented according to the target feature information of each pixel point of the plurality of pixel points.
  5. 根据权利要求4所述的方法,其中,The method of claim 4, wherein,
    在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息中的一者的情况下,将所述待分割图像中像素点的目标物体中心点特征信息与所述像素点的预设特征信息的乘积,作为所述像素点的所述目标特征信息;或者When the preset feature information includes one of the image semantic feature information and the image edge feature information, compare the target object center point feature information of the pixels in the to-be-segmented image with the pixel points The product of the preset feature information of , as the target feature information of the pixel point; or
    在所述预设特征信息包括所述图像语义特征信息和所述图像边缘特征信息的情况下,将所述待分割图像中像素点的目标物体中心点特征信息、所述像素点的图像语义特征信息以及所述像素点的图像边缘特征信息的乘积,作为所述像素点的所述目标特征信息。In the case that the preset feature information includes the image semantic feature information and the image edge feature information, the target object center point feature information of the pixels in the image to be segmented, the image semantic features of the pixels The product of the information and the image edge feature information of the pixel is taken as the target feature information of the pixel.
  6. 根据权利要求4所述的方法,其中,所述根据所述多个像素点的各像素点的所述目标特征信息,对所述待分割图像进行图像分割,包括:The method according to claim 4, wherein the performing image segmentation on the to-be-segmented image according to the target feature information of each pixel point of the plurality of pixel points comprises:
    将所述待分割图像和所述多个像素点的各像素点的所述目标特征信息输入到图像分割模型中,以通过所述图像分割模型对所述待分割图像进行图像分割。Inputting the image to be segmented and the target feature information of each pixel point of the plurality of pixel points into an image segmentation model, so as to perform image segmentation on the image to be segmented through the image segmentation model.
  7. 根据权利要求1所述的方法,其中,所述根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息,包括:The method according to claim 1, wherein the determining the center point position information of the object in the to-be-segmented image according to the feature information of the predicted object center point of each pixel point comprises:
    将所述多个像素点的所述预测物体中心点特征信息中局部极大值对应的像素点的位置信息,确定为所述中心点位置信息。The position information of the pixel point corresponding to the local maximum value in the feature information of the predicted object center point of the plurality of pixel points is determined as the center point position information.
  8. 根据权利要求1所述的方法,其中,所述待分割图像为视频中的图像帧;The method according to claim 1, wherein the to-be-segmented image is an image frame in a video;
    所述根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息,包括:The determining the center point position information of the object in the to-be-segmented image according to the feature information of the predicted object center point of each pixel, including:
    根据参考图像帧中所述物体的中心点位置信息以及所述物体的运动轨迹信息,确定所述待分割图像中所述物体的预测中心点,其中,所述参考图像帧为所述视频中与所述待分割图像不同的图像帧;Determine the predicted center point of the object in the to-be-segmented image according to the center point position information of the object and the motion track information of the object in the reference image frame, wherein the reference image frame is the same as that in the video. different image frames of the to-be-segmented images;
    确定所述待分割图像中所述物体所在的区域中所述预测物体中心点特征信息最大的预设数量个像素点;以及Determine the maximum preset number of pixels in the region where the object is located in the to-be-segmented image; and
    将所述预设数量个像素点中与所述预测中心点之间的距离最近的像素点的位置 信息,确定为所述中心点位置信息。The position information of the pixel point with the closest distance to the prediction center point among the preset number of pixel points is determined as the center point position information.
  9. 一种图像分割装置,包括:An image segmentation device, comprising:
    获取模块,用于获取待分割图像中多个像素点的各像素点的预测物体中心点特征信息,所述预测物体中心点特征信息用于表征所述像素点为物体中心点的可信度;an acquisition module, configured to acquire the predicted object center point feature information of each pixel of a plurality of pixels in the image to be segmented, where the predicted object center point feature information is used to characterize the reliability of the pixel as the object center point;
    确定模块,用于根据各像素点的所述预测物体中心点特征信息,确定所述待分割图像中物体的中心点位置信息;以及a determining module, configured to determine the center point position information of the object in the image to be segmented according to the feature information of the predicted object center point of each pixel; and
    图像分割模块,用于根据所述中心点位置信息,对所述待分割图像进行图像分割。An image segmentation module, configured to perform image segmentation on the to-be-segmented image according to the center point position information.
  10. 一种非瞬时性计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现权利要求1-8中任一项所述方法的步骤。A non-transitory computer-readable medium having stored thereon a computer program which, when executed by a processing device, implements the steps of the method of any one of claims 1-8.
  11. 一种电子设备,包括:An electronic device comprising:
    存储装置,其上存储有计算机程序;a storage device on which a computer program is stored;
    处理装置,用于执行所述存储装置中的所述计算机程序,以实现权利要求1-8中任一项所述方法。A processing device, configured to execute the computer program in the storage device, so as to implement the method of any one of claims 1-8.
  12. 一种计算机程序,包括:A computer program comprising:
    指令,所述指令当由处理器执行时使所述处理器执行根据权利要求1-8中任一项所述的图像分割方法。Instructions which, when executed by a processor, cause the processor to perform the image segmentation method of any of claims 1-8.
  13. 一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行根据权利要求1-8中任一项所述的图像分割方法。A computer program product comprising instructions which, when executed by a processor, cause the processor to perform the image segmentation method of any of claims 1-8.
PCT/CN2021/128958 2020-11-18 2021-11-05 Image segmentation method and apparatus, readable medium, and electronic device WO2022105622A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011295274.X 2020-11-18
CN202011295274.XA CN112418232A (en) 2020-11-18 2020-11-18 Image segmentation method and device, readable medium and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022105622A1 true WO2022105622A1 (en) 2022-05-27

Family

ID=74773075

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/128958 WO2022105622A1 (en) 2020-11-18 2021-11-05 Image segmentation method and apparatus, readable medium, and electronic device

Country Status (2)

Country Link
CN (1) CN112418232A (en)
WO (1) WO2022105622A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115908459A (en) * 2023-03-10 2023-04-04 中科慧远视觉技术(北京)有限公司 Image segmentation method and device, computer equipment and readable storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112418232A (en) * 2020-11-18 2021-02-26 北京有竹居网络技术有限公司 Image segmentation method and device, readable medium and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105844641A (en) * 2016-03-24 2016-08-10 武汉工程大学 Adaptive threshold segmentation method in dynamic environment
CN106204538A (en) * 2016-06-28 2016-12-07 陕西师范大学 A kind of image partition method and system
CN108090908A (en) * 2017-12-07 2018-05-29 深圳云天励飞技术有限公司 Image partition method, device, terminal and storage medium
CN108509820A (en) * 2017-02-23 2018-09-07 百度在线网络技术(北京)有限公司 Method for obstacle segmentation and device, computer equipment and readable medium
CN110838125A (en) * 2019-11-08 2020-02-25 腾讯医疗健康(深圳)有限公司 Target detection method, device, equipment and storage medium of medical image
CN112418232A (en) * 2020-11-18 2021-02-26 北京有竹居网络技术有限公司 Image segmentation method and device, readable medium and electronic equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110930419A (en) * 2020-02-13 2020-03-27 北京海天瑞声科技股份有限公司 Image segmentation method and device, electronic equipment and computer storage medium
CN111598902B (en) * 2020-05-20 2023-05-30 抖音视界有限公司 Image segmentation method, device, electronic equipment and computer readable medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105844641A (en) * 2016-03-24 2016-08-10 武汉工程大学 Adaptive threshold segmentation method in dynamic environment
CN106204538A (en) * 2016-06-28 2016-12-07 陕西师范大学 A kind of image partition method and system
CN108509820A (en) * 2017-02-23 2018-09-07 百度在线网络技术(北京)有限公司 Method for obstacle segmentation and device, computer equipment and readable medium
CN108090908A (en) * 2017-12-07 2018-05-29 深圳云天励飞技术有限公司 Image partition method, device, terminal and storage medium
CN110838125A (en) * 2019-11-08 2020-02-25 腾讯医疗健康(深圳)有限公司 Target detection method, device, equipment and storage medium of medical image
CN112418232A (en) * 2020-11-18 2021-02-26 北京有竹居网络技术有限公司 Image segmentation method and device, readable medium and electronic equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115908459A (en) * 2023-03-10 2023-04-04 中科慧远视觉技术(北京)有限公司 Image segmentation method and device, computer equipment and readable storage medium

Also Published As

Publication number Publication date
CN112418232A (en) 2021-02-26

Similar Documents

Publication Publication Date Title
CN110413812B (en) Neural network model training method and device, electronic equipment and storage medium
WO2022105622A1 (en) Image segmentation method and apparatus, readable medium, and electronic device
CN111784712B (en) Image processing method, device, equipment and computer readable medium
CN111222509B (en) Target detection method and device and electronic equipment
WO2022116947A1 (en) Video cropping method and apparatus, storage medium and electronic device
WO2022028253A1 (en) Positioning model optimization method, positioning method, positioning device, and storage medium
CN110866524A (en) License plate detection method, device, equipment and storage medium
CN111311609B (en) Image segmentation method and device, electronic equipment and storage medium
CN110852242A (en) Watermark identification method, device, equipment and storage medium based on multi-scale network
WO2022052889A1 (en) Image recognition method and apparatus, electronic device, and computer-readable medium
CN111680754B (en) Image classification method, device, electronic equipment and computer readable storage medium
CN111737575B (en) Content distribution method, content distribution device, readable medium and electronic equipment
CN110348374B (en) Vehicle detection method and device, electronic equipment and storage medium
CN114004229A (en) Text recognition method and device, readable medium and electronic equipment
CN112488947A (en) Model training and image processing method, device, equipment and computer readable medium
CN112418233A (en) Image processing method, image processing device, readable medium and electronic equipment
CN114359673B (en) Small sample smoke detection method, device and equipment based on metric learning
CN110991312A (en) Method, apparatus, electronic device, and medium for generating detection information
CN112070163B (en) Image segmentation model training and image segmentation method, device and equipment
CN111797932B (en) Image classification method, apparatus, device and computer readable medium
CN113744259B (en) Forest fire smoke detection method and equipment based on gray value increasing number sequence
WO2022194157A1 (en) Target tracking method and apparatus, device and medium
CN113808050B (en) Denoising method, device and equipment for 3D point cloud and storage medium
CN111814807B (en) Method, apparatus, electronic device, and computer-readable medium for processing image
WO2023109385A1 (en) Icon click detection method and apparatus, device, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21893769

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21893769

Country of ref document: EP

Kind code of ref document: A1