CN110570436A

CN110570436A - Image segmentation method based on depth perception

Info

Publication number: CN110570436A
Application number: CN201910661047.5A
Authority: CN
Inventors: 卢书芳; 丁雪峰; 寿旭峰; 朱翔; 高飞
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2019-07-22
Filing date: 2019-07-22
Publication date: 2019-12-13

Abstract

The invention discloses an image segmentation method based on depth perception, which comprises the following steps: (1) acquiring a depth map and a color map of a target and preprocessing the depth map and the color map; (2) extracting a minimum depth range containing a target from the depth map, and deleting pixel points at positions outside a depth threshold value in the corresponding color map; (3) and (3) performing picture frame twice on the color image processed in the step (2), wherein the first picture frame is divided into a minimum area containing a complete target, the second picture frame is divided into a positive foreground area in the minimum area, and all pixels outside the first picture frame are used as background pixels T_BThe second picture frame has all pixelsPart as a target pixel T_FThe pixel between two frames is used as a possible target pixel T_U(ii) a (4) And estimating a Gaussian mixture model of the target and the background according to pixels possibly belonging to the target, and performing iterative segmentation by using the Gaussian mixture model. The invention has better segmentation effect and higher segmentation accuracy.

Description

Image segmentation method based on depth perception

Technical Field

The invention belongs to the technical field of digital image processing, and particularly relates to an image segmentation method based on depth perception.

Background

Image segmentation is a classic image problem, is the basis of high-level work of digital image processing, and is widely researched. Common image segmentation methods include region growing, watershed, cluster segmentation, edge detection, threshold segmentation, and the like.

The energy minimization framework based on a Markov Random Field (MRF) is a research hotspot of image segmentation, and the theory is novel in that the energy minimization framework can be combined with various theoretical knowledge to carry out global optimal solution.A GrabCut algorithm is provided based on Graphcut in Rother and the like in 2004.

chinese patent publication No. CN108596919A discloses an automatic image segmentation method based on a depth map, which fuses depth information and improves segmentation accuracy of the algorithm on the basis of achieving automatic segmentation of GrabCut with saliency.

the maximum stream minimum cut algorithm in the GrabCut algorithm has the defects of sensitivity to local noise, poor edge effect of target extraction and the like. Depth perception is introduced, the depth range of the image can be normalized to be in a range of 0-255 by using a depth image, and the edge segmentation effect is better for scenes with complex target backgrounds and small edge gradients.

disclosure of Invention

in order to overcome the defects in the prior art, the invention provides an image segmentation method based on depth perception, which has a better segmentation effect and more accurate image edge segmentation.

An image segmentation method based on depth perception comprises the following steps:

(1) Acquiring a depth map and a color map of a target and preprocessing the depth map and the color map;

(2) extracting a minimum depth range containing a target from the depth map, and deleting pixel points at positions outside a depth threshold value in the corresponding color map;

(3) And (3) performing picture frame twice on the color image processed in the step (2), wherein the first picture frame is divided into a minimum area containing a complete target, the second picture frame is divided into a positive foreground area in the minimum area, and all pixels outside the first picture frame are used as background pixels T_BAll pixels in the second picture frame are used as target pixels T_FThe pixel between two frames is used as a possible target pixel T_U；

(4) And estimating a Gaussian mixture model of the target and the background according to pixels possibly belonging to the target, and performing iterative segmentation by using the Gaussian mixture model.

according to the method, the depth image target threshold value is segmented, the complete target minimum depth threshold value range is extracted, the image frames are processed twice, the complete target minimum area is segmented from the depth image in the first time, the foreground area is marked in the second time, the target clear edge is obtained from the complex background, the calculated amount of subsequent processing is reduced, and finally iterative segmentation is carried out according to a Gaussian mixture model.

In the step (1), the preprocessing includes aligning the depth map and the color map, and adjusting contrast and brightness.

in step (3), for T_BInitializing the label alpha of the pixel n for each pixel_n0; for T_FInside ofinitializing the label alpha of the pixel n for each pixel n_n2; for T_Ufor each pixel n, initializing the label alpha of the pixel n_n＝1。

In the step (4), the specific steps of performing iterative segmentation according to the gaussian mixture model are as follows:

(4-1) clustering pixels belonging to a target and a background into K classes respectively through a K-mean algorithm, namely K Gaussian models in the GMM, wherein each Gaussian model in the GMM obtains a plurality of pixel sample sets;

(4-2) assigning a gaussian component in the GMM to each pixel;

the purpose of the step is to find the Gaussian component corresponding to each pixel with the highest probability and distribute the Gaussian component to the corresponding Gaussian component; if pixel n is the target pixel, then the RGB value of pixel n is substituted into each Gaussian component in the target GMM with the highest probability, i.e., the kth of pixel n_nIndividual gaussian components:

Wherein alpha is_nIs a pixel label, θ, k_nIs the GMM parameter, z_nAre pixels in an RGB spatial image.

(4-3) obtaining a pixel sample set of a Gaussian model according to the classification of the Gaussian component to which each pixel belongs in the step (4-1), and meanwhile, obtaining parameter mean and covariance through RGB value estimation of the pixel samples, wherein the weight of the Gaussian component can be determined through the ratio of the number of pixels belonging to the Gaussian component to the total number of pixels:

Wherein D is_n(. h) is a Gaussian mixture model, theta, k are GMM parameters, alpha is a label of a pixel n, and z is a pixel point of an RGB space image;

(4-4) establishing a graph according to a Gibbs energy item, solving weights t-link and n-link, then segmenting by a max-flow/min-cut algorithm, and segmenting the maximum flow according to the mean value, covariance and weight of the maximum flow, wherein the formula is as follows:

Wherein α is a pixel label, α ═ 0 is a background, α ═ 1 is a possible region, α ═ 2 is a target region, θ, and k is a GMM parameter;

(4-5) repeating the steps (4-2) to (4-4) in the iterative minimization process until convergence, and obtaining a segmentation template for separation.

In the step (4), after the segmentation is finished, the method further comprises the step of smoothing the segmented boundary by using a border-matching. For the picture which can not be subjected to edge enhancement, the target edge segmented by adopting the depth perception method is smoother and clearer. The smoothing process specifically includes the following steps:

According to the boundary information of the image, selecting a mask of the image to use a Canny operator to carry out edge detection, parameterizing the image contour through the obtained edge information, finding out pixel points which belong to the same continuous contour, storing the information of the pixel points of all contours, and obtaining delta and sigma of each pixel point according to the principle of energy function minimization, wherein the energy function is as follows:

Wherein D is_nis a data item, V is a smoothing item; α, a_nIs a pixel label, Δ_t,σ_tDelta and sigma for a pixel point.

And calculating through a Sigmoid function, and fusing the alphaMask of the whole image with the original image to obtain the image after the border matching.

compared with the prior art, the invention has the following beneficial effects:

The invention effectively overcomes the defects of inaccurate target edge and incomplete target of color image segmentation by combining the depth map and the color map to segment the target. The method provided by the invention has better segmentation effect and higher segmentation accuracy.

Drawings

FIG. 1 is a schematic flow chart of an image segmentation method based on depth perception according to the present invention;

FIG. 2 is a color image and a depth image of an image to be segmented according to an embodiment of the present invention, in which (a) is the color image and (b) is the depth image;

FIG. 3 is a color drawing of an embodiment of the present invention with areas outside the depth threshold removed;

FIG. 4 is a diagram illustrating a first time a region containing a complete target according to an embodiment of the present invention;

FIG. 5 illustrates a second outlined region of positive foreground in an embodiment of the present invention;

FIG. 6 is a diagram illustrating the effect of the final segmentation in the embodiment of the present invention;

fig. 7 shows other image segmentation results according to the embodiment of the present invention, wherein (a) is before segmentation and (b) is after segmentation.

Detailed Description

the invention will be described in further detail below with reference to the drawings and examples, which are intended to facilitate the understanding of the invention without limiting it in any way.

as shown in fig. 1, an image segmentation method based on depth perception is characterized by comprising the following steps:

And S01, acquiring a depth map and a color map of the target and preprocessing the depth map and the color map.

Referring to fig. 2, the depth map D1 and the color map C1 are aligned, and the contrast and brightness of the color map C1 are adjusted.

S02, extracting the minimum depth range including the complete target from the depth map D1, and deleting the corresponding pixel points outside the depth threshold range in the color map C1 to obtain a map C2 to reduce the amount of computation, as shown in fig. 3.

s03, the color image C2 processed in step (2) is framed twice to mark different image areas.

(3-1) first framing an area containing a complete target, and taking all pixels outside the frame as background imagesPlain T_BAs shown in fig. 4.

(3-2) framing the area determined to be the target in the first frame, and framing T in the second frame_Fall as foreground pixels, pixel T between two blocks_UAll as "possible target" pixels, as shown in fig. 5.

(3-3) to T_BFor each pixel n in the array, initializing the label alpha of the pixel n_n0, namely the background pixel; for T_Ffor each pixel n in the array, initializing the label alpha of the pixel n_n2; to T_UEach pixel n in the array, initializing the label alpha of the pixel n_n1, i.e. a pixel that is "likely to be a target".

(3-4) obtaining a target (alpha) according to the probability_n1), the rest being the pixels belonging to the background (α)_n0) and certainly belong to the target (α)_n2) from pixels that may belong to the object, the GMMs of the object and the background are estimated.

S04, iterating the energy minimization segmentation.

(4-1) clustering pixels belonging to a target and a background into K classes respectively through a K-mean algorithm, namely K Gaussian models in GMM (Gaussian mixture model), wherein each Gaussian model in GMM obtains a plurality of pixel sample sets;

(4-2) assigning a Gaussian component in GMM to each pixel, if pixel n is the target pixel, then substituting the RGB value of pixel n into each Gaussian component in the target GMM with the highest probability, i.e., the kth of pixel n_nIndividual gaussian components:

And (4-3) obtaining a pixel sample set of a Gaussian model according to the classification of the Gaussian component to which each pixel belongs in the step (4-1), and simultaneously obtaining parameter mean and covariance through RGB value estimation of the pixel samples, wherein the weight of the Gaussian component can be determined through the ratio of the number of the pixels belonging to the Gaussian component to the total number of the pixels.

Wherein D is_n(. cndot.) is a Gaussian mixture model, theta, k are GMM parameters, alpha is a label of a pixel n, and z is a pixel point of an RGB space image.

where α is a pixel label, α ═ 0 is a background, α ═ 1 is a possible region, α ═ 2 is a target region, and θ, k is a GMM parameter.

(4-5) repeating the steps (4-2) to (4-4) in the iterative minimization process until convergence.

(4-6) smoothing the segmented boundary by using a clipper-matching, wherein the specific process is as follows:

Wherein D is_nis a data item, V is a smoothing item;

Calculating through a Sigmoid function, fusing the alphaMask obtained from the whole image with the original image to obtain a boundary-matched image, and finally obtaining a segmentation result as shown in fig. 6.

For the picture which can not be subjected to edge enhancement, the target edge segmented by adopting the depth perception method is smoother and clearer, and fig. 7 shows the segmentation results of other images obtained by utilizing the segmentation method of the invention, wherein (a) is a picture before segmentation and (b) is a picture after segmentation.

compared with the existing segmentation method, the method provided by the invention has better segmentation effect and higher segmentation accuracy.

The embodiments described above are intended to illustrate the technical solutions and advantages of the present invention, and it should be understood that the above-mentioned embodiments are only specific embodiments of the present invention, and are not intended to limit the present invention, and any modifications, additions and equivalents made within the scope of the principles of the present invention should be included in the scope of the present invention.

Claims

1. An image segmentation method based on depth perception is characterized by comprising the following steps:

2. the depth perception-based image segmentation method as claimed in claim 1, wherein in step (1), the preprocessing includes aligning a depth map and a color map, and adjusting contrast and brightness.

3. The depth perception-based image segmentation method according to claim 1, wherein in the step (3), T is measured_Binitializing the label alpha of the pixel n for each pixel_n0; for T_Ffor each pixel n, initializing the label alpha of the pixel n_n2; for T_UFor each pixel n, initializing the label alpha of the pixel n_n＝1。

4. the image segmentation method based on depth perception according to claim 1, wherein in the step (4), the iterative segmentation according to the gaussian mixture model specifically comprises the steps of:

(4-2) assigning a gaussian component in the GMM to each pixel;

5. The image segmentation method based on depth perception according to claim 4, wherein in the step (4-2), Gaussian components in GMM are allocated to each pixel, the probability of the Gaussian component corresponding to each pixel is found to be the highest, and the Gaussian component is allocated to the corresponding Gaussian component; if pixel n is the target pixel, then the RGB value of pixel n is substituted into each Gaussian component in the target GMM with the highest probability, i.e., the kth of pixel n_nIndividual gaussian components:

6. The method of image segmentation based on depth perception according to claim 1, wherein in the step (4), after the segmentation, the method further includes smoothing the segmented boundary by using a border-matching, and the specific process is as follows:

Wherein D is_nfor data items, V for smoothing items, alpha_nIs a pixel label, Δ_t,σ_tDelta and sigma of a pixel point;