CN108090913B

CN108090913B - Image semantic segmentation method based on object-level Gauss-Markov random field

Info

Publication number: CN108090913B
Application number: CN201711316006.XA
Authority: CN
Inventors: 郑晨; 姚鸿泰
Original assignee: Henan University
Current assignee: Suzhou Qingchen Technology Co ltd
Priority date: 2017-12-12
Filing date: 2017-12-12
Publication date: 2020-06-19
Anticipated expiration: 2037-12-12
Also published as: CN108090913A

Abstract

The invention provides an image semantic segmentation method based on an object-level Gauss-Markov random field, which comprises the following steps of: performing initialization over-segmentation on the pixel-level image to obtain an object-level image and a region adjacency graph, and respectively defining an adjacent domain system, an observation characteristic field and a segmentation marker field on the region adjacency graph; performing Gauss-Markov modeling on the characteristics of each region of the observation characteristic field and the characteristics of the neighborhood thereof according to the object-level segmentation marker field and the neighborhood system, and constructing an object-level linear regression equation for each region; and respectively carrying out probability modeling on the characteristic field and the marker field, obtaining posterior distribution of the segmented marker field according to Bayes criterion, and obtaining a final segmentation result according to the maximum posterior probability criterion. The method can be used in a system for carrying out semantic segmentation on the images in batches under the background of complex semantics and high spatial resolution, and greatly improves the working efficiency compared with manual detection.

Description

Image semantic segmentation method based on object-level Gauss-Markov random field

Technical Field

The invention relates to the technical field of image semantic segmentation, in particular to an image semantic segmentation method based on an object-level Gauss-Markov random field.

Background

Image semantic segmentation refers to grouping pixels in an image according to different semantics expressed in the image, and the process is performed autonomously by a machine.

With the continuous development of modern sensor manufacturing processes and imaging techniques, the spatial resolution of the processed images is higher and higher, and the number of acquired images grows exponentially, which is inefficient if manual segmentation is used. The previous pixel level segmentation method cannot consider spatial information in a wider range, and a large amount of information is wasted. In recent years, object-level geographic analysis technology has become a hotspot technology for extracting image information, and is applied to image semantic segmentation, so that spatial information in a wider range can be considered. But the interaction relation between the region features is ignored, and the segmentation precision needs to be improved. Therefore, there is a need for an image semantic segmentation method that can not only ensure the full utilization of spatial information, but also consider the interaction between the region features.

Disclosure of Invention

Aiming at the technical problem that the existing image semantic segmentation method cannot fully utilize and guarantee the spatial information and consider the interaction between the regional characteristics, the invention provides the image semantic segmentation method based on the object-level Gauss-Markov random field, which not only ensures the full utilization of the spatial information, but also considers the interaction between the regional characteristics.

In order to achieve the purpose, the technical scheme of the invention is realized as follows: an image semantic segmentation method based on an object-level Gauss-Markov random field is characterized by comprising the following steps of:

the method comprises the following steps: performing initial over-segmentation on the read pixel-level image to obtain an object-level image consisting of over-segmented regions and a corresponding object-level region adjacency graph RAG, and defining a neighborhood system N of the image according to the region adjacency graph RAG^OObject-level observation feature field Y^OAnd object level segmentation marker field X^O；

Step two: segmenting the marker field X according to the object level^OAnd neighbor system N^OTo the observation characteristic field Y^OEach region r of_iThe features of (a) and the features of its neighborhood are modeled by Gauss-Markov, and the structure is constructed for each region r_iI 1, …, l;

step three: respectively to observe characteristic field Y^OAnd dividing the mark field X^OCarrying out probability modeling and obtaining a segmentation marking field X according to Bayes criterion^OThe iterative segmentation is updated by applying the maximum posterior probability criterion and the final segmentation is obtained by solving the updated iterative segmentation.

The specific implementation steps of the first step are as follows:

1) performing position index set definition and pixel-level observation feature set definition on an input high-spatial-resolution three-channel image I (R, G, B), and assuming that the resolution of the image I (R, G, B) is mxn, obtaining: position index set S ═ S_xyX is not less than m and not more than 1 ≦ x; y is more than or equal to 1 and less than or equal to n, and a pixel-level observation feature set

Wherein,

representing the observed eigenvalue of the pixel at position s,

the values of R, G, B components of the image are respectively, m is the length of the image, n is the width of the image, and (x, y) are the position coordinates of pixel points in the image;

2) and performing over-segmentation processing on the pixel level image by using a mean-shift method according to the set minimum area: over-dividing the image I (R, G, B) into l minimum areas s_minEach region is assigned a label, resulting in a label matrix L_s＝{l_sS belongs to S, wherein the element l belongs to S_sE {1, …, l }, S e S; thus, the position index set R ═ R of the object-level image is obtained₁,r₂,…,r_lWherein, the area r_i＝{s|l_s＝i}；

3) Obtaining an object-level region adjacency graph G (R, E) according to the over-segmentation processing, wherein the position index set R is an object-level element, each element represents an over-segmentation region, and E (E)_ijI is less than or equal to 1, j is less than or equal to l represents an adjacency relation, and the element e_ijIndicating the region r_iNeutralization region r_jNumber of adjacent pixels, e_ijNot equal to 0 and only if the element R_iAnd R_jAre adjacent;

4) defining an object-level observation characteristic field on a region adjacency graph G

And object level segmentation marker field

Wherein,

indicating the region r_iIs observed as | r_iI denotes the region r_iThe number of internal pixel points; x^OIs a random field that is generated by the field,

is a random variable that is a function of time,

wherein K is a division classA classification set, wherein k is the preset number of segmentation classifications;

5) and giving an object-level neighborhood system according to the object-level region adjacency graph G ═ R, E):

wherein,

the second step comprises the following specific steps:

1) in the region adjacency graph G ═ (R, E), the area parameter whose target level element is the number of pixels included in each over-divided region can be obtained from the position index set R, and the area matrix RS ═ RS can be obtained_iI is more than or equal to 1 and less than or equal to l, wherein RS_i＝|r_i|；

2) Let x^OIs an object level segmentation marker field X^OAccording to x^OObtaining the characteristic mean value and the characteristic covariance matrix of each category, and the realization process is as follows:

(a) realization of the known object-level segmentation marker field as x^OCalculating the segmentation class corresponding to each pixel point in the original image, namely a pixel-level segmentation mark matrix

Wherein

(b) Respectively calculating characteristic mean values m ═ m_iI is not less than 1 and not more than k and a characteristic covariance matrix sigma { ∑ sigma_i|1≤i≤k}：

3) For each object level element r_iGiven its segmentation marker implementationIs composed of

Then, a linear regression equation is constructed as follows:

wherein e is_i～N(0,∑_h) Is a gaussian white noise.

The concrete method of the third step is as follows:

2) for object level observation feature field Y^OInstead of directly modeling the joint probabilities for observed features, each object-level element r is modeled_iAnd performing combined modeling on residual terms in the constructed object-level linear regression equation to obtain a likelihood function of the characteristic field, namely:

2) object level segmentation marker field X^OProbability modeling is carried out, and the Markov-Gibbs equivalence shows that the object-level segmentation mark field conforms to Gibbs distribution, and the prior distribution of the mark field is obtained as follows:

wherein Z is a normalization constant, U (x)^O) Representing a split field implementation as x^OEnergy of time, K is the set of segmentation classes, V₂(. cndot.) is a function of the potential energy of the group, given by the Potts model, i.e.:

3) the posterior distribution of the marker field from Bayes' formula can be found as:

therefore, the optimal result of the segmentation mark is converted into the segmentation mark field X^OThe problem of posterior distribution maximization, namely:

and updating the segmentation marks through loop iteration to finally obtain a segmentation result.

The specific implementation process of the loop iteration is as follows:

5) firstly, a pixel level MRF method is realized by a classical ICM algorithm, and the segmentation class of each pixel point is obtained, namely the pixel level segmentation field result: x is the number of^P＝{x_sL S belongs to S, and then the realization of the object segmentation mark field of the initial iteration is obtained

Wherein

mode is a mode function；

6) Implementation of segmentation of the marking field at the t-th step from the object level

Obtaining the characteristic mean value corresponding to each category according to the following formula

Sum feature covariance

7) Separately computing each object-level element r_iThe object-level linear regression equation of (1):

8) respectively calculating the object and characteristic field probability and the marker field probability, and updating the segmentation markers object by object, specifically:

the invention has the beneficial effects that: a semantic segmentation method for the formed RGB image with high spatial resolution is provided; the method can be used for semantic segmentation of batch processing of the RGB images with high spatial resolution, the segmentation efficiency is far higher than the traditional manual segmentation level, and the efficiency is higher than that of most of the existing object-oriented segmentation modes; the fixed value is directly given to the parameter to be estimated in the linear regression equation, so that the method is simple, convenient and quick to calculate and high in precision.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart of the present invention.

FIG. 2 is a flow chart of initialization of the present invention.

FIG. 3 is an exemplary diagram of the initialization process of the present invention.

FIG. 4 is a flow chart of the linear regression equation construction of the present invention.

FIG. 5 is a flow chart of the joint modeling of the present invention.

FIG. 6 is a simulation diagram of the experiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive effort based on the embodiments of the present invention, are within the scope of the present invention.

As shown in fig. 1, an image semantic segmentation method based on an object-level Gauss-Markov random field includes the following steps:

the method comprises the following steps: performing initial over-segmentation on the read pixel-level image to obtain an object-level image consisting of over-segmented regions and a corresponding object-level region adjacency graph RAG, and defining a neighborhood system N of the image according to the region adjacency graph RAG^OObject-level observation feature field Y^OAnd object level segmentation marker field X^O。

In order to perform object-level image analysis and improve algorithm efficiency, initial over-segmentation is required to obtain a region adjacency graph RAG. The region adjacency graph RAG is obtained from the spatial relationship between the respective over-segmented regions of the object-level image. The method for carrying out initialization over-segmentation on the image is a mean-shift method with the minimum area factor. And according to the pixel-level image characteristics, obtaining object-level image representation by using a mean-shift method related to a minimum area parameter (namely the number of pixels contained in the over-segmentation region), and finally obtaining the object-level image characteristics. As shown in fig. 2, the specific implementation steps are as follows:

Wherein,

representing the observed eigenvalue of the pixel at position s,

the values of the R, G, B components of the image, m the length of the image, n the width of the image, and (x, y) the position coordinates of the pixel points in the image.

2) And performing over-segmentation processing on the pixel level image by using a mean-shift method according to the set minimum area: over-dividing the image I (R, G, B) into l minimum areas s_minEach region is assigned a label, resulting in a label matrix L_s＝{l_sS belongs to S, wherein the element l belongs to S_sE {1, …, l }, S e S. Thus, the position index set R ═ R of the object-level image is obtained₁,r₂,…,r_lWherein, the area r_i＝{s|l_sI }. As shown in fig. 3(a), the processing result is grayed, and the line in the figure is the result of division.

3) The object level region adjacency graph G is obtained by the over-segmentation processing (R, E). Wherein the position index set R is an object level element, each element representing an over-segmented region. E ═ E_ijI is less than or equal to 1, j is less than or equal to l represents an adjacency relation, and the element e_ijIndicating the region r_iNeutralization region r_jNumber of adjacent pixels, e_ijNot equal to 0 and only if the element R_iAnd R_jAre adjacent.

Object level segmentation marker field

Wherein,

indicating the region r_iIs observed as | r_iI denotes the region r_iThe number of pixels in the column. X^OIs a random field that is generated by the field,

is a random variable representing the over-segmented region r_iThe classification of (2) is performed,

where K is a set of segmentation classes and K is a predetermined number of segmentation classes.

wherein,

fig. 3(b) is an enlarged view of a part of the rectangular frame in fig. 3(a), and the neighborhood labeling of each region in fig. 3(b) is as shown in fig. 3 (c).

Step two: dividing each region r of the marker field XO and the neighborhood system NO to the observation characteristic field YO according to the object level_iThe features of (a) and the features of its neighborhood are modeled by Gauss-Markov, and the structure is constructed for each region r_iI 1, …, l.

The object-level linear regression equation uses the area size and the boundary length of the object-level elements as parameters of the linear regression equation, and constructs a linear regression equation for each object-level element, as shown in fig. 4, the specific steps are as follows:

1) in the region adjacency graph G ═ (R, E), the number of pixels included in each over-divided region can be obtained from the position index set R, and the number of pixels is regarded as the area parameter of the object-level element, so that the area matrix RS ═ RS is obtained_iI is more than or equal to 1 and less than or equal to l, wherein RS_i＝|r_i|。

2) Let x be^OIs an object level segmentation marker field X^OAccording to x^OObtaining the characteristic mean value and the characteristic covariance matrix of each category, and the realization process is as follows:

Wherein

(b) Respectively calculating a characteristic mean value m { mi |1 ≦ i ≦ k } and a characteristic covariance matrix sigma { ∑ sigma { [_i|1≤i≤k}：

3) For each object level element r_iGiven its segmentation marker implemented as x_iAfter O, a linear regression equation was constructed as follows:

wherein, for the convenience of calculation, assume e_i～N(0,∑_h) Is a gaussian white noise.

Probabilistic modeling includes constructing an observed feature field Y from error terms in an object-level linear regression equation^OThe multivariate normal distribution and the construction of a segmentation marker field X by adopting a Potts model^OGibbs distribution of (1). The final segmentation result is: and updating iterative segmentation by using Gibbs distributed sampling, and finally outputting a convergence solution. As shown in fig. 5, the specific operation is as follows:

1) for object level observation feature field Y^OInstead of directly modeling the joint probabilities for observed features, each object-level element r is modeled_iAnd performing combined modeling on residual terms in the constructed object-level linear regression equation to obtain a likelihood function of the characteristic field, namely:

2) object level segmentation marker field X^OAnd (3) performing probability modeling, wherein the mark field has Markov property, and the mark field accords with Gibbs distribution according to Markov-Gibbs equivalence, so that the prior distribution of the mark field is obtained as follows:

wherein Z is a normalization constant, U (x)^O) Representing a split field implementation as x^OEnergy of time, V₂(. cndot.) is a function of the potential energy of the group, given by the Potts model, i.e.:

and updating the segmentation marks through loop iteration to finally obtain a result. The specific loop iteration process is as follows:

1) firstly, a pixel-level MRF (Markovrandom field) method is realized by a classical ICM (iterative condition model) algorithm, and the classification of each pixel point is obtained, namely the pixel-level segmentation field result: x is the number of^P＝{x_sL S belongs to S, and then the realization of the object segmentation mark field of the initial iteration is obtained

Wherein

I.e. for the over-divided region r_iThe segmentation mark is the mode of the segmentation mark of the internal pixel point.

2) Implementation of segmentation of the marking field at the t-th step from the object level

Sum feature covariance

3) Separately computing each object-level element r_iThe object-level linear regression equation of (1):

4) respectively calculating the object and characteristic field probability and the marker field probability, and updating the segmentation markers object by object, specifically:

the present invention is a platform that is operated such that core i3-4160@3.6GHz, RAM: 4G, 64-bit win10 system, 2015a version matlab, the color image of aerial image 1024_1 is as shown in fig. 6(a1) (the color image is grayed), real manual segmentation is as shown in fig. 6(a2), ICM method is used for image 1024_1, β is 0.5, the color image of the segmentation result is as shown in fig. 6(a3), GMRF method is used for image 1024_1, β is 0.5, the color image of the segmentation result is as shown in fig. 6(a4), mrmrmrmrmrf method is used for image 1024_1, wavelet decomposition is three-layer, β is 0.5, the color image of the segmentation result is as shown in fig. 6(a5), the image 1024_1 is as shown in fig. 6, and mr5 is as shown in fig. 256, r 5 is used for image 1024 b, r 5, the image is as shown in fig. 6(a 583) and the image is as shown in fig. 26, r 3, r 5, r 3, r 5, r 3, r 5, r 5, r, r 5, r 5, r 5, r 5, r g. 7, r, G5, r 5, g. 7, G5, G3, G5, G3, G7, G7, G6 a r, G7, G7, G7, G7, G.

TABLE 1 Kappa coefficient of segmentation results

TABLE 2 Total Accuracy of segmentation results (OA Accuracy)

As can be seen from the data in fig. 6 and tables 1-2, the segmentation accuracy of the present invention is the best. The aerial image contains more texture information, and the sub-objects in the same class have larger differences in spectral values, while sub-objects in different classes may have similar spectral values. For example, in the urban sector, roofs and courtyards have different spectral values, but the spectral values of trees in the urban and forest sectors are similar. For these reasons, there are many finely divided misclassifications for the three pixel-based approaches. Compared to the pixel-based approach, the object-based approach treats the over-segmented regions as basic units, thus significantly optimizing the segmentation accuracy. The OMRF method models the feature domain using the probability distribution of the features of the object, while the OGMRF-RC method models the feature domain using the probability distribution of the residual terms in the object-level linear regression equation. The OGMRF-RC method has the advantage that the influence of spectral variation between the same classes on the segmentation in the iterative process can be reduced. For example, in the upper half of fig. 6(a7), the large bare land and forest are accurately divided into idle sections, unlike the OMRF divided into house sections in fig. 6(a 6).

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. An image semantic segmentation method based on an object-level Gauss-Markov random field is characterized by comprising the following steps of:

Step two: segmenting the marker field X according to the object level^OAnd neighbor system N^OTo the observation characteristic field Y^OEach region r of_iThe features of (a) and the features of its neighborhood are modeled by Gauss-Markov, and the structure is constructed for each region r_i1.. times.l;

step three: respectively to observe characteristic field Y^OAnd dividing the mark field X^OGo to outlineRate modeling and obtaining a segmentation marking field X according to Bayes criterion^OThe posterior distribution of the segmentation is realized by updating iterative segmentation by applying a maximum posterior probability criterion and solving the iterative segmentation to obtain final segmentation;

the second step comprises the following specific steps:

Wherein

(b) Calculating the characteristic mean value mu ═ mu respectively_iI is not less than 1 and not more than k and a characteristic covariance matrix sigma { ∑ sigma_i|1≤i≤k}：

Wherein,

representing the observation characteristic value of the pixel point at the position s;

3) for each object level element r_iGiven its segmentation markingNow is

Then, a linear regression equation is constructed as follows:

wherein,

is a gaussian white noise; element e_ijIndicating the region r_iNeutralization region r_jThe number of adjacent pixels.

2. The method for image semantic segmentation based on the object-level Gauss-Markov random field according to claim 1, wherein the first step is implemented by the following steps:

Wherein,

representing the observed eigenvalue of the pixel at position s,

And object level segmentation marker field

Wherein,

is a random variable that is a function of time,

wherein, K is a segmentation class set, and K is a preset segmentation class number;

5) given according to the object level region adjacency graph G ═ R, EObject level neighborhood system:

wherein,

3. the image semantic segmentation method based on the object-level Gauss-Markov random field according to claim 1, wherein the specific method of the third step is as follows:

wherein,

indicating the region r_iThe observed characteristic of (a);

4. The image semantic segmentation method based on the object-level Gauss-Markov random field according to claim 3, wherein the loop iteration is realized by the following specific process:

1) firstly, a pixel level MRF method is realized by a classical ICM algorithm, and the segmentation class of each pixel point is obtained, namely the pixel level segmentation field result: x is the number of^P＝{x_sL S belongs to S, and then the realization of the object segmentation mark field of the initial iteration is obtained

Wherein

mode is a mode function;

Sum feature covariance

Wherein, | r_iI denotes the region r_iThe number of internal pixel points;