CN103942779A - Image segmentation method based on combination of graph theory and semi-supervised learning - Google Patents
Image segmentation method based on combination of graph theory and semi-supervised learning Download PDFInfo
- Publication number
- CN103942779A CN103942779A CN201410118303.3A CN201410118303A CN103942779A CN 103942779 A CN103942779 A CN 103942779A CN 201410118303 A CN201410118303 A CN 201410118303A CN 103942779 A CN103942779 A CN 103942779A
- Authority
- CN
- China
- Prior art keywords
- image
- node
- matrix
- semi
- supervised learning
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses an image segmentation method based on a combination of a graph theory and semi-supervised learning. The image segmentation method includes the steps that an image is roughly segmented into a certain number of region blocks and mapped into a weighted graph; a correlation matrix is constructed by using a semi-supervised learning method; semantic segmentation is performed on the image through a normalization segmentation method. According to the image segmentation method, the image is segmented on the basis of the graph theory and semi-supervised learning, precision of image segmentation can be improved, and the image segmentation method not only can be beneficial to image segmentation and objective extraction, but also can be beneficial for promoting development of the fields of pattern recognition, computer vision, artificial intelligence and the like.
Description
Technical field
The present invention relates to technical field of image processing, particularly a kind of image partition method combining based on graph theory and semi-supervised learning.
Background technology
Image is cut apart with target and is extracted as an important branch in Image processing and compute machine vision field, is attracting numerous researchers' concern always.Meanwhile, image is cut apart and is extracted in the fields such as pattern-recognition, computer vision, artificial intelligence with target and also has a wide range of applications.Therefore, image is cut apart to the further investigation of extracting with target and not only contribute to image to cut apart the perfect solution of extracting with target, and contribute to promote the development in the fields such as pattern-recognition, computer vision, artificial intelligence.This is also the direct driving force that the present invention studies.
Traditional image partition method has: average drifting method, that is: and a kind of dividing method based on region, smooth region and the texture region of the method to image is insensitive, has good robustness; Normalization dividing method, that is: a kind of dividing method based on graph theory, the method is selected very sensitive to parameter, and operand is larger; K Mean Method, that is: a kind of dividing method based on cluster, the method is applicable to image less demanding to segmentation precision or that image brightness distribution is wider and cuts apart.And the present invention can solve problem above well.
Summary of the invention
The object of the invention is that the method for employing based on graph theory is to Image Segmentation Using, utilize average drifting method to realize the coarse segmentation of image, then the image block after cutting apart is considered as to the node in graph model, and sets up the association between adjacent block, thereby build the weighted graph based on image block.The method is done corresponding modify to internodal incidence matrix, enables to build better internodal association, has improved the precision that image is cut apart.
The present invention solves the technical scheme that its technical matters takes: the invention provides a kind of image partition method combining based on graph theory and semi-supervised learning, described method comprises the steps:
Step 1: image rough segmentation is slit into the region unit of some, and is mapped to weighted graph;
The method that the present invention proposes is that block-based image is cut apart, and therefore the structure to associated diagram is necessarily improved.
In image is processed, it is generally acknowledged from the neighbor in same region and there is similar color value, and the element of correlation model W taking color as feature
be expressed as:
In above formula, P
irepresent the position of pixel i, C
irepresent its color value, θ
pand θ
cfor constant.
Each image-region obtaining after coarse segmentation is considered as a node by the present invention, the set that now the V in weighted graph is all nodes, and E is the limit of connected node between two, its weight is:
w
ij=exp(-θ
c||c
i-c
j||) (2)
In above formula, C
irepresent the color average of cut zone, θ
crepresent the constant of control weight.Can find out from (2) formula: in the time of weight calculation, be greater than from the degree of association between different block pixels from the degree of association between pixel in same cut zone, therefore (2) formula has also implied the local edge of image simultaneously.
Step 2: utilize semi-supervised learning method construct relevance matrix;
In the present invention, the present invention will utilize semi-supervised learning method to calculate all internodal incidence matrix.Specifically, first the present invention is divided into all nodes mark node and does not mark node (wherein marking node occupies the minority), then utilizes semi-supervised learning calculate mark node and do not mark internodal relevance.In general unsupervised learning framework, the label information transmission based on iterative process need spend a large amount of time, and therefore the present invention adopts the method for directly carrying out label propagation to reduce time cost.
Node m and all internodal interconnection vectors are expressed as:
a
m=α(I-(1-α)P)
-1b
m (3)
Wherein, unit matrix I dimension is image coarse segmentation region number n, matrix P=D
-1w, diagonal matrix D=diag (d
1, d
2..., d
n) a certain element d
ifor:
In addition the initial state vector b in (3) formula,
m=[b
im]
n × 1the value of expressive notation node m is b
im=1, the value of all the other nodes is 0; Parameter alpha is in order to represent the original tag information of a certain node and the ratio of its reception adjacent node label information.
Said method minimizes the energy function on figure:
Formula (5) indicates to obtain desirable segmentation effect, in image cutting procedure, should follow smoothness constraint and matching constraint, and smoothness constraint wherein requires image to cut apart front and back, and the label information between adjacent node can not change too much; Matching constraint requires image to cut apart front and back, and the label information on a certain node can not change too much.
Formula (3) can estimate that internodal local smoothing method changes preferably, and therefore the incidence matrix in the present invention is:
P=D in above formula
-1w, matrix (I-(1-α) P) is the sparse matrix of positive definite.
Step 3: adopt normalization dividing method to carry out semantic segmentation to image.
Image segmentation problem is considered as a label distribution problem by the present invention: by the arbitrary label k ∈ in tally set K 1,2 ..., K} distributes to each the block node i in image.Split vector y
k=[y
ik]
n × 1the value that represents k cut zone is y
ik=1, the value of all the other nodes is 0.
Wherein, Y=[y
1, y
2..., y
n] be subdivision matrix and meet YY
t=I, the implication of diagonal matrix D is identical with formula (3).
The optimum solution of formula (7) is matrix D
1/2(I-(1-α) P) D
1/2the subspace that generates of minimum K eigenwert.
Beneficial effect:
1, graph theoretic approach of the present invention is all in theory still comparatively ripe in practice.
2, the present invention is mapped as pixel between image after weighted graph, utilizes the just energy image processing of graph theory knowledge.
3, the present invention can improve precision and the image that image cuts apart and cuts apart quality, contributes to image to cut apart the perfect solution of extracting with target
Determine, and contribute to promote the development in the fields such as pattern-recognition, computer vision, artificial intelligence.
Brief description of the drawings
Fig. 1 is method flow diagram of the present invention.
Fig. 2 is the schematic diagram of contrast before and after pre-service;
Identifier declaration: a, c, e are original image, and b, d, f are pretreated image.
Fig. 3 is the schematic diagram from Berkeley picture library.
Fig. 4 is the schematic diagram from MSRC picture library.
Fig. 5 is two groups of comparative examples figure of the Berkeley picture library segmentation result based on four kinds of distinct methods;
Caption, is from left to right followed successively by: former figure, the segmentation result schematic diagram based on average drifting method, the segmentation result schematic diagram based on method for normalizing, the segmentation result schematic diagram based on multiple dimensioned method for normalizing, the segmentation result schematic diagram based on the method for the invention.
Fig. 6 is two groups of comparative examples figure of the segmentation result of the MSRC picture library based on four kinds of distinct methods;
Caption, is from left to right followed successively by: former figure, the segmentation result schematic diagram based on average drifting method, the segmentation result schematic diagram based on method for normalizing, the segmentation result schematic diagram based on multiple dimensioned method for normalizing, the segmentation result schematic diagram based on the method for the invention.
Embodiment
Below in conjunction with Figure of description, the invention is described in further detail.
As shown in Figure 1, the invention provides a kind of image partition method combining based on graph theory and semi-supervised learning, the research method of the method based on graph theory is with weighted graph G=(V, E) carry out the relation between map image pixel, wherein node V presentation video pixel, the relation between weights E presentation video pixel.The method realization approach is first to utilize average drifting method to realize the coarse segmentation of image, then the image block after cutting apart is considered as to the node in graph model, and sets up the association between adjacent block, thereby build the weighted graph based on image block.
Method flow:
Step 1: image rough segmentation is slit into the region unit of some, and is mapped to weighted graph;
Because pixel is a discrete series of presentation video information, and along with the continuous increase of resolution, thereby cause making to become more and more difficult based on the optimal treatment of image pixel.Therefore, first the present invention adopts average drifting method to carry out pre-service to image, image is divided into some and has the fritter of locality, continuity.Fig. 2 provides the example contrasting before and after some pre-service.
As everyone knows, in the image based on graph theory is cut apart, image is cut apart the associated diagram that quality depends on structure to a great extent.Many researchers is all utilized as color now, and the local features such as edge are set up a suitable associated diagram, and all direct nodes using pixel as associated diagram of these methods.The method proposing due to the present invention is that block-based image is cut apart, and therefore the structure to associated diagram is necessarily improved.
In image is processed, it is generally acknowledged from the neighbor in same region and there is similar color value, and the element of correlation model W taking color as feature
be expressed as:
In above formula, P
irepresent the position of pixel i, C
irepresent its color value, θ
pand θ
cfor constant.
In the present invention, each image-region obtaining after coarse segmentation is considered as a node by the present invention, the set that now the V in weighted graph is all nodes, and E is the limit of connected node between two, its weight is:
w
ij=exp(-θ
c||c
i-c
j||) (2)
In above formula, C
irepresent the color average of cut zone, θ
crepresent the constant of control weight.Can find out from (2) formula: in the time of weight calculation, be greater than from the degree of association between different block pixels from the degree of association between pixel in same cut zone, therefore (2) formula has also implied the local edge of image simultaneously.
Step 2: utilize semi-supervised learning method construct relevance matrix;
Semi-supervised learning is the hot research problem in area of pattern recognition and machine learning field in recent years, is a kind of learning method that supervised learning combines with unsupervised learning.It mainly considers how to utilize a small amount of mark sample training and to a large amount of problems that sample is classified that do not mark, it marks cost to how to reduce, raising learning performance has very great practical significance.The crucial part of semi-supervised learning is consistance hypothesis: (1) adjacent node has identical label; (2) node in same structure has identical label.More than suppose so-called cluster hypothesis.
The present invention will utilize semi-supervised learning method to calculate all internodal incidence matrix.Specifically, first the present invention is divided into all nodes mark node and does not mark node (wherein marking node occupies the minority), then utilizes semi-supervised learning calculate mark node and do not mark internodal relevance.In general unsupervised learning framework, the label information transmission based on iterative process need spend a large amount of time, and therefore the present invention directly carries out label propagation to reduce time cost.
Node m and all internodal interconnection vectors are expressed as:
a
m=α(I-(1-α)P)
-1b
m (3)
Wherein, unit matrix I dimension is image coarse segmentation region number n, matrix P=D
-1w, diagonal matrix D=diag (d
1, d
2..., d
n) a certain element d
ifor:
In addition the initial state vector b in (3) formula,
m=[b
im]
n × 1the value of expressive notation node m is b
im=1, the value of all the other nodes is 0; Parameter alpha is in order to represent the original tag information of a certain node and the ratio of its reception adjacent node label information.
Above method essence is equivalent to minimize the energy function on figure:
Formula (5) indicates to obtain desirable segmentation effect, in image cutting procedure, should follow smoothness constraint and matching constraint, and smoothness constraint wherein requires image to cut apart front and back, and the label information between adjacent node can not change too much; Matching constraint requires image to cut apart front and back, and the label information on a certain node can not change too much.
Formula (3) can estimate that internodal local smoothing method changes preferably, and therefore, the incidence matrix of using in the present invention is:
P=D in above formula
-1w, matrix (I-(1-α) P) is the sparse matrix of positive definite.
Step 3: adopt normalization dividing method to carry out semantic segmentation to image.
In the present invention, image segmentation problem is considered as a label distribution problem by the present invention: by the arbitrary label k ∈ in tally set K 1,2 ..., K} distributes to each the block node i in image.Split vector y
k=[y
ik]
n × 1the value that represents k cut zone is y
ik=1, the value of all the other nodes is 0.
Wherein, Y=[y
1, y
2..., y
n] be subdivision matrix and meet YY
t=I, the implication of diagonal matrix D is identical with formula (3).
The optimum solution of formula (7) is matrix D
1/2(I-(1-α) P) D
1/2the subspace that generates of minimum K eigenwert.
For assessed for performance, the present invention selects two common image storehouses as lab diagram image set.Berkeley picture library.This picture library comprises 12,000 people's work blocks from 300 pictures, and wherein 200 pictures are for training, and another 100 pictures are for test.Fig. 3 provides some examples from Berkeley picture library.Microsoft Research Cambridge (MSRC) picture library.This picture library is containing 591 pictures from 29 classifications, and 216 pictures are wherein for training, and another 275 pictures are for test.Fig. 4 provides some examples from MSRC picture library.
In this experiment, the present invention and three kinds of classical image partition methods compare:
Dividing method based on average drifting: be by selecting suitable kernel function, realize the figure of cutting apart of image.
Based on normalized dividing method: be the association by maximizing each subgraph in graph model, realize the figure of cutting apart of image.
Based on multiple dimensioned normalized dividing method: be the statistical information of obtaining picture structure in the neighborhood by expanding gradually in scope, realize the figure of cutting apart of image.
Table 1: the contrast experiment on Berkeley picture library.
From four kinds of methods experiment comparative results of table 1 and know, various aspects of performance of the present invention is all better than other three kinds of classical ways, its reason comprises: (1) is considered cut apart image and do not cut the relevance between image simultaneously, contributes to better to reflect the correlativity between fast of each region in image; (2) improve the structural form of incidence matrix, contribute to realize more exactly the semantic segmentation of image.
Fig. 5, Fig. 6 provide respectively Berkeley picture library based on four kinds of distinct methods and the segmentation result exemplary plot of MSRC picture library, and the cut zone piece K that wherein the cut zone piece K on Berkeley picture library is taken as on 20, MSRC picture library is taken as 30.
Claims (4)
1. the image partition method combining based on graph theory and semi-supervised learning, is characterized in that, described method comprises the steps:
Step 1: image rough segmentation is slit into the region unit of some, and is mapped to weighted graph;
Above-mentioned steps 1 is that each image-region obtaining after coarse segmentation is considered as to a node, the set that now the V in weighted graph is all nodes, and E is the limit of connected node between two, its weight is:
w
ij=exp(-θ
c||c
i-c
j||) (2)
In above formula (2), C
irepresent the color average of cut zone, θ
cthe constant that represents control weight, in the time of weight calculation, is greater than from the degree of association between different block pixels from the degree of association between pixel in same cut zone;
Step 2: utilize semi-supervised learning method construct relevance matrix;
First above-mentioned steps 2 is divided into all nodes mark node and does not mark node, wherein marking node occupies the minority, then utilize semi-supervised learning calculate mark node and do not mark internodal relevance, adopt the method for directly carrying out label propagation to reduce time cost;
Node m and all internodal interconnection vectors are expressed as:
a
m=α(I-(1-α)P)
-1b
m (3)
Wherein, unit matrix I dimension is image coarse segmentation region number n, matrix P=D
-1w, diagonal matrix D=diag (d
1, d
2..., d
n) a certain element d
ifor:
Initial state vector b in above formula (3)
m=[b
im]
n × 1the value of expressive notation node m is b
im=1, the value of all the other nodes is 0; Parameter alpha is in order to represent the original tag information of a certain node and the ratio of its reception adjacent node label information;
Described method minimizes the energy function on figure:
Formula (5) indicates to obtain desirable segmentation effect, in image cutting procedure, should follow smoothness constraint and matching constraint, and smoothness constraint wherein requires image to cut apart front and back, and the label information between adjacent node can not change too much; Matching constraint requires image to cut apart front and back, and the label information on a certain node can not change too much;
Formula (3) can estimate that internodal local smoothing method changes preferably, and described incidence matrix is:
In above formula (6), P=D
-1w, matrix (I-(1-α) P) is the sparse matrix of positive definite;
Step 3: adopt normalization dividing method to carry out semantic segmentation to image;
Image segmentation problem is considered as a label distribution problem by above-mentioned steps 3: by the arbitrary label k ∈ in tally set K 1,2 ..., K} distributes to each the block node i in image, split vector y
k=[y
ik]
n × 1the value that represents k cut zone is y
ik=1, the value of all the other nodes is 0;
Wherein, Y=[y
1, y
2..., y
n] be subdivision matrix and meet YY
t=I, the implication of diagonal matrix D is identical with formula (3).
2. a kind of image partition method combining based on graph theory and semi-supervised learning according to claim 1, is characterized in that the local edge that the formula (2) of described method step 1 has comprised image.
3. a kind of image partition method combining based on graph theory and semi-supervised learning according to claim 1, is characterized in that, described method step 3 comprises: the optimum solution of above-mentioned formula (7) is matrix D
1/2(I-(1-α) P) D
1/2the subspace that generates of minimum K eigenwert.
4. according to the arbitrary described a kind of image partition method combining based on graph theory and semi-supervised learning of claim 1-3, it is characterized in that, described method is with weighted graph G=(V, E) carry out the relation between map image pixel, wherein node V presentation video pixel, the relation between weights E presentation video pixel.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410118303.3A CN103942779A (en) | 2014-03-27 | 2014-03-27 | Image segmentation method based on combination of graph theory and semi-supervised learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410118303.3A CN103942779A (en) | 2014-03-27 | 2014-03-27 | Image segmentation method based on combination of graph theory and semi-supervised learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103942779A true CN103942779A (en) | 2014-07-23 |
Family
ID=51190430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410118303.3A Pending CN103942779A (en) | 2014-03-27 | 2014-03-27 | Image segmentation method based on combination of graph theory and semi-supervised learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103942779A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105608690A (en) * | 2015-12-05 | 2016-05-25 | 陕西师范大学 | Graph theory and semi supervised learning combination-based image segmentation method |
CN106204538A (en) * | 2016-06-28 | 2016-12-07 | 陕西师范大学 | A kind of image partition method and system |
CN107958216A (en) * | 2017-11-27 | 2018-04-24 | 沈阳航空航天大学 | Based on semi-supervised multi-modal deep learning sorting technique |
CN114240955A (en) * | 2021-12-22 | 2022-03-25 | 电子科技大学 | Semi-supervised cross-domain self-adaptive image segmentation method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101493887A (en) * | 2009-03-06 | 2009-07-29 | 北京工业大学 | Eyebrow image segmentation method based on semi-supervision learning and Hash index |
US20130346346A1 (en) * | 2012-06-21 | 2013-12-26 | Microsoft Corporation | Semi-supervised random decision forests for machine learning |
CN103617623A (en) * | 2013-12-10 | 2014-03-05 | 江南大学 | Medical image segmentation method and system for fully-represented semi-supervised fast spectral clustering |
-
2014
- 2014-03-27 CN CN201410118303.3A patent/CN103942779A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101493887A (en) * | 2009-03-06 | 2009-07-29 | 北京工业大学 | Eyebrow image segmentation method based on semi-supervision learning and Hash index |
US20130346346A1 (en) * | 2012-06-21 | 2013-12-26 | Microsoft Corporation | Semi-supervised random decision forests for machine learning |
CN103617623A (en) * | 2013-12-10 | 2014-03-05 | 江南大学 | Medical image segmentation method and system for fully-represented semi-supervised fast spectral clustering |
Non-Patent Citations (1)
Title |
---|
SONGHAO ZHU ET AL.: "Graph Theory Based Image Segmentation", 《2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105608690A (en) * | 2015-12-05 | 2016-05-25 | 陕西师范大学 | Graph theory and semi supervised learning combination-based image segmentation method |
CN105608690B (en) * | 2015-12-05 | 2018-06-08 | 陕西师范大学 | A kind of image partition method being combined based on graph theory and semi-supervised learning |
CN106204538A (en) * | 2016-06-28 | 2016-12-07 | 陕西师范大学 | A kind of image partition method and system |
CN107958216A (en) * | 2017-11-27 | 2018-04-24 | 沈阳航空航天大学 | Based on semi-supervised multi-modal deep learning sorting technique |
CN114240955A (en) * | 2021-12-22 | 2022-03-25 | 电子科技大学 | Semi-supervised cross-domain self-adaptive image segmentation method |
CN114240955B (en) * | 2021-12-22 | 2023-04-07 | 电子科技大学 | Semi-supervised cross-domain self-adaptive image segmentation method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104599275B (en) | The RGB-D scene understanding methods of imparametrization based on probability graph model | |
CN103824050B (en) | A kind of face key independent positioning method returned based on cascade | |
CN102930539B (en) | Based on the method for tracking target of Dynamic Graph coupling | |
CN108875076B (en) | Rapid trademark image retrieval method based on Attention mechanism and convolutional neural network | |
CN103544697B (en) | A kind of image partition method based on hypergraph analysis of spectrum | |
CN104156433B (en) | Image retrieval method based on semantic mapping space construction | |
CN108846404B (en) | Image significance detection method and device based on related constraint graph sorting | |
CN105184772A (en) | Adaptive color image segmentation method based on super pixels | |
CN104732545B (en) | The texture image segmenting method with quick spectral clustering is propagated with reference to sparse neighbour | |
CN102982544B (en) | Many foreground object image interactive segmentation method | |
CN104462494A (en) | Remote sensing image retrieval method and system based on non-supervision characteristic learning | |
CN106504255A (en) | A kind of multi-Target Image joint dividing method based on multi-tag multi-instance learning | |
CN104392241A (en) | Mixed regression-based head pose estimation method | |
CN104751175B (en) | SAR image multiclass mark scene classification method based on Incremental support vector machine | |
CN103578107B (en) | A kind of interactive image segmentation method | |
CN112817755A (en) | Edge cloud cooperative deep learning target detection method based on target tracking acceleration | |
CN110210431A (en) | A kind of point cloud classifications method based on cloud semantic tagger and optimization | |
CN104992454A (en) | Regionalized automatic-cluster-change image segmentation method | |
CN103679719A (en) | Image segmentation method | |
CN103942779A (en) | Image segmentation method based on combination of graph theory and semi-supervised learning | |
CN107533760B (en) | Image segmentation method and device | |
CN102831161B (en) | For the semi-supervised sequence learning method based on manifold regularization of image retrieval | |
Gong et al. | Unsupervised RGB-T saliency detection by node classification distance and sparse constrained graph learning | |
Luqman et al. | Subgraph spotting through explicit graph embedding: An application to content spotting in graphic document images | |
CN114782714A (en) | Image matching method and device based on context information fusion |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140723 |
|
RJ01 | Rejection of invention patent application after publication |