CN110443817A

CN110443817A - A method of improving image segmentation precision

Info

Publication number: CN110443817A
Application number: CN201910535813.3A
Authority: CN
Inventors: 张烨; 樊一超; 郭艺玲
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2019-06-20
Filing date: 2019-06-20
Publication date: 2019-11-12
Anticipated expiration: 2039-06-20
Also published as: CN110443817B

Abstract

A method of improving image segmentation precision, comprising: step 1, the model of setting condition random field；Step 2 designs the parameter based on condition random field；Step 3 designs connection regional search algorithm；Step 4 accurately confines minimum external quadrangle.The present invention uses a kind of method of fining image segmentation for sample problem to improve the precision of image segmentation, its most prominent feature is for segmentation problem of rough, condition random field optimization processing is introduced, it can satisfy the object edge segmentation more refined and interior void filled up, improve segmentation locating effect；For segmented image without obvious frame problem, minimum external quadrangle algorithm is used, the connection region after binaryzation search and minimum frame limits, perfect frame is obtained and limits result.The present invention can be widely applied to framing identification field, such as Logistics Park vehicle identification etc..

Description

A method of improving image segmentation precision

Technical field

The present invention relates to a kind of methods for improving image segmentation precision.

Technical background

In recent years with the rapid development of computer science and technology, image procossing, image object based on computer technology Detection etc. also obtains unprecedented fast development, and wherein deep learning is extracted crucial by the digital picture feature of study magnanimity Target signature has been more than the mankind in target detection, is brought to industry one and another pleasantly surprised.With neuroid It rises once again, the video image method based on convolutional Neural metanetwork becomes the mainstream technology of image segmentation and identification, using template The means such as matching, Edge Gradient Feature, histogram of gradients, realization accurately identify image.Although figure neural network based Effective feature identification can be carried out for the target of complex scene as feature detects, and its effect is much better than traditional side Method, but there is also shortcomings: (1) it is to noise anti-interference weaker；(2) over-fitting is solved by using Dropout method Problem improves convolutional neural networks model and parameter, but precision is but declined slightly；(3) introduce changeable type convolution with can Convolutional coding structure is separated, the generalization of model is improved, enhances network model ability in feature extraction, but to the target of complex scene Identification performance is not good enough；(4) newer a kind of image partition method, i.e. End-to-End, direct forecast image pixel classifications information, The pixel positioning of target object is accomplished, but model the problems such as that there are parameter amounts is big, efficiency is slow, segmentation is coarse.In short, traditional There is cumbersome, accuracy of identification is not high, recognition efficiency is slow and divides the problems such as coarse for detection method and video image method.

Summary of the invention

In order to overcome the above-mentioned deficiency of the prior art, the present invention provides a kind of fining image segmentation for sample problem Method the method for the related informations such as pixel spacing, color similarity is arranged using condition random field, meet finer The object edge segmentation of change and interior void are filled up, and are finally confined using minimum external quadrangle, solve characteristics of image The problem of boundary survey, is conducive to the further extraction of information.

To achieve the above object, the invention adopts the following technical scheme:

A method of image segmentation precision is improved, is included the following steps:

Step 1, the model of setting condition random field；

Traditional image partition method has " shift-and-stitch " intensively output and method progress using interpolation Up-sampling operation, but the result that these methods obtain is relatively rough, even if using traditional expansion, corrosion treatment, pixel Classification results it is still inaccurate.To solve this problem, using a kind of post-processing means of condition random field to pixel classifications The training stage is intervened afterwards, its classification is made to obtain more accurate pixels probability value, thus reach to object pixel classify into Row precision positioning.

Condition random field is a kind of undirected graph model of discriminate, for multiple variables or observation sequence x={ x₁, x₂,...,x_n, i.e., given object pixel value sequence, in given observation or flag sequence y={ y₁,y₂,...,y_n, i.e. classification Label, building conditional probability model P (y | x).Enable G=<V, E>expression node and the label one-to-one non-directed graph of y, y_vIt indicates Token variable corresponding with node v, n (v) indicate the adjacent node of node v, each variable y_vAll meet Markov property, i.e.,

P(y_v|x,y_v)=P (y_v|x,y_n(v)) (1)

Then (y, x) constitutes a condition random field, models to it, defines conditional probability P using potential function and group (y | x), so that token variable { y_iAnd adjacent token variable { y_i-1,y_iComposed by group's potential function it is maximum, pass through selection Exponential Potential Function, objective function are defined as

In formula: t_j(y_i+1,y_i, x, i) be two adjacent variable mark positions transfer characteristic function, for portraying adjacent mark Remember the influence of the correlativity and observation sequence of variable to it；s_k(y_i, x, i) and it is state of the observation sequence on mark position i Characteristic function, for portraying influence of the observation sequence to token variable；λ_jAnd μ_kFor parameter；Z is standardizing factor, for accurate Define probability.

Step 2 designs the parameter based on condition random field；

For above-mentioned condition random field, in conjunction with the universal model that characteristics of image is classified, the energy potential function used for

In formula: θ_i(x_i) it is unitary potential function；x_iFor the tag along sort of the pixel i in observation sequence, then there is class probability P (x_i), convert θ_i(x_i)=- logP (x_i).And the pairs of potential function θ of Section 2_ij(x_i,x_j) be extended to

In formula: μ (x_i,x_j) it is label contrast function, work as x_i≠x_jWhen, μ (x_i,x_j)=1, otherwise μ (x_i,x_j)=0, is used for Judge the distance between neighbor pixel；w_m·k^m(f_i,f_j) it is Gaussian convolution core characteristic function, use w_mWeigh adjacent pixel point feature Relationship, physical relationship function are

In formula: p_iWith p_jFor adjacent position pixel coordinate；I_iWith I_jFor the two colouring information；σ_αFor location factor；σ_βFor The color similarity factor；σ_γFor the additional Location Scale factor；w₁, w₂The respectively weight of linear combination.Pass through mean field approximation FunctionThe K-L divergence that iteration Q (x) minimizes P (x) and Q (x) is updated, the optimal solution of model is obtained.

Step 3 designs connection regional search algorithm；

Image connection regional search method is more, there is pixel point mark method, line segment labelling method etc..Wherein pixel point mark method It is divided into region growth method, sequential scan method, recursion marking method again.Line segment labelling method is mainly distance of swimming labelling method.And pixel mark Notation is most common, converts binary map for the prediction result of the logistics vehicles of each classification, is looked by connection region labeling It looks for.If coordinate is respectively f (x-1, y), f (x+1, y), f (x, y-1) up and down for its left and right pixel f (x, y), f (x, y+1) then joins Logical region labeling merge (x, y) is scanned in 4 fields, and left, upper position f (x-1, y) and f (x, y- have been scanned when putting by f (x, y) 1), therefore the connectivity of f (x, y), specific discriminate can be determined by judging merge (x-1, y) and merge (x, y-1) For

1) show the Rule of judgment being connected with left collar domain: as f (x, y)=f (x-1, y) and f (x, y) ≠ f (x, y-1) When, merge (x, y)=merge (x-1, y).

2) show the Rule of judgment being connected with upper field: as f (x, y)=f (x, y-1) and f (x, y) ≠ f (x-1, y) When, merge (x, y)=merge (x, y-1).

3) show the Rule of judgment being connected with left, upper field: as f (x, y)=f (x-1, y) and f (x, y)=f (x, y- 1) when, merge (x, y)=merge (x-1, y)=merge (x, y-1).

4) show the Rule of judgment with left, upper field not connection: as f (x, y) ≠ f (x-1, y) and f (x, y) ≠ f (x, y- 1) when, merge (x, y)=NewLabel new connection label.

Set up an one-dimension array common, under be designated as the value of interim connection region labeling merge (x, y), merge The value of (x, y) represents some common connection region labeling, the i.e. common connection region labeling common of pixel f (x, y) (merge(x,y)).Binary map classification image is scanned, detailed process is

1) when there is current coordinate point f (x, y) ≠ f (x-1, y) and f (x, y) ≠ f (x, y-1), show pixel f (x, y) belongs to new connection region, and array common is one newly-increased, and record common (merge (x, y))=merge (x, y)。

2) when there is current coordinate point f (x, y)=f (x, y-1) and f (x, y)=f (x-1, y), it is also necessary to relatively more interim The value of connection region labeling merge (x-1, y) and merge (x, y-1).

If there is merge (x-1, y)=merge (x, y-1) situation then merge (x, y)=merge (x, y-1)；

If occur merge (x-1, y) ≠ merge (x, y-1) situation then when common (i)=common (merge (x-1, When y)), there is common (i)=common (merge (x, y-1)).

3) when there is current coordinate point f (x, y)=f (x, y-1) and f (x, y) ≠ f (x-1, y), then show and upper field Connection records merge (x, y)=merge (x, y-1).

4) when there is current coordinate point f (x, y)=f (x-1, y) and f (x, y) ≠ f (x, y-1), then show and left collar domain Connection records merge (x, y)=merge (x-1, y).

After above step, merge all connection regions, obtain the connection region of each classification, picture can be made to target image Vegetarian refreshments segmentation positioning.

Step 4 accurately confines minimum external quadrangle；

On the basis of segmentation, target is confined using the method for minimum external quadrangle, is conducive to calculate in this way The high Pixel Information of the width of target.The external quadrangle calculation process of minimum wherein positioned is

1) each classification of step 3 segmented image is switched into bianry image, finds its approximate polygon profile.

2) polygonal profile is made of every series of points, is found y-coordinate maximum, the smallest point of x coordinate in discrete point and is denoted as A Point.

3) using A as origin, the forward and reverse ray of x-axisScanning clockwise, finds scanning element when rotation angle minimum, is denoted as B Point.

4) using B point as origin, AB oriented radialScanning, point when finding rotation angle minimum are denoted as C point clockwise.

5) and so on, until finding A point, to obtain polygon P.

6) area rotated each time is calculated with rotary process using P as chimb, obtains minimum area, i.e., minimum external four side Shape, the height and width of the minimum external quadrangle of record.

The invention has the advantages that

The present invention improves the precision of image segmentation for sample problem using a kind of method for refining image segmentation, Its most prominent feature is to have introduced condition random field optimization processing for segmentation problem of rough, can satisfy and more refine Object edge segmentation and interior void are filled up, and segmentation locating effect is improved；For segmented image without obvious frame problem, use Minimum external quadrangle algorithm search to the connection region after binaryzation and minimum frame limits, obtains perfect side Frame limits result.The present invention can be widely applied to framing identification field, such as Logistics Park vehicle identification etc..

Detailed description of the invention

Fig. 1 a~Fig. 1 c is the defect schematic diagram of traditional images segmentation, wherein Fig. 1 a is original image, and Fig. 1 b is label, Fig. 1 c It is prediction；

Fig. 2 a~Fig. 2 f is the front and back comparison of use condition random field of the invention.Before Fig. 2 is use condition random field Original image, Fig. 2 b label, Fig. 2 c are predictions；Fig. 2 d is the original image after use condition random field, and Fig. 2 e is label, Fig. 2 f prediction；

Fig. 3 a~Fig. 3 b is the external quadrangle positioning of minimum of the invention, and Fig. 3 a is lateral register, and Fig. 3 b is positive positioning.

Specific embodiment

Step 1, the model of setting condition random field；

P(y_v|x,y_v)=P (y_v|x,y_n(v)) (1)

Then (y, x) constitutes a condition random field, models to it, defines conditional probability P using potential function and group (yx), so that token variable { y_iAnd adjacent token variable { y_i-1,y_iComposed by group's potential function it is maximum, referred to by selecting Number potential function, objective function are defined as

Step 2 designs the parameter based on condition random field；

Step 3 designs connection regional search algorithm；

Step 4 accurately confines minimum external quadrangle；

5) and so on, until finding A point, to obtain polygon P.

In order to verify the superiority of the invention, using Logistics Park vehicle as example, following network model is constructed, is compareed Experiment:

It builds lightweight and conditional random field models network structure: acquiring cargo, towed goods from Logistics Park Vehicle, dumper, four seed type of tank truck logistics vehicles, be divided into training set 8 000, each classification 2 000 is surveyed Examination collection 4 000, each classification 1 000.Each parameter configuration of the network architecture built is as shown in table 1 below.

In table 1: k is convolution kernel size；S is step-length；P is the size of filling；DW is channel convolution group, indicates channel convolution The regular collocation of core composition；Residual error summation has been used to be conducive to the gradient transmitting of big network；The activation of each layer and batch standardize Operation (Batch Normalization, BN) is conducive to accelerate the training of network；ReLU is amendment linear unit, is one and swashs Function living.

Each parameter designing of 1 network architecture of table

After having introduced condition random field optimization, segmentation locating effect is improved.Use condition random field segmentation before with make Comparative result after being divided with condition random field is as shown in Figure of description 2.

For segmented image without obvious frame problem, minimum external quadrangle algorithm is used, to the connection after binaryzation Region search and minimum frame limits.Perfect frame is obtained to limit as a result, as shown in Figure of description 3.No matter logistics How is vehicle direction, and the bezel locations after segmentation positioning can be limited in minimum rectangle frame.

The invention has the advantages that

Content described in this specification embodiment is only enumerating to the way of realization of inventive concept, protection of the invention Range should not be construed as being limited to the specific forms stated in the embodiments, and protection scope of the present invention is also and in art technology Personnel conceive according to the present invention it is conceivable that equivalent technologies mean.

Claims

1. a kind of method for improving image segmentation precision, includes the following steps:

Step 1, the model of setting condition random field；

The rear training stage of pixel classifications is intervened using a kind of post-processing means of condition random field, obtains its classification More accurate pixels probability value carries out precision positioning to object pixel classification to reach；

Condition random field is a kind of undirected graph model of discriminate, for multiple variables or observation sequence x={ x₁,x₂,..., x_n, i.e., given object pixel value sequence, in given observation or flag sequence y={ y₁,y₂,...,y_n, i.e. class label, structure Build conditional probability model P (y | x)；Enable G=<V, E>expression node and the label one-to-one non-directed graph of y, y_vIt indicates and node v Corresponding token variable, n (v) indicate the adjacent node of node v, each variable y_vAll meet Markov property, i.e.,

P(y_v|x,y_v)=P (y_v|x,y_n(v)) (1)

Then (y, x) constitute a condition random field, it is modeled, using potential function and group come define conditional probability P (y | X), so that token variable { y_iAnd adjacent token variable { y_i-1,y_iComposed by group's potential function it is maximum, pass through and select index Potential function, objective function are defined as

In formula: t_j(y_i+1,y_i, x, i) be two adjacent variable mark positions transfer characteristic function, for portray adjacent marker become Influence of the correlativity and observation sequence of amount to it；s_k(y_i, x, i) and it is state feature of the observation sequence on mark position i Function, for portraying influence of the observation sequence to token variable；λ_jAnd μ_kFor parameter；Z is standardizing factor, is used for accurate definition Probability；

Step 2 designs the parameter based on condition random field；

In formula: θ_i(x_i) it is unitary potential function；x_iFor the tag along sort of the pixel i in observation sequence, then there is class probability P (x_i), Convert θ_i(x_i)=- logP (x_i)；And the pairs of potential function θ of Section 2_ij(x_i,x_j) be extended to

In formula: μ (x_i,x_j) it is label contrast function, work as x_i≠x_jWhen, μ (x_i,x_j)=1, otherwise μ (x_i,x_j)=0, for judging Distance between neighbor pixel；w_m·k^m(f_i,f_j) it is Gaussian convolution core characteristic function, use w_mWeigh adjacent pixel point feature to close System, physical relationship function are

In formula: p_iWith p_jFor adjacent position pixel coordinate；I_iWith I_jFor the two colouring information；σ_αFor location factor；σ_βFor color The similarity factor；σ_γFor the additional Location Scale factor；w₁, w₂The respectively weight of linear combination；Pass through mean field approximation functionThe K-L divergence that iteration Q (x) minimizes P (x) and Q (x) is updated, the optimal solution of model is obtained；

Step 3 designs connection regional search algorithm；

Image connection regional search method is more, there is pixel point mark method, line segment labelling method etc.；Wherein pixel point mark method divides again For region growth method, sequential scan method, recursion marking method；Line segment labelling method is mainly distance of swimming labelling method；And pixel point mark method It is most common, binary map is converted by the prediction result of the logistics vehicles of each classification, is searched by connection region labeling；If Coordinate is respectively f (x-1, y), f (x+1, y), f (x, y-1), f (x, y+1) up and down for its left and right pixel f (x, y), then connection area Domain label merge (x, y) is scanned in 4 fields, and left, upper position f (x-1, y) and f (x, y-1) have been scanned when putting by f (x, y), Therefore the connectivity of f (x, y) can be determined by judging merge (x-1, y) and merge (x, y-1), specific discriminate is

S1) show the Rule of judgment being connected with left collar domain: as f (x, y)=f (x-1, y) and f (x, y) ≠ f (x, y-1), Merge (x, y)=merge (x-1, y)；

S2) show the Rule of judgment being connected with upper field: as f (x, y)=f (x, y-1) and f (x, y) ≠ f (x-1, y), Merge (x, y)=merge (x, y-1)；

S3) show the Rule of judgment being connected with left, upper field: as f (x, y)=f (x-1, y) and f (x, y)=f (x, y-1) When, merge (x, y)=merge (x-1, y)=merge (x, y-1)；

S4) show the Rule of judgment with left, upper field not connection: as f (x, y) ≠ f (x-1, y) and f (x, y) ≠ f (x, y-1) When, merge (x, y)=NewLabel new connection label；

Set up an one-dimension array common, under be designated as the value of interim connection region labeling merge (x, y), merge (x, y) Value represent some common connection region labeling, i.e. the common connection region labeling common of pixel f (x, y) (merge (x, y))；Binary map classification image is scanned, detailed process is

T1) when there is current coordinate point f (x, y) ≠ f (x-1, y) and f (x, y) ≠ f (x, y-1), show pixel f (x, Y) belong to new connection region, array common is one newly-increased, and records common (merge (x, y))=merge (x, y)；

T2) when there is current coordinate point f (x, y)=f (x, y-1) and f (x, y)=f (x-1, y), it is also necessary to relatively more interim connection The value of logical region labeling merge (x-1, y) and merge (x, y-1)；

If there is merge (x-1, y) ≠ merge (x, y-1) situation then as common (i)=common (merge (x-1, y)) When, there is common (i)=common (merge (x, y-1))；

T3) when there is current coordinate point f (x, y)=f (x, y-1) and f (x, y) ≠ f (x-1, y), then show to join with upper field It is logical, it records merge (x, y)=merge (x, y-1)；

T4) when there is current coordinate point f (x, y)=f (x-1, y) and f (x, y) ≠ f (x, y-1), then show to join with left collar domain It is logical, it records merge (x, y)=merge (x-1, y)；

After above step, merge all connection regions, obtain the connection region of each classification, pixel can be done to target image Segmentation positioning；

Step 4 accurately confines minimum external quadrangle；

On the basis of segmentation, target is confined using the method for minimum external quadrangle, is conducive to calculate target in this way The high Pixel Information of width；The external quadrangle calculation process of minimum wherein positioned are as follows:

1) each classification of step 3 segmented image is switched into bianry image, finds its approximate polygon profile；

2) polygonal profile is made of every series of points, is found y-coordinate maximum, the smallest point of x coordinate in discrete point and is denoted as A point；

3) using A as origin, the forward and reverse ray of x-axisScanning clockwise, finds scanning element when rotation angle minimum, is denoted as B point；

4) using B point as origin, AB oriented radialScanning, point when finding rotation angle minimum are denoted as C point clockwise；

5) and so on, until finding A point, to obtain polygon P；

6) area rotated each time is calculated with rotary process using P as chimb, obtain minimum area, i.e., minimum external quadrangle, note The height and width of the minimum external quadrangle of record.