CN103714549A - Stereo image object segmentation method based on rapid local matching - Google Patents
Stereo image object segmentation method based on rapid local matching Download PDFInfo
- Publication number
- CN103714549A CN103714549A CN201310745606.3A CN201310745606A CN103714549A CN 103714549 A CN103714549 A CN 103714549A CN 201310745606 A CN201310745606 A CN 201310745606A CN 103714549 A CN103714549 A CN 103714549A
- Authority
- CN
- China
- Prior art keywords
- triangle
- parallax
- cut zone
- pixel
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- 230000011218 segmentation Effects 0.000 title claims abstract description 40
- 238000010168 coupling process Methods 0.000 claims description 16
- 238000005859 coupling reaction Methods 0.000 claims description 16
- 230000008878 coupling Effects 0.000 claims description 15
- 238000012795 verification Methods 0.000 claims description 10
- 238000004040 coloring Methods 0.000 claims description 9
- 230000003044 adaptive effect Effects 0.000 claims description 7
- 238000005070 sampling Methods 0.000 claims description 5
- 238000010276 construction Methods 0.000 claims description 4
- 230000001419 dependent effect Effects 0.000 claims description 3
- 238000001514 detection method Methods 0.000 claims description 3
- 239000004576 sand Substances 0.000 claims description 3
- 238000010200 validation analysis Methods 0.000 claims description 3
- 238000013461 design Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 6
- 239000003086 colorant Substances 0.000 abstract 1
- 238000000638 solvent extraction Methods 0.000 description 7
- 238000011160 research Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000032683 aging Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 238000012428 routine sampling Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Landscapes
- Image Analysis (AREA)
Abstract
Provided is a stereo image object segmentation method based on rapid local stereo matching. The stereo image object segmentation method based on rapid local stereo matching comprises the steps of obtaining a reliable matching point on an image, and using Delaunay triangulation to carry out interpolation so as to obtain the parallax; building a graph, using a pixel as a vertex, and using connecting lines formed between the pixel and eight-site neighborhood pixels of the pixel as edges of the graph, wherein the weight of each edge is determined by the colors of the connected pixels and parallax information; determining whether the areas where every two connected pixels are located belong to the same segmentation area or not through a Kruskal minimum spanning tree strategy according to the weights of the edges, if yes, combining the areas, and if not, keeping the areas unchanged; judging whether the obtained segmentation area belongs to an object needing to be segmented out or not, taking out the segmentation area, and obtaining the final object. The stereo image object segmentation method based on rapid local stereo matching is rapid and effective, areas with the discontinuous parallax, such as edges of articles can be effectively processed, a plurality of objects can be rapidly segmented out, time efficiency is high, the segmentation effect is good, and the requirement for automatic and rapid object segmentation can be met.
Description
Technical field
The present invention relates to solid matching method, the Object Segmentation method in binocular stereo vision field, belong to computer vision field, main application is the parallax information of quick obtaining binocular image, thereby utilizing the colouring information of parallax information and picture itself to carry out Object Segmentation, is a kind of stereo-picture Object Segmentation method based on quick local matching.
Background technology
Stereoscopic vision makes people can obtain the depth information in object and scene, is that later stage application comprises that 3D rebuilds and the basis of content analysis.Stereo matching is one of gordian technique of stereoscopic vision, and it is by carrying out to two width or multiple image the depth information that pixel matching obtains pixel.
Coupling degree of accuracy and operation ageing be two central factors of Stereo matching, yet, early stage research show, these two factors are conflicting.General, according to matching strategy, solid matching method can be divided into two classes: overall and local Stereo matching.Overall situation solid matching method can produce high accuracy depth figure, yet the time of cost is longer.In contrast, the depth map that sectional perspective matching process produces does not have global approach precision high, but ageing height, can meet in real time or approach real-time utilization demand.So according to the demand of practice, sectional perspective matching process is worth us to carry out deep research.
Parallax interpolation applies in sectional perspective coupling widely, and this is mainly based on keeping changing continuously at local smoothing method regional disparity.Parallax interpolation is carried out in Delta Region to be also used in previous research.These Delta Regions, by using trigonometric ratio to form to some initial matching points, suppose that the parallax in Delta Region keeps changing continuously, and it is feasible that the parallax face that utilizes this Delta Region to form carries out interpolation.Yet if the summit that this triangle appears at the border of object or forms it comprises abnormal initial matching point, it is invalid utilizing this triangle to carry out the parallax that interpolation obtains.For these problems, the sectional perspective matching process in the present invention has provided the solution of oneself, and experimental result has also been confirmed the feasibility of this scheme.
Object Segmentation, image is cut apart, and is the study hotspot of computer vision always, and it is the basis of most of visual problems (as object identification).How to Image Segmentation Using, to be fast and effeciently emphasis and the difficult point of this research field always.The most time complexity of the good algorithm of segmentation effect is high at present, and needs man-machine interactively, and the object needs of cutting apart are manually specified, and the representative of this class algorithm is the partitioning algorithm cutting based on figure.In contrast, the partitioning algorithm that the average drifting of take is representative, time efficiency is high, and does not need artificial input, but its segmentation effect is much worse than greatly figure, cuts algorithm, easily over-segmentation, same target is very easily divided into a plurality of regions.
Summary of the invention
One aspect of the invention provides a kind of sectional perspective matching process based on extended triangular interpolation effectively, the method can effectively solve sectional perspective matching algorithm based on trigonometric ratio in some region, as cross over the triangle of object boundary or the triangle being formed by Exceptional point, the inaccurate problem of interpolation.
Technical scheme of the present invention is: the stereo-picture Object Segmentation method based on quick local matching, comprises the following steps:
1) ask for reliable matching point: the left image after input validation, right image carries out uniform sampling to left image, to sampled point, use adaptive weighting sectional perspective matching process to search optimal match point in right image, when the coupling cost of calculating pixel point, adopt the RGB colouring information of sobel textural characteristics and pixel, then matching result is carried out to later stage verification, finally remaining point is as reliable matching point;
2) reliable matching point is used to Delaunay trigonometric ratio: the reliable matching point that left image is tried to achieve is organized into triangle gridding, each triangle is carried out to reliability judgement, judgement according to being to check that whether the triangle that triangle closes on it coplanar, point in diabolo, according to leg-of-mutton reliability, utilize three triangles on triangle and limit common with it to carry out interpolation and ask parallax; The disparity map obtaining is carried out to later stage verification and obtain final disparity map;
3) to left picture construction figure: each pixel of left image is as the summit of figure, the limit using pixel and its eight neighbour's line as figure, the weights on limit determine by color and the parallax information of the pixel that is connected, parallax information is obtained by final disparity map;
4) utilizing the figure building to carry out image cuts apart: adopt Kruskal minimum spanning tree strategy, sorted from low to high according to the size of weights in limit, according to the weights on limit, determine whether the two pixel regions that are connected belong to same cut zone, the threshold value of judgement changes according to region adaptivity, handle successively all limits in figure, in picture, all pixels all incorporate corresponding region into, obtain cut zone;
5) extract interested object in cut zone: to the cut zone obtaining, according to parallax size, the compactness of cut zone and the ratio of the shared picture in its entirety of cut zone, judge whether to belong to the object that need to split, according to object, divide the cut zone of taking out each object, complete Object Segmentation.
Later stage verification adopts left and right consistency detection.
Described step 2), in, to each triangle, utilize its triangle closing on to carry out reliability judgement and be specially:
According to the angle theta of leg-of-mutton planar process vector, judge that whether triangle is coplanar:
two triangle f
1with f
2normal vector, if angle theta is less than the threshold tau setting in advance
f, two triangle f
1with f
2think and be positioned at same plane, coplanar; Otherwise be positioned at Different Plane;
The fiduciary level γ of triangle f (f) is calculated as:
T
rthe adjacent triangle number coplanar with triangle f, t
ntriangle total number adjacent with triangle f, τ
rit is the parameter that fiduciary level is adjusted.
Further, step 2), according to leg-of-mutton reliability, ask parallax to be specially:
If triangle f
preliably, its triangle closing on is all coplanar with it, and fiduciary level is 1, utilizes parallax that triangle determines to ask parallax in the face of the some p in it:
If triangle f
punreliable, to the pixel in it, adopt Bayesian model to ask parallax, with triangle f
pand the triangle that closes on common limit is as model priori, increases a parameterized level and smooth item as model priori simultaneously, Bayesian model is as follows:
P(D|I,F)∝P(I|D)P(F|D)P(D) (3)
Wherein D represents parallax, I representative image, and F represents triangle, P represents the probability in model;
According to Bayesian model, asking parallax is exactly the posterior probability problem of maximum value formula (3), and maximum value posterior probability is equivalent to the local energy function that minimal value is following:
E(p,f
p)=E
data(p,f
p)+λ
sE
smooth(p)+λ
fE
f(f
p) (5)
E (p, f
p) represent required triangle f
ptotal coupling cost of interior some p; E
data(p, f
p) be that postulated point p is at triangular facet f
pon coupling cost; E
smooth(p) it is level and smooth, for describing the relation between a p and consecutive point, E
f(f
p) be parallax face f
pinsecure penalty term, for describing f
pplane reliability, λ
sand λ
frespectively level and smooth and f
pthe control parameter of fiduciary level.
In step 3) during design of graphics, based on hypothesis: the pixel of the color similarity closing on should be positioned at same target, the two consistent pixels of depth information of closing on be positioned at same target, the weights on setting limit are determined jointly by color and the parallax of two pixels that are connected, if two pixels that in figure, limit is connected all comprise parallax information, by parallax and color according to the weights on the ratio-dependent limit of 7:3, otherwise, only according to colouring information, determine weights.
As optimal way, step 4), in, carry out for the first time after image cuts apart, carrying out post-processed: in conjunction with parallax information, the thinner cut zone that belongs to same parallax face of cutting apart of closing on is merged, for the fineness of cutting apart that judges whether to merge, according to image, cut apart Location of requirement here.
Step 5) in, need the judgment principle of the object that splits to be:
A) region of parallax maximum: the degree of depth is forward, object is close to camera lens, in the remarkable position of vision;
B) cut zone is compact: in cut zone, the ratio of number of pixels and cut zone place minimum rectangular area area is minimum;
C) ratio of the shared whole image-region of cut zone is moderate: Aspect Ratio meets predefined parameter:
According to mentioned above principle, take out cut zone, obtain cutting object;
Wherein, establish q
numto belong to pixel sum in cut zone, d
avgthe mean parallax of pixel in cut zone, w
minthe minimum abscissa value of cut zone, w
maxmaximum abscissa value, h
minthe minimum ordinate value of cut zone, h
maxbe maximum ordinate value, cut zone is determined by formula (18) minimum rectangle, and W and H are respectively the length of this minimum rectangle and wide, W
allwith H
allthe length of entire image and wide:
W=w
max-w
min,H=h
max-h
min (18)
To the C that is in conformity with the principle) cut zone, according to each cut zone complexity O, sort from big to small, take out and to come n object above, the complexity of cutting apart is:
Take out the minimum rectangle mask of cut zone, carry out binaryzation cavity and fill, then this mask is acted on to the object that former figure obtains splitting.
The present invention compared with prior art has the following advantages
The sectional perspective matching algorithm that the present invention proposes can effectively solve the existing sectional perspective matching algorithm based on trigonometric interpolation in the inaccurate problem of parallax discontinuity zone interpolation, and these regions comprise near object boundary and the Delta Region being comprised of abnormal initial matching point.Than the overall Stereo Matching Algorithm of present main flow, although the method disparity map lower and that obtain on matching precision is not dense, this does not affect the utilization of later stage on cut apart.The matching characteristic adopting due to the method is sobel Edge texture, so between object and object the abundant local parallax of textural characteristics mate more accurate, can effectively instruct Object Segmentation, and for the unconspicuous region of textural characteristics, although parallax information is inaccurate or disappearance, rely on colouring information also can successfully the pixel of same target be linked together.In addition, sharpest edges of sectional perspective matching process of the present invention are that time efficiency is high, have more practicality.
The Object Segmentation Algorithm based on color and the degree of depth that the present invention proposes can fast and effeciently be partitioned into a plurality of objects in image, compare with the global segmentation algorithm cutting based on figure of existing main flow, although segmentation effect is poor, but in cutting procedure, do not need man-machine interactively, cutting object be a plurality of and not background solely cut apart, and time efficiency is higher; Segmentation effect is compared with mean shift algorithm, and sliced time, efficiency was higher, and the non-over-segmentation of segmentation result.
Accompanying drawing explanation
Fig. 1 is implementing procedure of the present invention.
Fig. 2 is the implementing procedure of quick sectional perspective matching process of the present invention.
Fig. 3 is the implementing procedure of Object Segmentation of the present invention.
Fig. 4 is that the present invention carries out the model of interpolation to the point in unreliable triangle.
Embodiment
The present invention proposes a kind of stereo-picture Object Segmentation method based on quick local matching, comprise the following steps:
1) the left and right binocular stereo image after input validation, carries out routine sampling to left image, and the object of sampling is mainly that left and right image is all original input in order to guarantee the efficiency of algorithm, here left image is sampled, and right figure temporarily also has no relations.Then sampled point is used to the adaptive weighting sectional perspective matching process that classical matching precision is high, when the coupling cost of calculating pixel point, adopt the RGB colouring information of sobel textural characteristics and pixel, then matching result is used to later stage verification, as left and right consistency detection, obtain final match point reliably.
2) to step 1) the reliable matching point obtained uses Delaunay trigonometric ratio, and view picture image is covered by triangle gridding.To each triangle, utilize the triangle closing on to calculate his fiduciary level.If triangle is reliable, utilize parallax that this triangle determines to ask parallax in the face of the pixel interpolation in it; If unreliable, utilize this triangle and close on three altogether limit triangle pairs pixel interpolation in it ask parallax, the disparity map obtaining is carried out to later stage verification obtains final disparity map equally, this disparity map may be sparse.
3) to left picture construction figure, pixel is as the summit of figure, and pixel and its eight neighbour's line is as the limit of figure.The weights on limit are determined by color and the parallax of two pixels that are connected.
4) adopt Kruskal minimum spanning tree strategy, sorted from low to high according to the size of weights in limit, according to the weights on limit, determine whether the two pixel regions that are connected belong to same cut zone, and the threshold value of judgement changes according to region adaptivity.Handle successively all limits in figure, in picture, all pixels all incorporate corresponding region into.The easy over-segmentation of initial partitioning, in conjunction with parallax information, merges the thinner cut zone that belongs to same parallax face of cutting apart of closing in post-processed.
5) to the cut zone obtaining, according to parallax size, the compactness of cut zone and the scale of the shared picture in its entirety of cut zone, judge whether to belong to the object that need to obtain, take out cut zone, the object that obtains finally splitting.
If Fig. 1 is implementing procedure figure of the present invention, first to the left and right binocular stereo image after verification, utilize the method for existing adaptive weighting to ask reliable matching point.In order to guarantee the efficiency of algorithm, first left image is carried out to uniform sampling, the sample window that this method adopts is 3 * 3.For sampled point, by the method for classical adaptive weighting, in right image, search optimal match point.When assembling coupling cost according to local window in pixel self information and the definite weight of geometric distance carry out cost accumulative total and be added.Given pixel p, for the arbitrary pixel q in local window, the weight of q is by the color distortion c between p, q
pqwith Euclidean distance g
pqdetermine, in explanation of the present invention, the point in triangle is also pixel:
σ
cwith σ
gthat the gathering cost C ' (p, d) of each pixel is as follows for controlling the parameter of color and Euclidean distance:
Ω
pthe local window of pixel p, C (q, d) be some q (x, y) in left image with right image in some q
dthe coupling cost of (x-d, y), the textural characteristics sobel difference that coupling cost of the present invention is pixel and the RGB colouring information of pixel.Then adopt the strategy of winner-take-all (winner-take-all) to find the parallax that mates Least-cost.
D(p)=arg min
d C′(p,d) (8)
D (p) is the parallax of pixel p, be the pixel p (x in left image, y) the pixel p (x-D (p) and in right image, y) coupling, all sampled points in left image are asked for after parallax, used post-processed, as left and right consistency check, remove some points not too reliably, final remaining point is exactly the reliable matching point needing.Use Delaunay trigonometric ratio that these reliable points are organized into triangle, all pixels of left image are all covered by triangle gridding, as shown in Figure 5.
To each triangle, utilize the triangle closing on to calculate his fiduciary level.In most cases, pixel in same target, their parallax should be positioned at same plane, thereby the triangle that is positioned at same target also should have similar parallax plane, if triangle is crossed over the border of object or formed leg-of-mutton point is insecure, the parallax face that this triangle determines is not suitable for that the point in it is carried out to interpolation and asks parallax, and it is insecure.Thereby the present invention proposes to utilize its triangle closing on to judge each leg-of-mutton reliability, the triangle that is applicable to reliably interpolation should be to its triangle around on similar parallax face.Judging whether triangle is positioned at same plane can be according to the angle of their planar process vector:
two triangle f
1with f
2normal vector, if angle theta is less than a certain threshold tau arranging in advance
f, two triangle f
1with f
2approximate think be positioned at same plane; Otherwise be positioned at Different Plane.If more these the leg-of-mutton reliabilities of coplanar triangle are also just higher, otherwise reliability is lower.The fiduciary level γ of triangle f (f) is calculated as:
T
rthe adjacent triangle number coplanar with triangle f, t
nadjacent triangle total number, τ
rit is the parameter that fiduciary level is adjusted.
A) for reliable triangle γ (f
p)=1, its triangle closing on is all coplanar with it, and fiduciary level is 1, and the parallax plane of utilizing this triangle to determine is asked parallax to the point interpolation in it:
P
xwith p
ytriangle f
pthe coordinate of interior some p,
with
triangle f
pthe parallax plane determining;
B) for insecure triangle γ (f
p) < 1, utilize this triangle and be total to its interior pixel p interpolation of three triangle pairs of closing on limit and ask parallax.To the pixel in it, adopt Bayesian model to ask parallax, as Fig. 4, with triangle f
pand the triangle that closes on common limit is as model priori, increases a parameterized level and smooth item as model priori simultaneously, Bayesian model is as follows:
P(D|I,F)∝P(I|D)P(F|D)P(D) (3)
D is calculative parallax, and I is image itself, and F is the parallax face that triangle determines, is also triangle self, and P is the probability of Bayesian model.According to Bayesian model, asking parallax is exactly the problem of maximum value posterior probability, and maximum value posterior probability is equivalent to the local energy function that minimal value is following:
E(p,f
p)=E
data(p,f
p)+λ
sE
smooth(p)+λ
fE
f(f
p) (5)
E (p, f
p) be total coupling cost of a p; E
data(p, f
p) be that postulated point p is at triangle f
pcoupling cost on the parallax face determining; E
smooth(p) be level and smooth, as fruit dot p and adjacent pixel do not have corresponding penalty value, E at same parallax face
smooth(p) for describing the relation between a p and consecutive point, E
f(f
p) be parallax face f
pinsecure penalty term, for describing f
pplane reliability, λ
sand λ
frespectively level and smooth and f
pthe control parameter of fiduciary level.
In the present invention, the triangle of recording, parallax face and triangular facet three refer to same, and just, when mentioning this face from different perspectives, title is described upper some difference.For example, while describing from the angle of Delaunay trigonometric ratio, be called triangle, when the angle of parallax is described, be called parallax face, while describing from the angle of plane, be called triangular facet.
The disparity map obtaining is carried out to later stage verification obtains final disparity map equally, and this disparity map may be sparse.
Generally, suppose that in triangle, parallax is continually varying, for the point in triangle, adopt the definite parallax face of triangle to carry out interpolation and can determine parallax, as formula (4).Yet this simple interpolation is crossed over object boundaries to those or the triangle that is comprised of initial Exceptional point is inapplicable.According to most of real-life scenes, the parallax of putting in unreliable triangle is very likely on the reliable leg-of-mutton parallax face of its neighbour, so adopt the parallax face closing on effectively to carry out parallax interpolation to the point in unreliable triangle.The optimal number of closing on parallax face is that the detail by sampling density and image determines.In addition, time cost also needs to take into account, the computation complexity of method and the quantity correlation that closes on parallax face.Moreover, close on parallax face from reference point more away from, its impact on this point is less, so the present invention closes on 3 triangle f on common limit the most at last
n1, f
n2, f
n3as candidate's the parallax face that closes on, add the parallax face of intermediate triangle f, finally the candidate's parallax face for interpolation model forms by four:
In energy function (5), E
data(p, f
p) be that hypothesis pixel p is at parallax face f
pon coupling cost; E
smooth(p) be 4 neighbours level and smooth between matched pixel.Consider efficiency of algorithm, only adopt single treatment, the pixel that starts to have mated most may be fewer, but this can't cause relatively large deviation to result, because whole level and smooth is non-hard limit in energy function.
E
f(f
p) be plane f
punreliability penalty term, can determine f according to formula (2)
pfiduciary level.
Minimization energy function (5), finds best parallax face according to winner-take-all strategy
Find best parallax face, according to formula (4), calculate the parallax of a p.
The disparity map generating is used to post-processed, as left and right consistency check, obtain reliable sparse disparities figure.
Step 3) to left picture construction figure, as shown in Figure 3, pixel is as the summit of figure, and each pixel and its eight neighbour's line is as limit, and the weights on limit size determines jointly according to color and the depth information of two pixels that are connected:
If two pixels that are connected have reliable parallax value, color and parallax are according to the ratio-dependent weights of 3:7; Otherwise, only adopt colouring information as weights.
If p
r, p
g, p
bthe RGB triple channel color value of pixel p, q
r, p
g, p
bthe RGB triple channel color value of pixel p, color distortion W
cas formula (14)
P
d, q
dpixel p, the parallax value of q, parallax difference W
das formula (15)
W
d=|p
d-q
d| (15)
The weight w on limit is as formula (16)
P is two pixels that are connected with on one side with q, if p and q contain parallax, while determining weights, parallax difference accounts for 0.7, and color distortion accounts for 0.3; Otherwise get 0.5 of color distortion, determining weights, why get 0.5 here, is mainly in order to make color be unlikely to too strong to the impact of cutting apart in the situation that not considering parallax.
Built figure, adopted Kruskal ' s minimum spanning tree strategy to carry out image and cut apart, the complexity of cutting apart is O (mlogm), and concrete partitioning algorithm is as follows:
Input figure G=(V, E), output cut zone: S=(C
1..., C
r)
Opposite side E sorts according to non-order of falling, and after sequence, is π=(o
1..., o
m)
A. initial segmentation result is S
0, each pixel is a cut zone
B. to every limit q=1 ..., m repeating step c
C. S on previous segmentation result basis
q-1, cut apart S next time
q, get q bar limit o
q=(v
i, v
jif). according to S
q-1segmentation result, v
iand v
jin different cut zone, and the weight w (o on limit
q) be less than the threshold value of two cut zone, by v
iand v
jtwo cut zone at place merge.Adjust the region threshold after merging.Otherwise S
q=S
q-1.
D. when handling all limits, return
In cutting procedure, the threshold value in region is adaptive change, is mainly in order to guarantee that segmentation result neither can over-segmentation can not cut apart very coarsely yet.
T(C)=Int(C)+τ(C)τ(C)=k/|C| (17)
T (C) is the threshold value of cut zone, and Int (C) is the maximum weights of connected pixel in cut zone, and τ (C) is the value of adaptive change.K is the threshold value arranging in advance, and it is generally used for the size of controlling cut zone, in whole cutting procedure, fixes.| C| is the size of cut zone, i.e. the number of pixel, and it is automatically to change.
Handle successively all limits in figure, in picture, all pixels all incorporate corresponding region into, obtain cut zone.
Although set region threshold, initial partitioning is still than being easier to over-segmentation, as optimal way, in post-processed in conjunction with parallax information, the thinner cut zone that belongs to same parallax face of cutting apart of closing on is merged, for the fineness of cutting apart that judges whether to merge, according to image, cut apart Location of requirement here.When initial partitioning, can cut apart more like this, then by post-processed, adjust, reach suitable segmentation result.
Step 5), in, to the region obtaining after cutting apart, need to take out wherein suitable object.Q
numto belong to pixel sum in cut zone; d
avgit is the mean parallax of pixel in cut zone; w
minthe minimum abscissa value of cut zone, w
maxit is maximum abscissa value; h
minthe minimum ordinate value of cut zone, h
maxit is maximum ordinate value.The cut zone of taking out can be determined by formula (18) minimum rectangle, and W and H are respectively the length of this minimum rectangle and wide, W
allwith H
allthe length of entire image and wide.
W=w
max-w
min,H=h
max-h
min (18)
Taking out the principle of object needing is
A. parallax is maximum: the degree of depth is the most forward, and object is nearest from camera lens, in the remarkable position of vision.
B. cut zone is compact: in cut zone, the ratio of number of pixels and cut zone place minimum rectangular area area is minimum;
C. the ratio of the shared whole image-region of cut zone is more moderate: Aspect Ratio is suitable, meets the threshold value of setting in advance.
The cut zone of a, b of being in conformity with the principle is directly taken out just passable, if the ratio of the shared entire image of cut zone is proper, the c that is in conformity with the principle, both not too little also not too greatly and Aspect Ratio suitable,
and
, according to the descending sequence of the size of each region O, take out and come n object above, the size of concrete n can freely be specified, and n object is exactly n cut zone.The complexity of cutting apart is:
Take out the minimum rectangle mask of cut zone, carry out binaryzation cavity and fill, then this mask is acted on to the object that former figure obtains splitting.
The Stereo Matching Algorithm that the present invention proposes is effective fast, effectively the parallax discontinuity zone such as handled object edge; Partitioning algorithm can be partitioned into a plurality of objects rapidly, and time efficiency of the present invention is high, and segmentation effect is good, can meet the demand of fast automatic Object Segmentation.
Claims (7)
1. the stereo-picture Object Segmentation method based on quick local matching, is characterized in that comprising the following steps:
1) ask for reliable matching point: the left image after input validation, right image carries out uniform sampling to left image, to sampled point, use adaptive weighting sectional perspective matching process to search optimal match point in right image, when the coupling cost of calculating pixel point, adopt the RGB colouring information of sobel textural characteristics and pixel, then matching result is carried out to later stage verification, finally remaining point is as reliable matching point;
2) reliable matching point is used to Delaunay trigonometric ratio: the reliable matching point that left image is tried to achieve is organized into triangle gridding, each triangle is carried out to reliability judgement, judgement according to being to check that whether the triangle that triangle closes on it coplanar, point in diabolo, according to leg-of-mutton reliability, utilize three triangles on triangle and limit common with it to carry out interpolation and ask parallax; The disparity map obtaining is carried out to later stage verification and obtain final disparity map;
3) to left picture construction figure: each pixel of left image is as the summit of figure, the limit using pixel and its eight neighbour's line as figure, the weights on limit determine by color and the parallax information of the pixel that is connected, parallax information is obtained by final disparity map;
4) utilizing the figure building to carry out image cuts apart: adopt Kruskal minimum spanning tree strategy, sorted from low to high according to the size of weights in limit, according to the weights on limit, determine whether the two pixel regions that are connected belong to same cut zone, the threshold value of judgement changes according to region adaptivity, handle successively all limits in figure, in picture, all pixels all incorporate corresponding region into, obtain cut zone;
5) extract interested object in cut zone: to the cut zone obtaining, according to parallax size, the compactness of cut zone and the ratio of the shared picture in its entirety of cut zone, judge whether to belong to the object that need to split, according to object, divide the cut zone of taking out each object, complete Object Segmentation.
2. the stereo-picture Object Segmentation method based on quick local matching according to claim 1, is characterized in that described later stage verification adopts left and right consistency detection.
3. the stereo-picture Object Segmentation method based on quick local matching according to claim 1, is characterized in that described step 2) in, to each triangle, utilize its triangle closing on to carry out reliability judgement and be specially:
According to the angle theta of leg-of-mutton planar process vector, judge that whether triangle is coplanar:
two triangle f
1with f
2normal vector, if angle theta is less than the threshold tau setting in advance
f, two triangle f
1with f
2think and be positioned at same plane, coplanar; Otherwise be positioned at Different Plane;
The fiduciary level γ of triangle f (f) is calculated as:
T
rthe adjacent triangle number coplanar with triangle f, t
ntriangle total number adjacent with triangle f, τ
rit is the parameter that fiduciary level is adjusted.
4. the stereo-picture Object Segmentation method based on quick local matching according to claim 3, is characterized in that step 2) in, according to leg-of-mutton reliability, ask parallax to be specially:
If triangle f
preliably, its triangle closing on is all coplanar with it, and fiduciary level is 1, utilizes parallax that triangle determines to ask parallax in the face of the some p in it:
If triangle f
punreliable, to the pixel in it, adopt Bayesian model to ask parallax, with triangle f
pand the triangle that closes on common limit is as model priori, increases a parameterized level and smooth item as model priori simultaneously, Bayesian model is as follows:
P(D|I,F)∝P(I|D)P(F|D)P(D) (3)
Wherein D represents parallax, I representative image, and F represents triangle, P represents the probability in model;
According to Bayesian model, asking parallax is exactly the posterior probability problem of maximum value formula (3), and maximum value posterior probability is equivalent to the local energy function that minimal value is following:
E(p,f
p)=E
data(p,f
p)+λ
sE
smooth(p)+λ
fE
f(f
p) (5)
E (p, f
p) represent required triangle f
ptotal coupling cost of interior some p; E
data(p, f
p) be that postulated point p is at triangular facet f
pon coupling cost; E
smooth(p) it is level and smooth, for describing the relation between a p and consecutive point, E
f(f
p) be parallax face f
pinsecure penalty term, for describing f
pplane reliability, λ
sand λ
frespectively level and smooth and f
pthe control parameter of fiduciary level.
5. the stereo-picture Object Segmentation method based on quick local matching according to claim 1, it is characterized in that in step 3) during design of graphics, based on hypothesis: the pixel of the color similarity closing on should be positioned at same target, the two consistent pixels of depth information of closing on be positioned at same target, the weights on setting limit are determined jointly by color and the parallax of two pixels that are connected, if two pixels that in figure, limit is connected all comprise parallax information, by parallax and color according to the weights on the ratio-dependent limit of 7:3, otherwise, only according to colouring information, determine weights.
6. the stereo-picture Object Segmentation method based on quick local matching according to claim 1, it is characterized in that step 4) in, carry out for the first time after image cuts apart, carry out post-processed: in conjunction with parallax information, the thinner cut zone that belongs to same parallax face of cutting apart of closing on is merged, for the fineness of cutting apart that judges whether to merge, according to image, cut apart Location of requirement here.
7. the stereo-picture Object Segmentation method based on quick local matching according to claim 1, is characterized in that step 5) in need the judgment principle of the object that splits to be:
A) region of parallax maximum: the degree of depth is forward, object is close to camera lens, in the remarkable position of vision;
B) cut zone is compact: in cut zone, the ratio of number of pixels and cut zone place minimum rectangular area area is minimum;
C) ratio of the shared whole image-region of cut zone is moderate: Aspect Ratio meets predefined parameter:
According to mentioned above principle, take out cut zone, obtain cutting object;
Wherein, establish q
numto belong to pixel sum in cut zone, d
avgthe mean parallax of pixel in cut zone, w
minthe minimum abscissa value of cut zone, w
maxmaximum abscissa value, h
minthe minimum ordinate value of cut zone, h
maxbe maximum ordinate value, cut zone is determined by formula (18) minimum rectangle, and W and H are respectively the length of this minimum rectangle and wide, W
allwith H
allthe length of entire image and wide:
W=w
max-w
min,H=h
max-h
min (18)
To with the C that is in conformity with the principle) cut zone, according to each cut zone complexity O, sort from big to small, take out and to come n object above, the complexity of cutting apart is:
Take out the minimum rectangle mask of cut zone, carry out binaryzation cavity and fill, then this mask is acted on to the object that former figure obtains splitting.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310745606.3A CN103714549B (en) | 2013-12-30 | 2013-12-30 | Based on the stereo-picture object dividing method of quick local matching |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310745606.3A CN103714549B (en) | 2013-12-30 | 2013-12-30 | Based on the stereo-picture object dividing method of quick local matching |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103714549A true CN103714549A (en) | 2014-04-09 |
CN103714549B CN103714549B (en) | 2016-06-08 |
Family
ID=50407492
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310745606.3A Active CN103714549B (en) | 2013-12-30 | 2013-12-30 | Based on the stereo-picture object dividing method of quick local matching |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103714549B (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104166988A (en) * | 2014-07-10 | 2014-11-26 | 北京工业大学 | Sparse matching information fusion-based three-dimensional picture synchronization segmentation method |
CN106886988A (en) * | 2015-12-11 | 2017-06-23 | 中国科学院深圳先进技术研究院 | A kind of linear goal detection method and system based on unmanned aerial vehicle remote sensing |
CN105719316B (en) * | 2015-09-29 | 2018-08-03 | 温州大学 | A kind of stereo-picture dividing method based on interactive mode |
CN108492326A (en) * | 2018-01-31 | 2018-09-04 | 北京大学深圳研究生院 | The resolution ratio solid matching method gradually refined from low to high and system |
TWI658720B (en) * | 2016-12-30 | 2019-05-01 | 創研光電股份有限公司 | Method for using zoom image information to improve mismatch of original parallax image |
CN107341822B (en) * | 2017-06-06 | 2019-11-08 | 东北大学 | A kind of solid matching method based on the polymerization of minimum branch cost |
CN110956195A (en) * | 2019-10-11 | 2020-04-03 | 平安科技(深圳)有限公司 | Image matching method and device, computer equipment and storage medium |
CN111402152A (en) * | 2020-03-10 | 2020-07-10 | 北京迈格威科技有限公司 | Disparity map processing method and device, computer equipment and storage medium |
CN113012270A (en) * | 2021-03-24 | 2021-06-22 | 纵深视觉科技(南京)有限责任公司 | Stereoscopic display method and device, electronic equipment and storage medium |
CN113947625A (en) * | 2021-10-15 | 2022-01-18 | 中国矿业大学 | Binocular image parallax calculation optimization method for fine modeling of parallax surface |
CN117455969A (en) * | 2023-11-27 | 2024-01-26 | 南京邮电大学通达学院 | High-precision edge protection type local stereo matching parallax estimation method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6873723B1 (en) * | 1999-06-30 | 2005-03-29 | Intel Corporation | Segmenting three-dimensional video images using stereo |
US20080025633A1 (en) * | 2006-07-25 | 2008-01-31 | Microsoft Corporation | Locally adapted hierarchical basis preconditioning |
CN103336946A (en) * | 2013-06-17 | 2013-10-02 | 浙江大学 | Binocular stereoscopic vision based clustered tomato identification method |
-
2013
- 2013-12-30 CN CN201310745606.3A patent/CN103714549B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6873723B1 (en) * | 1999-06-30 | 2005-03-29 | Intel Corporation | Segmenting three-dimensional video images using stereo |
US20080025633A1 (en) * | 2006-07-25 | 2008-01-31 | Microsoft Corporation | Locally adapted hierarchical basis preconditioning |
CN103336946A (en) * | 2013-06-17 | 2013-10-02 | 浙江大学 | Binocular stereoscopic vision based clustered tomato identification method |
Non-Patent Citations (3)
Title |
---|
KLAUS A ET AL: "Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure", 《2006 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION》, 31 December 2006 (2006-12-31), pages 1 - 5 * |
LI HONG ET AL: "Segment-based Stereo Matching Using Graph Cuts", 《PROCESSING OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》, vol. 1, 31 December 2004 (2004-12-31), pages 1 - 8 * |
袁方 等: "一种基于三维Delaunay 三角化的曲面重建算法", 《计算机技术与发展》, vol. 21, no. 10, 10 October 2011 (2011-10-10), pages 14 - 18 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104166988B (en) * | 2014-07-10 | 2017-06-06 | 北京工业大学 | A kind of stereo sync dividing method for incorporating sparse match information |
CN104166988A (en) * | 2014-07-10 | 2014-11-26 | 北京工业大学 | Sparse matching information fusion-based three-dimensional picture synchronization segmentation method |
CN105719316B (en) * | 2015-09-29 | 2018-08-03 | 温州大学 | A kind of stereo-picture dividing method based on interactive mode |
CN106886988A (en) * | 2015-12-11 | 2017-06-23 | 中国科学院深圳先进技术研究院 | A kind of linear goal detection method and system based on unmanned aerial vehicle remote sensing |
CN106886988B (en) * | 2015-12-11 | 2020-07-24 | 中国科学院深圳先进技术研究院 | Linear target detection method and system based on unmanned aerial vehicle remote sensing |
TWI658720B (en) * | 2016-12-30 | 2019-05-01 | 創研光電股份有限公司 | Method for using zoom image information to improve mismatch of original parallax image |
CN107341822B (en) * | 2017-06-06 | 2019-11-08 | 东北大学 | A kind of solid matching method based on the polymerization of minimum branch cost |
CN108492326B (en) * | 2018-01-31 | 2021-11-23 | 北京大学深圳研究生院 | Stereo matching method and system with gradually refined resolution ratio from low to high |
CN108492326A (en) * | 2018-01-31 | 2018-09-04 | 北京大学深圳研究生院 | The resolution ratio solid matching method gradually refined from low to high and system |
CN110956195A (en) * | 2019-10-11 | 2020-04-03 | 平安科技(深圳)有限公司 | Image matching method and device, computer equipment and storage medium |
CN110956195B (en) * | 2019-10-11 | 2023-06-02 | 平安科技(深圳)有限公司 | Image matching method, device, computer equipment and storage medium |
CN111402152A (en) * | 2020-03-10 | 2020-07-10 | 北京迈格威科技有限公司 | Disparity map processing method and device, computer equipment and storage medium |
CN111402152B (en) * | 2020-03-10 | 2023-10-24 | 北京迈格威科技有限公司 | Processing method and device of disparity map, computer equipment and storage medium |
CN113012270A (en) * | 2021-03-24 | 2021-06-22 | 纵深视觉科技(南京)有限责任公司 | Stereoscopic display method and device, electronic equipment and storage medium |
CN113947625A (en) * | 2021-10-15 | 2022-01-18 | 中国矿业大学 | Binocular image parallax calculation optimization method for fine modeling of parallax surface |
CN113947625B (en) * | 2021-10-15 | 2024-06-21 | 中国矿业大学 | Binocular image parallax calculation optimization method for parallax surface fine modeling |
CN117455969A (en) * | 2023-11-27 | 2024-01-26 | 南京邮电大学通达学院 | High-precision edge protection type local stereo matching parallax estimation method |
Also Published As
Publication number | Publication date |
---|---|
CN103714549B (en) | 2016-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103714549B (en) | Based on the stereo-picture object dividing method of quick local matching | |
US20240346675A1 (en) | Systems and Methods for Hybrid Depth Regularization | |
CN102074014B (en) | Stereo matching method by utilizing graph theory-based image segmentation algorithm | |
CN106570468B (en) | A method of rebuilding LiDAR original point cloud contour of building line | |
Broggi et al. | The single frame stereo vision system for reliable obstacle detection used during the 2005 DARPA grand challenge on TerraMax | |
CN102324102B (en) | Method for automatically filling structure information and texture information of hole area of image scene | |
CN103413322B (en) | Keyframe extraction method of sequence video | |
CN104517317A (en) | Three-dimensional reconstruction method of vehicle-borne infrared images | |
CN104867133A (en) | Quick stepped stereo matching method | |
CN104820991A (en) | Multi-soft-constraint stereo matching method based on cost matrix | |
CN102609950B (en) | Two-dimensional video depth map generation process | |
CN104200453B (en) | Parallax image correcting method based on image segmentation and credibility | |
Pang et al. | Fast and robust multi-view 3d object recognition in point clouds | |
CN106709901A (en) | Simulation fog image generation method based on depth priori | |
CN106485737A (en) | Cloud data based on line feature and the autoregistration fusion method of optical image | |
CN103971366A (en) | Stereoscopic matching method based on double-weight aggregation | |
CN116258817A (en) | Automatic driving digital twin scene construction method and system based on multi-view three-dimensional reconstruction | |
CN103985128A (en) | Three-dimensional matching method based on color intercorrelation and self-adaptive supporting weight | |
CN104517095A (en) | Head division method based on depth image | |
CN104851089A (en) | Static scene foreground segmentation method and device based on three-dimensional light field | |
CN106408596A (en) | Edge-based local stereo matching method | |
CN102447917A (en) | Three-dimensional image matching method and equipment thereof | |
CN108010075A (en) | A kind of sectional perspective matching process based on multiple features combining | |
CN113989758A (en) | Anchor guide 3D target detection method and device for automatic driving | |
CN105225233B (en) | A kind of stereopsis dense Stereo Matching method and system based on the expansion of two classes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |