CN112085653B

CN112085653B - Parallax image splicing method based on depth of field compensation

Info

Publication number: CN112085653B
Application number: CN202010789913.1A
Authority: CN
Inventors: 朱策
Original assignee: Sichuan Jiuzhou Electric Group Co Ltd
Current assignee: Sichuan Jiuzhou Electric Group Co Ltd
Priority date: 2020-08-07
Filing date: 2020-08-07
Publication date: 2022-09-16
Anticipated expiration: 2040-08-07
Also published as: CN112085653A

Abstract

The invention discloses a parallax image splicing method based on depth of field compensation, which comprises the following steps: s1: estimating the depth of field of a target scene, and segmenting the scene under different depth of field according to different depth of field of the image to be spliced by using a Mask R-CNN image segmentation algorithm; s2: respectively extracting feature points in scenes under different depths of field, and matching the scenes of different depths of field in the images to be spliced according to the feature points; s3: for the matched scenes with different depths of field, selecting N uniform anchor points from the segmented outlines of the different scenes, and dividing Delaunay triangles for the feature points matched with the scenes with different depths of field in combination with the step S2; s4: after the image to be spliced is subjected to scale normalization, calculating a parallax offset vector of the scene with the adjacent depth of field of the image to be spliced according to the matched characteristic points; s5: position compensation is performed based on the parallax offset vector acquired in step S4, and a transform relationship after compensation is acquired based on the feature points after compensation.

Description

Parallax image splicing method based on depth of field compensation

Technical Field

The invention relates to a parallax image splicing method, in particular to a parallax image splicing method based on depth of field compensation.

Background

The image stitching is a technology for obtaining an image with a larger field of view by using the correlation of the overlapping region of a plurality of images through a series of registration algorithms and post-processing algorithms, and mainly comprises a registration process and a combination process. The post-processing process in the combination stage aims at reducing structural distortion caused by inaccurate registration parameters and eliminating pixel distortion of the registered images. Namely, the registration stage obtains transformation parameters which enable the structural distortion of the transformed image to be minimum, and the combination stage eliminates the structural distortion and the pixel distortion to a certain extent.

However, due to the complexity of natural scenes (differences in depth of field leading to occlusion or non-occlusion relationships), the same scene taken at different angles or distances between cameras (base line) tends to have large differences in the images, which are referred to as parallax images.

The traditional image stitching algorithm is suitable for stitching simple scene images: in the scenes with continuously changing depth and the images strictly meeting specific acquisition conditions, most of the splicing methods treat the images as a single plane, and even though the images are divided by grids or other dividing modes, the depth relation among different scenes in the images is still not considered. Therefore, for the parallax images, the traditional stitching algorithm cannot achieve a good stitching effect, and in order to deal with the stitching of the parallax images and analyze the reason of parallax generation, the invention provides a method for stitching according to the scene depth.

Disclosure of Invention

The invention aims to solve the technical problem that the parallax image is difficult to splice or has poor splicing effect by using the conventional image splicing algorithm, and provides a parallax image splicing method based on depth of field compensation to solve the problem.

The invention is realized by the following technical scheme:

the parallax image splicing method based on depth compensation is characterized by comprising the following steps of: s1: estimating the depth of field of a target scene, and segmenting the scene under different depth of field according to different depth of field of the image to be spliced by using a Mask R-CNN image segmentation algorithm; s2: respectively extracting feature points in scenes under different depths of field, and matching the scenes of different depths of field in the images to be spliced according to the feature points; s3: for the matched scenes with different depths of field, selecting N uniform anchor points from the segmentation contours of the different scenes, and dividing Delaunay triangles for the feature points matched with the scenes with different depths of field in combination with the step S2; s4: after the image to be spliced is subjected to scale normalization, calculating a parallax offset vector of the scene with the adjacent depth of field of the image to be spliced according to the matched characteristic points; s5: performing position compensation according to the parallax offset vector acquired in step S4, and acquiring a transform relationship after compensation according to the feature points after compensation; s6: after the transformation relation is determined, sequentially splicing different depth-of-field scenes according to the depth sequence, calculating transformation weight for each Delaunay triangle of each scene, transforming each Delaunay triangle of the reference image and the target image according to the transformation weight, and establishing a lookup table; s7: and registering the whole image through a lookup table, and then carrying out post-processing on the registered image to obtain a final spliced image.

In the prior art, the existing image stitching method does not consider the depth occlusion relationship of scenes, and stitches the scenes at different depths of the image as the same plane, or when the scene is divided, divides the image into grids or adopts other division modes, the division is random, and the occlusion relationship of the scene is ignored, so that the stitching effect of the parallax image is poor. Therefore, according to the situation and the depth estimation result, the application document uses image segmentation algorithms such as Mask R-CNN and the like to finely segment the scenes under different depths of field, and finally splices the scenes under different depths of field.

Further, the step S2 is executed by

And matching scenes with different depths of field.

Where C denotes that the entire scene approximation can be divided into C depth planes, N (C) denotes that the C-th depth scene matches to N pairs of feature points,

an ith feature of a c-th depth scene representing the target image,

an ith feature representing a c-th depth scene of the reference image.

Further, the disparity offset vector in step S4 is obtained by using the following formula,

in the above formula, c represents an index of different depth scenes, for example, if three depth planes are estimated, then c is 0,1,2, and c is 0 to represent sky at infinity;

representing the ith feature matched on the scene with the index of c in the target image; n (c) scene matching with index cThe number of the feature points; parallel (c) represents the disparity offset vector of the c scene relative to the c-1 scene in the two disparity images.

Further, the position compensation in the step S5 adopts

Compensate and pass

Acquiring a transformation relation after compensation; in the above-mentioned formula,

representing the ith feature matched on the scene with the index of c in the target image;

representing the characteristic of the reference image after position compensation on the scene with the index of c; h _c Representing a transformation matrix of a scene with index c in the target image to a scene with index c in the reference image before position compensation; offset H _c Representing a transformation matrix of the scene with index c in the target image to the scene with index c in the reference image after the position compensation.

Further, the method for transforming each delaunay triangle in step S6 is:

wherein the content of the first and second substances,

representing the weight required for the jth Delaunay triangle transformation of the scene with index c between the parallax images; s _c Representing a similarity transformation of a scene with index c between parallax images;

respectively representing referencesThe ith vertex of the jth Delaunay triangle in the scene with the index of c in the image and the target image;

respectively representing the ith vertex of the jth Delaunay triangle in the scene with the index of c in the reference image and the target image after transformation.

Transform time weighting for each Delaunay triangle

Can be according to pi _j The distance between the center point of (a) and the center of the nearest 3 feature points.

Further, each delaunay triangle pi of each depth scene of the reference image and the target image in S6 _j After the vertex of the image is transformed, a lookup table is established, and in step S7, according to the lookup table, when a point in a scene at a certain depth in the image is located inside a triangle, a transformation parameter is directly obtained through the lookup table, and the point is transformed. And after the scenes of all the depths are transformed, obtaining a final spliced image through corresponding post-processing.

Compared with the prior art, the invention has the following advantages and beneficial effects:

1. according to the parallax image splicing method based on the depth of field compensation, the conventional image splicing method is suitable for simple scenes or images acquired under the condition of meeting strict acquisition conditions, but the robustness of the parallax images is low. The method specifically analyzes the specific situation, analyzes the generation reason of the parallax, starts from the aspect of depth of field, firstly carries out scene segmentation on the image according to the depth of field, and splices the segmented image, so that the existing method can be utilized, and the influence of the parallax can be eliminated to a certain extent.

Drawings

The accompanying drawings, which are included to provide a further understanding of the embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:

fig. 1 is a schematic imaging diagram in the case of parallel optical axes.

FIG. 2 is a flow chart of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not meant to limit the present invention.

Examples

As shown in fig. 1 and 2, the method for splicing parallax images based on depth compensation of the present invention is characterized by comprising the following steps: s1: estimating the depth of field of a target scene, and segmenting the scene under different depth of field according to different depth of field of the image to be spliced by using Mask R-CNN and other image segmentation algorithms; s2: respectively extracting characteristic points from scenes under different depths of field, and matching the scenes of different depths of field in the images to be spliced according to the characteristic points; s3: for the matched scenes with different depths of field, selecting N uniform anchor points from the segmented outlines of the different scenes, and dividing Delaunay triangles by combining the matched feature points of the scenes with different depths of field in the step S2; s4: after the image to be spliced is subjected to scale normalization, calculating a parallax offset vector of the scene with the adjacent depth of field of the image to be spliced according to the matched characteristic points; s5: performing position compensation according to the parallax offset vector acquired in step S4, and acquiring a transform relationship after compensation according to the feature points after compensation; s6: after the transformation relation is determined, sequentially calculating transformation weights for each Delaunay triangle of different depth-of-field scenes according to the depth sequence, transforming each Delaunay triangle of the reference image and the target image according to the transformation weights, and establishing a lookup table; s7: and registering the whole image through a lookup table, and then carrying out post-processing on the registered image to obtain a final spliced image.

By the step S2

Matching scenes with different depths of field, wherein C represents that the whole scene can be approximately divided into C +1 depthsDegree plane, N (c) denotes that the c-th depth scene matches to N pairs of feature points,

the ith feature of the c-th depth scene representing the target image,

an ith feature representing a c-th depth scene of the reference image.

The disparity offset vector in step S4 is obtained using the following formula,

in the above formula, c represents an index of different depth scenes, for example, if three depth planes are estimated, then c is 0,1,2, and c is 0 representing sky at infinity;

representing the ith feature matched on the scene with the index of c in the target image; n (c) represents the number of feature points matched by the scene with the index of c; parallel (c) represents the disparity offset vector of the c scene relative to the c-1 scene in the two disparity images.

The position compensation in the step S5 adopts

Compensate and pass

And obtaining the transformation relation after compensation, wherein in the formula,

representing the characteristic of the reference image after position compensation on the scene with the index c; h _c Representing a transformation matrix of a scene with index c in the target image to a scene with index c in the reference image before position compensation; offset H _c Representing a transformation matrix of the scene with index c in the target image to the scene with index c in the reference image after the position compensation.

In the step S6, a gaussian weight, a student' S-t weight, a cauchy weight, and the like are selected according to conditions to calculate a transformation weight, each probability distribution has a different ending degree and a different application condition, and specifically, the selection can be performed by comparing test results. When calculating the weights, we only need to assign weights to the distances according to their positions on the normal curve.

Transform time weighting for each Delaunay triangle

Can be according to pi _j Is obtained from the distance of the center point of (a) to the center of the nearest 3 feature points, e.g. by calculating the weight using the Cauchy distribution

The following were used:

γ＝||mid(anchors ^(c) )-mid(kpts ^(c) )|| ²

wherein kpts ^(c) Feature points representing a scene with index c; anchors ^(c) Anchor points on the contour representing the scene with index c; min ₃ (a, b) represents the 3 elements in the b set closest to a; mid (-) denotes the center of the set of points; other weighting functions may be chosen, such as Gaussian weighting, Student's-t weighting, etc.

The check in the step S7The table is looked up according to each Delaunay triangle pi in S6 _j After the vertex is transformed, a lookup table is established, when a point in the image is positioned in the triangle, transformation parameters are directly obtained through the lookup table, and then the point is transformed. The look-up table is built as follows: because the transformation process is to transform the vertex of each Delaunay triangle first, the weighted transformation matrix is used, and a lookup table is established for the vertex of each triangle and the transformation matrix. Therefore, for any point in the image, the triangle is judged to be in, and then the transformation matrix of the triangle is directly used for transformation, so that the transformation matrix does not need to be obtained through weighting again, and the operation time is greatly reduced.

After the weights are calculated, the transformation method for each delaunay triangle in the scene with index c is as follows:

wherein the content of the first and second substances,

respectively representing the ith vertex of the jth Delaunay triangle in the scene with the index of c in the reference image and the target image;

The above-mentioned embodiments, objects, technical solutions and advantages of the present invention are further described in detail, it should be understood that the above-mentioned embodiments are only examples of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. The parallax image splicing method based on depth compensation is characterized by comprising the following steps of:

s1: estimating the depth of field of a target scene, and segmenting the scene under different depth of field according to different depth of field of the image to be spliced by using a Mask R-CNN image segmentation algorithm;

s2: respectively extracting characteristic points of scenes under different depths of field, and matching the scenes of different depths of field in the images to be spliced according to the characteristic points

S3: for the matched scenes with different depths of field, selecting N uniform anchor points in the outline of the scene, and dividing Delaunay triangles for the feature points matched with the scenes with different depths of field in combination with the step S2;

s4: after the image to be spliced is subjected to scale normalization, calculating a parallax offset vector according to the matched characteristic points of the scene with the adjacent depth of field of the image to be spliced; the disparity offset vector is obtained using the following formula,

in the above formula, C represents an index of a scene with different depths, and three depth planes are estimated from the depths, where C is 0,1,2, and C is 0 representing the sky at infinity, and C represents a scene in which the whole image can be approximately divided into C depths;

representing the ith feature matched on the scene with the index of c in the target image; n (c) represents the number of feature points matched by the scene with the index of c; parallelx (c) represents a disparity offset vector of the c scene relative to the c-1 scene in the two disparity images;

s5: performing position compensation according to the parallax offset vector acquired in step S4, and acquiring a transform relationship after compensation according to the feature points after compensation; the position compensation adopts

C is 1, …, and is compensated by

Acquiring a transformation relation after compensation; wherein the content of the first and second substances,

an ith feature representing a c-th depth scene of the reference image;

representing the characteristic of the reference image after position compensation on the scene with the index of c; h _c Representing a transformation matrix of a scene with index c in the target image to a scene with index c in the reference image before position compensation; offset H _c Representing a transformation matrix of a scene with the index c in the target image to a scene with the index c in the reference image after the position compensation;

s6: after the transformation relation is determined, calculating transformation weight for the vertex of each Delaunay triangle of different depth-of-field scenes according to the depth sequence, transforming each Delaunay triangle of different depth-of-field scenes of the reference image and the target image according to the transformation weight, and establishing a lookup table; wherein, the transformation method of each Delaunay triangle is as follows:

wherein, the first and the second end of the pipe are connected with each other,

respectively representing the ith vertex of the jth Delaunay triangle in the scene with the index of c in the reference image and the target image after transformation;

s7: registering the whole image through a lookup table, and then performing post-processing on the registered image to obtain a final spliced image, wherein the method specifically comprises the following steps: the method comprises the steps that scenes are transformed according to a lookup table in a depth sequence, when a point in a certain depth scene is located inside a Delaunay triangle of the lookup table, transformation parameters are directly obtained through the lookup table, and the point is further transformed; and carrying out image fusion on the transformed images of the same depth scene of the reference image and the target image, and directly superposing the fused images of different depths to obtain a final spliced image.

2. The method for splicing depth-of-field compensation-based parallax images according to claim 1, wherein the camera imaging principle in step S1 predicts the depth of the scene in the image by knowing the baseline and the camera parameters under the condition that the optical axes are parallel, or predicts the depth of the scene in the image by using a monadepth 2 deep learning algorithm, and segments the scene under different depths of field according to the depth relationship.

3. The method according to claim 1The method for splicing depth-compensated parallax images, wherein the step S2 is performed by

And matching scenes with different depths of field.

4. The method for splicing disparity images based on depth-of-field compensation according to claim 1, wherein an anchor point is selected from the contour of the scene after the segmentation in step S3, and a delaunay triangle is divided between the anchor point and feature points matched to different scenes.

5. The method for depth-compensated parallax image stitching according to claim 1, wherein the transform weight for each Delaunay triangle is

Can be according to pi _j The distance between the center point of (a) and the center of the nearest 3 feature points is obtained, and the weights are calculated by adopting Cauchy distribution

Each Delaunay triangle π is calculated as follows _j After the weight is transformed, transforming the vertex of the Delaunay triangle according to the weight, and establishing a lookup table;

γ＝||mid(anchors ^(c) )-mid(kpts ^(c) )|| ²

wherein kpts ^(c) Feature points representing a scene with index c; anchors ^(c) An anchor point on the outline representing the scene with index c;

min ₃ (a, b) represents the 3 elements in the b set closest to a; mid (-) denotes the center of the point set.