CN112950787A

CN112950787A - Target object three-dimensional point cloud generation method based on image sequence

Info

Publication number: CN112950787A
Application number: CN202110225992.8A
Authority: CN
Inventors: 袁利毫; 昝英飞; 韩端锋; 祝海涛; 钟声华; 秦浩
Original assignee: Harbin Engineering University
Current assignee: Harbin Engineering University
Priority date: 2021-03-01
Filing date: 2021-03-01
Publication date: 2021-06-11
Anticipated expiration: 2041-03-01
Also published as: CN112950787B

Abstract

The invention discloses a target object three-dimensional point cloud generating method based on an image sequence, which is characterized in that on the basis of a segmented image, the segmented target image sequence is subjected to feature point detection, extraction and matching and error matching pair reduction to obtain an effective matching pair, and then three-dimensional point cloud reconstruction is carried out on the effective matching pair to obtain a target object three-dimensional point cloud model with higher quality and precision. The method can effectively improve the efficiency of the three-dimensional modeling process, improve the quality of the three-dimensional point cloud model, reduce the probability of occurrence of processing errors in each link, is suitable for development of intelligent robot development, and has important engineering value and theoretical guidance significance for the efficiency of intelligent robot operation, especially underwater robot operation.

Description

Target object three-dimensional point cloud generation method based on image sequence

Technical Field

The invention belongs to the technical field of computer vision simulation, and particularly relates to a method for three-dimensional reconstruction of a shot continuous image sequence.

Background

With the progress of image processing technology and the development of three-dimensional sensors, the demands of three-dimensional application scenes such as intelligent mobile robots, unmanned driving, AR and the like are rapidly increased, and the three-dimensional reconstruction technology is required. Different from two-dimensional images, the three-dimensional model has stronger sense of reality, can present more information of people, is also different from drawing modeling, and the three-dimensional scene reconstruction technology has higher requirements on reflecting the original scene situation in real time more quickly and better, wherein the three-dimensional modeling is realized only by an image sequence, although the economic cost is reduced, the three-dimensional model is difficult to ensure the quality, the precision and the accuracy of the three-dimensional model construction, while the reconstruction of continuous multiple images needs to consider the error accumulation generated by observation at different angles, and a process of optimizing or balancing the three-dimensional model exists. Therefore, in order to realize the real-scene three-dimensional model reconstruction quickly and well, the research on the three-dimensional reconstruction method based on the image sequence has important engineering value and practical significance.

At present, there are many methods for improving the three-dimensional reconstruction efficiency and precision of an image sequence, and the methods can be roughly divided into two types: one is that the image is divided to obtain a target image and a background image to respectively carry out three-dimensional reconstruction, and finally the reconstruction of an image three-dimensional model is realized through the splicing and fusion of point clouds; and the other type is that the three-dimensional reconstruction is directly carried out on the scene image, and then the obtained point cloud model segmentation is carried out on the optimization model.

Disclosure of Invention

The invention aims to solve the problems of low quality and poor accuracy of the existing three-dimensional point cloud model, and provides a target object three-dimensional point cloud generation method based on an image sequence.

The purpose of the invention is realized as follows:

a target object three-dimensional point cloud generation method based on an image sequence comprises the following steps:

acquiring a source image sequence, and calculating a gray level image sequence of the source image sequence;

performing threshold segmentation on each image in the gray image sequence, determining an optimal segmentation threshold, segmenting the gray image sequence according to the optimal segmentation threshold, reserving pixels larger than the optimal segmentation threshold as target object pixels to obtain a target gray image sequence, and correspondingly reserving the target object pixels in the source image sequence to obtain the target image sequence;

detecting characteristic points of the target image sequence, extracting and storing the characteristic points, and calculating the module value and the direction of the characteristic points;

and (3) an effective quasi-matching pair in the component target image sequence, wherein the module values and the directions of two characteristic points in the effective quasi-matching pair are equal, and the module values and the directions are recorded

A qth valid aligned pair of the pth image and the (p + 1) th image in the target image sequence; deleting the mismatching pairs from the effective quasi-matching pairs to obtain effective matching pairs;

calculating a basic matrix F between the p image and the p +1 image according to the obtained effective matching pair_p(ii) a Acquiring an internal parameter matrix K of the camera; according to the obtained basic matrix F_pComputing the eigenmatrix E_pFor intrinsic matrix E_pCalculating an extrinsic parameter rotation matrix R of the camera between the p-th image and the p + 1-th image by singular value decomposition_pTranslation matrix T_p：

According to a rotation matrix R_pTranslation matrix T_pCalculating point cloud data of effective matching pairs between the p image and the p +1 image

Position:

wherein X ' ═ (X ', y ', 1) is the position of the feature point in the p +1 th image in the valid matching pair;

and calculating all the effective matching pairs to form a point cloud data set, transforming the obtained point cloud data set to the same coordinate system, and eliminating repeated data points to generate the three-dimensional point cloud of the target object.

The calculation method of the gray level image sequence of the source image sequence comprises the following steps:

performing gray value calculation on each pixel point of each image in the source image sequence:

Gray＝0.299R+0.587G+0.114B (2)

gray is a Gray value, and R, G, B are R, G, B three channel values of the pixel points respectively;

and storing the gray value of each pixel point in the pixel point of a new image sequence corresponding to the pixel point of the source image sequence, and taking the new image sequence as a gray image sequence.

The determination method of the optimal segmentation threshold comprises the following steps:

(a1) counting the number of pixels corresponding to each gray scale from 0 to 255 in the gray image sequence, and taking mu (t) (t is 0,1,2, … and 255) as the number of pixels corresponding to the gray value t;

(a2) calculating a mean of gray values of an image

Wherein p [ mu (t) ] is a probability function of the number of pixels mu (t) with the gray value of t;

(a3) calculating the pixel C of the target image of the object to be segmented in the image₀Average gray level mu of₁：

Where Pr (i | C0) indicates that the number of pixels having a gradation value i accounts for the pixel C of the divided object target image₀Probability of (P)_iThe probability that the pixel with the gray value i accounts for the total number of the pixels is obtained;

(a4) calculating the pixel C of the target image of the object to be segmented in the image₀Number of pixels ofProportion omega₁：

Wherein N is₁Is a target image C₀The number of pixels, Sum is the total number of image pixels;

(a5) calculating a background image pixel C in an image₁Average gray level mu of₂：

Where Pr (i | C1) indicates that the number of pixels with a gray level value of i accounts for background image pixel C₁Probability of (P)_iThe probability that the pixel with the gray value i accounts for the total number of the pixels is obtained;

(a6) calculating a background image pixel C in an image₁Is the ratio omega of the number of pixels₂：

Wherein N is₂As a background image C₁The number of pixels;

(a7) calculating an optimal segmentation threshold t_bestSaid optimal segmentation threshold t_bestTo maximize the value of t for the inter-class variance g (t),

wherein, the value of alpha belongs to (0,1) is adjusted according to each image segmentation binarization effect graph.

The detection method of the characteristic points comprises the following steps:

each image pixel value f (x, y) in the target grayscale image sequence is processed as follows:

(b1) the image pixel value f (x, y) is subjected to Gaussian filtering G under different Gaussian smoothing parameters sigma_σAnd (3) calculating:

(b2) DoG response image DoG f that calculates image pixel value f (x, y):

DoGf(x,y)＝DoG*f(x,y)＝g₁(x,y)-g₂(x,y) (10)

(b3) repeating the steps (b1) and (b2) to obtain Dogf₁(x,y)、DoGf₂(x,y)、DoGf₃(x,y)；

For Dogf₂All values in (x, y) are detected if a certain DoGf₂(x, y) in Dogf_iIf the value of (x + n, y + m) (i is 1,2, 3; n is-1, 0, 1; and m is-1, 0,1) is maximum or minimum, marking the pixel as a characteristic point, and recording the characteristic point

The position of the characteristic point is (x, y) for the kth characteristic point of the mth image in the target image sequence.

And setting the module value of the characteristic point as M, wherein the calculation method comprises the following steps:

wherein the scale used for L is a characteristic point

The scale of the method is as follows:

L(x,y,σ)＝G(x,y,σ)*I(x,y) (12)

σ is an autonomously selected parameter and remains unchanged in all feature points.

Let the direction of the feature point be θ, the calculation method is:

θ(x,y)＝tan^-1((L(x,y+1)-L(x,y-1))/(L(x+1,y)-L(x-1,y))) (13)。

the method for deleting the mismatching pairs comprises the following steps:

(c1) from valid quasi-matching pairs KPM_q' inSelecting the first M non-collinear sample data, wherein M is more than or equal to 4, calculating an optimal homography matrix according to the formula, and marking the optimal homography matrix as a model M;

wherein (x, y), (x ', y') respectively represent valid quasi-matching pairs KPM_q' the positions of two pairs of characteristic points in the equation, s is a scale parameter, and h is usually given as₃₃Normalizing the matrix by 1;

(c2) calculating an effective quasi-matching pair KPM based on the model M_q' projection error of all data in e:

if e is less than 0.75, retaining the data, otherwise, removing the data;

(c3) let optimal valid matching pair KPM'_best，KPM′_bestInitial value of (1) ═ KPM₁', if the current valid quasi-matching is to KPM_q'the number of elements is greater than the optimal effective matching pair KPM'_bestAnd then KPM'_best＝KPM_q', update the iteration number kk at the same time:

wherein, p is confidence coefficient, and takes value of 0.995, and w is KPM'_bestNumber of elements/KPM_q' number of elements, number of initial iterations kk₀Middle KPM'_bestThe number of the elements is 1;

(c4) if the iteration times are more than kk, exiting the loop and recording KPM_qFor efficient matching of KPM_q＝KPM′_best(ii) a Otherwise, the number of iterations is increased by 1, and the above step (c3) is repeated.

The internal parameter matrix of the camera is as follows:

wherein f is_x，f_yIs the focal length, x₀、y₀Is the coordinate of the principal point, and s is the inclination parameter of the coordinate axis.

Basis matrix F between p image and p +1 image_pThe calculation method comprises the following steps:

wherein X is K^-1(x,y,1),X′＝K^-1(x ', y', 1) is an effective matching pair

The origin is located at the center of the image.

Compared with the prior art, the invention has the beneficial effects that:

the invention provides a target object three-dimensional point cloud generating method based on an image sequence, which takes target object three-dimensional point cloud generation based on the image sequence as a research object, deeply analyzes various process collocation of three-dimensional modeling and advantages and disadvantages of an algorithm, and designs the target object image sequence three-dimensional point cloud generating method based on image segmentation. The invention is suitable for development of intelligent robot development, and has important engineering value and theoretical guidance significance for the operation of the intelligent robot, especially the operation efficiency of the underwater robot.

Drawings

FIG. 1 is a flow chart of a method for generating a three-dimensional point cloud of a target object based on an image sequence;

FIG. 2 is a flow chart of a target object thresholding method based on an image sequence;

FIG. 3 is a schematic diagram of a feature point extraction algorithm model.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.

As shown in fig. 1, the method for generating a three-dimensional point cloud of a target object based on an image sequence specifically includes the following steps:

1. acquiring a source image sequence from shooting equipment, and recording the source image sequence as S1; calculating a R, G, B three-channel value of each pixel point of each image in the source image sequence S1 according to the following formula (1), obtaining a Gray value Gray of each pixel point, storing the Gray value into the pixel points of a new image sequence corresponding to the pixel points of the source image sequence, and recording the new image sequence as a Gray image sequence S2.

Gray＝0.299R+0.587G+0.114B (1)

2. Each image in the grayscale image sequence S2 is threshold-segmented to determine an optimal segmentation threshold.

The flow chart of the threshold segmentation is shown in fig. 2, and the specific steps are as follows:

the number of pixels corresponding to each gradation level is counted from 0 to 255, and μ (t) (t is 0,1,2, …,255) is the number of pixels corresponding to the gradation value t.

(a2) calculating a mean of gray values of an image

Wherein p [ mu (t) ] is a probability function of the number of pixels mu (t) with the gray value of t; (ii) a

Where Pr (i | C0) indicates that the number of pixels having a gradation value i accounts for the pixel C of the divided object target image₀Probability of (P)_iIs the probability of the pixel with the gray value i occupying the total number of pixels.

(a4) Calculating the pixel C of the target image of the object to be segmented in the image₀Is the ratio omega of the number of pixels₁：

(a5) calculating the average gray level μ of the background image pixel C1 in the image₂：

Where Pr (i | C1) indicates that the number of pixels with a gray level value of i accounts for background image pixel C₁Probability of (P)_iIs the probability of the pixel with the gray value i occupying the total number of pixels.

(a6) Calculating a background image pixel C in an image₁The ratio of the number of pixels:

wherein N is₂As a background image C₁The number of pixels;

g(t)＝ω₁ ^α-2(ω₁μ-μ(t))²+ω₂ ^α-2(μ(ω₂-1)+μ(t))² (7)

According to the optimal segmentation threshold t_bestThe gray image sequence S2 is divided to be less than t_bestThe pixel of (A) is a background pixel C₁Greater than t_bestIs the target object pixel C₀At this time, the target object pixel C₀Reserving to obtain an object target gray level image sequence SA_blackAnd correspondingly reserving pixel points in the source image sequence S1 to obtain a segmented target image sequence SA.

3. The method for detecting, extracting and storing the feature points of the target image sequence SA includes the following steps, and a flowchart of the method for extracting the feature points is shown in fig. 3:

for target gray image sequence SA_blackIs processed as follows:

(b2) DoG response image DoG f that calculates image pixel value f (x, y):

DoGf(x,y)＝DoG*f(x,y)＝g₁(x,y)-g₂(x,y) (9)

Calculating feature points from equation (10)

The modulus value M:

wherein the scale used for L is a characteristic point

The scale of the method is as follows:

L(x,y,σ)＝G(x,y,σ)*I(x,y) (11)

wherein the parameter sigma is selected autonomously and in all feature points

Is kept unchanged.

Calculating feature points from equation (12)

Direction θ of (c):

θ(x,y)＝tan^-1((L(x,y+1)-L(x,y-1))/(L(x+1,y)-L(x-1,y))) (12)

4. for the characteristic points in the target image sequence SA

Are matched in pairs, i.e.

And

a pair of feature points KP with the middle module value M and the direction theta equal are effectively matched and recorded

For the q (q ═ 1,2,3, …) th valid quasi-matching pair of the p-th image and the p + 1-th image (p ═ 1,2,3, …) in the target image sequence SA, the mis-matching pair is deleted from the valid quasi-matching pair, resulting in a valid matching pair.

The specific steps for deleting the mismatching pairs are as follows:

(c1) from valid quasi-matching pairs KPM_qSelecting the previous M non-collinear sample data, wherein M is more than or equal to 4, calculating an optimal homography matrix according to the formula, and marking the optimal homography matrix as a model M;

if e is less than 0.75, retaining the data, otherwise, removing the data;

5. Obtaining internal parameters of the camera by referring to relevant information of the camera or using a camera calibration method, wherein an internal parameter matrix K comprises:

wherein f is_x，f_yFor focal length, x is generally equal to₀、y₀Is the principal point coordinate (relative to the imaging plane) and s is the coordinate axis tilt parameter, ideally 0.

According to the obtained effective matching pairs

Calculating a basis matrix F between the p-th image and the p + 1-th image by equation (16)_p。

Wherein X is K^-1(x,y,1),X′＝K^-1(x ', y', 1) is an effective matching pair

The origin is located at the center of the image.

According to the obtained basic matrix F_pCalculating an extrinsic parameter rotation matrix R of the camera between the p-th image and the p + 1-th image by equation (18)_pTranslation matrix T_p。

F_p＝C^-TE_pC^-1，E_p＝T_p×R_p(18)

Where C is the vector of the position of the camera center in the world coordinate system given by itself, often let C equal the camera coordinate system of the p-th image, E_pFor the intrinsic matrix of the camera between the p-th image and the p + 1-th image, by matching the intrinsic matrix E_pThe exterior of the camera between the p-th image and the p + 1-th image can be calculated by Singular Value Decomposition (Singular Value Decomposition)Partial parameter rotation matrix R_pTranslation matrix T_p。

6. Calculating the effective matching pair between the p-th image and the p + 1-th image by the formula (19)

Point cloud data of

Position:

will effectively match with the pair

All calculated and recorded as a point cloud data set X^p(p ═ 1,2,3, …). The obtained point cloud data set X^pAnd transforming to the same coordinate system, and eliminating repeated data points to complete the generation of the three-dimensional point cloud of the target object.

In summary, the following steps: the invention discloses a target object three-dimensional point cloud generating method based on an image sequence, which is characterized in that on the basis of a segmented image, the segmented target image sequence is subjected to feature point detection, extraction and matching and error matching pair reduction to obtain an effective matching pair, and then the effective matching pair is subjected to three-dimensional point cloud reconstruction to obtain a target object three-dimensional point cloud model with higher quality and precision. The method can effectively improve the efficiency of the three-dimensional modeling process, improve the quality of the three-dimensional point cloud model, reduce the probability of occurrence of processing errors in each link, is suitable for development of intelligent robot development, and has important engineering value and theoretical guidance significance for the efficiency of intelligent robot operation, especially underwater robot operation.

Claims

1. A target object three-dimensional point cloud generating method based on an image sequence is characterized by comprising the following steps:

Position:

2. The method for generating a three-dimensional point cloud of a target object based on an image sequence according to claim 1, wherein the calculation method of the gray-scale image sequence of the source image sequence is as follows:

Gray＝0.299R+0.587G+0.114B (2)

3. The method for generating a three-dimensional point cloud of a target object based on an image sequence according to claim 1 or 2, wherein the optimal segmentation threshold is determined by:

(a2) calculating a mean of gray values of an image

Wherein N is₂As a background image C₁The number of pixels;

4. The method for generating a three-dimensional point cloud of a target object based on an image sequence according to claim 1 or 2, wherein the method for detecting the feature points comprises:

(b2) DoG response image DoG f that calculates image pixel value f (x, y):

DoGf(x,y)＝DoG*f(x,y)＝g₁(x,y)-g₂(x,y) (10)

5. The method for generating a three-dimensional point cloud of a target object based on an image sequence according to claim 4, wherein the module value of the feature point is M, and the calculation method comprises:

wherein the scale used for L is a characteristic point

The scale of the method is as follows:

L(x,y,σ)＝G(x,y,σ)*I(x,y) (12)

6. The method for generating a three-dimensional point cloud of a target object based on an image sequence according to claim 5, wherein the direction of the feature point is θ, and the calculation method comprises:

θ(x,y)＝tan^-1((L(x,y+1)-L(x,y-1))/(L(x+1,y)-L(x-1,y))) (13)。

7. the method for generating the three-dimensional point cloud of the target object based on the image sequence according to claim 1 or 2, wherein the method for deleting the mismatching pairs comprises:

wherein (x, y), (x ', y') respectively represent valid quasi-matching pairs KPM_q' the positions of two pairs of feature points in the equation, s is a scale parameter, and h is usually given₃₃Normalizing the matrix by 1;

if e is less than 0.75, retaining the data, otherwise, removing the data;

8. The method of generating a three-dimensional point cloud of a target object based on an image sequence according to claim 1 or 2, wherein the internal parameter matrix of the camera is:

9. The method of claim 1 or 2, wherein the method comprisesBasis matrix F between p image and p +1 image_pThe calculation method comprises the following steps:

wherein X is K^-1(x,y,1),X′＝K^-1(x ', y', 1) is an effective matching pair

The origin is located at the center of the image.