CN110044374B

CN110044374B - Image feature-based monocular vision mileage measurement method and odometer

Info

Publication number: CN110044374B
Application number: CN201810044762.XA
Authority: CN
Inventors: 樊晓东; 孟俊华; 王飞; 唐文平; 高成
Original assignee: Kuanyan Beijing Technology Development Co ltd
Current assignee: Kuanyan Beijing Technology Development Co ltd
Priority date: 2018-01-17
Filing date: 2018-01-17
Publication date: 2022-12-09
Anticipated expiration: 2038-01-17
Also published as: CN110044374A

Abstract

The invention provides a method for measuring mileage based on image characteristics by monocular vision and a milemeter, wherein the method comprises the following steps: the method comprises the following steps of (1) calibrating a camera; (2) Calculating 2D characteristic points of two adjacent frames of images in the front and back direction along the advancing direction; (3) Matching the 2D feature points to find out corresponding feature points in the two frames of images; (4) Calculating the 3D coordinates of the corresponding feature points in the two frames of images, and calculating the camera pose according to the 3D coordinates and the 2D coordinates of the corresponding feature points to obtain the relative displacement of the camera; (5) And performing the same operation on the subsequent frames, and finally accumulating all the displacements to obtain the mileage. Compared with a method based on binocular vision, the method has the advantages that the method measures the mileage by adopting monocular vision, is simple in equipment and reduces cost; compared with a method based on sift and Harris angular points, the method has the advantages that the image characteristic speed is calculated faster, the rotation scale invariance is realized, and the real-time processing can be realized.

Description

Image feature-based monocular vision mileage measurement method and odometer

Technical Field

The invention relates to the technical field of image processing, in particular to a method for measuring mileage by monocular vision based on image characteristics and a milemeter.

Background

In the subway operation process, apparent defects such as water leakage or cracks, peeling and the like of the outlet wire of a tunnel structure mainly made of concrete materials and deformation of the section of the tunnel are unavoidable defects, and long-term development of the defects can cause irreversible negative effects on the safety of the tunnel. Therefore, the maintenance of the tunnel structure in subway operation is a necessary means for ensuring the long-term safe operation of the tunnel. The position control of the sensor in the detection process directly influences the effectiveness of detection data acquisition. At present, the positions of sensors for detecting most subway tunnel defects are set in advance at the present stage, and for different tunnel section environments, the validity of data cannot be improved through position adjustment, and the difficulty of software analysis is reduced. In recent years, with the rapid development of computer technology, automatic control theory, embedded development, chip design and sensor technology, the tunnel disease automatic detection is realized, a scene image or an image sequence is extracted from a detection vehicle running in real time for processing, effective characteristics of a detected target are extracted, space target real-time pose information is acquired, and support is provided for the position and mileage of a subsequent tunnel disease image. However, it is very difficult to obtain three-dimensional coordinate information of a moving object due to technical limitations of the monocular camera. Monocular vision has three generation modes, one is generated by perspective geometry, the reference vanishing point is formed by displacement of the target, and the generation mode is that the limiting conditions, such as camera fixation, background fixation and constant speed of a person, are required, so that the faster the target moves, the closer the target is to the camera. Yet another is through the focal length. The blurring effect of the same scene shot by different focal lengths is measured. This method does not work well for the entire image that is generated, but the values are relatively accurate. Binocular vision relies on parallax effects. This effect is a main cause of being able to form a three-dimensional stereoscopic impression. At present, monocular vision also mainly depends on generating three-dimensional depth information by finding a reference object and finding a parallax effect. Monocular pose estimation is a problem of a three-dimensional scene structure, and a triangular geometric relation of corresponding feature points needs to be formed through interframe movement. After the triangle geometric relationship is established, the three-dimensional coordinates of the pose and the feature points are solved simultaneously, which is a classic three-dimensional scene structure problem. Therefore, the problem that the prior chicken is the prior egg is not existed. The solutions for three-dimensional scene structures are many, and the rotation R and the displacement T of the camera can be obtained by estimating an approximate matrix and then decomposing the approximate matrix in the simplest way. In binocular stereo vision, the three-dimensional coordinates of the characteristic points can be directly triangulated because the base line is fixed and known. Then the motion information between frames is the motion parameter fitting between two piles of three-dimensional points; the dual purpose disadvantage is that the base line is fixed and is not generally very wide due to the size limitations of the carrier. The accuracy of the triangulated reconstruction is generally not very high.

Therefore, there is a need to develop a monocular vision-based mileage calculation method, which has faster speed for measuring and calculating image features, and has rotation scale invariance, and can process images in real time, compared with the sift, harris corner-based method.

Disclosure of Invention

The invention aims to solve the technical problem of providing a monocular vision-based mileage measurement method, which has simple equipment and low cost; compared with a method based on sift and Harris corners, the method for measuring monocular vision mileage based on image characteristics has the advantages of higher speed of measuring and calculating image characteristics, rotation scale invariance and real-time processing.

In order to solve the technical problems, the invention adopts the technical scheme that: the method for measuring the mileage based on the monocular vision of the image characteristics specifically comprises the following steps:

(1) Calibrating a camera to obtain internal and external parameters of the camera;

(2) Calculating 2D characteristic points of two adjacent frames of images in the forward direction;

(3) Matching the 2D feature points to find out corresponding feature points in the two frames of images;

(4) Calculating the 3D coordinates of the corresponding characteristic points in the two frames of images, and calculating the camera pose according to the 3D coordinates and the 2D coordinates of the corresponding characteristic points to obtain the relative displacement of the camera;

(5) And (4) repeating the steps (1) to (4) for subsequent frames in sequence, calculating the displacement of the camera relative to the previous frame when shooting each frame, and finally accumulating all the displacements to obtain the mileage.

By adopting the technical scheme, the mileage is measured by adopting monocular vision, compared with a method based on binocular vision, the method has the advantages of simple equipment and low cost; comparison is based on sift, harri _s The angular point method has the advantages of higher image feature calculation speed, rotation scale invariance and real-time processing.

The invention is further improved in that the step (1) comprises the following steps:

1-1, obtaining a conversion relation among an image coordinate system, a camera coordinate system and a world coordinate system according to a pinhole imaging model;

1-2, shooting a plurality of checkerboard calibration plates under different visual angles, extracting angular points on images of the calibration plates, and obtaining pixel coordinates and physical coordinates of the angular points according to the checkerboard size so as to obtain a homography matrix H of all images of the calibration plates;

1-3, solving internal and external parameters;

and 1-4, solving the minimized projection error through a Levenberg-Marquardt algorithm, and optimizing the internal and external parameters of the camera.

Preferably, the step (2) of calculating the orb feature points of the two previous and next frames of images, which is the 2D feature point, specifically includes the following steps: constructing an image pyramid, extracting key points from each layer according to a fast algorithm, selecting point pairs around the key points according to a brief algorithm, generating a descriptor by comparing pixel values, adjusting the descriptor according to an included angle between the key points and a gray centroid, enabling the descriptor to have rotation invariance, and finally obtaining an orb descriptor.

The 2D feature point matching in the step (3) specifically comprises the following steps:

3-1, establishing a k-d tree for the feature point set in the image, namely selecting a dimension k with the maximum variance in the data set; then selecting a characteristic point with a median value m on a k dimension as a splitting node; dividing the value of k smaller than m into left subspace, and dividing the value of k larger than m into right subspace; respectively carrying out the operations on the left subspace and the right subspace until the left subspace and the right subspace can not be divided, and obtaining a k-d tree;

3-2, performing feature matching search by using a bbf search algorithm: performing binary search starting from a root node of the k-d tree, and sequencing nodes on a query path according to respective distances from the query path to query points; and during backtracking, starting from the tree node with the highest priority, and when all nodes are checked or exceed the running time limit, taking the point with the shortest distance as the nearest matching feature point.

The step (4) specifically comprises the following steps:

4-1, establishing a coordinate system by taking the upper left corner of the image as a coordinate origin and taking the area shot by the image as a plane according to the pixel size and the physical size of the image to obtain a 3D coordinate of the feature point;

and 4-2, according to camera internal parameters, the 3D coordinates of the characteristic points in the previous frame image and the 2D coordinates of the characteristic points in the next frame image, calculating the pose of the camera when the camera shoots the next frame by using a coordinate conversion relation, and further calculating the displacement of the camera between the positions of shooting the previous frame and the next frame.

The conversion relation among the image coordinate system, the camera coordinate system and the world coordinate system in the step 1-1 is specifically as follows:

(1.11) in the world coordinate system, the coordinate of a certain point is [ X ] _w ，Y _w ，Z _w ]Passing through the camera coordinate system, the corner point coordinate is [ X ] _c ，Y _c ，Z _c ]Through the corresponding transformation relation of rotation and translation,

where R is the rotation matrix and T is the displacement of two coordinate origins, then

(1.12) after the point is imaged by the camera, the point is [ x, y ] in the coordinate system represented by the physical size of the image]According to a similar trigonometric relationship, have

Where f is the focal length of the camera, i.e

(1.13) the relationship between the image pixel size coordinate system and the image physical size coordinate system is shown in formula (3), and the coordinate system of the point expressed in the image pixel size is [ u, v [ ]]Then there is a pairThe relationship should be:

wherein (u) ₀ ，v ₀ ) Is the center of an image pixel, d _x Is the physical dimension of a pixel in the x-axis direction, d _y Is the physical size of a pixel in the y-axis direction, namely

(1.14) by combining the relationships of the above formulae (1), (2) and (3):

(1.15) consideration of the addition of the skewness parameter C, ultimately

(1.16) since the checkerboard is a plane, Z is set _w =0, let a denote the camera matrix,

r ₁ ，r ₂ ，r ₃ for a column vector of R, t is a translation column vector, then equation (5) can be written as

And solving the homography matrix H, shooting a plurality of chessboard pattern calibration plates under different visual angles, and extracting angular points on the images of the calibration plates. And the size of the checkerboard is known, so that the pixel coordinates and the physical coordinates of the corner points can be obtained. By the least square method, the homography matrix H, H = [ H ] of all calibration board images can be obtained ₁ h ₂ h ₃ ]According to the formula (6), let λ denote a constant, and [ h ] can be obtained ₁ h ₂ h ₃ ]＝λA[r ₁ r ₂ t]；(7)

Let alpha, beta and gamma be the rotation angles in the directions of x-axis, y-axis and z-axis, respectively, then the rotation matrix

Can obtain

And

to obtain | | | r ₁ ||＝(cosγ cosβ+sinγ sinα sinβ) ² +(-sinγ cosβ+cosγ sinα sinβ) ² +(cosα sinβ) ² =1, and | | r ₂ ||＝(sinγ cosα) ² +(cosγ cosα) ² +(-sinα) ² =1, so | | r ₁ ||＝||r ₂ ||＝1。 (8)

Calculating r ₁ ·r ₂ ＝(cosγ cosβ+sinγ sinα sinβ)(sinγ cosα)+(-sinγ cosβ+cosγ sinα sinβ)(cosγ cosα)+(cosα sinβ)(-sinα)＝0， (9)

From the above equations (7), (8) and (9), it is possible to obtain:

i.e. h ₁ ^T A ^-T A ^-1 h ₂ ＝0； (10)

To obtain h ₁ ^T A ^-T A ^-1 h ₁ ＝h ₂ ^T A ^-T A ^-1 h ₂ 。 (11)

Establishing an equation set according to the formula (10) and the formula (11), and substituting the homography matrix values of the groups obtained in the step (1.2) into the equation set to obtain an internal reference matrix A;

order to

Let h _i ＝[h _i1 ，h _i2 ，h _i3 ] ^T Then there is

Wherein B = [ B = ₁₁ ，B ₁₂ ，B ₂₂ ，B ₁₃ ，B ₂₃ ，B ₃₃ ] ^T ，v _ij ＝[h _i1 h _j1 ，h _i1 h _j2 +h _i2 h _j1 ，h _i2 h _j2 ，h _i3 h _j1 +h _i1 h _j3 ，h _i3 h _j2 +h _i2 h _j3 ，h _i3 h _j3 ] ^T (ii) a Therefore, the above-mentioned formulas (10) and (11) can be written as

And (4) substituting the values of all the homography matrixes, solving b, and then solving the values of all the elements and the external parameters in the internal parameter matrix A.

The invention also provides a monocular vision odometer based on the image characteristics, and the method for measuring the mileage based on the monocular vision odometer based on the image characteristics is utilized for calculating the mileage.

Compared with the prior art, the invention has the following beneficial effects: compared with a method based on binocular vision, the method has the advantages of simple equipment and low cost; compared with a method based on sift and Harris angular points, the method has the advantages that the image characteristic speed is measured and calculated more quickly, the rotation scale invariance is realized, and the real-time processing can be realized.

Drawings

FIG. 1 is a diagram of a pinhole imaging model for a monocular vision mileage measurement method based on image features according to the present invention;

fig. 2 is a schematic diagram of two adjacent frames before and after shooting of the method for monocular vision mileage measurement based on image characteristics according to the present invention.

Detailed Description

The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the drawings of the embodiments of the present invention.

Example 1: the image feature-based monocular vision mileage measuring method is applied to vehicle-mounted tunnel detection equipment, and specifically comprises the following steps:

(1) Firstly, calibrating a camera to obtain parameters of the camera;

(2) Sequentially calculating 2D characteristic points of front and rear frames in the advancing direction of the vehicle;

(3) Matching the 2D characteristic points to find out corresponding characteristic points;

(4) Calculating the 3D coordinates of the characteristic points, and calculating the posture according to the 3D coordinates and the 2D coordinates of the characteristic points to obtain relative displacement;

(5) Sequentially adopting the same method for the subsequent measured frames, calculating the displacement of the camera relative to the previous frame when shooting each frame, and finally accumulating all the displacements to obtain the mileage;

as shown in fig. 1, the calibrating the camera by using the checkerboard calibration algorithm in step (1), and the acquiring the camera internal reference specifically includes the following steps:

(1.1) according to the pinhole imaging model, the transformation relation under an image coordinate system, a camera coordinate system and a world coordinate system is as follows:

(1.11) in the world coordinate system, the coordinate of a certain point is [ X ] _w ，Y _w ，Z _w ]Passing under the camera coordinate system, the corner point coordinate is [ X ] _c ，Y _c ，Z _c ]By the corresponding transformation relation of rotation and translation,

(1.12) after the point is imaged by the camera, the point is [ x, y ] in the coordinate system represented by the physical size of the image]According to the similar trigonometric relationship, there are

Where f is the focal length of the camera, i.e.

(1.13) the relationship between the image pixel size coordinate system and the image physical size coordinate system is shown in formula (3), and the coordinate system of the point expressed in the image pixel size is [ u, v [ ]]Then, there is a corresponding relationship:

wherein (u) ₀ ，v ₀ ) Center of image pixel, d _x Is the physical dimension of a pixel in the x-axis direction, d _y Is the physical size of a pixel in the y-axis direction

(1.14) by combining the relationships of the above formulae (1), (2) and (3):

(1.15) consideration of the addition of the skewness parameter C, ultimately

(1.16) since the checkerboard calibration plate is a plane, Z is set _w =0, let a denote the camera matrix,

And (1.2) solving the homography matrix H, shooting a plurality of chessboard pattern calibration plates under different visual angles, and extracting angular points on the images of the calibration plates. And the size of the checkerboard is known, so that the pixel coordinates and the physical coordinates of the corner points can be obtained. By the least square method, the homography matrix H of all calibration plate images can be found.

(1.3) homography matrix H = [ H = ₁ h ₂ h ₃ ]According to the formula (6), let λ denote a constant, and [ h ] can be obtained ₁ h ₂ h ₃ ]＝λA[r ₁ r ₂ t]； (7)

Can obtain

And

to obtain | | | r ₁ ||＝(cosγ cosβ+sinγ sinα sinβ) ² +(-sinγ cosβ+cosγ sinαs inβ) ² +(cosα sinβ) ² =1, and | | | r ₂ ||＝(sinγ cosα) ² +(cosγ cosα) ² +(-sinα) ² =1, so | | | r ₁ ||＝||r ₂ ||＝1。 (8)

From the above equations (7), (8) and (9), it is possible to obtain:

then h is obtained ₁ ^T A ^-T A ^-1 h ₁ ＝h ₂ ^T A ^-T A ^-1 h ₂ 。 (11)

(1.4) solving internal and external parameters: establishing an equation set according to the formula (10) and the formula (11), and substituting the homography matrix values obtained in the step (1.2) into the equation set to obtain an internal reference matrix A;

order to

Is provided with h _i ＝[h _i1 ，h _i2 ，h _i3 ] ^T Then there is

Wherein B = [ B = ₁₁ ，B ₁₂ ，B ₂₂ ，B ₁₃ ，B ₂₃ ，B ₃₃ ] ^T ，v _ij ＝[h _i1 h _j1 ，h _i1 h _j2 +h _i2 h _j1 ，h _i2 h _j2 ，h _i3 h _j1 +h _i1 h _j3 ，h _i3 h _j2 +h _i2 h _j3 ，h _i3 h _j3 ] ^T (ii) a Therefore, the above equations (10) and (11) can be written as

Taking in all the values of the homography matrix, solving b, and then solving each element value and external parameter in the internal parameter matrix A;

(1.5) solving the minimized projection error through a Levenberg-Marquardt algorithm to optimize internal and external parameters of the camera; extracting pixel points with larger difference values with the pixel points in the surrounding area as key points according to a fast algorithm in the step 2); and selecting point pairs around the key points according to a brief algorithm, and generating descriptors by comparing pixel values.

Example 2: the method for measuring the mileage based on the monocular vision of the image characteristics specifically comprises the following steps:

1) Calibrating the camera according to a checkerboard calibration algorithm to obtain camera internal parameters;

2) Calculating image characteristics of the front frame and the rear frame: firstly, constructing an image pyramid, and extracting pixel points with larger difference values with pixel points in surrounding areas on each layer as key points; selecting point pairs around the key points, and generating descriptors by comparing pixel values; adjusting the descriptor according to an included angle between the key point and the gray scale centroid, so that the descriptor has rotation invariance; finally, obtaining a descriptor of the image characteristics;

3) Matching the feature points on the front frame and the rear frame to obtain corresponding feature points: establishing a k-d tree for a feature point set on an image: selecting a dimension k having a maximum variance in the dataset; then selecting a characteristic point with a value of a median m on a k dimension as a split node; dividing the value of the dimension k smaller than m to obtain a left subspace, and dividing the value of the dimension k larger than m to a right subspace; respectively carrying out the operations on the left subspace and the right subspace until the left subspace and the right subspace can not be divided, and obtaining a k-d tree; and (3) performing feature matching search by using a bbf search algorithm: starting from the root node of the k-d tree, performing binary search, and sequencing the nodes on the query path according to the respective distances from the query points; when backtracking is carried out, starting from a tree node with a high priority, and when all nodes are checked or exceed the running time limit, taking the best result found at present as a nearest neighbor matching feature point;

4) Calculating the 3D coordinates of the matched feature points: according to the pixel size and the physical size of an image, establishing a coordinate system by taking the upper left corner of the image as a coordinate origin and taking the area shot by the image as a plane, and obtaining a 3D coordinate of a characteristic point;

5) Calculating the moving distance of the camera relative to the previous frame when shooting the next frame according to the 3D coordinates and the 2D coordinates of the feature points;

6) And sequentially calculating the poses of the cameras under all the frames to obtain the displacement and obtain the mileage.

As shown in fig. 1, the calibrating the camera by using the checkerboard calibration algorithm in step 1), and the acquiring the camera internal reference specifically includes the following steps:

(1.11) in the world coordinate system, the coordinate of a certain point is [ X ] _w ，Y _w ，Z _w ]Passing through the camera coordinate system, the corner point coordinate is [ X ] _c ，Y _c ，Z _c ]By the corresponding transformation relation of rotation and translation,

Where f is the focal length of the camera, i.e

wherein (u) ₀ ，v ₀ ) Center of image pixel, d _x Is the physical dimension of a pixel in the x-axis direction, d _y Is the physical size of a pixel in the y-axis direction, namely

(1.14) by combining the relationships of the above formulae (1), (2) and (3):

(1.15) consideration of the addition of skewness parameter C, ultimately

And (1.2) solving the homography matrix H, shooting a plurality of chessboard pattern calibration plates under different visual angles, and extracting angular points on the images of the calibration plates. And the size of the checkerboard is known, so that the pixel coordinates and the physical coordinates of the corner points can be obtained. By the least square method, the homography matrix H of all calibration plate images can be obtained.

(1.3) homography matrix H = [ H ] ₁ h ₂ h ₃ ]According to the formula (6), let λ denote a constant, we can obtain [ h ] ₁ h ₂ h ₃ ]＝λA[r ₁ r ₂ t]； (7)

And if the alpha, the beta and the gamma are respectively the rotation angles in the directions of the x axis, the y axis and the z axis, the rotation matrix is formed

Can obtain

And

From the above equations (7), (8) and (9), it is possible to obtain:

to obtain h ₁ ^T A ^-T A ^-1 h ₁ ＝h ₂ ^T A ^-T A ^-1 h ₂ 。 (11)

(1.4) solving internal and external parameters: establishing an equation set according to the formula (10) and the formula (11), and substituting the homography matrix values of the groups obtained in the step (1.2) into the equation set to obtain an internal reference matrix A;

order to

Let h _i ＝[h _i1 ，h _i2 ，h _i3 ] ^T Then there is

Wherein B = [ B = ₁₁ ，B ₁₂ ，B ₂₂ ，B ₁₃ ，B ₂₃ ，B ₃₃ ] ^T ，v _ij ＝[h _i1 h _j1 ，h _i1 h _j2 +h _i2 h _j1 ，h _i2 h _j2 ，h _i3 h _j1 +h _i1 h _j3 ，h _i3 h _j2 +h _i2 h _j3 ，h _i3 h _j3 ] ^T (ii) a Therefore, the above formulas (10) and (II)

(11) Can be written as

Taking in all the values of the homography matrix, solving b, and then solving each element value and the external parameter in the internal parameter matrix A;

(1.5) solving the minimized projection error through a Levenberg-Marquardt algorithm to optimize internal and external parameters of the camera; extracting pixel points with larger difference value with the pixel points in the surrounding area as key points according to a fast algorithm in the step 2); selecting point pairs around the key points according to a brief algorithm, and generating a descriptor by comparing pixel values; the step 5) specifically comprises the following steps: by utilizing the camera internal parameters obtained in the step 1), the 3D coordinates of the characteristic points obtained in the step 4) and the 2D pixel coordinates of the characteristic points in the next frame, the pose of the camera in shooting the next frame can be obtained, wherein the displacement component represents the displacement of the camera between the positions of the previous frame and the next frame in shooting; the step 6) specifically comprises the following steps: and (4) sequentially repeating the operations from the step 2) to the step 5) on the images shot in the advancing direction, and sequentially accumulating the displacement components of the camera in the front frame and the rear frame of shooting to obtain the mileage.

Example 3: the method for measuring the mileage based on the monocular vision of the image characteristics specifically comprises the following steps:

2) Calculating image characteristics for the front frame and the rear frame: firstly, constructing an image pyramid, and extracting pixel points with larger difference values with pixel points in surrounding areas on each layer as key points; selecting point pairs around the key points, and generating descriptors by comparing pixel values; adjusting the descriptor according to an included angle between the key point and the gray scale centroid, so that the descriptor has rotation invariance; finally, obtaining a descriptor of the image characteristics;

3) Matching the feature points on the front frame and the rear frame to obtain corresponding feature points: establishing a k-d tree for a feature point set on an image: selecting a dimension k having a maximum variance in the dataset; then selecting a characteristic point with a median value m on a k dimension as a splitting node; dividing the value of the dimension k smaller than m to obtain a left subspace, and dividing the value of the dimension k larger than m to a right subspace; respectively carrying out the operations on the left subspace and the right subspace until the left subspace and the right subspace can not be divided, and obtaining a k-d tree; and (3) performing feature matching search by using a bbf search algorithm: starting from a root node of the k-d tree, performing binary search, and sequencing nodes on the query path according to respective distances from the query points; when backtracking is carried out, starting from a tree node with a high priority, and when all nodes are checked or exceed the running time limit, taking the best result found at present as a nearest neighbor matching feature point;

Where f is the focal length of the camera, i.e.

(1.14) the relationships among the above formulae (1), (2) and (3) are integrated to obtain:

(1.15) consideration of the addition of the skewness parameter C, ultimately

r ₁ ，r ₂ ，r ₃ for a column vector of R and t a shifted column vector, equation (5) can be written as

(1.2) solving a homography matrix H; shooting a plurality of chessboard pattern calibration plates under different visual angles, extracting angular points on images of the calibration plates, wherein the sizes of the chessboard patterns are known, so that pixel coordinates and physical coordinates of the angular points can be obtained, and a homography matrix H of all images of the calibration plates can be obtained by a least square method;

Can obtain

And

to obtain | | | r ₁ ||＝(cosγ cosβ+sinγ sinα sinβ) ² +(-sinγ cosβ+cosγ sinα sinβ) ² +(cosα sinβ) ² =1, and | | r ₂ ||＝(sinγ cosα) ² +(cosγ cosα) ² +(-sinα) ² =1, so | | r ₁ ||＝||r ₂ ||＝1 (8)

From the above equations (7), (8) and (9), it is possible to obtain:

to obtain h ₁ ^T A ^-T A ^-1 h ₁ ＝h ₂ ^T A ^-T A ^-1 h ₂ (11)

order to

Is provided with h _i ＝[h _i1 ，h _i2 ，h _i3 ] ^T Then there is

(1.5) solving the minimized projection error through a Levenberg-Marquardt algorithm to optimize internal and external parameters of the camera; extracting pixel points with larger difference value with the pixel points in the surrounding area as key points according to a fast algorithm in the step 2); selecting point pairs around the key points according to a brief algorithm, and generating descriptors by comparing pixel values; the step 5) specifically comprises the following steps: by utilizing the camera internal reference obtained in the step 1), the 3D coordinates of the feature points obtained in the step 4) and the 2D pixel coordinates of the feature points in the next frame, the pose of the camera in shooting the next frame can be obtained, wherein the displacement component represents the displacement of the camera between the positions of the previous frame and the next frame in shooting; the step 6) specifically comprises the following steps: sequentially repeating the operations from the step 2) to the step 5) on the images shot in the advancing direction, and sequentially accumulating the displacement components of the camera before and after shooting two frames to obtain the mileage; the monocular vision odometer for the image characteristics in tunnel detection is used for the monocular vision odometer for the image characteristics based on the image characteristics; in the step (3), the most significant feature detection result on the target is used as an initial condition of feature matching, the surface code of the target visible in the field of view is determined, and the feature matching is started by using the surface code as an initial state.

Example 4

A monocular visual odometer based on image characteristics is located on a vehicle-mounted detection platform for tunnel detection, and the method in the embodiments 1 to 3 is used for the carried program.

The following table shows the pose six parameters of the monocular vision measuring camera:

TABLE 1 pose six parameters of monocular vision measuring camera

The above description is only exemplary of the present invention and should not be taken as limiting the invention, as any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A monocular vision mileage measurement method based on image features is characterized by specifically comprising the following steps:

(2) Calculating 2D characteristic points of two adjacent frames of images in the front and back direction along the advancing direction;

(4) Calculating the 3D coordinates of the corresponding feature points in the two frames of images, and calculating the camera pose according to the 3D coordinates and the 2D coordinates of the corresponding feature points to obtain the relative displacement of the camera;

(5) Repeating the steps (1) to (4) for subsequent frames in sequence, calculating the displacement of the camera relative to the previous frame when shooting each frame, and finally accumulating all the displacements to obtain the mileage;

the step (1) comprises the following steps:

1-3, solving internal and external parameters;

1-4, solving a minimized projection error through a Levenberg-Marquardt algorithm, and optimizing internal and external parameters of the camera;

the 2D feature points, that is, orb feature points in step (2), specifically calculating orb feature points of two frames of images before and after the first frame of image, includes the following steps: constructing an image pyramid, extracting key points from each layer according to a fast algorithm, selecting point pairs around the key points according to a brief algorithm, generating a descriptor by comparing pixel values, adjusting the descriptor according to an included angle between the key points and a gray scale centroid, enabling the descriptor to have rotation invariance, and finally obtaining an orb descriptor;

3-1, establishing a k-d tree for the feature point set in the image, namely selecting a dimension k with the maximum variance in the data set; then selecting a characteristic point with a median value m on a k dimension as a splitting node; dividing the part with the value of k being less than m into a left subspace, and dividing the part with the value of k being more than m into a right subspace; respectively carrying out the operations on the left subspace and the right subspace until the left subspace and the right subspace can not be divided, and obtaining a k-d tree;

3-2, performing feature matching search by using a bbf search algorithm: performing binary search starting from a root node of the k-d tree, and sequencing nodes on a query path according to respective distances between the nodes and query points; when backtracking is carried out, starting from the tree node with the highest priority, and when all nodes are checked or exceed the running time limit, taking the point with the shortest distance as the nearest neighbor matching feature point;

the monocular vision odometer of the image characteristics is used for tunnel detection; in the step (3), the most significant feature detection result on the target is used as an initial condition of feature matching, the visible surface code of the target in the visual field is determined, and the feature matching is started by using the visible surface code as an initial state;

the step (4) specifically comprises the following steps:

4-1, establishing a coordinate system by taking the upper left corner of the image as a coordinate origin and taking the area shot by the image as a plane according to the pixel size and the physical size of the image to obtain a 3D coordinate of the characteristic point;

and 4-2, solving the pose of the camera when shooting the next frame by utilizing a coordinate conversion relation according to camera internal parameters, the 3D coordinates of the feature points in the previous frame image and the 2D coordinates of the feature points in the next frame image, and further solving the displacement of the camera between the positions of shooting the previous frame and the next frame.

2. The method for monocular vision to measure mileage based on image characteristics as claimed in claim 1, wherein the scaling relationship among the image coordinate system, the camera coordinate system and the world coordinate system in the step 1-1 is specifically:

(1.11) in the world coordinate system, the coordinate of a certain point is [ X ] _w ，Y _w ，Z _w ]In the camera coordinate system, the point coordinate is [ X ] _c ，Y _c ，Z _c ]Through the corresponding transformation relation of rotation and translation,

Where f is the focal length of the camera, i.e

(1.13) let the point be [ u, v ] in a coordinate system expressed by the image pixel size]Then, the relationship between the image pixel size coordinate system and the image physical size coordinate system is:

(1.14) by combining the relationships of the above formulae (1), (2) and (3):

(1.15) consideration of the addition of the skewness parameter C, ultimately

r ₁ ，r ₂ ，r ₃ is the column vector of R, t is the translation column vector, then equation (5) is written as

3. The method for monocular vision distance measurement based on image features of claim 2, wherein the homography matrix H = [ H ] ₁ h ₂ h ₃ ]Let λ represent a constant according to equation (6) to obtain

[h ₁ h ₂ h ₃ ]＝λA[r ₁ r ₂ t] (7)；

To obtain

And

to obtain | | | r ₁ ||＝(cosγcosβ+sinγsinαsinβ) ² +(-sinγcosβ+cosγsinαsinβ) ² +(cosαsinβ) ² =1, and | | | r ₂ ||＝(sinγcosα) ² +(cosγcosα) ² +(-sinα) ² ＝1，

So r ₁ ||＝||r ₂ ||＝1 (8)；

Calculating r ₁ ·r ₂ ＝(cosγcosβ+sinγsinαsinβ)(sinγcosα)+(-sinγcosβ+cosγsinαsinβ)(cosγcosα)+(cosαsinβ)(-sinα)＝0， (9)；

From the above equations (7), (8) and (9), it is possible to obtain:

i.e. h ₁ ^T A ^-T A ^-1 h ₂ ＝0； (10)；

To obtain h ₁ ^T A ^-T A ^-1 h ₁ ＝h ₂ ^T A ^-T A ^-1 h ₂ (11)。

4. The method for monocular vision to measure mileage based on image characteristics of claim 3, wherein the specific process of solving the internal and external parameters of the camera is: establishing an equation set according to the formula (10) and the formula (11), substituting the homography matrix values of the groups obtained in the step (1-2) into the equation set, and solving to obtain an internal reference matrix A;

order to

Is provided with h _i ＝[h _i1 ，h _i2 ，h _i3 ] ^T Then there is

And substituting the values of all the homography matrixes to solve b, and then solving each element value and the external parameter in the internal parameter matrix A.

5. A monocular visual odometer based on image characteristics, characterized by: the monocular vision odometer adopts the image characteristic-based monocular vision mileage measuring method of any one of claims 1 to 4 to calculate mileage.