CN111862200B

CN111862200B - Unmanned aerial vehicle positioning method in coal shed

Info

Publication number: CN111862200B
Application number: CN202010607883.8A
Authority: CN
Inventors: 董延超; 王浩天; 宁少淳; 冀玲玲; 岳继光; 何斌; 沈润杰
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2020-06-30
Filing date: 2020-06-30
Publication date: 2023-04-28
Anticipated expiration: 2040-06-30
Also published as: CN111862200A

Abstract

The invention relates to a method for positioning an unmanned aerial vehicle in a coal shed, which comprises the following steps: obtaining a priori visual tag map; step 2: initializing the SLAM system; step 3: solving the pose of the camera, updating the map at the same time, and then removing redundant key frames and redundant map points in the map; step 4: and (3) optimizing the camera pose solved in the step (3) to obtain a final camera pose system, wherein the camera pose system comprises the camera pose, the visual tag pose and the map point space position. Compared with the prior art, the invention has the advantages of high positioning precision and the like.

Description

Unmanned aerial vehicle positioning method in coal shed

Technical Field

The invention relates to the technical field of unmanned aerial vehicle positioning in a coal shed, in particular to a method for positioning an unmanned aerial vehicle in a coal shed based on priori visual labels and natural characteristic point maps.

Background

In the energy structure of China, coal takes the dominant role, and the thermal power generation is mainly performed by coal. In recent years, different types of stacking volume measurement systems are sequentially introduced by higher institutions and scientific research institutions at home and abroad, the system is widely applied to stacking inventory of resource enterprises such as electric power, steel, metallurgy and the like, the laser disk coal system based on the traveling crane or bucket wheel machine at home and abroad has the defects of complex equipment wiring, long measurement period and difficult maintenance, the portable laser disk coal system is relatively simple and convenient to operate, the performance is greatly improved compared with a fixed disk coal instrument, but the whole measurement process needs the participation of operators, and the selection of measurement points directly influences the accuracy of measurement results. Along with the development of unmanned aerial vehicle technology, the coal-coiling system of the thermal power plant based on an unmanned aerial vehicle platform also has rapid development. The rotary wing aircraft has the characteristics of low flying speed, capability of taking off and landing vertically, accurate flying action, flexible operation, simple structure and the like, and is applied to coal coiling work of part of thermal power plants.

For example, chinese patent CN209274918U discloses an unmanned aerial vehicle coal-coiling system, including unmanned aerial vehicle, remote control station and ground station, still including carrying on unmanned aerial vehicle's RTK positioning module and water sample collection system, wherein RTK positioning module is used for obtaining unmanned aerial vehicle's positional information, and this system has the advantage that can independently gather the water sample to obtain unmanned aerial vehicle positional information. However, the positioning accuracy of the unmanned aerial vehicle positioning module in the system is low, and stable and safe navigation information cannot be provided for the coal-based unmanned aerial vehicle system under the condition that GNSS signals are weak or the GNSS signals do not exist.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide a positioning method of an unmanned aerial vehicle in a coal shed with high positioning accuracy.

The aim of the invention can be achieved by the following technical scheme:

a method for positioning an unmanned aerial vehicle in a coal shed comprises the following steps:

step 1: obtaining a priori visual tag map;

step 2: initializing the SLAM system;

step 3: solving the pose of the camera, updating the map at the same time, and then removing redundant key frames and redundant map points in the map;

step 4: and (3) optimizing the camera pose solved in the step (3) to obtain a final camera pose system, wherein the camera pose system comprises the camera pose, the visual tag pose and the map point space position.

Preferably, the prior visual tag map in the step 1 is specifically:

the prior visual tag map arranged in the coal shed comprises a plurality of two-dimensional codes, and each two-dimensional code has independent ID and pose information.

Preferably, the step 2 specifically includes:

step 2-1: solving the relative pose between the first frame and the second frame of the image by using a relative pose solving sub-method based on a priori visual label, judging whether the SLAM system successfully solves the relative pose, if so, executing the step 2-3, otherwise, executing the step 2-2;

step 2-2: and (2) solving the relative pose between the first frame and the second frame of the image by using a relative pose solving sub-method based on the characteristic points, and then executing the steps (2-3):

step 2-3: triangularizing the feature points;

step 2-4: and constructing the feature points obtained by triangulation as map points, adding the key frames and the points and the observation attributes between the points and the key frames, calculating the optimal descriptors, updating the observation direction and the distance range, inserting new map points into the map, and finishing the updating of the initial map.

More preferably, the prior visual tag-based relative pose solving sub-method specifically comprises the following steps:

calculate the first frame f ₀ Middle visual tag and first frame f ₀ Is gamma of the relative pose of (2) ₀ Second frame f ₁ Middle visual tag and second frame f ₁ Is gamma of the relative pose of (2) ₁ The relative pose T between two frames is specifically: t=γ ₀ ^-1 *γ ₁ 。

More preferably, the feature point-based relative pose solving sub-method specifically comprises:

firstly, calculating an essential matrix model and a homography matrix model;

the essential matrix model calculation method comprises the following steps: first frame f of image ₀ And a second frame f ₁ A pair of characteristic points p ₁ and p₂ Is the projection of a point P in space in a picture, a pair of characteristic points P ₁ and p₂ A point P in space can be determined, in the first frame f ₀ The spatial position of the point P under the set coordinate system is as follows:

P＝[X,Y,Z] ^T

in the camera projection model, point p ₁ and p₂ The pixel coordinates of (a) are respectively:

x ₀ ＝KP

x ₁ ＝K(RP+t)

wherein K is an internal reference matrix of the camera, and is obtained according to epipolar constraint:

x ₁ ^T K ^-T t^RK ^-1 x ₀ ＝0

let the essential matrix e=t≡r, then based on the point p ₁ and p₂ Solving an essential matrix E by pixel positions;

the calculation method of the homography matrix model comprises the following steps: solving a homography matrix H by using a DLT algorithm;

then, the scores of the essential matrix model and the homography matrix model are calculated, and the score calculation method comprises the following steps:

wherein ,

for matching point pairs->

Reprojection errors of the current frame are obtained through an essential matrix model or a homography matrix model; />

For matching point pairs->

Re-projection errors in a reference frame through an essential matrix model or a homography matrix model;

calculating a judgment index R _H The calculation method comprises the following steps:

if R is _H And (3) selecting a homography matrix H to restore the pose (R, t) if the homography matrix H is more than 0.45, otherwise selecting an essential matrix E to restore (R, t).

More preferably, the step 2-3 specifically comprises:

the spatial point P is at the first frame f of the initial frame ₀ The positions in the normalized plane of (a) are:

the spatial point P is in the second frame f of the current frame ₁ The positions in the normalized plane of (a) are:

the relation according to the position change is as follows:

z _r p _r ＝T _rw P _w

z _c p _c ＝T _cw P _w

wherein ,T_rw For the identity matrix of the initial frame, T _cw For the second frame f ₁ And a first frame f ₀ Is changed in relative pose;

three-dimensional vector cross multiplication is zero, and the following steps are obtained:

p _r ×z _r p _r ＝p _r ×T _rw P _w ＝0

p _c ×z _c p _c ＝p _c ×T _cw P _w ＝0

after finishing, the following steps are obtained:

finally, solving to obtain the space coordinates P of the feature points _w 。

More preferably, the camera pose solving method in the step 3 is a first pose solving sub-method or a second pose solving sub-method;

the first pose solving sub-method specifically comprises the following steps:

adopting a constant speed model, obtaining the initial pose of the camera in the current frame according to the pose and the speed of the previous frame image, wherein the pose change of the camera from the moment k to the moment k-1 is the same as the pose change of the camera from the moment k-1 to the moment k-2, namely:

thus, there is obtained:

the above steps are recursively and inversely transformed, and the concrete steps are as follows:

the movement speed of the camera is as follows:

calculating a translation vector t from the current frame to the previous frame, re-projecting the three-dimensional point corresponding to the characteristic point of the last frame to the current frame, searching the characteristic point at the re-projection position, and optimizing the pose of the current frame according to a method for minimizing the re-projection error;

the second pose solving sub-method specifically comprises the following steps:

obtaining a matching map point and visual labels obtained by matching a reference key frame with a current frame, and if the number of the matched visual labels is more than two, according to the world pose of the visual labels

And the relative pose relationship of the local coordinate system of the visual tag and the current frame coordinate system +.>

Obtaining the initial pose of the current frame:

then optimizing the pose of the current frame according to a method for minimizing the reprojection error;

the calculation method of the re-projection error comprises the following steps:

wherein, (R, t) is the transformation relation of the visual label corner point converted from the visual label local coordinate system to the camera coordinate system; x is x _i Pixel coordinates of corner points of the visual tag in the image; p is p _i The coordinates of the corner points of the visual tag in a local coordinate system of the visual tag; pi _m Is a camera projection model.

Preferably, the specific method for updating the map in the step 3 is as follows:

for the current frame, if: at least one new visual label which does not exist in the original map or the number of the feature points tracked in the current frame is more than 50 or the repetition rate of the feature points tracked by the current frame and the reference key frame is less than 90 percent, the current frame is used as the key frame to be added into the map, and meanwhile, the map points obtained by triangulating the current frame are also added into the map.

Preferably, the method for removing redundant key frames in the map specifically includes:

if three key frames exist in the map, more than 90% of map points and all visual labels in the key frames can be observed, judging the key frames as redundant key frames, and removing the redundant key frames from the map;

the method for removing the redundant map points comprises the following steps:

and projecting map points to the common-view key frame, extracting characteristic points in a grid where a projection area of the common-view key frame is located, and if the Hamming distance of the characteristic point descriptors is smaller than 30, replacing the current map points with corresponding map points in the common-view key frame.

Preferably, the step 4 specifically includes:

firstly, a key frame with more than 50 common-view map points or more than 2 common-view visual labels and corresponding map points and visual labels are searched in a map, and then the pose, the pose of the visual labels and the spatial position of the map points of a camera with common-view relation in the tracking process are optimized by using an L-M method, wherein the optimization method comprises the following steps:

wherein ,w₁ and w₂ Respectively representing the proportion of map points and visual labels in the optimization; p is p _i ^p Representing map points in a space; x is x _i ^p Representing a pixel point corresponding to the map point; p is p _i ^m Representing coordinates of the visual tag corner points in a visual tag local coordinate system; x is x _i ^m Representing pixel points corresponding to the corner points of the visual labels; pi _m Representing a projection equation of the camera;

the w is ₁ and w₂ The calculation method of (1) is as follows:

w ₁ ＝1

wherein m is the number of map points observed in the current frame; n is the number of a priori data tags observed in the current frame.

Compared with the prior art, the invention has the following advantages:

the positioning accuracy is higher: according to the method for positioning the unmanned aerial vehicle in the coal shed, the real pose of the unmanned aerial vehicle is determined through the priori visual tag map, and the dimension and the pose of the priori visual tag are known, so that the accuracy of the pose of the unmanned aerial vehicle determined through the priori visual tag is higher, and the positioning accuracy of the unmanned aerial vehicle is also higher.

Drawings

FIG. 1 is a schematic flow chart of a method for positioning an unmanned aerial vehicle in a coal shed;

FIG. 2 is a layout of a priori visual tags in an embodiment of the present invention;

FIG. 3 is a schematic diagram of a SLAM map according to an embodiment of the present invention;

fig. 4 is a schematic diagram of path planning of the unmanned aerial vehicle according to an embodiment of the present invention;

FIG. 5 is a graph comparing the trajectory on the X-axis of a method for positioning a drone using the present invention and a method for positioning a drone using GPS in an embodiment of the present invention;

FIG. 6 is a graph comparing the trajectory on the Y-axis of a method for positioning a drone using the present invention and a method for positioning a drone using GPS in an embodiment of the present invention;

fig. 7 is a graph comparing the trajectory on the Z axis of the present invention using the present invention and the unmanned positioning method using GPS.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.

The method for positioning the unmanned aerial vehicle in the coal shed comprises the following steps:

step 1: obtaining a priori visual tag map;

as with human mapping, the process of describing and recognizing the environment by the robot mainly depends on the map, and the current environment information is described through the environment map. As shown in fig. 2, a map comprising a plurality of two-dimensional codes is arranged in a coal shed, each two-dimensional code has independent ID and pose information, and the visual tag map is used as priori knowledge of unmanned aerial vehicle navigation;

step 2: initializing the SLAM system, specifically:

the prior visual tag-based relative pose solving sub-method specifically comprises the following steps:

calculate the first frame f ₀ Middle visual tag and first frame f ₀ Is gamma of the relative pose of (2) ₀ Second frame f ₁ Middle visual tag and second frame f ₁ Is gamma of the relative pose of (2) ₁ The relative pose T between two frames is specifically: t=γ ₀ ^-1 *γ ₁ ；

The relative pose solving sub-method based on the characteristic points specifically comprises the following steps:

firstly, calculating an essential matrix model and a homography matrix model;

P＝[X,Y,Z] ^T

x ₀ ＝KP

x ₁ ＝K(RP+t)

x ₁ ^T K ^-T t^RK ^-1 x ₀ ＝0

wherein ,

for matching point pairs->

For matching point pairs->

if R is _H More than 0.45, the homography matrix H is selected to recover the pose (R, t) Otherwise, selecting an essential matrix Eback (R, t);

step 2-3: triangularizing the feature points, wherein the method comprises the following specific steps of:

the relation according to the position change is as follows:

z _r p _r ＝T _rw P _w

z _c p _c ＝T _cw P _w

p _r ×z _r p _r ＝p _r ×T _rw P _w ＝0

p _c ×z _c p _c ＝p _c ×T _cw P _w ＝0

after finishing, the following steps are obtained:

finally, solving to obtain the space coordinates P of the feature points _w 。

the pose is solved in two ways, namely, the pose is solved by using a first pose solving sub-method or a second pose solving sub-method.

The first pose solving sub-method specifically comprises the following steps:

thus, there is obtained:

the movement speed of the camera is as follows:

the second pose solving sub-method specifically comprises the following steps:

Obtaining the initial pose of the current frame:

wherein, (R, t) is the transformation relation of the visual label corner point converted from the visual label local coordinate system to the camera coordinate system; x is x _i Pixel coordinates of corner points of the visual tag in the image; p is p _i The coordinates of the corner points of the visual tag in a local coordinate system of the visual tag; pi _m A projection model for a camera;

the specific method for updating the map comprises the following steps:

for the current frame, if:

(1) At least one new visual label which does not exist in the original map or the number of the feature points tracked in the current frame is more than 50;

(2) The repetition rate of the feature points tracked by the current frame and the reference key frames is less than 90%;

if either of the two conditions is satisfied, the current frame is added to the map as a key frame, and the map point obtained by triangulating the current frame is also added to the map.

The method for removing the redundant key frames in the map comprises the following steps:

the method for removing the redundant map points comprises the following steps:

Step 4: optimizing the camera pose solved in the step 3 to obtain a final camera pose system, wherein the final camera pose system comprises the spatial positions of the camera pose, the visual tag pose and map points, and the method specifically comprises the following steps:

firstly, a key frame with more than 50 common-view map points or more than 2 common-view visual labels and corresponding map points and visual labels are searched in a map, and then a camera gesture system with a common-view relation in a tracking process, namely a camera pose, a visual label pose and a map point space position are optimized by using an L-M method, wherein the optimization method comprises the following steps:

/>

the w is ₁ and w₂ The calculation method of (1) is as follows:

w ₁ ＝1

Finally, the optimized camera attitude system, namely the camera pose, the visual tag pose and the map point space position, is obtained, so that the accurate positioning of the unmanned aerial vehicle in the coal shed is completed.

Meanwhile, when the steps are carried out, maintenance and update are carried out on the SLAM Map in real time, the Map storage is to store each element in Map and the relation between the elements, and the data needed in the tracking process are natural, namely objects to be stored. The map mainly comprises a key frame, natural characteristic map points, two-dimensional codes, a BoW vector, a common view, a growing tree and the like, wherein the three tracking models and the local map tracking and the like are adopted in the tracking process, the local map tracking needs to use elements such as a 3D map point, a common view relation and the like, the reference frame model needs to use the BoW vector of the key frame, the repositioning needs to use the BoW vector, the 3D point, the two-dimensional codes and the like, and the basic elements need to be stored.

In this embodiment, as shown in fig. 3, a SLAM map constructed by an unmanned aerial vehicle includes a visual tag, three-dimensional map points and a key frame, and new elements are continuously added into the map along with the movement of the unmanned aerial vehicle; meanwhile, the map has a storage function, and the map stored before the same scene is used for multiple movements can be used as prior information; the visual tag elements in the map can bring a strong constraint to the positioning of the unmanned aerial vehicle, and when the corresponding visual tag is detected in the image, the pose of the unmanned aerial vehicle and the pose of the visual tag are optimized simultaneously. The path planned by the unmanned aerial vehicle according to the prior visual tag is shown in fig. 4, and the world coordinate system can be constructed through the visual tag and the path planning can be performed. Meanwhile, the unmanned aerial vehicle positioning method and the unmanned aerial vehicle running track determined by the unmanned aerial vehicle positioning method based on the GPS are compared in an outdoor meadow lean texture scene, and schematic diagrams are shown in fig. 5, 6 and 7, and can be seen from the figures. The unmanned aerial vehicle track output by the unmanned aerial vehicle positioning method is basically consistent with the unmanned aerial vehicle track output by the GPS, the unmanned aerial vehicle can perform high-precision positioning in real time in the autonomous flight process, the average distance errors of the x axis, the y axis and the z axis are 0.072m, 0.107m and 0.206m, the scale information is consistent with the real world, and the unmanned aerial vehicle positioning method can still realize high-precision positioning for poor texture scenes which cannot be operated by the traditional vision SLAM systems of coal sheds, grasslands, factories and the like.

While the invention has been described with reference to certain preferred embodiments, it will be understood by those skilled in the art that various changes and substitutions of equivalents may be made and equivalents will be apparent to those skilled in the art without departing from the scope of the invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.

Claims

1. The unmanned aerial vehicle positioning method in the coal shed is characterized by comprising the following steps of:

step 1: obtaining a priori visual tag map;

step 2: initializing the SLAM system;

step 4: optimizing the camera pose solved in the step 3 to obtain a final camera pose system, wherein the camera pose system comprises the camera pose, the visual tag pose and the map point space position;

the step 4 specifically comprises the following steps:

world pose representing visual tag +.>

Representing an initial pose of the current frame;

the w is ₁ and w₂ Is calculated by the method of (a)The method comprises the following steps:

w ₁ ＝1

wherein m is the number of map points observed in the current frame; n is the number of a priori data tags observed in the current frame;

the step 2 specifically comprises the following steps:

step 2-3: triangularizing the feature points;

2. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the priori visual tag map in the step 1 is specifically:

3. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the prior visual tag-based relative pose solving sub-method is specifically as follows:

4. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the characteristic point-based relative pose solving sub-method is specifically as follows:

firstly, calculating an essential matrix model and a homography matrix model;

P＝[X,Y,Z] ^T

x ₀ ＝KP

x ₁ ＝K(RP+t)

x ₁ ^T K ^-T t^RK ^-1 x ₀ ＝0

wherein ,

for matching point pairs->

For matching point pairs->

5. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the steps 2-3 are specifically as follows:

the spatial point P is in the second frame f of the current frame ₁ The positions in the normalized plane of (a) are：

The relation according to the position change is as follows:

z _r p _r ＝T _rw P _w

z _c p _c ＝T _cw P _w

p _r ×z _r p _r ＝p _r ×T _rw P _w ＝0

p _c ×z _c p _c ＝p _c ×T _cw P _w ＝0

after finishing, the following steps are obtained:

finally, solving to obtain the space coordinates P of the feature points _w 。

6. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the specific method for updating the map in the step 3 is as follows:

7. The method for positioning the unmanned aerial vehicle in the coal shed according to claim 1, wherein the method for removing the redundant key frames in the map is specifically as follows:

the method for removing the redundant map points comprises the following steps: