CN112613372B

CN112613372B - Outdoor environment visual inertia SLAM method and device

Info

Publication number: CN112613372B
Application number: CN202011489168.5A
Authority: CN
Inventors: 陶文宇; 戴志强; 朱祥维; 李芳�; 陈伟翔; 陈彦莛
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2020-12-16
Filing date: 2020-12-16
Publication date: 2024-04-02
Anticipated expiration: 2040-12-16
Also published as: CN112613372A

Abstract

The invention discloses an outdoor environment visual inertia SLAM method and a device, wherein the method comprises the following steps: obtaining a gradient image by extracting image gradient information of an original image, and performing threshold segmentation processing on the gradient image to obtain an image to be optimized; calculating to-be-integrated boundary data according to the to-be-optimized image; integrating the boundary data to be integrated by adopting a polynomial fitting algorithm to obtain final boundary data; and dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization on the inertial navigation data and the non-sky area image to obtain an SLAM frame, and carrying out SLAM composition positioning according to the SLAM frame. According to the embodiment of the invention, the boundary line is corrected by adopting a polynomial fitting algorithm, so that the operation amount and operation time can be effectively reduced, the sky region segmentation effect can be rapidly and accurately obtained, and the accuracy of positioning and track map construction can be effectively improved.

Description

Outdoor environment visual inertia SLAM method and device

Technical Field

The invention relates to the technical field of image processing, in particular to an outdoor environment visual inertia SLAM method and device.

Background

SLAM (Simultaneous Localization and Mapping) Chinese translation is a technique for simultaneously locating and mapping, and establishing an environment model and estimating own motion by carrying one or more sensors under the condition of no environment priori information. As an important method for locating and establishing an environmental model, it has a wide range of applications in fields such as automatic driving. The basic visual SLAM structure is relatively simple, but the problems of inaccurate positioning and track deviation can occur in various scenes. The combined inertial measurement unit and camera vision-inertial unit is an increasingly popular application framework. The existing outdoor environment visual inertia SLAM method adopts a mature SLAM framework such as VINS-Mono, ORB-SLAM3 and the like, and can obtain ideal experimental results in the outdoor environment. However, the existing outdoor environment visual inertial SLAM method needs to waste a large amount of computing resources in an ineffective sky area, resulting in poor efficiency of positioning and track mapping.

Disclosure of Invention

The invention provides an outdoor environment visual inertia SLAM method and device, which are used for solving the technical problem that the existing outdoor environment visual inertia SLAM method needs to waste a large amount of computing resources in an invalid sky area, so that the efficiency of positioning and track mapping is poor.

A first embodiment of the present invention provides an outdoor environment visual inertial SLAM method, including:

obtaining a gradient image by extracting image gradient information of an original image, and performing threshold segmentation processing on the gradient image to obtain an image to be optimized;

defining a sky boundary function according to the parameters of the image to be optimized, and calculating the sky boundary function according to a gradient optimization energy function to obtain boundary data to be integrated;

adopting a polynomial fitting algorithm to integrate the boundary data to be integrated to obtain final boundary data;

and dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization on the inertial navigation data and the non-sky area image to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

Further, the step of obtaining a gradient image by extracting image gradient information of an original image, and the step of performing threshold segmentation processing on the gradient image to obtain an image to be optimized specifically includes:

calculating gradient information of the original image based on image space domain convolution by adopting a Sobel operator, and drawing a gradient image according to the gradient information;

and dividing the sky area and the non-sky area of the gradient image to obtain an image to be optimized.

Further, the parameters of the image to be optimized include, but are not limited to, the width of the image and the height of the image; defining a sky boundary function according to the parameters of the image to be optimized, and calculating the sky boundary function according to a gradient optimization energy function to obtain boundary data to be integrated, wherein the boundary data to be integrated is specifically as follows:

respectively calculating a covariance matrix of a sky area and a covariance matrix of a non-sky area in the image to be optimized according to the number of pixels of the sky area and the number of pixels of the non-sky area in the image to be optimized, and defining a gradient optimization energy function according to the covariance matrix of the sky area and the covariance matrix of the non-sky area;

defining a sky boundary function according to the width of the image and the height of the image;

and calculating the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated.

Further, the step of performing SLAM initialization on the inertial navigation data and the segmented image to obtain a SLAM frame, and performing SLAM composition positioning according to the SLAM frame specifically includes:

and initializing the segmented image and the inertial navigation data by adopting a visual inertial odometer, rear-end optimization and loop detection processing to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

A second embodiment of the present invention provides an outdoor environment visual inertial SLAM apparatus, comprising:

the threshold segmentation module is used for obtaining a gradient image by extracting image gradient information of an original image, and performing threshold segmentation processing on the gradient image to obtain an image to be optimized;

the computing module is used for defining a sky boundary function according to the parameters of the image to be optimized, and computing the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated;

the integration module is used for integrating the boundary data to be integrated by adopting a polynomial fitting algorithm to obtain final boundary data;

and the composition positioning module is used for dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization according to the inertial navigation data and the division to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

Further, the threshold segmentation module is specifically configured to:

Further, the parameters of the image to be optimized include, but are not limited to, the width of the image and the height of the image; the computing module is specifically configured to:

Further, the composition positioning module is specifically configured to:

According to the embodiment of the invention, the boundary line is corrected by adopting a polynomial fitting algorithm, so that a great amount of calculation resources and calculation time are avoided, and the sky area segmentation effect can be rapidly and accurately obtained; after the sky area is segmented, initializing segmented images after sky segmentation and inertial navigation data to obtain an SLAM frame, so that the SLAM frame is more reliable, and in positioning and track mapping in an outdoor environment, the situation of inaccurate feature matching can be effectively avoided, and the accuracy of positioning and track mapping can be effectively improved.

Drawings

FIG. 1 is a schematic flow chart of an outdoor environment visual inertial SLAM method according to an embodiment of the present invention;

fig. 2 is a schematic flow chart of an outdoor environment visual inertial SLAM method according to an embodiment of the present invention.

Detailed Description

The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all, of the embodiments of the present application. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.

In the description of the present application, it should be understood that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or an implicit indication of the number of technical features being indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present application, unless otherwise indicated, the meaning of "a plurality" is two or more.

In the description of the present application, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the terms in this application will be understood by those of ordinary skill in the art in a specific context.

Referring to fig. 1, a first embodiment of the present invention provides an outdoor environment visual inertial SLAM method, including:

s1, obtaining a gradient image by extracting image gradient information of an original image, and performing threshold segmentation processing on the gradient image to obtain an image to be optimized;

in the embodiment of the invention, the gradient image has gradient information, and the gradient information is used as one of the original information of the image and can be used for clearly representing the trend of the gray level change of the image, thereby providing important information for the next image processing. For example, in an original image captured by a SLAM monocular camera in an outdoor scene, there is a clear visual distinction between a sky area and a non-sky area, and based on this, a gradient image with gradient information can accurately reflect the distinction between the sky area and the non-sky area in most cases.

S2, defining a sky boundary function according to parameters of the image to be optimized, and calculating the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated;

s3, integrating the boundary data to be integrated by adopting a polynomial fitting algorithm to obtain final boundary data;

s4, dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization on the inertial navigation data and the non-sky area image to obtain an SLAM frame, and carrying out SLAM composition positioning according to the SLAM frame.

As a specific implementation manner of the embodiment of the invention, the gradient image is obtained by extracting the image gradient information of the original image, and the image to be optimized is obtained by performing threshold segmentation processing on the gradient image, which is specifically as follows:

calculating gradient information of an original image based on image space domain convolution by adopting a Sobel operator, and drawing a gradient image according to the gradient information;

In a specific embodiment, a threshold segmentation method is used for processing the gradient image, and pixel-level classification is carried out on the gradient image by setting a threshold according to the obvious difference between a sky area and a city building wood counting area in the gradient image and the difference of the sky area and other areas in gray scale, so that the elimination of fine crushing extraction of the sky area is realized.

As a specific implementation of the embodiment of the present invention, parameters of the image to be optimized include, but are not limited to, the width of the image and the height of the image; defining a sky boundary function according to parameters of an image to be optimized, and calculating the sky boundary function according to a gradient optimization energy function to obtain boundary data to be integrated, wherein the boundary data to be integrated is specifically as follows:

In an embodiment of the present invention, the expression of the gradient optimized energy function is as follows:

wherein, sigma _S Sum sigma _g Covariance matrices representing sky regions and non-sky regions, respectively, represented by RGB values, γ being a parameter of sky region uniformity,and->Corresponding to the two matrices, |·| represents the corresponding determinant, Σ _S Sum sigma _g The definition is as follows:

N _s and N _g The number of pixels representing the sky region and the non-sky region, respectively.

In the embodiment of the invention, the gradient optimization energy function can effectively optimize the segmentation result between the sky area and the non-sky area.

In a specific embodiment, a sky boundary function boundary (x) is defined:

1≤border(x)≤H(1≤x≤W)

where W and H represent the width and height of the gradient image, respectively. The sky and non-sky regions can be calculated using the following formula:

sky＝{(x,y)|1≤x≤W,1≤y≤border(x)}

ground＝{(x,y)|1≤x≤W,border(x)≤y≤H}

and calculating an optimal value of the sky area and an optimal value of the non-sky area according to the gradient optimization energy function to obtain boundary data to be integrated.

After the boundary data to be integrated is obtained through calculation, a polynomial fitting method is introduced to further correct the boundary line of the sky area.

Specifically, given data point p _i (x _i ,y _i ) Where i=1, 2, … m, the deviation of the approximation curve y=f (x) is required to be minimal, the approximation curve being at point p _i Deviation at

General form of polynomial:

y＝p ₀ x ⁿ +p ₁ x ^n-1 +p ₂ x ^n-2 +...+p _n

the difference between the fitting function and the true result is as follows:

it will be appreciated that the process of polynomial fitting is the process of finding the minimum loss.

As a specific implementation manner of the embodiment of the present invention, performing SLAM initialization on inertial navigation data and a split image to obtain a SLAM frame, and performing SLAM composition positioning according to the SLAM frame, specifically:

and initializing the split image and the inertial navigation data by adopting a visual inertial odometer, rear-end optimization and loop detection processing to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

In an embodiment of the invention, a visual odometer is employed, with the visual portion and the inertial navigation portion starting from the initialization portion. The visual part extracts characteristic angular points in the image by using a characteristic point method; the inertia part adopts pre-integration to realize the optimization of the calculated amount; the rear-end optimization part uses BA (Bundle Adjustment) method to make optimal adjustment for the pose of the camera and the conceptual position of the feature point space; the loop detection part uses a scheme based on a bag-of-words model, and each element in the dictionary is regarded as a set of adjacent characteristic points, so that the success rate and the speed of image comparison are optimized. According to the embodiment of the invention, SLAM composition positioning is performed through the constructed SLAM frame, so that the waste of computing resources can be effectively reduced, and the accuracy of positioning and track mapping can be effectively improved.

The embodiment of the invention has the following beneficial effects:

Referring to fig. 2, a second embodiment of the present invention provides an outdoor environment visual inertial SLAM device, comprising:

the threshold segmentation module 10 is used for obtaining a gradient image by extracting image gradient information of an original image, and performing threshold segmentation processing on the gradient image to obtain an image to be optimized;

The computing module 20 is configured to define a sky boundary function according to parameters of the image to be optimized, and calculate the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated;

the integration module 30 is configured to integrate the boundary data to be integrated by using a polynomial fitting algorithm to obtain final boundary data;

the composition positioning module 40 is configured to divide the original image according to the final boundary data to obtain a sky area image, initialize the inertial navigation data and the division to obtain a SLAM frame, and perform SLAM composition positioning according to the SLAM frame.

As a specific implementation of the embodiment of the present invention, the threshold segmentation module 10 is specifically configured to:

As a specific implementation of the embodiment of the present invention, parameters of the image to be optimized include, but are not limited to, the width of the image and the height of the image; the calculation module 20 is specifically configured to:

In a specific embodiment, a sky boundary function boundary (x) is defined:

1≤border(x)≤H(1≤x≤W)

sky＝{(x,y)|1≤x≤W,1≤y≤border(x)}

ground＝{(x,y)|1≤x≤W,border(x)≤y≤H}

Specifically, given data point p _i (x _i ,y _i ) Where i=1, 2..m, the deviation of the approximation curve y=f (x) is required to be minimal, the approximation curve being at point p _i Deviation at

General form of polynomial:

y＝p ₀ x ⁿ +p ₁ x ^n-1 +p ₂ x ^n-2 +...+p _n

the difference between the fitting function and the true result is as follows:

As a specific implementation of the embodiment of the present invention, the composition positioning module 40 is specifically configured to:

The embodiment of the invention has the following beneficial effects:

The foregoing is a preferred embodiment of the present invention and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present invention and are intended to be comprehended within the scope of the present invention.

Claims

1. An outdoor environment visual inertial SLAM method, comprising:

defining a sky boundary function according to the parameters of the image to be optimized, and calculating the sky boundary function according to a gradient optimization energy function to obtain boundary data to be integrated; the parameters of the image to be optimized comprise the width of the image and the height of the image; defining a sky boundary function according to the parameters of the image to be optimized, and calculating the sky boundary function according to a gradient optimization energy function to obtain boundary data to be integrated, wherein the boundary data to be integrated is specifically as follows: respectively calculating a covariance matrix of a sky area and a covariance matrix of a non-sky area in the image to be optimized according to the number of pixels of the sky area and the number of pixels of the non-sky area in the image to be optimized, and defining a gradient optimization energy function according to the covariance matrix of the sky area and the covariance matrix of the non-sky area; defining a sky boundary function according to the width of the image and the height of the image; calculating the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated;

the expression of the gradient optimization energy function is as follows:

N _s and N _g The number of pixels representing the sky region and the non-sky region, respectively;

and dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization on the inertial navigation data and the sky area image to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

2. The outdoor environment visual inertia SLAM method of claim 1, wherein the gradient image is obtained by extracting image gradient information of an original image, and the image to be optimized is obtained by performing threshold segmentation processing on the gradient image, specifically:

3. The outdoor environment vision inertial SLAM method of claim 1, wherein the step of performing SLAM initialization on the inertial navigation data and the sky area image to obtain a SLAM frame, and performing SLAM composition positioning according to the SLAM frame comprises the following steps:

and initializing the sky area image and the inertial navigation data by adopting a visual inertial odometer, rear-end optimization and loop detection processing to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

4. An outdoor environment visual inertial SLAM device, comprising:

the computing module is used for defining a sky boundary function according to the parameters of the image to be optimized, and computing the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated; the parameters of the image to be optimized comprise the width of the image and the height of the image; the computing module is specifically configured to: respectively calculating a covariance matrix of a sky area and a covariance matrix of a non-sky area in the image to be optimized according to the number of pixels of the sky area and the number of pixels of the non-sky area in the image to be optimized, and defining a gradient optimization energy function according to the covariance matrix of the sky area and the covariance matrix of the non-sky area; defining a sky boundary function according to the width of the image and the height of the image; calculating the sky boundary function according to the gradient optimization energy function to obtain boundary data to be integrated;

the expression of the gradient optimization energy function is as follows:

and the composition positioning module is used for dividing the original image according to the final boundary data to obtain a sky area image, carrying out SLAM initialization on the inertial navigation data and the sky area image to obtain an SLAM framework, and carrying out SLAM composition positioning according to the SLAM framework.

5. The outdoor environment visual inertial SLAM device of claim 4, wherein the threshold segmentation module is specifically configured to:

6. The outdoor environment visual inertial SLAM device of claim 4, wherein the composition positioning module is specifically configured to: