CN110060202B

CN110060202B - Monocular SLAM algorithm initialization method and system

Info

Publication number: CN110060202B
Application number: CN201910319610.0A
Authority: CN
Inventors: 杨文龙; P·尼古拉斯
Original assignee: Hubei Ecarx Technology Co Ltd
Current assignee: Ecarx Hubei Tech Co Ltd
Priority date: 2019-04-19
Filing date: 2019-04-19
Publication date: 2021-06-08
Anticipated expiration: 2039-04-19
Also published as: CN110060202A

Abstract

The invention provides an initialization method and a system of a monocular SLAM algorithm, wherein the method comprises the following steps: and selecting pixel points in an image overlapping area from the acquired image, converting the selected pixel points into pixel points corresponding to a binocular SLAM algorithm, calculating three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm, converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system, and initializing the monocular SLAM algorithm according to the coordinate information of the camera coordinate system. Therefore, the embodiment of the invention can assist the monocular SLAM algorithm to realize initialization according to the binocular SLAM algorithm, so that the monocular system corresponding to the monocular SLAM algorithm can realize initialization more quickly and better, and the stability and the calculation precision of the monocular SLAM algorithm are improved.

Description

Monocular SLAM algorithm initialization method and system

Technical Field

The invention relates to the technical field of automobiles, in particular to an initialization method and system of a monocular SLAM algorithm.

Background

In the existing solutions of Auto Park Assist Systems (APAs), 4 low-cost fisheye cameras are generally provided to implement an all-vehicle Monitoring system (AVM) function or achieve a visual display effect related to the APA. In the prior art, in the process of finding a parking space by using an APA, the parking space and static obstacles such as an ice cream cone, a wheel gear, a rod, a net rail and the like are generally required to be detected, and even some unusual obstacles such as a bicycle, a chair and the like are required to be detected. The method has the advantages that the static obstacles are detected by adopting the current common machine learning method, the types of the obstacles and a large amount of training data need to be known in advance, and the universality and the flexibility for finding the parking spaces are poor.

In addition, at present, the latest research focus is to use point cloud to realize static obstacle detection, but a better Global Camera (Global Shutter Camera) or a binocular Camera is generally required, and the existing fish-eye cameras arranged in a large number on the automobile cannot be well realized. The existing visual SLAM (Simultaneous Localization And mapping) point cloud generating method based on the monocular fisheye camera mainly has the problems of poor stability, low precision, difficulty in obtaining the real scale of a static obstacle And the like. The poor stability is mainly expressed in the aspects of frequent initialization failure in the point cloud generation process, large deviation of point cloud generation, large noise, large scale deviation of point cloud which can be generated, and the like. The precision problem and the real scale problem are mainly shown in other aspects that the point cloud generated by the monocular SLAM algorithm has no real scale, the scale has larger randomness, and the point cloud changes with the scene, the key frame interval and the like.

Disclosure of Invention

In view of the above, the present invention has been made to provide an initialization method and system of a monocular SLAM algorithm that overcomes or at least partially solves the above-mentioned problems.

According to an aspect of the present invention, there is provided a method for initializing a monocular SLAM algorithm, including:

selecting pixel points in an image overlapping region from the collected images, wherein the image overlapping region is a content overlapping region of two images collected through different angles;

converting the selected pixel points into pixel points corresponding to a binocular SLAM algorithm, and calculating three-dimensional coordinates of the converted pixel points in an image overlapping area based on the binocular SLAM algorithm;

converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system;

and initializing the monocular SLAM algorithm according to the coordinate information of the camera coordinate system.

Optionally, selecting a pixel point in an image overlapping region from the collected image includes:

acquiring a collected image, and inputting the acquired image into a preset camera model;

and after the image is processed by the camera model, selecting pixel points in an image overlapping area from the processed image according to a monocular SLAM algorithm.

Optionally, converting the selected pixel point into a pixel point corresponding to a binocular SLAM algorithm, including:

and converting the selected pixel points into pixel points corresponding to a binocular SLAM algorithm based on an image coordinate system.

Optionally, calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm includes:

acquiring a base line distance between cameras corresponding to pre-measured acquired images;

and calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm and according to the baseline distance between the cameras.

Optionally, after initializing the monocular SLAM algorithm according to the coordinate information of the camera coordinate system, the method further includes:

selecting pixel points in an image overlapping area from the collected images according to an initialized monocular SLAM algorithm, wherein the image overlapping area is a content overlapping area of two images collected from different angles;

converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system, and calculating the pose change of a camera for collecting images from an angle based on the coordinate information of the camera coordinate system by using the initialized monocular SLAM algorithm;

calculating three-dimensional coordinates of pixel points in an image non-overlapping area according to the pose change, wherein the image non-overlapping area is an area where the contents of two images acquired through different angles do not overlap;

and generating a point cloud according to the three-dimensional coordinates of the pixel points in the non-overlapping area and the overlapping area of the image.

Optionally, calculating three-dimensional coordinates of pixel points in the non-overlapping region of the image according to the pose change includes:

calculating the displacement distance between any two different moments of a camera which collects images from an angle through an initialized monocular SLAM algorithm according to the pose change;

and calculating the three-dimensional coordinates of the pixel points in the non-overlapping area of the image according to the displacement distance by using the initialized monocular SLAM algorithm.

Optionally, the pose change of the camera comprises: a rotation matrix and/or a translation matrix of the camera.

According to another aspect of the present invention, there is also provided an initialization system of monocular SLAM algorithm, comprising a monocular subsystem, a binocular subsystem, and a transformation subsystem, wherein,

the monocular subsystem selects pixel points in an image overlapping area from the collected images, wherein the image overlapping area is a content overlapping area of two images collected through different angles;

the conversion subsystem converts the selected pixel points into pixel points which are suitable for the binocular subsystem and correspond to a binocular SLAM algorithm;

the binocular subsystem calculates the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm;

the conversion subsystem is used for converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system suitable for the monocular subsystem;

and the monocular subsystem initializes the monocular SLAM algorithm according to the coordinate information of the camera coordinate system.

According to yet another aspect of the present invention, there is also provided a computer storage medium having stored thereon computer program code which, when run on a computing device, causes the computing device to execute the method of initializing the monocular SLAM algorithm of any of the embodiments above.

In the embodiment of the invention, after the pixel points in the image overlapping area are selected from the collected image and are converted into the pixel points corresponding to the binocular SLAM algorithm, the three-dimensional coordinates of the converted pixel points in the image overlapping area are calculated based on the binocular SLAM algorithm, the three-dimensional coordinates of the pixel points are converted into the coordinate information of a camera coordinate system, and then the monocular SLAM algorithm can be initialized according to the coordinate information of the camera coordinate system. Therefore, the embodiment of the invention can calculate and obtain the more accurate and stable coordinate information of the pixel points in the image overlapping area according to the binocular SLAM algorithm, so that the coordinate information of the pixel points is utilized to assist the monocular SLAM algorithm to realize initialization, a monocular system corresponding to the monocular SLAM algorithm can be initialized more quickly and better, and the stability and the calculation precision of the monocular SLAM algorithm are improved.

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

The above and other objects, advantages and features of the present invention will become more apparent to those skilled in the art from the following detailed description of specific embodiments thereof, taken in conjunction with the accompanying drawings.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:

FIG. 1 shows a flow diagram of a method for initializing a monocular SLAM algorithm according to one embodiment of the present invention;

FIG. 2 shows a schematic view of a capture field of view of a fisheye camera on a vehicle, in accordance with one embodiment of the invention;

fig. 3 is a schematic structural diagram illustrating an initialization system of the monocular SLAM algorithm according to an embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

In order to solve the above technical problem, an embodiment of the present invention provides an initialization method for a monocular SLAM algorithm. Fig. 1 is a flowchart illustrating an initialization method of a monocular SLAM algorithm according to one embodiment of the present invention. Referring to fig. 1, the method includes at least steps S102 to S108.

Step S102, selecting pixel points in an image overlapping area from the collected images, wherein the image overlapping area is a content overlapping area of two images collected through different angles.

In the step, the pixel points in the image overlapping area are actually activated points, the uniform point selection is preferably performed when the pixel points are selected in the image overlapping area, and the density of the selected points is slightly larger, so that the requirement of initializing the monocular SLAM algorithm is met.

And step S104, converting the selected pixel points into pixel points corresponding to a binocular SLAM algorithm, and calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm.

In this step, the three-dimensional coordinates of the converted pixel points refer to three-dimensional coordinate information based on a world coordinate system.

And step S106, converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system.

And step S108, initializing the monocular SLAM algorithm according to the coordinate information of the camera coordinate system.

In the embodiment of the invention, after the pixel points in the image overlapping area are selected from the collected image and are converted into the pixel points corresponding to the binocular SLAM algorithm, the three-dimensional coordinates of the converted pixel points in the image overlapping area are calculated based on the binocular SLAM algorithm, the three-dimensional coordinates of the pixel points are converted into the coordinate information of a camera coordinate system, and then the monocular SLAM algorithm can be initialized according to the coordinate information of the camera coordinate system. Therefore, the embodiment of the invention calculates and obtains the more accurate and stable coordinate information of the pixel points in the image overlapping area through the binocular SLAM algorithm, so that the coordinate information of the pixel points is utilized to assist the monocular SLAM algorithm to realize initialization, a monocular system corresponding to the monocular SLAM algorithm can be initialized more quickly and better, and the stability and the calculation precision of the monocular SLAM algorithm are improved.

Because the prior vision SLAM algorithm needs the image-based correction graph, the field angle FOV of the correction graph is small, and the overlapping area of two adjacent cameras is not large enough. And the problems of stability and scale precision encountered by a monocular algorithm can be well solved by adopting a binocular vision SLAM algorithm. Therefore, in the embodiment of the invention, at least two cameras are used as two eyes, so that a sufficient field angle overlapping area is provided for the binocular vision SLAM algorithm, and the at least two cameras can be respectively positioned at any positions of different angles on the vehicle.

Referring to step S102 above, in an embodiment of the present invention, the image overlapping area may be a content overlapping area in two images respectively captured by any two cameras installed at different positions of the vehicle. When selecting the pixel points in the image overlapping region from the collected images, the pixel points of any object in the image overlapping region can be selected from one or more frames of images collected by one of the cameras during initial work. Of course, the device for acquiring an image in the embodiment of the present invention is not limited to a camera, and may also be other image acquisition devices, which is not specifically limited in the embodiment of the present invention.

In order to take the low cost and the large visual angle of the camera into account, the cameras arranged at different positions and angles of the vehicle can adopt fisheye cameras which can acquire images and have a large visual angle range.

For example, referring to fig. 2, the acquisition field of view of the fisheye camera mounted on the automobile is represented by a semicircle, and the automobile of the embodiment shown in fig. 2 has 4 fisheye cameras mounted therein, which are respectively located in the front, rear, left, and right directions of the automobile body. Assuming that two adjacent fisheye cameras comprise a front fisheye camera and a side fisheye camera, when a plurality of pixel points of any object are selected from images corresponding to the side fisheye cameras, uniform point selection needs to be performed in an image overlapping area and an image non-overlapping area respectively, and the density of the selected points in the image overlapping area is slightly larger.

Referring to step S102, in an embodiment of the present invention, in the process of selecting the pixel points in the image overlapping area from the acquired image, in order to directly use the image acquired by the camera, after the image acquired by the camera is acquired, the acquired image may be input into a preset camera model, the camera model processes the image, and then the pixel points in the image overlapping area are selected from the processed image according to the monocular SLAM algorithm.

In the embodiment of the invention, the monocular pinhole Camera Model and the fisheye Camera Model (omnidirective Camera Model) are respectively used for processing the images acquired by the fisheye Camera, and the coverage range of the activated pixels in the images processed by the fisheye Camera Model is larger and the number of the activated pixels is larger by comparing the effective pixel points in the images processed by the monocular pinhole Camera Model and the fisheye Camera Model. Therefore, the fish-eye camera model is preferably adopted in the embodiment of the invention, so that distortion correction of the fish-eye image acquired by the fish-eye camera is not required (namely, the fish-eye image is not required to be converted into a plane image), the content of the fish-eye image can be directly applied to the visual SLAM algorithm, and the consistency and quality of the final point cloud are improved. Moreover, the fisheye camera model is used to increase the field angle and improve the matching precision.

If the fisheye camera model is selected as the preset camera model, in the step S102, in order to directly use the image acquired by the fisheye camera, the image acquired by the fisheye camera is prevented from being corrected to a non-fisheye image, so as to prevent the field of view of the image from being reduced. According to the embodiment of the invention, two images respectively acquired by two fisheye cameras can be acquired firstly, then the two images are respectively input into the fisheye camera model, then after the images are processed by the fisheye camera model, the image corresponding to any fisheye camera is selected from the processed images, and a plurality of pixel points of any object are selected from the selected processed images.

In general, two adjacent fisheye cameras mounted on a vehicle are far away from each other and oriented differently, and the angles at which images are acquired are different. For example, for a front fisheye camera and a side fisheye camera adjacent to each other in a car, the two cameras are far away from each other and are respectively directed to the front side and the left side/right side. When selecting an image corresponding to any fisheye camera, it is necessary to determine which fisheye camera is more appropriate to select the image corresponding to the fisheye camera according to an actual application scene.

For example, in an auto park scene, a driver cannot visually see a scene to the side of the vehicle because the driver can visually see the scene in front of the vehicle. Therefore, the visual field of the side fisheye camera can be selected as a target area, namely, an image corresponding to the side fisheye camera is selected, so that a driver can conveniently detect the static barriers at the side of the vehicle when looking for a parking space.

Referring to step S104, in an embodiment of the present invention, in the process of converting the pixel point located in the image overlapping area in the selected pixel point into the pixel point corresponding to the binocular SLAM algorithm, the pixel point may be converted by combining the image coordinate system, and a specific conversion process may be implemented in the prior art, which is not described herein again.

In another embodiment of the present invention, in the process of calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm, the pre-measured baseline distance between the cameras corresponding to the acquired images may be obtained, and then the binocular SLAM algorithm is used to calculate the three-dimensional coordinates of the converted pixel points in the image overlapping area according to the baseline distance between the cameras. The three-dimensional coordinate information in this embodiment further includes depth information of the pixel points.

In this embodiment, if the fisheye cameras installed in front of and on the side of the vehicle are used to form the binocular, the base line distance between the fisheye cameras in front of and on the side of the vehicle may be measured in advance, and then the binocular SLAM algorithm is used to calculate the three-dimensional coordinates of the converted pixel points in the image overlapping area according to the base line distance between the two fisheye cameras.

After initializing the monocular SLAM algorithm, the binocular SLAM algorithm can be used for assisting the initialized monocular SLAM algorithm to acquire mature points in the acquired image more quickly and accurately in real time, and point clouds of the object are generated according to three-dimensional coordinates of pixel points in a non-overlapping area and an overlapping area of the image. Therefore, the embodiment of the invention also provides a binocular point cloud generating method which mainly comprises the following steps 1 to 5.

Step 1, selecting pixel points in an image overlapping area from an acquired image according to an initialized monocular SLAM algorithm.

In this step, the image overlap region is a content overlap region of two images acquired through different angles. In addition, because the camera installed on the vehicle can collect images in real time, the method can select pixel points in the image overlapping area from the images collected in real time.

In the step, a plurality of pixel points selected from the acquired image refer to mature points, and are stable pixel points which are further screened after the pixel points are primarily selected from the image, and the mature points can be subsequently calculated to obtain three-dimensional coordinate information.

And 2, converting the selected pixel points into pixel points corresponding to a binocular SLAM algorithm, and calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on the binocular SLAM algorithm.

In this step, when the selected pixel point is converted into a pixel point corresponding to the binocular SLAM algorithm, the selected pixel point may also be converted into a pixel point corresponding to the binocular SLAM algorithm based on the image coordinate system.

And 3, converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system, and calculating the pose change of the camera for acquiring images from an angle based on the coordinate information of the camera coordinate system by using the initialized monocular SLAM algorithm.

And 4, calculating the three-dimensional coordinates of pixel points in the image non-overlapping area according to the pose change, wherein the image non-overlapping area is an area where the contents of two images acquired through different angles are not overlapped.

And 5, generating point clouds according to the three-dimensional coordinates of the pixel points in the non-overlapping area and the overlapping area of the image.

According to the embodiment of the invention, the cameras arranged at different positions and angles of the vehicle form a binocular system, after the binocular SLAM algorithm in the binocular system is used for calculating the three-dimensional coordinates (namely the coordinates of a world coordinate system) of the pixel points in the image overlapping area, the three-dimensional coordinates of the pixel points in the image overlapping area can be converted into the coordinate information of the camera coordinate system of one camera according to the conversion mode of the coordinate value marks between different coordinate systems, and then the three-dimensional coordinates of the pixel points in the image non-overlapping area can be calculated according to the monocular SLAM algorithm and based on the coordinate information of the camera coordinate system obtained through conversion, therefore, the scheme of the embodiment can carry out scale calculation according to the pixel point information in the image overlapping area of the collected image, and the pixel point information of the image non-. Because the binocular SLAM algorithm has the advantages of no need of initialization, capability of acquiring more accurate scale information of an object, better stability and the like, the binocular SLAM algorithm can be used for assisting the monocular SLAM algorithm to acquire mature points in an acquired image more quickly and accurately, and therefore the stability of point cloud generation and the accuracy of acquiring the real scale of the object are improved.

Referring to the step 1, in an embodiment of the present invention, the process of selecting the pixel points in the image overlapping area may refer to the above embodiment, after the image acquired by the camera is acquired, the acquired image is input into the preset camera model, the camera model processes the image, and then the pixel points in the image overlapping area are selected from the processed image according to the initialized monocular SLAM algorithm.

Referring to step 4 above, in an embodiment of the present invention, a specific process of calculating three-dimensional coordinates of pixel points in a non-overlapping area of an image according to a pose change of a camera is to first calculate a displacement distance of the camera at any two different times by using an initialized monocular SLAM algorithm and according to the pose change of the camera. And then, calculating the three-dimensional coordinates of the pixel points in the non-overlapping region of the image according to the displacement distance by using the initialized monocular SLAM algorithm.

In this embodiment, the camera with the changed pose refers to a camera corresponding to one image of the selected pixel point. Therefore, the camera corresponding to the displacement distance calculated between any two different moments is the camera corresponding to the image of the selected pixel point. For example, two cameras at any angle are respectively installed in front of and at the side of the vehicle, and in the above embodiment, pixel points of any object in the image overlapping area are selected from the images collected by the cameras at the side.

In this embodiment, the displacement distance of a camera between any two different time instants can also be understood as the baseline distance of the camera between any two different time instants.

In an embodiment of the invention, the pose change of the camera may include a rotation matrix and/or a translation matrix of the camera.

Based on the same inventive concept, the embodiment of the invention also provides an initialization system of the monocular SLAM algorithm. Fig. 3 is a schematic structural diagram illustrating an initialization system of the monocular SLAM algorithm according to an embodiment of the present invention. Referring to fig. 3, the initialization system 300 for the monocular SLAM algorithm includes a monocular subsystem 320, a transformation subsystem 330, and a binocular subsystem 340.

Now, the functions of the components or devices of the initialization system 300 of the monocular SLAM algorithm and the connection relationship between the components will be described:

the monocular subsystem 320 selects pixel points in an image overlapping region from the acquired images, wherein the image overlapping region is a content overlapping region of two images acquired through different angles;

the conversion subsystem 330 is coupled with the monocular subsystem 320 and converts the selected pixel points into pixel points which are suitable for the binocular subsystem 340 and correspond to the binocular SLAM algorithm;

the binocular subsystem 340 is coupled with the conversion subsystem 330 and used for calculating the three-dimensional coordinates of the converted pixel points in the image overlapping area based on a binocular SLAM algorithm;

a conversion subsystem 330 for converting the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system suitable for the monocular subsystem 320;

the monocular subsystem 320 initializes the monocular SLAM algorithm according to the coordinate information of the camera coordinate system.

In an embodiment of the present invention, the monocular subsystem 320 may further obtain a collected image, input the obtained image into a preset camera model, and select a pixel point in an image overlapping region according to a monocular SLAM algorithm from the processed image after the image is processed by the camera model.

In an embodiment of the present invention, when the conversion subsystem 330 converts the selected pixel point into a pixel point corresponding to the binocular SLAM algorithm, the selected pixel point may be converted into a pixel point corresponding to the binocular SLAM algorithm based on the image coordinate system.

In an embodiment of the present invention, the binocular subsystem 340 may first obtain a pre-measured baseline distance between cameras corresponding to the acquired images, and then calculate three-dimensional coordinates of the converted pixel points in the image overlapping area according to the baseline distance between the cameras based on a binocular SLAM algorithm.

In an embodiment of the present invention, after the monocular subsystem 320 initializes the monocular SLAM algorithm according to the coordinate information of the camera coordinate system, the corresponding monocular subsystem 320, the binocular subsystem 340, and the transformation subsystem 330 may further perform a process of generating an object point cloud, which includes the following specific processes:

and the monocular subsystem 320 selects pixel points in the image overlapping region from the acquired image according to the initialized monocular SLAM algorithm.

The image overlapping area is a content overlapping area of two images acquired by cameras at different angles. Moreover, since the camera mounted on the vehicle can capture images in real time, the monocular subsystem 320 may select pixel points within the image overlap region from the images captured in real time.

The conversion subsystem 330 converts the selected pixel points into pixel points which are suitable for the binocular subsystem 340 and correspond to the binocular SLAM algorithm.

And the binocular subsystem 340 calculates the three-dimensional coordinates of the converted pixel points in the image overlapping area based on a binocular SLAM algorithm.

The conversion subsystem 330 converts the three-dimensional coordinates of the pixel points into coordinate information of a camera coordinate system suitable for the monocular subsystem 320.

The monocular subsystem 320 calculates the pose change of the camera acquiring the image from an angle based on the coordinate information of the camera coordinate system by using a monocular SLAM algorithm, and calculates the three-dimensional coordinates of any object pixel point in the non-overlapping area of the image according to the pose change. The image non-overlapping area is a region where the contents of two images acquired through different angles do not overlap.

And the monocular subsystem 320 generates point clouds according to the three-dimensional coordinates of the pixel points in the non-overlapping area and the overlapping area of the images.

In an embodiment of the present invention, when the monocular subsystem 320 calculates the three-dimensional coordinates of the pixel points in the non-overlapping region of the image according to the pose change, the displacement distance between any two different times of the camera that collects the image from an angle may be calculated according to the pose change by the initialized monocular SLAM algorithm. And then, calculating the three-dimensional coordinates of the pixel points in the non-overlapping area of the image according to the displacement distance by using the initialized monocular SLAM algorithm.

In an embodiment of the invention, the pose change of the camera comprises a rotation matrix and/or a translation matrix of the camera.

An embodiment of the present invention further provides a computer storage medium, where a computer program code is stored, and when the computer program code runs on a computing device, the computer storage medium causes the computing device to execute the initialization method of the monocular SLAM algorithm in any of the above embodiments.

According to any one or a combination of the above preferred embodiments, the following advantages can be achieved by the embodiments of the present invention:

in the embodiment of the invention, after the pixel points in the image overlapping area are selected from the collected image and are converted into the pixel points corresponding to the binocular SLAM algorithm, the three-dimensional coordinates of the converted pixel points in the image overlapping area are calculated based on the binocular SLAM algorithm, the three-dimensional coordinates of the pixel points are converted into the coordinate information of a camera coordinate system, and then the monocular SLAM algorithm can be initialized according to the coordinate information of the camera coordinate system. Therefore, the embodiment of the invention can assist the monocular SLAM algorithm to realize initialization according to the binocular SLAM algorithm, so that the monocular system corresponding to the monocular SLAM algorithm can realize initialization more quickly and better, and the stability and the calculation precision of the monocular SLAM algorithm are improved.

It is clear to those skilled in the art that the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and for the sake of brevity, further description is omitted here.

In addition, the functional units in the embodiments of the present invention may be physically independent of each other, two or more functional units may be integrated together, or all the functional units may be integrated in one processing unit. The integrated functional units may be implemented in the form of hardware, or in the form of software or firmware.

Those of ordinary skill in the art will understand that: the integrated functional units, if implemented in software and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computing device (e.g., a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention when the instructions are executed. And the aforementioned storage medium includes: u disk, removable hard disk, Read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disk, and other various media capable of storing program code.

Alternatively, all or part of the steps of implementing the foregoing method embodiments may be implemented by hardware (such as a computing device, e.g., a personal computer, a server, or a network device) associated with program instructions, which may be stored in a computer-readable storage medium, and when the program instructions are executed by a processor of the computing device, the computing device executes all or part of the steps of the method according to the embodiments of the present invention.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments can be modified or some or all of the technical features can be equivalently replaced within the spirit and principle of the present invention; such modifications or substitutions do not depart from the scope of the present invention.

Claims

1. A monocular SLAM algorithm initialization method comprises the following steps:

2. The method of claim 1, wherein selecting pixel points within an image overlap region from the captured image comprises:

3. The method of claim 1 or 2, wherein converting the selected pixel points to pixel points corresponding to a binocular SLAM algorithm comprises:

4. The method of claim 1 or 2, wherein calculating three-dimensional coordinates of the transformed pixel points in the image overlap region based on the binocular SLAM algorithm comprises:

5. The method of claim 1, wherein after initializing the monocular SLAM algorithm according to the coordinate information of the camera coordinate system, further comprising:

6. The method of claim 5, wherein calculating three-dimensional coordinates of pixel points within a non-overlapping region of the image from the pose changes comprises:

7. The method of claim 5 or 6,

the pose change of the camera comprises: a rotation matrix and/or a translation matrix of the camera.

8. An initialization system of monocular SLAM algorithm comprises a monocular subsystem, a binocular subsystem and a conversion subsystem, wherein,

9. A computer storage medium having computer program code stored thereon which, when run on a computing device, causes the computing device to perform the method of initializing the monocular SLAM algorithm of any one of claims 1-7.