CN111862120B

CN111862120B - Monocular SLAM scale recovery method

Info

Publication number: CN111862120B
Application number: CN202010711561.8A
Authority: CN
Inventors: 郁树梅; 郭文康; 孙荣川; 干旻峰; 孙立宁
Original assignee: Suzhou University; First Affiliated Hospital of Suzhou University
Current assignee: Suzhou University; First Affiliated Hospital of Suzhou University
Priority date: 2020-07-22
Filing date: 2020-07-22
Publication date: 2023-07-11
Anticipated expiration: 2040-07-22
Also published as: CN111862120A

Abstract

The invention discloses a method for recovering monocular SLAM scale, which comprises the following steps: manufacturing a calibration box, wherein a rectangular through hole is formed in the calibration box in a penetrating manner; placing a calibration box at the visual field of a monocular camera, and modeling a target object after the monocular camera passes through a through hole of the calibration box; the monocular camera acquires images of the inner wall of the calibration box, and the upper and lower images of the inner wall of the calibration box in the previous key frames generated by SLAM are segmented in areas; respectively obtaining three-dimensional map point coordinates of upper and lower images of the inner wall of the calibration box according to the corresponding relation between the feature points and the three-dimensional map points in the SLAM; fitting the coordinates of the three-dimensional map points on the upper surface and the lower surface of the inner wall of the calibration box according to the constraint condition of the two planes, and calculating to obtain the distances between the upper surface and the lower surface of the inner wall of the calibration box in the three-dimensional map; calculating a scale factor; the scale factor F is used to recover the entire three-dimensional map scale. The monocular SLAM device can automatically recover the size of monocular SLAM in the human intestinal environment, and is good in effect and high in accuracy.

Description

Monocular SLAM scale recovery method

Technical Field

The invention relates to the technical field of surgical robot navigation, in particular to a method for recovering monocular SLAM scale.

Background

In the process of taking a picture by a monocular camera, the actual distance from the object to the camera cannot be known only by a single picture, the distance from the object to the camera is solved in monocular vision SLAM by triangulating the matched characteristic points in the adjacent key frames, the measured distance is only a numerical value, no unit exists, and the scale has uncertainty, so that the structural framework of the object is restored only, and the size of the object in practical significance is not restored.

Currently, other sensors are mainly added in a large scene environment to restore the scale, such as an IMU, a GPS, a laser radar and the like; the absolute scale can be recovered with higher accuracy by using the scale information provided by the additional sensor, and such methods cannot be used in the intestinal environment due to the limitation of the sensor size. The method for reconstructing the three-dimensional structure of the abdominal cavity scene based on the laparoscope is characterized in that a calibrated scale with scales is placed at the abdominal cavity, and the scale recovery is manually carried out, but is inconvenient in actual operation; still other schemes for reconstructing an endoscopic scene eventually do not recover scale information for the scene.

Disclosure of Invention

The invention aims to provide a method for recovering the monocular SLAM scale, which can solve the problem of uncertainty of the monocular SLAM scale in the intestinal environment of a human body.

In order to solve the technical problems, the invention provides a method for recovering the monocular SLAM scale, which comprises the following steps:

s1, manufacturing a calibration box, wherein a rectangular through hole is formed in the calibration box in a penetrating manner, and the distance between the upper surface and the lower surface of the wall of the through hole is d _i ′；

S2, placing a calibration box at the visual field of the monocular camera, and modeling a target object after the monocular camera passes through a through hole of the calibration box;

s3, the monocular camera performs image acquisition on the inner wall of the calibration box, and upper and lower images of the inner wall of the calibration box in the previous several key frames generated by SLAM are segmented in areas;

s4, respectively obtaining three-dimensional map point coordinates of the upper image and the lower image of the inner wall of the calibration box according to the corresponding relation between the characteristic points in the SLAM and the three-dimensional map points;

s5, fitting the coordinates of the three-dimensional map points on the upper surface and the lower surface of the inner wall of the calibration box by using the constraint condition of the two planes to obtain the distance d between the upper surface and the lower surface of the inner wall of the calibration box in the three-dimensional map by calculation _i ；

S6, according to d _i ' and d _i Calculating a scale factor F;

s7, restoring the whole three-dimensional map scale by using the scale factor F.

Preferably, the area division in S3 is defined by four ridges on the inner wall of the calibration box.

Preferably, the method for using four edges of the inner wall of the calibration box as boundaries specifically includes:

detecting edge information of key frames of the previous frames;

obtaining a linear edge in the edge information;

and connecting the straight lines on the same ridge line end to end for merging.

Preferably, the edge information of key frames of the previous frames is detected through a canny operator, a Roberts operator, a Prewitt operator or a Sobel operator.

Preferably, the obtaining the straight line edge in the edge information specifically includes:

and obtaining the linear edges in the edge information by using a hough linear detection algorithm or a Freeman linear detection algorithm, and simultaneously setting a threshold value to filter out the short linear edges.

Preferably, in S3, the key frame image is the first five frames.

Preferably, the step S6 specifically includes: f=d _i ′/d _i

Preferably, the step S7 specifically includes multiplying the coordinates of all three-dimensional map points constructed by the monocular SLAM by a scale factor F.

The invention has the beneficial effects that:

the method utilizes the auxiliary of the calibration box to automatically recover the scale of the monocular SLAM in the human intestinal environment, has good effect and high precision, is suitable for a narrow environment, and can solve the problem that the scale of the monocular SLAM is difficult to recover in the narrow human body cavity environment.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is an external structure of the calibration box;

FIG. 3 is an internal view of a calibration box;

FIG. 4 is a simulation experiment scenario; .

FIG. 5 is a contour extracted from a key frame of SLAM by a canny operator;

FIG. 6 is a straight line segment detected by Hough;

FIG. 7 shows the upper and lower surfaces of the calibration box obtained by region division;

fig. 8 is a SLAM sparse feature point map after scale recovery.

Detailed Description

The present invention will be further described with reference to the accompanying drawings and specific examples, which are not intended to be limiting, so that those skilled in the art will better understand the invention and practice it.

Initializing the map size is completed in ORB-SLAM2 with the median value of the map point depths generated by the first frame as unit size 1. To restore the scale, the size (with unit information) of some objects in the real scene and the corresponding size (without unit information) of the objects in the map must be known in advance, and the scale factor is calculated, so that the scale information of all objects in the scene is restored.

Referring to FIG. 1, the invention discloses a method for monocular SLAM scale recovery, which comprises the following steps:

s1, manufacturing a calibration box, wherein a rectangular through hole is formed in the calibration box in a penetrating manner, and the distance between the upper surface and the lower surface of the wall of the through hole is d _i ′。

As shown in fig. 2 and fig. 3, the inner wall of the calibration box is provided with features, the front and the back of the calibration box are square and non-closed cuboid calibration boxes, the external dimensions are 250mm by 51mm, and the internal dimensions are 250mm by 42mm.

S2, placing a calibration box at the visual field of the monocular camera, and modeling the target object after the monocular camera passes through the through hole of the calibration box. In this embodiment, the calibration box may be placed at the front end of the intestinal inlet, and the monocular camera enters the intestinal tract for modeling after passing through the calibration box. The monocular camera may be an endoscope.

S3, the monocular camera acquires images of the inner wall of the calibration box, the upper and lower images of the inner wall of the calibration box in the first few key frames generated by SLAM are segmented in areas, and the first five key frames are preferable because if the number of map points generated is less than five, the number of map points is increased, errors are increased, and after the number of map points exceeds five, the endoscope acquires the scene of the intestinal tract, so that the area segmentation is not facilitated.

The step S3 of region segmentation takes four edge lines of the inner wall of the calibration box as boundaries, and specifically comprises the following steps:

detecting edge information of key frames of the previous frames through a canny operator, a Roberts operator, a Prewitt operator or a Sobel operator;

obtaining a linear edge in the edge information by using a hough linear detection algorithm or a Freeman linear detection algorithm, and simultaneously setting a threshold value to filter out a short linear edge;

and connecting and combining the ends of the straight lines detected on one ridge line according to the coordinate range of the four ridge lines.

As shown in fig. 4, in an experimental scenario, a calibration box is placed at the front end of the intestinal inlet, and an endoscope passes through the calibration box and then enters the intestinal tract to perform intestinal modeling.

As shown in fig. 6, is a straight line segment detected by Hough. And setting a length threshold in the detection process, and removing the straight line segment with shorter length.

As shown in fig. 7, the upper and lower surfaces of the calibration box obtained by the region division are divided, and the blue line is the boundary of the region division. And dividing the upper plane and the lower plane by using the straight line segment detected by hough according to the coordinate range.

S4, respectively obtaining three-dimensional map point coordinates of the upper and lower images of the inner wall of the calibration box according to the corresponding index relation between the feature points and the three-dimensional map points in the SLAM;

S6, according to d _i ' and d _i Calculating scale factor F, f=d _i '/d _i ；

S7, recovering the scale of the whole three-dimensional map by using the scale factor F, and multiplying the scale factor F by all three-dimensional map point coordinates constructed by the monocular SLAM.

As shown in fig. 8, the SLAM sparse feature point map after scale recovery. The unit is cm.

The above-described embodiments are merely preferred embodiments for fully explaining the present invention, and the scope of the present invention is not limited thereto. Equivalent substitutions and modifications will occur to those skilled in the art based on the present invention, and are intended to be within the scope of the present invention. The protection scope of the invention is subject to the claims.

Claims

1. A method for monocular SLAM scale recovery, comprising the steps of:

S2, placing a calibration box at the visual field of the monocular camera, placing the calibration box at the front end of an intestinal inlet, and modeling the intestinal tract after the monocular camera passes through the through hole of the calibration box;

S6, according to d _i ' and d _i Calculating a scale factor F;

2. The method for monocular SLAM scale restoration of claim 1, wherein the region segmentation in S3 is bounded by four ridges of the inner wall of the calibration box.

3. The method for monocular SLAM scale restoration of claim 2, wherein the boundaries of four ridges on the inner wall of the calibration box specifically include:

detecting edge information of key frames of the previous frames;

obtaining a linear edge in the edge information;

4. The method for monocular SLAM scale restoration of claim 3, wherein the edge information of key frames of the previous frames is detected by a canny operator, a Roberts operator, a Prewitt operator, or a Sobel operator.

5. The method for monocular SLAM scale recovery of claim 3, wherein obtaining the straight line edges in the edge information specifically comprises:

6. The method for monocular SLAM scale restoration of claim 1, wherein in S3, the key frame image is the first five frames.

7. The method for monocular SLAM scale recovery of claim 1, wherein S6 specifically comprises: f=d _i ′/d _i 。

8. The method for monocular SLAM scale restoration of claim 1, wherein S7 specifically comprises multiplying all three-dimensional map point coordinates of the monocular SLAM construction by a scale factor F.

9. A processor for running a program, wherein the program when run performs the method of any one of claims 1 to 8.