CN112381886B

CN112381886B - Three-dimensional scene reconstruction method based on multiple cameras, storage medium and electronic equipment

Info

Publication number: CN112381886B
Application number: CN202011281518.9A
Authority: CN
Inventors: 邱又海; 黄智洲; 徐倩茹
Original assignee: Shenzhen Zhouming Technology Co Ltd
Current assignee: Shenzhen Zhouming Technology Co Ltd
Priority date: 2020-11-16
Filing date: 2020-11-16
Publication date: 2024-04-09
Anticipated expiration: 2040-11-16
Also published as: CN112381886A

Abstract

The invention discloses a three-dimensional scene reconstruction method based on multiple cameras, a storage medium and electronic equipment, which comprise the following steps: acquiring a primary position relation calibration matrix among a plurality of cameras; acquiring color images and depth images which are respectively shot by a plurality of cameras on a scene to be reconstructed, wherein the scene to be reconstructed is provided with a plurality of checkerboards, and acquiring a point cloud calibration matrix among the plurality of cameras according to the color images and the depth images of the plurality of cameras and a preliminary position relationship calibration matrix among the plurality of cameras; splicing the point clouds of the cameras according to the point cloud calibration matrix among the cameras to obtain a reconstructed three-dimensional scene; according to the invention, under the condition that the cameras are fixed, the point clouds of the cameras are registered and spliced through the machine vision method to obtain the reconstructed three-dimensional scene, so that the reconstruction of the three-dimensional scene can be better and faster realized, and meanwhile, the method is more time-saving, labor-saving and more universal.

Description

Three-dimensional scene reconstruction method based on multiple cameras, storage medium and electronic equipment

Technical Field

The present invention relates to the field of image processing technologies, and in particular, to a three-dimensional scene reconstruction method based on multiple cameras, a storage medium, and an electronic device.

Background

In an augmented reality application, in order for a human experimenter to obtain an immersive experience, a scene reconstruction of the real scene is required so that the human experimenter has an immersive feel. In the prior art, an indoor three-dimensional scene reconstruction method based on a double-layer registration method uses surf (Speeded Up Robust Features, acceleration robust feature) algorithm to extract and match feature points, maps the matching points to three-dimensional coordinates, and uses RANSAC (Random Sample Consensus, random sampling consensus algorithm) and ICP (Iterative Closest Point, iterative closest point algorithm) algorithm to perform registration.

Disclosure of Invention

The technical problems to be solved by the invention are as follows: the three-dimensional scene reconstruction method based on the multiple cameras, the storage medium and the electronic equipment can rapidly achieve three-dimensional scene reconstruction.

In order to solve the technical problems, the invention adopts the following technical scheme:

a three-dimensional scene reconstruction method based on multiple cameras comprises the following steps:

acquiring a primary position relation calibration matrix among a plurality of cameras;

acquiring color images and depth images which are respectively shot by a plurality of cameras on a scene to be reconstructed, wherein the scene to be reconstructed is provided with a plurality of checkerboards, and acquiring a point cloud calibration matrix among the plurality of cameras according to the color images and the depth images of the plurality of cameras and a preliminary position relationship calibration matrix among the plurality of cameras;

and splicing the point clouds of the cameras according to the point cloud calibration matrixes among the cameras to obtain a reconstructed three-dimensional scene.

In order to solve the technical problems, the invention adopts another technical scheme that:

a computer readable storage medium having stored thereon a computer program which when executed by a processor implements the multi-camera based three-dimensional scene reconstruction method described above.

an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the multi-camera based three-dimensional scene reconstruction method described above when executing the computer program.

The invention has the beneficial effects that: the method comprises the steps of firstly obtaining a preliminary position relation calibration matrix among a plurality of cameras, then placing a plurality of checkerboards in a three-dimensional scene to be reconstructed, obtaining a point cloud calibration matrix among the plurality of cameras according to color images and point clouds of the plurality of checkerboards in different cameras, finally splicing the point clouds of the plurality of cameras according to the point cloud calibration matrix among the plurality of cameras to obtain the reconstructed three-dimensional scene, namely, under the condition that the cameras are fixed, registering and splicing the point clouds of the plurality of cameras by a machine vision method to obtain the reconstructed three-dimensional scene, so that the reconstruction of the three-dimensional scene can be better and faster realized, and simultaneously, the method is time-saving, labor-saving and universal.

Drawings

Fig. 1 is a flow chart of a three-dimensional scene reconstruction method based on multiple cameras according to an embodiment of the invention;

FIG. 2 is a schematic flow chart of generating a calibration matrix of a preliminary positional relationship according to an embodiment of the present invention;

FIG. 3 is a schematic flow chart of point cloud registration and stitching according to an embodiment of the present invention;

FIG. 4 is a schematic diagram of a checkerboard according to an embodiment of the present invention;

FIG. 5 is a schematic diagram of four checkerboards according to an embodiment of the present invention;

FIG. 6 is a schematic diagram of the effect of masking the found checkerboard according to an embodiment of the present invention;

FIG. 7 is a schematic view illustrating the effect of a region of interest of a second color image according to an embodiment of the present invention;

FIG. 8 is a schematic view of the effect of a region of interest of a point cloud according to an embodiment of the present invention;

fig. 9 is a schematic diagram of an effect of performing preliminary registration on a point cloud according to an embodiment of the present invention;

FIG. 10 is a schematic diagram of calculation for generating a calibration matrix of a preliminary positional relationship according to an embodiment of the present invention;

FIG. 11 is a perspective view of a view cut-out according to an embodiment of the present invention;

FIG. 12 is a schematic view of a view of the XZ plane of a viewing volume according to an embodiment of the present invention;

FIG. 13 is a schematic diagram illustrating the positions of a point cloud and a plane of interception according to an embodiment of the present invention;

fig. 14 is a schematic view of a point cloud of a camera according to an embodiment of the present invention;

fig. 15 is a schematic view of a point cloud after a certain camera according to an embodiment of the present invention removes overlapping point cloud areas;

fig. 16 is a schematic view of an effect of two spliced camera point clouds according to an embodiment of the present invention;

fig. 17 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Description of the reference numerals:

1. an electronic device; 2. a processor; 3. a memory.

Detailed Description

In order to describe the technical contents, the achieved objects and effects of the present invention in detail, the following description will be made with reference to the embodiments in conjunction with the accompanying drawings.

Referring to fig. 1 to 16, the multi-camera based three-dimensional scene reconstruction method includes the steps of:

From the above description, the beneficial effects of the invention are as follows: the method comprises the steps of obtaining a preliminary position relation calibration matrix among a plurality of cameras in advance, then placing a plurality of checkerboards in a three-dimensional scene to be reconstructed, obtaining a point cloud calibration matrix among the plurality of cameras according to color images and point clouds of the plurality of checkerboards in different cameras, and finally splicing the point clouds of the plurality of cameras according to the point cloud calibration matrix among the plurality of cameras to obtain a reconstructed three-dimensional scene, namely under the condition that the cameras are fixed, registering and splicing the point clouds of the plurality of cameras by a machine vision method to obtain the reconstructed three-dimensional scene.

Further, the obtaining the calibration matrix of the preliminary positional relationship among the plurality of cameras specifically includes the following steps:

acquiring a first color image shot by each camera on a checkerboard, and obtaining angular point coordinates of the checkerboard in each first color image;

pairing each angular point coordinate of the first color images of different cameras according to the epipolar geometry constraint principle to obtain angular point pairing relations among a plurality of cameras;

and obtaining a plurality of preliminary position relation calibration matrixes among the cameras according to the corner pairing relations among the cameras.

From the above description, the prior lookup chessboard algorithm is adopted to identify the chessboard, and the angular point coordinates of the identified chessboard are paired according to the epipolar geometry constraint principle, so that the pairing relation of different cameras relative to the same object is obtained, and the position relation of the color images of different cameras, namely the preliminary position relation calibration matrix, is obtained.

Further, the obtaining color images and depth images of the plurality of cameras respectively shot for the scene to be reconstructed, where the plurality of checkerboards are placed, and obtaining a point cloud calibration matrix between the plurality of cameras according to the color images and the depth images of the plurality of cameras and the preliminary position relationship calibration matrix between the plurality of cameras specifically includes the following steps:

acquiring second color images and depth images which are respectively shot by a plurality of cameras on a scene to be reconstructed, wherein the scene to be reconstructed is provided with a plurality of checkerboards;

searching each checkerboard in each second color image, and obtaining the center point coordinate of each checkerboard in each second color image to obtain the interested area of the second color image of each camera;

obtaining a region of interest of a point cloud of each camera according to a preset position relation matrix between the second color image and the depth image in each camera and the region of interest of the second color image;

performing preliminary registration on the point clouds of all cameras according to the preliminary position relation calibration matrix among the cameras and the interested area of the point clouds of each camera to obtain the point cloud preliminary calibration matrix of all the cameras;

and performing iterative operation on the point cloud preliminary calibration matrixes of all cameras to obtain the point cloud final calibration matrixes of all cameras.

From the above description, for the camera capable of acquiring the color image and the depth image, the position relation matrix of the color image and the depth image is preset, after the region of interest of the color image is obtained through a plurality of checkerboards, the region of interest of the point cloud of each camera can be obtained according to the position relation matrix of the color image and the depth image, the preliminary registration of the point cloud can be realized according to the preliminary position relation calibration matrix, and then the fine registration is performed through iterative operation, so that a more accurate final calibration matrix of the point cloud is obtained, and a foundation is laid for the reconstruction precision of the subsequent three-dimensional scene.

Further, the searching each checkerboard in each second color image, and obtaining the center point coordinates of each checkerboard in each second color image, and obtaining the interested area of the second color image of each camera specifically includes the following steps:

searching the checkerboard in each second color image, and obtaining the corner coordinates of the searched checkerboard to obtain the center point coordinates of the searched checkerboard;

masking the searched checkerboard, and then continuing to search other checkerboards in the second color image and obtain the central point coordinates of the other checkerboards until the central point coordinates of each checkerboard in each second color image are obtained;

polar angle sequencing is carried out on the central point coordinates of all the chessboards in each second color image, and then the central point coordinates of the chessboards are sequentially connected according to the polar angle sequencing result, so that a closed area is obtained, namely the interested area of the second color image of each camera.

As can be seen from the above description, for the recognition of a plurality of checkerboards, the camera performs mask masking after recognizing the checkerboard and obtaining the coordinates of the center point, so that the camera can not recognize the checkerboard any more, but starts to recognize other checkerboards, thereby completing the recognition and positioning of the plurality of checkerboards to obtain the region of interest of the color image of the camera.

Further, the number of the cameras is two, the iterative operation is performed on the initial calibration matrix of the point clouds of all the cameras, and the final calibration matrix of the point clouds of all the cameras is obtained, which comprises the following steps:

the point cloud A set and the point cloud B set of the two cameras are respectively set as a set P and a set Q, and the matching relation M of the set P and the set Q is obtained as follows:

M＝match(P,Q)＝{(p,q):p∈P,q∈Q}

the error between the set P and the set Q is defined as:

error(P,Q)＝∑ _(p,q∈M) w(p,q)d(p,q)

wherein w (P, Q) represents the weight of the corresponding point cloud in the set P and the set Q, and d (P, Q) represents the distance between the corresponding point clouds in the set P and the set Q;

performing iterative operation on the point cloud preliminary calibration matrix through an iterative calculation formula to obtain the point cloud final calibration matrix of all cameras, wherein the iterative calculation formula is as follows:

the i and i+1 represent the number of iterations, theFor iterative integration of transforms, the T _init And (5) primarily calibrating a matrix for the point cloud.

It can be seen from the above description that, for the calibration matrix, one of the two camera point clouds is converted to the coordinate system of the other point cloud, and then the defined error value is continuously reduced through iterative operation until the optimal final calibration matrix of the point clouds is obtained, that is, the fine registration of the point clouds of the two cameras is completed.

Further, the splicing of the point clouds of the cameras according to the point cloud calibration matrix between the cameras, and the obtaining of the reconstructed three-dimensional scene specifically includes the following steps:

calculating overlapping point cloud areas among a plurality of cameras according to point cloud calibration matrixes among the plurality of cameras, and removing the overlapping point cloud areas of one camera;

and splicing the point clouds of all cameras after the overlapping point cloud areas are removed according to the point cloud calibration matrixes among the cameras, so as to obtain a reconstructed three-dimensional scene.

As can be seen from the above description, since the point cloud data of the plurality of cameras has an overlapping region, in the embodiment, when the point cloud splicing is performed, the overlapping region is removed first, and then the two point clouds are spliced, so that the operation amount is reduced, the splicing speed is improved, and the reconstruction of the three-dimensional scene is better and faster realized.

Further, the calculating the overlapping point cloud areas between the cameras according to the point cloud calibration matrix between the cameras, and removing the overlapping point cloud area of one of the cameras specifically includes the following steps:

the visual field of the camera is equivalent to a visual cone, and objects in the visual field of the camera are cut by using a near plane and a far plane to form a visual section, wherein the visual section comprises a left face, a right face, a top face, a bottom face, a near plane and a far plane;

locating the camera at the origin of the world coordinate system, locating the camera in a positive direction along the Z-axis, locating the camera in the direction of the line of sightThe view matrix of the machine is set as an identity matrix, and vertexes are expressed as: v= (xyzw=1) ^T The projection matrix S of 4*4 is expressed as: m=(s) _kl )；

The vertex obtained by rotating the projection matrix S is v '= (x' y 'z' w=1) ^T The method comprises the following steps:

the row is a point multiplication _i ＝(s _i1 s _i2 s _i3 s _i4 ) An ith row of data representing the projection matrix S;

if the vertex v' is located within the viewing section, the following first inequality should be satisfied:

-w'<x'<w'

-w^'<y^'<w^'

-w'<z'<w'

obtaining an inequality corresponding to each intercepting plane according to a first inequality of the viewing section and the position relation of six intercepting planes in the viewing section, so as to calculate a plane equation coefficient of each intercepting plane, and obtain a plane equation expression of each intercepting plane;

judging the position relation between the point cloud of the camera and each intercepting plane of other cameras one by one according to the plane equation expression of each intercepting plane corresponding to each camera, and taking the point cloud on the inner side of all intercepting planes in any one camera as an overlapped point cloud area of the camera;

and removing the overlapping point cloud area of any one of the cameras comprising the overlapping point cloud area.

From the above description, it can be known that by performing calculation of plane equation expressions on six interception planes of the view frustum, and then determining the positional relationship between the point clouds of other cameras and the six interception planes of the camera, it is determined whether there is overlapping with the view frustum of the camera in the point clouds of other cameras, if so, the point clouds are considered to be overlapping, so that the overlapping point cloud area is rapidly and accurately identified.

Further, the plurality of checkerboards is four checkerboards.

From the foregoing, it can be seen that a preferred embodiment is provided.

Another embodiment of the present invention provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the multi-camera based three-dimensional scene reconstruction method described above.

Referring to fig. 17, another embodiment of the present invention provides an electronic device, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, where the processor implements the multi-camera-based three-dimensional scene reconstruction method when executing the computer program.

With respect to the specific implementation procedure and corresponding effect of the image scaling method contained in the computer program in the above two embodiments, reference may be made to the relevant description in the multi-camera-based three-dimensional scene reconstruction method in the previous embodiment.

Thus, the following embodiments provide a plurality of cameras within a scene to reconstruct the scene in three dimensions, such as when an AR (Augmented Reality ) system needs to reconstruct a real scene in three dimensions, which is specifically as follows:

referring to fig. 1 to 16, a first embodiment of the present invention is as follows:

the three-dimensional scene reconstruction method based on multiple cameras, as shown in fig. 1, comprises the following steps:

s1, acquiring a primary position relation calibration matrix among a plurality of cameras;

in this embodiment, as shown in fig. 2, step S1 specifically includes the following steps:

s11, acquiring first color images shot by each camera on a checkerboard, and obtaining angular point coordinates of the checkerboard in each first color image;

in this embodiment, as shown in fig. 4, a plurality of first color images including a checkerboard may be acquired at the same time and in different poses, for example, in this embodiment, the number of the first color images is about 100. Then the angular point coordinates of the checkerboard are found by adopting the existing lookup checkerboard algorithm.

S12, matching each corner coordinate of the first color images of different cameras according to the epipolar geometry constraint principle to obtain a corner matching relationship among a plurality of cameras;

in the present embodiment, as shown in FIG. 10, wherein I ₁ And I ₂ Image planes which are color images on two cameras, wherein the center of each camera is O ₁ And O ₂ . Wherein P is ₁ Is one corner point on the checkerboard at I ₁ Corresponding projection points, the same as P ₂ Is the same angular point on the checkerboard is at I ₂ Corresponding to the projection point. Point O ₁ 、O ₂ And the plane defined by the P three points is the polar plane. O (O) ₁ O ₂ Connection line and image plane I ₁ 、I ₂ The intersection points of (a) are respectively e ₁ And e ₂ . Wherein e ₁ And e ₂ Called poles, O ₁ O ₂ Referred to as the baseline. Polar plane and two image planes I ₁ And I ₂ Intersecting line l between ₁ And l ₂ And the corner points are epipolar lines, so that whether the corner points respectively identified by the cameras are the same corner points can be judged according to the fact that the same corner points on the checkerboard are intersected at the P points, and the corner point pairing relation among the cameras is obtained.

S13, obtaining a preliminary position relation calibration matrix among the cameras according to the corner pairing relation among the cameras.

In this embodiment, two cameras are selected to collect images of a scene under different viewing angles, and in other equivalent embodiments, three or four cameras may be selected to collect images.

S2, acquiring color images and depth images which are respectively shot by a plurality of cameras on a scene to be reconstructed, wherein the scene to be reconstructed is provided with a plurality of checkerboards, and acquiring a point cloud calibration matrix among the plurality of cameras according to the color images and the depth images of the plurality of cameras and a preliminary position relationship calibration matrix among the plurality of cameras;

and S3, splicing the point clouds of the cameras according to the point cloud calibration matrix among the cameras to obtain a reconstructed three-dimensional scene.

Referring to fig. 1 to 16, a second embodiment of the present invention is as follows:

based on the first embodiment, as shown in fig. 3, the step S2 specifically includes the following steps:

s21, acquiring second color images and depth images which are respectively shot by a plurality of cameras on a scene to be reconstructed, wherein the scene to be reconstructed is provided with a plurality of checkerboards;

in this embodiment, as shown in fig. 5, four checkerboards are disposed in the scene to be reconstructed. Thus, two cameras need to capture a color image comprising four checkerboards and a depth image, wherein the depth image corresponds to a point cloud. In other equivalent embodiments, two, three or five or more checkerboards may be selected.

S22, finding each checkerboard in each second color image, and obtaining the center point coordinate of each checkerboard in each second color image to obtain the interested area of the second color image of each camera;

the step S22 specifically includes the following steps:

s221, searching the checkerboard in each second color image, and obtaining corner coordinates of the searched checkerboard to obtain center point coordinates of the searched checkerboard;

in this embodiment, the checkerboard is searched according to the existing checkerboard searching algorithm, and first, one checkerboard is identified, for example, the lower right corner checkerboard in fig. 5 in this embodiment, to obtain the center point coordinate of the checkerboard.

S222, masking the searched checkerboard, and then continuing to search other checkerboards in the second color image and obtain the center point coordinates of the other checkerboards until the center point coordinates of each checkerboard in each second color image are obtained;

the mask is a binary image composed of 0 and 1, and as shown in fig. 6, the lower right checkerboard is masked, i.e. the lower right checkerboard is masked by using pixel value 0 instead. Thus, the coordinates of the center points of the four checkerboards can be obtained.

S223, polar angle ordering is carried out on the central point coordinates of all the chessboards in each second color image, and then the central point coordinates of the chessboards are sequentially connected according to the polar angle ordering result, so that a closed area is obtained, namely the interested area of the second color image of each camera.

The polar angle ordering is to take a fixed point called pole in a plane, draw a ray called polar axis, select a positive direction of length unit and angle, and then arrange the points on a given plane into a preset direction according to a selected fixed point. Corresponding to this embodiment, as shown in fig. 7, the coordinates of the center points of the four checkerboards are sequentially connected to obtain a rectangle, that is, the region of interest.

S23, obtaining the interested region of the point cloud of each camera according to a preset position relation matrix between the second color image and the depth image in each camera and the interested region of the second color image;

the RGBD cameras capable of acquiring the color image and the depth image store a preset positional relationship matrix between the color image and the depth image in advance, so that a region of interest of a point cloud of each camera is obtained, as shown in fig. 8.

S24, performing preliminary registration on the point clouds of all cameras according to the preliminary position relation calibration matrix among the cameras and the interested areas of the point clouds of each camera to obtain a point cloud preliminary calibration matrix of all cameras;

the effect obtained after the point clouds of the two cameras are initially registered is shown in fig. 9.

And S25, performing iterative operation on the point cloud preliminary calibration matrixes of all cameras to obtain a point cloud final calibration matrix of all cameras.

In this embodiment, the iterative operation uses an iterative closest point algorithm, whose mathematical form represents:

wherein, ^A p denotes the point cloud a and, ^B q represents a point cloud B, and the fine registration is performed by converting the point cloud A into a coordinate system of the point cloud B to enable error (T # ^A P), ^B Q) is minimal. Thus, step S25 specifically includes the steps of:

M＝match(P,Q)＝{(p,q):p∈P,q∈Q}

the error between set P and set Q is defined as:

error(P,Q)＝∑ _(p,q∈M) w(p,q)d(p,q)

wherein w (P, Q) represents the weight of the corresponding point clouds in the set P and the set Q, and d (P, Q) represents the distance between the corresponding point clouds in the set P and the set Q;

performing iterative operation on the point cloud preliminary calibration matrix through an iterative calculation formula to obtain point cloud final calibration matrixes of all cameras, wherein the iterative calculation formula is as follows:

i and i +1 represent the number of iterations,for iterative integration of transformations, T _init And (5) primarily calibrating a matrix for the point cloud.

Thereby, a fine registration of the point clouds of the two cameras is completed.

Referring to fig. 1 to 16, a third embodiment of the present invention is as follows:

based on the first embodiment, the step S3 specifically includes the following steps:

s31, calculating an overlapped point cloud area between a plurality of cameras according to a point cloud calibration matrix between the plurality of cameras, and removing the overlapped point cloud area of one camera;

in this embodiment, the step S31 specifically includes the following steps:

s311, the visual field of the camera is equivalent to a view cone, and objects in the visual field of the camera are cut by using a near plane and a far plane to form a view section, wherein the view section comprises a left face, a right face, a top face, a bottom face, a near plane and a far plane;

fig. 11 is a view section including the above six cutting planes, and fig. 12 is a view section XZ-plane screenshot.

S312, the camera is located at the origin of the world coordinate system, the line of sight direction of the camera is the positive direction along the Z axis, the view matrix of the camera is set as the identity matrix, and the vertices are expressed as: v= (xyzw=1) ^T The projection matrix S of 4*4 is expressed as: s= (S) _kl )；

S313, the vertex obtained by rotating the projection matrix S is v '= (x' y 'z' w=1) ^T The method comprises the following steps:

wherein, is dot product, row _i ＝(s _i1 s _i2 s _i3 s _i4 ) The ith row of data representing the projection matrix S is converted into homogeneous coordinates from the vertexes of the world coordinate system after the transformation;

s314, if the vertex v' is located in the viewing section, the following first inequality should be satisfied:

-w'<x'<w'

-w^'<y^'<w^'

-w'<z'<w'

s315, obtaining an inequality corresponding to each interception plane according to a first inequality of the interception plane and the position relation of six interception planes in the interception plane, so as to calculate a plane equation coefficient of each interception plane and obtain a plane equation expression of each interception plane;

wherein, it is assumed that x' satisfies the following inequality:

-w'<x'

from the above rotation equation:

-(v·row ₄ )<(v·row ₁ )

that is:

0<(v·row ₄ )+(v·row ₁ )

finally, the method can obtain:

0<v·(row ₄ +row ₁ )

the plane equation for the left cross-sectional plane can be expressed as:

x(m ₄₁ +m ₁₁ )+y(m ₄₂ +m ₁₂ )+z(m ₄₃ +m ₁₃ )+w(m ₄₄ +m ₁₄ )＝0

since w=1, there is

x(m ₄₁ +m ₁₁ )+y(m ₄₂ +m ₁₂ )+z(m ₄₃ +m ₁₃ )+m ₄₄ +m ₁₄ ＝0

The standard plane equation is:

ax+by+cz+d＝0

the plane equation coefficients for the left cross-section plane are:

a＝m ₄₁ +m ₁₁ ，b＝m ₄₂ +m ₁₂ ，c＝m ₄₃ +m ₁₃ ，d＝m ₄₄ +m ₁₄

repeating the steps to obtain the plane equation coefficient of each intercepting plane according to the inequality corresponding to different intercepting planes, wherein the plane equation coefficient is shown in the table 1:

TABLE 1 plane equation results for the intercept plane

From this, plane equation expressions for six intercept planes can be found.

S316, judging the position relation between the point clouds of the cameras and each intercepting plane of other cameras one by one according to the plane equation expression of each intercepting plane corresponding to each camera, and taking the point clouds inside all intercepting planes in any one camera as overlapping point cloud areas of the cameras;

wherein viewing the viewing section from the cross section, the relationship of the plane of interception to the point can be obtained as shown in fig. 13. I.e. a plane can be defined by P and normal vectors on the planeDetermining that any point on the plane P' satisfies:

then given any point Q, there are:

the point Q is outside the clipping plane;

the point Q is on the clipping plane;

the point Q is inside the clipping plane;

thereby, an overlapping point cloud area of the two cameras is obtained.

And S317, removing the overlapping point cloud area of any one of the cameras comprising the overlapping point cloud area.

That is, fig. 14 is a point cloud of one of the cameras, and fig. 15 is obtained after removing the overlapping point cloud area.

And S32, splicing the point clouds of all cameras after the overlapping point cloud areas are removed according to the point cloud calibration matrix among the cameras, and obtaining the reconstructed three-dimensional scene.

The fourth embodiment of the invention is as follows:

another embodiment of the present invention provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the multi-camera-based three-dimensional scene reconstruction method of any one of the above embodiments one to three.

Referring to fig. 17, a fifth embodiment of the present invention is as follows:

an electronic device 1 comprises a memory 3, a processor 2 and a computer program stored on the memory 3 and executable on the processor 2, the processor 2 implementing the multi-camera based three-dimensional scene reconstruction method of any of the above embodiments one to three when executing the computer program.

In summary, the three-dimensional scene reconstruction method, the storage medium and the electronic device based on the multiple cameras provided by the invention adopt the existing searching chessboard algorithm to identify the chessboard, pair the angular point coordinates of the identified chessboard according to the epipolar geometry constraint principle, and obtain the preliminary position relation calibration matrix among the multiple cameras; after the interested areas of the color images are obtained through a plurality of chequers, the interested areas of the point clouds of each camera can be obtained according to the position relation matrix of the color images and the depth images, the initial registration of the point clouds can be realized according to the initial position relation calibration matrix, and then the fine registration is carried out through an iterative nearest point algorithm, so that a more accurate final calibration matrix of the point clouds is obtained; finally, according to the point cloud calibration matrix among the cameras, the overlapping point cloud areas are removed firstly, then the point clouds of the cameras are spliced to obtain a reconstructed three-dimensional scene, namely under the condition that the cameras are fixed, the point clouds of the cameras are registered and spliced by a machine vision method to obtain the reconstructed three-dimensional scene, so that the reconstruction of the three-dimensional scene can be better and faster realized, and meanwhile, the method is time-saving and labor-saving and has universality.

The foregoing description is only illustrative of the present invention and is not intended to limit the scope of the invention, and all equivalent changes made by the specification and drawings of the present invention, or direct or indirect application in the relevant art, are included in the scope of the present invention.

Claims

1. The three-dimensional scene reconstruction method based on the multiple cameras is characterized by comprising the following steps of:

splicing the point clouds of the cameras according to the point cloud calibration matrixes among the cameras to obtain a reconstructed three-dimensional scene;

the method for obtaining the point cloud calibration matrix between the cameras comprises the following steps of:

2. The multi-camera based three-dimensional scene reconstruction method according to claim 1, wherein the obtaining the preliminary positional relationship calibration matrix between the plurality of cameras specifically comprises the steps of:

3. The multi-camera based three-dimensional scene reconstruction method according to claim 1, wherein said finding each of said second color images and obtaining center point coordinates of each of said second color images, obtaining a region of interest of said second color image for each of said cameras, comprises the steps of:

4. The three-dimensional scene reconstruction method based on multiple cameras as set forth in claim 1, wherein the number of cameras is two, the iterative operation is performed on the point cloud preliminary calibration matrices of all the cameras, and obtaining the point cloud final calibration matrices of all the cameras includes the following steps:

M＝match(P,Q)＝{(p,q):p∈P,q∈Q}

the error between the set P and the set Q is defined as:

error(P,Q)＝∑ _(p,q∈M) w(p,q)d(p,q)

5. The multi-camera-based three-dimensional scene reconstruction method according to claim 1, wherein the stitching the point clouds of the cameras according to the point cloud calibration matrix between the cameras, to obtain the reconstructed three-dimensional scene specifically comprises the following steps:

6. The method for reconstructing a three-dimensional scene based on multiple cameras according to claim 5, wherein calculating overlapping point cloud areas between multiple cameras according to a point cloud calibration matrix between the multiple cameras, and removing the overlapping point cloud areas of one of the cameras specifically comprises the following steps:

the camera is located at the origin of the world coordinate system, the sight line direction of the camera is the positive direction along the Z axis, the view matrix of the camera is set as an identity matrix, and the vertex is expressed as: v= (xyzw=1) ^T The projection matrix S of 4*4 is expressed as: s= (S) _kl )；

-w'<x'<w'

-w^'<y^'<w^'

-w'<z'<w'

7. The multi-camera based three-dimensional scene reconstruction method according to any of claims 1 to 6, wherein the plurality of checkerboards is four checkerboards.

8. A computer-readable storage medium having stored thereon a computer program, characterized by: the computer program, when executed by a processor, implements the three-dimensional scene reconstruction method as claimed in any one of claims 1 to 7.

9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the three-dimensional scene reconstruction method according to any of claims 1-7 when the computer program is executed by the processor.