CN112258635A

CN112258635A - Three-dimensional reconstruction method and device based on improved binocular matching SAD algorithm

Info

Publication number: CN112258635A
Application number: CN202011154642.9A
Authority: CN
Inventors: 刘学君; 常梦洁; 晏涌; 沙芸; 孔祥旻; 李柏灿; 龚泓博; 苏鹏; 黎杨
Original assignee: Beijing Institute of Petrochemical Technology
Current assignee: Beijing Institute of Petrochemical Technology
Priority date: 2020-10-26
Filing date: 2020-10-26
Publication date: 2021-01-22
Anticipated expiration: 2040-10-26
Also published as: CN112258635B

Abstract

The invention relates to the technical field of image processing, in particular to a three-dimensional reconstruction method and a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm. The three-dimensional reconstruction method based on the improved binocular matching SAD algorithm comprises the following steps: acquiring an original picture; the original pictures comprise corresponding first pictures and second pictures; obtaining an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map; and filling the holes of the initial disparity map to optimize the initial disparity map to obtain a final disparity map.

Description

Three-dimensional reconstruction method and device based on improved binocular matching SAD algorithm

Technical Field

The invention relates to the technical field of image processing, in particular to a three-dimensional reconstruction method and a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm.

Background

The continuous increase of the production and use amount of dangerous chemicals puts higher requirements on the supervision of the storage safety of the dangerous chemicals. The three-dimensional reconstruction is carried out on the hazardous chemical substance warehouse, the environmental information around the hazardous chemical substance can be accurately acquired in real time, the visualization of the hazardous chemical substance warehouse environment is realized, and the real-time performance and the safety of monitoring and early warning are ensured. The binocular vision technology carries out three-dimensional reconstruction on the target based on the parallax principle, compared with the mode that a TOF camera, structured light, a laser radar and the like acquire scene depth information, the binocular vision technology is not limited by the material of the surface of an object, can acquire uniform and dense scene three-dimensional information, has low requirements on hardware and cost, and has a certain application prospect in the aspect of safety monitoring in a hazardous chemical library.

The effect and precision of three-dimensional reconstruction are directly determined by a matching algorithm, and the key step is to search homonymous points in two images, namely to perform stereo matching. The Sum of Absolute Differences (SAD) algorithm is a classic block matching algorithm, is used as a typical algorithm of a real-time system, is low in complexity, high in speed and good in real-time performance, and an obtained dense disparity map can be used for dense three-dimensional reconstruction. However, when a traditional SAD algorithm is applied to a complex scene, a certain error is generated by directly summing the absolute values of the differences between corresponding numerical values of two pixel blocks, the accuracy of the obtained result is insufficient, and a parallax image has a void due to the change of parallax in a matching window, so that the reconstructed three-dimensional model is affected.

Disclosure of Invention

In view of this, the present application provides a three-dimensional reconstruction method and apparatus based on an improved binocular matching SAD algorithm to solve the problems in the related art.

The invention adopts the following technical scheme:

in a first aspect, an embodiment of the present invention provides a three-dimensional reconstruction method based on an improved binocular matching SAD algorithm, including:

acquiring an original picture; the original pictures comprise corresponding first pictures and second pictures;

obtaining an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map;

and filling the holes of the initial disparity map to optimize the initial disparity map to obtain a final disparity map.

Optionally, the improved SAD algorithm is to introduce a weighting coefficient of two-dimensional gaussian distribution with a zero mean value on the basis of the original SAD algorithm, so that when matching is performed, the matching precision of the central pixel of the image block is emphasized.

Optionally, the obtaining an initial disparity map based on the original picture by using a modified SAD algorithm includes:

and solving the template image block in the first picture and the search image block in the second picture, and measuring whether the two image blocks are matched or not based on the sum of the absolute values of the pixel value differences introduced with the two-dimensional Gaussian weighting coefficients so as to obtain an initial disparity map.

Optionally, the filling the hole of the initial disparity map by using a hole filling method includes:

determining holes existing in the initial disparity map;

and aiming at the pixel points with the holes in the disparity map, selecting a window by using texture information measured by GLCM (global Gamut-nearest neighbor) to fill the holes.

Optionally, the selecting a window by using texture information measured by GLCM to fill the hole includes:

measuring areas with different sizes of textures around the cavity by introducing a gray level co-occurrence matrix to obtain Euclidean distances between adjacent areas;

selecting a region with the minimum corresponding Euclidean distance as a target region;

and (4) solving the parallax average value of the target area for filling the hole.

Optionally, the introducing a gray level co-occurrence matrix to measure different sizes of regions of the texture around the cavity to obtain the euclidean distance includes:

sequentially selecting areas with different sizes;

selecting an angular second moment, entropy, contrast and correlation to form a four-dimensional feature vector to comprehensively describe the texture attribute of the region in the image in the selected region;

and determining Euclidean distances between adjacent regions based on the four-dimensional feature vectors.

Optionally, the adjacent areas are: and arranging the areas with different sizes according to the sizes of the areas, wherein the areas with the adjacent size relationship are adjacent areas.

In a second aspect, the present application provides a three-dimensional reconstruction apparatus based on an improved binocular matching SAD algorithm, including:

the acquisition module is used for acquiring an original picture; the original pictures comprise corresponding first pictures and second pictures;

the initial disparity map generation module is used for obtaining an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map;

and the filling module is used for filling the holes of the initial disparity map so as to optimize the initial disparity map.

In a third aspect, the present application provides a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm, including:

a processor, and a memory coupled to the processor;

the memory is configured to store a computer program for performing at least the method for three-dimensional reconstruction based on the improved binocular matching SAD algorithm according to the first aspect of the present application;

the processor is used for calling and executing the computer program in the memory.

In a fourth aspect, the present application provides a storage medium storing a computer program that, when executed by a processor, implements a three-dimensional reconstruction method based on an improved binocular matching SAD algorithm according to the first aspect of the present application.

According to the technical scheme, the first picture and the second picture are shot through the preset camera device; after the original picture is obtained; obtaining an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map; it should be noted that, based on the flatness of the existing camera device, the matching accuracy of the central pixels of the image blocks of the two pictures is emphasized, which is in accordance with objective practice, and the overall matching accuracy can be effectively improved; filling holes of the initial disparity map so as to optimize the initial disparity map. By the arrangement, the improved SAD algorithm is used in the algorithm, a better matching result is kept in the parallax smoothing process, the texture information is used for filling the holes in the algorithm, the hole area can be well reduced, the result is superior to that of the traditional SAD algorithm, and the algorithm has better practicability.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a flowchart of a three-dimensional reconstruction method based on an improved binocular matching SAD algorithm according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a binocular vision model provided by an embodiment of the present invention;

FIG. 3 is a schematic diagram of a gray level co-occurrence matrix generation process according to an embodiment of the present invention;

fig. 4 is a flowchart of a three-dimensional reconstruction method based on an improved binocular matching SAD algorithm according to an embodiment of the present invention

Fig. 5 is a schematic structural diagram of a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm according to an embodiment of the present invention.

Fig. 6 is a schematic structural diagram of a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be described in detail below. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the examples given herein without any inventive step, are within the scope of the present invention.

First, an application scenario of the embodiment of the invention is explained, and the continuous increase of the production and use amount of dangerous chemicals puts higher requirements on the supervision of the storage safety of the dangerous chemicals. The three-dimensional reconstruction is carried out on the hazardous chemical substance warehouse, the environmental information around the hazardous chemical substance can be accurately acquired in real time, the visualization of the hazardous chemical substance warehouse environment is realized, and the real-time performance and the safety of monitoring and early warning are ensured. The binocular vision technology carries out three-dimensional reconstruction on the target based on the parallax principle, compared with the mode that a TOF camera, structured light, a laser radar and the like acquire scene depth information, the binocular vision technology is not limited by the material of the surface of an object, can acquire uniform and dense scene three-dimensional information, has low requirements on hardware and cost, and has certain application prospect in the aspect of safety monitoring in a hazardous chemical library.

Examples

Fig. 1 is a flowchart of a three-dimensional reconstruction method based on an improved binocular matching SAD algorithm according to an embodiment of the present invention, which may be performed by the three-dimensional reconstruction apparatus based on the improved binocular matching SAD algorithm according to the embodiment of the present invention. Referring to fig. 1, the method may specifically include the following steps:

s101, acquiring an original picture; the original pictures comprise corresponding first pictures and second pictures;

the binocular vision technology is used for three-dimensional reconstruction of a target based on the parallax principle, and compared with the mode of acquiring scene depth information by a TOF camera, structured light, a laser radar and the like, the binocular vision technology is not limited by the material of the surface of an object, can acquire uniform and dense scene three-dimensional information, has low requirements on hardware and cost, and has certain application prospect in the aspect of safety monitoring in a hazardous chemical library.

The binocular stereoscopic vision principle is as follows: the binocular stereo vision imaging model is generally two cameras arranged in parallel, as shown in FIG. 2, two-phase camera optical center O_lAnd O_rThe distance between the cameras is a base line distance b, and the coordinates of a point p in the space at the left camera and the right camera are respectively p_l(x_l,y_l) And p_r(x_r,y_r)The parallax d is defined as the difference in the position of a point in the two imaging planes: d ═ x_l-x_r。

From the binocular vision model, Delta pO_lO_r∽△pp_lp_rAccording to the principle of triangle similarity, the following results are obtained:

wherein Z_CThe depth of point b is shown, f is the focal length of the camera, and the parallax value d and the depth value Z can be obtained by the formula (1)_CInversely proportional, as the disparity value d is smaller, the depth value Z_CThe larger the distance the point is from the camera. Conversely, it indicates that the closer the point is to the camera. In practical situations, two cameras cannot be strictly arranged side by side, and y needs to be corrected through an external polar line_l＝y_rTherefore, the searching dimension is reduced from two dimensions to one dimension, the matching complexity is reduced, and the matching efficiency is improved. In step S101, the acquired picture is a picture taken by an image pickup device corresponding to positions of two eyes in the binocular vision model as shown in fig. 2.

S102, obtaining an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map;

note that SAD is a block matching algorithm based on gray scale in image stereo matching. The principle of the method is to calculate the sum of absolute values of pixel value differences in a template image block and a search image block, and further measure whether the two image blocks are matched, and the definition is as follows:

wherein d is the parallax value, P_L(i, j) represents the gray level of the pixel in the template image block, P_R(i, j) represents the gray scale value of the pixel within the search image block. The smaller the SAD matrix value calculated using equation (2), the more similar the two image blocks are.

For a complex scene, matching image blocks by using the similarity of pixel point difference values in two pixel blocks, wherein once similar pixel values appear in pixel points of continuous image blocks in the scene, errors are inevitably generated in matching. In order to improve the matching precision of two image blocks, aiming at the SAD target function, a weighting coefficient omega (i, j) with zero mean value and two-dimensional Gaussian distribution is introduced:

where σ 2 is a variance, σ 2 is 0.5n, and n is a block radius.

The two-dimensional Gaussian distribution is popularized from one-dimensional normal distribution to two-dimensional distribution. As shown in fig. 2, the projections of the two-dimensional gaussian distribution with zero mean on the XOZ plane and the YOZ plane are both a standard normal distribution, and the projection on the XOY plane is an ellipse.

Combining equation (3) and equation (2) results in an improved SAD algorithm, as shown in equation (5):

the improved SAD after Gaussian distribution is introduced is more focused on the matching precision of the central pixel of the image block, and the method accords with objective practice and can effectively improve the overall matching precision. The Gaussian weights introduced by the algorithm can keep a better matching result in the parallax smoothing process.

S103, filling the holes of the initial disparity map to optimize the initial disparity map.

Specifically, after the weighted SAD algorithm is used to obtain an initial disparity map, it is found that when the disparity of the pixels in the matching block changes, a hole appears in the disparity map. In order to fill the cavity, a Gray-level Co-occurrence Matrix (GLCM) is introduced into the scheme provided by the application to measure the texture area around the cavity, and then the appropriate window size is selected to obtain the parallax average value to fill the cavity.

Wherein filling the holes of the initial disparity map fills the holes, including:

determining holes existing in the initial disparity map;

The method comprises the following steps: "selecting a window for hole filling using texture information measured by GLCM" includes:

measuring areas with different sizes of textures around the cavity by introducing a gray level co-occurrence matrix to obtain Euclidean distances between adjacent areas; wherein the adjacent regions are: and arranging the areas with different sizes according to the sizes of the areas, wherein the areas with the adjacent size relationship are adjacent areas.

Specifically, areas with different sizes are sequentially selected; selecting an angular second moment, entropy, contrast and correlation to form a four-dimensional feature vector to comprehensively describe the texture attribute of the region in the image in the selected region;

Step S103 is further described below with reference to fig. 3 and 4:

GLCM describes the texture of an image by studying the spatially dependent properties of the gray levels. GLCM is denoted by G (i, j) (i, j ═ 0,1,2, … L-1), where L denotes the gray level of the image, i and j denote the gray level of the pixels, d denotes the pitch of the two pixels, and n denotes the window size. θ is the generation direction of GLCM, and is typically taken at 0 °, 45 °, 90 ° and 135 °. Fig. 3 shows the process of generating a gray matrix with θ equal to 0 ° and d equal to 1: specifically, among the 14 texture features based on the GLCM, an Angular Second Moment (ASM), an Entropy (Entropy, ENT), a Contrast (CON), and a Correlation (COR) are selected to form a four-dimensional feature vector to comprehensively describe texture attributes of a region in an image.

As shown in equation (6), Euclidean Distance (ED) is used to describe the similarity of two adjacent texture region feature vectors (ASM1, ENT1, CON1, COR1) and (ASM2, ENT2, CON2, COR 2):

the larger the ED of the texture feature vectors of two adjacent regions is, the larger the difference of the texture attributes of the two regions is, the higher the probability that the two regions do not belong to the same object is, and the disparity mean value cannot be directly obtained and given to the position of the cavity; if the ED of the texture feature vectors of two adjacent regions is closer to 0, it indicates that the probability that the two regions belong to the same object is higher, and the average parallax can be assigned to the hole position. The self-adaptive adjustment of the parallax filling window is carried out according to the image texture information, so that the parallax filling can be correctly and effectively carried out, and a good parallax image can be obtained. The specific steps are shown in fig. 4:

s401, solving an initial disparity map by using a weighted SAD algorithm;

s402, inspecting holes and screening;

namely: and finding the holes in the initial disparity map.

S403, determining the position of the hole;

it should be noted that a plurality of holes may exist in one initial disparity map, and in the scheme provided in the present application, each hole needs to be filled one by one, so the position of the hole that needs to be filled at this time needs to be determined before each filling.

S404, solving ASM, ENT, CON and COR by taking the cavity as a center and different radiuses as windows;

s405, solving ED (edge distribution) by the radius of adjacent windows;

s406, averaging the minimum ED window to give a cavity;

it should be noted that, for one ED corresponds to two adjacent windows, in practical application, the larger the ED of the texture feature vectors of two adjacent regions is, the larger the difference of texture attributes of the two regions is, the higher the probability that the two regions do not belong to the same object is, and the disparity mean value cannot be directly obtained and given to the hole position; if the ED of the texture feature vectors of two adjacent regions is closer to 0, it indicates that the probability that the two regions belong to the same object is higher, and the average parallax can be assigned to the hole position. Therefore, the probability that the two adjacent windows and the hole corresponding to the minimum ED belong to the same object is high, and therefore, the hole portion can be filled based on any one of the image texture information of the two adjacent windows corresponding to the minimum ED. The self-adaptive adjustment of the parallax filling window is carried out according to the image texture information, so that the parallax filling can be correctly and effectively carried out, and a good parallax image can be obtained.

S407, judging whether filling is finished;

further, if yes, go straight to step S408; if not, go to step S403. In this way, after the filling of one cavity is completed, the filling of another cavity can be performed until the filling of the cavity is completed.

And S408, obtaining a final disparity map.

In conclusion; the scheme provided by the application is as follows: weighting is carried out on the SAD matching algorithm by introducing a two-dimensional Gaussian distribution coefficient, and after an initial disparity map is obtained, a window is selected by using texture information measured by GLCM (global Gaussian mixture model) to fill the hole aiming at the pixel points with the hole in the disparity map. Compared with the SAD algorithm, the disparity map obtained by the algorithm is more accurate, and preparation can be made for three-dimensional reconstruction of a later scene.

Fig. 5 is a schematic structural diagram of a three-dimensional reconstruction device based on an improved binocular matching SAD algorithm according to an embodiment of the present invention; referring to fig. 5, the present application provides a three-dimensional reconstruction apparatus based on an improved binocular matching SAD algorithm, including:

an obtaining module 501, configured to obtain an original picture; the original pictures comprise corresponding first pictures and second pictures;

an initial disparity map generation module 502, configured to obtain an initial disparity map through an improved SAD algorithm based on the original picture; the improved SAD algorithm is biased to the matching precision of central pixels of image blocks of two pictures in the process of generating an initial disparity map;

a filling module 503, configured to fill the holes of the initial disparity map to optimize the initial disparity map.

Fig. 6 is a schematic structural diagram of a three-dimensional reconstruction based on an improved binocular matching SAD algorithm according to an embodiment of the present invention. Referring to fig. 6, the present application provides a three-dimensional reconstruction apparatus based on an improved binocular matching SAD algorithm, including:

a processor 601, and a memory 602 connected to the processor 601;

the memory 602 is used for storing a computer program for performing at least the three-dimensional reconstruction method based on the improved binocular matching SAD algorithm according to any embodiment of the present application;

The present application further provides a storage medium storing a computer program which, when executed by a processor, implements a method for three-dimensional reconstruction based on an improved binocular matching SAD algorithm according to any of the embodiments of the present application.

It is understood that the same or similar parts in the above embodiments may be mutually referred to, and the same or similar parts in other embodiments may be referred to for the content which is not described in detail in some embodiments.

It should be noted that the terms "first," "second," and the like in the description of the present invention are used for descriptive purposes only and are not to be construed as indicating or implying relative importance. Further, in the description of the present invention, the meaning of "a plurality" means at least two unless otherwise specified.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.

In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.

The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims

1. A three-dimensional reconstruction method based on an improved binocular matching SAD algorithm is characterized by comprising the following steps:

2. The three-dimensional reconstruction method based on the improved binocular matching SAD algorithm as claimed in claim 1, wherein the improved SAD algorithm is based on the original SAD algorithm, and introduces a weighting coefficient of two-dimensional Gaussian distribution with a zero mean value, so that when matching is performed, the matching precision of the central pixel of the image block is emphasized.

3. The method for three-dimensional reconstruction based on the improved binocular matching SAD algorithm according to claim 2, wherein the obtaining of the initial disparity map through the improved SAD algorithm based on the original picture comprises:

4. The improved binocular matching SAD algorithm based three-dimensional reconstruction method according to claim 2, wherein the filling of the holes of the initial disparity map fills the holes, comprising:

determining holes existing in the initial disparity map;

5. The method for three-dimensional reconstruction based on the improved binocular matching SAD algorithm according to claim 4, wherein the selecting of the window for hole filling using the texture information of GLCM metric comprises:

6. The three-dimensional reconstruction method based on the improved binocular matching SAD algorithm of claim 5, wherein the step of introducing the gray level co-occurrence matrix to measure the regions with different sizes of the textures around the holes to obtain Euclidean distances comprises the following steps:

sequentially selecting areas with different sizes;

7. The improved binocular matching SAD algorithm-based three-dimensional reconstruction method according to claim 5, wherein the adjacent areas are: and arranging the areas with different sizes according to the sizes of the areas, wherein the areas with the adjacent size relationship are adjacent areas.

8. A three-dimensional reconstruction device based on an improved binocular matching SAD algorithm is characterized by comprising:

9. A three-dimensional reconstruction device based on an improved binocular matching SAD algorithm is characterized by comprising:

a processor, and a memory coupled to the processor;

the memory is used for storing a computer program at least for executing the improved binocular matching SAD algorithm based three-dimensional reconstruction method of any one of claims 1 to 7;

10. A storage medium, characterized in that the storage medium stores a computer program which, when executed by a processor, implements the three-dimensional reconstruction method based on the modified binocular matching SAD algorithm according to any one of claims 1 to 7.