CN112819935A

CN112819935A - Method for realizing three-dimensional reconstruction of workpiece based on binocular stereo vision

Info

Publication number: CN112819935A
Application number: CN202011242913.6A
Authority: CN
Inventors: 汪贵华; 罗晓杰; 张印
Original assignee: Nanjing University of Science and Technology
Current assignee: Nanjing University of Science and Technology
Priority date: 2020-11-09
Filing date: 2020-11-09
Publication date: 2021-05-18

Abstract

The invention discloses a method for realizing three-dimensional reconstruction of workpiece based on binocular stereo vision. The tilt angle of the three-dimensional rotating device of the workpiece; process the collected images; use the SIFT algorithm to extract feature points from the left and right contour images, and perform stereo matching; convert the pixel coordinates of the feature points into actual coordinates; Curve fitting to obtain the contour of the workpiece. The present invention can complete work under different conditions, has high efficiency and high detection accuracy.

Description

Method for realizing three-dimensional reconstruction of workpiece based on binocular stereo vision

Technical Field

The invention belongs to the technical field of metering detection, and particularly relates to a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision.

Background

As computer vision applications have become more and more concerned by various industries, the range of involvement has been expanding. Computer vision technology creates opportunities for people and also presents many challenges. Three-dimensional reconstruction techniques have been widely used in research and life as one of the most important research subjects. In industry, three-dimensional reconstruction can be used in projects such as workpiece welding, dies and the like. However, due to the complex industrial environment, on one hand, contact measurement is difficult to perform, and on the other hand, due to insufficient illumination and many workpiece cavities, non-contact measurement is easily interfered by the environment and is difficult to perform effective measurement.

Disclosure of Invention

The invention aims to provide a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision.

The technical scheme for realizing the purpose of the invention is as follows: a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision comprises the following steps:

step 1: constructing a workpiece image acquisition system, wherein the workpiece image acquisition system comprises a workpiece three-dimensional rotating device and a binocular camera hardware measurement system;

step 2: when the workpiece three-dimensional rotating device rotates for one angle from the initial position, the binocular camera hardware system collects a frame of workpiece image and measures the inclination angle of the workpiece three-dimensional rotating device;

and step 3: carrying out gray processing, ROI region selection and self-adaptive median filtering on the collected image to obtain a binary image, and carrying out contour extraction on the binary image by using a canny edge extraction algorithm;

and 4, step 4: extracting feature points from the left and right contour maps by adopting an SIFT algorithm, and performing stereo matching;

step 5, converting the pixel coordinates of the characteristic points into coordinates under a world coordinate system according to the calibration result obtained in the step 1 and the distance measured by the laser radar;

and 6, performing curve fitting on the coordinates of the obtained image characteristic points in a world coordinate system to obtain a workpiece contour map.

Preferably, the binocular camera hardware measurement system comprises two cameras and a laser radar, the centers of the two cameras and the workpiece rotating device are located on the same horizontal line, and the laser radar is located between the two cameras.

Preferably, an inclination sensor is arranged on the workpiece three-dimensional rotating device and used for measuring a rotating angle.

Preferably, the specific steps of extracting the feature points of the left and right contour maps by adopting the SIFT algorithm are as follows:

searching image positions on all scales, and identifying interest points which are invariable in scale and rotation through a Gaussian differential function;

at the position of each interest point, the position and the scale of the feature point are determined by a fitting model.

Preferably, the specific method for determining the positions and the dimensions of the feature points through the fitting model is as follows:

performing curve fitting by using a Talor expansion of the DoG function in the scale space, wherein the Talor expansion of the DoG function in the scale space is as follows:

wherein D (X) is a Gaussian difference operator, X (X, y, sigma) represents pixel coordinates under a scale, sigma is a scale factor, X and y are coordinates of any pixel point in an image pixel coordinate system, and X₀(x₀,y₀,σ₀) An origin coordinate of an image pixel coordinate system under an original scale;

and (3) carrying out derivation on the Talor expansion and making the equation equal to zero to obtain the offset of the extreme point as follows:

the corresponding extreme point equation has the value:

preferably, the conversion relationship between the pixel coordinates and the world coordinates of the feature points is specifically as follows:

wherein (u, v) is the coordinate of the characteristic point in the pixel coordinate system, dy is the size of the characteristic point pixel in the X and y directions in the physical coordinate system, f represents the focal length of the camera, R represents the rotation third order matrix, T represents the translational column vector, (X)_W,Y_W,Z_W) Indicating the position of the point in a world coordinate system.

Compared with the prior art, the invention has the following remarkable advantages:

(1) the method is simple to operate, simple and quick in operation processing, low in requirement on environment and suitable for workpiece measurement in different environments;

(2) the method solves the problem that shadow areas are generated on the images due to insufficient illumination in the industrial environment, and the images of the workpieces at different angles are obtained by rotating the workpieces, so that the influence of the shadow areas on the subsequent three-dimensional reconstruction work is effectively avoided;

(3) the distance between the workpiece and the camera can be measured by adopting the laser radar, and the complete three-dimensional coordinates of the workpiece can be obtained by combining image coordinate conversion;

(4) the invention adopts the inclination sensor ADXL345 to measure the rotation angle of the workpiece, and the characteristic matching is carried out on the left image and the right image through multiple angles;

(5) the invention directly transplants OpenCV to an ARM development board, calls a related function kernel algorithm of a computer vision library, and carries out a series of image preprocessing of acquisition kernel on the workpiece image, thereby selecting and identifying the ROI area.

The present invention is described in further detail below with reference to the attached drawing figures.

Drawings

Fig. 1 is a flow chart of a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision.

Fig. 2 is a schematic diagram of a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision.

Fig. 3 is a schematic diagram of the rotation of a workpiece according to the present invention.

Detailed Description

As shown in fig. 1 to 3, a method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision includes:

step 1: constructing a workpiece image acquisition system, wherein the workpiece image acquisition system comprises a workpiece three-dimensional rotating device and a binocular camera hardware measurement system, and an inclination sensor is arranged on the workpiece three-dimensional rotating device and used for measuring a rotating angle; the binocular camera hardware measurement system comprises two cameras and a laser radar, the centers of the two cameras and the workpiece rotating device are located on the same horizontal line, and the laser radar is located between the two cameras; the distance between the binocular camera hardware measurement system and the workpiece three-dimensional rotating device is measured, and the two cameras can move to achieve measurement of different distances.

Calibrating the left camera and the right camera to obtain internal parameters and relative attitude parameters of the two cameras; measuring the Z coordinate of the characteristic point of the workpiece by using the laser radar;

in some embodiments, the binocular camera hardware measurement system and the workpiece three-dimensional rotating device center are located on the same horizontal line, and the calculation amount of space conversion can be reduced.

When the workpiece acquires an image, under the industrial ring 00, a large shadow area is generated due to the influence of insufficient illumination. Therefore, the influence of the shadow area needs to be reduced, and the invention adopts the method of rotating the workpiece to acquire the workpiece images under different angles, so as to reduce the influence of the shadow area.

Step 2: starting from an initial position, when the workpiece three-dimensional rotating device rotates by an angle, a binocular camera hardware system collects a frame of image, and an inclination angle is measured by an inclination sensor;

the video Capture in OpenCV is used to open the camera, which is used to process the video file or the video stream of the camera, and can control the opening and closing of the camera, and the video stream can be read into the hardware platform and stored in the matrix frame by using the cap > frame, so as to process each frame image in the video.

And step 3: analyzing and processing the acquired image, wherein the analyzing and processing comprises gray processing, ROI area selection and self-adaptive median filtering to obtain a binary image, and extracting the contour of the binary image by using a canny edge extraction algorithm;

because the video collected by the camera is colorful, the video is processed into a gray level image when being processed, and three components of the gray level image R, G, B in the RGB format are equal and equal to the gray level value. In OpenCV, the functional declaration that enables the conversion of RGB color space to grayscale is: cvcvcvtcolor (const CvArr src, CvArr dst, int code), i.e. converting the original image src to dst, code representing the color space conversion parameter, and using this function to perform the gray-scale conversion on each frame of color image. The specific function is implemented as cvtColor (frame, edges, CV _ BGR2GRAY), where frame is the original image and edges is the grayscale image.

The image denoising is a commonly used step in image preprocessing, and commonly used image denoising algorithms include adaptive median filtering, gaussian filtering and the like. Wherein the adaptive median filtering is more suitable for such salt and pepper noise with abrupt white or black spots. The image noise mainly comes from the image acquisition and transmission process, and common noises include additive noise, multiplicative noise, quantization noise, salt and pepper noise and the like. Therefore, the present invention employs adaptive median filtering to eliminate noise.

Canny edge detection is carried out on the binary image, the edge of the image is detected, and a workpiece edge contour map is obtained;

and 4, extracting feature points from the left and right contour maps by adopting an SIFT algorithm, and performing stereo matching.

The SIFT algorithm is a description used in the field of image processing, can detect key points in an image, and is a local feature descriptor. The SIFT algorithm is mainly divided into scale space extreme value detection, key point positioning and key point feature description.

And (3) detection of extreme values in the scale space: image locations at all scales are searched and potential scale and rotation invariant points of interest are identified by gaussian differential functions. The dimensional image of the space is described as:

in the formula, L (x, y, σ) represents an image in a scale space, I (x, y) is an input image, G (x, y, σ) represents a two-dimensional gaussian kernel function whose scale can be changed, coordinates (x, y) of a pixel point, and σ is a scale factor.

Key point positioning: at the location of each point of interest, the location and scale are determined by a fitting fine model. In some embodiments, curve fitting is performed using a Talor expansion of the DoG function in scale space;

the Talor expansion of the DoG function in scale space is:

and (3) obtaining the offset of the extreme point by obtaining the derivation and the yield equal to zero:

the corresponding extreme point equation has the value:

and matching the characteristic points obtained from the left image and the right image.

Step 5, converting the coordinates of the characteristic points under the image pixel coordinate system into the coordinates under the world coordinate system according to the calibration result obtained in the step 1 and the distance measured by the laser radar;

step 5.1: converting the pixel coordinates of the image feature points into image physical coordinates;

for the feature point p, its coordinates are (u, v) in the pixel coordinate system and (x, y) in the physical coordinate system. Given that the dimensions of a single pixel in the x and y directions in the physical coordinate system are dx and dy, respectively, the following equations hold:

the arrangement into the form of its secondary transformation matrix is as follows:

in the formula (u)₀,v₀) Coordinates representing the origin of the physical coordinate system of the image

Step 5.2: camera coordinates that convert the physical coordinates of the image feature points.

The camera coordinate system is a space three-dimensional coordinate system established by taking the optical center of a camera lens as an origin, the Z axis is vertical to the image physical coordinate system, and a conversion matrix is obtained according to the similar triangle principle as follows:

in the formula (X)_C,Y_C,Z_C) Is the coordinate of the camera coordinate system, and f is the focal length of the camera.

Step 5.3, converting the camera coordinates of the image feature points into world coordinates;

finally, the conversion relation between the world coordinate system and the pixel coordinate system is obtained as follows:

where f denotes the focal length of the camera, R denotes a rotational third-order matrix, T denotes a translational column vector, and (X) denotes a translational column vector_W,Y_W,Z_W) Is the world coordinate system coordinate.

And 6, performing curve fitting on the world coordinates of the obtained image feature points to obtain a workpiece contour map.

The invention realizes the three-dimensional reconstruction of the workpiece by adopting multi-angle fusion, records the angle data of the workpiece by rotating the workpiece for a certain angle, and respectively collects one frame of image by a left camera and a right camera. And then, performing image algorithms such as image preprocessing, image feature matching and the like on each frame of image to extract the feature points of the image, acquiring the pixel coordinates of the feature points, and then performing coordinate conversion on the pixel coordinates of the feature points to acquire the actual physical coordinates of the workpiece feature points. The method is simple and convenient to operate, can effectively realize three-dimensional reconstruction of the small workpiece, and effectively reduces the influence of the image shadow area.

Claims

1. a method for realizing three-dimensional reconstruction of workpiece based on binocular stereo vision, is characterized in that, comprising:

Step 1: build a workpiece image acquisition system, the workpiece image acquisition system includes a workpiece three-dimensional rotation device and a binocular camera hardware measurement system;

Step 2: Every time the workpiece three-dimensional rotation device rotates by an angle from the initial position, the binocular camera hardware system collects a frame of workpiece image, and measures the inclination angle of the workpiece three-dimensional rotation device;

Step 3: Perform grayscale processing, ROI region selection, and adaptive median filtering on the collected image to obtain a binarized image, and use the canny edge extraction algorithm to extract the contour of the binarized image;

Step 4: Use the SIFT algorithm to extract feature points from the left and right contour images, and perform stereo matching;

Step 5. Obtain the calibration result and the distance measured by the lidar according to step 1, and convert the pixel coordinates of the feature points into coordinates in the world coordinate system;

Step 6: Perform curve fitting on the coordinates of the obtained image feature points in the world coordinate system to obtain a contour map of the workpiece.

2. The method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision according to claim 1, wherein the binocular camera hardware measurement system comprises two cameras and a laser radar, and the centers of the two cameras are located at the workpiece rotating device. On the same horizontal line, the lidar is located between the two cameras.

3 . The method for realizing three-dimensional reconstruction of a workpiece based on binocular stereo vision according to claim 1 , wherein the three-dimensional rotation device of the workpiece is provided with a tilt sensor, and the tilt sensor is used to measure the rotation angle. 4 .

4. the method for realizing workpiece three-dimensional reconstruction based on binocular stereo vision according to claim 1, is characterized in that, the concrete step that adopts SIFT algorithm to extract feature point to left and right contour map is:

Search image locations at all scales and identify points of interest that are invariant to scale and rotation through differential Gaussian functions;

At the location of each interest point, the feature point location and scale are determined by fitting the model.

5. the method for realizing the three-dimensional reconstruction of workpiece based on binocular stereo vision according to claim 4, is characterized in that, the concrete method that determines feature point position and scale by fitting model is:

Curve fitting is performed by using the Talor expansion of the DoG function in the scale space. The Talor expansion of the DoG function in the scale space is:

In the formula, D(X) is the Gaussian difference operator, X(x, y, σ) represents the pixel coordinates under the scale, σ is the scale factor, (x, y) is the coordinate of any pixel in the image pixel coordinate system, X ₀ (x ₀ , y ₀ , σ ₀ ) is the origin coordinate of the image pixel coordinate system at the original scale;

Taking the derivative of the Talor expansion and making the equation equal to zero yields the offset of the extreme point as:

The value of the corresponding extreme point equation is:

6. the method for realizing workpiece three-dimensional reconstruction based on binocular stereo vision according to claim 1, is characterized in that, the pixel coordinate of feature point and the world coordinate conversion relation are specifically:

In the formula, (u, v) are the coordinates of the feature point in the pixel coordinate system, dy is the size of the feature point pixel in the x, y direction in the physical coordinate system, f represents the camera focal length, R represents the rotation third-order matrix, T represents The translation column vector, (X _W , Y _W , Z _W ) represents the position of the point in the world coordinate system.