CN110766797B

CN110766797B - Three-dimensional map repairing method based on GAN

Info

Publication number: CN110766797B
Application number: CN201911045035.6A
Authority: CN
Inventors: 刘坤华; 陈龙; 张亚琛; 袁湛楠
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2019-10-30
Filing date: 2019-10-30
Publication date: 2021-08-13
Anticipated expiration: 2039-10-30
Also published as: CN110766797A

Abstract

The invention relates to the technical field of deep learning, in particular to a three-dimensional map repairing method based on GAN. Firstly, carrying out scaling and normalization processing on an input original color image to be restored and a corresponding pseudo color disparity map to be restored to enable the size of the original color image to be restored to be H x W; secondly, inputting the color image to be repaired and the pseudo color parallax image to be repaired into a generation model; and thirdly, generating a model to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored to obtain the restored color image and the restored pseudo color parallax image. According to the three-dimensional map repairing method based on the GAN, the generation model and the judgment model are both composed of the convolutional neural network, the convolutional neural network can well extract image characteristics, and the problem that the traditional method cannot process images without textures or with weak textures is solved.

Description

Three-dimensional map repairing method based on GAN

Technical Field

The invention relates to the technical field of deep learning, in particular to a three-dimensional map repairing method based on GAN.

Background

The high-precision map is one of the most core technologies of the unmanned technology, and directly determines the intelligence degree of the unmanned vehicle. The repair problem can be classified into: color image restoration, depth image restoration, and three-dimensional restoration. At present, color image restoration and depth image restoration are a hot research direction in the field of computer vision, but the restoration of a single color image or the restoration of a depth image cannot accurately carry out high-precision map three-dimensional construction. If a map is to be constructed with high accuracy, it needs to be repaired three-dimensionally. The three-dimensional restoration can be divided into direct three-dimensional point cloud restoration and collaborative restoration of a color image and a depth image (a parallax image or a pseudo-color parallax image) according to different restoration contents. The three-dimensional point cloud can directly express the space, but the three-dimensional point cloud is defined on an irregular non-Euclidean domain and has the characteristics of disorder, rotation and the like, so that the high-precision repair of the three-dimensional point cloud is difficult to realize. Another method of three-dimensional restoration is the cooperative restoration of color images and depth images. The repairing method considers that the dynamic barrier exists in both the color image and the depth image, and the color image and the depth image need to be repaired simultaneously when three-dimensional repairing is carried out. At present, researches on the aspects of the cooperative repair of the color image and the depth image are less, and the color image and the depth image are repaired by adopting a traditional method due to the data difference of the color image and the depth image. However, the conventional method has difficulty in processing images without texture or weak texture when processing color images and depth images. The convolutional neural network can well extract image features, but at present, the research of cooperative repair of a color image and a depth image (a disparity map or a pseudo-color disparity map) based on deep learning is not available.

Disclosure of Invention

In order to overcome the defects in the prior art, the invention provides the three-dimensional map repairing method based on the GAN, and the implementation process is simple and convenient.

In order to solve the technical problems, the invention adopts the technical scheme that: a three-dimensional map repairing method based on GAN, at first, carry on zoom and normalization process to the primitive color picture to be repaired and its correspondent pseudo-color disparity map to be repaired, make its size H x W, the output value of the picture is (-1, 1); secondly, inputting the color image to be repaired and the pseudo color parallax image to be repaired into a generation model; and thirdly, generating a model to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored to obtain the restored color image and the restored pseudo color parallax image.

Further, the color image to be repaired and the pseudo color parallax image to be repaired are simultaneously transmitted into the generation model, and the generation model repairs the color image to be repaired and the pseudo color parallax image to be repaired.

The training device further comprises a discrimination model, wherein the discrimination model is used for discriminating the quality of the generated model, and if the discrimination result shows that the output result of the generated model is not good, the generated model continues to be trained.

Further, the generation model comprises a feature sharing module, a color image feature module and a pseudo-color parallax image feature module; the feature sharing module comprises 4 neural network layers, wherein the first layer is an input layer, and the other layers are convolutional layers; the data dimension of the first layer is H W6, the data dimensions of the second layer and the third layer are (H/2) W/2 256, and the data dimension of the fourth layer is (H/4) W/4 512; the color image feature module and the pseudo-color parallax map feature module respectively comprise 4 convolution layers, the data dimension of the first layer is (H/4) × (W/4) × 512, the data dimension of the second layer and the third layer is (H/2) × (W/2) × 256, and the data dimension of the fourth layer is H × W3.

Furthermore, the discrimination model comprises a feature sharing module, a color image feature module and a pseudo-color parallax image feature module; the characteristic sharing module comprises 3 neural network layers, wherein the first layer is an input layer, and the other layers are convolutional layers; the data dimension of the first layer is H W6, the data dimension of the second layer is (H/2) W/2 256, and the data dimension of the third layer is (H/4) W/4 512; the color image feature module and the pseudo-color parallax map feature module respectively comprise 3 convolution layers, the data dimension of the first layer and the second layer is (H/4) × (W/4) × 512, and the data dimension of the third layer is 1 × 1.

In the invention, the activation function of the generation model and the discrimination model is ReLU, the data processing batch is 64, the size of a convolution kernel is 3 x 3, the step size is 2, and the optimization function is Adam.

Further, the training process of the discriminant model includes the following steps:

s1, respectively inputting a true color image and a true and false color parallax image corresponding to the true color image, a color image generated by a generation model and a false color parallax image corresponding to the color image into a discrimination model D, wherein: discriminator loss function L_dis：

In the formula: x is a true color image or a true and false color image;

the color image or the pseudo color parallax image to be repaired is obtained;

generating a result of the color image or the pseudo color parallax image to be repaired by the generator;

d (x) is a true color image or a true-false color image;

s2, training a discrimination model;

and S3, judging the authenticity of the output result of the judging model, namely the color image output result and the pseudo-color parallax image output result, through a sigmoid function.

Further, the training process of the generative model comprises the following steps:

s1, inputting a color image to be repaired and a corresponding pseudo color disparity map to be repaired into a generation model, wherein a generator loss function L of the color image or the pseudo color disparity map is_gen(L_genThe distance between the true color image and the color image to be restored is expected as the judgment result L1, or the distance between the true color image and the false color parallax image to be restored is expected as the judgment result L1):

wherein, x is a true color image or a true color image;

the color image or the pseudo color parallax image to be repaired is obtained;

d (x) is a true color image or a true-false color image;

s2, generating a model to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored to obtain a restored color image and a restored pseudo color parallax image;

further, in the training process, a model and a discriminant model are generated for iterative training.

Compared with the prior art, the beneficial effects are:

1. in the three-dimensional map repairing method based on the GAN, a generation model and a discrimination model are both formed by a convolutional neural network, and the convolutional neural network can well extract image characteristics, so that the problem that the traditional method cannot process images without textures or weak textures is solved;

2. the three-dimensional map repairing method based on the GAN is an end-to-end learning method, and the implementation process is simple and convenient.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention.

FIG. 2 is a schematic flow chart of the training method of the present invention.

Detailed Description

The drawings are for illustration purposes only and are not to be construed as limiting the invention; for the purpose of better illustrating the embodiments, certain features of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product; it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted. The positional relationships depicted in the drawings are for illustrative purposes only and are not to be construed as limiting the invention.

Example 1:

as shown in fig. 1 and 2, when implementing the GAN-based three-dimensional map repairing method, the following contents are included in part 3:

1) collecting and processing training data and test data;

step 1, calibrating a binocular camera;

step 2, designing two scenes with and without barriers, and collecting color images through a binocular camera respectively;

step 3, calculating the parallax of the color image by using the data collected without the obstacle scene as a true value and the data collected with the obstacle scene as a false value through an optical flow algorithm, and further obtaining a pseudo-color parallax image;

step 4, performing data enhancement, size scaling (the size of the scaled image is H x W) and normalization processing on the data;

step 5, carrying out panoramic segmentation on the color image through a Panoptic FPN neural network architecture, and extracting an obstacle region to be used as a region to be repaired;

and 6, setting 70% of data (color images and corresponding pseudo-color disparity maps) as training data, 20% of data as test data and 10% of data as verification data.

2) Generating model G and discriminating model D for training GAN

(1) The training steps of the discriminant model D are as follows:

step 1, respectively inputting a true color image and a true and false color disparity map corresponding to the true color image, a color image generated by a generation model G and a false color disparity map corresponding to the color image into a discrimination model D;

step 2, training a discrimination model D;

and 3, judging the authenticity of the output result of the judgment model D through the sigmoid function.

(2) The training steps for generating the model are as follows:

step 1, inputting a color image to be repaired and a corresponding pseudo color disparity map to be repaired into a generation model G;

step 2, generating a model G to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored, and obtaining a restored color image and a restored pseudo color parallax image;

3) generating model G based on GAN to realize three-dimensional map repair

Step 1, carrying out scaling and normalization processing on an input original color image to be restored and a corresponding pseudo color disparity map to be restored to enable the size of the original color image to be H x W and the image output value to be (-1, 1);

step 2, inputting the color image to be repaired and the corresponding pseudo color disparity map to be repaired into a generation model G;

and 3, generating a model G to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored, and obtaining the restored color image and the restored pseudo color parallax image.

It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims

1. A three-dimensional map repairing method based on GAN is characterized in that firstly, scaling and normalizing an original color image to be repaired and a corresponding pseudo color parallax image to be repaired in an input three-dimensional map to make the original color image to be repaired and the pseudo color parallax image to be repaired equal to H x W; secondly, simultaneously transmitting the color image to be repaired and the pseudo color parallax image to be repaired into a generation model, and simultaneously repairing the generated model; thirdly, generating a model to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored to obtain a restored color image and a restored pseudo color parallax image;

the training process of the generative model comprises the following steps:

s21, inputting the color image to be repaired and the corresponding pseudo color disparity map to be repaired into a generation model, wherein a generator loss function L of the color image or the pseudo color disparity map is used for generating a loss function L_gen，L_genThe expectation of the distance between the true color image and the color image discrimination result L1 to be repaired or the expectation of the distance between the true-false color image and the false color parallax image discrimination result L1 to be repaired is as follows:

wherein, x is a true color image or a true color image;

the color image or the pseudo color parallax image to be repaired is obtained;

the prediction result of the color image or the pseudo color parallax image to be repaired is obtained through the generator;

d (x) is the true value of the true color image or the true and false color image;

and S22, generating a model to finish the restoration of the color image to be restored and the pseudo color parallax image to be restored to obtain the restored color image and the restored pseudo color parallax image.

2. The method of repairing a three-dimensional map based on GAN as claimed in claim 1, further comprising a discriminant model for determining whether the generated model is good or bad, wherein if the result of the determination shows that the output result of the generated model is not good, the generated model continues to be trained.

3. The GAN-based three-dimensional map repairing method according to claim 2, wherein the generated model comprises a feature sharing module, a color image feature module and a pseudo color parallax image feature module; the feature sharing module comprises 4 neural network layers, wherein the first layer is an input layer, and the other layers are convolutional layers; the data dimension of the first layer is H W6, the data dimensions of the second layer and the third layer are (H/2) W/2 256, and the data dimension of the fourth layer is (H/4) W/4 512; the color image feature module and the pseudo-color parallax map feature module respectively comprise 4 convolution layers, the data dimension of the first layer is (H/4) × (W/4) × 512, the data dimension of the second layer and the third layer is (H/2) × (W/2) × 256, and the data dimension of the fourth layer is H × W3.

4. The GAN-based three-dimensional map repairing method according to claim 2, wherein the discriminant model comprises a feature sharing module, a color image feature module and a pseudo-color disparity map feature module; the characteristic sharing module comprises 3 neural network layers, wherein the first layer is an input layer, and the other layers are convolutional layers; the data dimension of the first layer is H W6, the data dimension of the second layer is (H/2) W/2 256, and the data dimension of the third layer is (H/4) W/4 512; the color image feature module and the pseudo-color parallax map feature module respectively comprise 3 convolution layers, the data dimension of the first layer and the second layer is (H/4) × (W/4) × 512, and the data dimension of the third layer is 1 × 1.

5. The GAN-based three-dimensional map inpainting method according to any one of claims 2 to 4, wherein the discriminant model training process comprises the steps of:

s11, inputting the true color image and the true and false color parallax map corresponding to the true color image into a discrimination model D, wherein: discriminator loss function L_disThe expectation of the distance between the true color image and the color image discrimination result L1 to be repaired or the expectation of the distance between the true-false color image and the false color parallax image discrimination result L1 to be repaired is as follows:

in the formula: x is a true color image or a true and false color image;

the color image or the pseudo color parallax image to be repaired is obtained;

s12, training a discrimination model;

and S13, judging the authenticity of the output result of the judging model, namely the color image output result and the pseudo-color parallax image output result, through a sigmoid function.

6. The GAN-based three-dimensional map inpainting method according to claim 5, wherein the generation model and the discriminant model are iteratively trained during the training process.