CN116912091A

CN116912091A - Binocular fisheye image stitching method and system

Info

Publication number: CN116912091A
Application number: CN202310855895.6A
Authority: CN
Inventors: 董安明; 金占杰; 禹继国; 臧传浩; 主洪磊; 宋守良; 高斌; 韩玉冰; 李素芳; 张丽
Original assignee: Qilu University of Technology
Current assignee: Qilu University of Technology
Priority date: 2023-07-13
Filing date: 2023-07-13
Publication date: 2023-10-20

Abstract

The invention discloses a binocular fisheye image stitching method and a binocular fisheye image stitching system, belongs to the technical field of image processing, and aims to solve the technical problems that panoramic image stitching existing in binocular fisheye image stitching is long in time consumption and double images appear in panoramic image stitching. The method comprises the following steps: simultaneously acquiring two fisheye images through a binocular fisheye lens; acquiring parameters of a camera through a camera calibration method, and carrying out distortion correction on each fish-eye graph based on the parameters of the camera; performing image rough alignment and image reconstruction on the corrected fish-eye image based on the image stitching frame; the stitched image is rectangular by warping the stitched image from the predicted initial grid to the predefined target grid, resulting in a rectangular image.

Description

Binocular fisheye image stitching method and system

Technical Field

The invention relates to the technical field of image processing, in particular to a binocular fisheye image splicing method and system.

Background

Along with the development of science and technology, panoramic vision is a technical concept of extremely hot at present, and is widely applied in the field of virtual reality, and can influence the aspects of our life, and the application of panoramic vision is more and more extensive, including the application in the aspects of national defense, rescue, safety monitoring, medical treatment and the like. The premise of achieving omnibearing vision is to acquire a panoramic image containing all surrounding scene information. Such images are generally taken by expensive professional equipment and are obtained by complex post-processing, which is quite unfavorable for the use and spread of the general public. Thus, the panoramic picture is more conveniently acquired, which is a hot spot in the field of panoramic vision.

Panoramic image is taken as a live-action picture capable of providing a user with ultra-large field of view observation, wherein the generation of the panoramic image based on image stitching mainly comprises the following two modes:

(1) Specially-manufactured equipment adopting multiple lenses: the equipment is generally expensive, and the common public often cannot bear the equipment and is not suitable for popularization;

(2) Collecting a plurality of images through a single lens, splicing and the like: the method is complex in operation, low in efficiency and low in applicability in practical engineering application, and is only suitable for static scenes.

The fisheye lens has a wider visual angle than a common camera, and in order to simplify a camera placing device model for shooting a covering panorama in multiple directions, the binocular camera simultaneously shoots and acquires two fisheye images of the covering panorama in opposite directions, and a series of image processing processes are carried out to splice panoramic images. To obtain a better panoramic image, the binocular fisheye image needs to be corrected and spliced, which is a difficulty affecting the practical application of the fisheye lens.

The current fish-eye correction mode is mainly divided into a projection transformation model-based method and a calibration-based method. Based on spherical perspective projection, the objective function is optimized by fitting a polynomial, so that the correction model parameters are estimated, the corrected image is deduced, and the method is complex in calculation and poor in instantaneity. The calibration-based correction method mainly calibrates internal and external parameters of the fisheye image by means of external equipment, and realizes the fisheye image distortion correction by means of coordinate conversion between real coordinates and fisheye imaging plane coordinates.

How to solve the problems of long time consumption and double image in panoramic image stitching existing in binocular fisheye image stitching is a technical problem to be solved.

Disclosure of Invention

The technical task of the invention is to provide the binocular fisheye image splicing method and the binocular fisheye image splicing system aiming at the defects, so as to solve the technical problems of long time consumption of panoramic image splicing and double image occurrence in panoramic image splicing in binocular fisheye image splicing.

In a first aspect, the invention provides a binocular fisheye image stitching method, which comprises the following steps:

and (3) image acquisition: simultaneously acquiring two fisheye images through a binocular fisheye lens;

and (3) correcting an image: acquiring parameters of a camera through a camera calibration method, and carrying out distortion correction on each fisheye image based on the parameters of the camera to obtain corrected fisheye images;

and (3) image stitching: constructing an image stitching frame based on an unsupervised rough alignment method and an unsupervised image reconstruction method, and performing image rough alignment and image reconstruction on corrected fisheye images based on the image stitching frame to obtain stitched images;

image squaring: predicting an initial grid based on a complete convolution network and a residual error progressive regression strategy, predefining a target grid with a rigid shape, and squaring the spliced image in a mode of warping the spliced image from the predicted initial grid to the predefining target grid to obtain a rectangular image.

Preferably, the parameters of the camera are obtained through a camera calibration method, which comprises the following steps:

selecting a point as an origin of a world coordinate system, and constructing the world coordinate system;

for the fish-eye image, obtaining a three-dimensional coordinate of each pixel point in the fish-eye image in a world coordinate system through an image detection algorithm;

converting the three-dimensional coordinates of each pixel point into pixel coordinates through a camera coordinate system and an image plane coordinate system after rigid transformation and projection;

for the pixel coordinates of each pixel point, the position of the center of the image is determined, and the position is converted into a final pixel coordinate system through translation.

Preferably, the image rough alignment is performed on the corrected fisheye image based on the image stitching frame, and the method comprises the following steps:

constraining an unsupervised homography network based on ablation and introducing a stitching domain transformation layer for reducing image size;

for each corrected fisheye image, extracting the characteristics of the corrected fisheye image through a convolution network, taking the extracted characteristics as input, estimating the homography of the image through an unsupervised homography network, and obtaining a distorted image by distorting the corrected image and performing coarse alignment;

And putting the distorted image into a stitching domain transformation layer to reduce redundant black pixels in the image, so as to obtain a final distorted image.

Preferably, when the corrected fisheye image is reconstructed based on the image splicing frame, the double image of the characteristic to the pixel is eliminated through an unsupervised reconstruction network, the reconstruction network comprises a low-resolution deformation branch network and a high-resolution thinning branch network, the deformation rule of image splicing is learned through the low-resolution deformation branch network, and the resolution of the spliced image is improved through the high-resolution thinning branch network;

learning a deformation rule of image stitching through a low-resolution deformed branch network, comprising the following operations:

downsampling the warped image to reduce the resolution of the image to obtain a downsampled image;

inputting the downsampled image into an encoder-decoder structure consisting of a pooling layer and a deconvolution layer;

jumping connection is carried out through the encoder-decoder structure to deform the downsampled image, so that a content mask and a seam mask of the downsampled image are obtained, the content mask is used for restraining characteristics of the reconstructed image to be close to a distorted image, the seam mask is used for restraining the edge of an overlapped area to be natural and continuous, and the overlapped area is an overlapped area between two fisheye images;

The high-resolution refinement branch network consists of a convolution layer and a resource block, wherein the resource block consists of the convolution layer, a relu activation function, the convolution layer, a sum function and the relu activation function in sequence;

and calculating the content loss and the seam loss of the distorted image under the high-resolution thinning branch network, and thinning the spliced image by combining the content mask and the seam mask of the distorted image under the low resolution to obtain the spliced image of the two distorted fish-eye images.

Preferably, the stitched image is rectangular by warping the stitched image from the predicted initial grid to a predefined target grid, comprising the steps of:

taking the spliced image and a mask corresponding to the spliced image as input, and extracting advanced semantic features from the input through a feature extractor to obtain a feature image, wherein the feature extractor consists of stacked convolution pool blocks;

predicting horizontal motion and vertical motion of each vertex in the characteristic image based on a conventional grid through a grid motion regressor, wherein the grid motion regressor is of a full convolution structure;

constructing two regressors with the same structure based on a residual regression strategy, namely a first regressor and a second regressor, wherein the first regressor is used for predicting primary grid motion in a progressive manner, and the second regressor is used for predicting residual grid motion in a progressive manner;

And predicting primary grid motion through a first regression device, carrying out grid deformation on the feature map, predicting residual grid motion through a second regression device, carrying out grid deformation on the distorted feature map, and maintaining the shape of deformed grids based on intra-grid constraints and inter-grid constraints during grid deformation to obtain a rectangular feature map.

In a second aspect, the present invention provides a binocular fisheye image stitching system for stitching images by a binocular fisheye image stitching method according to any one of the first aspect, the system comprising:

the image acquisition module is used for simultaneously acquiring two fisheye images through the binocular fisheye lens;

the image correction module is used for obtaining parameters of a camera through a camera calibration method, and carrying out distortion correction on each fisheye graph based on the parameters of the camera to obtain corrected fisheye images;

the image stitching module is used for constructing an image stitching frame based on an unsupervised rough alignment method and an unsupervised image reconstruction method, and performing image rough alignment and image reconstruction on the corrected fisheye image based on the image stitching frame to obtain a stitched image;

And the image squaring module is used for predicting an initial grid based on a complete convolution network and a residual error progressive regression strategy, predefining a target grid with a rigid shape, and squaring the spliced image in a mode of warping the spliced image from the predicted initial grid to the predefining target grid to obtain a rectangular image.

Preferably, the image correction module is configured to perform the following to obtain parameters of the camera by a camera calibration method:

Preferably, the image stitching module is configured to perform coarse alignment of the corrected fisheye image as follows:

Preferably, the image stitching module is configured to perform the following steps

When the corrected fisheye image is subjected to image reconstruction, the image stitching module is used for eliminating double images from features to pixels through an unsupervised reconstruction network, the reconstruction network comprises a low-resolution deformation branch network and a high-resolution refinement branch network, the low-resolution deformation branch network is used for learning deformation rules of image stitching, and the high-resolution refinement branch network is used for improving resolution of stitched images;

the graphic stitching module is used for executing the following morphing rule for learning image stitching through a low-resolution morphing branch network:

the image stitching module is used for calculating content loss and seam loss of the distorted image under the high-resolution thinning branch network, and thinning the stitched image by combining the content mask and the seam mask of the distorted image under the low resolution to obtain the stitched image of the two distorted fish-eye images.

Preferably, the image squaring module is configured to perform the following:

The binocular fisheye image splicing method has the following advantages: for the collected fisheye image, camera parameters are obtained through a camera calibration method, the fisheye image is corrected through the camera parameters, for the corrected image, the image coarse alignment and the image reconstruction are carried out on the basis of an unsupervised coarse alignment method and an unsupervised image reconstruction method, a spliced image is obtained, and the spliced image is subjected to squaring, so that the spliced image which is good in splicing effect, high in boundary rectangle degree and suitable for being watched by a user and subsequent image processing is obtained.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments or the description of the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.

The invention is further described below with reference to the accompanying drawings.

Fig. 1 is a flow chart of a binocular fisheye image stitching method of embodiment 1;

fig. 2 is a relationship block diagram of four large coordinate systems in a binocular fisheye image stitching method of embodiment 1;

FIG. 3 is a table showing the transformation relationship of four coordinate systems in a binocular fisheye image stitching method according to embodiment 1;

fig. 4 is a functional block diagram of image stitching in a binocular fisheye image stitching method according to embodiment 1;

fig. 5 is a functional block diagram of image rectangle in a binocular fisheye image stitching method of embodiment 1;

FIG. 6 is a schematic diagram of intra-grid constraint in a binocular fisheye image stitching method of embodiment 1;

fig. 7 is a schematic diagram of inter-grid constraint in a binocular fisheye image stitching method of embodiment 1.

Detailed Description

The invention will be further described with reference to the accompanying drawings and specific examples, so that those skilled in the art can better understand the invention and implement it, but the examples are not meant to limit the invention, and the technical features of the embodiments of the invention and the examples can be combined with each other without conflict.

The embodiment of the invention provides a binocular fisheye image stitching method and a binocular fisheye image stitching system, which are used for solving the technical problems that panoramic image stitching is long in time consumption and double images appear in panoramic image stitching.

Example 1:

the invention discloses a binocular fisheye image splicing method, which comprises the following steps of:

s100, image acquisition: simultaneously acquiring two fisheye images through a binocular fisheye lens;

s200, correcting an image: acquiring parameters of a camera through a camera calibration method, and carrying out distortion correction on each fisheye image based on the parameters of the camera to obtain corrected fisheye images;

s300, image stitching: constructing an image stitching frame based on an unsupervised rough alignment method and an unsupervised image reconstruction method, and performing image rough alignment and image reconstruction on corrected fisheye images based on the image stitching frame to obtain stitched images;

S400, image squaring: predicting an initial grid based on a complete convolution network and a residual error progressive regression strategy, predefining a target grid with a rigid shape, and squaring the spliced image in a mode of warping the spliced image from the predicted initial grid to the predefining target grid to obtain a rectangular image.

In step S200 of the present embodiment, parameters of a camera are obtained by a camera calibration method, including the following steps:

(1) Selecting a point as an origin of a world coordinate system, and constructing the world coordinate system;

(2) For the fish-eye image, obtaining a three-dimensional coordinate of each pixel point in the fish-eye image in a world coordinate system through an image detection algorithm;

(3) Converting the three-dimensional coordinates of each pixel point into pixel coordinates through a camera coordinate system and an image plane coordinate system after rigid transformation and projection;

(4) For the pixel coordinates of each pixel point, the position of the center of the image is determined, and the position is converted into a final pixel coordinate system through translation.

The four coordinate systems related to this embodiment are shown in fig. 2, the conversion relationships of which are shown in fig. 3, and the internal reference matrix in fig. 3 is simplified as follows:

the overall conversion of the internal reference matrix of fig. 3 is as follows, i.e. the single point undistorted camera imaging model is as follows:

The camera parameters including the information of the internal reference matrix, the external reference matrix and the like are obtained through the camera calibration method, and the distortion of the fisheye image is corrected through the camera parameters.

Step S300 is to splice the corrected two fish-eye images to obtain a spliced image. In the step, an unsupervised depth image stitching framework is constructed, and two-stage operation is executed through the stitching framework: unsupervised coarse image alignment and unsupervised image reconstruction.

The first stage performs image coarse alignment on the corrected fisheye image by an unsupervised coarse image alignment method, and comprises the following steps:

(1) Constraining an unsupervised homography network based on ablation and introducing a stitching domain transformation layer for reducing image size;

(2) For each corrected fisheye image, extracting the characteristics of the corrected fisheye image through a convolution network, taking the extracted characteristics as input, estimating the homography of the image through an unsupervised homography network, and obtaining a distorted image by distorting the corrected image and performing coarse alignment;

(3) And putting the distorted image into a stitching domain transformation layer to reduce redundant black pixels in the image, so as to obtain a final distorted image.

The coarse image of the present embodiment is aligned with the complete image as input, ensuring that all overlapping areas are included in the input. The distorted target image is forced to be close to the reference image without refilling invalid pixels in the distorted image. In contrast, the content of the invalid pixels in the target image is warped in the reference image. At the same time, a transform layer is introduced to warp the input image in the stitching domain space so as to be roughly aligned with each other in the proposed stitching domain transform layer.

In order to solve the problem of space waste, a splice domain transform layer is used. The splicing area is defined as the minimum circumscribed rectangle of the spliced image, so that the space is saved to the greatest extent while the integrity of the image content is ensured. First, coordinates of 4 vertices in a warped object image are calculated by formula (1), whereinAnd->The object image and the kth vertex coordinates of the object image are warped, respectively.

Then, the size of the distorted image is found by equation 2, in whichIs in combination with->Vertex coordinates of reference images having the same value. Wherein the splice domain is set to H ^* ×W ^* Size of the product. In this way, the input image is transformed in the splicing domain space, so that the space occupied by the feature mapping in the subsequent reconstruction network is effectively reduced.

A and B respectively represent a distorted target image and a distorted target image, k represents the kth vertex coordinate, and Deltax _k ,Δy _k And the offset of the kth vertex is represented, M is a reference image, and the vertex coordinates of the reference image have the same value as the vertex coordinates of the target image.

In the second stage, there is often a ghost effect in the stitched result, since homography matrices cannot account for all arrangements of different depth levels. However, pixel-level misalignments can be eliminated to some extent at the feature level, and the warped image is used to reconstruct the stitched image from features to pixels. When the corrected fisheye image is subjected to image reconstruction, the double image of the characteristic to the pixel is eliminated through an unsupervised reconstruction network, the reconstruction network comprises a low-resolution deformation branch network and a high-resolution refinement branch network, the deformation rule of image splicing is learned through the low-resolution deformation branch network, and the resolution of the spliced image is improved through the high-resolution refinement branch network.

(1) Downsampling the warped image to reduce the resolution of the image to obtain a downsampled image;

(2) Inputting the downsampled image into an encoder-decoder structure consisting of a pooling layer and a deconvolution layer;

(3) The downsampled image is deformed by a skip connection through the encoder-decoder structure to obtain a content mask and a seam mask for the downsampled image, the content mask being used to constrain features of the reconstructed image to approximate a warped image, the seam mask being used to constrain edges of an overlap region to be natural and continuous, wherein the overlap region is an overlap region between two fisheye images.

As a specific implementation, the warped image is first downsampled to a low resolution defined as 256×256, and then the stitched image is reconstructed using an encoder-decoder network consisting of 3 pooling layers and 3 deconvolution layers.

The design of the high resolution branch is to refine the stitched image, the branch is entirely composed of convolution layers, in particular, it is composed of three independent convolution layers and eight sum resource blocks, wherein the resource blocks are sequentially composed of convolution layers, relu activation functions, convolution layers, sum functions and relu activation functions. Wherein the number of filters of each layer is set to 64 except for the filter number of the last layer which is set to 3. The specific splicing flow is shown in fig. 4.

And when the image is reconstructed, calculating the content loss and the joint loss of the distorted image under the high-resolution thinning branch network, and thinning the spliced image by combining the content mask and the joint mask of the distorted image under the low resolution to obtain the spliced image of the two distorted fish-eye images.

Step S300 of squaring the stitched image, comprising the steps of:

(1) Taking the spliced image and a mask corresponding to the spliced image as input, and extracting advanced semantic features from the input through a feature extractor to obtain a feature image, wherein the feature extractor consists of stacked convolution pool blocks;

(2) Predicting horizontal motion and vertical motion of each vertex in the characteristic image based on a conventional grid through a grid motion regressor, wherein the grid motion regressor is of a full convolution structure;

(3) Constructing two regressors with the same structure based on a residual regression strategy, namely a first regressor and a second regressor, wherein the first regressor is used for predicting primary grid motion in a progressive manner, and the second regressor is used for predicting residual grid motion in a progressive manner;

(4) And predicting primary grid motion through a first regression device, carrying out grid deformation on the feature map, predicting residual grid motion through a second regression device, carrying out grid deformation on the distorted feature map, and maintaining the shape of deformed grids based on intra-grid constraints and inter-grid constraints during grid deformation to obtain a rectangular feature map.

As a specific implementation of specific squaring, the stitched image and mask are used as inputs, and the feature extractor extracts advanced semantic features from the inputs by stacking simple convolution tiles through a predefined feature extractor. After feature extraction, an adaptive pooling layer is utilized to fix the resolution of the feature map. Subsequently, a full convolution structure was designed as a grid motion regression variable to predict the horizontal and vertical motion of each vertex on a regular grid basis. Assuming that the grid resolution is u×v, the size of the output body is (u+1) × (v+1) ×2. And estimating accurate grid motion in a progressive mode through a residual progressive regression strategy, and taking the distorted result as the re-input of the network. The regressor, whose warped image consists of a full convolution structure, is not used as a direct input to the new network, as this doubles the computational complexity. In contrast, warping the intermediate feature map improves performance with a slight increase in computational effort. Then, the initial mesh motion and the residual mesh motion are predicted by two regressors having the same structure, respectively. Although they have the same structure, they are designated for different tasks due to the different input features. Finally, a rigid target mesh is predefined, and only an initial mesh is estimated to form mesh deformation, helping to form a compact single-stage solution.

To prevent in rectangular imagesContent warp, the predicted grid should not be exaggerated deformed. Thus, intra-mesh constraints and inter-mesh constraints are designed to maintain the shape of the deformed mesh. The grid internal constraints are constraints imposed on the size and orientation of the grid edges in the grid, as shown in FIG. 6, encouraging each horizontal edgeIs greater than a threshold(assuming that the stitched image has a resolution of H W), this constraint is described using penalty P (equation 3). At the same time, inter-grid constraints are employed to encourage neighboring grids to transform consistently, as shown in FIG. 7, to encourage two consecutive deformed grid edgesCollinear.

Wherein, the liquid crystal display device comprises a liquid crystal display device,for the threshold value i is a horizontal unit vector to the right, the vertical side in each grid +.>Each horizontal edgeIs arranged in the horizontal projection direction of the lens.

Example 2:

the invention discloses a binocular fisheye image stitching system, which comprises an image acquisition module, an image correction module, an image stitching module and an image squaring module, wherein the system performs the method disclosed in the embodiment 1 to stitch fisheye images.

The image acquisition module is used for simultaneously acquiring two fisheye images through the binocular fisheye lens.

The image correction module is used for obtaining parameters of the camera through a camera calibration method, and carrying out distortion correction on each fisheye graph based on the parameters of the camera to obtain corrected fisheye images.

In this embodiment, the image correction module is configured to perform the following to obtain the camera parameters:

In this embodiment, the reference matrix in the four-coordinate transformation relationship is simplified as follows:

the total conversion of the internal reference matrix is as follows, namely, a single-point undistorted camera imaging model is as follows:

the camera parameter comprises information such as an internal reference matrix, an external reference matrix and the like, and the image correction module is used for correcting the distortion of the fisheye image through the camera parameter.

The image stitching module is used for constructing an image stitching frame based on an unsupervised rough alignment method and an unsupervised image reconstruction method, and performing image rough alignment and image reconstruction on the corrected fisheye image based on the image stitching frame to obtain a stitched image.

The image stitching module of this embodiment is used for stitching the two corrected fisheye images to obtain a stitched image. The module is used for constructing an unsupervised depth image splicing frame, and two-stage operation is performed through the splicing frame: unsupervised coarse image alignment and unsupervised image reconstruction.

The first stage, the module is used for performing image coarse alignment on the corrected fisheye image through an unsupervised coarse image alignment method, and the following steps are executed:

In the second stage, there is often a ghost effect in the stitched result, since homography matrices cannot account for all arrangements of different depth levels. However, pixel-level misalignments can be eliminated to some extent at the feature level, and the warped image is used to reconstruct the stitched image from features to pixels. When the corrected fisheye image is subjected to image reconstruction, the image splicing module is used for eliminating double images from features to pixels through an unsupervised reconstruction network, the reconstruction network comprises a low-resolution deformation branch network and a high-resolution thinning branch network, the deformation rule of image splicing is learned through the low-resolution deformation branch network, and the resolution of the spliced image is improved through the high-resolution thinning branch network.

The image stitching module is used for executing the following morphing rules for learning image stitching through a low-resolution morphing branch network:

The design of the high resolution branch is to refine the stitched image, the branch is entirely composed of convolution layers, in particular, it is composed of three independent convolution layers and eight sum resource blocks, wherein the resource blocks are sequentially composed of convolution layers, relu activation functions, convolution layers, sum functions and relu activation functions. Wherein the number of filters of each layer is set to 64 except for the filter number of the last layer which is set to 3.

When reconstructing the image, the image stitching module is used for calculating content loss and seam loss of the distorted image under the high-resolution thinning branch network, and thinning the stitched image by combining the content mask and the seam mask of the distorted image under the low resolution to obtain the stitched image of the two distorted fisheye images.

The image squaring module is used for predicting an initial grid based on a complete convolution network and a residual error progressive regression strategy, predefining a target grid with a rigid shape, and squaring the spliced image in a mode of warping the spliced image from the predicted initial grid to the predefining target grid to obtain a rectangular image.

In this embodiment, the image stitching module is configured to perform squaring of the stitched image as follows:

As a specific implementation, the image stitching module is configured to perform the following: taking the stitched image and mask as input, the feature extractor extracts advanced semantic features from the input by stacking simple convolved tiles through a predefined feature extractor. After feature extraction, an adaptive pooling layer is utilized to fix the resolution of the feature map. Subsequently, a full convolution structure was designed as a grid motion regression variable to predict the horizontal and vertical motion of each vertex on a regular grid basis. Assuming that the grid resolution is u×v, the size of the output body is (u+1) × (v+1) ×2. And estimating accurate grid motion in a progressive mode through a residual progressive regression strategy, and taking the distorted result as the re-input of the network. The regressor, whose warped image consists of a full convolution structure, is not used as a direct input to the new network, as this doubles the computational complexity. In contrast, warping the intermediate feature map improves performance with a slight increase in computational effort. Then, the initial mesh motion and the residual mesh motion are predicted by two regressors having the same structure, respectively. Although they have the same structure, they are designated for different tasks due to the different input features. Finally, a rigid target mesh is predefined, and only an initial mesh is estimated to form mesh deformation, helping to form a compact single-stage solution.

In order to prevent distortion of the content in the rectangular image, the predicted grid should not be exaggeratedly deformed. Thus, intra-mesh constraints and inter-mesh constraints are designed to maintain the shape of the deformed mesh. The grid internal constraints are constraints imposed on the size and orientation of the grid edges in the grid, as shown in FIG. 6, encouraging each horizontal edgeIs greater than the threshold ++>(assuming that the stitched image has a resolution of H W), this constraint is described using penalty P (equation 3). At the same time, inter-grid constraints are employed to encourage neighboring grids to transform consistently, encouraging two consecutive deformed grid edges +.>Collinear.

Wherein, the liquid crystal display device comprises a liquid crystal display device,is the threshold ofThe value i is a horizontal unit vector to the right, the vertical side in each grid +.>Each horizontal edgeIs arranged in the horizontal projection direction of the lens.

While the invention has been illustrated and described in detail in the drawings and in the preferred embodiments, the invention is not limited to the disclosed embodiments, but it will be apparent to those skilled in the art that many more embodiments of the invention can be made by combining the means of the various embodiments described above and still fall within the scope of the invention.

Claims

1. The binocular fisheye image splicing method is characterized by comprising the following steps of:

2. The binocular fisheye image stitching method according to claim 1, wherein the parameters of the camera are obtained by a camera calibration method, comprising the steps of:

3. The binocular fisheye image stitching method of claim 1, wherein the image coarse alignment of the corrected fisheye image based on the image stitching frame comprises the steps of:

4. A binocular fisheye image stitching method according to claim 3 wherein, when reconstructing the corrected fisheye image based on the image stitching frame, the ghost of the feature to the pixel is eliminated by an unsupervised reconstruction network, the reconstruction network comprises a low resolution deformed branch network and a high resolution refined branch network, the deformation rule of the image stitching is learned by the low resolution deformed branch network, and the resolution of the stitched image is improved by the high resolution refined branch network;

5. The binocular fisheye image stitching method of claim 4, wherein the stitched image is rectangular by warping the stitched image from a predicted initial grid to a predefined target grid, comprising the steps of:

6. A binocular fisheye image stitching system, wherein image stitching is performed by a binocular fisheye image stitching method as claimed in any one of claims-5, the system comprising:

7. The binocular fisheye image stitching system of claim 6, wherein the image correction module is configured to perform the following to obtain parameters of the camera via a camera calibration method:

8. The binocular fisheye image stitching system of claim 6, wherein the image stitching module is configured to perform coarse image alignment of corrected fisheye images as follows:

9. The binocular fisheye image stitching system of claim 8, wherein the image stitching module is configured to perform the following

10. The binocular fisheye image stitching system of claim 9, wherein the image squaring module is configured to perform the following: