CN115631094A

CN115631094A - Unmanned aerial vehicle real-time image splicing method based on spherical correction

Info

Publication number: CN115631094A
Application number: CN202211400858.8A
Authority: CN
Inventors: 曾国奇; 牛子凡; 范峥; 郑丽丽
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2022-11-09
Filing date: 2022-11-09
Publication date: 2023-01-20

Abstract

The invention discloses an unmanned aerial vehicle real-time image splicing method based on spherical correction, which comprises the steps of (1) judging and cleaning a real-time frame by using a fuzzy degree and an image content in order to filter a low-quality image; (2) In order to perform feature description on an image, SIFT feature description is created on the image; (3) In order to match the characteristics between images, selecting the optimal matching characteristics by using a brute force traversal mode, and screening the characteristic matching relation by using a RANSAC algorithm; (4) In order to eliminate transmission transformation errors, a homography matrix is obtained through a characteristic matching relation, a spherical correction model under geometric transformation parameters is constructed according to the geometric transformation parameters between the homography matrix and the images, and a correction ball is calculated; (5) And performing spherical projection transformation on the input image, and performing feature matching again by using the transformed image to eliminate the transmission transformation error in the splicing process.

Description

Unmanned aerial vehicle real-time image splicing method based on spherical correction

Technical Field

The invention relates to a ground unmanned aerial vehicle image splicing processing method, in particular to an unmanned aerial vehicle real-time image splicing method based on spherical correction.

Background

With the development maturity of the small unmanned aerial vehicle technology, the unmanned aerial vehicle has been widely applied to various fields, such as survey and drawing, supervision, military affairs etc. especially in aspects such as industry inspection, natural disaster supervision, city security protection, unmanned aerial vehicle picture transmission function plays crucial effect. Under some circumstances, need use unmanned aerial vehicle to carry out the map to the situation in certain region and pass the observation, and the mode of directly drawing unmanned aerial vehicle video stream is unfavorable for carrying out whole observation and analysis to the situation, consequently needs to use image stitching technique to generate whole situation picture.

The unmanned aerial vehicle image transmission system generally includes an Unmanned Aerial Vehicle (UAV), a wireless communication device, and an unmanned aerial vehicle Ground Control Station (GCS), as shown in fig. 1. The UAV is loaded with sensors with different application requirements, such as an image sensor for acquiring an image of a ground detection area, and image information acquired by the image sensor is received by the GCS through a wireless communication device. And the GCS performs image enhancement, image splicing and other processing on the received image information, and finally displays the real scene of the detection area on the display equipment of the GCS.

The traditional unmanned aerial vehicle image splicing technology is generally used for carrying out offline splicing on video data shot by an unmanned aerial vehicle during flying in a GCS (general packet switch) after the unmanned aerial vehicle finishes a flying task, the offline splicing technology has a good splicing effect at present, but the splicing speed is low generally, the splicing mode is lack of timeliness, splicing is carried out after analysis and calculation of all images, image splicing observation can not be carried out on a certain situation in real time and on line, and therefore the offline image splicing technology can not be applied to scenes with high timeliness requirements. From the present unmanned aerial vehicle image use angle, the object of image concatenation mainly has two kinds: one is an aerial photo shot by a digital aerial camera; the other is a video sequence image (including a visible light image and an infrared video image). The image stitching process is a process of stitching a group of images with overlapping degree into a seamless high-definition large-field image through automatic computer registration, geometric correction, image dodging and other processing, as shown in fig. 2.

With the development and maturity of 5G communication, the image transmission capability of the UAV is enhanced, the peak rate of a communication link of wireless communication equipment can reach 10 Gbit/s-20 Gbit/s, the air interface delay is as low as 1ms, and the real-time performance of wireless communication is greatly enhanced, so that the UAV can transmit stable and high-quality video stream information (namely HTTP data stream) back to a communication base station in real time, and therefore the real-time image splicing technology of the UAV based on the video stream becomes an important development direction. However, the unmanned aerial vehicle image splicing technology based on video streaming faces many challenges, and the video streaming transmission mode causes the image data quality to be reduced; the stability of the communication link influences the splicing stability and even directly influences the splicing success or failure; the flight state of the UAV is also closely related to the splicing quality. Therefore, in a video stream splicing mode in the GCS of 5G communication, how to simultaneously maintain the stability and timeliness of a splicing algorithm is a technical problem to be solved.

Disclosure of Invention

In order to solve the problem that high-precision splicing in the process of splicing images of unmanned aerial vehicle video streams needs high time consumption in a mode of splicing video streams in a Ground Control Station (GCS) of an unmanned aerial vehicle in 5G communication; on the other hand, the image quality is poor due to low-consumption time splicing; the third aspect is the technical problem that the splicing stability of the unmanned aerial vehicle video stream image splicing process is poor, and the invention provides an unmanned aerial vehicle real-time image splicing method based on spherical correction. According to the method, the image splicing algorithm based on feature matching and homography transformation is optimized through spherical transformation, and high-precision unmanned aerial vehicle image splicing can be completed in a low-consumption mode; meanwhile, the time consumption is low under high-precision splicing. The method is a processing method for directly splicing the unmanned aerial vehicle images of the unmanned aerial vehicle ground control station in real time on video stream information (namely HTTP data stream).

The invention is based on the network video stream transmitted by the unmanned aerial vehicle in real time, (1) in order to filter low-quality images, the real-time frames are judged and cleaned by using the fuzziness and the image content; (2) In order to carry out feature description on the image, SIFT feature description is created on the image; (3) In order to match features between images, selecting optimal matching features by using a brute force traversal mode, and screening feature matching relations by using a RANSAC algorithm; (4) In order to eliminate transmission transformation errors, a homography matrix is obtained through a characteristic matching relation, a spherical correction model under geometric transformation parameters is constructed according to the geometric transformation parameters between the homography matrix and the images, and a correction ball is calculated; (5) And performing spherical projection transformation on the input image, and performing feature matching again by using the transformed image to eliminate the transmission transformation error in the splicing process.

The invention relates to an unmanned aerial vehicle real-time image splicing method based on spherical correction, which comprises the following steps:

selecting a first frame image as a reference image;

step two, taking the current image frame after the initial frame image as an image to be registered;

step three, fuzzy filtering;

step 31, convolution processing;

step 32, negative feedback control of fuzzy filtering judgment;

step 33, judging whether the image frame is the last image frame;

step four, extracting features based on SIFT algorithm;

step five, a nearest neighbor distance ratio matching strategy defined by a threshold value;

step 51, calculating Euclidean distances of feature sets of two adjacent image frames;

step 52, calculating a nearest neighbor distance ratio;

step 53, judging image frame-feature matching;

step six, a random sample consistency algorithm;

step seven, calculating the radius of the correction sphere;

step 71, calculating geometric transformation parameters between images;

step 72, calculating the corrected sphere radius

Step eight, spherical projection

Step nine, feature matching;

step 91, extracting feature sets of two adjacent image frames after spherical transformation;

step 93, a random sample consistency algorithm;

step ten, homography transformation and weighted average processing.

The unmanned aerial vehicle real-time image splicing method based on spherical correction has the advantages that:

(1) the image splicing stability is high: the spherical transformation is used to eliminate the accumulation of the transmission error of the homography transformation, and the problem of the accumulation of the transmission error can not occur in a larger splicing range.

(2) The image splicing is convenient and efficient: can be when drawing unmanned aerial vehicle network video stream, online splice the image, need not to wait to splice the data of taking again after the unmanned aerial vehicle flight task ends, also need not unmanned aerial vehicle's flight parameter equally, as long as insert the video stream and just can splice in real time online.

(3) The network state tolerance is high during image splicing: aiming at the transmission characteristics of the network video stream, multiple data cleaning links are set, and low-quality images caused by network communication quality fluctuation can be filtered.

Drawings

Fig. 1 is a structure diagram of an unmanned aerial vehicle image transmission system.

Fig. 2 is a flow diagram of a conventional image stitching technique.

FIG. 3 is a flow chart of the unmanned aerial vehicle real-time image stitching method based on spherical correction.

FIG. 4 is a schematic structural diagram of a homography transformation splicing model in the method of the present invention.

FIG. 5 is a schematic structural diagram of a spherical correction model in the method of the present invention.

FIG. 6 is a schematic view of a spherical projection structure in the method of the present invention.

Fig. 7A is a homography transformed image mosaic at low distortion.

FIG. 7B is a photograph of a mosaic of images after spherical correction using the method of the present invention.

Fig. 8A is a homography transformed image mosaic at high distortion.

FIG. 8B is a photograph of a mosaic of images after spherical correction using the method of the present invention.

Fig. 9A is a comparison graph of the reprojection error for homography transformed image stitching.

FIG. 9B is a comparison graph of the re-projection error after spherical correction in the method of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples.

Referring to fig. 1, an Unmanned Aerial Vehicle (UAV) uses an image sensor (e.g., a camera) to acquire video data of a detection area. The unmanned aerial vehicle uses an RTMP protocol to push RTMP data streams to the communication base station and the cloud server in real time, and the unmanned aerial vehicle Ground Control Station (GCS) receives the HTTP data streams forwarded by the cloud server.

In the present invention, software in the drone Ground Control Station (GCS) uses python software to pull the HTTP data stream. The resulting image frame, denoted pic, is pulled by python software. For the image processor in the GCS, a plurality of image frames pic are spliced. The present invention is an improved method proposed for image pre-processing and image registration in fig. 2.

In the present invention, one HTTP data stream is denoted as PIC, and PIC = { PIC = ₁ ,pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η }, in which:

pic ₁ representing the 1 st image frame in the data stream.

pic ₂ Representing the 2 nd image frame in the data stream.

pic _i Representing the ith image frame in the data stream.

pic _i-1 Number of representationsPlacement in a stream of pictures pic _i The previous one image frame is simply referred to as a previous image frame.

pic _i+1 Representing pictures pic in a data stream _i The next image frame is simply referred to as the next image frame.

pic _η Representing the last image frame in the data stream.

For convenience of explanation, the pic _i Also referred to as the current image frame in the data stream. The subscript i represents the identification number of the image frames in the data stream and the subscript η represents the total number of image frames in the data stream.

In the present invention, the current image frame pic _i The position of the pixel point is recorded as pic _i (x, y), x being the pixel abscissa and y being the pixel ordinate. Similarly, the previous image frame pic _i-1 The position of the pixel point is recorded as pic _i-1 (x, y); the latter image frame pic _i+1 The position of the pixel point is marked as pic _i+1 (x, y), image frame pic _η The position of the pixel point is recorded as pic _η (x,y)。

Fuzzy filtering condition FS

For the current image frame pic _i Performing convolution processing on the pixel points and the Laplace operator to obtain a pixel point Laplace convolution sum, and recording the sum as the Laplace convolution sum

For the previous image frame pic _i-1 Performing convolution processing on the pixel points and the Laplacian operator to obtain a sum of the pixel points and the Laplacian convolution and recording the sum as

In the invention, convolution processing is carried out by using pixel points of an image frame and a Laplacian, and reference is made to an improved single image deblurring algorithm with a super Laplacian constraint, which is disclosed on a small-sized microcomputer system in volume 39, no. 5 in 2018, by which an author is good in Qin and Shunji.

In the present invention, two adjacent image framesIs calculated as

In the present invention, the blur filtering condition is denoted as FS, and FS =0.4. The value of the optimal blur filtering condition FS is 0.4. When the value is 0.4, the fuzzy filtering processing can sensitively identify the fuzzy image, and meanwhile, the method has good image splicing stability and is not easily influenced by the change of the image content.

The flow of the unmanned aerial vehicle image splicing technology of the invention is shown in fig. 3, an image processor in a Ground Control Station (GCS) of an unmanned aerial vehicle sequentially splices the images to the last frame of image according to the sequence of the image frames in an HTTP data stream, namely splices the images of a panoramic unmanned aerial vehicle from the first frame to the last frame one by one, and specifically comprises the following steps:

selecting a first frame image as a reference image;

in the invention, the image processor first reads the HTTP data stream PIC = { PIC = ₁ ,pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η The first frame image in (1), i.e., the 1 st image frame pic ₁ And combining said pic ₁ As a reference image; and then executing the step two.

In the present invention, the 1 st image frame pic ₁ As a reference image, the start position of the image stitching is thus determined. The starting position may be the upper left corner of a panorama, or may be any position point of the panorama.

In the invention, HTTP data stream without first frame image is marked as image set PIC to be registered _{To be treated} And PIC _{To be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η }。

in the invention, PIC is selected from the image set to be registered _{To be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η Read the current image frame pic _i And read the current image frame pic _i As an image to be registered; then step three is performed.

Step three, fuzzy filtering;

in the invention, in order to filter low-quality images, the real-time frame is judged and cleaned by using the fuzziness and the image content, so that invalid images are eliminated, the time consumption of image splicing is reduced, and the method is also a means for completing high-precision unmanned aerial vehicle image splicing in low consumption.

Step 31, convolution processing;

for the current image frame pic _i Convolution processing of pixel points and Laplace operators is carried out to obtain pic _i The pixel point of (a) is-Laplace-convolution sum, and is recorded as

Step 32, negative feedback control of fuzzy filtering judgment;

to pic _i Performing a fuzzy filtering calculation, i.e.

Then judging by adopting a fuzzy filtering condition FS

Whether filtration is required;

representing pic of a previous image frame _i-1 And performing convolution processing on the pixel points and the Laplace operator to obtain a Laplace convolution sum of the pixel points.

If it is

The current image frame pic is retained _i And executing the step four;

if it is

Discarding current image frame pic _i And selecting relay pic _i The subsequent picture frame, i.e. pic _i+1 And then executing the step two.

Step 33, judging whether the image frame is the last image frame;

repeating steps 31-32 until PIC is completed _{To be treated} The last frame of image in (1), i.e. pic _η ；

For the last frame image pic _η Convolution processing of pixel points and Laplace operators is carried out to obtain pic _η The pixel point of (a) is-Laplace-convolution sum, and is recorded as

To pic _η Performing fuzzy filtering calculation

Then judging by adopting a fuzzy filtering condition FS

Whether filtration is required;

representing pic of images for the eta-1 th frame _η-1 And performing convolution processing on the pixel points and the Laplace operator to obtain a Laplace convolution sum of the pixel points.

If it is

The last frame image pic is retained _η And executing the step four;

if it is

Discarding the last frame image pic _η And (4) finishing the image splicing task.

In the present invention, PIC _{To be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η The coarse-mosaic image obtained after the fuzzy filtering treatment is denoted as PIC ^Coarse And is and

representing image frames pic ₂ And (5) carrying out fuzzy filtering processing on the image frames.

Representing image frames pic _i And (5) carrying out fuzzy filtering processing on the image frames.

Representing image frames pic _i-1 And (5) carrying out fuzzy filtering processing on the image frames.

Representing image frames pic _i+1 And (5) carrying out fuzzy filtering processing on the image frames.

Representing image frames pic _η And (5) carrying out fuzzy filtering processing on the image frames.

In the present invention, fuzzy filtering conditions are utilized

The negative feedback control of image splicing is carried out by adding a fuzzy filtering condition FS, filtering out fuzzy images generated due to camera shake carried by an unmanned aerial vehicle platform, network flow quality reduction and other reasons, and reducing error introduction from the source, so that the method can adapt to more complicated,A bad network situation. In addition, whether the current frame image is clear or not is judged by calculating the change of the convolution values of the image pixel points and the Laplacian operator in the fuzzy filtering process, the clear current frame image is reserved, the fuzzy current frame image is abandoned, and the generation of image splicing errors is effectively prevented.

Step four, extracting features based on SIFT algorithm;

in the present invention, for PIC ^Coarse Performing SIFT feature extraction on each image frame in the image. SIFT feature extraction reference is made to "Automatic Panoramic Image Stitching using investigational Features" published on International Journal of Computer, vol.74, 2007, author, matthew Brown, david G.Lowe.

Image frames using opencv library of Python software

Conversion to a grey scale map, denoted gpic _i 。

Creation of attributes belonging to gpic using the SIFT feature creation function in the opencv library _i Set of feature points of (1), as

The described

Simply referred to as the feature set of the current frame image.

Similarly, the image frame

Gray scale of (1), denoted as gpic ₂ (ii) a Belonging to the genus gpic ₂ Set of feature points of (2), denoted as

The described

Simply referred to as the feature set of the frame 2 image.

Similarly, the image frame

Gray scale of (1), denoted as gpic _i-1 (ii) a Belonging to the general term gpic _i-1 Set of feature points of (1), as

The above-mentioned

Simply referred to as the feature set of the previous frame image.

Similarly, image frames

Gray scale of (1), denoted as gpic _i+1 (ii) a Belonging to the genus gpic _i+1 Set of feature points of (1), as

The above-mentioned

Simply referred to as the feature set of the next frame image.

Similarly, the image frame

Gray scale of (2), denoted as gpic _η (ii) a Belonging to the general term gpic _η Set of feature points of (1), as

The above-mentioned

Simply referred to as the feature set of the last frame image.

In the invention, PIC is carried out on the video stream according to SIFT algorithm ^Coarse The image frames in the image frame are subjected to feature extraction, and the obtained frame image-gray level-feature set is recorded as

And is

Then step five is performed.

in the invention, the Euclidean distance is used as the similarity measurement of the feature points, but a lot of error matching can be introduced by directly calculating the nearest matching feature points, so that the feature matching is carried out by using a nearest distance ratio strategy limited by a threshold value in a novel unmanned aerial vehicle aerial image fast splicing algorithm disclosed in 'computer simulation' at No. 5, volume 39, no. 5 of 2022.

in the invention, feature sets of two adjacent image frames are used for matching, and the nearest Euclidean distance and the next nearest Euclidean distance of the two adjacent image frames are obtained by traversing feature points;

feature set for previous frame image

And feature set of current frame image

Carry out matching on

And

traversing all the feature points, and calculating the nearest Euclidean distance between the feature points during traversal

To the next nearest Oldham's distance

Step 52, calculating a nearest neighbor distance ratio;

calculating the nearest Euclidean distance

To the next nearest Euclidean distance

Is recorded as the distance ratio of

And is

Step 53, judging image frame-feature matching;

when ratio of

Less than ratio threshold TT _{Threshold value} Time of flight

I.e. the features are considered to match. The matching set after completing the feature matching is recorded as the feature matching of two adjacent image frames

When ratio of

Greater than or equal to the ratio threshold TT _{Threshold value} Time of flight

I.e. feature set ending the previous frame image

And feature set of current frame image

Is performed.

In the present invention, the ratio threshold is expressed asTT _{Threshold value} And TT _{Threshold value} =0.4. When the ratio threshold is set to 0.4, the feature matching condition can be judged more accurately.

In the same way, can obtain

Performing feature set matching on two adjacent image frames to obtain feature matching sets of the two adjacent image frames

Step six, a random sample consistency algorithm;

matching sets in the step five according to a random sample consistency algorithm

Screening, eliminating bad matches and obtaining effective matching set

And homography matrix

The model

A three row three column matrix.

In the invention, the random sample consistency algorithm refers to a random sample consistency algorithm in a new unmanned aerial vehicle aerial image fast splicing algorithm which is disclosed on 'computer simulation' in No. 5 of No. 39 of No. 2022 month.

Step seven, calculating the radius of the correction sphere;

step 71, calculating geometric transformation parameters between images;

in the present invention, the homography transform matrix

By translation H of an image sensor (e.g. camera) on the drone _Translation Zoom H _Zoom Rotation H _{x rotation} ,H _{y rotation} ,H _{z rotation} Miscut H _{x miscut} ,H _{y miscut} Is obtained, therefore, can

Expressed as the product of translation-rotation-miscut, i.e. H _{Translation of} ·H _{x rotation} ·H _{y rotation} ·H _{z rotation} ·H _Zooming ·H _{x miscut} ·H _{y miscut} 。

Translation of splice location

Amount of scaling of splice location

Amount of rotation of splice position about X-axis

Amount of rotation of splice position about Y-axis

Amount of rotation of splice position about Z-axis

Miscut of splice position about X-axis

Miscut of splice position about Y-axis

X is the pixel value of the image translated in the X-axis direction.

Y is the pixel value of the image shifted in the Y-axis direction.

W is the scale value at which the image is scaled in the x-axis direction.

V is the scale value at which the image is scaled in the y-axis direction.

α, β, γ are rotation angles of the image in x, y, z axis directions, respectively.

φ、

The angle of the image is the miscut angle in the x and y directions.

According to Newton method

Carrying out iterative solution to obtain an image gpic _i Transformation to image gpic _i-1 Geometric transformation parameters of

And is

Thus, it is possible to provide

The 9 values of the three rows and three columns and the 9 geometric parameters of step 71 form a set of equations

Using Newton method to iteratively solve the equation set to obtain

The value of (c).

As an image gpic _i Transformation to image gpic _i-1 Pixel values translated in the x-axis direction.

As an image gpic _i Transformation to image gpic _i-1 Pixel values translated in the y-axis direction.

As an image gpic _i Transformation to image gpic _i-1 Scaled in the x-axis direction.

As an image gpic _i Transformation to image gpic _i-1 The scaled scale value in the y-axis direction.

As an image gpic _i Transformation to image gpic _i-1 The angle of rotation in the x-axis direction.

As an image gpic _i Transformation to image gpic _i-1 The angle of rotation in the y-axis direction.

As an image gpic _i Transformation to image gpic _i-1 The angle of rotation in the z-axis direction.

As an image gpic _i Transformation to image gpic _i-1 Miscut angle in the x-axis direction.

As an image gpic _i Transformation to image gpic _i-1 Miscut angle in the y-axis direction.

In the invention, newton's method is referred to the iterative solution method-Newton method of the nonlinear equation set in chapter 4, section 2 of ' numerical analysis ' of 9 months, 4 th edition, yanqingjin, of Beijing university of aerospace, press, 2012.

Similarly, the geometric transformation parameters between the images are calculated for MDD to obtain HMDD, and

step 72, calculating the corrected sphere radius

The invention provides a spherical correction algorithm to relieve the error accumulation problem of homography transformation. The three-dimensional mosaic model is shown in fig. 4, mosaic images are not located on the same plane, so that the problem of transmission error accumulation exists during homography transformation, the mosaic model is vertically projected along the negative direction of the z axis, and an overlook two-dimensional graph is shown in fig. 5. The invention provides a spherical correction model, which introduces a splicing rule: with the first image frame gpic ₁ The straight line is the reference line and is marked as L _base Performing spherical projection transformation on other images in subsequent splicing to obtain the ith frame image gpic _i The image after the spherical projection transformation is recorded as cpic _i The radius of the sphere is marked as

So that the right end point is always kept at L when the transformed images are registered _base The above.

Under this rule, let gpic _i And gpic _i-1 At an included angle of

gpic _i And gpic _i-1 Relative displacement in the y direction of

For each pixel, then the cpic can be solved _i Radius of sphere

Comprises the following steps:

wherein X _i 、α _i Solved in step 72, the transcendental equation is solved using an iterative method to calculate the corrected sphere radius r _i 。

Similarly, the RR is calculated by using HMDD to correct the radius of the sphere, an

Step eight, spherical projection

In the invention, spherical projection is adopted to carry out spherical projection transformation on the input image, and the transformed image is used for carrying out feature matching again, thus eliminating the transmission transformation error in the splicing process.

For gpic _i Is carried out to

Is a spherical projection of radius, and will gpic _i Projective transformation to cpic _i . As shown in FIG. 6, assume that the image gpic _i Located at a radius of

On a sphere, this time gpic _i At any point P ₂ Pixel coordinate value of (gx) _i ,gy _i ,gz _i ) The molecular weight distribution of the polymer in (0,

) A light source point P is arranged, and a projection point P is obtained by projecting the point P to a plane with z =0 ₃ I.e. cpic _i The pixel coordinate value of any one point of (2) is expressed as (cx) _i ,cy _i 0), let the projection scale factor be tk _i Then, there are:

the gpic can be finally obtained _i Cpic obtained by spherical projection _i The pixel coordinate value of any one point is (tk) _i ·gx _i ,tk _i ·gy _i ) Wherein the projection scale factor

Similarly, RR pairs PIC are used ^Coarse Performing spherical projection calculation to obtain image CPIC subjected to spherical correction ^{Correction of} In which

Step nine, feature matching;

in the invention, in order to eliminate the transmission transformation error, a homography matrix is obtained through a characteristic matching relation, a spherical correction model under geometric transformation parameters is constructed according to the geometric transformation parameters between the homography matrix and the image, and a correction ball is calculated.

for the spherical correction image cpic obtained in the step eight _i And the previous frame spherical correction image cpic _i-1 Extracting the characteristics (by adopting the method of the step four), and obtaining the target substance which belongs to cpic _i Feature set of

And belong to cpic _i-1 Feature set of

In the invention, the image frame set CPIC is corrected to the sphere according to the SIFT algorithm ^{Correction of} The obtained correction frame image-gray-feature set is recorded as

And is

Step 92, threshold-defined nearest neighbor distance ratio matching strategy;

to pair

And

and (5) performing feature matching (by adopting the method of the step five), if the features are matched, finishing the matching set after the features are matched, and recording the matching set as two adjacent correction image frames for feature matching

If the features are not matched, ending the feature set of the previous frame image

And feature set of current correction frame image

Is performed.

In the same way, can obtain

Performing feature set matching on two adjacent image frames to obtain two adjacent spherical correction image frames-feature matching sets

Step 93, a random sample consistency algorithm;

to pair

Optimizing a random sample consistency algorithm (adopting the method of the step six) and generating a homography model to obtainEfficient correction matching

And homography model

In the same way, pair

Repeating the step six to obtain an effective correction matching set

Wherein

And a homography matrix set CMDD in which

The model

A three row three column matrix.

Step ten, homography transformation and weighted average processing are carried out;

the above-mentioned

As cpic _i The homography matrix of the transmission transformation of (1), the cpic _i Transmission transformation to base image RES and the pair cpic is completed _i And (4) splicing.

In a similar way, the

As pic _i+1 The homography matrix of the transmission transformation of (c), pic _i+1 Transmission is transformed to the base image RES to complete pic alignment _i+1 Splicing.

In a similar way, the

As pic _η Homography matrix of transmission transformation of (c), and (c) _η Transmission is transformed to the base image RES to complete pic alignment _η And (4) splicing.

In the present invention, for CPIC ^{Correction of} All corrected images in (1) are homography transformed using CMDD and the final base image RES will contain CPIC ^{Correction of} All of the elements in (a). And then, fusing the spliced images RES by adopting a weighted average algorithm to complete the splicing of the HTTP data streams.

In the invention, the weighted average algorithm refers to a new unmanned aerial vehicle aerial image fast splicing algorithm which is disclosed on computer simulation at the No. 5 of volume 39 of No. 5 of No. 2022.

Example 1

In order to illustrate the application effect of the method, the invention uses the major Mavic 2Pro in the Qinhuai region of Nanjing city, jiangsu province, china, 118.813482 north latitude and 32.029366 east longitude, flies from west to east at the altitude of 45 meters and in the air of 24.7 meters on the ground at the speed of 10km/h, and carries out orthographic shooting on the ground, a section of video stream in a stable state is shot, the splicing effect of the video stream images is shown in figures 7A and 7B, when the spherical correction algorithm of the invention is not used, as shown in figure 7A, the homography transformation generated during the splicing of the images can be seen to cause the images to be slightly deformed, and the accumulation effect at the splicing end is more obvious; when the spherical correction algorithm is used, the splicing effect is shown in fig. 7B, the reprojection error is shown in fig. 9A and fig. 9B, it can be seen that the splicing precision is improved after spherical correction, the low-distortion splicing field is expanded by nearly three times, the homography error when splicing is continuously performed for two hundred times is still lower than the error when splicing is performed for the 70 th time without the correction algorithm, the time consumption is still maintained at a lower level under the condition that the precision is obviously improved, and the method can adapt to a real-time scene. When there is a large homography error, the cumulative effect of the homography transform is magnified as shown in FIG. 8A, while the homography transform error is greatly mitigated after sphere correction as shown in FIG. 8B. The SIFT algorithm has higher accuracy compared with other algorithms, the matching accuracy is obviously reduced along with the increase of times without the optimization of the method, the subsequent matching accuracy is obviously improved after the optimization of the method, and the method is especially obvious in the later stage of splicing and powerfully relieves the problem of homography error accumulation.

The invention provides an unmanned aerial vehicle real-time image splicing method based on spherical correction, which aims to solve the technical problems of how to improve the real-time splicing response speed and the image precision of an unmanned aerial vehicle video stream image under the condition of 5G communication; meanwhile, the time consumption is low under high-precision splicing.

Claims

1. An unmanned aerial vehicle real-time image splicing method based on spherical correction is characterized in that an unmanned aerial vehicle uses an RTMP protocol to push RTMP data streams to a communication base station and a cloud server in real time, and an unmanned aerial vehicle ground control station receives HTTP data streams forwarded by the cloud server; the method is characterized in that: an image processor in the unmanned aerial vehicle ground control station sequentially splices the images of the last frame according to the sequence of the image frames in the HTTP data stream, namely splices the images of the panoramic unmanned aerial vehicle from the first frame to the last frame one by one, and the method specifically comprises the following steps:

selecting a first frame image as a reference image;

the image processor first reads the HTTP data stream PIC = { PIC = ₁ ,pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η The first frame image in (1), i.e., the 1 st image frame pic ₁ And combining said pic ₁ As a reference image; then executing the step two;

the 1 st image frame pic ₁ As a reference image, determining the starting position of image splicing;

HTTP data flow without first frame image, marked as image set PIC to be registered _{To be treated} And PIC _{To be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η }；

from a set of images to be registered PIC _{To be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η Read the current image frame pic _i And read the current image frame pic _i As an image to be registered; then, executing the step three;

step three, fuzzy filtering;

step 31, convolution processing;

for the current image frame pic _i Convolution processing of pixel points and Laplace operators is carried out to obtain pic _i The pixel point of (a) -Laplace-convolution sum, is recorded as

Step 32, negative feedback control of fuzzy filtering judgment;

to pic _i Performing a fuzzy filtering calculation, i.e.

Then judging by adopting a fuzzy filtering condition FS

Whether filtration is required;

representing pic of a previous image frame _i-1 Performing convolution processing on the pixel points and the Laplace operator to obtain a Laplace convolution sum of the pixel points;

if it is

The current image frame pic is retained _i And executing the step four;

if it is

Discarding current image frame pic _i And selecting a relay pic _i The subsequent picture frame, i.e. pic _i+1 Then executing the step two;

step 33, judging whether the image frame is the last image frame;

For the last frame image pic _η Convolution processing of pixel points and Laplace operators is carried out to obtain pic _η The pixel point of (a) -Laplace-convolution sum, is recorded as

To pic _η Performing fuzzy filtering calculation

Then judging by adopting a fuzzy filtering condition FS

Whether filtration is required;

representing pic of images for the eta-1 th frame _η-1 Performing convolution processing on the pixel points and the Laplace operator to obtain a Laplace convolution sum of the pixel points;

if it is

The last frame image pic is retained _η And executing the step four;

if it is

Discarding the last frame imagepic _η The image splicing is finished;

PIC _{to be treated} ＝{pic ₂ ,…,pic _i-1 ,pic _i ,pic _i+1 ,…,pic _η The coarse-mosaic image obtained after the fuzzy filtering treatment is denoted as PIC ^Coarse And is and

representing image frames pic ₂ The image frames are subjected to fuzzy filtering;

representing image frames pic _i The image frames are subjected to fuzzy filtering;

representing image frames pic _i-1 The image frames are subjected to fuzzy filtering;

representing image frames pic _i+1 The image frames are subjected to fuzzy filtering;

representing image frames pic _η The image frames are subjected to fuzzy filtering;

step four, extracting features based on SIFT algorithm;

image frames using opencv library of Python software

Is converted into a gray-scale image,is noted as gpic _i ；

The above-mentioned

The feature set of the current frame image is simply referred to;

similarly, the image frame

Gray scale of (1), denoted as gpic ₂ (ii) a Belonging to the general term gpic ₂ Set of feature points of (1), as

The above-mentioned

Simply referred to as the feature set of the 2 nd frame image;

similarly, the image frame

Gray scale of (2), denoted as gpic _i-1 (ii) a Belonging to the general term gpic _i-1 Set of feature points of (1), as

The described

The feature set of the previous frame image is simply referred to;

similarly, image frames

Gray scale of (1), denoted as gpic _i+1 (ii) a Belonging to the general term gpic _i+1 Set of feature points of (2), denoted as

The described

The feature set of the next frame image is simply referred to;

similarly, image frames

Gray scale of (1), denoted as gpic _η (ii) a Belonging to the general term gpic _η Set of feature points of (2), denoted as

The described

The feature set of the last frame image is simply referred to;

PIC for video stream according to SIFT algorithm ^Coarse Performing feature extraction on each image frame in the image system, and recording the obtained frame image-gray-feature set as

And is provided with

Then executing the step five;

feature set for previous frame image

And feature set of current frame image

Match is carried out, and

and

To the next nearest Oldham's distance

Step 52, calculating a nearest neighbor distance ratio;

calculating the nearest Euclidean distance

To the next nearest Oldham's distance

Is recorded as the distance ratio of

And is provided with

Step 53, judging image frame-feature matching;

when ratio of

Less than a ratio threshold TT _{Threshold value} Time of flight

Namely, the characteristics are considered to be matched; the matching set after completing the feature matching is recorded as the feature matching of two adjacent image frames

When ratio of

I.e. feature set ending the previous frame image

And feature set of current frame image

Matching the characteristics of (1);

in the same way, can obtain

Step six, a random sample consistency algorithm;

Screening, eliminating bad matches and obtaining effective matching set

And homography matrix

The model

A three-row three-column matrix is formed;

step seven, calculating the radius of a correction sphere;

step 71, calculating geometric transformation parameters between images;

homography transformation matrix

By translation H of an image sensor (e.g. camera) on the drone _Translation Zoom H _Zooming Rotation H _{x rotation} ,H _{y rotation} ,H _{z rotation} Miscut H _{x type miscut} ,H _{y type miscut} Is obtained, thus can

Expressed as the product of translation-rotation-miscut, i.e. H _Translation ·H _{x rotation} ·H _{y rotation} ·H _{z rotation} ·H _Zoom ·H _{x miscut} ·H _{y miscut} ；

Translation of splice location

The amount of scaling of the splice location

Amount of rotation of splice position about X-axis

Amount of rotation of splice position about Y-axis

Amount of rotation of splice position about Z-axis

Miscut of splice position about X-axis

Miscut of splice position about Y-axis

X is the pixel value of the image translated in the direction of the X axis;

y is the pixel value of the image translated in the Y-axis direction;

w is the scaling value of the image scaled in the x-axis direction;

v is a scaling value of the image in the y-axis direction;

alpha, beta and gamma are rotation angles of the image in the directions of the x axis, the y axis and the z axis respectively;

φ、

the miscut angles of the image in the directions of the x axis and the y axis respectively are shown;

according to Newton method

Iterative solution is carried out to obtain an image gpic _i Transformation to image gpic _i-1 Geometric transformation parameters of

And is provided with

Thus, it is possible to provide

Using Newton method to iteratively solve the equation set to obtain

A value of (d);

as an image gpic _i Transformation to image gpic _i-1 Pixel values translated in the x-axis direction;

as an image gpic _i Transformation to image gpic _i-1 Pixel values translated in the y-axis direction;

as an image gpic _i Transformation to image gpic _i-1 A scaled value in the x-axis direction;

as an image gpic _i Transformation to image gpic _i-1 A scaled value in the y-axis direction;

as an image gpic _i Transformation to image gpic _i-1 Rotation angles in x, y, and z axis directions, respectively;

as an image gpic _i Transformation to image gpic _i-1 The miscut angles in the directions of the x axis and the y axis respectively;

step 72, calculating the corrected sphere radius

Let gpic _i And gpic _i-1 Included angle therebetween is

gpic _i And gpic _i-1 Relative displacement in the y direction of

For each pixel, then the cpic can be solved _i Radius of sphere

Comprises the following steps:

wherein, X _i 、α _i Solved in step 72, the transcendental equation is solved using an iterative method to calculate the corrected sphere radius r _i ；

Step eight, spherical projection

For gpic _i Is carried out to

Is a spherical projection of radius, and will gpic _i Projective transformation to cpic _i (ii) a Image gpic _i Located at a radius of

On a sphere, then gpic _i At any point P ₂ Pixel coordinate value of (gx) _i ,gy _i ,gz _i ) In a

A light source point P is arranged, and a projection point P is obtained by projecting the point P to a plane with z =0 ₃ I.e. cpic _i The pixel coordinate value of any one point of (2) is expressed as (cx) _i ,cy _i 0), let the projection scale factor be tk _i Then, there are:

finally, gpic can be obtained _i Cpic obtained by spherical projection _i The pixel coordinate value of any one point is (tk) _i ·gx _i ,tk _i ·gy _i ) Wherein the projection scale factor

Similarly, RR pairs PIC are used ^Coarse Performing spherical projection calculation to obtain image CPIC subjected to spherical correction ^{Correction of} Wherein

Step nine, feature matching;

correcting a spherical image frame set CPIC according to SIFT algorithm ^{Correction of} The obtained correction frame image-gray-feature set is recorded as

And is

Step 92, threshold-defined nearest neighbor distance ratio matching strategy;

to pair

And

performing feature matching, and if the features are matched, recording a matching set after the feature matching as two adjacent correction image frames for feature matching

And feature set of current correction frame image

Matching the characteristics of the two groups;

in the same way, can obtain

Step 93, a random sample consistency algorithm;

for is to

Performing a random sample consistency algorithmTransforming and generating homography model to obtain effective correction matching

And homography model

In the same way, for

Carrying out random sample consistency algorithm optimization to obtain an effective correction matching set

Wherein

And a set of homography matrices CMDD in which

The model

A three-row three-column matrix is formed;

step ten, homography transformation and weighted average processing;

the described

As cpic _i The homography matrix of the transmission transformation of (1), the cpic _i Transmission transformation to base image RES and the pair cpic is completed _i Splicing;

for CPIC ^{Correction of} All corrected images in (1) are homography transformed using CMDD and the final base image RES will contain CPIC ^{Correction of} All of the elements in (1); and then, fusing the spliced images RES by adopting a weighted average algorithm to complete the splicing of the HTTP data streams.

2. The unmanned aerial vehicle real-time image stitching method based on spherical correction according to claim 1, characterized in that: the starting position may be the upper left corner of a panorama, or may be any position point of the panorama.

3. The unmanned aerial vehicle real-time image stitching method based on spherical correction according to claim 1, characterized in that: ratio threshold, noted TT _{Threshold value} And TT _{Threshold value} =0.4; when the ratio threshold is set to 0.4, the feature matching condition can be judged more accurately.

4. The unmanned aerial vehicle real-time image stitching method based on spherical correction according to claim 1, characterized in that: software in the drone ground control station uses python software to pull the HTTP data stream.