CN115278054B

CN115278054B - Method for recovering high frame rate global shutter video from rolling shutter image

Info

Publication number: CN115278054B
Application number: CN202210726715.XA
Authority: CN
Inventors: 戴玉超; 樊斌; 张志远; 郭相
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2022-06-23
Filing date: 2022-06-23
Publication date: 2023-08-08
Anticipated expiration: 2042-06-23
Also published as: CN115278054A

Abstract

The invention relates to a method for recovering a high-frame-rate global shutter video from a rolling shutter image, which aims to overcome the defects of the prior art and provides an internal geometric model for correcting the rolling shutter image, wherein the internal geometric model comprises the following steps: modeling a bidirectional rolling shutter de-distortion flow for removing rolling shutter distortion under a uniform motion model; then, a geometrical relation between the optical flow between the successive frames and the de-distortion flow corresponding to any scan line is established by a simple scaling operation; furthermore, a mechanism of mutual conversion between different de-distortion flows corresponding to different scan lines is established. The method is based on continuous two-frame rolling shutter image data acquired by the CMOS camera, can reverse a rolling shutter imaging mechanism by organically fusing the geometric model into a deep learning network, and simultaneously realizes rolling artifact removal and image time super-resolution, and is relatively suitable for practical applications such as mobile phone shooting, unmanned aerial vehicle shooting, computational shooting and the like by adopting the CMOS camera.

Description

Method for recovering high frame rate global shutter video from rolling shutter image

Technical Field

The invention belongs to the technical field of image and video synthesis, relates to a method for recovering a high-frame-rate global shutter video from a rolling shutter image, and particularly relates to an efficient high-frame-rate global shutter video recovery method.

Background

Image sensors can be generally classified into two general types, i.e., a CCD (Charge-Coupled Device) image sensor and a CMOS (Complementary Metal Oxide Semiconductor, complementary metal oxide field effect transistor) image sensor, according to the type of structure thereof. Since the 90 s, CMOS image sensing technology has gained attention and a great deal of research and development resources. In recent years, CMOS image sensors have taken an absolute predominance in the market, and the replacement of CCD image sensors is basically realized, and camera devices using CMOS image sensors have been increasingly popularized in the fields of unmanned driving, unmanned aerial vehicles, virtual reality, augmented reality, computed photography, and the like. The main advantages of CMOS cameras can be summarized in three ways: 1) In terms of cost, the CMOS image sensor chip generally adopts a standard flow process suitable for mass production, and the unit cost is far lower than that of a CCD during mass production; 2) In terms of size, the CMOS sensor can integrate the image acquisition unit and the signal processing unit on the same substrate, so that the volume is greatly reduced, and the CMOS sensor is very suitable for mobile equipment and various miniaturized equipment; 3) CMOS sensors also retain the advantages of low power consumption and low heat generation over CCDs in terms of power consumption.

The CMOS camera mostly adopts a rolling shutter working mechanism, and the CCD camera adopts a global shutter mode. Global shutter cameras typically use sensors to complete the entire pixel exposure and collect light at one instant to obtain a frame of global shutter image. Rolling shutter CMOS cameras capture images by way of line-by-line exposure. The sensor scans the scene light according to the lines to cause pixel points in different lines to have different exposure time, a certain time delay exists between two adjacent lines, and the exposure of pixels in all lines is realized in a very short time so as to obtain a rolling shutter image. In practical applications, most photographic cameras adopt a rolling shutter working mode, and due to an imaging mechanism of progressive exposure, when the camera shoots a scene or a moving object in the scene in moving, the obtained rolling shutter image has rolling shutter effects (namely "jelly" effects) such as image tilting, shaking, twisting and the like. This not only causes serious degradation of imaging quality in photography, but also renders most three-dimensional vision algorithms designed for global shutter camera models ineffective. In recent years, many studies have pointed out that eliminating the rolling shutter effect is extremely important in practical applications.

Over a decade of long-standing development, researchers have achieved a number of very effective results in eliminating the rolling shutter effect. In particular, the deep neural network has been applied to a rolling shutter image correction task in recent years, and has achieved a superior de-distortion effect compared to the conventional model method. In fact, when one is observing two consecutive rolling shutter images, one can automatically infer in the mind a sequence of virtual global shutter images hidden at any instant, i.e. a high frame rate global shutter video. However, the existing deep learning method can only restore a global shutter image corresponding to a specific time from an input rolling shutter image, and does not have a strong automatic inference capability similar to a person. Therefore, reversing the rolling shutter imaging mechanism to recover high frame rate high quality global shutter video would be of great significance for practical applications such as scene understanding, computed radiography, video entertainment, video compression and editing.

Disclosure of Invention

Technical problem to be solved

In order to avoid the deficiencies of the prior art, the present invention proposes a method of recovering a high frame rate global shutter video from a rolling shutter image, providing a method for recovering a high frame rate global shutter video from two consecutive rolling shutter images.

Technical proposal

A method of recovering high frame rate global shutter video from a rolling shutter image, characterized by the steps of:

step 1: inputting two continuous rolling shutter images with the resolution of h multiplied by w multiplied by 3 into an optical flow estimation network PWC-Net, and outputting the rolling shutter images as dense optical flow f between the two rolling shutter images, wherein the resolution of f is h multiplied by w multiplied by 2;

step 2: the two shutter images and the dense light flow f between them are input together to an encoder-decoder UNet network, and output as the dense light flow f and the dense de-distortion flow u of the corresponding intermediate scan line ^m An association factor graph c between, where u ^m The resolution of (a) is h×w×2, and the resolution of c is h×w×1;

step 3: explicitly using the correlation factor value c (x) located at pixel x of the kth scan line in the rolling shutter imageCalculating de-distortion flow u corresponding to middle scanning line at pixel x ^m (x)：

u ^m (x)＝c(x)×f(x)

Wherein: c (x) represents the associated factor value of the corresponding pixel x in c, and f (x) represents the optical flow value of the corresponding pixel x in f;

step 4: usingDe-teration flow u corresponding to intermediate scanning line ^m (x) Explicitly propagated to the corresponding arbitrary scan line s E [1, h ]]Is used for removing teratogen ^s (x)；

Step 5: repeating the step 3 and the step 4 for h multiplied by w pixels x in the image to obtain the corresponding arbitrary scanning line s epsilon [1, h ]]Dense de-terating flow u ^s Wherein u is ^s The resolution of (2) is h×w×2;

step 6: according to the scan line s E [1, h ]]Dense de-terating flow u ^s Transforming the first shutter image using a forward warping technique to recover the scan line s e 1, h]A corresponding global shutter image;

step 7: and (3) sequentially recovering the global shutter image sequences corresponding to the continuous scanning lines s=1, 2, and the number h according to the step (6), and finally outputting the global shutter video with high frame rate.

Advantageous effects

The invention provides a method for recovering a high-frame-rate global shutter video from a rolling shutter image, which aims to overcome the defects of the prior art and provides an internal geometric model of the rolling shutter image correction problem: modeling a rolling shutter de-distortion flow for removing rolling shutter distortion under a uniform motion model; then, a geometrical relation between the optical flow between the successive frames and the de-distortion flow corresponding to any scan line is established by a simple scaling operation; furthermore, a mechanism of mutual conversion between different de-distortion flows corresponding to different scan lines is established. The method is based on continuous two-frame rolling shutter image data acquired by a CMOS camera, and the geometric model is organically fused into a deep learning network, so that the technical scheme for realizing the purpose is as follows: a method of recovering high frame rate global shutter video from rolling shutter images.

Drawings

Fig. 1 is a flow chart of a method of the present invention for recovering high frame rate global shutter video from rolling shutter images.

Fig. 2 is a schematic diagram of a rolling shutter camera and global shutter camera exposure mechanism.

Fig. 3 is a coder-decoder (UNet) network architecture used to estimate the correlation factor.

Fig. 4 is a sequential two-frame raw rolling shutter image acquired by a CMOS camera.

Fig. 5 is a global shutter video image sequence extracted from the rolling shutter image of fig. 4 using the method of the present invention.

Detailed Description

The invention will now be further described with reference to examples, figures:

the technical proposal of the invention is as follows: a method of recovering high frame rate global shutter video from rolling shutter images. Aiming at rolling shutter images shot by a CMOS camera, the effect of recovering high-frame-rate global shutter videos from two continuous rolling shutter images is achieved, and a method for recovering the high-frame-rate global shutter videos from the rolling shutter images is provided by establishing an internal geometric model of the rolling shutter image correction problem.

The invention will be further described with reference to the drawings and experimental test results.

By pixel x= (x, y) ^T (corresponding depth is Z) for example, assuming that the camera experiences constant motion (v, ω) during imaging, the optical flow at pixel x f= [ f _u ,f _v ] ^T Can be expressed as:

where gamma is the camera read-out rate, satisfying 0 < gamma.ltoreq.1, f is the camera focal length, h is the number of horizontal scan lines of the image, where

Representing a potential rolling shutter spatiotemporal geometry model depends on camera motion, camera parameters, scene depth, and scan line position. Eliminating the vertical optical flow component f in (1) _v It is possible to further obtain:

then, in order to obtain a de-distortion flow u= [ u ] for converting pixels x of a kth scanning line of the rolling shutter image into a global shutter image corresponding to an s-th scanning line _u ,u _v ] ^T The invention builds its model as follows:

next, based on equations (3) and (4), a geometric correlation method between the de-distortion flow and the optical flow can be established by a simple scaling operation:

where c.epsilon. (-1, 1) is the correlation factor, i.e

Since the solution of c involves a complex rolling shutter spatiotemporal geometric model, we implicitly estimate it by designing an efficient deep neural network in the present invention. Finally, based on equation (4), corresponding to scan line s ₁ And scan line s ₂ The mechanism of the reciprocal transformation between the two rolling shutter de-distortion flows is as follows:

since equation (7) includes only a simple matrix operation, efficient propagation efficiency can be ensured. Moreover, it can be seen from equation (7) that the de-teration flow has a strong row-dependent characteristic, both its size and direction being closely related to the target scan line to be corrected.

To sum up, equations (5) and (7) reveal the intrinsic geometric model of the rolling shutter image correction problem, i.e., equation (5) establishes the geometric relationship between the optical flow between successive frames and the de-distortion flow corresponding to any scan line by a simple scaling operation; equation (7) establishes a mechanism of interconversion between different de-distortion flows corresponding to different scan lines. Through these geometric models, theoretical support can be provided for extracting high frame rate global shutter video from continuous two-frame rolling shutter images. As shown in fig. 1, the technical scheme of the invention mainly comprises the following steps:

step 3: explicitly calculating a de-distortion flow u corresponding to an intermediate scan line at pixel x using an associated factor value c (x) at pixel x of a kth scan line in a rolling shutter image ^m (x)：

u ^m (x)＝c(x)×f(x)

The effectiveness of the invention can be further illustrated by the following experiments:

(1) Network model training

The invention develops experiments on NVIDIA GeForce RTX 2080Ti GPU based on PyTorch and uses an Adam optimizer to perform network optimization. The present invention employs a current, more sophisticated optical flow estimation network (i.e., PWC-Net) to estimate optical flow between two frames of input, while using a simple encoder-decoder (i.e., UNet) structure to estimate correlation factors. The UNet network architecture is shown in fig. 3. The global shutter image corresponding to the middle scanning line is only used for supervision in the training stage, and the global shutter image corresponding to any scanning line can be effectively transmitted in the testing stage, namely the rolling shutter reversal is realized.

(2) Simulation experiment result analysis

Fig. 4 shows two sets of input consecutive two-frame raw rolling shutter images. It can be seen that the rolling shutter image exhibits significant rolling shutter distortion such as graphic tilting, shaking, warping, etc. With the global shutter video restoration method provided by the present invention, the extracted 6-frame global shutter video image sequence is shown from top to bottom in fig. 5. Tests show that for two continuous frames of rolling shutter images with the input resolution of 640 multiplied by 480, the method can efficiently recover 960 global shutter video image sequences in 1.8 seconds. In conclusion, the effectiveness and the high efficiency of the method are fully verified, and meanwhile, the practical application requirements can be met.

Claims

1. A method of recovering high frame rate global shutter video from a rolling shutter image, characterized by the steps of:

u ^m (x)＝c(x)×f(x)

step 6: according to the scan line s E [1, h ]]Is thick of (3)Dense de-teration flow u ^s Transforming the first shutter image using a forward warping technique to recover the scan line s e 1, h]A corresponding global shutter image;