CN114926339B

CN114926339B - Light field multi-view image super-resolution reconstruction method and system based on deep learning

Info

Publication number: CN114926339B
Application number: CN202210599012.5A
Authority: CN
Inventors: 袁潮; 请求不公布姓名; 温建伟
Original assignee: Beijing Zhuohe Technology Co Ltd
Current assignee: Beijing Zhuohe Technology Co Ltd
Priority date: 2022-05-30
Filing date: 2022-05-30
Publication date: 2023-02-03
Anticipated expiration: 2042-05-30
Also published as: CN114926339A

Abstract

The invention provides a light field multi-view image super-resolution reconstruction method and a light field multi-view image super-resolution reconstruction system based on deep learning, and belongs to the technical field of light field reconstruction. The method comprises the steps of obtaining a first imaging focus, first imaging data, a second imaging focus, second imaging data and current focal length data of a first lens surface and a second lens surface of a target object shot by a light field camera under different view angles, and obtaining reconstruction results corresponding to the different view angles, wherein the reconstruction results comprise a first resolution reconstruction result and a second resolution reconstruction result; when the first resolution ratio reconstruction result is matched with the second resolution ratio reconstruction result, starting super-resolution ratio reconstruction; otherwise, parameter adjustment is initiated. According to the technical scheme, super-resolution reconstruction of the light field image data can be realized based on difference matching of different visual angles, and when the matching reconstruction conditions are not met, the imaging parameters are adjusted in a self-adaptive mode, so that the super-resolution reconstruction effect is ensured.

Description

Light field multi-view image super-resolution reconstruction method and system based on deep learning

Technical Field

The invention belongs to the technical field of light field reconstruction, and particularly relates to a light field multi-view image super-resolution reconstruction method and system based on deep learning.

Background

The capability of the light field camera to record the light direction enables the light field camera to obtain refocused images at any positions in a geometric tracking mode, compared with the traditional shooting mode, the traditional shooting mode needs a complicated focusing process, and the light field digital refocusing is revolutionary change.

Although the light field camera can record the position and angle information of the space light rays at the same time, and has remarkable advantages in digital refocusing, full-focus image acquisition and depth estimation, the recording of multi-dimensional information causes the spatial resolution of the light field image to be low, and the research on the light field super-resolution reconstruction technology has great significance in order to fully utilize the advantages of the light field camera and overcome the problem of limited image resolution.

In the prior art, light field super-resolution reconstruction is generally implemented by acquiring image data based on a single visual angle and fusing different resolutions. However, the problem of imaging difference of the light field camera for the same target object (region) under different viewing angles is not considered in this way, and the problem of parameter adjustment of the lens array inside the light field camera itself is not considered, so that registration error or image display blur after super-resolution fusion may occur.

Disclosure of Invention

In order to solve the technical problems, the invention provides a light field multi-view image super-resolution reconstruction method and a light field multi-view image super-resolution reconstruction system based on deep learning.

The technical scheme includes that a first imaging focus, first imaging data, a second imaging focus and second imaging data of a target object on a first lens surface and current focal length data of the first lens surface and a second lens surface, which are shot by a light field camera under different view angles, are obtained, and reconstruction results corresponding to the different view angles are obtained, wherein the reconstruction results comprise a first resolution reconstruction result and a second resolution reconstruction result; when the first resolution ratio reconstruction result is matched with the second resolution ratio reconstruction result, starting super-resolution ratio reconstruction; otherwise, parameter adjustment is initiated.

According to the technical scheme, super-resolution reconstruction of the light field image data can be realized based on difference matching of different viewing angles, and imaging parameters are adjusted in a self-adaptive mode when the super-resolution reconstruction conditions are not met, so that the super-resolution reconstruction effect is ensured.

Specifically, in a first aspect of the present invention, a light field multi-view image super-resolution reconstruction method based on deep learning is provided. The method is implemented based on a light field camera comprising a first lens face and a second lens face;

at least one of the first lens surface and the second lens surface is a flexible optical lens, and the focal length of the flexible optical lens can be adjusted within a design range.

In the performing step, the method comprises:

s1: acquiring a first imaging focus F of a target object shot by the light field camera on the first lens surface under a first visual angle ₁ ¹ And a second imaging focus on the second lens surface

S2: the current focal length J of the first lens surface ₁ A current focal length J of the second lens surface ₂ And said first imaging focus F ₁ ¹ And a second imaging focus on the second lens face

Relative distance therebetween

Inputting the data as a deep learning parameter to a deep learning model, wherein the deep learning model outputs a first resolution reconstruction result of the target object;

s3: acquiring a first imaging focus F of the target object shot by the light field camera on the first lens surface under a second visual angle ₁ ² And a second imaging focus on the second lens surface

S4: the current focal length J of the first lens surface ₁ SaidCurrent focal length J of the second lens face ₂ And the first imaging focus F ₁ ² And a second imaging focus on the second lens face

Relative distance therebetween

Inputting the target object as a deep learning parameter to the deep learning model, wherein the deep learning model outputs a second resolution reconstruction result of the target object;

s5: judging whether the first resolution reconstruction result is matched with the second resolution reconstruction result, and if so, entering the step S6;

otherwise, adjusting imaging parameters and returning to the step S1;

the imaging parameters comprise one of the current focal length of the first lens surface, the current focal length of the second lens surface, the relative distance between the first lens surface and the second lens surface, or any combination thereof;

s6: and carrying out image fusion on the first resolution reconstruction result and the second resolution reconstruction result to obtain a super-resolution reconstruction result of the target object.

As a further improvement, the deep learning model is a back propagation neural network model, and the neural network model comprises an input layer, an intermediate layer and an output layer;

the input layer comprises a first node, a second node and a third node;

the first node inputs a current focal length J of the first lens surface ₁ (ii) a The second nodal point inputs a current focal length J of the second lens face ₂ ；

The third node inputs first imaging data of a target object shot by the light field camera on the first lens surface and second imaging data of the target object shot by the light field camera on the second lens surface;

the middle layer comprises a first fusion node and a second fusion node;

the first fusion node fuses a first imaging focus F on the first lens surface ₁ ¹ And a second imaging focus on the second lens face

And obtaining the relative distance

The second fusion node fuses the first imaging data and the second imaging data and obtains target data to be reconstructed;

the output layer is connected with the middle layer and the input layer, the output of the middle layer is used as the input of the output layer, and the output result of the output layer is fed back and output to the input layer.

The first perspective is different from the second perspective;

in a specific selection, the first viewing angle or the second viewing angle is selected from one of the following viewing angles:

parallel to a first coordinate axis of the cartesian coordinate system, parallel to a second coordinate axis of the cartesian coordinate system, parallel to a third coordinate axis of the cartesian coordinate system.

The step S5 of determining whether the first resolution reconstruction result and the second resolution reconstruction result match includes:

judging whether the difference value of the image resolution of the same plane of the first resolution reconstruction result and the second resolution reconstruction result in a Cartesian coordinate system is within a preset range or not;

and if so, matching the first resolution reconstruction result with the second resolution reconstruction result.

More specifically, three difference values of image resolutions of three same planes of the first resolution reconstruction result and the second resolution reconstruction result in a cartesian coordinate system are obtained;

and if at least two difference values of the three difference values are within a preset range, matching the first resolution reconstruction result with the second resolution reconstruction result.

In a second aspect of the present invention, to implement the method in the first aspect, a light field multi-view image super-resolution reconstruction system based on deep learning is provided, the system is connected to a light field camera, the light field camera includes a first lens surface and a second lens surface, a focal length of at least one lens surface is adjustable, and a relative distance between the first lens surface and the second lens surface is adjustable;

in a specific structure, the system further comprises a visual angle setting unit, a deep learning unit, a matching judgment unit, a parameter adjusting unit and a super-resolution reconstruction unit.

More specifically, the functions of the above respective implementation units are implemented as follows:

the visual angle setting unit is used for setting a shooting visual angle of the light field camera, and the shooting visual angle comprises a first visual angle and a second visual angle;

the deep learning unit is used for acquiring a first imaging focus, first imaging data, a second imaging focus, second imaging data and current focal length data of the first lens surface and the second lens surface of a target object shot by the light field camera under different view angles, and acquiring reconstruction results corresponding to the different view angles, wherein the reconstruction results comprise a first resolution reconstruction result and a second resolution reconstruction result;

the matching judgment unit is used for judging whether the first resolution reconstruction result is matched with the second resolution reconstruction result;

the parameter adjusting unit is used for adjusting imaging parameters of the light field camera;

the super-resolution reconstruction unit carries out image fusion on the first resolution reconstruction result and the second resolution reconstruction result to obtain a super-resolution reconstruction result of the target object;

the imaging parameters include one of a current focal length of the first lens surface, a current focal length of the second lens surface, a relative distance of the first lens surface and the second lens surface, or any combination thereof.

More specifically, the matching judgment unit is configured to judge whether the first resolution reconstruction result and the second resolution reconstruction result match, and specifically includes:

acquiring three difference values of image resolutions of three same planes of the first resolution reconstruction result and the second resolution reconstruction result in a Cartesian coordinate system;

The super-resolution reconstruction unit and the parameter adjusting unit are communicated with the matching judging unit;

when the judgment result of the matching judgment unit is matching, starting the super-resolution reconstruction unit; otherwise, starting the parameter adjusting unit.

More specifically, the deep learning unit comprises a deep learning model; the deep learning model is a back propagation neural network model.

In a specific structure, the neural network model comprises an input layer, an intermediate layer and an output layer;

the input layer comprises a first node, a second node and a third node;

the first nodal point inputs a current focal length J of the first lens face ₁ (ii) a The second node inputs a current focal length J of the second lens surface ₂ ；

the middle layer comprises a first fusion node and a second fusion node;

the first fusion node fuses a first imaging focus F on the first lens surface ₁ ¹ And a second imaging focus on the second lens surface

And obtaining the relative distance

According to the technical scheme, super-resolution reconstruction of the light field image data can be realized based on difference matching of different visual angles, and when the matching reconstruction conditions are not met, imaging parameters are adjusted in a self-adaptive mode, so that the super-resolution reconstruction effect is ensured; meanwhile, the neural network deep learning model with back propagation is adopted, so that the output result can reversely act on the input layer, a feedback signal is provided for the subsequent secondary visual angle adjustment, the accuracy and the efficiency of the whole reconstruction process are further ensured, and closed-loop self-learning feedback is formed.

Further advantages of the invention will be apparent in the detailed description section in conjunction with the drawings attached hereto.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.

FIG. 1 is a schematic flow chart illustrating steps of a light field multi-view image super-resolution reconstruction method based on deep learning according to an embodiment of the present invention;

FIG. 2 is a block diagram of a deep learning model used in various embodiments of the present invention;

FIG. 3 is a schematic diagram of the relative positions of different lenses of a light field camera as used by various embodiments of the present invention;

FIG. 4 is a schematic diagram illustrating a determination of whether reconstruction results at different viewing angles are matched according to various embodiments of the present invention;

fig. 5 is an architecture diagram of a light field multi-view image super-resolution reconstruction system based on deep learning according to an embodiment of the present invention.

Detailed Description

The invention is further described with reference to the following drawings and detailed description.

Fig. 1 shows a flow of steps of a light field multi-view image super-resolution reconstruction method based on deep learning according to an embodiment of the present invention, which includes a loop determination process of steps S1-S6.

The method of fig. 1 is implemented based on a light field camera comprising a first lens face and a second lens face.

As a specific example, various embodiments of the present invention are directed to a light field camera that includes a microlens array, but of course, other optical sensors. In general, a light field camera consists of three parts, a main lens, a microlens array, and an image sensor.

In the present invention, the adjustment of the main lens is not considered, and therefore, the subsequent embodiments or diagrams only show the microlens and the corresponding focusing mirror (also called refocusing mirror).

Therefore, the first lens surface and the second lens surface can be understood as a micro lens and a corresponding focusing lens (also called a refocusing lens), i.e. the first lens surface is a micro lens and the second lens surface is a focusing lens; or the second lens surface is a micro lens, and the first lens surface is a focusing lens.

On the basis, the steps S1-S6 are specifically realized as follows:

S2: the current focal length J of the first lens surface ₁ A current focal length J of the second lens surface ₂ And the first imaging focus F ₁ ¹ And a second imaging focus on the second lens face

Relative distance therebetween

S4: the current focal length J of the first lens surface ₁ A current focal length J of the second lens surface ₂ And said first imaging focus F ₁ ² And a second imaging focus on the second lens face

Relative distance therebetween

otherwise, adjusting imaging parameters and returning to the step S1;

In step S5, the imaging parameter includes one of a current focal length of the first lens surface, a current focal length of the second lens surface, a relative distance between the first lens surface and the second lens surface, or any combination thereof.

FIG. 2 is a block diagram of a deep learning model used in various embodiments of the present invention.

In fig. 2, the deep learning model is a back-propagation neural network model, which includes an input layer, an intermediate layer and an output layer;

the input layer comprises a first node, a second node and a third node;

the first node inputs a current focal length J of the first lens surface ₁ (ii) a The second node inputs a current focal length J of the second lens surface ₂ ；

the middle layer comprises a first fusion node and a second fusion node;

And obtaining the relative distance

The specific deep learning realizes the correlation principle of light field reconstruction, and reference may be made to various light field deep learning reconstruction models introduced in the prior art, for example:

[1] yi Yupeng, light field image super-resolution reconstruction algorithm research based on deep learning [ D ]. Taiyuan science and technology university, 2021. DOI.

[2] Nilluxia. Three-dimensional light field accurate reproduction key technology research [ D ]. Zhejiang university, 2021.DOI.

It can be seen that the embodiment of the invention can realize super-resolution reconstruction of light field image data based on difference matching of different viewing angles, and adaptively adjust imaging parameters when the matching reconstruction conditions are not met, thereby ensuring the super-resolution reconstruction effect; meanwhile, the neural network deep learning model with back propagation is adopted, so that the output result can reversely act on the input layer, a feedback signal is provided for the subsequent secondary visual angle adjustment, the accuracy and the efficiency of the whole reconstruction process are further ensured, and closed-loop self-learning feedback is formed.

More specifically, FIG. 3 illustrates a schematic diagram of relative positions of different lenses of a light field camera as used by various embodiments of the present invention.

In fig. 3, a first imaging focus F on the first lens face is shown ₁ ¹ And a second imaging focus on the second lens face

Current focal length J of the first lens face ₁ A current focal length J of the second lens surface ₂ The first imaging focus F ₁ ¹ And a second imaging focus on the second lens face

Relative distance therebetween

In various embodiments of the present invention, the first imaging focus F, regardless of the lens array employed ₁ ¹ And a second imaging focus on the second lens face

Relative distance therebetween

Can be fine-tuned within the design.

On the basis, as a further preference, a flexible optical lens can be adopted as the partial lens, and the focal length of the flexible optical lens can be adjusted within a design range.

For example, at least one of the first lens surface and the second lens surface is a flexible optical lens, the focal length of which is adjustable within a design range.

Or the first lens surface and the second lens surface are both flexible optical lenses, and the focal length of the flexible optical lenses can be adjusted within a design range.

How to manufacture or implement a flexible optical lens with a focal length that can be fine-tuned within a design range belongs to the prior art, and can be seen in the following:

[3] preparation and performance studies of Liupenghui, li Shi Yao, wang Wen, wen Xuyang, wu dynasty, zhou Xiong, zyong love Flexible liquid Crystal microlens arrays [ J ]. Photonic science, 2021,50 (03): 86-93.

[4] Wang Jong, li Lei optical imaging systems [ J ] based on adaptive lenses optoelectronics, 2020,40 (03): 155-163. DOI.

The invention is therefore not specifically developed in this regard, and the above cited documents are incorporated as part of the embodiments of the present invention.

As an empirical alternative of the invention, the first imaging focus F is adjusted during the adjustment process ₁ ¹ And a second imaging focus on the second lens face

Relative distance therebetween

Current focal length J with the first lens face ₁ A current focal length J of the second lens surface ₂ The following constraints are satisfied:

wherein K is an adjustment coefficient;

when the first lens surface is a micro lens and the second lens surface is a focusing lens, K is larger than 1;

when the second lens surface is a micro lens and the first lens surface is a focusing lens, the 0-straw K-straw (1) is formed.

Reference is next made to fig. 4. Fig. 4 is a schematic diagram illustrating determination of whether reconstruction results at different viewing angles are matched in various embodiments of the present invention.

Specifically, the step S5 of determining whether the first resolution reconstruction result and the second resolution reconstruction result match includes:

if yes, the first resolution reconstruction result is matched with the second resolution reconstruction result.

In various embodiments of the present invention, the first viewing angle or the second viewing angle is selected from one of the following viewing angles:

parallel to a first axis of the cartesian coordinate system, parallel to a second axis of the cartesian coordinate system and parallel to a third axis of the cartesian coordinate system.

It is understood that the first coordinate axis, the second coordinate axis, and the third coordinate axis are each one of the X-Y-Z axes.

See, as an example, fig. 4.

Assuming that the first viewing angle is a viewing angle parallel to a first coordinate axis of the cartesian coordinate system, e.g. a viewing angle parallel to the X-axis;

assuming that the second viewing angle is a viewing angle parallel to a second coordinate axis of the cartesian coordinate system, e.g. a viewing angle parallel to the Y-axis;

the first resolution reconstruction results in three planes of image resolution in cartesian coordinates:

an image resolution 1 in the XY plane, an image resolution 2 in the XZ plane, and an image resolution 3 in the YZ plane;

the second resolution reconstruction results in image resolutions in three planes also in a cartesian coordinate system:

an image resolution 11 in the XY plane, an image resolution 12 in the XZ plane, and an image resolution 13 in the XY plane;

at this time, at least two of the image resolution 1 of the XY plane, the image resolution 2 of the XZ plane, and the image resolution 3 of the YZ plane should be the same as or slightly different from at least two of the image resolution 11 of the XY plane, the image resolution 12 of the XZ plane, and the image resolution 13 of the XY plane (the difference value is within a preset range), so as to ensure that no error or blur is generated in the subsequent fusion reconstruction process.

Thus, in particular implementations, the first viewing angle is different from the second viewing angle.

More specifically, three difference values of image resolutions of three planes of the first resolution reconstruction result and the second resolution reconstruction result in a cartesian coordinate system are obtained;

The preset range can be set based on the size of the reconstruction resolution, and the higher the requirement of the reconstruction resolution is, the smaller the preset range is.

To implement the above method, referring to fig. 5, fig. 5 is an architecture diagram of a light field multi-view image super-resolution reconstruction system based on deep learning according to an embodiment of the present invention.

In fig. 5, a light field multi-view image super-resolution reconstruction system based on deep learning is shown, the system is connected with a light field camera, the light field camera comprises a first lens surface and a second lens surface, the focal length of at least one lens surface can be adjusted, and the relative distance between the first lens surface and the second lens surface can be adjusted;

fig. 5 shows that the system further comprises:

a viewing angle setting unit for setting a photographing viewing angle of the light field camera, the photographing viewing angle including a first viewing angle and a second viewing angle;

the deep learning unit is used for acquiring a first imaging focus, first imaging data, a second imaging focus and second imaging data of a target object on the first lens surface, the second imaging focus and the second imaging data of the target object, and current focal length data of the first lens surface and the second lens surface, wherein the target object is shot by the light field camera at different view angles, and acquiring reconstruction results corresponding to the different view angles, wherein the reconstruction results comprise a first resolution reconstruction result and a second resolution reconstruction result;

a matching judgment unit, configured to judge whether the first resolution reconstruction result and the second resolution reconstruction result are matched;

a parameter adjusting unit for adjusting imaging parameters of the light field camera;

a super-resolution reconstruction unit which performs image fusion on the first resolution reconstruction result and the second resolution reconstruction result to obtain a super-resolution reconstruction result of the target object;

Preferably, the first lens surface and the second lens surface are both flexible optical lenses.

Preferably, the matching judgment unit is configured to judge whether the first resolution reconstruction result and the second resolution reconstruction result match, and specifically includes:

acquiring three difference values of image resolutions of three same planes of the first resolution reconstruction result and the second resolution reconstruction result under a Cartesian coordinate system;

The super-resolution reconstruction unit and the parameter adjusting unit are communicated with the matching judgment unit;

The deep learning unit comprises a deep learning model;

the deep learning model is a back propagation neural network model.

Specific deep learning model structure can be seen in fig. 2, and will not be described again.

According to the technical scheme, super-resolution reconstruction of light field image data can be realized based on difference matching of different viewing angles, and imaging parameters are adjusted in a self-adaptive manner when the super-resolution reconstruction conditions are not met, so that a super-resolution reconstruction effect is ensured; meanwhile, the back-propagation neural network deep learning model is adopted, so that the output result can reversely act on the input layer, a feedback signal is provided for the subsequent secondary adjustment of the visual angle, the accuracy and efficiency of the whole reconstruction process are further ensured, and closed-loop self-learning feedback is formed.

This section describes a number of different embodiments. It should be noted that different embodiments of the present invention can respectively solve one or more technical problems mentioned in the background art and achieve corresponding technical effects, and different combinations of the embodiments can solve all the mentioned technical problems and achieve all the technical effects;

however, it is not required that every single embodiment of the present invention solve all the technical problems or achieve all the improvements. The solution to a certain problem or the improved corresponding embodiment of a single technical effect may both constitute an independent technical solution of the present invention.

Although the present invention has been described in detail with reference to the above embodiments, it should be understood by those skilled in the art that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

The present invention is not limited to the specific module structure described in the prior art. The prior art mentioned in the background section can be used as part of the invention to understand the meaning of some technical features or parameters. The scope of the present invention is defined by the claims.

Claims

1. A light field multi-view image super-resolution reconstruction method based on deep learning is realized based on a light field camera, wherein the light field camera comprises a first lens surface and a second lens surface;

characterized in that the method comprises the following steps:

s1: acquiring a first imaging focus F of a target object shot by the light field camera on the first lens surface under a first view angle ₁ ¹ And a second imaging focus F on the second lens surface ₂ ¹ ；

S2: the current focal length J of the first lens surface ₁ A current focal length J of the second lens surface ₂ And the first imaging focus F ₁ ¹ And a second imaging focus F on the second lens face ₂ ¹ Relative distance between | F ₂ ¹ -F ₁ ¹ I is input into a deep learning model as a deep learning parameter, and the deep learning model outputs a first resolution reconstruction result of the target object;

s3: acquiring a first imaging focus F of the target object shot by the light field camera on the first lens surface under a second visual angle ₁ ² And a second imaging focus F on the second lens surface ₂ ² ；

S4: the current focal length J of the first lens surface ₁ A current focal length J of the second lens surface ₂ And said first imaging focus F ₁ ² And a second imaging focal point F on the second lens surface ₂ ² Relative distance | F between ₂ ² -F ₁ ² I, inputting the I as a deep learning parameter to the deep learning model, and outputting a second resolution reconstruction result of the target object by the deep learning model;

s5: judging whether the first resolution ratio reconstruction result is matched with the second resolution ratio reconstruction result, and if so, entering the step S6;

otherwise, adjusting imaging parameters and returning to the step S1;

s6: performing image fusion on the first resolution reconstruction result and the second resolution reconstruction result to obtain a super-resolution reconstruction result of the target object;

the deep learning model is a neural network model of back propagation, and the neural network model comprises an input layer, a middle layer and an output layer; the output layer is connected with the middle layer and the input layer, the output of the middle layer is used as the input of the output layer, and the output result of the output layer is fed back and output to the input layer;

the step S5 of determining whether the first resolution reconstruction result and the second resolution reconstruction result match includes: acquiring three difference values of image resolutions of three same planes of the first resolution reconstruction result and the second resolution reconstruction result in a Cartesian coordinate system; and if at least two difference values in the three difference values are in a preset range, matching the first resolution ratio reconstruction result with the second resolution ratio reconstruction result.

2. The light field multi-view image super-resolution reconstruction method based on deep learning of claim 1, wherein: the input layer comprises a first node, a second node and a third node;

the middle layer comprises a first fusion node and a second fusion node;

the first fusion node fuses a first imaging focus F on the first lens face ₁ ¹ And a second imaging focus F on the second lens face ₂ ¹ And obtaining the relative distance | F ₂ ¹ -F ₁ ¹ |；

And the second fusion node fuses the first imaging data and the second imaging data and obtains target data to be reconstructed.

3. The light field multi-view image super-resolution reconstruction method based on deep learning of claim 1, wherein: at least one of the first lens surface and the second lens surface is a flexible optical lens, and the focal length of the flexible optical lens can be adjusted within a design range.

4. The light field multi-view image super-resolution reconstruction method based on deep learning of claim 1, wherein: the first perspective is different from the second perspective;

the first viewing angle or the second viewing angle is selected from one of the following viewing angles:

5. A light field multi-view image super-resolution reconstruction system based on deep learning is connected with a light field camera, wherein the light field camera comprises a first lens surface and a second lens surface, the focal length of at least one lens surface is adjustable, and the relative distance between the first lens surface and the second lens surface is adjustable;

characterized in that the system further comprises:

the deep learning unit comprises a deep learning model, the deep learning model is a neural network model of back propagation, and the neural network model comprises an input layer, an intermediate layer and an output layer; the output layer is connected with the middle layer and the input layer, the output of the middle layer is used as the input of the output layer, and the output result of the output layer is fed back and output to the input layer; the matching judgment unit is configured to judge whether the first resolution reconstruction result and the second resolution reconstruction result are matched, and specifically includes: acquiring three difference values of image resolutions of three same planes of the first resolution reconstruction result and the second resolution reconstruction result in a Cartesian coordinate system; and if at least two difference values in the three difference values are in a preset range, matching the first resolution reconstruction result with the second resolution reconstruction result.

6. The deep learning-based light field multi-view image super-resolution reconstruction system according to claim 5, wherein: the first lens surface and the second lens surface are both flexible optical lenses.

7. The light field multi-view image super-resolution reconstruction system based on deep learning of claim 5, wherein: the super-resolution reconstruction unit and the parameter adjusting unit are communicated with the matching judging unit;