CN115187454A

CN115187454A - Multi-view image super-resolution reconstruction method and device based on meta-imaging

Info

Publication number: CN115187454A
Application number: CN202210605176.4A
Authority: CN
Inventors: 郭钰铎; 蔡娅雯; 郭泽群
Original assignee: Yuantong Beijing Technology Co ltd
Current assignee: Yuantong Beijing Technology Co ltd
Priority date: 2022-05-30
Filing date: 2022-05-30
Publication date: 2022-10-14
Also published as: WO2023231138A1

Abstract

The application relates to the technical field of image reconstruction, in particular to a multi-view image super-resolution reconstruction method, a device and equipment based on meta-imaging, wherein the method comprises the following steps: collecting a multi-view image of an object to be reconstructed; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of an object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold, recovering a higher multi-view resolution image under a smaller scanning frequency by utilizing the redundant sampling characteristic among the multi-view images, and finally realizing the reconstruction of a low-resolution multi-view image by utilizing a neural network model. Therefore, the problems that the scanning times are too many, the accuracy of the recovered multi-view resolution image is low and the like in the traditional scanning light field image reconstruction are solved.

Description

Multi-view image super-resolution reconstruction method and device based on meta-imaging

Technical Field

The present application relates to the field of image reconstruction technologies, and in particular, to a multi-view image super-resolution reconstruction method and apparatus based on meta-imaging.

Background

At present, any video camera equipment is affected by physical conditions of an imaging system, shooting environment and the like, and the actually obtained images are poor in quality and low in resolution.

In the related art, the resolution of the image can be improved by reducing the pixel size, increasing the chip scale, changing the arrangement mode of the detecting elements and reconstructing the super-resolution image. The process level is improved, the size of the detecting element is reduced, the density of the detecting element of a CCD (Charge Coupled Device) camera is increased, although the purposes of reducing the size of a pixel and improving the resolution of an image can be achieved, the illumination obtained by each pixel is reduced, correspondingly, the exposure noise is more obvious, and the image quality is deteriorated.

In the related art, it is difficult to recover a high multi-view resolution image, and a high-resolution two-dimensional or three-dimensional image cannot be reconstructed, which is urgently needed to be solved.

Disclosure of Invention

The application provides a multi-view image super-resolution reconstruction method and device based on meta-imaging, electronic equipment and a storage medium, and aims to solve the problems that scanning times are too many, accuracy of recovered multi-view resolution images is low and the like when a traditional light field super-resolution works.

The embodiment of the first aspect of the application provides a multi-view image super-resolution reconstruction method based on meta-imaging, which includes the following steps: collecting a multi-view image of an object to be reconstructed; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; and inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold value and images correspondingly higher than a second preset resolution threshold value.

Optionally, in an embodiment of the present application, before inputting the multi-view target image into a pre-trained image reconstruction model, the method further includes: acquiring a plurality of first multi-view training images, which are lower than the first preset resolution threshold, of a training reconstruction object and a plurality of second multi-view training images, which correspond to the first multi-view training images and are higher than the second preset resolution threshold, wherein the second preset resolution threshold is greater than the first preset resolution threshold; training a neural network model by taking the first multi-view training image as input and the second multi-view training image as output; and when the neural network model meets the training termination condition, generating the pre-trained image reconstruction model.

Optionally, in an embodiment of the present application, the training termination condition includes: and the resolution difference value between the output image and the input image of the neural network is greater than the preset resolution difference value or the iteration times meet the preset times.

Optionally, in an embodiment of the present application, before generating the pre-trained image reconstruction model, further including: acquiring a target resolution difference value of an input image and an output image in the image reconstruction model; and matching the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

Optionally, in an embodiment of the present application, after obtaining the multi-view reconstructed image of the object to be reconstructed, the method further includes: and re-reconstructing the multi-view reconstructed image by using an image reconstruction algorithm to obtain a two-dimensional or three-dimensional reconstructed image of the object to be reconstructed.

The embodiment of the second aspect of the present application provides a multi-view image super-resolution reconstruction apparatus based on meta-imaging, including: the first acquisition module is used for acquiring a multi-view image of an object to be reconstructed; the screening module is used for screening multi-view target images with the resolution lower than a preset resolution from the multi-view images; the first reconstruction module is used for inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold value and images correspondingly higher than a second preset resolution threshold value.

Optionally, in an embodiment of the present application, the method further includes: a second collecting module, configured to collect, before the multi-view target image is input into a pre-trained image reconstruction model, a plurality of first multi-view training images lower than the first preset resolution threshold for training a reconstruction object and a plurality of second multi-view training images higher than the second preset resolution threshold corresponding to the first multi-view training images, where the second preset resolution threshold is greater than the first preset resolution threshold; the training module is used for training a neural network model by taking the first multi-view training image as input and the second multi-view training image as output; and the generating module is used for generating the pre-trained image reconstruction model when the neural network model meets the training termination condition.

Optionally, in an embodiment of the present application, the method further includes: the acquisition module is used for acquiring a target resolution difference value of an input image and an output image in the image reconstruction model before the pre-trained image reconstruction model is generated; and the matching module is used for matching the difference value between the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

Optionally, in an embodiment of the present application, the method further includes: and the second reconstruction module is used for re-reconstructing the multi-view reconstructed image by using an image reconstruction algorithm after the multi-view reconstructed image of the object to be reconstructed is obtained, so as to obtain a two-dimensional or three-dimensional reconstructed image of the object to be reconstructed.

An embodiment of a third aspect of the present application provides an electronic device, including: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor executing the program to perform the method for super-resolution reconstruction of multi-view images based on meta-imaging as described in the above embodiments.

A fourth aspect of the present application provides a computer-readable storage medium, on which a computer program is stored, where the program is executed by a processor to execute the method for super-resolution reconstruction of multi-view images based on meta-imaging as described in the foregoing embodiments.

Therefore, the embodiment of the application has the following beneficial effects:

the method comprises the steps of acquiring a multi-view image of an object to be reconstructed; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of an object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold, recovering a higher multi-view resolution image under a smaller scanning frequency by utilizing the redundant sampling characteristic among the multi-view images, and finally realizing the reconstruction of a low-resolution multi-view image by utilizing a neural network model. Therefore, the problems that the scanning times are too many, the accuracy of the recovered multi-view resolution image is low and the like in the traditional light field super-resolution work are solved.

Additional aspects and advantages of the present application will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the present application.

Drawings

The foregoing and/or additional aspects and advantages of the present application will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

fig. 1 is a flowchart of a method for super-resolution reconstruction of a multi-view image based on meta-imaging according to an embodiment of the present application;

FIG. 2 is a schematic diagram illustrating an implementation logic of a training and prediction process for an image reconstruction model according to an embodiment of the present application;

fig. 3 is an exemplary diagram of a multi-view image super-resolution reconstruction apparatus based on meta-imaging according to an embodiment of the present application;

fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.

Description of reference numerals: a first acquisition module-100, a screening module-200, a first reconstruction module-300, a memory-401, a processor-402, and a communication interface-403.

Detailed Description

Reference will now be made in detail to embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are exemplary and intended to be used for explaining the present application and should not be construed as limiting the present application.

The following describes a multi-view image super-resolution reconstruction method, an apparatus, an electronic device, and a storage medium based on meta-imaging according to embodiments of the present application with reference to the drawings. In order to solve the problems mentioned in the background art, the present application provides a multi-view image super-resolution reconstruction method based on meta-imaging, in which method, the embodiments of the present application collect multi-view images of an object to be reconstructed; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of an object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold, recovering a higher multi-view resolution image under a smaller scanning frequency by utilizing the redundant sampling characteristic among the multi-view images, and finally realizing the reconstruction of a low-resolution multi-view image by utilizing a neural network model. Therefore, the problems that the scanning times are too many, the accuracy of the recovered multi-view resolution image is low and the like in the traditional light field super-resolution work are solved

Specifically, fig. 1 is a flowchart of a multi-view image super-resolution reconstruction method based on meta-imaging according to an embodiment of the present disclosure.

As shown in fig. 1, the method for super-resolution reconstruction of multi-view images based on meta-imaging comprises the following steps:

in step S101, multi-view images of an object to be reconstructed are acquired.

In order to reconstruct an object to be reconstructed, the embodiment of the application can acquire multi-view images of the object to be reconstructed, and acquire a plurality of multi-view images by acquiring the object to be reconstructed in multiple angles and multiple postures. There are various ways to capture the image, such as capturing by a camera or an imaging system with a lens, and the like, and the method is not particularly limited.

In step S102, a multi-view target image lower than a preset resolution is screened from the multi-view images.

It can be appreciated that the embodiments of the present application can improve the reconstruction of a low-resolution multi-view image into a high-resolution reconstructed image. The embodiment of the application can screen the multi-view images after acquiring the multi-view images of the object to be reconstructed, and screen out a plurality of multi-view images with resolution lower than preset resolution for reconstruction.

Specifically, in the embodiment of the present application, in order to improve the effect of super-resolution reconstruction of a multi-view image, a plurality of images with relatively low resolution as much as possible may be input. In particular, these multiple relatively low resolution images are characterized by: the multi-view image is generated by at least 2 or more scans.

In step S103, the multi-view target image is input into a pre-trained image reconstruction model to obtain a multi-view reconstructed image of the object to be reconstructed, where the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and an image correspondingly higher than a second preset resolution threshold.

And inputting the multi-view target image which is subjected to screening and is lower than the preset resolution in the multi-view image as an input image into a pre-trained image fusion model to obtain a multi-view reconstructed image. The multi-view image super-resolution reconstruction of the multi-view target image is realized through the neural network model.

It will be appreciated by those skilled in the art that in embodiments of the present application, the image reconstruction model described above may be implemented using a convolutional neural network based on a 3d net and attention mechanism. Specifically, a multi-view image with relatively low resolution is input as a three-dimensional image stack, characteristics of each view and different views are learned by three-dimensional convolution, and a true value is finally output as a super-resolution multi-view image. Wherein the attention mechanism module is arranged in a cascade of down-sampling on the left side of the 3d net and deconvolution subtraction on the right side. In a specific implementation process, a technician may also adopt other super-resolution image reconstruction strategies based on deep learning, which is not specifically limited herein.

Optionally, in an embodiment of the present application, before inputting the multi-view target image into the pre-trained image reconstruction model, the method further includes: collecting a plurality of first multi-view training images which are lower than a first preset resolution threshold value and used for training a reconstructed object and a plurality of second multi-view training images which correspond to the first multi-view training images and are higher than a second preset resolution threshold value, wherein the second preset resolution threshold value is larger than the first preset resolution threshold value; training a neural network model by taking the first multi-view training image as input and the second multi-view training image as output; and when the neural network model meets the training termination condition, generating a pre-trained image reconstruction model.

Specifically, a supervised training mode is adopted in the training process of the image reconstruction model, and all input images of the model training have corresponding result labels. It should be noted that the input data for model training is a plurality of multi-view images with relatively low resolution. In particular, these multiple relatively low-resolution multi-view images are characterized by: the multi-view image is generated at least by 2 or more scans.

The output data of the training, i.e. the labels, should be relatively high resolution multi-view images. The characteristics of the multi-view high-resolution images are as follows: the high-resolution multi-view image should have a one-to-one correspondence with the plurality of input low-resolution multi-view images. During training, the resolution factor of the improvement is determined by training data and can be 2 times, 3 times, 4 times, 5 times and the like.

Optionally, in an embodiment of the present application, the training termination condition includes: the resolution difference value between the output image and the input image of the neural network is larger than the preset resolution difference value or the iteration times meet the preset times.

And inputting the low-resolution multi-view image training data into the model to obtain a high-resolution multi-view reconstructed image. In the training process, a person skilled in the art can decide whether to terminate the training of the model according to actual conditions, such as the difference or multiple of the resolution between the output image and the input image. For example, when the output image resolution is 5 times the input image resolution, training of the model is terminated. In addition, the skilled person can also give appropriate weight to the difference or multiple of the average resolution between the output image and the input image as a hyperparameter to combine with some loss functions, and continuously update each parameter through back propagation, so that after the loss function value reaches a certain range, the model training is finished.

Optionally, in an embodiment of the present application, before generating the pre-trained image reconstruction model, the method further includes: acquiring a target resolution difference value of an input image and an output image in an image reconstruction model; and matching the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

It can be understood that, during training, a plurality of first multi-view training images lower than the first preset resolution threshold are input, and a plurality of second multi-view training images higher than the second preset resolution threshold corresponding to the first multi-view training images are output. The embodiment of the application can adjust the size of the first preset resolution threshold value and the second preset resolution threshold value during model training according to the input domain output requirement of the image reconstruction model, and as a specific adjusting mode, the embodiment of the application can adjust the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the difference value of the input resolution and the inattentive resolution.

It should be noted that after the multi-view reconstructed image of the object to be reconstructed is obtained by inputting the multiple multi-view images into the pre-trained image reconstruction model, the embodiment of the application may further reconstruct the multi-view reconstructed image by using an image reconstruction algorithm to obtain a two-dimensional or three-dimensional reconstructed image of the object to be reconstructed. For example, if the resolution of the multi-view image output after the light field is collected is increased, if the resolution of each view needs to be increased by 3 times (three times for horizontal and vertical), the number of scanning times theoretically needs to reach 3 × 3 times. It can be understood that each pixel is densely sampled 3 x 3 times, divided into 3 x 3 sub-pixels. In the embodiment of the present application, only 2 or 3 scans need to be used to achieve the resolution of 3 × 3 scans, and a high-resolution two-dimensional or three-dimensional image is finally reconstructed, where the scans may be performed in an image plane scanning manner, which includes but is not limited to displacement through a microlens array, scanning by a photosensor, or vibration and displacement of a light path itself. The logic for executing the above-mentioned training and prediction process of the image reconstruction model is shown in fig. 2.

According to the multi-view image super-resolution reconstruction method based on the meta-imaging, which is provided by the embodiment of the application, multi-view images of an object to be reconstructed are collected; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; and inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold. Therefore, the high multi-view resolution image can be recovered under a small scanning frequency, and the high-resolution two-dimensional or three-dimensional image can be finally reconstructed.

The multi-view image super-resolution reconstruction device based on meta-imaging proposed according to the embodiments of the present application is described next with reference to the accompanying drawings.

Fig. 3 is a block diagram of a multi-view image super-resolution reconstruction apparatus based on meta-imaging according to an embodiment of the present application.

As shown in fig. 3, the multi-view image super-resolution reconstruction apparatus 10 based on meta-imaging includes: a first acquisition module 100, a screening module 200, and a first reconstruction module 300.

The first acquisition module is used for acquiring a multi-view image of an object to be reconstructed. And the screening module is used for screening the multi-view target image with the resolution ratio lower than the preset resolution ratio from the multi-view images. The first reconstruction module is used for inputting the multi-view target images into a pre-trained image reconstruction model to obtain multi-view reconstructed images of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold.

Optionally, in an embodiment of the present application, the method further includes: the second acquisition module is used for acquiring a plurality of first multi-view training images lower than a first preset resolution threshold value of a training reconstruction object and a plurality of second multi-view training images higher than a second preset resolution threshold value corresponding to the first multi-view training images before inputting the multi-view target images into a pre-trained image reconstruction model, wherein the second preset resolution threshold value is larger than the first preset resolution threshold value; the training module is used for training the neural network model by taking the first multi-view training image as input and the second multi-view training image as output; and the generating module is used for generating a pre-trained image reconstruction model when the neural network model meets the training termination condition.

Optionally, in an embodiment of the present application, the method further includes: the acquisition module is used for acquiring a target resolution difference value of an input image and an output image in the image reconstruction model before generating a pre-trained image reconstruction model; and the matching module is used for matching the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

It should be noted that the foregoing explanation on the embodiment of the multi-view image super-resolution reconstruction method based on meta-imaging is also applicable to the multi-view image super-resolution reconstruction apparatus based on meta-imaging in this embodiment, and details are not repeated here.

According to the multi-view image super-resolution reconstruction device based on the meta-imaging, which is provided by the embodiment of the application, multi-view images of an object to be reconstructed are collected; screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images; and inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold and images correspondingly higher than a second preset resolution threshold. And recovering a higher multi-view resolution image under fewer scanning times by utilizing the redundant sampling characteristic among the multi-view images, and finally realizing the reconstruction of the low-resolution multi-view image by utilizing a neural network model.

Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application. The electronic device may include:

memory 401, processor 402, and computer programs stored on memory 401 and executable on processor 402.

The processor 402 executes the program to implement the multi-view image super-resolution reconstruction method based on meta-imaging provided in the above-described embodiments.

Further, the electronic device further includes:

a communication interface 403 for communication between the memory 401 and the processor 402.

A memory 401 for storing computer programs executable on the processor 402.

Memory 401 may comprise high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory.

If the memory 401, the processor 402 and the communication interface 403 are implemented independently, the communication interface 403, the memory 401 and the processor 402 may be connected to each other through a bus and perform communication with each other. The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 4, but this does not indicate only one bus or one type of bus.

Optionally, in a specific implementation, if the memory 401, the processor 402, and the communication interface 403 are integrated on a chip, the memory 401, the processor 402, and the communication interface 403 may complete mutual communication through an internal interface.

The processor 402 may be a Central Processing Unit (CPU), an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits configured to implement embodiments of the present Application.

The present embodiment also provides a computer-readable storage medium, on which a computer program is stored, wherein the program is executed by a processor to implement the above meta-imaging based multi-view image super-resolution reconstruction method.

In the description herein, reference to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the application. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or N embodiments or examples. Moreover, various embodiments or examples and features of various embodiments or examples described in this specification can be combined and combined by one skilled in the art without being mutually inconsistent.

Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present application, "N" means at least two, e.g., two, three, etc., unless explicitly defined otherwise.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more N executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present application in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of implementing the embodiments of the present application.

It should be understood that portions of the present application may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the N steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are well known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried out in the method of implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and the program, when executed, includes one or a combination of the steps of the method embodiments.

Claims

1. A multi-view image super-resolution reconstruction method based on meta-imaging is characterized by comprising the following steps:

collecting a multi-view image of an object to be reconstructed;

screening a multi-view target image with a resolution lower than a preset resolution from the multi-view images;

and inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold value and images correspondingly higher than a second preset resolution threshold value.

2. The method of claim 1, wherein before inputting the multi-view target image into a pre-trained image reconstruction model, further comprising:

collecting a plurality of first multi-view training images of a training reconstruction object, wherein the first multi-view training images are lower than the first preset resolution threshold value, and a plurality of second multi-view training images which correspond to the first multi-view training images and are higher than the second preset resolution threshold value, wherein the second preset resolution threshold value is larger than the first preset resolution threshold value;

training a neural network model by taking the first multi-view training image as input and the second multi-view training image as output;

and when the neural network model meets the training termination condition, generating the pre-trained image reconstruction model.

3. The method of claim 2, wherein the training termination condition comprises:

and the resolution difference value between the output image and the input image of the neural network is greater than the preset resolution difference value or the iteration times meet the preset times.

4. The method of claim 2, further comprising, prior to generating the pre-trained image reconstruction model:

acquiring a target resolution difference value of an input image and an output image in the image reconstruction model;

and matching the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

5. The method according to any one of claims 1-4, further comprising, after obtaining the multi-view reconstructed image of the object to be reconstructed:

and re-reconstructing the multi-view reconstructed image by using an image reconstruction algorithm to obtain a two-dimensional or three-dimensional reconstructed image of the object to be reconstructed.

6. A multi-view image super-resolution reconstruction device based on meta-imaging is characterized by comprising:

the first acquisition module is used for acquiring a multi-view image of an object to be reconstructed;

the screening module is used for screening multi-view target images with the resolution lower than a preset resolution from the multi-view images;

the first reconstruction module is used for inputting the multi-view target image into a pre-trained image reconstruction model to obtain a multi-view reconstruction image of the object to be reconstructed, wherein the pre-trained image reconstruction model is obtained by training a plurality of images lower than a first preset resolution threshold value and images correspondingly higher than a second preset resolution threshold value.

7. The apparatus of claim 6, further comprising:

a second collecting module, configured to collect, before the multi-view target image is input into a pre-trained image reconstruction model, a plurality of first multi-view training images lower than the first preset resolution threshold for training a reconstruction object and a plurality of second multi-view training images higher than the second preset resolution threshold corresponding to the first multi-view training images, where the second preset resolution threshold is greater than the first preset resolution threshold;

the training module is used for training a neural network model by taking the first multi-view training image as input and the second multi-view training image as output;

and the generating module is used for generating the pre-trained image reconstruction model when the neural network model meets the training termination condition.

8. The apparatus of claim 7, wherein the training termination condition comprises:

9. The apparatus of claim 7, further comprising:

the acquisition module is used for acquiring a target resolution difference value of an input image and an output image in the image reconstruction model before the pre-trained image reconstruction model is generated;

and the matching module is used for matching the difference value of the first preset resolution threshold value and the second preset resolution threshold value according to the target resolution difference value.

10. The apparatus according to any one of claims 6-9, further comprising, after obtaining the multi-view reconstructed image of the object to be reconstructed:

and the second reconstruction module is used for reconstructing the multi-view reconstructed image by using an image reconstruction algorithm to obtain a two-dimensional or three-dimensional reconstructed image of the object to be reconstructed.

11. An electronic device, comprising: memory, a processor and a computer program stored on the memory and executable on the processor, the processor executing the program to implement the meta-imaging based multi-view image super-resolution reconstruction method according to any one of claims 1 to 5.

12. A computer-readable storage medium, on which a computer program is stored, which program is executable by a processor for implementing a method for super-resolution reconstruction of multi-view images based on meta-imaging as claimed in any one of claims 1 to 5.