CN114387395A

CN114387395A - A fast generation method of hologram based on phase-dual resolution network

Info

Publication number: CN114387395A
Application number: CN202210029238.1A
Authority: CN
Inventors: 王帅; 于挺; 田子建
Original assignee: China University of Mining and Technology Beijing CUMTB
Current assignee: China University of Mining and Technology Beijing CUMTB
Priority date: 2022-01-11
Filing date: 2022-01-11
Publication date: 2022-04-22

Abstract

The invention discloses a method for rapidly generating a hologram based on a phase-double-resolution network. The method comprises the following steps: without making a real pure-phase hologram mask, utilizing the differentiability of an angular spectrum method, and using a natural image to realize convolution Unsupervised training of neural networks; convolutional layers learn feature maps in the same space instead of across distances; use atrous convolution and group convolution to reduce network model computation and GPU memory usage; use MS‑SSIM loss and the MSE loss are combined as a consistency loss function to generate reconstruction maps that are more in line with the human visual system. This application calculates and generates a hologram with a resolution of 1080P in only 57 milliseconds, and the peak signal-to-noise ratio between the optimal numerical reconstruction image and the target intensity image reaches 31.17dB.

Description

A fast generation method of hologram based on phase-dual resolution network

技术领域technical field

本发明涉及计算生成全息领域和计算机视觉领域，特别是涉及一种基于相位-双分辨率网络的全息图快速生成方法。The invention relates to the field of computationally generated holography and computer vision, in particular to a method for rapidly generating a hologram based on a phase-dual resolution network.

背景技术Background technique

全息显示被认为是一种先进的显示技术，逐渐应用于三维场景重建、虚拟现实和增强显示系统中，使无需眼镜的三维显示模式成为可能。在实现实时全息显示的同时，如何保证全息显示影像的高保真度仍然是一个难题。相位型的空间光调制器光学效率较高，重构过程中无共轭图像的干扰，纯相位全息图已成为计算生成全息图的主要编码方法。因此，开发一种能够实时计算生成高质量相位型全息图的算法，对于全息显示技术的发展具有重要意义。Holographic display is considered to be an advanced display technology, and is gradually applied in 3D scene reconstruction, virtual reality and augmented display systems, enabling a 3D display mode without glasses. While realizing the real-time holographic display, how to ensure the high fidelity of the holographic display image is still a difficult problem. The phase-type spatial light modulator has high optical efficiency, and there is no interference of conjugate images in the reconstruction process. Pure-phase holograms have become the main coding method for computationally generated holograms. Therefore, it is of great significance for the development of holographic display technology to develop an algorithm that can calculate and generate high-quality phase holograms in real time.

半个多世纪以来，人们推出了多种计算生成全息图的算法，主要包括迭代优化算法和非迭代优化算法。1972年提出的Gercheberg-Saxton算法是迭代优化计算生成全息图的典型代表。随后也有一些基于Gercheberg-Saxton迭代优化算法的改进算法，如，在Gercheberg-Saxton 迭代优化算法中添加随机噪声优化生成全息图；双向误差扩散算法生成全息图；通过梯度下降或Wirtinger导数求解非凸优化问题间接计算生成全息图等。由于迭代优化算法需要花费较长计算时间生成全息图，不适用于大规模、高分辨率全息图的实时计算，因此，有研究学者提出非迭代优化算法，如，双相位幅值编码和一步相位提取算法等。虽然非迭代优化算法计算速度得到大幅度提升，能够满足实时计算，但全息图的重建图像包含较多的散斑噪声。因此，非迭代优化算法无法保证计算生成全息图的质量。For more than half a century, a variety of algorithms for computationally generating holograms have been introduced, mainly including iterative optimization algorithms and non-iterative optimization algorithms. The Gercheberg-Saxton algorithm proposed in 1972 is a typical representative of iterative optimization to generate holograms. Then there are some improved algorithms based on the Gercheberg-Saxton iterative optimization algorithm, such as adding random noise to the Gercheberg-Saxton iterative optimization algorithm to generate holograms; bidirectional error diffusion algorithm to generate holograms; gradient descent or Wirtinger derivatives to solve non-convex optimization The problem is indirect computationally generating holograms, etc. Since the iterative optimization algorithm takes a long time to generate holograms, it is not suitable for the real-time calculation of large-scale and high-resolution holograms. Therefore, some researchers have proposed non-iterative optimization algorithms, such as two-phase amplitude encoding and one-step phase encoding. Extraction algorithms, etc. Although the calculation speed of the non-iterative optimization algorithm has been greatly improved and can meet the real-time calculation, the reconstructed image of the hologram contains more speckle noise. Therefore, non-iterative optimization algorithms cannot guarantee the quality of computationally generated holograms.

近年来，随着算法研究的不断深入，深度学习技术逐渐被引入光学领域，深度学习相关算法的出现也为计算生成全息图提供了新方法。卷积神经网络作为广义近似函数，可以学习输入和输出之间的映射。2021年，Shi等人引入了大规模菲涅耳全息数据集，训练卷积神经网络生成逼真的三维全息图，并且能够变换焦距。也有研究人员利用菲涅尔方法生成大量的全息图作为数据集，训练对抗生成网络。这类方法所使用的训练集均是通过传统的迭代优化算法计算生成的全息图。因此，网络模型的性能直接受限于训练集的质量，间接受限于传统迭代优化算法。如何打破这些限制成为深度学习应用于计算生成全息图的一个关键挑战。更重要的是，卷积层作用于输入的空间维度，最适合于建模和计算输入与输出之间的空间关系映射。卷积神经网络将在物平面中定义的图像跨域映射到在全息图平面中定义的纯相位全息图掩模，存在着空间对应关系不能得到保留，无法发挥卷积神经网络强大的特征映射能力的问题。In recent years, with the deepening of algorithm research, deep learning technology has been gradually introduced into the field of optics, and the emergence of deep learning-related algorithms has also provided a new method for computationally generating holograms. Convolutional neural networks act as generalized approximation functions that learn the mapping between input and output. In 2021, Shi et al. introduced a large-scale Fresnel holography dataset to train a convolutional neural network to generate realistic 3D holograms with the ability to change the focal length. Some researchers also use the Fresnel method to generate a large number of holograms as a dataset to train a confrontational generative network. The training sets used in these methods are all holograms generated by traditional iterative optimization algorithms. Therefore, the performance of the network model is directly limited by the quality of the training set, and indirectly limited by the traditional iterative optimization algorithm. How to break these limitations becomes a key challenge for the application of deep learning to computationally generated holograms. More importantly, convolutional layers act on the spatial dimension of the input and are best suited for modeling and computing the spatial relationship mapping between input and output. The convolutional neural network maps the image defined in the object plane to the pure phase hologram mask defined in the hologram plane. There is a spatial correspondence that cannot be preserved, and the powerful feature mapping capability of the convolutional neural network cannot be exerted. The problem.

发明内容SUMMARY OF THE INVENTION

为了解决上述背景技术中的问题，本申请提供一种基于相位-双分辨率网络的全息图快速生成方法，主要技术方案如下：In order to solve the above-mentioned problems in the background technology, the present application provides a method for rapidly generating a hologram based on a phase-dual resolution network. The main technical solutions are as follows:

构建相位-双分辨率网络模型f_net1和f_net2；Build phase-dual resolution network models f _net1 and f _net2 ;

经过训练的相位-双分辨率网络f_net1，根据输入的目标强度图I计算出初始相位φ₀；The trained phase-dual-resolution network f _net1 calculates the initial phase φ ₀ according to the input target intensity map I;

根据初始相位φ₀与目标强度图I，计算出复值波场U_z；根据角谱法，计算复值波场U_z在自由空间传播-z后得到的复值波场U₀；Calculate the complex-valued wave field U _z according to the initial phase φ ₀ and the target intensity map I; calculate the complex-valued wave field U ₀ obtained after the complex-valued wave field U _z propagates in free space -z according to the angular spectrum method;

经过训练的相位-双分辨率网络f_net2，根据输入的复值波场U₀计算出纯相位的全息图。The trained phase-dual-resolution network _fnet2 computes a phase-only hologram from the input complex-valued wavefield U ₀ .

优选地，所述构建相位-双分辨率网络模型包括：Preferably, the building a phase-dual resolution network model includes:

相位-双分辨率网络主要分为编码器和解码器两部分；The phase-dual resolution network is mainly divided into two parts: encoder and decoder;

在相位-双分辨率网络的编码器中，前两层网络使用组卷积进行特征提取；编码器中剩余的网络层均使用空洞卷积进行特征提取，其中空洞卷积中的空洞因子，遵循卷积中两个非零值之间的最大距离计算公式：In the encoder of the phase-dual resolution network, the first two layers of the network use group convolution for feature extraction; the remaining network layers in the encoder use atrous convolution for feature extraction, where the atrous factor in the atrous convolution follows the The formula for calculating the maximum distance between two non-zero values in convolution:

M_i＝max[M_i+1-2r_i,M_i+1-2(M_i+1-r_i),r_i],M_n＝r_n；M _i =max[M _i+ 1-2r _i ,M _i+ 1-2(M _i+1 -r _i ),r _i ],M _n =r _n ;

在解码器中，通过sub-pixel卷积对编码器提取的特征进行上采样，直至与输入图像分辨率相同；编码器的第三层网络输出使用跳跃连接至解码器的第一层进行特征拼接；将1×1 卷积作为解码器的最后一层，使得输出为单通道图像；In the decoder, the features extracted by the encoder are upsampled by sub-pixel convolution until it is the same resolution as the input image; the third layer network output of the encoder is connected to the first layer of the decoder for feature stitching using skip connections ;Use 1×1 convolution as the last layer of the decoder, so that the output is a single-channel image;

将单通道图像的灰度值归一化至[-π,π]。Normalize the grayscale values of a single-channel image to [-π,π].

优选地，所述经过训练的相位-双分辨率网络的训练过程包括：Preferably, the training process of the trained phase-dual resolution network includes:

已构建的相位-双分辨率网络f_net2计算生成纯相位的全息图，根据角谱法计算出全息图的重建图

其中重建距离为z；The constructed phase-dual-resolution network f _net2 is calculated to generate a pure phase hologram, and the reconstructed image of the hologram is calculated according to the angular spectrum method

where the reconstruction distance is z;

计算输入的目标强度图I和角谱法计算的重建图

之间的MS-SSIM损失和MSE损失；Calculate the input target intensity map I and the reconstructed map calculated by the angular spectrum method

between MS-SSIM loss and MSE loss;

将MS-SSIM损失和MSE损失按照一定的权重比例进行求和作为整体的一致性损失函数，实现相位-双分辨率网络模型f_net1和f_net2的同时训练。The MS-SSIM loss and the MSE loss are summed according to a certain weight ratio as the overall consistency loss function to realize the simultaneous training of the phase-dual-resolution network models f _net1 and f _net2 .

优选地，所述复值波场U_z的计算公式包括：Preferably, the calculation formula of the complex-valued wave field U _z includes:

优选地，所述MS-SSIM损失和MSE损失按照一定的权重比例进行求和作为整体损失函数包括：Preferably, the MS-SSIM loss and the MSE loss are summed according to a certain weight ratio as an overall loss function including:

其中，I表示目标强度图，

表示重建图，

用于计算I和

之间的多尺度结构相似性，p表示像素，用于逐像素计算误差，依据经验设置α＝0.84。where I represents the target intensity map,

represents the reconstructed graph,

used to calculate I and

The multi-scale structural similarity between , p represents the pixel, and is used to calculate the error pixel by pixel, and α = 0.84 is set empirically.

附图说明Description of drawings

图1示出了本申请的基于相位-双分辨率网络生成纯相位全息图的整体结构图；Fig. 1 shows the overall structure diagram of the pure phase hologram generated based on the phase-dual resolution network of the present application;

图2示出了本申请的相位-双分辨率网络结构图；Fig. 2 shows the phase-dual resolution network structure diagram of the present application;

图3示出了本申请搭建的全息显示结构示意图和仿真实物图FIG. 3 shows a schematic diagram of the holographic display structure and a simulated physical diagram of the holographic display constructed in the present application.

具体实施方式Detailed ways

下面对照附图并结合优选的实施方式对本发明进行清楚、完整地描述。The present invention will be clearly and completely described below with reference to the accompanying drawings and in conjunction with the preferred embodiments.

全息显示技术逐渐应用于三维场景重建、虚拟现实和增强显示系统中，使无需眼镜的三维显示模式成为可能。在全息显示技术中，全息图作为信息的载体，包含了物体的深度和强度信息，因此计算生成全息图是全息显示中关键一步。目前计算生成全息图的方法主要包括迭代优化算法和非迭代优化算法。然而两种算法多年来一直在全息图的优化质量和计算时间之间进行权衡。随着深度学习技术逐渐引入光学研究领域，并取得了诸多令人满意的成果。发明一种能够满足实时地计算生成高质量全息图的神经网络算法，对于实时全息显示技术的发展具有重要意义。Holographic display technology is gradually applied in 3D scene reconstruction, virtual reality and augmented display systems, making 3D display mode without glasses possible. In the holographic display technology, the hologram, as the carrier of information, contains the depth and intensity information of the object, so the computationally generated hologram is a key step in the holographic display. The current methods for generating holograms by computation mainly include iterative optimization algorithms and non-iterative optimization algorithms. However, both algorithms have traded off optimal quality of holograms and computation time for many years. With the gradual introduction of deep learning technology into the field of optics research, many satisfactory results have been achieved. The invention of a neural network algorithm capable of generating high-quality holograms by real-time calculation is of great significance to the development of real-time holographic display technology.

为此，本申请提供了一种基于相位-双分辨率网络的全息图快速生成方法。首先引入光场在自由空间中传播的角谱法，其计算公式如下：To this end, the present application provides a method for fast generation of holograms based on a phase-dual resolution network. First, the angular spectrum method of light field propagation in free space is introduced, and its calculation formula is as follows:

u_z(x₁,y₁)＝f_AS{φ(x,y)}＝IFFT{FFT{u_src(x,y)·e^iφ(x,y)}·H(f_x,f_y)} (1)u _z (x ₁ , y ₁ )=f _AS {φ(x,y)}=IFFT{FFT{u _src (x,y)·e ^iφ(x,y) }·H(f _x ,f _y ) } (1)

其中u_src表示相干光源产生的复数波场，u_z表示衍射距离z处的复值波场，f(x,y)表示加载到空间光调制器中的纯相位全息图，FFT和IFFT分别是傅里叶变换和逆傅里叶变换算子，H(f_x, f_y)是角谱变换函数。H(f_x,f_y)的表达式表示为：where u _src represents the complex wave field generated by the coherent light source, u _z represents the complex valued wave field at the diffraction distance z, f(x, y) represents the pure phase hologram loaded into the spatial light modulator, FFT and IFFT are respectively Fourier transform and inverse Fourier transform operators, H(f _x , f _y ) is the angular spectral transform function. The expression of H(f _x , f _y ) is expressed as:

其中f_x和f_y是空间频率；λ是光的波长，z是衍射距离，H(f_x,f_y)是角光谱法的传递函数。由式(1)可知，u_src(x,y)是一个与内容无关的相干光源，因此角光谱法的可导性仅与H(f_x,f_y) 有关。根据式(2)，角光谱法的传递函数与波长、传播距离、像素间距和空间分辨率有关。当光学系统固定时，四个参数是固定的，传递函数在卷积神经网络的反向传播中可以看作是一个常数。因此，角谱分析法的计算公式满足梯度传播。根据神经网络的通用近似定理，利用卷积神经网络正向传播产生纯相位全息图，并用角谱法计算全息图的数值重建。这个过程可以用以下形式表示：where f _x and f _y are the spatial frequencies; λ is the wavelength of light, z is the diffraction distance, and H(f _x , f _y ) is the transfer function of angular spectroscopy. It can be known from equation (1) that u _src (x, y) is a content-independent coherent light source, so the derivability of angular spectroscopy is only related to H(f _x , f _y ). According to equation (2), the transfer function of angular spectroscopy is related to wavelength, propagation distance, pixel pitch and spatial resolution. When the optical system is fixed, the four parameters are fixed, and the transfer function can be regarded as a constant in the backpropagation of the convolutional neural network. Therefore, the calculation formula of the angular spectrum analysis method satisfies the gradient propagation. According to the general approximation theorem of neural network, the pure phase hologram is generated by forward propagation of convolutional neural network, and the numerical reconstruction of the hologram is calculated by the angular spectrum method. This process can be expressed in the following form:

其中I表示目标振幅，f_net表示卷积神经网络的近似函数，φ_holo表示纯相位全息图，

表示再现图像。通过损失函数和角谱法计算梯度的过程可以表示为：where I represents the target amplitude, _fnet represents the approximate function of the convolutional neural network, φ _holo represents the pure phase hologram,

Indicates the reproduced image. The process of calculating the gradient through the loss function and the angle spectrum method can be expressed as:

其中w_k和b_k表示卷积神经网络的第k层可训练参数，φ_holo表示全息图。where w _k and b _k represent the kth layer trainable parameters of the convolutional neural network, and φ _holo represents the hologram.

图1示出了基于相位-双分辨率网络的全息图快速生成方法的整体结构。输入为任意一张强度图，经过训练的相位-双分辨率网络f_net1将其映射为一张相位图；将该相位图和输入的目标强度图通过式(5)计算为复值波场U_z；根据角谱法使复值波场在自由空间传播-z距离后得到复值波场U₀；由于相位-双分辨率网络f_net2不能够进行复数运算，因此根据欧拉公式将复值波场U₀分解为实部和虚部，并进行矩阵拼接操作；经过矩阵拼接的特征矩阵输入相位-双分辨率网络f_net2计算得到全息图。Figure 1 shows the overall structure of a fast generation method for holograms based on a phase-dual resolution network. The input is any intensity map, and the trained phase-dual-resolution network f _net1 maps it to a phase map; the phase map and the input target intensity map are calculated as a complex-valued wave field U by formula (5). _z ; According to the angular spectrum method, the complex-valued wavefield U ₀ is obtained after the complex-valued wavefield propagates in the free space -z distance; since the phase-dual resolution network f _net2 cannot perform complex number operations, the complex-valued wavefield is calculated according to Euler's formula. The wave field U ₀ is decomposed into real and imaginary parts, and the matrix splicing operation is performed; the eigenmatrix after matrix splicing is input into the phase-dual resolution network f _{net2 to} calculate the hologram.

上述过程为使用经过训练的相位-双分辨率网络f_net1和f_net2的生成全息图的过程。由于高质量的全息掩模较难获取，训练过程是通过无全息图掩模的卷积神经网络训练来完成。在训练阶段将相位-双分辨率网络f_net2计算得到的全息图，通过角谱法计算其重建图

此时式 (3)可进一步表示为：The above process is the process of generating holograms using the trained phase-dual resolution networks f _net1 and f _net2 . Since high-quality holographic masks are difficult to obtain, the training process is done by convolutional neural network training without holographic masks. In the training phase, the hologram calculated by the phase-double-resolution network f _net2 is calculated by the angular spectrum method.

At this time, formula (3) can be further expressed as:

式中，

表示再现图像，φ_holo表示纯相位全息图，f_AS表示角谱法，I表示目标振幅。分别计算输入的目标强度图I和角谱法计算的重建图

之间的MS-SSIM损失和MSE损失，并将其按照一定的比例求和作为整体的一致性损失函数，实现相位-双分辨率网络模型f_net1和f_net2的同时训练。一致性损失函数可表示为：In the formula,

represents the reproduced image, φ _holo represents the pure phase hologram, f _AS represents the angular spectrum method, and I represents the target amplitude. Calculate the input target intensity map I and the reconstructed map calculated by the angular spectrum method respectively

The MS-SSIM loss and the MSE loss between them are summed according to a certain ratio as the overall consistency loss function to realize the simultaneous training of the phase-dual-resolution network models f _net1 and f _net2 . The consistency loss function can be expressed as:

其中I表示目标振幅，

表示重建图，

用于计算I和

之间的多尺度结构相似性，p表示像素，用于逐像素计算误差，依据经验设置α＝0.84。where I is the target amplitude,

represents the reconstructed graph,

used to calculate I and

图2示出了相位-双分辨率网络结构图。其中CBP表示卷积模块，包含卷积层、批归一化层和PReLU激活函数。RB表示残差模块，RBB表示残差瓶颈模块。DAPPM代表深度聚合金字塔池化模块。相位-双分辨率网络主要由编码器和解码器组成，其中编码器是由高分辨率分支和低分辨率分支组成。Figure 2 shows a phase-dual resolution network structure diagram. where CBP represents the convolution module, which includes a convolution layer, a batch normalization layer, and a PReLU activation function. RB stands for residual block, and RBB stands for residual bottleneck block. DAPPM stands for Deep Aggregation Pyramid Pooling Module. The phase-dual resolution network is mainly composed of an encoder and a decoder, where the encoder is composed of a high-resolution branch and a low-resolution branch.

编码器的高分辨率分支可以保留较多的高频特征，而低分辨率分支可以提取目标振幅的高级抽象特征。编码器的前两层网络使用组卷积进行特征提取。编码器中剩余的网络层均使用空洞卷积进行特征提取，其中空洞卷积中的空洞因子，遵循卷积中两个非零值之间的最大距离计算公式：The high-resolution branch of the encoder can retain more high-frequency features, while the low-resolution branch can extract high-level abstract features of the target amplitude. The first two layers of the encoder network use group convolutions for feature extraction. The remaining network layers in the encoder use atrous convolution for feature extraction, where the atrous factor in atrous convolution follows the formula for calculating the maximum distance between two non-zero values in the convolution:

M_i＝max[M_i+1-2r_i,M_i+1-2(M_i+1-r_i),r_i],M_n＝r_n (8)M _i =max[M _i+1 -2r _i ,M _i+1 -2(M _i+1 -r _i ), _r _i ],M _n =rn (8)

其中i表示第i层卷积，M_i表示第i层卷积输出中非零值之间的最大距离。r_i表示卷积第i层的空洞因子。通过使用组卷积可以减少相位-双分辨率网络生成高分辨率全息图时的计算量和GPU内存占用，使用空洞卷积可以扩大卷积操作的感受野增强特征提取能力。where i represents the i-th layer convolution, and M _i represents the maximum distance between non-zero values in the i-th layer convolution output. r _i represents the dilation factor of the i-th layer of the convolution. The use of group convolution can reduce the amount of computation and GPU memory usage when generating high-resolution holograms by the phase-dual resolution network, and the use of atrous convolution can expand the receptive field of convolution operations to enhance feature extraction capabilities.

在解码器中，通过sub-pixel卷积对编码器提取的特征进行上采样，直至与输入图像分辨率相同。编码器的第三层网络输出使用跳跃连接至解码器的第一层进行特征拼接，增强特征解码能力。将1×1卷积作为解码器的最后一层，使得输出为单通道图像。将单通道图像的灰度值归一化至[-π,π]，使得网络的最终输出为相位图。In the decoder, the features extracted by the encoder are upsampled by sub-pixel convolutions until the same resolution as the input image. The encoder's third-layer network output uses skip connections to the decoder's first layer for feature concatenation, enhancing feature decoding capabilities. A 1×1 convolution is used as the last layer of the decoder so that the output is a single-channel image. The grayscale values of the single-channel images are normalized to [-π,π] so that the final output of the network is a phase map.

图3示出了本申请搭建的全息显示结构示意图和仿真实物图。空间光调制器的型号为 Holoeye Photonics AG，像素间距为8μm，激光波长为532nm，传播距离为40cm。FIG. 3 shows a schematic diagram of a holographic display structure and a simulated physical diagram of the holographic display constructed in the present application. The model of the spatial light modulator is Holoeye Photonics AG, the pixel pitch is 8 μm, the laser wavelength is 532 nm, and the propagation distance is 40 cm.

Claims

1. a kind of hologram fast generation method based on phase-double resolution network, it is characterized in that:

(1) Build phase-dual resolution network models f _net1 and f _net2 ;

(2) The trained phase-dual resolution network f _net1 calculates the initial phase φ ₀ according to the input target intensity map I;

(3) Calculate the complex-valued wave field U _z according to the initial phase φ ₀ and the target intensity map I; according to the angular spectrum method, calculate the complex-valued wave field U ₀ obtained after the complex-valued wave field U _z propagates in free space -z ;

(4) The trained phase-dual-resolution network f _net2 calculates a phase-only hologram according to the input complex-valued wave field U ₀ .

2. The method according to claim 1, wherein building a phase-dual resolution network model comprises:

(1) In the encoder of the phase-dual resolution network, the first two layers of the network use group convolution for feature extraction; the remaining network layers in the encoder use atrous convolution for feature extraction, and the holes in atrous convolution are used for feature extraction. factor, following the formula for calculating the maximum distance between two non-zero values in the convolution:

M _i =max[M _i+ 1-2r _i ,M _i+ 1-2(M _i+1 -r _i ),r _i ],M _n =r _n ;

(2) In the decoder, the features extracted by the encoder are up-sampled by sub-pixel convolution until the resolution is the same as the input image; the third layer network output of the encoder is connected to the first layer of the decoder using skips Perform feature stitching; use 1×1 convolution as the last layer of the decoder, so that the output is a single-channel image;

(3) Normalize the gray value of the single-channel image to [-π,π].

3. The method according to claim 1, wherein the training of the trained phase-dual resolution network comprises:

(1) The constructed phase-dual resolution network f _net2 calculates and generates a pure phase hologram, and calculates the reconstructed image of the hologram according to the angular spectrum method

where the reconstruction distance is z;

(2) Calculate the input target intensity map I and the reconstruction map calculated by the angular spectrum method respectively

between MS-SSIM loss and MSE loss;

(3) The MS-SSIM loss and the MSE loss are summed according to a certain weight ratio as the overall consistency loss function to realize the simultaneous training of the phase-dual-resolution network models f _net1 and f _net2 .

4. The method according to claim 1, wherein, for the complex-valued wave field U _z , the calculation formula comprises:

5. The method according to claim 3, wherein the overall consistency loss function, the calculation formula comprises:

where I is the target amplitude,

represents the reconstructed amplitude,

used to calculate I and