CN114494498B

CN114494498B - Metal artifact removing method based on double-domain Fourier neural network

Info

Publication number: CN114494498B
Application number: CN202210108825.XA
Authority: CN
Inventors: 单洪明; 李子龙; 张军平
Original assignee: Fudan University
Current assignee: Fudan University
Priority date: 2022-01-28
Filing date: 2022-01-28
Publication date: 2023-04-18
Anticipated expiration: 2042-01-28
Also published as: CN114494498A

Abstract

The invention belongs to the technical field of medical image analysis, and particularly relates to a metal artifact removing method based on a double-domain Fourier neural network. The invention removes metal artifacts from the two aspects of local and global by fast Fourier convolution in the chord image domain and the CT image domain respectively. By fully utilizing the global receptive field provided by Fourier convolution, the method can repair the chord graph damaged by metal on the chord graph by utilizing remote information, thereby removing metal artifacts on the original data. Meanwhile, the method eliminates the global inconsistency caused by secondary artifacts and artifact removal through a U-Net comprising fast Fourier convolution jump connection in the image domain. A multi-window loss function is used in training the network to optimize model performance under clinically significant window widths. The method has good effect of removing artifacts caused by metals with different sizes and shapes. The method improves the efficiency of removing the metal artifacts in the CT image, thereby reducing the interference of the metal artifacts on clinical diagnosis.

Description

Metal artifact removing method based on double-domain Fourier neural network

Technical Field

The invention belongs to the technical field of medical image analysis, and particularly relates to a CT metal artifact removing method.

Background

Computed Tomography (CT) is one of the primary modalities widely used for clinical diagnosis and screening. When there are high density objects such as metal implants and dental fillings, the raw data of the CT scan, i.e. the chord graph, is corrupted by physical factors such as beam hardening and scattering. The resulting reconstructed images exhibit severe radial artifacts, greatly limiting subsequent diagnosis. How to effectively reduce metal artifacts remains challenging. With the development of deep neural networks, the problem is also more and more focused.

To address this problem, there has been a related effort to remove Metal Artifacts (MAR) in the chord and image domains, respectively. However, in a single domain the method cannot efficiently recover the real organization from the corrupted data. This is because, on the one hand, metal artifacts on the reconstructed image are present globally, and artifacts are difficult to remove in the image domain when the metal becomes large. On the other hand, although only a part of the area in the chord graph is damaged, the restored original data cannot ensure the ideal image quality after reconstruction, because the small error discontinuity in the chord graph easily causes secondary artifacts in the image domain.

The currently popular MAR approach is to combine the advantages of two domains through a backward propagating Radon layer [1,2], significantly improving MAR performance. For example, [1] use two U-nets to enhance the chord graph and reconstruct the image to achieve promising results. However, convolutional layers lack a sufficiently large reception field, resulting in limited information to be captured, which hinders further development of the network. Therefore, at present, many efforts are made to improve the performance of the network by considering the joining and utilizing other information, such as image priority [3], adaptive scale [4], metal mask project [5], and so on.

The performance of the existing MAR method is unsatisfactory due to the following aspects. Firstly, in the chord graph, the existing method does not fully utilize the global context to perform interpolation, but only can continuously utilize surrounding pixels to perform recovery, and the recovery precision of the chord graph is greatly limited. Secondly, for those CT images that are heavily contaminated by artifacts, the locally repaired deep neural network may further introduce secondary artifacts; while some work [4] explored a non-local approach to eliminate global artifacts in the image domain, it may not be useful when the CT image is severely damaged. Third, most MAR methods measure image quality using a full window, ignoring contrast information for some clinically important CT windows. This leads to a situation where the network is overly smooth, making the restored image difficult to apply clinically.

Unlike prior methods, the present invention incorporates a fourier convolution into a neural network. The global receptive field provided by the Fourier convolution [6] is utilized to better remove artifacts in both the chord map and image domains. More specifically, in order to accurately recover a damaged area from information of an undamaged area in a chord graph domain, the invention provides a novel chord graph recovery network based on fast Fourier convolution, which can perform global interpolation by using a receptive field in a chord graph range. Meanwhile, the invention uses a local network to directly recover in the image domain. The above method removes metal artifacts locally and globally, respectively. In order to further remove existing secondary artifacts and inconsistency between different methods and simultaneously utilize the advantages of the methods, the invention further provides a U-Net based on the fast Fourier jump connection to further remove the artifacts in both local and global aspects and improve the image quality. In addition, because of the large numerical range of CT images, to further improve their clinical value, the present invention introduces multi-window losses to constrain the edges, pixel losses, and perceptual losses in the various CT windows.

Disclosure of Invention

The invention aims to provide a metal artifact removing method based on a dual-domain Fourier neural network aiming at the characteristics of metal artifacts on the whole situation and the local situation, so that the metal artifacts can be respectively processed on the local situation and the whole situation, and the image quality is improved.

The metal artifact removing method based on the double-domain Fourier neural network provided by the invention explores the receptive fields of a chord graph range and an image range of an MAR by utilizing fast Fourier convolution; in the chord map domain, the invention provides a chord map Fourier repair network, and a damaged area is repaired by using an undamaged area of a full chord map in a receptive field of the full chord map by means of fast Fourier convolution. Meanwhile, in the image domain, a Fourier U-Net is used for restoring the image from local to global through the context information of the image domain; in addition, the present invention introduces multi-window penalties to help the network perceive accurate feedback in different dynamic ranges to get better results in clinical applications. Experimental results demonstrate that the proposed method is superior to the most advanced metal artifact removal methods.

The invention provides a metal artifact removing method based on a double-domain Fourier neural network, which comprises the following specific steps:

(1) In the chord graph domain, a neural network based on fast Fourier convolution is utilized to realize global information interpolation so as to repair the chord graph damaged by metal.

Setting human body two-dimensional CT sliceThe attenuation coefficient distribution of (a) is: x = μ (i, j), where (i, j) represents a two-dimensional coordinate. The chord graph S in the ideal case can be used by projection

Determine, i.e.>

Analogously, by defining the back-projection process >>

An image X can be reconstructed from the chord graph S, i.e. < >>

Here->

Is/>

The inverse of (c). When the metal implant X _m When present, the chords are damaged along the metal traces. Suppose the size of the chord graph is H _s ×W _s In which H is _s And W _s The number of detectors and the projection angle, respectively. Then the invention uses->

To represent a binary metal trace, where region M =1 corresponds to a damaged region of the metal in the chord graph. Thus, the chord graph S is influenced by the metal _mc And the metal-free chord graph S can be expressed as:

S _mc ⊙(1-M)＝S⊙(1-M) (1)

wherein the symbol |, indicates element-by-element multiplication. The problem of metal artifact removal in the chord graph is to recover the damaged area marked with metal tracks using the information of the area not affected by the metal. For the metal artifact problem of the chord graph, a fast Fourier convolution-based chord graph recovery network (FS-Net) is used, and the FS-Net can utilize a global receptive field on the chord graph to accurately recover the chord graph damaged by metal. The network firstly uses two down-sampling dimensionalities reduction, then uses six chord graph repair modules containing fast Fourier convolution to remove metal artifacts, and finally obtains a recovered chord graph through two up-sampling layers. In each of the chord graph repair modules, the input tensor is first split into two branches, called local branch and global branch, respectively. For local and global branches, three convolution modules are first used to capture the multiscale information. Meanwhile, for global branches, the chord graph is recovered in the fourier domain using a fast fourier convolution block. The output of FS-Net is written as:

S _r ＝FS-Net(S _mc ,M) (2)

wherein S is _r Is the chord graph after repair. Finally, the restored chord graph is reconstructed through a Radon layer capable of back propagation, and the CT image X recovered from the angle of the chord graph _s ：

X _s ＝RL(S _r ⊙M+S _mc ⊙(1-M)) (3)

The invention uses the L1 loss function to measure the difference of the chord map domain, and uses the smooth-L1 loss function to measure the difference of the image domain, so as to avoid overfitting and gradient explosion. Thus, the penalty for optimizing FS-Net is defined as follows:

wherein S is _gt And X _gt Respectively, metal-free, true chord and image.

(2) In the image domain, a local artifact removal network IU-Net is used for removing metal artifacts in the image domain. FR-Net is then replaced by U-Net containing Fourier jump connection to better remove secondary artifacts and full image inconsistency, which is as follows:

firstly, a U-Net is used to primarily remove metal artifacts in an image domain, and the network is written as:

X _u ＝LU-Net(RL(S _mc )) (5)

for this network, L1 loss is used as its loss function

Setting CT image-X with artifact removed from local and global angle by different method _s And X _u . A fourier recovery network (FR-Net) uses information from the two coarsely recovered images to obtain accurate CT images after artifact removal. The proposed FR-Net uses the U-Net architecture, but uses local and global information for reconstruction to further remove artifacts, the module key being the fourier-based jump connection. The input tensor from the encoder is first split into two parts, one for the local branch and one for the global branch. For local branching, the image is processed locally using convolution layers, batch Normalization (BN) and ReLU activation functions. For global branching, the invention applies fast fourier convolution to globally eliminate secondary artifacts and global level inconsistencies in the fourier domain using the same architecture. Where the input data is fed into a common convolution layer and a fast fourier convolution, respectively, to remove artifacts locally and globally. The network can be written as:

X _r ＝FR-Net(X _s ,X _u )+X _u (7)

(3) Training by using a loss function with multiple window widths so as to improve the accuracy of the key window width in clinic;

training is usually performed using a large window exceeding [ -1000,2000] HU, where HU stands for Hounsfield units. Such large CT window widths result in networks that are not focused on some clinically important windows. In view of the clinical application, radiologists often use different CT windows for different clinical tasks. The effect of metal artifact removal at different window widths is therefore of paramount importance.

To address this problem, the present invention proposes a multi-window penalty to force the network to focus on those clinically important windows. Specifically, the invention uses three common windows, and the value ranges of the three windows are respectively as follows: the full range window [ -1000,2000] HU; the lung window [ -1000,200] HU and the soft tissue window [ -200,300] HU.

Order to

Three CT windows are represented, with w =1,2,3. The invention uses three important image quality indexes to ensure that the structure and the details of the structure artifact-removed picture are correct, and the structure artifact-removed picture respectively comprises pixel level loss, edge loss and perception loss.

(3.1) pixel level loss: the invention uses an L1 loss function to measure the pixel level difference under three CT windows:

(3.2) edge loss: edge information is crucial for clinical diagnostic measurement boundary information. The invention therefore uses a Sobel filter to extract gradient information for comparison. This process is performed under the CT window selected by the present invention. The process is defined as

(3.3) loss of perception

To ensure that the output of the network has a texture similar to the metal-free image, the present invention utilizes perceptual loss to extract high-level features for comparison. The present invention uses a pre-trained VGG-16 network to form the feature extractor phi. More specifically, the present invention concatenates images from three windows to form an RGB-like three channel image, since the VGG is pre-trained using natural images. The perceptual loss is defined as:

wherein

Representing the Frobenius norm. The multi-window loss function can be written as：

/>

In the invention, the model comprises three neural networks in total, wherein the neural networks comprise a Fourier chord graph network used for recovering the area polluted by the metal artifact in a chord graph domain through Fourier convolution, and a local network used for removing the local artifact in an image domain. And a network using fourier-hop connected U-Net in conjunction with two coarse recoveries further removes artifacts. Where the perceived loss is calculated using layers 2, 4 and 7 of VGG-16.

Wherein, regarding the chord graph repairing network FS-Net: firstly, two down-sampling layers are used, and then six Fourier residual modules are used for removing artifacts; in each Fourier residual block, firstly, an input vector X epsilon R is input according to channels ^B×C×H×W Split into two local and global tensors X = { X ^l ,X ^g To extract local and global features, respectively. Wherein

Is a local tensor, is based on>

Is the global tensor. B represents mini-batch size, C represents input channel number, H and W represent length and width of input vector, alpha ₁ And alpha ₂ To adjust the hyper-parameter, C = alpha is satisfied ₁ +α ₂ . Likewise, define the output vector as Y = { Y ^l ,Y ^g Denotes extracted local and global features, respectively. Then three convolution modules f are used _ll ，f _gl ，f _lg And a fast Fourier convolution module f _g To remove artifacts on different local scales and globally.

Y ^l ＝f _ll (X ^l )+f _gl (X ^g ) (12)

Y ^g ＝f _lg (X ^l )+f _g (X ^g ) (13)

Finally, two up-sampling layers corresponding to the down-sampling layers are used to make the input and output sizes consistent.

Regarding the local artifact removal network LU-Net in the image domain, a standard U-Net is used, which comprises four down-sampling layers and four up-sampling layers.

Regarding the Fourier repair network FR-Net of the image domain, three down-sampling layers and three up-sampling layers are used, i.e., the same structure as LU-Net. Except that one fourier block is used for the hop connection. Specifically, the method comprises a common convolution module comprising a convolution layer and a Fourier convolution module comprising fast Fourier transform, convolution, batch normalization, RELU activation function and fast inverse Fourier transform respectively.

According to the method, the Fourier convolution is applied to the metal artifact removal model, namely the metal is removed from the chord graph and the image domain to form the artifact through the global receptive field provided by the Fourier convolution, and the defect that the damaged part cannot be repaired on the chord graph by using remote information and the full-graph artifact cannot be modeled due to the fact that the traditional network lacks the receptive field is overcome. The model repairs the damaged part on the chord graph mainly by combining local information and global information on the chord graph. This solves the problem that the conventional network cannot utilize remote information. In the image domain, the invention first uses a neural network to locally remove artifacts. And then, the image with local artifact removal on the image domain and the reconstructed image after global repair on the chord graph are used, and the Fourier U-Net provided by the invention is combined with the advantages of two rough artifact removal graphs to repair in the aspects of local and global. According to the CT metal artifact removal method based on Fourier convolution, the image quality and the accuracy of metal artifact removal are greatly improved, and the problem that a CT image is difficult to diagnose due to metal artifacts in clinic can be solved.

Drawings

FIG. 1 is a model overall framework of the present invention.

Fig. 2 is a difference between the conventional deep learning-based method and the method on the chord graph.

Fig. 3 is an effect diagram of different metal artifact methods, wherein the sequence numbers respectively represent: (a) images of metal damage (input); (b) LI 7; (c) NMAR [8]; (d) CNNMAR [2]; (e) DuDoNet [1]; (f) DANnet [4]; (g) FD-MARs (method of the invention); (h) a view without damage by metal. The display window range is [ -200,300] HU. The yellow shade is the shape and position of the metal.

Detailed Description

The present invention is further described below by using a specific simulation example, and shows the effect of removing metal artifacts on metal artifact data, and the comparison with other methods, including image quality and quantization indexes.

The present invention uses the widely used DeepLeision dataset [9] in the metal artifact task to model metal data. 360,000 cases and 2,000 cases were used for training and testing, respectively. All CT images were resized to 208 x 208. A total of one hundred different metal implant data were used, with 90 metal shapes for training and 10 additional metal shapes for testing. The sizes of the 10 metal implants tested were: 9,13,27,28,30,59,108,220,223,515.

The invention uses the procedure of [5] to generate chord maps and CT images damaged by metals. The present invention uses fan beam CT to simulate forward and back projection. The distance from the X-ray source to the center of rotation was set to 59.5 cm, and a total of 320 projection angles were evenly distributed between 360 degrees. The number of detectors is set to 321. Thus, the size of the chord graph is 321 × 320, and each pixel of the chord graph represents the intensity received by the detector at its corresponding angle.

To speed up training, each subnetwork in all experiments was pre-trained for 20 rounds using training data. The hyper-parameter settings for all experiments were as follows: the learning rate is 0.0005, each 15 epochs are attenuated by 0.5, and the total epoch number is 100; where the mini-batch value is set to 4. The optimizer is Adam with a parameter of (. Beta.) ₁ ,β ₂ ) = (0.5,0.999). All experiments were carried out in a PyTorch framework and were performed using an NVIDIA GTX 2080Ti GPU.

In the experiment, three indexes of peak signal-to-noise ratio (PSNR) root-mean-square error (RMSE) and Structural Similarity (SSIM) are adopted to measure the experimental effect, and the PSNR is defined in the following way:

RMSE is defined as follows:

SSIM is defined as follows:

PSNR and RMSE represent the pixel-level match of the algorithm between the artifact removal result and the normal dose CT, while SSIM represents the structural similarity between the two.

Experimental example 1: comparison of different metal artifact removal methods

TABLE 1 comparison of different metal artifact removal methods

/>

The black bold in table 1 indicates the optimal results, and it can be seen from table 1 that the present invention has significant advantages over other methods, especially in the case of relatively large metal artifacts. The invention further compares the model of the invention with the following most advanced methods: LI [7], NMAR [8], CNNMAR [2], duDoNet [1] and methods of use in the present invention. LI and NMAR are widely used benchmark methods in MAR. CNNMAR is an image domain based deep learning method. DuDoNet is a two-domain method using two U-nets. DANNET is a two-domain network with adaptive scaling for obtaining additional information from damaged areas.

Table 1 shows the quantitative comparison. The values are the average values under three window widths, which are: [ -1000,2000] HU, [ -200,300] HU, [ -1000, -200] HU. Wherein the left side of the table is smaller metal and the right side of the table is larger metal, and the sizes of the smaller metal and the larger metal are increased from left to right. First, the two-domain based approach performs significantly better than the single-domain based approach. Although previous methods have worked well on small metals, the present invention further frees the potential for metal artifact removal networks by fourier convolution. By analyzing the PSNR and RMSE, the method used by the present invention greatly improves the pixel-level accuracy. This shows that the method of the invention is more accurate in restoring details, a feature that is critical for medical diagnosis.

Experimental example 2: contrast of different metal artifact removal methods under soft group window

TABLE 2 comparison of different metal artifact removal methods under soft tissue windows

/>

Soft tissue Windows are a commonly used display window in medicine, defined as [ -200,300] HU. Table 2 shows the performance of the invention under the soft tissue window. It is clear that the process of the invention has very significant advantages over other processes. Wherein the lift is particularly great, especially when the metal volume is large. The method used by the invention can effectively improve the clinical image quality.

Reference documents:

[1]Lin W A,Liao H,Peng C,et al.DuDoNet:Dual Domain Network for CT Metal Artifact Reduction[C]//2019IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2020.

[2]Zhang Y,Yu H.Convolutional Neural Network Based Metal Artifact Reduction in X-ray Computed Tomography[J].IEEE Transactions on Medical Imaging,2018:1-1.

[3]Yu L,Zhang Z,X Li,et al.Deep Sinogram Completion with Image Prior for Metal Artifact Reduction in CT Images[J].IEEE Transactions on Medical Imaging,2020.

[4]Wang T,Xia W,Y Huang,et al.DAN-Net:Dual-Domain Adaptive-Scaling Non-local Network for CT Metal Artifact Reduction[J].2021.

[5]Lyu Y,Lin W A,Lu J,et al.DuDoNet++:Encoding mask projection to reduce CT metal artifacts[J].2020.

[6]Suvorov R,Logacheva E,Mashikhin A,et al.Resolution-robust Large Mask Inpainting with Fourier Convolutions[J].2021.

[7]Kalender W A,Hebel R,Ebersberger J.Reduction of CT artifacts caused by metallic implants.[J].Radiology,1987,164(2):576-577.

[8]Meyer E,Raupach R,Lell M,et al.Normalized metal artifact reduction(NMAR)in computed tomography[J].Medical Physics,2010,37(10).

[9]KeY,Wang X,Le L,et al.DeepLesion:automated mining of large-scale lesion annotations and universal lesion detection with deep learning[J].Journal of Medical Imaging,2018,5(3):1-。

Claims

1. a metal artifact removing method based on a double-domain Fourier neural network is characterized in that in a chord graph domain, a chord graph Fourier repairing network is adopted, a damaged area is repaired by utilizing an undamaged area of a full chord graph in the receptive field of the full chord graph by means of fast Fourier convolution; meanwhile, in an image domain, a Fourier U-Net is used for restoring an image from a local state to a global state through context information of the image domain; in addition, multi-window losses are introduced to help the network perceive accurate feedback in different dynamic ranges to get better results in clinical applications; the method comprises the following specific steps:

(1) In a chord graph field, utilizing a neural network based on fast Fourier convolution to realize global information interpolation so as to repair a chord graph damaged by metal; the specific process is as follows:

setting human body two-dimensional CT slicesThe attenuation coefficient distribution is: x = μ (i, j), where (i, j) represents a two-dimensional coordinate; by projection for setting up a string drawing S

Determine, i.e.>

Analogously by defining a back-projection process>

Reconstructing an image X from the chord graph S, i.e.

Is here->

Is->

The inverse operation of (1); when the metal implant X _m When present, the chords are damaged along the metal traces; suppose the size of the chord graph is H _s ×W _s In which H is _s And W _s The number of detectors and the projection angle, respectively; then the present use->

To represent a binary metal trace, where M =1 corresponds to a damaged area of metal in the chord graph; thus, the chord graph S is influenced by the metal _mc And the metal-free chord graph S is expressed as:

S _mc ⊙(1-M)＝S⊙(1-M) (1)

wherein the symbol [ ] indicates element-by-element multiplication; the problem of removing the metal artifacts in the chord graph is to recover the damaged area marked by the metal track by using the information of the area which is not influenced by the metal; for the metal artifact problem of the chord graph, a chord graph recovery network FS-Net based on fast Fourier convolution is used, and the chord graph damaged by metal is accurately recovered on the chord graph by using a global receptive field; the chord graph recovery network firstly uses two down-sampling dimensionalities reduction, then uses six chord graph repair modules containing fast Fourier convolution to remove metal artifacts, and finally obtains a recovered chord graph through two up-sampling layers; in each chord graph restoration module, the input tensor is firstly divided into two branches, namely a local branch and a global branch; for local and global branches, three convolution modules are used to capture multi-scale information first; meanwhile, for the global branch, a fast Fourier convolution block is used for recovering a chord graph in a Fourier domain; the output of FS-Net is written as:

S _r ＝FS-Net(S _mc ,M) (2)

wherein S is _r Is the chord graph after repair; finally, the restored chord graph is reconstructed through a Radon layer capable of back propagation, and the CT image X recovered from the angle of the chord graph _s ：

X _s ＝RL(S _r ⊙M+S _mc ⊙(1-M)) (3)

Measuring the difference of the chord map domain by using an L1 loss function, and measuring the difference of the image domain by using a smooth-L1 loss function so as to avoid overfitting and gradient explosion; thus, the loss of optimized FS-Net is defined as follows:

wherein S is _gt And X _gt A metal-free, real chord and image, respectively;

(2) In an image domain, firstly, removing metal artifacts in the image domain by using a local artifact removal network IU-Net; and then replacing the traditional U-Net by using U-Net comprising Fourier jump connection FR-Net to better remove secondary artifacts and full-image inconsistency, and comprising the following specific steps:

first, a U-Net is used to remove the metal artifacts primarily in the image domain, and the Net is written as:

X _u ＝LU-Net(RL(S _mc )) (5)

for this network, L1 loss is used as its loss function

Setting CT image-X with artifact removed from local and global angle by different method _s And X _u (ii) a Using information from the two roughly recovered images to obtain a deghosted accurate CT image using a fourier recovery network FR-Net; the FR-Net uses a U-Net framework, but uses local and global information to carry out reconstruction so as to further remove the artifact, and the network module is mainly connected through a jump based on Fourier; the input tensor from the encoder is first divided into two parts, one for the local branch and one for the global branch; for local branches, images are processed locally using convolutional layers, batch Normalization (BN) and ReLU activation functions; for global branches, applying fast fourier convolution to eliminate secondary artifacts and global-level inconsistencies in the fourier domain; wherein the input data is fed into a normal convolutional layer and a fast fourier convolution, respectively, to remove artifacts locally and globally; the network writes as:

X _r ＝FR-Net(X _s ,X _u )+X _u (7)

training using a large window exceeding [ -1000,2000] HU, where HU stands for Hounsfield units; the metal artifact removal effect under different window widths is important, and therefore, a multi-window loss is adopted to force the network to pay attention to the clinically important windows; specifically, three commonly used windows are used, and the value ranges of the three windows are respectively: the full range window [ -1000,2000] HU; the pulmonary window [ -1000,200] hu and the soft tissue window [ -200,300] hu;

order to

Represents three CT windows, w =1,2,3; using three important image quality indicatorsThe structure and the details of the structure artifact removing picture are ensured to be correct, namely pixel level loss, edge loss and perception loss;

(3.1) pixel level loss: the L1 loss function is used to measure the pixel level difference for three CT windows:

(3.2) edge loss: the edge information is important for measuring boundary information in clinical diagnosis, and therefore a Sobel filter is used for extracting gradient information for comparison; the process is performed under a selected CT window, which is defined as:

(3.3) loss of perception

To ensure that the output of the network has a texture similar to the metal-free image, high-level features are extracted with perceptual loss for comparison; i.e., using a pre-trained VGG-16 network to form the feature extractor phi, specifically, connecting the images from three windows to form a three-channel RGB-like image, since VGG is pre-trained using natural images, the perceptual loss is defined as:

wherein

Represents the Frobenius norm; the multi-window loss function is then written as:

2. the method for removing the metal artifact based on the dual-domain Fourier neural network as claimed in claim 1, wherein the specific network structure used is as follows:

with respect to the chord graph repair network FS-Net: firstly, two down-sampling layers are used, and then six Fourier residual modules are used for removing artifacts; in each Fourier residual block, firstly, an input vector X epsilon R is input according to channels ^B×C×H×W Split into two local and global tensors X = { X ^l ,X ^g -to extract local and global features, respectively; wherein

In the form of a local tensor is used,

is a global tensor; b represents mini-batch size, C represents input channel number, H and W represent length and width of input vector, alpha ₁ And alpha ₂ To adjust the hyper-parameter, C = alpha is satisfied ₁ +α ₂ (ii) a Likewise, define the output vector as Y = Y ^l ,Y ^g Represents the extracted local and global features, respectively; then using three convolution modules f _ll ，f _gl ，f _lg And a fast Fourier convolution module f _g To remove artifacts on different local scales and globally: />

Y ^l ＝ f _ll (X ^l )+f _gl (X ^g ) (12)

Y ^g ＝ f _lg (X ^l )+f _g (X ^g ) (13)

Finally, two up-sampling layers corresponding to the down-sampling layers are used to make the input and output sizes consistent;

regarding local artifact removal network LU-Net of an image domain, a standard U-Net is used, and comprises four layers of down-sampling layers and four layers of up-sampling layers;

regarding the Fourier repair network FR-Net of the image domain, three layers of down-sampling layers and three layers of up-sampling layers are used, namely the structure is the same as that of the LU-Net; except that one fourier block is used for the hop connection; specifically, the system comprises a convolution module and a fast Fourier convolution module, wherein the convolution module comprises a convolution layer; the fast fourier convolution module includes fast fourier transform, convolution, batch normalization, RELU activation function, and inverse fast fourier transform.