CN111161370A

CN111161370A - Human body multi-core DWI joint reconstruction method based on AI

Info

Publication number: CN111161370A
Application number: CN201911400857.1A
Authority: CN
Inventors: 周欣; 段曹辉; 邓鹤; 娄昕; 孙献平; 叶朝辉
Original assignee: Wuhan Institute of Physics and Mathematics of CAS
Current assignee: Institute of Precision Measurement Science and Technology Innovation of CAS
Priority date: 2019-12-30
Filing date: 2019-12-30
Publication date: 2020-05-15
Anticipated expiration: 2039-12-30
Also published as: CN111161370B

Abstract

The invention discloses an AI-based human body multi-core DWI joint reconstruction method. The human body multi-core DWI image training set is established; the human body multi-core DWI joint reconstruction model is established; the loss function of the human body multi-core DWI joint reconstruction model is defined; the gradient descent algorithm is used to train the human body multi-core DWI joint reconstruction model; input new undersampled DWI images to the trained human body multi-core DWI joint reconstruction model, and through the forward propagation of the model, the final reconstructed images containing different b values can be obtained. The present invention can obtain high-quality reconstructed images under high acceleration times, and the reconstruction speed is fast.

Description

Human body multi-core DWI joint reconstruction method based on AI

Technical Field

The invention relates to the technical field of multi-nuclear Magnetic Resonance Imaging (MRI), Artificial Intelligence (AI), deep learning, undersampled reconstruction and the like, in particular to a method for realizing the multi-nuclear Magnetic Resonance Imaging (MRI), the Artificial Intelligence (AI), the deep learning and the undersampled reconstructionA human body multi-core DWI joint reconstruction method based on AI, suitable for accelerating human body multi-core (such as¹²⁹Xe、³He, etc.) the imaging speed of DWI, or more data can be obtained in the same time.

Background

Multinuclear MRI can provide abundant physiological and pathological information, such as hyperpolarized gas (c:¹²⁹Xe、³he) pulmonary MRI can provide high resolution structural and functional images of the lungs. In particular, hyperpolarized gas pulmonary DWI can sensitively assess structural and functional changes associated with pulmonary disease. In combination with the gas diffusion theoretical model, the multi-b-value DWI can non-invasively and quantitatively obtain lung morphological parameters of the alveolar level, such as the alveolar lung airway inner diameter (R), the airway outer diameter (R), the alveolar pulmonary depth (h), the mean linear intercept (L), and the mean linear intercept (L)_m) Surface-to-volume ratio (SVR). However, multiple b-value DWIs require longer acquisition times. For example, acquiring a set of low resolution DWI data (4 slices, 5 b-values, resolution 64 × 64) requires a breath-hold time of approximately 18s, and acquiring a set of 3D whole lung DWI data (10-15mm slice thickness) requires more than 1 min. Although studies have been made to acquire multiple b-value DWI data using a multi-breath approach, multiple breaths can result in differences in lung volume, longer acquisition times, and higher gas costs. Therefore, the DWI imaging speed needs to be accelerated, and a single breath-holding multi-b-value DWI imaging method needs to be developed.

Compressed sensing-based MRI (CS-MRI) speeds up imaging by undersampling k-space without the need for additional hardware and sequences. Chan et al applied CS-MRI to 3D multi-b-value DWI, enabling single breath-hold whole lung morphological parameter measurements [ Chanet al.MagnReson Med,2017,77: 1916-.]. Abascal et al undersampled DWI data in the spatial and diffusion directions and combined with a priori knowledge reconstruction of signal attenuation, obtain an acceleration multiple of 7 to 10 times, and significantly shorten the Imaging time of multi-b-value DWI [ Abascalat al. IEEE Trans Med Imaging,2018,37: 547-.]Westcott et al further applied the method to high resolution hyperpolarization³He multiple b value DWI [ Westcottet almaging,2019,49:1713-1722]. However, there are some limitations to the CS-MRI technique. The nonlinear reconstruction algorithm of CS-MRI relates to iterative computation, and needs longer reconstruction time, for example, in the research of Westcott et al, 2-3 min is needed for reconstructing a layer of DWI image, and the requirement of clinical real-time reconstruction is difficult to meet. In addition, the selection of the hyperparameter of CS-MRI is difficult, and the improper hyperparameter can cause the over-smooth reconstruction result or the residual undersampling artifact.

More recently, AI has been applied in the field of MRI undersampling reconstruction. AI-based MRI reconstruction uses a deep Convolutional Neural Network (CNN) to extract abstract feature representations, learning the nonlinear mapping relationship between undersampled images and fully sampled images through a large amount of training data. Compared with CS-MRI, the AI-based MRI reconstruction has more remarkable advantages in the aspects of reconstruction speed, image quality, acceleration multiple and the like. However, because the hyperpolarized DWI image has the characteristics of low signal-to-noise ratio, insufficient training set and the like, the application of AI to the hyperpolarized DWI reconstruction field has not been studied at present.

Compared with other MRI imaging modalities (T1, T2, etc.), DWI images are multi-channel data composed of different b-value images, and have not only spatial sparsity but also low rank in the direction of diffusion gradient. Wang et al propose a combined denoising CNN model, which improves the denoising effect of DWI images by cascading high-level features of different b-value images [ Wanget al.JMagnReson Imaging,2019,50: 1937-. Xiang et al propose a multi-modal fusion method that reconstructs the undersampled T2 weighted image [ Xianget al. ieee Trans Biomed Eng,2018,66:2105-2114] using the complementary information of the T1 weighted image. Similarly, if the data redundancy in the hyperpolarized multi-b value DWI space and the diffusion direction is fully utilized, the reconstruction quality of the DWI image is further improved.

Based on the analysis, the invention provides a human body multi-core DWI joint reconstruction method based on AI. The method utilizes a CNN model to learn the nonlinear mapping relation between an undersampled image and a fully sampled image, and simultaneously, data redundancy in DWI space and diffusion gradient directions is mined through combined reconstruction, so that the reconstruction effect is improved. Compared with CS-MRI, the method has better image reconstruction effect and faster image reconstruction speed under high acceleration multiple (more than or equal to 4 times).

Disclosure of Invention

The invention aims to provide a human multi-core DWI joint reconstruction method based on AI aiming at the defects and shortcomings of the prior art.

In order to achieve the purpose, the invention adopts the following technical scheme:

a human body multi-core DWI joint reconstruction method based on AI comprises the following steps:

step 1, establishing a human body multi-kernel DWI image training set, wherein the human body multi-kernel DWI image training set comprises an undersampled DWI image y and a full-sampling DWI image x.

Step 1.1, acquiring a fully sampled DWI image x with a diffusion sensitivity factor b value of 0 from a magnetic resonance imager_b。

And 1.2, generating a full sampling DWI image x. Using DWI images x with a diffusion sensitivity factor b value of 0_bDWI signal diffusion model, DWI image x of each b value_b. DWI image x of individual b-values_bAre combined into a fully sampled DWI image x.

The DWI signal diffusion model is:

wherein b is a diffusion sensitive factor, D_L、D_TRespectively longitudinal diffusion coefficient and transverse diffusion coefficient, phi is error function, x₀DWI images with a diffusion sensitivity factor b value of 0.

And 1.3, establishing a human body multi-core DWI image training set. And generating an undersampling matrix U, and retrospectively undersampling the fully sampled DWI image x by using the undersampling matrix U to obtain an undersampled DWI image y. And the undersampled DWI image y and the fully sampled DWI image x form a human multi-kernel DWI image training set.

And 2, establishing a human body multi-core DWI combined reconstruction model. The human body multi-core DWI combined reconstruction model is represented as G (·, theta), the input of the model is represented, theta is a model parameter, and the output of the human body multi-core DWI combined reconstruction model is a final weightImage construction

The human multi-core DWI combined reconstruction model is a CNN model.

The human multi-core DWI combined reconstruction model comprises a residual dense module (RDB) and a Data Consistency (DC) layer, wherein the residual dense module comprises three parts which are respectively a feature extraction layer, a dense module and a reconstruction layer containing residual connection,

the feature extraction layer extracts features from the model input to generate a first feature map and inputs the first feature map to the dense module. And the dense module further extracts the features of the first feature map to obtain a second feature map, and inputs the second feature map to a reconstruction layer containing residual connection. The reconstruction layer containing residual connection synthesizes the second characteristic graph into a residual image, and then the residual image is processed by using the residual connection to obtain a primary reconstruction image x_c. Will preliminarily reconstruct the image x_cObtaining a final reconstructed image by an input data consistency layer

Will preliminarily reconstruct the image x_cInput data consistency layer obtaining reconstructed image

The method comprises the following steps:

the data consistency layer will preliminarily reconstruct the image x_cK-space data k substituted into the following formula to obtain data consistency_DCK-space data k for data consistency_DCPerforming inverse Fourier transform to obtain final reconstructed image

Wherein k is_c＝Fx_c，k₀Fy, F is the fourier transform, j is the k-space coordinate, k_DC(j) K-space data k for data consistency at j_DCThe value of (b), Ω represents the set of k-space coordinates sampled in the undersampled DWI image y.

And 3, defining a loss function L (theta) of the human body multi-core DWI combined reconstruction model G (·, theta).

L(θ)＝E[||x-G(y,θ)||_l2]+ηE[||Ψ(x)-Ψ(G(y,θ))||_l2]

Wherein,

representing the desired operation, y is the undersampled DWI image and G (y, θ) is the final reconstructed image

||·||_l2Denotes the L2 norm, Ψ denotes an estimation function of the Apparent Diffusion Coefficient (ADC), and η is a weighting factor of the apparent diffusion coefficient loss.

The first part of the above equation is the pixel level loss between the fully sampled image and the reconstructed image, and the second part is the apparent diffusion coefficient loss estimated for the fully sampled image and the reconstructed image. The apparent diffusion coefficient extracted from the DWI image has important physiological significance, so that the apparent diffusion coefficient loss is added into the loss function, and the estimation accuracy of the apparent diffusion coefficient of the reconstructed image is improved.

And 4, training a human body multi-core DWI combined reconstruction model. Training a human body multi-core DWI combined reconstruction model by adopting a gradient descent algorithm, and searching a model parameter theta which enables a loss function L (theta) to be minimum, wherein the model parameter theta which enables the loss function L (theta) to be minimum is

And 5, performing combined reconstruction on the multi-core DWI image of the target human body. To give

Inputting a new undersampled DWI image y, and obtaining a final reconstructed image containing different b values through forward propagation of the model

Compared with the prior art, the invention has the following advantages:

under the condition of high acceleration multiple (more than or equal to 4 times), the method can remove undersampling artifacts, recover detailed information of DWI images and improve the imaging speed of human multi-core DWI; the reconstruction speed is high, only the forward propagation of the CNN model is needed, and the reconstruction time reaches ms magnitude; parameters do not need to be adjusted, and the method is more convenient and fast in practical application; the structural similarity of different b-value images is jointly reconstructed and mined, and the reconstruction effect is improved; a data consistency layer is added into the CNN model to ensure the consistency of the final reconstructed image and the undersampled data; and apparent diffusion coefficient loss is added into the loss function, so that the accuracy of the estimation of the apparent diffusion coefficient is improved.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2A is a set of fully sampled hyperpolarizations¹²⁹A Xe pulmonary DWI image, a fully sampled DWI image containing 5 b values;

FIG. 2B is an undersampled DWI image under quadruple undersampling;

FIG. 2C is the final reconstructed image of the conventional CS-MRI method under four times undersampling;

fig. 2D is a final reconstructed image obtained by using the method of embodiment 1 of the present invention under four times undersampling.

Detailed Description

The present invention will be described in further detail with reference to examples for the purpose of facilitating understanding and practice of the invention by those of ordinary skill in the art, and it is to be understood that the present invention has been described in the illustrative embodiments and is not to be construed as limited thereto.

Example 1:

step 1, constructing a human multi-kernel DWI image training set. In the embodiment, the multi-core DWI of the human body is hyperpolarized¹²⁹Xe lung DWI, human multi-nuclear DWI image training set is hyperpolarized¹²⁹Xe lung DWI image training set.

Step 1.1, obtaining fully sampled hyperpolarization from a magnetic resonance imager¹²⁹Xe pulmonary ventilation images. Hyperpolarization of a full sample collected from 105 volunteers¹²⁹Xe pulmonary ventilation images. Fully sampled hyperpolarization¹²⁹Xe pulmonary ventilation images were acquired using a 3D bSSFP sequence with a sampling matrix size of 96X 84, layer thickness of 8mm, and number of layers of 24. Selecting the image with signal-to-noise ratio greater than 6.6 to obtain 1404 total sampled hyperpolarized images¹²⁹Xe pulmonary ventilation images. Hyperpolarization of full samples¹²⁹The Xe pulmonary ventilation images were preprocessed and the image size was transformed to 64 x 64. Hyperpolarization of full samples after image size conversion¹²⁹Xe pulmonary ventilation image as DWI image x with diffusion sensitive factor b value of 0_b。

And 1.2, generating a full sampling DWI image x. Using DWI images x with a diffusion sensitivity factor b value of 0_bAnd a DWI signal diffusion model for generating a DWI image x with a diffusion sensitivity factor b value different from 0_b. In hyperpolarization¹²⁹In Xe lung DWI, the DWI signal diffusion model is a cylinder model (Sukstanski AL et AL. magnetic Resonance in Medicine,2012,67:856-,

wherein x is₀A DWI image with a b-value of 0, b being a diffusion sensitive factor, in this embodiment b-values include 0, 10, 20, 30, 40s/cm²。D_L、D_TRespectively longitudinal diffusion coefficient and transverse diffusion coefficient, phi is an error function. D₀＝0.14cm²S is of¹²⁹The diffusion coefficient of Xe in a gas mixture. Δ is a diffusion time, and in the present embodiment Δ is 5 ms. R and R are random parameters, F_LAnd F_TAre all empirical expressions, F_LAnd F_THas been derived from Sukstanskii (Sukstanskit al. magnetic response in Medicine,2012,67: 856-. Randomly generating an R value within a range of R values corresponding to the real lung, and an R value within a range of R values corresponding to the real lung: the range of R values for the real lung is (360 + -60) μm, and the range of R values for the real lung is (160 + -30) μm. Using the equation (1), b is 10, 20, 30, 40s/cm²DWI image x of_b. Finally, DWI image x of each b value_bA DWI image composed of a set of multiple channels, as a fully sampled DWI image x: x ═ x₀,x₁₀,…,x₄₀]. The size of the fully sampled DWI image x is 64 × 64 × 5.

And 1.3, establishing a human body multi-core DWI image training set. An undersampled matrix U is generated at a sampling rate of 1/4 and an undersampled DWI image y is obtained by retrospectively undersampling the fully sampled DWI image x with the undersampled matrix U, as shown in fig. 1. Similarly, y ═ y₀,y₁₀,…,y₄₀]. And the undersampled DWI image y and the fully sampled DWI image x form a human multi-kernel DWI image training set.

And 2, establishing a human body multi-core DWI combined reconstruction model. The human multi-core DWI joint reconstruction model is represented as G (·, theta), representing model input, and theta is a model parameter. Since the undersampled DWI image y is a complex-valued image, the real part and the imaginary part of the undersampled DWI image y are respectively taken as different channels in the embodiment, and thus the size of the model input of the human multi-kernel DWI joint reconstruction model is 64 × 64 × 10. The human multi-core DWI combined reconstruction model comprises a residual dense module and a data consistency layer. The undersampled DWI image y shares the characteristics in the residual dense module, so that the data redundancy of the DWI image on the space and diffusion gradient method can be fully mined, and the reconstruction effect is improved. The residual dense module comprises three parts, which are respectively specialThe system comprises a sign extraction layer, a dense module and a reconstruction layer containing residual connection. The feature extraction layer extracts features from the undersampled DWI image y using a 3 × 3 convolution to generate a first feature map and inputs the first feature map to the dense module. The dense module further extracts the features of the first feature map to obtain a second feature map, inputs the second feature map into a reconstruction layer containing residual connection, and fully utilizes the hierarchical features of all convolution layers to avoid the problems of information loss and gradient disappearance between convolution layers. The reconstruction layer containing residual error connection synthesizes the second characteristic graph into a residual error image by using convolution of 1 multiplied by 1, and then the residual error image is processed by using the residual error connection to obtain a primary reconstruction image x_c. Will preliminarily reconstruct the image x_cInput data consistency layer obtaining reconstructed image

F^HIs an inverse fourier transform. The specific operations of the data consistency layer comprise: the data consistency layer will preliminarily reconstruct the image x_cSubstitution into equation (3) to obtain k-space data k of data consistency_DCK-space data k for data consistency_DCPerforming inverse Fourier transform to obtain final reconstructed image

In a similar manner to that described above,

the human multi-core DWI combined reconstruction model can be built by using a deep learning tool kit TensorFlow in a computer application software Python 3.6 environment.

Wherein k is_c＝Fx_c，k₀Fy, F is the fourier transform, j is the k-space coordinate, k_DC(j) K-space data for data consistency at jk_DCThe value of (b), Ω represents the set of k-space coordinates sampled in the undersampled DWI image y.

And 3, defining a loss function. The loss function L (theta) of the human multi-kernel DWI joint reconstruction model G (·, theta) includes pixel-level loss and apparent diffusion coefficient loss:

L(θ)＝E[||x-G(y,θ)||_l2]+ηE[||Ψ(x)-Ψ(G(y,θ))||_l2]formula (4)

Wherein

||·||_l2Expressing the norm of L2, Ψ represents the estimation function of the apparent diffusion coefficient, and η ═ 0.001 is the weighting factor of the apparent diffusion coefficient loss.

And 4, training a human body multi-core DWI combined reconstruction model. The Adam algorithm [ Kingma, et al. arXivpreprint,2014, arXiv:1412.6980 was used.]Training a human body multi-core DWI combined reconstruction model, and searching for a model parameter theta which enables a loss function L (theta) to be minimum, wherein the model parameter theta which enables the loss function L (theta) to be minimum is

Namely, the following conditions are satisfied:

the learning rate of the Adam algorithm is 0.0002, the first order momentum is set to 0.9, and the second order momentum is set to 0.999. After training is finished, the model parameter theta of the human multi-core DWI combined reconstruction model is fixed

Can be used to reconstruct new hyperpolarizations¹²⁹Xe pulmonary DWI images.

Inputting a new undersampled DWI image y (shown in FIG. 2B), and performing forward propagation on the model to obtain a final reconstructed image containing different B values

In a similar manner to that described above,

fig. 2A is a full sampling image, which includes 5 b values (b is 0, 10, 20, 30, 40 s/cm)²) Hyperpolarization of¹²⁹Xe pulmonary DWI images. Fig. 2B is an undersampled DWI image y at a sampling rate of 1/4, which has lost most of the structural and detail information and contains severe undersampling artifacts. Although the conventional CS-MRI method can recover part of the structural information, fig. 2C contains a part of the artifact and a significant smoothing effect. As shown in FIG. 2D, the method of the present invention successfully removes the undersampling artifacts and accurately recovers the hyperpolarized DWI image structure and detail information. In addition, the method only needs the forward propagation of the CNN model, and the reconstruction speed is high.

The specific embodiments described herein are merely illustrative of the invention. The AI method in the present invention is not limited to CNN, and may include Recurrent Neural Network (RNN) and the like. The multi-core DWI in the present invention is not limited to the embodiments¹²⁹Xe, may also be³He、¹⁹F, etc., the present invention is also applicable to conventional ones¹Under-sampled reconstruction of the HDWI. The CNN model is not limited to RDB, and can be a residual error network, U-Net and the like. The CNN model training method is not limited to Adam, and also comprises an optimization algorithm commonly used in deep learning such as RMSProp.

The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications or additions may be made to the described embodiments or alternatives may be employed by those skilled in the art without departing from the spirit or ambit of the invention as defined in the appended claims.

Claims

1. a human body multi-core DWI joint reconstruction method based on AI, is characterized in that, comprises the following steps:

Step 1. Establish a human body multi-core DWI image training set. The human body multi-core DWI image training set includes an undersampled DWI image y and a fully sampled DWI image x,

Step 2, establish a human body multi-core DWI joint reconstruction model, the human body multi-core DWI joint reconstruction model is expressed as G(·,θ), · represents the model input, θ is the model parameter,

Step 3: Define the loss function L(θ) of the human multi-core DWI joint reconstruction model G(·,θ)

L(θ)=E[||xG(y,θ)|| _l2 ]+ηE[||Ψ(x)-Ψ(G(y,θ))|| _l2 ]

in,

represents the desired operation, y is the undersampled DWI image, ||·|| _l2 represents the L2 norm, Ψ represents the estimation function of the apparent diffusion coefficient, η is the weight coefficient of the apparent diffusion coefficient loss,

Step 4: Use the gradient descent algorithm to train the multi-core DWI joint reconstruction model of the human body, and find the model parameter θ that minimizes the loss function L(θ), and the model parameter θ that minimizes the loss function L(θ) is:

Step 5, give

Input a new undersampled DWI image y, and through the forward propagation of the model, the final reconstructed image containing different b values can be obtained

2. a kind of AI-based human body multi-core DWI joint reconstruction method according to claim 1, is characterized in that, described step 1 comprises the following steps:

Step 1.1. Obtain a fully sampled DWI image x _b with a diffusion sensitivity factor b value of 0 from a magnetic resonance imager,

Step 1.2, generate a fully sampled DWI image x, use the DWI image x _b with the diffusion sensitivity factor b value of 0 and the DWI signal diffusion model to obtain the DWI image x _{b of each b value, and the DWI image x b} _of each b value is combined into a full sample DWI image x,

Step 1.3: Establish a training set of human multi-core DWI images, generate an undersampling matrix U, and perform retrospective undersampling on the fully sampled DWI image x using the undersampling matrix U to obtain an undersampling DWI image y. The undersampled DWI image y and the fully sampled DWI image x constitute the training set of human multi-core DWI images.

3. a kind of AI-based human body multi-core DWI joint reconstruction method according to claim 2, is characterized in that, in described step 1.2, DWI signal diffusion model is:

Among them, b is the diffusion sensitivity factor, _DL and _DT are the longitudinal diffusion coefficient and the transverse diffusion coefficient, respectively, Φ is the error function, and x ₀ is the DWI image with the diffusion sensitivity factor b value of 0.

4. a kind of AI-based human body multi-core DWI joint reconstruction method according to claim 1, is characterized in that, described human body multi-core DWI joint reconstruction model comprises residual error dense module and data consistency layer, and the residual error dense module comprises Three parts, namely feature extraction layer, dense module, reconstruction layer containing residual connection,

The feature extraction layer extracts features from the model input to generate the first feature map and inputs the first feature map to the dense module. The dense module performs further feature extraction on the first feature map to obtain the second feature map, and inputs the second feature map to the dense module. The reconstruction layer containing residual connections, the reconstruction layer containing residual connections synthesizes the second feature map into a residual image, and then uses the residual connection to process the residual image to obtain a preliminary reconstructed image x _c , and the preliminary reconstructed image x _c is Input data consistency layer to obtain final reconstructed image

5. a kind of AI-based human body multi-core DWI joint reconstruction method according to claim 4, is characterized in that, initial reconstruction image x _c is input data consistency layer to obtain final reconstruction image

Include the following steps:

The data consistency layer substitutes the preliminary reconstructed image x _c into the following formula to obtain the data-consistent k-space data k _DC , and performs inverse Fourier transform on the data-consistent k-space data k _DC to obtain the final reconstructed image

Among them, k _c =Fx _c , k ₀ =Fy, F is the Fourier transform, j is the k-space coordinate, k _DC (j) is the value of the k-space data k _DC of the data consistency at j, Ω represents the undersampling DWI The set of k-space coordinates sampled in image y.