CN112734814B

CN112734814B - Three-dimensional craniofacial cone beam CT image registration method

Info

Publication number: CN112734814B
Application number: CN201911029712.5A
Authority: CN
Inventors: 裴玉茹; 张云庚; 郭玉珂; 查红彬; 许天民; 马赓宇
Original assignee: Peking University
Current assignee: Peking University
Priority date: 2019-10-28
Filing date: 2019-10-28
Publication date: 2023-10-20
Anticipated expiration: 2039-10-28
Also published as: CN112734814A

Abstract

The invention discloses a three-dimensional craniofacial cone beam CT image registration method, which establishes a registration full convolution neural network based on unsupervised learning, effectively realizes the image registration of three-dimensional craniofacial CBCT by correspondingly solving dense consistent voxel levels among paired CBCT images, meets the requirement of reversible consistency, realizes the efficient online registration of three-dimensional craniofacial Cone Beam CT (CBCT) images, and greatly reduces the registration time.

Description

Three-dimensional craniofacial cone beam CT image registration method

Technical Field

The invention relates to the technical fields of oral clinical medicine and computer vision, in particular to a three-dimensional craniofacial cone beam computed tomography (Cone Beam Computed Tomography, CBCT) image registration method.

Background

The solution of the dense voxels of the three-dimensional image is a key technology in medical image analysis and is widely applied to the tasks of statistical shape analysis, feature point and segmentation migration, preoperative and postoperative efficacy evaluation and the like.

In the prior art, the traditional three-dimensional image non-rigid registration method reduces the apparent difference of the dense voxel level of the three-dimensional image through online nonlinear iterative optimization, and commonly used measures comprise mean square error, normalized cross correlation, mutual information and the like. Deformation models in non-rigid registration include elastomer models, demons, differential stratospheric maps, B-spline free deformation models, and the like. Considering large-scale parameters to be solved in three-dimensional image registration, the optimization solution of the traditional registration method usually has great calculation time cost and is easy to trap into local minima. Some methods use downsampling and statistical deformation models to reduce parameters of the problem solving, but still rely on iterative parameter optimization solutions. The non-rigid registration method based on numerical solution requires manual key point and segmentation map labeling. The one-to-one mapping that keeps topology, deformation fields free of folds, reversible smoothing is an important property that anatomical image registration should have, for which registration methods using differential-homoembryo deformation models including LDDMM, syN, differential-homoembryo demons, etc. are used in anatomical structure registration.

In recent years, regression models are introduced into the non-rigid registration of three-dimensional images, and the learning-based three-dimensional image non-rigid registration method avoids online iterative optimization, such as support vector machines, random forests and deep neural networks, by finding the regression models of the input images and the registration deformation fields, and compared with the traditional optimization method, the time consumption of the registration is reduced by orders of magnitude. The supervised deep learning-based three-dimensional image registration method requires truth values of key points and deformation fields in the training process, and the manual labeling difficulty of the three-dimensional image can limit the acquisition of training data. Unsupervised deep learning-based three-dimensional image registration methods use spatial transformation networks to reduce apparent differences at the voxel level. The existing three-dimensional image registration method based on deep learning reduces the required memory occupation through three-dimensional image block registration, or only solves the unidirectional mapping from one image to a reference image. In order to solve the reversible one-to-one mapping, the existing three-dimensional image registration method based on deep learning using differential homoembryo mapping requires a re-parameterization layer to convert the velocity field predicted by the neural network into a bidirectional deformation field. Some methods utilize reversible constraints to directly infer the bi-directional deformation field, avoiding the estimation of intermediate layers, but relying on manual extraction of features. Thus, no effective solution currently exists to directly predict reversibly uniform deformation fields using neural networks.

Disclosure of Invention

The invention provides a three-dimensional craniofacial Cone Beam CT (CBCT) image registration method, which aims to overcome the defects, perform high-efficiency online registration on three-dimensional images and meet the requirement of reversible consistency.

In order to achieve the purpose, the registration framework based on unsupervised learning is established, and the full convolution neural network is utilized, so that the voxel level correspondence solution with dense consistency between paired CBCT images is realized, and compared with the traditional iterative optimization registration method, the registration time is greatly reduced. The invention uses the convolution neural network to regress the two-way mapping of voxel level between the input image V and the template image T, guarantees the forward deformation field and the reverse deformation field to meet the requirement of reversible consistency by introducing combined reversible constraint, and strengthens the closed loop consistency of the deformation field by using any binary group and ternary group in the data set through the constraint of the deformation field in the network training process. In registration network training, the system allows a user to specify a structure of interest (SOI), enhancing the registration effect of anatomical structures with higher registration difficulty, such as the skull base with lower bone density and blurred boundary with surrounding soft tissue. For online registration of any pair of given images, the trained convolutional neural network returns forward and directional deformation fields of each image and the template image T respectively, and a pair of forward and reverse deformation fields which are reversibly consistent with each other are obtained through combination of the deformation fields.

In the present invention, a computed tomography is: CT (Computed Tomography); cone beam computed tomography: cone Beam Computed Tomography (CBCT).

The technical scheme provided by the invention is as follows:

a three-dimensional craniofacial Cone Beam CT (CBCT) image registration method establishes a registration full convolution neural network based on unsupervised learning, and the registration time can be greatly reduced by solving the correspondence of voxel levels with consistency between pairs of CBCT images, thereby effectively realizing the image registration of the three-dimensional craniofacial Cone Beam CT (CBCT); the method comprises the following steps:

1) Defining a template image, and obtaining a template image T after the template image is deformed by an average deformation field;

a set of three-dimensional image data sets v= { V _i I=1,.. N registration to a randomly selected reference image V _r Obtaining a deformation field phi _i N is the number of images of the dataset. The template image T is an image obtained by deforming the average deformation field.

In the formula 1 of the present invention,representing the utilization of deformation fields->For image V _r Deformation is performed.

2) The construction and pre-training of the registration network comprises the following steps:

21 Solving a smooth one-to-one mapping of voxel levels between a three-dimensional image V and a template image T by using a bidirectional registration network based on a fixed template image, wherein V and T are three-dimensional single-channel gray images;

22 Using convolutional neural networks to regress the input image and the output bi-directional deformation field:

h _Θ，T ：V→[φ _f ，φ _b ]

wherein phi is _f For deforming the input image V into the deformation field of the template image T _b For the deformation field of the template image T into the input image V, Θ is a learnable parameter of the registration network h (e.g. a convolution kernel weight of the convolutional neural network). Setting image pairs And +.>With similar anatomical appearance (the degree of similarity can be measured by equation 4).

23 Convolutional neural network using symmetrical codec structure with long residual connection, estimating six-channel forward and reverse deformation fields [ phi ] from the input single-channel three-dimensional image V _f ，φ _b ]. The coder-decoder is composed of coder and decoder, the characteristic image of decoding stage is composed of the characteristic image output by previous decoding layer and the characteristic imageThe co-dimensional feature images of the encoding stage are composed to facilitate feature propagation and training convergence.

The invention uses a symmetrical codec structure with long residual connection to infer six-channel forward and reverse deformation fields [ phi ] from an input single-channel three-dimensional image V _f ，φ _b ]. The encoder comprises six convolutional layers of 3 x 3 step size 1, each convolution layer is followed by an instance regularization (instance normalization), a leak ReLU activation function, and a 2 x 2 max pooling layer. The decoder contains six convolutional layers of convolution kernel size 3 x 3 steps of 1. Each convolution layer is preceded by a x 2 upsampling layer and followed by an instance regularization and ReLU activation function layer.

24 Pre-training the convolutional neural network by using the synthetic data, generating a deformation field by using the randomly sampled B-spline deformation parameters, and ensuring that the generated deformation field is not folded; deforming the template image T by using the generated deformation field to generate a composite image V _syn And the reverse deformation field is calculated. The neural network is pre-trained using the composite image and corresponding forward and reverse deformation fields.

3) Training the registration network constructed in the step 2) in an unsupervised mode to obtain a trained registration network; the method comprises the following steps:

for having distributionV= { V _i I=1,..n, N is the number of images of the training dataset, the loss function is minimized by finding the optimal network optimizable parameter Θ.

Wherein h is a registration network; Θ is a learnable parameter of the registration network h, V is a training image, T is a template image,is the distribution of the training data set.

The loss function of registration network training is:

wherein L is _sim Is a similarity measure loss, L _reg Is a deformation field regularization term, L _inv Is a deformation field combined reversible constraint term, L _str Is the registration loss of the structure of interest (SO 1), Θ is the convolutional neural network optimizable parameter, α _sim 、α _reg 、α _inv 、α _str The weights of the loss terms are respectively.

Each loss function is described in turn as follows:

31 Registration network predicts forward and reverse deformation fields [ phi ] _f ，φ _b ]Similarity measure L _sim Metrology using forward deformation field phi _f，i Deformed input imageDifference from template image T at voxel level and using inverse deformation field phi _b，i Deformed template image +.>And input image V _i M is the number of images of one batch of data. Similarity metric loss L _sim Expressed as equation 4:

32 Deformation field regularization term L _reg To ensure the smoothness of the forward and reverse deformation fields, the spatial gradient of the deformation field is used as a regularization term, expressed as equation 5.

Wherein phi is _f，i ，φ _b，i For both the forward and reverse deformation fields, I.I _F Is the Frobenius norm and m is the number of images of a batch of data.

33 Deformation field combination reversible constraint term L _inv ：

The invention uses the combined reversible constraint to lead the forward and reverse deformation fields to meet the conditions of forward and reverse consistency and closed loop consistency, and the deformation field is combined with the reversible constraint item L _inv Expressed as equation 6:

L _inv the first term of the two-way conversion [ phi ] _f ，φ _b ]Is reciprocal, m is the number of images in a batch in training. I.I _F Is the Frobenius norm. L (L) _inv Requires k images { V }, second term of _i I=0,..Is an identity transformation. For a training set with N three-dimensional images, the number of closed loops with the length of more than or equal to 2 is +.>To simplify training, only the closed loop formed by two or three volumes of data may be considered, K being set to 3.L (L) _inv The second term of (2) requires that the forward and reverse deformation fields of any two images are reciprocal and the deformation fields of any three images satisfy the closed loop agreement.

In equation 6, phi _f，i ，φ _b，i For both the forward and reverse deformation fields, I.I _F Is the Frobenius norm, m is the number of images of a batch of data,a combination of k deformation fields along the closed loop c. For the combination of deformation fields, the standard coordinate field Q is changed into Q' after being deformed by a plurality of deformation fields' Q is the combined result of the deformation fields.

34 Loss of registration L for a structure of interest (SOI) _str ：

In registration of three-dimensional craniofacial images, some anatomical structures such as the skull base are relatively blurred from the surrounding soft tissue boundaries due to lower bone density and device-dependent noise, and the registration difficulty is high, the present invention uses a structure of interest (SOI) registration loss function L in network training _str To improve registration of the fine structure, expressed as equation 7.

Where M is a mask for a region of interest (SOI), where a voxel value of 1 represents the region of interest and the remaining voxel values are 0.m is the number of images of a batch of data, V _i Is an input image of a network, T is a template image, phi _f，i Is the forward deformation field of network output, phi _b，i Is the reverse deformation field of the network output. The operator ∈ represents element-by-element matrix multiplication.

4) On-line registration of three-dimensional images to be registered (paired CBCT images):

in-line registration stage, for any pair of floating images V _M And a fixed image V _F Is denoted as (V) _M ，V _F ) Returning V respectively through the trained registration network in the step 3) _M And V _F Forward and reverse deformation fields of (a)And obtaining a final registration deformation field phi through deformation field combination _M→F And phi _F→M ：

Wherein phi is used _M→F Representing that the floating image can be deformed to a fixed image, using phi _F→M The fixed image may be deformed to a floating image,should be in accordance with V _M Has similar apparent characteristics->Should be in accordance with V _F Has similar apparent characteristics. And obtaining a deformation field, namely registering the images. After the floating image is deformed by using the deformation field generated by registration, the more similar the obtained image is to the appearance of the fixed image, the better the registration effect is.

Through the steps, registration of three-dimensional craniofacial Cone Beam CT (CBCT) images is realized.

The invention has the beneficial effects that:

by using the three-dimensional craniofacial Cone Beam CT (CBCT) image registration method provided by the invention, the dense correspondence of a pair of three-dimensional image voxel levels can be solved, the requirement of reversible consistency is met, and the efficient online registration of the three-dimensional craniofacial Cone Beam CT (CBCT) image is realized.

Drawings

FIG. 1 is a diagram of a registration network training framework corresponding to the method of the present invention;

fig. 2 is a block diagram of an online registration flow of the present invention.

FIG. 3 is a schematic representation of a registration deformation field and registration results for a pair of images in an embodiment of the present invention;

wherein (a) is the input image pair (V) _M ，V _F ) The method comprises the steps of carrying out a first treatment on the surface of the (b) And (c) are each V _M And V _F Forward and reverse deformation fields of (a); (d) Is the forward deformation field phi _M→F And a reverse deformation field phi _F→M (e) A deformed image;

FIG. 4 is an illustration of the overlap of the results of a pair of three-dimensional images prior to registration and after forward registration and after reverse registration in the sagittal, coronal, and axial planes in an embodiment of the present invention;

wherein, the liquid crystal display device comprises a liquid crystal display device,(a) Is a sagittal plane; (b) is a coronal plane; (c) the axial surfaces overlap. (d) Deformation field phi _f And phi _b Is a slice of the same.

Detailed Description

The invention is further described by way of examples in the following with reference to the accompanying drawings, but in no way limit the scope of the invention.

The three-dimensional craniofacial Cone Beam CT (CBCT) image registration method provided by the invention uses a convolutional neural network to regress an input image and a bidirectional deformation field, reduces the apparent difference of voxel levels of a target image and a deformed image by using a space transformation network, ensures the reversible consistency of a forward deformation field and a reverse deformation field by combining reversible constraint, avoids iterative optimization of the traditional registration method during online registration, and can greatly reduce registration time.

The following is a further description of embodiments of the invention with reference to the drawings and examples.

Step 1: defining template images

A group of acquired oral craniofacial three-dimensional cone beam CT image data sets v= { V _i I=1,..n } is registered to a randomly selected reference image V _r Obtaining a deformation field phi _i N is the number of images of the dataset. The template image T is an image obtained by deforming the average deformation field.

Step 2: registration network construction and pre-training

The invention solves the smooth one-to-one mapping of voxel level between a three-dimensional image V and a template image T by using a bidirectional registration network based on a fixed template image, V and T are three-dimensional single-channel gray level images, and a convolution neural network is used for regressing an input image and an output bidirectional deformation field h _Θ，T ：V→[φ _f ，φ _b ]，φ _f : v- & gt T is the deformation field from the input image V to the template image T, phi _b : T-V is the deformation field of the template image T deformed to the input image V, and Θ is the learnable parameter of the registration network hNumbers, such as convolution kernel weights of a convolutional neural network. Image pairAnd +.>Is expected to have a similar anatomical appearance. The optimal parameter theta is obtained in training by measuring the registration similarity in the forward direction and the reverse direction.

The invention uses a symmetrical codec structure with long residual connection to infer six-channel forward and reverse deformation fields [ phi ] from an input single-channel three-dimensional image V _f ，φ _b ]. The encoder comprises six convolutional layers of 3 x 3 step size 1, each convolution layer is followed by an instance regularization (instance normalization), a leak ReLU activation function, and a 2 x 2 max pooling layer. The decoder contains six convolutional layers of convolution kernel size 3 x 3 steps of 1. Each convolution layer is preceded by a x 2 upsampling layer and followed by an instance regularization and ReLU activation function layer. The feature image of the decoding stage is composed of the feature image output by the previous decoding layer and the feature image with the same size of the encoding stage, so as to promote the propagation of the features and the convergence of training.

The method uses the synthetic data to pretrain the convolutional neural network, uses the randomly sampled B-spline deformation parameters to generate a deformation field, ensures that the generated deformation field is not folded, utilizes the generated deformation field to deform the template image T to generate a synthetic image V, and calculates a reverse deformation field. The neural network is pre-trained using the composite image and corresponding forward and reverse deformation fields.

Step 3: unsupervised mode training registration network

The present invention uses combined reversible constraints to ensure reversible and closed-loop consistency of the deformation field. For having distribution D _v V= { V _i I=1,..n, N is the number of images of the training dataset, the loss function is minimized by finding the optimal network optimizable parameter Θ.

Where Θ is the learnable parameter of the registration network h, V is the training image, T is the template image,is the distribution of the training data set.

The loss function of registration network training is:

wherein L is _sim Is a similarity measure loss, L _reg Is a deformation field regularization term, L _inv Is a deformation field combined reversible constraint term, L _str Is the registration loss of the structure of interest (SOI), Θ is the convolutional neural network optimizable parameter, α _sim 、α _reg 、α _inv 、α _str The weights of the loss terms are respectively.

Each loss function is described in turn as follows:

1) Registration network predicts forward and reverse deformation fields [ phi ] _f ，φ _b ]Similarity measure L _sim Metrology using forward deformation field phi _f，i Deformed input imageDifference from template image T at voxel level and using inverse deformation field phi _b，i Deformed template image +.>And input image V _i M is the number of images of one batch of data.

2) Regularization ofItem L _reg The method is used for guaranteeing the smoothness of the forward deformation field and the backward deformation field, and the spatial gradient of the deformation field is used as a regularization term.

3) The invention uses the combined reversible constraint to ensure that the forward deformation field and the reverse deformation field meet the conditions of forward and reverse consistency and closed loop consistency, L _inv The first term of the two-way conversion [ phi ] _f ，φ _b ]Is reciprocal, m is the number of images in a batch in training. I.I _F Is the Frobenius norm. L (L) _inv Requires k images { V }, second term of _i I=0,..Is an identity transformation. For a training set with N three-dimensional images, the number of closed loops with the length of more than or equal to 2 is +.>To simplify training, only the closed loop formed by two or three volumes of data may be considered, K being set to 3.L (L) _inv The second term of (2) requires that the forward and reverse deformation fields of any two images are reciprocal and the deformation fields of any three images satisfy the closed loop agreement.

Wherein phi is _f，i ，φ _b，i For both the forward and reverse deformation fields, I.I _F Is the Frobenius norm, m is the number of images of a batch of data,a combination of k deformation fields along the closed loop c. For the combination of deformation fields, the standard coordinate field Q is utilized to change into Q ', Q' -Q after being deformed by a plurality of deformation fields, and the combination result of the deformation fields is obtained.

4) In registration of three-dimensional craniofacial images, some anatomical structures such as the skull base are relatively blurred from the surrounding soft tissue boundaries due to lower bone density and device-dependent noise, and the registration difficulty is high, the present invention uses a structure of interest (SOI) registration loss function L in network training _str To improve registration of the fine structures.

Step 4: online registration of paired CBCT images

In the online registration stage, given any pair of floating and fixed images (V _M ，V _F ) The registration network returns to V respectively _M And V _F Forward and reverse deformation fields of (a)And obtaining a final deformation field through deformation field combination

In order to verify the registration accuracy of the registration network on paired CBCT images, for composite data, the mean square distance between the forward deformation field and the reverse deformation field predicted by the network and the true value of the deformation field is calculated, the average error is smaller than 0.25mm, meanwhile, the deformation field has better reversibility, and the average value of the number of voxels with the jacobian less than or equal to 0 is 7.44, which indicates that the deformation field predicted by the registration method has better reversibility.

With the method of the present invention, pairs of images (e.g., cone beam CT images) can be rapidly and efficiently registered non-rigidly. The system utilizes convolutional neural network regression to input images and bidirectional deformation fields, avoids iterative solution of the traditional registration method, and greatly reduces registration time. The system utilizes the combined reversible constraint to further require that for any binary group and ternary group in the training data set, the deformation field meets the requirement of closed loop consistency, so that the forward deformation field and the reverse deformation field predicted by the registration network have better reversibility. The system uses region of interest (SOI) registration loss to enhance registration of fine anatomical structures. The system effectively solves the problems that the traditional non-rigid registration method is time-consuming to calculate and easy to sink into local minimum, and meets the requirement of reversible consistency of deformation fields.

The above embodiments are only for illustrating the present invention and not for limiting the same, and various changes and modifications may be made by those skilled in the art without departing from the spirit and scope of the present invention, and therefore all equivalent technical solutions are also within the scope of the present invention, and the scope of the present invention is defined by the claims.

Claims

1. A three-dimensional craniofacial cone beam CT image registration method, wherein the cone beam CT is called CBCT for short; the registration method establishes a registration full convolution neural network based on unsupervised learning, and effectively realizes image registration of three-dimensional craniofacial CBCT by correspondingly solving voxel levels with consistent density among paired CBCT images; the method comprises the following steps:

a set of three-dimensional image data sets v= { V _i I=1,.. N registration to randomly selected reference image V _r Obtaining a deformation field phi _i The method comprises the steps of carrying out a first treatment on the surface of the Where N is the number of images of the dataset;

the template image T is an image obtained by deforming the average deformation field, and is expressed by formula 1:

in the formula 1 of the present invention,representing the utilization of deformation fields->For image V _r Deforming;

2) The registration network is built and pre-training is carried out, and the method comprises the following steps:

21 Using a bi-directional registration network based on the fixed template image to solve a smooth one-to-one mapping of voxel levels between the three-dimensional image and the template image; the three-dimensional image and the template image are three-dimensional single-channel gray scale images;

h _Θ，T ：V→[φ _f ，φ _b ]

wherein V is an input image; phi (phi) _f A deformation field for deforming the input image V into the template image T; phi (phi) _b A deformation field for deforming the template image T into the input image V; Θ is a learnable parameter of the registration network h;

23 Using a symmetrical codec structure with long residual connection, six-channel forward and reverse deformation fields [ phi ] are extrapolated from the single-channel input image V _f ，φ _b ]The method comprises the steps of carrying out a first treatment on the surface of the The characteristic image of the decoding stage consists of the characteristic image output by the previous decoding layer and the characteristic image with the same size of the encoding stage;

24 Pre-training the convolutional neural network using the composite image and corresponding forward and reverse deformation field data, generating a deformation field using randomly sampled B-spline deformation parameters, and leaving the generated deformation field unfolded; using the generated variationsThe shape field deforms the template image T to generate a composite image V _syn And calculating a reverse deformation field;

3) Training the registration network of the step 2) in an unsupervised mode to obtain a trained registration network; the method comprises the following steps:

for having distributionV= { V _i I=1,..a., N }, N is the number of images of the training dataset, minimizing the loss function by finding the optimal network optimizable parameter Θ, expressed as equation 2:

wherein h is a registration network; Θ is a learnable parameter of the registration network h; v is a training image; t is the template image;is the distribution of the training dataset;

the loss function for registration network training is expressed as equation 3:

wherein L is _sim Measure loss for similarity; l (L) _reg Is a deformation field regularization term; l (L) _inv Is a deformation field combination reversible constraint term; l (L) _str SOI registration loss for the structure of interest; Θ is the convolutional neural network optimizable parameter; alpha _sim 、α _reg 、α _inv 、α _str Weights of the corresponding penalty terms, respectively;

4) Performing online registration on the three-dimensional CBCT image to be registered:

for any pair of floating images V _M And a fixed image V _F Is denoted as (V) _M ，V _F )，Returning V respectively through the trained registration network in the step 3) _M And V _F Forward and reverse deformation fields of (a)And obtaining a final registration deformation field phi through deformation field combination _M→F And phi _F→M Expressed as:

wherein phi is used _M→F Representing that the floating image can be deformed to a fixed image, using phi _F→M The fixed image may be deformed to a floating image,and V is equal to _M Has similar apparent characteristics->And V is equal to _F Has similar apparent characteristics;

obtaining a deformation field to realize image registration; the floating image is deformed by using the deformation field generated by registration, and the obtained image is the registered image;

through the steps, the registration of the three-dimensional craniofacial CBCT image is realized.

2. The method of three-dimensional craniofacial cone-beam CT image registration according to claim 1 wherein in step 22), the learnable parameter Θ of the registration network h is a convolution kernel weight of the convolutional neural network.

3. The method for three-dimensional craniofacial cone-beam CT image registration of claim 1 wherein the steps of22 Specifically setting an image pairAnd->With similar anatomical appearance, the optimal parameters Θ are obtained in training by measuring the forward and reverse registration similarity.

4. The method of three-dimensional craniofacial cone-beam CT image registration according to claim 1 wherein in step 23) the encoder comprises six convolution layers of 3 x 3 and 1 step size, each followed by an instance regularization, a leak ReLU activation function, and a 2 x 2 max pooling layer.

5. The method of three-dimensional craniofacial cone-beam CT image registration according to claim 1 wherein in step 23) the decoder comprises six convolution layers of 3 x 3 convolution kernel size with a step size of 1; each convolution layer is preceded by a x 2 upsampling layer and followed by an instance regularization and ReLU activation function layer.

6. The method for registration of three-dimensional craniofacial cone-beam CT images of claim 1 wherein step 3) registers a loss function of network trainingIn the similarity measure loss L _sim Specifically expressed as formula 4:

where m is the number of images; phi (phi) _f，i Is a forward deformation field; phi (phi) _b，i Is a reverse deformation field;representing the use of a forward deformation field phi _f，i A deformed input image; />Representing the use of the reverse deformation field phi _b，i And (5) a deformed template image.

7. The method for registration of three-dimensional craniofacial cone-beam CT images of claim 1 wherein step 3) registers a loss function of network trainingIn, deformation field regularization term L _reg For ensuring the smoothness of the forward and reverse deformation fields, spatial gradients of the deformation fields are used as regularization terms, expressed as equation 5:

wherein phi is _f，i 、φ _b，i For both the forward and reverse deformation fields, I.I _F Is the Frobenius norm and m is the number of images.

8. The method for registration of three-dimensional craniofacial cone-beam CT images of claim 1 wherein step 3) registers a loss function of network trainingIn the deformation field combination reversible constraint term L _inv Expressed as equation 6:

L _inv the first term of the two-way conversion [ phi ] _f ，φ _b ]Is reciprocal, m is the number of images in a batch in training; I.I _F Is the Frobenius norm;

L _inv in the second item of (2),a combination of k deformation fields along the closed loop c; k images { V _i I=0,..>Is an identity transformation, for a training set with N three-dimensional images, the number of closed loops with the length of more than or equal to 2 is +.>

9. The method of three-dimensional craniofacial cone-beam CT image registration according to claim 8, wherein only two or three closed loops formed by volume data are considered, and k has a value of 3.

10. The method for registration of three-dimensional craniofacial cone-beam CT images of claim 1 wherein step 3) registers a loss function of network trainingIn the SOI registration loss L of the structure of interest _str For improving registration of fine structures, expressed as formula 7:

wherein M is a mask of the region of interest SOI, wherein a voxel value of 1 represents the region of interest and the rest voxel values are 0; m is the number of images; v (V) _i An input image for a network; t is a template image; phi (phi) _f，i The forward deformation field is output by the network; phi (phi) _b，i The reverse deformation field is output by the network; the operator ∈ represents element-by-element matrix multiplication.