WO2024001140A1 - Vertebral body sub-region segmentation method and apparatus, and storage medium - Google Patents

Vertebral body sub-region segmentation method and apparatus, and storage medium Download PDF

Info

Publication number
WO2024001140A1
WO2024001140A1 PCT/CN2022/143887 CN2022143887W WO2024001140A1 WO 2024001140 A1 WO2024001140 A1 WO 2024001140A1 CN 2022143887 W CN2022143887 W CN 2022143887W WO 2024001140 A1 WO2024001140 A1 WO 2024001140A1
Authority
WO
WIPO (PCT)
Prior art keywords
vertebral body
mask
region segmentation
sub
vertebral
Prior art date
Application number
PCT/CN2022/143887
Other languages
French (fr)
Chinese (zh)
Inventor
李危石
邹达
齐欢
吕维加
Original Assignee
北京大学第三医院(北京大学第三临床医学院)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京大学第三医院(北京大学第三临床医学院) filed Critical 北京大学第三医院(北京大学第三临床医学院)
Publication of WO2024001140A1 publication Critical patent/WO2024001140A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/90Dynamic range modification of images or parts thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10072Tomographic images
    • G06T2207/10081Computed x-ray tomography [CT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20036Morphological image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30008Bone
    • G06T2207/30012Spine; Backbone

Definitions

  • the present application relates to the field of medical image processing, and in particular to a vertebral body sub-region segmentation method, device and storage medium.
  • the vertebral body subregions of the spine mainly include bony endplates, cortical bone side walls, and cancellous bone, all of which play an important supporting role in the human body.
  • the bone quality of the bony endplate and the bone area adjacent to the endplate is one of the key factors affecting the efficacy of spinal interbody fusion.
  • Analyzing the vertebral bony endplates, cortical bone lateral plates, and cancellous bone areas (hereinafter referred to as vertebral subregions) from aspects such as bone density, mechanics, and morphological structure will help establish quantitative evaluation indicators for bone quality.
  • Assist clinical diagnosis and surgical planning Based on three-dimensional medical images (such as computed tomography, CT), accurate three-dimensional segmentation and three-dimensional reconstruction of vertebral body sub-regions can be performed.
  • Existing technical solutions generally segment vertebral body images based on underlying image processing methods such as morphological algorithms and similarity indicators, and can automatically generate segmentation masks for vertebral body sub-regions.
  • the bony endplate segmented by existing methods mainly has the following problems.
  • the endplate segmentation area cannot match the complex endplate anatomical shape well;
  • the bone density of subregions with different thicknesses and shapes (hereinafter referred to as endplate subregions) within the same endplate range is also different.
  • the implants implanted in the intervertebral spine during spinal surgery often only come into contact with a certain endplate sub-region and do not contact the entire endplate.
  • Existing methods cannot achieve segmentation of customized endplate subregions; thirdly, existing methods cannot separate and exclude non-osseous endplate regions such as osteophytes.
  • this application aims to provide a vertebral body sub-region segmentation method, device and storage medium, and obtain a three-dimensional segmentation mask of the vertebral body sub-region through the pre-trained vertebral body sub-region segmentation neural network model.
  • This application solves problems not covered by vertebral body sub-region segmentation methods in the prior art, including: the inability to achieve customized endplate and sub-endplate bone sub-region segmentation that matches the complex anatomical shape of the vertebral bony endplate; The segmentation performance of the vertebral body lateral plate and vertebral body cancellous bone area is poor, and the generalization ability is not high.
  • This application can be expanded and applied in the field of spinal local bone density calculation.
  • a vertebral body sub-region segmentation method including the following steps:
  • the neural network model is used to output the mask regression results of the upper and lower bony endplates of the vertebral body, cortex Bone lateral plate mask regression results, cancellous bone area mask regression results, and vertebral body mask regression results based on fusion;
  • the vertebral body mask regression results are post-processed through morphological operations and connectivity tests to obtain a three-dimensional segmentation mask of the upper and lower bony endplates, cortical bone lateral plates, and cancellous bone areas of the vertebral body to complete vertebral body sub-region segmentation;
  • the vertebral body subregion includes the upper and lower bony endplates of the vertebral body, cortical bone lateral plates, and cancellous bone areas.
  • the pre-trained neural network model is a bifurcated multi-task convolutional neural network structure, including an encoder, three decoders and a MAX fusion unit: where,
  • the encoder is used to receive preprocessed image data to obtain feature maps
  • the three decoders are respectively connected to the encoder and used to output the upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results and vertebral cancellous bone mask regression results based on the feature map;
  • the MAX fusion unit is connected to three encoders respectively and used to output the vertebral mask regression results.
  • the pre-training process of the neural network includes:
  • upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, vertebral body cancellous bone mask labels are constructed respectively, and the vertebral body mask label is obtained by fusion;
  • the image data after constructing the label is augmented to obtain training samples to expand the training data set;
  • the binary mask method was used to annotate the voxels in the training sample image, and the upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral cancellous bone mask were constructed for the training sample image.
  • Label
  • the vertebral body mask label is constructed by fusing the upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral body cancellous bone mask labels.
  • the endplate mask label is constructed, including customizing the thickness and shape of the endplate area to obtain the endplate mask label.
  • the gradient descent method is used to iterate the model parameters, including:
  • the weight of cross entropy and Dice function is used as the loss function of each output result, where the cross entropy function is:
  • the Dice function is:
  • a is an output result in the neural network model
  • b is the labeling result
  • i is the voxel position index
  • is the total prime number.
  • the MAX fusion unit is used to output the vertebral mask regression results, including: based on the position index of each voxel of the preprocessed spinal vertebral image, taking the maximum value of the corresponding index of the three decoders, and calculating the position index of each voxel.
  • the three decoders perform MAX fusion on the maximum value of the corresponding index, and output the vertebral mask regression result.
  • the preprocessing of the spinal vertebra image data includes resampling and pixel value normalization; among which,
  • the resampling process includes: dividing the spatial resolution of the obtained spinal vertebral body image by the preset spatial resolution to obtain the resampling ratio of the image data in three dimensions; based on the resampling ratio, using linear interpolation method to obtain the resampling The resulting image data with fixed spatial resolution.
  • Pixel value normalization processing includes: mapping the original pixel value range [M, N] to the preset value range [P, Q] through a linear function; where M is the minimum pixel value of the CT image, and N is the maximum pixel value of the CT image. , P is the lower bound of the preset value range, and Q is the upper bound of the preset value range.
  • the mask regression results are post-processed through morphological operations and connectivity tests, including: using a three-dimensional morphological opening operation with a convolution kernel of 3*3*3, and using the sliding window method to calculate the neighbors around each voxel. Domains are morphologically eroded and dilated to remove fine-grained noise in the mask to remove areas of abnormal bone structure including vertebral osteophytes.
  • the skimage toolkit uses the skimage toolkit to calculate the connectivity of the upper and lower bony endplate masks, and retain the two largest connected areas in the results, corresponding to the upper and lower bony endplates respectively; the cortical bone lateral plate mask and the vertebral cancellous The bone mask retains only the largest connected area.
  • this application also provides a vertebral body sub-region segmentation device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, and the processor executes the The computer program implements the aforementioned vertebral body sub-region segmentation method.
  • a computer-readable storage medium is also provided.
  • a computer program is stored on the storage medium, and the computer program can be executed by a processor to implement the aforementioned vertebral body sub-region segmentation method.
  • This application can also be extended to the field of local bone density calculation to calculate the average CT value within the segmented area. Based on the CT value, quantitative calculation and analysis of bone mineral content can be performed through automated positioning analysis of muscle fat.
  • this application divides the vertebral body into upper and lower bones. There are three regions: sexual endplate, cortical bone lateral plate and vertebral cancellous bone. It can achieve personalized endplate thickness segmentation, endplate sub-region segmentation matching the "implant-endplate” contact surface, and remove abnormal bone structures such as osteophytes. Segmenting the vertebral body more accurately further expands the application scope of the vertebral body segmentation method.
  • this application proposes a machine learning technology method based on a pre-trained neural network model, and uses the data augmentation method to simulate the vertebral image features obtained in different scenarios. Increase the adaptability of the prediction model to image contrast, image noise, vertebral body posture, and image layer thickness, thereby obtaining stronger generalization ability. Compared with the existing technology, this method can learn image features independently without using a specific feature extraction method.
  • this application adopts a data-driven modeling method, which can improve the performance of the prediction model by adding training data, and can add expert opinions to the model training process through data annotation. , and continue to optimize.
  • Figure 1 Schematic flow chart of the vertebral body sub-region segmentation method according to the embodiment of the present application
  • Figure 2 Schematic structural diagram of the vertebral body sub-region segmentation neural network model according to the embodiment of the present application
  • Figure 3 Structural diagram of a neural network model for training vertebral body sub-region segmentation according to the embodiment of the present application
  • Figure 4 Comparison of the original spine image and the corresponding data augmentation result according to the embodiment of the present application.
  • Figure 5 Structural diagram of the vertebral body sub-region segmentation method according to the embodiment of the present application
  • Figure 6 A schematic diagram of the vertebral body osteophyte removal method according to the embodiment of the present application, which removes the outer ring through an erosion algorithm to remove peripheral osteophytes;
  • This application obtains the vertebral body sub-region segmentation mask by preprocessing the spinal vertebral body image data, constructing mask labels, neural network model prediction and post-processing operations; the vertebral body sub-regions in this application include: vertebral body bony end plates, lateral cortical plates, and cancellous bone areas of the vertebral body.
  • the spinal vertebral body image data is CT image data.
  • One embodiment of the present application provides a vertebral body sub-region segmentation method, as shown in Figure 1, including the following steps:
  • Step 1 Preprocess the obtained spinal vertebral body image data
  • preprocessing includes resampling and pixel value normalization. Divide the spatial resolution of the spine vertebral body image by the preset spatial resolution to obtain the resampling ratio of the image data in three dimensions; according to the resampling ratio, use a linear interpolation method to obtain the resampled image with a fixed Image data of spatial resolution; and map the original pixel value range to a preset value range through a linear function.
  • preprocessing it can eliminate the negative impact of different spatial resolutions and extreme pixel values of the original image data on subsequent steps.
  • resampling requires obtaining the spatial resolution of the input spine vertebral body CT image, that is, the physical spatial size corresponding to each voxel.
  • Image data based on the DICOM protocol will save this information as part of the metadata.
  • the spatial resolution of the CT image can be obtained through the image data based on the DICOM protocol.
  • the target spatial resolution of resampling is a certain fixed spatial size, such as 150*90*90 mm.
  • first preset a spatial resolution, such as 1*1*1 mm divide the spatial resolution of the input CT image by the preset spatial resolution, and obtain the weight of the three dimensions of the original image data.
  • Sampling ratio as a specific embodiment, a linear interpolation method (trilinear interpolation) can be used to obtain CT image data with a fixed spatial size after resampling.
  • the pixel values of the image need to be normalized, and the original pixel value range [M, N] is mapped to a preset value range [P, Q] through a linear function.
  • M is the minimum pixel value of the CT image
  • N is the maximum pixel value of the CT image
  • P is the lower bound of the preset value range
  • Q is the upper bound of the preset value range
  • use (M,P ) and (N, Q) fit the linear function.
  • the image data is represented with a unified spatial resolution, thereby eliminating the structural differences caused by different spatial resolutions (such as data layer thickness and reconstruction methods), making the sub-region segmentation model Focus the learning direction of feature representation on the semantics of the image itself.
  • the purpose of pixel value normalization is to further eliminate the negative impact of extreme pixel values on subsequent steps; for example, some metal implants have abnormally high pixel values in CT images and need to be suppressed by pixel value normalization.
  • the linear interpolation method (trilinear interpolation) is used for normalization processing, which can achieve faster processing speed while retaining image characteristics.
  • Step 2 Input the pre-processed image data into the pre-trained neural network model to obtain the sub-region segmentation mask regression results corresponding to the spinal vertebrae; wherein, the neural network model is used to output the upper and lower bony endplate masks of the vertebral body.
  • the four predicted outputs are subjected to regression iterative calculations to obtain the final vertebral upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results, cancellous bone region mask regression results, and vertebral fusion-based vertebral mask regression results.
  • Volume mask regression results are subjected to regression iterative calculations to obtain the final vertebral upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results, cancellous bone region mask regression results, and vertebral fusion
  • this application adopts a bifurcated multi-task convolutional neural network structure, as shown in Figure 2.
  • the multi-task learning strategy helps improve the generalization ability of convolutional neural networks and suppress over-fitting.
  • the bifurcated structure consists of an encoder, three decoders and a MAX fusion unit; the encoder is used to receive the preprocessed image data to obtain feature maps; the outputs of the three decoders and a MAX fusion unit It is the four outputs of the model; the three decoders are connected to the encoder respectively, and are used to output the upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results and vertebral cancellous bone mask based on the feature map. Regression results; the MAX fusion unit is connected to three encoders respectively to output the vertebral mask regression results.
  • this application draws on the UNet structure and uses skip connections to transfer the feature map of the encoder to three decoders, thereby ensuring that the local feature map can be effectively transferred to the decoder and making up for the information caused by downsampling. lost.
  • this application uses the MAX fusion unit to perform voxel-level fusion of the outputs of the three decoders; that is, for each voxel position index, the maximum value of the corresponding index of the three decoders is obtained, which is equivalent to completing Cone subregion classification voting based on channel maxima.
  • the three-channel MAX fusion output can restore the complete vertebral mask and filter out invalid feature areas.
  • three types of training labels need to be constructed for the image data used for training: upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral body cancellous Bone mask label.
  • the vertebral mask label can be obtained by fusing the aforementioned three labels in space.
  • the endplate segmentation area is basically consistent with the anatomical shape of the endplate of different spinal segments, and image segmentation can be performed based on the neural network model; and the endplate segmentation area can achieve a uniform thickness effect.
  • the endplate segmentation area can be eliminated by using a three-dimensional morphological erosion algorithm to eliminate non-interesting interference areas such as osteophytes; upper and lower bony endplates
  • the mask label can be expanded to a customized endplate sub-region shape, and the shape and size of the "implant-endplate" contact surface under the geometric model can be obtained through the pre-designed geometric model of intervertebral implants such as cages. Matching mask labels for superior and inferior bony endplate subregions.
  • the way to create mask labels can be manually annotated by experts, and a binary mask can be used to represent the area of interest, where the label 1 represents the voxels in the area of interest, and the label 0 represents the voxels in the non-interest area.
  • the image data used for training are three-dimensional medical images, including computed tomography, CT, MRI and other imaging equipment to collect three-dimensional data.
  • Figure 3 is a schematic structural diagram of the neural network model for training vertebral body sub-region segmentation in this application;
  • three-dimensional data augmentation methods include: random exponential transformation and logarithmic transformation of pixel values, random three-dimensional radial transformation (including translation, stretching, shrinkage, rotation, shearing, etc.), random salt and pepper noise Disturbance, random elastic deformation, etc.; through data augmentation, the training data set is greatly enriched and the generalization ability of the model is increased.
  • the expanded training data set is used to iteratively train the model parameters.
  • the gradient descent method is used to iterate the model parameters as a whole, by calculating the loss functions of the four outputs of the model and the corresponding labels, and assigning weights to the loss functions of the four outputs, for example, weighting according to the ratio of 1:1:1:1, The total loss function of the model is obtained; the gradient descent method is used for the total loss function of the model, and the model parameters are updated and iteratively optimized at a certain learning rate.
  • This application uses cross entropy and Dice function weighted at 1:1 as the loss function of each output; where,
  • the cross entropy function is:
  • the Dice function is:
  • a is an output result in the neural network model
  • b is the labeling result
  • i is the pixel position index
  • is the total number of pixels.
  • this application greatly enhances the generalization ability of the vertebral body segmentation method through the expansion of the training sample set and iterative parameter training based on the mask label data annotated by experts, and solves the problems that existing technologies have on vertebral body image quality. It has strong dependence, poor generalization ability, and is difficult to adapt to problems such as different CT equipment, different scanning parameters, and different spine morphologies; and this application adds expert opinions to better improve segmentation performance and improve reliability. Expandability.
  • loss function and its weight distribution ratio selected in this application are only examples related to the solution of this application and do not constitute a limitation on the application of the solution of this application.
  • the types of loss functions actually used and each loss function The corresponding weights can be changed.
  • Step 3 Post-process the mask regression results through morphological operations and connectivity tests to obtain a three-dimensional segmentation mask of the upper and lower bony endplates, cortical bone lateral plates, and cancellous bone areas of the vertebral body, and complete the vertebral body subdivision. Region segmentation.
  • the upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results, and vertebral body cancellous bone mask regression results output from the vertebral body subregion segmentation neural network model need to be post-processed.
  • the three-dimensional morphological opening operation is first used to process each mask regression result to remove the fine-grained noise in the mask;
  • the convolution kernel of the opening operation is 3*3*3, and the essence of the three-dimensional morphological opening operation is
  • the sliding window method is used to morphologically erode and expand the neighborhood around each voxel to remove abnormal bone structure areas including vertebral osteophytes.
  • the schematic diagram of osteophyte removal is shown in Figure 6. Then, use the connectivity test to process the three mask regression results.
  • the skimage toolkit can be used to calculate the connectivity of the upper and lower bony endplate masks, and retain the two largest connected areas in the results, corresponding to There are two upper and lower bony endplates; the cortical bone lateral plate mask only retains the largest connected area; the vertebral cancellous bone mask retains only the largest connected area. After morphological operations and connectivity testing, the final vertebral sub-region segmentation mask area can be obtained.
  • the third embodiment of the present application provides a computer-readable storage medium.
  • a computer program is stored on the storage medium.
  • the computer program can be executed by a processor to realize the vertebral subregion described in any of the above embodiments of the present application. Segmentation method.
  • this application provides a vertebral body sub-region segmentation method: As shown in Figure 5, the pre-processed image data is input into the pre-trained vertebral body sub-region segmentation neural network model, and four prediction outputs are obtained. , including sub-region segmentation mask regression results and vertebral body segmentation mask regression results; the mask regression results are post-processed through morphological operations and connectivity tests to obtain the upper and lower vertebral body bony endplates, cortical bone lateral plates, loose The three-dimensional segmentation mask of vertebral body sub-regions such as the bone region is used to complete the segmentation of vertebral body sub-regions.
  • This application uses artificial intelligence technology to promote the application of smart medical care in clinical settings; it solves the shortcomings of traditional methods such as poor generalization ability, poor scalability, and limited application scope. And this segmentation method can be extended to the field of local bone density calculation to calculate the average CT value in the segmented area. Based on the CT value, through the automatic positioning analysis of muscle fat, quantitative calculation and analysis of bone mineral content can be performed, thereby achieving Bone density calculation measurement.
  • the process of implementing the method in the above embodiments can be completed by instructing relevant hardware through a computer program, and the program can be stored in a computer-readable storage medium.
  • the computer-readable storage medium is a magnetic disk, an optical disk, a read-only memory or a random access memory, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Quality & Reliability (AREA)
  • Apparatus For Radiation Diagnosis (AREA)

Abstract

Disclosed are a vertebral body sub-region segmentation method and apparatus, and a storage medium, relating to the technical field of medical image processing. The segmentation method comprises: preprocessing obtained spine vertebral body image data; inputting the preprocessed image data into a pre-trained neural network model to obtain a sub-region segmentation mask regression result corresponding to a spine vertebral body; and performing post-processing on the mask regression result by means of morphological operation and connectivity test to obtain three-dimensional segmentation masks for vertebral body superior and inferior bony endplate regions having a uniform and customizable thickness, and cortical bone side plate and cancellous bone regions, thereby completing vertebral body sub-region segmentation. The problems in the prior art that a vertebral body sub-region segmentation method cannot implement personalized endplate and inferior bony endplate individual sub-region segmentation matching the complex anatomical form of the vertebral body bony endplates, the segmentation performance for the vertebral body side plate and vertebral body cancellous bone regions is poor, and the generalization ability is low are solved.

Description

一种椎体亚区域分割方法、装置及存储介质A vertebral body sub-region segmentation method, device and storage medium 技术领域Technical field
本申请涉及医学图像处理领域,尤其涉及一种椎体亚区域分割方法、装置及存储介质。The present application relates to the field of medical image processing, and in particular to a vertebral body sub-region segmentation method, device and storage medium.
背景技术Background technique
脊柱椎体亚区域主要包括骨性终板、皮质骨侧壁和松质,它们都对人体起着重要的支撑作用。其中,骨性终板及其邻近终板下骨区域的骨质量还是影响脊柱椎间融合术疗效的关键因素之一。终板骨质量越差,椎间融合术中出现终板损伤,术后出现终板塌陷和融合器下沉移位的风险越高。从骨密度、力学、形态结构等方面对椎体骨性终板、皮质骨侧板和松质骨区域(下文部分内容简称椎体亚区域)进行分析有助于建立骨质量的量化评价指标,辅助临床诊断和手术计划。基于三维医学影像(例如计算机断层成像,CT),可对椎体亚区域进行精准的三维分割和三维重建。现有技术方案一般基于形态学算法和相似度指标等底层图像处理方法对椎体图像进行分割,可自动生成椎体亚区域的分割掩膜。The vertebral body subregions of the spine mainly include bony endplates, cortical bone side walls, and cancellous bone, all of which play an important supporting role in the human body. Among them, the bone quality of the bony endplate and the bone area adjacent to the endplate is one of the key factors affecting the efficacy of spinal interbody fusion. The worse the endplate bone quality, the higher the risk of endplate damage during interbody fusion, and postoperative endplate collapse and cage subsidence and displacement. Analyzing the vertebral bony endplates, cortical bone lateral plates, and cancellous bone areas (hereinafter referred to as vertebral subregions) from aspects such as bone density, mechanics, and morphological structure will help establish quantitative evaluation indicators for bone quality. Assist clinical diagnosis and surgical planning. Based on three-dimensional medical images (such as computed tomography, CT), accurate three-dimensional segmentation and three-dimensional reconstruction of vertebral body sub-regions can be performed. Existing technical solutions generally segment vertebral body images based on underlying image processing methods such as morphological algorithms and similarity indicators, and can automatically generate segmentation masks for vertebral body sub-regions.
现有方法分割的骨性终板主要存在以下问题。第一,终板分割区域无法与复杂的终板解剖外形很好地匹配;第二,实际上在同一终板范围内不同厚度和不同形状的亚区域(下文简称终板亚区域)骨密度也各不相同,脊柱手术植入椎间的内植物往往只和某个终板亚区域接触,而不会和整个终板都接触。现有方法无法实现对自定义终板亚区域的分割;第三,现有方法无法对骨赘等非骨性终板区域进行分离排除。The bony endplate segmented by existing methods mainly has the following problems. First, the endplate segmentation area cannot match the complex endplate anatomical shape well; secondly, in fact, the bone density of subregions with different thicknesses and shapes (hereinafter referred to as endplate subregions) within the same endplate range is also different. Different from each other, the implants implanted in the intervertebral spine during spinal surgery often only come into contact with a certain endplate sub-region and do not contact the entire endplate. Existing methods cannot achieve segmentation of customized endplate subregions; thirdly, existing methods cannot separate and exclude non-osseous endplate regions such as osteophytes.
另一方面,现有方法对椎体侧板和椎体松质骨区域的分割性能较差。对椎体图像质量有较强依赖性,即泛化能力较差,难以在不同的成像设备(如CT设备的品牌、型号)、不同的扫描参数(电压、辐射剂量、扫描层 厚、重建核函数)、不同脊柱形态(结构异常,如畸形、骨折等;椎体角度异常,如侧弯、前后凸等)等场景中均保持较高的分割准确度。On the other hand, existing methods have poor segmentation performance for the vertebral body lateral plates and vertebral body cancellous bone regions. There is a strong dependence on the quality of the vertebral body image, that is, the generalization ability is poor, and it is difficult to adapt to different imaging equipment (such as the brand and model of CT equipment), different scanning parameters (voltage, radiation dose, scanning layer thickness, reconstruction kernel). Function), different spinal morphology (structural abnormalities, such as deformities, fractures, etc.; abnormal vertebral body angles, such as scoliosis, anterior and posterior kyphosis, etc.) and other scenarios maintain high segmentation accuracy.
上述问题均制约了现有方法在临床上的广泛应用。The above problems have restricted the widespread clinical application of existing methods.
发明内容Contents of the invention
鉴于上述的分析,本申请旨在提供一种椎体亚区域分割方法、装置及存储介质,通过预训练的椎体亚区域分割神经网络模型,得到椎体亚区域的三维分割掩膜。本申请解决了现有技术中椎体亚区域分割方法未涉及的问题,包括:无法实现与椎体骨性终板复杂的解剖形态相匹配的自定义终板及终板下骨亚区域分割;对椎体侧板和椎体松质骨区域的分割性能较差,且泛化能力不高。本申请可以在脊柱局部骨密度计算领域进行拓展应用。In view of the above analysis, this application aims to provide a vertebral body sub-region segmentation method, device and storage medium, and obtain a three-dimensional segmentation mask of the vertebral body sub-region through the pre-trained vertebral body sub-region segmentation neural network model. This application solves problems not covered by vertebral body sub-region segmentation methods in the prior art, including: the inability to achieve customized endplate and sub-endplate bone sub-region segmentation that matches the complex anatomical shape of the vertebral bony endplate; The segmentation performance of the vertebral body lateral plate and vertebral body cancellous bone area is poor, and the generalization ability is not high. This application can be expanded and applied in the field of spinal local bone density calculation.
本申请的目的主要是通过以下技术方案实现的:The purpose of this application is mainly achieved through the following technical solutions:
一方面,提供一种椎体亚区域分割方法,包括以下步骤:On the one hand, a vertebral body sub-region segmentation method is provided, including the following steps:
对获取得到的脊柱椎体影像数据进行预处理;Preprocess the obtained spinal vertebral body image data;
将预处理后的影像数据输入预训练的神经网络模型,得到脊柱椎体对应的亚区域分割掩膜回归结果;其中,神经网络模型用于输出椎体上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果、松质骨区域掩膜回归结果以及基于融合得到的椎体掩膜回归结果;Input the pre-processed image data into the pre-trained neural network model to obtain the sub-region segmentation mask regression results corresponding to the spinal vertebrae; among them, the neural network model is used to output the mask regression results of the upper and lower bony endplates of the vertebral body, cortex Bone lateral plate mask regression results, cancellous bone area mask regression results, and vertebral body mask regression results based on fusion;
通过形态学操作和连通性测试将椎体掩膜回归结果进行后处理,得到椎体上下骨性终板、皮质骨侧板、松质骨区域的三维分割掩膜,完成椎体亚区域分割;The vertebral body mask regression results are post-processed through morphological operations and connectivity tests to obtain a three-dimensional segmentation mask of the upper and lower bony endplates, cortical bone lateral plates, and cancellous bone areas of the vertebral body to complete vertebral body sub-region segmentation;
椎体亚区域包括椎体上下骨性终板、皮质骨侧板和松质骨区域。The vertebral body subregion includes the upper and lower bony endplates of the vertebral body, cortical bone lateral plates, and cancellous bone areas.
进一步的,预训练的神经网络模型为分叉式多任务卷积神经网络结构,包括一个编码器、三个解码器和一个MAX融合单元:其中,Further, the pre-trained neural network model is a bifurcated multi-task convolutional neural network structure, including an encoder, three decoders and a MAX fusion unit: where,
编码器用于接收预处理后的影像数据得到特征图;The encoder is used to receive preprocessed image data to obtain feature maps;
三个解码器分别与编码器相连,分别用于基于特征图输出上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果和椎体松质骨掩膜回归结果;The three decoders are respectively connected to the encoder and used to output the upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results and vertebral cancellous bone mask regression results based on the feature map;
MAX融合单元分别与三个编码器连接,用于输出椎体掩膜回归结果。The MAX fusion unit is connected to three encoders respectively and used to output the vertebral mask regression results.
进一步的,神经网络的预训练过程包括:Further, the pre-training process of the neural network includes:
为预处理后的影像数据分别构建上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签以及融合得到椎体掩膜标签;For the preprocessed image data, upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, vertebral body cancellous bone mask labels are constructed respectively, and the vertebral body mask label is obtained by fusion;
通过模拟不同场景下获取的影像特征对构建标签后的影像数据进行增广得到训练样本,以扩充训练数据集;By simulating the image features obtained in different scenarios, the image data after constructing the label is augmented to obtain training samples to expand the training data set;
计算神经网络模型的四个输出结果与对应标签的损失函数,为四个输出的损失函数分配权重得到神经网络模型总的损失函数;基于神经网络模型总的损失函数采用梯度下降法进行模型参数的迭代训练。Calculate the loss functions of the four output results of the neural network model and the corresponding labels, and assign weights to the four output loss functions to obtain the total loss function of the neural network model; based on the total loss function of the neural network model, the gradient descent method is used to determine the model parameters. Iterative training.
进一步的,使用二值掩膜法对训练样本图像中的体素进行标注,为训练样本图像构建得到上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签;Furthermore, the binary mask method was used to annotate the voxels in the training sample image, and the upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral cancellous bone mask were constructed for the training sample image. Label;
通过融合上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签,构建得到椎体掩膜标签。The vertebral body mask label is constructed by fusing the upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral body cancellous bone mask labels.
其中,构建终板掩膜标签,包括自定义终板区域的厚度和形状以得到终板掩膜标签。Among them, the endplate mask label is constructed, including customizing the thickness and shape of the endplate area to obtain the endplate mask label.
进一步的,采用梯度下降法进行模型参数迭代,包括:Further, the gradient descent method is used to iterate the model parameters, including:
采用交叉熵和Dice函数的加权作为每个输出结果的损失函数,其中,交叉熵函数为:The weight of cross entropy and Dice function is used as the loss function of each output result, where the cross entropy function is:
Figure PCTCN2022143887-appb-000001
Figure PCTCN2022143887-appb-000001
Dice函数为:The Dice function is:
Figure PCTCN2022143887-appb-000002
Figure PCTCN2022143887-appb-000002
其中,a为神经网络模型中一个输出结果,b为标注结果,i为体素位置索引,|n|为总体素数。Among them, a is an output result in the neural network model, b is the labeling result, i is the voxel position index, and |n| is the total prime number.
进一步的,MAX融合单元用于输出椎体掩膜回归结果,包括:基于预处理后的脊柱椎体影像的每个体素位置索引,取三个解码器对应索引的最大值,对每个体素的三个解码器对应索引的最大值进行MAX融合,输出椎体掩膜回归结果。Further, the MAX fusion unit is used to output the vertebral mask regression results, including: based on the position index of each voxel of the preprocessed spinal vertebral image, taking the maximum value of the corresponding index of the three decoders, and calculating the position index of each voxel. The three decoders perform MAX fusion on the maximum value of the corresponding index, and output the vertebral mask regression result.
进一步的,对脊柱椎体影像数据进行预处理包括重采样处理和像素值归一化处理;其中,Further, the preprocessing of the spinal vertebra image data includes resampling and pixel value normalization; among which,
重采样处理包括:将获取得到的脊柱椎体影像的空间分辨率除以预设的空间分辨率,得到影像数据在三个维度的重采样比率;根据重采样比率,采用线性插值方法得到重采样后的具有固定空间分辨率的影像数据。The resampling process includes: dividing the spatial resolution of the obtained spinal vertebral body image by the preset spatial resolution to obtain the resampling ratio of the image data in three dimensions; based on the resampling ratio, using linear interpolation method to obtain the resampling The resulting image data with fixed spatial resolution.
像素值归一化处理包括:将原始像素值范围[M,N]通过线性函数映射到预设的值域[P,Q];其中M为CT影像最小像素值,N为CT影像最大像素值,P为预设值域的下界,Q为预设值域的上界。Pixel value normalization processing includes: mapping the original pixel value range [M, N] to the preset value range [P, Q] through a linear function; where M is the minimum pixel value of the CT image, and N is the maximum pixel value of the CT image. , P is the lower bound of the preset value range, and Q is the upper bound of the preset value range.
进一步的,通过形态学操作和连通性测试将掩膜回归结果进行后处理,包括:使用卷积核为3*3*3的三维形态学开运算,通过滑动窗口法对每个体素周围的邻域进行形态学腐蚀和膨胀,去除掩膜中的细颗粒噪声,以去除包括椎体骨赘在内的非正常骨质结构区域。Further, the mask regression results are post-processed through morphological operations and connectivity tests, including: using a three-dimensional morphological opening operation with a convolution kernel of 3*3*3, and using the sliding window method to calculate the neighbors around each voxel. Domains are morphologically eroded and dilated to remove fine-grained noise in the mask to remove areas of abnormal bone structure including vertebral osteophytes.
使用skimage工具包对上下骨性终板掩膜进行连通性计算,保留结果中最大的两块连通区域,分别对应上、下两块骨性终板;皮质骨侧板掩膜与椎体松质骨掩膜只保留最大的一块连通区。Use the skimage toolkit to calculate the connectivity of the upper and lower bony endplate masks, and retain the two largest connected areas in the results, corresponding to the upper and lower bony endplates respectively; the cortical bone lateral plate mask and the vertebral cancellous The bone mask retains only the largest connected area.
另一方面,本申请还提供一种椎体亚区域分割装置,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现前述的椎体亚区域分割方法。On the other hand, this application also provides a vertebral body sub-region segmentation device, including: a memory, a processor, and a computer program stored on the memory and executable on the processor, and the processor executes the The computer program implements the aforementioned vertebral body sub-region segmentation method.
第三方面,还提供一种计算机可读存储介质,存储介质上存储有计算机程序,所述计算机程序可被处理器执行,实现前述的椎体亚区域分割方法。In a third aspect, a computer-readable storage medium is also provided. A computer program is stored on the storage medium, and the computer program can be executed by a processor to implement the aforementioned vertebral body sub-region segmentation method.
本申请还可以往局部骨密度计算领域进行扩展,计算分割区域内的平均CT值,在CT值基础上,通过肌肉脂肪的自动化定位分析,进行骨矿物含量的定量化计算和分析。This application can also be extended to the field of local bone density calculation to calculate the average CT value within the segmented area. Based on the CT value, quantitative calculation and analysis of bone mineral content can be performed through automated positioning analysis of muscle fat.
本技术方案的有益效果:Beneficial effects of this technical solution:
1.针对现有方法不能实现与复杂终板解剖结构匹配的个性化终板亚区域分割,对椎体侧壁和松质骨区域分割性能较差的问题,本申请将椎体划分为上下骨性终板,皮质骨侧板和椎体松质骨三个区域。可实现个性化终板厚度分割、与“植入物-终板”接触面匹配的终板亚区域分割,去除骨赘等非正常骨质结构。更加准确的对椎体进行分割,进一步扩大了椎体分割方法的应用范围。1. In view of the problem that existing methods cannot achieve personalized endplate sub-region segmentation that matches the complex endplate anatomy and have poor performance in segmenting vertebral body side walls and cancellous bone regions, this application divides the vertebral body into upper and lower bones. There are three regions: sexual endplate, cortical bone lateral plate and vertebral cancellous bone. It can achieve personalized endplate thickness segmentation, endplate sub-region segmentation matching the "implant-endplate" contact surface, and remove abnormal bone structures such as osteophytes. Segmenting the vertebral body more accurately further expands the application scope of the vertebral body segmentation method.
2.针对现有技术泛化能力较差的问题,本申请提出一种基于预训练的神经网络模型的机器学习技术方法,并且通过数据增广法,模拟不同场景下获取的椎体影像特征,增加预测模型对影像对比度、影像噪声、椎体位姿、影像层厚的适应度,从而获得更强的泛化能力。相比于现有技术,本方法可自主学习影像特征,无需使用特定的特征提取方法。2. In view of the problem of poor generalization ability of the existing technology, this application proposes a machine learning technology method based on a pre-trained neural network model, and uses the data augmentation method to simulate the vertebral image features obtained in different scenarios. Increase the adaptability of the prediction model to image contrast, image noise, vertebral body posture, and image layer thickness, thereby obtaining stronger generalization ability. Compared with the existing technology, this method can learn image features independently without using a specific feature extraction method.
3.针对现有技术不具备可拓展性的问题,本申请采用数据驱动的建模方式,可通过增加训练数据的方法提升预测模型性能,可将专家意见通过数据标注的方式添加到模型训练过程中,不断优化。3. In view of the problem that the existing technology is not scalable, this application adopts a data-driven modeling method, which can improve the performance of the prediction model by adding training data, and can add expert opinions to the model training process through data annotation. , and continue to optimize.
本申请的其他特征和优点将在随后的说明书中阐述,并且,部分的从说明书中变得显而易见,或者通过实施本申请而了解。本申请的目的和其他优点可通过在所写的说明书、权利要求书、以及附图中所特别指出的结构来实现和获得。Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the application may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
附图说明Description of drawings
附图仅用于示出具体实施例的目的,而并不认为是对本申请的限制,在整个附图中,相同的参考符号表示相同的部件。The drawings are for the purpose of illustrating specific embodiments only and are not to be construed as limiting the application. Throughout the drawings, the same reference characters represent the same components.
图1:本申请实施例的椎体亚区域分割方法的示意流程图;Figure 1: Schematic flow chart of the vertebral body sub-region segmentation method according to the embodiment of the present application;
图2:本申请实施例的椎体亚区域分割神经网络模型的结构示意图;Figure 2: Schematic structural diagram of the vertebral body sub-region segmentation neural network model according to the embodiment of the present application;
图3:本申请实施例的一种训练椎体亚区域分割神经网络模型的结构示意图;Figure 3: Structural diagram of a neural network model for training vertebral body sub-region segmentation according to the embodiment of the present application;
图4:本申请实施例的脊椎影像原始图与对应的数据增广结果的对比图;Figure 4: Comparison of the original spine image and the corresponding data augmentation result according to the embodiment of the present application;
图5:本申请实施例的椎体亚区域分割方法的结构示意图;Figure 5: Structural diagram of the vertebral body sub-region segmentation method according to the embodiment of the present application;
图6:本申请实施例的椎体骨赘去除方法,通过腐蚀算法去除外圈从而去除外围骨赘示意图;Figure 6: A schematic diagram of the vertebral body osteophyte removal method according to the embodiment of the present application, which removes the outer ring through an erosion algorithm to remove peripheral osteophytes;
具体实施方式Detailed ways
下面结合附图来具体描述本申请的优选实施例,其中,附图构成本申请一部分,并与本申请的实施例一起用于阐释本申请的原理,并非用于限定本申请的范围。Preferred embodiments of the present application will be described in detail below with reference to the accompanying drawings. The drawings constitute a part of the present application and are used together with the embodiments of the present application to illustrate the principles of the present application, but are not intended to limit the scope of the present application.
本申请通过对脊柱椎体影像数据进行预处理、构建掩膜标签、神经网络模型预测和后处理操作,得到椎体亚区域分割掩膜;本申请的椎体亚区 域包括:椎体骨性终板、皮质骨侧板和椎体松质骨区域。示例性的,脊柱椎体影像数据为CT影像数据。This application obtains the vertebral body sub-region segmentation mask by preprocessing the spinal vertebral body image data, constructing mask labels, neural network model prediction and post-processing operations; the vertebral body sub-regions in this application include: vertebral body bony end plates, lateral cortical plates, and cancellous bone areas of the vertebral body. For example, the spinal vertebral body image data is CT image data.
本申请的一个实施例,提供一种椎体亚区域分割方法,如图1所示,包括以下步骤:One embodiment of the present application provides a vertebral body sub-region segmentation method, as shown in Figure 1, including the following steps:
步骤1:对获取得到的脊柱椎体影像数据进行预处理;Step 1: Preprocess the obtained spinal vertebral body image data;
具体的,预处理包括重采样和像素值归一化处理。将脊柱椎体影像的空间分辨率除以预设的空间分辨率,得到所述影像数据在三个维度的重采样比率;根据所述重采样比率,采用线性插值方法得到重采样后的具有固定空间分辨率的影像数据;并将原始像素值范围通过线性函数映射到预设的值域。Specifically, preprocessing includes resampling and pixel value normalization. Divide the spatial resolution of the spine vertebral body image by the preset spatial resolution to obtain the resampling ratio of the image data in three dimensions; according to the resampling ratio, use a linear interpolation method to obtain the resampled image with a fixed Image data of spatial resolution; and map the original pixel value range to a preset value range through a linear function.
具体的,在得到脊柱椎体影像数据后,通过对其进行预处理操作,能够消除原始影像数据的不同空间分辨率和极端像素值对后续步骤的负面影响。Specifically, after obtaining the spinal vertebral body image data, preprocessing it can eliminate the negative impact of different spatial resolutions and extreme pixel values of the original image data on subsequent steps.
首先,对原始影像数据进行重采样:重采样需要得到输入的脊柱椎体CT影像的空间分辨率,即每个体素对应的物理空间尺寸。基于DICOM协议的影像数据会将其作为元数据(metadata)的一部分保存该信息,作为一个具体的实施例,可以通过基于DICOM协议的影像数据获得CT影像的空间分辨率。重采样的目标空间分辨率为某一固定的空间尺寸,例如150*90*90毫米。以此为目的,首先预设一个空间分辨率,例如1*1*1毫米,将输入的CT影像的空间分辨率除以预设好的空间分辨率,得到原始影像数据所在三个维度的重采样比率;作为一个具体的实施例,可以采用线性插值方法(trilinear interpolation),经过重采样,得到具有固定空间尺寸的CT影像数据。First, resample the original image data: resampling requires obtaining the spatial resolution of the input spine vertebral body CT image, that is, the physical spatial size corresponding to each voxel. Image data based on the DICOM protocol will save this information as part of the metadata. As a specific embodiment, the spatial resolution of the CT image can be obtained through the image data based on the DICOM protocol. The target spatial resolution of resampling is a certain fixed spatial size, such as 150*90*90 mm. For this purpose, first preset a spatial resolution, such as 1*1*1 mm, divide the spatial resolution of the input CT image by the preset spatial resolution, and obtain the weight of the three dimensions of the original image data. Sampling ratio; as a specific embodiment, a linear interpolation method (trilinear interpolation) can be used to obtain CT image data with a fixed spatial size after resampling.
对输入的CT影像数据进行重采样后,需要对影像的像素值做归一化处理,将原始像素值范围[M,N]通过线性函数映射到某一预设的值域[P, Q],例如[-1,1],其中M为CT影像最小像素值,N为CT影像最大像素值,P为预设值域的下界,Q为预设值域的上界,使用(M,P)和(N,Q)这两点拟合线性函数。After resampling the input CT image data, the pixel values of the image need to be normalized, and the original pixel value range [M, N] is mapped to a preset value range [P, Q] through a linear function. , for example [-1,1], where M is the minimum pixel value of the CT image, N is the maximum pixel value of the CT image, P is the lower bound of the preset value range, Q is the upper bound of the preset value range, use (M,P ) and (N, Q) fit the linear function.
需要说明的是,经过重采样后,将影像数据以统一的空间分辨率表示,从而消除不同空间分辨率(例如数据层厚和重建方法的不同)带来的结构化差异,使得亚区域分割模型将特征表征的学习方向集中于影像的语义本身。像素值归一化的目的是进一步消除极端像素值对后续步骤的负面影响;例如,某些金属植入物在CT影像中具有异常高的像素值,需要通过像素值归一化进行抑制。采用线性插值方法(trilinear interpolation)进行归一化处理,能够在保留影像特征的基础上具有较快的处理速度。It should be noted that after resampling, the image data is represented with a unified spatial resolution, thereby eliminating the structural differences caused by different spatial resolutions (such as data layer thickness and reconstruction methods), making the sub-region segmentation model Focus the learning direction of feature representation on the semantics of the image itself. The purpose of pixel value normalization is to further eliminate the negative impact of extreme pixel values on subsequent steps; for example, some metal implants have abnormally high pixel values in CT images and need to be suppressed by pixel value normalization. The linear interpolation method (trilinear interpolation) is used for normalization processing, which can achieve faster processing speed while retaining image characteristics.
步骤2:将所述预处理后的影像数据输入预训练的神经网络模型,得到脊柱椎体对应的亚区域分割掩膜回归结果;其中,神经网络模型用于输出椎体上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果、松质骨区域掩膜回归结果以及基于融合得到的椎体掩膜回归结果;四个所述掩膜回归结果即为神经网络模型的预测输出;对四个所述预测输出经过回归迭代计算,得到最终的椎体上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果、松质骨区域掩膜回归结果以及基于融合得到的椎体掩膜回归结果。Step 2: Input the pre-processed image data into the pre-trained neural network model to obtain the sub-region segmentation mask regression results corresponding to the spinal vertebrae; wherein, the neural network model is used to output the upper and lower bony endplate masks of the vertebral body. Membrane regression results, cortical bone lateral plate mask regression results, cancellous bone area mask regression results and vertebral body mask regression results based on fusion; the four mask regression results are the prediction output of the neural network model; The four predicted outputs are subjected to regression iterative calculations to obtain the final vertebral upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results, cancellous bone region mask regression results, and vertebral fusion-based vertebral mask regression results. Volume mask regression results.
具体地,本申请采用了分叉式多任务卷积神经网络结构,如图2所示。多任务学习的策略有助于提升卷积神经网络的泛化能力,抑制过拟合现象。分叉式结构由一个编码器、三个解码器和一个MAX融合单元组成;其中,编码器用于接收所述预处理后的影像数据得到特征图;其中三个解码器和一个MAX融合单元的输出即为模型的四种输出;三个解码器分别与编码器相连,用于基于特征图输出上下骨性终板掩膜回归结果、皮质骨 侧板掩膜回归结果和椎体松质骨掩膜回归结果;MAX融合单元分别与三个编码器连接,用于输出椎体掩膜回归结果。Specifically, this application adopts a bifurcated multi-task convolutional neural network structure, as shown in Figure 2. The multi-task learning strategy helps improve the generalization ability of convolutional neural networks and suppress over-fitting. The bifurcated structure consists of an encoder, three decoders and a MAX fusion unit; the encoder is used to receive the preprocessed image data to obtain feature maps; the outputs of the three decoders and a MAX fusion unit It is the four outputs of the model; the three decoders are connected to the encoder respectively, and are used to output the upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results and vertebral cancellous bone mask based on the feature map. Regression results; the MAX fusion unit is connected to three encoders respectively to output the vertebral mask regression results.
作为一个具体的实施例,本申请借鉴UNet结构,使用跳跃连接的方式将编码器的特征图传递给三个解码器,从而保证局部特征图能够有效地传递到解码器,弥补下采样造成的信息丢失。为了进一步抑制模型过拟合现象,本申请采用MAX融合单元将三个解码器的输出进行体素级融合;即对每个体素位置索引取三个解码器对应索引的最大值,相当于完成了基于通道最大值的椎体亚区域分类投票。三个通道的MAX融合输出能够还原出完整的椎体掩膜,过滤无效特征区域。As a specific embodiment, this application draws on the UNet structure and uses skip connections to transfer the feature map of the encoder to three decoders, thereby ensuring that the local feature map can be effectively transferred to the decoder and making up for the information caused by downsampling. lost. In order to further suppress the over-fitting phenomenon of the model, this application uses the MAX fusion unit to perform voxel-level fusion of the outputs of the three decoders; that is, for each voxel position index, the maximum value of the corresponding index of the three decoders is obtained, which is equivalent to completing Cone subregion classification voting based on channel maxima. The three-channel MAX fusion output can restore the complete vertebral mask and filter out invalid feature areas.
为了训练本申请所述的椎体亚区域分割神经网络模型,需要为训练用的图像数据构建三种训练标签:上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签。椎体掩膜标签可通过在空间中融合前述的三种标签获得。In order to train the vertebral body sub-region segmentation neural network model described in this application, three types of training labels need to be constructed for the image data used for training: upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral body cancellous Bone mask label. The vertebral mask label can be obtained by fusing the aforementioned three labels in space.
通过上述方法,终板分割区域与不同脊柱节段的终板解剖外形基本一致,可以根据神经网络模型进行图像分割;且终板分割区域可实现厚度均匀的效果,通过形态学计算,按照预先自定义的厚度进行分割,可覆盖骨性终板以及自定义厚度范围的终板下骨;终板分割区域可通过三维形态学腐蚀算法,消除骨赘等非感兴趣干扰区域;上下骨性终板掩膜标签可以拓展为自定义终板亚区域形状,通过预先设计的融合器等椎体间植入物的几何模型,得到该几何模型下与“植入物-终板”接触面的形状大小匹配的上下骨性终板亚区域的掩膜标签。Through the above method, the endplate segmentation area is basically consistent with the anatomical shape of the endplate of different spinal segments, and image segmentation can be performed based on the neural network model; and the endplate segmentation area can achieve a uniform thickness effect. Through morphological calculation, according to the pre-automatic Segmentation with a defined thickness can cover the bony endplate and the bone under the endplate in a custom thickness range; the endplate segmentation area can be eliminated by using a three-dimensional morphological erosion algorithm to eliminate non-interesting interference areas such as osteophytes; upper and lower bony endplates The mask label can be expanded to a customized endplate sub-region shape, and the shape and size of the "implant-endplate" contact surface under the geometric model can be obtained through the pre-designed geometric model of intervertebral implants such as cages. Matching mask labels for superior and inferior bony endplate subregions.
具体的,创建掩膜标签的方式可以通过专家手动标注,可使用二值掩膜表示感兴趣区域,其中标注1代表感兴趣区域内的体素,标注0代表非感兴趣区域体素。训练用的图像数据为三维医学影像,包括计算机断层成像,CT、MRI等影像设备采集到三维数据。Specifically, the way to create mask labels can be manually annotated by experts, and a binary mask can be used to represent the area of interest, where the label 1 represents the voxels in the area of interest, and the label 0 represents the voxels in the non-interest area. The image data used for training are three-dimensional medical images, including computed tomography, CT, MRI and other imaging equipment to collect three-dimensional data.
图3是本申请训练椎体亚区域分割神经网络模型的结构示意图;Figure 3 is a schematic structural diagram of the neural network model for training vertebral body sub-region segmentation in this application;
首先对图像数据进行数据增广。具体的,如图4所示,三维数据增广方法包括:像素值随机指数变换和对数变换、随机三维度放射变换(包括平移、拉伸、收缩、旋转、剪切等)、随机椒盐噪声扰动、随机弹性形变等;通过数据增广极大的丰富了训练数据集,增加了模型的泛化能力。First, perform data augmentation on the image data. Specifically, as shown in Figure 4, three-dimensional data augmentation methods include: random exponential transformation and logarithmic transformation of pixel values, random three-dimensional radial transformation (including translation, stretching, shrinkage, rotation, shearing, etc.), random salt and pepper noise Disturbance, random elastic deformation, etc.; through data augmentation, the training data set is greatly enriched and the generalization ability of the model is increased.
通过数据增广扩充训练数据集,模拟不同场景下获取的CT影像特征,增加模型对影像对比度、影像噪声、椎体位姿、影像层厚的适应度,从而获得更强的泛化能力,能够应用于更广泛的临床场景。Expand the training data set through data augmentation, simulate CT image features acquired in different scenarios, and increase the model's adaptability to image contrast, image noise, vertebral body posture, and image layer thickness, thereby obtaining stronger generalization ability and being able to Applicable to a wider range of clinical scenarios.
其次,利用扩充后的训练数据集,对模型参数进行迭代训练。具体的,模型参数迭代整体采用梯度下降法,通过计算模型的四个输出与对应标签的损失函数,并为四种输出的损失函数分配权重,例如按照1:1:1:1的比例加权,得到模型总的损失函数;对模型总的损失函数采用梯度下降法,以一定的学习率对模型参数进行更新迭代优化。本申请采用交叉熵和Dice函数按1:1加权作为每个输出的损失函数;其中,Secondly, the expanded training data set is used to iteratively train the model parameters. Specifically, the gradient descent method is used to iterate the model parameters as a whole, by calculating the loss functions of the four outputs of the model and the corresponding labels, and assigning weights to the loss functions of the four outputs, for example, weighting according to the ratio of 1:1:1:1, The total loss function of the model is obtained; the gradient descent method is used for the total loss function of the model, and the model parameters are updated and iteratively optimized at a certain learning rate. This application uses cross entropy and Dice function weighted at 1:1 as the loss function of each output; where,
交叉熵函数为:The cross entropy function is:
Figure PCTCN2022143887-appb-000003
Figure PCTCN2022143887-appb-000003
Dice函数为:The Dice function is:
Figure PCTCN2022143887-appb-000004
Figure PCTCN2022143887-appb-000004
其中,a为神经网络模型中一个输出结果,b为标注结果,i为像素位置索引,|n|为总像素数。Among them, a is an output result in the neural network model, b is the labeling result, i is the pixel position index, and |n| is the total number of pixels.
需要说明的是,本申请通过训练样本集扩充和根据专家标注的掩膜标签数据进行参数迭代训练,极大的增强了椎体分割方法的泛化能力,解决了现有技术对椎体图像质量有较强的依赖性,泛化能力较差,难以适应 不同的CT设备、不同的扫描参数和不同脊柱形态等问题;且本申请增加了专家意见,更好的改善了分割性能,提高了可拓展性。It should be noted that this application greatly enhances the generalization ability of the vertebral body segmentation method through the expansion of the training sample set and iterative parameter training based on the mask label data annotated by experts, and solves the problems that existing technologies have on vertebral body image quality. It has strong dependence, poor generalization ability, and is difficult to adapt to problems such as different CT equipment, different scanning parameters, and different spine morphologies; and this application adds expert opinions to better improve segmentation performance and improve reliability. Expandability.
本领域的技术人员可以理解,本申请选择的损失函数及其权重分配比例仅仅是与本申请方案相关的实例,并不构成应用本申请方案的限定,实际使用的损失函数种类和每个损失函数对应的权重可以改变。Those skilled in the art can understand that the loss function and its weight distribution ratio selected in this application are only examples related to the solution of this application and do not constitute a limitation on the application of the solution of this application. The types of loss functions actually used and each loss function The corresponding weights can be changed.
步骤3:通过形态学操作和连通性测试将所述掩膜回归结果进行后处理,得到椎体上下骨性终板、皮质骨侧板、松质骨区域的三维分割掩膜,完成椎体亚区域分割。Step 3: Post-process the mask regression results through morphological operations and connectivity tests to obtain a three-dimensional segmentation mask of the upper and lower bony endplates, cortical bone lateral plates, and cancellous bone areas of the vertebral body, and complete the vertebral body subdivision. Region segmentation.
由椎体亚区域分割神经网络模型输出得到的上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果、椎体松质骨掩膜回归结果需要进行后处理。具体的,首先使用三维形态学开运算对每个掩膜回归结果进行处理,去除掩膜中的细颗粒噪声;开运算的卷积核为3*3*3,三维形态学开运算的本质是使用滑动窗口法对每个体素周围的邻域进行形态学腐蚀和膨胀,以去除包括椎体骨赘在内的非正常骨质结构区域,去除骨赘的示意图如图6所示。然后,使用连通性测试对三种掩膜回归结果进行处理,具体的,可以使用skimage工具包对上下骨性终板掩膜进行连通性计算,并保留结果中最大的两块连通区域,分别对应上、下两块骨性终板;皮质骨侧板掩膜仅保留最大的一块连通区域;椎体松质骨掩膜仅保留最大的一块连通区。经过形态学操作和连通性测试,即可得到最终的椎体亚区域分割掩膜区。The upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results, and vertebral body cancellous bone mask regression results output from the vertebral body subregion segmentation neural network model need to be post-processed. Specifically, the three-dimensional morphological opening operation is first used to process each mask regression result to remove the fine-grained noise in the mask; the convolution kernel of the opening operation is 3*3*3, and the essence of the three-dimensional morphological opening operation is The sliding window method is used to morphologically erode and expand the neighborhood around each voxel to remove abnormal bone structure areas including vertebral osteophytes. The schematic diagram of osteophyte removal is shown in Figure 6. Then, use the connectivity test to process the three mask regression results. Specifically, you can use the skimage toolkit to calculate the connectivity of the upper and lower bony endplate masks, and retain the two largest connected areas in the results, corresponding to There are two upper and lower bony endplates; the cortical bone lateral plate mask only retains the largest connected area; the vertebral cancellous bone mask retains only the largest connected area. After morphological operations and connectivity testing, the final vertebral sub-region segmentation mask area can be obtained.
本申请的另一个实施例,提供了一种椎体亚区域分割装置,包括:存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,且处理器执行计算机程序时实现本申请上述任一实施例所述的椎体亚区域分割方法。Another embodiment of the present application provides a vertebral body sub-region segmentation device, including: a memory, a processor, and a computer program stored in the memory and executable on the processor, and the present invention is implemented when the processor executes the computer program. Apply the vertebral body sub-region segmentation method described in any of the above embodiments.
本申请的第三个实施例,提供了一种计算机可读存储介质,存储介质上存储有计算机程序,计算机程序可被处理器执行,实现本申请上述任一实施例所述的椎体亚区域分割方法。The third embodiment of the present application provides a computer-readable storage medium. A computer program is stored on the storage medium. The computer program can be executed by a processor to realize the vertebral subregion described in any of the above embodiments of the present application. Segmentation method.
综上所述,本申请提供的一种椎体亚区域分割方法:如图5所示,将预处理后的影像数据输入到预先训练的椎体亚区域分割神经网络模型,得到四个预测输出,包括亚区域分割掩膜回归结果和椎体分割掩膜回归结果;通过形态学操作和连通性测试对掩膜回归结果进行后处理,得到椎体上下骨性终板、皮质骨侧板、松质骨区域等椎体亚区域的三维分割掩膜,完成椎体亚区域分割。本申请使用人工智能技术促进了智慧医疗在临床中的应用;解决了传统方法泛化能力差、可拓展性差、应用范围局限的缺点。且该分割方法可以向局部骨密度计算领域进行扩展,计算分割区域内的平均CT值,在CT值基础上,通过肌肉脂肪的自动化定位分析,进行骨矿物含量的定量化计算和分析,从而实现骨密度计算测量。To sum up, this application provides a vertebral body sub-region segmentation method: As shown in Figure 5, the pre-processed image data is input into the pre-trained vertebral body sub-region segmentation neural network model, and four prediction outputs are obtained. , including sub-region segmentation mask regression results and vertebral body segmentation mask regression results; the mask regression results are post-processed through morphological operations and connectivity tests to obtain the upper and lower vertebral body bony endplates, cortical bone lateral plates, loose The three-dimensional segmentation mask of vertebral body sub-regions such as the bone region is used to complete the segmentation of vertebral body sub-regions. This application uses artificial intelligence technology to promote the application of smart medical care in clinical settings; it solves the shortcomings of traditional methods such as poor generalization ability, poor scalability, and limited application scope. And this segmentation method can be extended to the field of local bone density calculation to calculate the average CT value in the segmented area. Based on the CT value, through the automatic positioning analysis of muscle fat, quantitative calculation and analysis of bone mineral content can be performed, thereby achieving Bone density calculation measurement.
本领域技术人员可以理解,实现上述实施例中方法的全部或部分流程,可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于计算机可读存储介质中。其中,所述计算机可读存储介质为磁盘、光盘、只读存储记忆体或随机存储记忆体等。Those skilled in the art can understand that all or part of the process of implementing the method in the above embodiments can be completed by instructing relevant hardware through a computer program, and the program can be stored in a computer-readable storage medium. Wherein, the computer-readable storage medium is a magnetic disk, an optical disk, a read-only memory or a random access memory, etc.
以上所述,仅为本申请较佳的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本申请的保护范围之内。The above are only preferred specific implementations of the present application, but the protection scope of the present application is not limited thereto. Any person familiar with the technical field can easily think of changes or modifications within the technical scope disclosed in the present application. Replacements shall be covered by the protection scope of this application.

Claims (10)

  1. 一种椎体亚区域分割方法,其特征在于,包括以下步骤:A vertebral body sub-region segmentation method, characterized by including the following steps:
    对获取得到的脊柱椎体影像数据进行预处理;Preprocess the obtained spinal vertebral body image data;
    将所述预处理后的影像数据输入预训练的神经网络模型,得到所述脊柱椎体对应的亚区域分割掩膜回归结果;其中,所述神经网络模型用于输出椎体上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果、松质骨区域掩膜回归结果以及基于融合得到的椎体掩膜回归结果;The pre-processed image data is input into the pre-trained neural network model to obtain the sub-region segmentation mask regression results corresponding to the spinal vertebral body; wherein the neural network model is used to output the upper and lower bony endplates of the vertebral body. Mask regression results, cortical bone lateral plate mask regression results, cancellous bone area mask regression results, and vertebral body mask regression results based on fusion;
    通过形态学操作和连通性测试将所述椎体掩膜回归结果进行后处理,得到椎体上下骨性终板、皮质骨侧板、松质骨区域的三维分割掩膜,完成椎体亚区域分割;The vertebral body mask regression results are post-processed through morphological operations and connectivity tests to obtain a three-dimensional segmentation mask of the upper and lower bony endplates, cortical bone lateral plates, and cancellous bone areas of the vertebral body to complete the vertebral body sub-regions. segmentation;
    所述椎体亚区域包括椎体上下骨性终板、皮质骨侧板和松质骨区域。The vertebral body sub-region includes the upper and lower bony endplates of the vertebral body, cortical bone lateral plates and cancellous bone areas.
  2. 根据权利要求1所述的椎体亚区域分割方法,其特征在于,所述预训练的神经网络模型为分叉式多任务卷积神经网络结构,包括一个编码器、三个解码器和一个MAX融合单元:其中,The vertebral body sub-region segmentation method according to claim 1, characterized in that the pre-trained neural network model is a bifurcated multi-task convolutional neural network structure, including an encoder, three decoders and a MAX Fusion unit: where,
    所述编码器用于接收所述预处理后的影像数据得到特征图;The encoder is used to receive the preprocessed image data to obtain a feature map;
    三个所述解码器分别与所述编码器相连,分别用于基于所述特征图输出上下骨性终板掩膜回归结果、皮质骨侧板掩膜回归结果和椎体松质骨掩膜回归结果;The three decoders are respectively connected to the encoder, and are respectively used to output upper and lower bony endplate mask regression results, cortical bone lateral plate mask regression results and vertebral cancellous bone mask regression based on the feature map. result;
    所述MAX融合单元分别与三个所述编码器连接,用于输出椎体掩膜回归结果。The MAX fusion unit is connected to the three encoders respectively, and is used to output the vertebral mask regression results.
  3. 根据权利要求2所述的椎体亚区域分割方法,其特征在于,所述神经网络的预训练过程包括:The vertebral body sub-region segmentation method according to claim 2, characterized in that the pre-training process of the neural network includes:
    为预处理后的影像数据分别构建上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签以及融合得到椎体掩膜标签;For the preprocessed image data, upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, vertebral body cancellous bone mask labels are constructed respectively, and the vertebral body mask label is obtained by fusion;
    通过模拟不同场景下获取的影像特征对构建标签后的影像数据进行 增广得到训练样本,以扩充训练数据集;By simulating the image features obtained in different scenarios, the image data after constructing the label is augmented to obtain training samples to expand the training data set;
    计算所述神经网络模型的四个输出结果与对应标签的损失函数,为四个输出的损失函数分配权重得到所述神经网络模型总的损失函数;基于所述神经网络模型总的损失函数采用梯度下降法进行模型参数的迭代训练。Calculate the four output results of the neural network model and the loss function of the corresponding label, assign weights to the four output loss functions to obtain the total loss function of the neural network model; use the gradient based on the total loss function of the neural network model The descent method performs iterative training of model parameters.
  4. 根据权利要求3所述的椎体亚区域分割方法,其特征在于,使用二值掩膜法对训练样本图像中的体素进行标注,为所述训练样本图像构建得到所述上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签;The vertebral body sub-region segmentation method according to claim 3, characterized in that the voxels in the training sample image are marked using a binary mask method, and the upper and lower bony endplates are constructed for the training sample image. Mask label, cortical bone lateral plate mask label, vertebral cancellous bone mask label;
    通过融合所述上下骨性终板掩膜标签、皮质骨侧板掩膜标签、椎体松质骨掩膜标签,构建得到所述椎体掩膜标签。The vertebral body mask label is constructed by fusing the upper and lower bony endplate mask labels, cortical bone lateral plate mask labels, and vertebral body cancellous bone mask labels.
    其中,构建所述终板掩膜标签,包括自定义终板区域的厚度和形状以得到终板掩膜标签。Wherein, constructing the endplate mask label includes customizing the thickness and shape of the endplate area to obtain the endplate mask label.
  5. 根据权利要求3或4所述的椎体亚区域分割方法,其特征在于,所述采用梯度下降法进行模型参数迭代,包括:The vertebral body sub-region segmentation method according to claim 3 or 4, characterized in that the gradient descent method is used to iterate the model parameters, including:
    采用交叉熵和Dice函数的加权作为每个输出结果的损失函数,其中,交叉熵函数为:The weight of cross entropy and Dice function is used as the loss function of each output result, where the cross entropy function is:
    Figure PCTCN2022143887-appb-100001
    Figure PCTCN2022143887-appb-100001
    Dice函数为:The Dice function is:
    Figure PCTCN2022143887-appb-100002
    Figure PCTCN2022143887-appb-100002
    其中,a为神经网络模型中一个输出结果,b为标注结果,i为体素位置索引,|n|为总体素数。Among them, a is an output result in the neural network model, b is the labeling result, i is the voxel position index, and |n| is the total prime number.
  6. 根据权利要求2-4任一项所述的椎体亚区域分割方法,其特征在于,所述MAX融合单元用于输出椎体掩膜回归结果,包括:基于所述预处 理后的脊柱椎体影像的每个体素位置索引,取三个解码器对应索引的最大值,对每个体素的所述三个解码器对应索引的最大值进行MAX融合,输出椎体掩膜回归结果。The vertebral body sub-region segmentation method according to any one of claims 2 to 4, characterized in that the MAX fusion unit is used to output a vertebral body mask regression result, including: based on the preprocessed spinal vertebral body For each voxel position index of the image, take the maximum value of the corresponding indexes of the three decoders, perform MAX fusion on the maximum value of the corresponding indexes of the three decoders for each voxel, and output the vertebral mask regression result.
  7. 根据权利要求1所述的椎体亚区域分割方法,其特征在于,所述对脊柱椎体影像数据进行预处理包括重采样处理和像素值归一化处理;其中,The vertebral body sub-region segmentation method according to claim 1, wherein the preprocessing of the spinal vertebral body image data includes resampling processing and pixel value normalization processing; wherein,
    所述重采样处理包括:将获取得到的脊柱椎体影像的空间分辨率除以预设的空间分辨率,得到所述影像数据在三个维度的重采样比率;根据所述重采样比率,采用线性插值方法得到重采样后的具有固定空间分辨率的影像数据。The resampling process includes: dividing the obtained spatial resolution of the spine vertebral body image by a preset spatial resolution to obtain a resampling ratio of the image data in three dimensions; according to the resampling ratio, using The linear interpolation method obtains resampled image data with a fixed spatial resolution.
    所述像素值归一化处理包括:将原始像素值范围[M,N]通过线性函数映射到预设的值域[P,Q];其中M为CT影像最小像素值,N为CT影像最大像素值,P为预设值域的下界,Q为预设值域的上界。The pixel value normalization process includes: mapping the original pixel value range [M, N] to the preset value range [P, Q] through a linear function; where M is the minimum pixel value of the CT image, and N is the maximum CT image. Pixel value, P is the lower bound of the preset value range, Q is the upper bound of the preset value range.
  8. 根据权利要求1所述的椎体亚区域分割方法,其特征在于,所述通过形态学操作和连通性测试将所述掩膜回归结果进行后处理,包括:使用卷积核为3*3*3的三维形态学开运算,通过滑动窗口法对每个体素周围的邻域进行形态学腐蚀和膨胀,去除掩膜中的细颗粒噪声,以去除包括椎体骨赘在内的非正常骨质结构区域。The vertebral body sub-region segmentation method according to claim 1, wherein the post-processing of the mask regression results through morphological operations and connectivity testing includes: using a convolution kernel of 3*3* 3's three-dimensional morphological opening operation uses a sliding window method to morphologically erode and expand the neighborhood around each voxel to remove fine-grained noise in the mask to remove abnormal bone including vertebral osteophytes. Structural area.
    使用skimage工具包对上下骨性终板掩膜进行连通性计算,保留结果中最大的两块连通区域,分别对应上、下两块骨性终板;皮质骨侧板掩膜与椎体松质骨掩膜只保留最大的一块连通区。Use the skimage toolkit to perform connectivity calculations on the upper and lower bony endplate masks, and retain the two largest connected areas in the results, corresponding to the upper and lower bony endplates respectively; the cortical bone lateral plate mask and the vertebral cancellous The bone mask retains only the largest connected area.
  9. 一种椎体亚区域分割装置,其特征在于,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如权利要求1至8中任一项所述的椎体亚区域分割方法。A vertebral body sub-region segmentation device, characterized in that it includes: a memory, a processor, and a computer program stored in the memory and executable on the processor, which is implemented when the processor executes the computer program The vertebral body sub-region segmentation method according to any one of claims 1 to 8.
  10. 一种计算机可读存储介质,其特征在于,所述存储介质上存储有计算机程序,所述计算机程序可被处理器执行,实现权利要求1-8中任一项所述的椎体亚区域分割方法。A computer-readable storage medium, characterized in that a computer program is stored on the storage medium, and the computer program can be executed by a processor to implement the vertebral body sub-region segmentation described in any one of claims 1-8. method.
PCT/CN2022/143887 2022-06-28 2022-12-30 Vertebral body sub-region segmentation method and apparatus, and storage medium WO2024001140A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210744141.9 2022-06-28
CN202210744141.9A CN114998301B (en) 2022-06-28 2022-06-28 Vertebral body sub-region segmentation method and device and storage medium

Publications (1)

Publication Number Publication Date
WO2024001140A1 true WO2024001140A1 (en) 2024-01-04

Family

ID=83037349

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/143887 WO2024001140A1 (en) 2022-06-28 2022-12-30 Vertebral body sub-region segmentation method and apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN114998301B (en)
WO (1) WO2024001140A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114998301B (en) * 2022-06-28 2022-11-29 北京大学第三医院(北京大学第三临床医学院) Vertebral body sub-region segmentation method and device and storage medium
CN115797698B (en) * 2022-12-13 2023-07-14 北京大学第三医院(北京大学第三临床医学院) Bone sagittal balance parameter detection system
CN115880319B (en) * 2023-02-16 2023-07-21 博志生物科技(深圳)有限公司 Automatic segmentation method and device for vertebral endplate and adjacent cancellous bone
CN117058172A (en) * 2023-08-24 2023-11-14 吉林大学 CT image multi-region segmentation method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109800631A (en) * 2018-12-07 2019-05-24 天津大学 Fluorescence-encoded micro-beads image detecting method based on masked areas convolutional neural networks
CN111462086A (en) * 2020-03-31 2020-07-28 北京推想科技有限公司 Image segmentation method and device and training method and device of neural network model
WO2021189843A1 (en) * 2020-09-22 2021-09-30 平安科技(深圳)有限公司 Vertebra positioning method and apparatus for ct image, and device and medium
CN114240848A (en) * 2021-11-24 2022-03-25 慧影医疗科技(北京)股份有限公司 Bone mineral density measuring method, system, storage medium and electronic equipment
CN114998301A (en) * 2022-06-28 2022-09-02 北京大学第三医院(北京大学第三临床医学院) Vertebral body sub-region segmentation method and device and storage medium

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2014158628A (en) * 2013-02-20 2014-09-04 Univ Of Tokushima Image processor, image processing method, control program, and recording medium
US10262424B2 (en) * 2015-12-18 2019-04-16 The Johns Hopkins University Method for deformable 3D-2D registration using multiple locally rigid registrations
CN108510507A (en) * 2018-03-27 2018-09-07 哈尔滨理工大学 A kind of 3D vertebra CT image active profile dividing methods of diffusion-weighted random forest
CA3095408C (en) * 2018-03-30 2023-04-11 Perkinelmer Health Sciences, Inc. Systems and methods for automated detection and segmentation of vertebral centrum(s) in 3d images
CN108711187B (en) * 2018-08-03 2021-12-31 华侨大学 Method for establishing human lumbar vertebra three-dimensional simulation model by registering and fusing CT (computed tomography) and MRI (magnetic resonance imaging) signals
EP3751516B1 (en) * 2019-06-11 2023-06-28 Holo Surgical Inc. Autonomous multidimensional segmentation of anatomical structures on three-dimensional medical imaging
CN113689454A (en) * 2020-05-18 2021-11-23 傅豪 3D CT vertebral body segmentation algorithm based on convolutional neural network
CN113327235B (en) * 2021-06-04 2022-07-01 杭州健培科技有限公司 Image processing method and device based on CT image sarcopenia evaluation
CN113515875B (en) * 2021-07-20 2023-12-29 北京工业大学 Bone biomechanics modeling method, system and device based on multi-modal image

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109800631A (en) * 2018-12-07 2019-05-24 天津大学 Fluorescence-encoded micro-beads image detecting method based on masked areas convolutional neural networks
CN111462086A (en) * 2020-03-31 2020-07-28 北京推想科技有限公司 Image segmentation method and device and training method and device of neural network model
WO2021189843A1 (en) * 2020-09-22 2021-09-30 平安科技(深圳)有限公司 Vertebra positioning method and apparatus for ct image, and device and medium
CN114240848A (en) * 2021-11-24 2022-03-25 慧影医疗科技(北京)股份有限公司 Bone mineral density measuring method, system, storage medium and electronic equipment
CN114998301A (en) * 2022-06-28 2022-09-02 北京大学第三医院(北京大学第三临床医学院) Vertebral body sub-region segmentation method and device and storage medium

Also Published As

Publication number Publication date
CN114998301A (en) 2022-09-02
CN114998301B (en) 2022-11-29

Similar Documents

Publication Publication Date Title
WO2024001140A1 (en) Vertebral body sub-region segmentation method and apparatus, and storage medium
CN113506334B (en) Multi-mode medical image fusion method and system based on deep learning
CN113450294A (en) Multi-modal medical image registration and fusion method and device and electronic equipment
CN108309334B (en) Data processing method of spine X-ray image
CN112037200A (en) Method for automatically identifying anatomical features and reconstructing model in medical image
CN111524109A (en) Head medical image scoring method and device, electronic equipment and storage medium
CN116258726A (en) Temporal-mandibular joint MRI image important structure segmentation method based on deep learning
CN113838048B (en) Center positioning and ligament length calculating method for pre-operation dead center of cruciate ligament
CN113436128B (en) Dual-discriminator multi-mode MR image fusion method, system and terminal
CN106780492A (en) A kind of extraction method of key frame of CT pelvises image
Hou et al. An MRI image automatic diagnosis model for lumbar disc herniation using semi-supervised learning
CN111640127A (en) Accurate clinical diagnosis navigation method for orthopedics department
CN111145289A (en) Extraction method and device of pelvis three-dimensional data
CN115252233B (en) Automatic planning method for acetabular cup in total hip arthroplasty based on deep learning
CN114663363B (en) Deep learning-based hip joint medical image processing method and device
Liu et al. Prior-based 3D U-Net: A model for knee-cartilage segmentation in MRI images
Kurochka et al. An algorithm of segmentation of a human spine X-ray image with the help of Mask R-CNN neural network for the purpose of vertebrae localization
CN114359308A (en) Aortic dissection method based on edge response and nonlinear loss
EP3905129A1 (en) Method for identifying bone images
Amara et al. Augmented reality for medical practice: a comparative study of deep learning models for ct-scan segmentation
Kumar et al. Medical image fusion based on type-2 fuzzy sets with teaching learning based optimization
Aal et al. Survey: Automatic recognition of musculoskeletal disorders from radiographs
Huang et al. 3d hand bones and tissue estimation from a single 2d x-ray image via a two-stream deep neural network
Lin et al. ARF-NET: Method of spinal segmentation in deformed spine radiographs
Lewis Improving Gross Anatomy: Enhancing the Information Content of Cadaveric CT Scans and Models

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22949215

Country of ref document: EP

Kind code of ref document: A1