CN113902674A

CN113902674A - Medical image segmentation method and electronic equipment

Info

Publication number: CN113902674A
Application number: CN202111035523.6A
Authority: CN
Inventors: 贺志强; 牛凯; 吴文彬
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2021-09-02
Filing date: 2021-09-02
Publication date: 2022-01-07
Anticipated expiration: 2041-09-02
Also published as: CN113902674B

Abstract

The present disclosure provides a medical image segmentation method and an electronic device. The method comprises the following steps: acquiring medical image data, and preprocessing the medical image data to obtain processed medical image data; inputting the processed medical image data into a medical image segmentation model; obtaining pyramid bottom characteristics f of the medical image according to the processed medical image data in the medical image segmentation model₁(ii) a According to the pyramid bottom characteristic f of the medical image₁Obtaining a prediction score p; and obtaining a segmentation result P of the medical image according to the prediction fraction P. This is thatIn the embodiment of the specification, the medical image segmentation result P is finally obtained through medical image data and a pre-trained medical image segmentation model.

Description

Medical image segmentation method and electronic equipment

Technical Field

The disclosure relates to the technical field of image semantic segmentation of computer vision, in particular to a medical image segmentation method.

Background

With the development of science and technology, medical image segmentation technology is gradually replaced by deep learning methods with better performance and stronger generalization capability from the previous Otsu threshold method, watershed method, Graphcut and some methods based on active contour. However, the current medical image segmentation field still has some difficulties, firstly, the data volume of the medical image is insufficient when the medical image segmentation model is trained; secondly, the difference of the target dimension to be segmented in the medical image is large, the model is difficult to accurately identify, and finally, most target organs in the existing medical image are fuzzy and easy to be confused with organs in other adjacent parts, and the model is difficult to identify.

Disclosure of Invention

In view of the above, an object of the present disclosure is to provide a medical image segmentation method and an electronic device.

In view of the above, the present disclosure provides a medical image segmentation method, which includes:

acquiring medical image data;

preprocessing the medical image data to obtain processed medical image data;

inputting the processed medical image data into a medical image segmentation model;

obtaining pyramid bottom characteristics f of the medical image according to the processed medical image data in the medical image segmentation model₁(ii) a According to the pyramid bottom characteristic f of the medical image₁Obtaining a prediction score p;

and obtaining a segmentation result P of the medical image according to the prediction fraction P.

Optionally, the training process of the medical image segmentation model includes:

acquiring medical image data for training;

acquiring the number c of classes to be segmented of medical image data for training;

acquiring a real Mask corresponding to medical image data for training, wherein the real Mask corresponding to the medical image data for training is as follows: pre-labeling the medical image data for training with a good result pixel by pixel;

according to the medical image data for training, carrying out thresholding, normalization and data enhancement processing on the medical image data for training to obtain the processed medical image data for training;

performing initial multi-scale feature extraction on the processed medical image data for training to obtain a multi-scale feature pyramid of the processed medical image data for training;

performing multi-scale feature fusion on the multi-scale feature pyramid of the processed medical image data for training to obtain an enhanced feature pyramid for training;

obtaining the bottom feature f of the pyramid for training according to the enhanced feature pyramid for training₁′；

Extracting weights of the training enhanced feature pyramid aiming at the number c of all the classes to be segmented to obtain scale self-adaptive attention class weights;

obtaining a multi-scale self-adaptive segmentation loss function according to the scale self-adaptive attention category weight;

according to the pyramid tower bottom characteristic f for training₁', obtaining a class segmentation score p' and a target feature;

obtaining an intra-class consistency constraint loss function and an inter-class difference constraint loss function according to the target characteristics;

obtaining a segmentation loss function according to the class segmentation fraction p' and a real Mask corresponding to the medical image data for training;

and training to obtain the medical image segmentation model according to the multi-scale self-adaptive segmentation loss function, the intra-class consistency constraint loss function, the inter-class difference constraint loss function and the segmentation loss function.

Optionally, obtaining the scale-adaptive attention category weight includes:

according to a certain characteristic f in the enhanced characteristic pyramid for training_i' after convolution operation and average pooling, the data is inputted to a first full-connection network FC₁Activated using the ReLU activation function and re-input into the second fully connected network FC₂Obtaining the scale self-adaptive attention category weight w by using Sigmoid activation function activation_i。

Optionally, obtaining the target feature includes:

according to the pyramid tower bottom characteristic f for training₁' after 2d convolution operation and self-adaptive pooling are carried out on the target feature f, flattening operation and dimension replacement operation are carried out, and then 1d convolution operation is carried out to obtain the target feature f₁ ^c′。

Optionally, the processed medical image data is obtained by thresholding and normalizing the medical image data.

Optionally, obtaining pyramid bottom feature f of the medical image₁The method specifically comprises the following steps:

extracting initial multi-scale features of the processed medical image data to obtain a multi-scale feature pyramid;

performing multi-scale feature fusion on the multi-scale feature pyramid to obtain an enhanced feature pyramid;

obtaining pyramid bottom characteristics f of the medical image according to the enhanced characteristic pyramid₁。

Optionally, the prediction score p is obtained by performing 2d convolution operation on the pyramid tower bottom features of the medical image.

Optionally, the size of the prediction score p is c × H × W, and c is the number c of categories to be segmented of the medical image data for training; h is the length of the prediction fraction p, and W is the width of the prediction fraction p.

Optionally, the final segmentation result P of the medical image is passed

Calculating to obtain;

the above-mentioned

In, p_lij,l∈[1,c],i∈[1,H],j∈[1,W]The value of the prediction score of the ith class to be segmented in the prediction score vector of each pixel position (i, j) in the prediction score p;

l∈[1,c]to predict a certain pixel position in the fraction p(i, j) prediction score vector p_ijC is the number c of classes to be segmented of the medical image data for training.

Based on the same inventive concept, the present disclosure also provides an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor executes the computer program to implement the medical image segmentation method according to any one of the above aspects.

As can be seen from the foregoing, according to the medical image segmentation method provided by the present disclosure, after medical image data is acquired and preprocessed, a pre-trained medical image segmentation model is input to obtain a prediction score P, and a segmentation result P is obtained according to the prediction score P. According to the method, the medical image data is additionally subjected to data enhancement preprocessing during model training, so that the data volume entering the model is greatly improved, and the model learning is facilitated; when the model is trained, initial multi-scale feature extraction and multi-scale feature fusion are carried out on medical image data, so that the segmentation capability of the trained model on the target to be segmented under different scales is well improved; when the model is trained, the medical image data is subjected to target feature enhancement processing, and the processed medical image data has the characteristics of small intra-class difference and large inter-class difference, so that the model learning is facilitated; the training means enables the data volume to be sufficient during model training, and the trained model has the characteristics of strong segmentation capability and strong classification performance.

Drawings

In order to more clearly illustrate the technical solutions in the present disclosure or related technologies, the drawings needed to be used in the description of the embodiments or related technologies are briefly introduced below, and it is obvious that the drawings in the following description are only embodiments of the present disclosure, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

Fig. 1 is a flowchart of a medical image segmentation method according to an embodiment of the present disclosure;

fig. 2 is a schematic diagram illustrating a training process of a medical image segmentation model according to an embodiment of the disclosure;

fig. 3 is an electronic device structure according to an embodiment of the disclosure.

Detailed Description

For the purpose of promoting a better understanding of the objects, aspects and advantages of the present disclosure, reference is made to the following detailed description taken in conjunction with the accompanying drawings.

It is to be noted that technical terms or scientific terms used in the embodiments of the present disclosure should have a general meaning as understood by those having ordinary skill in the art to which the present disclosure belongs, unless otherwise defined. The use of "first," "second," and similar terms in the embodiments of the disclosure is not intended to indicate any order, quantity, or importance, but rather to distinguish one element from another. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items. The terms "connected" or "coupled" and the like are not restricted to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "upper", "lower", "left", "right", and the like are used merely to indicate relative positional relationships, and when the absolute position of the object being described is changed, the relative positional relationships may also be changed accordingly.

As described in the background section, the medical image segmentation method in the related art has the disadvantages of insufficient data amount during training of the medical image segmentation model, large scale difference of the target to be segmented, and easy confusion between the target organ and the nearby organ, and brings great inconvenience to the segmentation of the medical image, and the segmentation result obtained after the medical image is segmented is not ideal.

In view of the above, the embodiment of the present disclosure provides a medical image segmentation method, which obtains a segmentation result P of a medical image based on a pre-trained medical image segmentation model and medical data for training, so that the segmentation result of the medical image is more accurate.

Hereinafter, the technical means of the embodiments of the present disclosure will be described in detail by specific examples.

Referring to fig. 1, a medical image segmentation method according to an embodiment of the present disclosure includes the following steps:

step S101, acquiring medical image data.

In this step, the medical image data is in a currently common format such as DICOM, NIFFT, etc., and these data include patient information, slice information, image information, etc. Therefore, the first step of the medical image segmentation method is to acquire medical image data, i.e. to read the image data information required by the embodiment from the medical image. The data in the above format can be read by corresponding python library functions, such as nibabel, pydicom, etc.

Step S103, preprocessing the medical image data to obtain processed medical image data.

In this step, the range of the numerical value of the medical image data acquired in step S101 is wide, and if the medical image data is directly normalized, the contrast between the targets to be segmented in the subsequent segmentation process may be insufficient, or even the targets cannot be identified, so that the segmentation result has a problem. The threshold in thresholding is typically set by the physician based on clinical experience. If the medical image data acquired in step S101 is recorded as O, the thresholded lower bound is min, the thresholded upper bound is max, and the thresholded data is C, the thresholding may be represented as:

after thresholding the medical image data, because the difference between the medical image data is large, the data ranges corresponding to different targets may have large difference after thresholding, and are not easy to process, and in order to process the acquired medical image data more uniformly, the embodiment performs normalization processing on the medical image data, so that the medical image data is easy to convert into common image formats, such as PNG, JPEG, and the like. In this embodiment, a normalization scheme is adopted to normalize the maximum and minimum values, and C is recorded_minIs the minimum value of thresholded data, C_maxFor the maximum value of the thresholded data, the normalized data is N, then the normalization can be expressed as:

and performing thresholding and normalization processing on the medical image data to obtain processed medical image data.

Step S105, inputting the processed medical image data into a medical image segmentation model.

In this step, the medical image segmentation model is trained in advance, and a specific training process is described in detail in the embodiments described later.

Step S107, in the medical image segmentation model, obtaining pyramid bottom characteristics f of the medical image according to the processed medical image data₁(ii) a According to the pyramid bottom characteristic f of the medical image₁And obtaining a prediction score p. In this step, the processed medical image data is subjected to initial multi-scale feature extraction in a medical image segmentation model to obtain a multi-scale feature pyramid, multi-scale feature fusion is performed on the multi-scale feature pyramid to obtain an enhanced feature pyramid, and a pyramid bottom feature f is obtained according to the enhanced feature pyramid₁According to the pyramid bottom feature f₁And obtaining a prediction score p.

In this embodiment, the pyramid bottom feature f₁Obtaining a prediction score graph which is obtained after 2d convolution and is the prediction score p, wherein the scale size is c H W, and c is the number of the categories to be segmented of the medical image data for training; h is the length of the prediction fraction p, and W is the width of the prediction fraction p.

And step S109, obtaining a segmentation result P of the medical image according to the prediction fraction P.

In the present embodiment, the input medical image segmentation model is the processed medical image data, and the output model is the corresponding pre-image dataAnd measuring the fraction P, wherein the segmentation result P of the medical image is obtained by calculation outside the medical image segmentation model. The calculation formula is as follows:

in the formula p_lij,l∈[1,c],i∈[1,H],j∈[1,W]The value of the prediction score of the ith class to be segmented in the prediction score vector of each pixel position (i, j) in the prediction score p;

l∈[1,c]for a vector p of prediction scores for a certain pixel position (i, j) in the prediction score p_ijC is the number c of classes to be segmented of the medical image data for training.

Taking abdominal cavity CT as an example, the large intestine, small intestine and cecum are divided. Reading medical image data in abdominal cavity CT by pydicom, carrying out thresholding and normalization processing on the medical image data to obtain processed abdominal cavity CT medical image data, inputting the processed abdominal cavity CT medical image data into a pre-trained medical image segmentation model, carrying out initial multi-scale feature extraction on the processed abdominal cavity CT medical image data in the model to obtain a multi-scale feature pyramid of abdominal cavity CT, carrying out multi-scale feature fusion on the multi-scale feature pyramid of abdominal cavity CT to obtain an enhanced feature pyramid of abdominal cavity CT, taking the pyramid bottom feature of the pyramid, carrying out 2d convolution on the pyramid bottom feature, outputting prediction scores of large intestine, small intestine and cecum in abdominal cavity CT by the model, calculating the prediction scores by the calculation formula in step S109 to obtain the segmentation results of the large intestine, small intestine and cecum in abdominal cavity CT, wherein the segmentation results are index values of the large intestine, the small intestine and the cecum in the abdominal cavity CT, in order to more intuitively recognize the ranges of the large intestine, the small intestine and the cecum, RGB color values are generally given to the index values, and the effect of the range of the large intestine, the small intestine and the cecum appearing in the abdominal cavity CT in different colors on one graph is finally presented.

In some embodiments, the medical image segmentation model is obtained by pre-training, and the present embodiment will describe the training process of the medical image segmentation model in detail. As shown in fig. 2, the training process of the model includes the following steps:

in step S201, medical image data for training is acquired.

In this step, the medical image data is in a currently common format such as DICOM, NIFFT, etc., and these data include patient information, slice information, image information, etc. Therefore, the first step of the medical image segmentation method is to acquire medical image data, i.e. to read the image data information required by the embodiment from the medical image. The data in the above format can be read by corresponding python library functions, such as nibabel, pydicom, etc. During training, massive training data is input, the massive training data is divided into a plurality of batches, the size of each batch is recorded as B, and data in one batch are forwarded and error is reversely propagated each time.

Step S203, acquiring the number c of classes to be segmented of the medical image data for training.

The number c of the classes to be segmented of the medical image data for training is the number of the classes to be segmented of the medical image data to be inferred, if the number c of the classes to be segmented is required to be changed, retraining of the whole model is not required, and the method only needs to extract the weight of the pyramid according to the enhanced feature for training and aiming at the number c of the classes to be segmented, so that the migration training is started again at the step of obtaining the scale self-adaptive attention class weight.

Step S205, acquiring a real Mask corresponding to the medical image data for training, where the real Mask corresponding to the medical image data for training is: and pre-labeling the medical image data for training pixel by pixel to obtain a good result.

In this embodiment, the real Mask corresponding to the medical image data may be a result labeled in advance on professional labeling software by a doctor in the field; if it is from an open source data set, it is the data set publisher responsible for collation and labeling.

And step S207, performing thresholding, normalization and data enhancement processing on the medical image data for training according to the medical image data for training to obtain the processed medical image data for training.

In this embodiment, the steps of thresholding and normalizing the medical image data for training are the same as those of thresholding and normalizing the medical image data in the inference process, and are not described herein again. The data quantity and the data diversity of the medical image cannot meet the requirement of deep learning easily, so that the data quantity and the data diversity need to be expanded, the precision of the model is improved, data enhancement operations such as random scale transformation, random cutting, random horizontal inversion and normalization are adopted in the embodiment, and one or more operations can be selected in actual operation.

Step S209, performing initial multi-scale feature extraction on the processed medical image data for training to obtain a multi-scale feature pyramid of the processed medical image data for training.

In the step, the initial multi-scale feature extraction is to improve the segmentation capability of the model on the target to be segmented with different scales, and the method adopts the classical convolutional neural network to complete the step, wherein the classical convolutional neural network gradually extracts high-level features by stacking convolutional modules, and in the process, feature graphs with different scales are formed to obtain a multi-scale feature pyramid structure.

In this embodiment, a High-Resolution Network (HRNet) is used to perform initial multi-scale feature extraction, in this embodiment, medical image data is subjected to initial multi-scale feature extraction to obtain a multi-scale feature pyramid of the medical image data for post-processing training, and a feature map from bottom to top in the obtained multi-scale feature pyramid of the medical image data for post-processing training is denoted as s₁,s₂,s₃,s₄The dimensions of the four multi-scale features are 1/4, 1/8, 1/16 and 1/32 of the original image in sequence, and the four multi-scale features form a multi-scale feature pyramid. At this time, the information of the multi-scale feature pyramid is insufficient for the medical image segmentation task of the disclosure, the features of different scales have respective advantages and disadvantages, the position information of the high-resolution feature map is more accurate and rich, but the semantic information is lower; since the semantic information of the low-resolution feature map is higher, but the position information loss is more serious, in this embodiment, the multi-scale feature pyramid is subjected to the multi-scale feature fusion processing, so that the feature maps of the respective scales are mutually lengthenedAnd (5) making up for the shortness.

Step S211 performs multi-scale feature fusion on the multi-scale feature pyramid of the processed medical image data for training to obtain an enhanced feature pyramid for training.

In this embodiment, a Feature Pyramid Network (FPN) structure is used to perform multi-scale Feature fusion, and the multi-scale features after FPN fusion are respectively denoted as f₁′,f₂′,f₃′,f₄' which in turn are all 1/4 of the original drawing. The multi-scale feature pyramid is subjected to multi-scale feature fusion processing to obtain an enhanced feature pyramid, each level of the feature pyramid has good classification capability in the aspect of uniformity, and certain position information is reserved or recovered.

Step S213, obtaining the bottom feature f of the training pyramid according to the enhanced feature pyramid for training₁′。

In this embodiment, the enhanced bottom feature of the feature pyramid has the best positioning capability, and the pyramid bottom feature f has good classification capability because of being subjected to multi-scale fusion processing, so that this embodiment uses the pyramid bottom feature f₁' as the final segmentation feature.

Step S215, extracting the weight of the enhanced feature pyramid for training aiming at the number c of all the categories to be segmented to obtain the scale self-adaptive attention category weight.

Considering that the target to be segmented has large change in scale, the difference of scales of different targets on the same slice is large, and the scales of the same target on different slices can also be greatly changed; meanwhile, in consideration of the fact that the detection capabilities of different scales of features on different scales of targets are different, the loss of position information of the feature map with a larger scale is less, the detection capability on a small target is stronger, and the pertinence of the feature map with a smaller scale on a large target is stronger, so that the feature maps with different scales of targets are required to be adopted for detection and segmentation. Therefore, the method adopts a light-weight small network to extract the weight of each feature map with different scales aiming at each object to be segmented, the feature map with each scale can generate a weight suitable for the scale in a self-adaptive manner, the weight can be weighted with each object to be segmented in the feature map with the corresponding scale, so that the different-scale feature maps can pay attention to certain categories to be segmented aiming at the self-adaptive emphasis, and the different-scale feature maps can play their own roles, thereby improving the segmentation precision of the network.

In the present embodiment, the weight generation employs an operation based on the Squeeze-and-Excitation in SEnet. Since the feature maps of different scales are generated by adaptive weight independently, but the flows are the same, the present embodiment will be described in detail only for the weight generation of a feature map of a certain scale:

for a feature of a certain scale f_iI ∈ {1,2,3,4}, with a size c_iH w, wherein c_iIs f_iH is f_iW is f_iC, the weight w of the scale self-adaptive attention class_iMay be expressed as:

And S217, obtaining a multi-scale self-adaptive segmentation loss function according to the scale self-adaptive attention category weight.

The convolution network corresponding to the feature map of each scale is optimized independently, so that the classification capability of the medical image segmentation model for segmentation targets of different scale bands is improved, and finally, each scale feature map has a corresponding convolution network suitable for the scale, and scale self-adaptive segmentation can be completed.

In this embodiment, the feature map f of each scale_iAfter convolution, a feature map with the channel number c, namely a feature score, recorded as o, is obtained_iC is the number of classes to be dividedTo achieve the purpose. And the characteristic diagram, i.e. f_i' segmentation scores of all corresponding target classes to be segmented are calculated according to the actual Mask calculation error L corresponding to the medical image data for training input into the model_iThe calculation formula is as follows:

where c is the object class to be segmented, o_ijIs o_iSegmentation score, M, on object class j to be segmented_jA real Mask corresponding to the medical image data for training corresponding to the target class j to be segmented, wherein l (x, y) represents a loss function between x and y, a cross entropy loss function is adopted in the embodiment, which is o in the embodiment_ijAnd M_jA loss function in between.

On the basis, the self-adaptive attention class weight w corresponding to each scale feature map obtained in the previous step needs to be considered_iThen in this case, for each scale feature map, the error L_iThe calculation formula of (2) is as follows:

wherein, w_ijIs corresponding to f_i' weight of the scale on the object class j to be segmented.

Finally, obtaining a multi-scale self-adaptive segmentation loss function L_i。

Step S219, according to the pyramid tower bottom characteristic f for training₁', get the class segmentation score p' and the target feature.

In the present embodiment, the pyramid bottom feature f for training₁', the feature map obtained after the 2d convolution is referred to as a class division score p'. The process of target feature extraction is as follows: according to the pyramid tower bottom characteristic f for training₁' after 2d convolution operation and self-adaptive pooling, carrying out flattening operation and dimension replacement operation, and then carrying out 1d convolution operation to obtainThe target feature

And step S221, obtaining an intra-class consistency constraint loss function and an inter-class difference constraint loss function according to the target characteristics.

The embodiment of the disclosure expects that when the medical image segmentation model is trained, the features in the same category have smaller differences among the medical image data entering the model in the same batch, and the features in different categories have larger differences, so that the medical image segmentation model can be conveniently learned. In order to reduce the difference of the features in the same category, it may be considered to measure the difference between the features in the same category by calculating the similarity, that is, by calculating the difference between the similarity of the features in the same category and 1, an unsupervised loss function is introduced, so as to introduce a new regularization term, which will make the consistency between the features in the same category as large as possible. Similar to the reduction of the difference of the internal features of the same category, in order to achieve the purpose of reducing the consistency of the features between different categories, a new regularization term is introduced by calculating the difference between the similarity and 0 between the features of different categories, and the regularization term enables the difference between the internal features of different categories to be as large as possible.

In this embodiment, different medical images are different medical image pictures in the same batch. For a certain target class k to be segmented (k belongs to { 1.,. c }), c is the target class to be segmented, the quantity of medical image data of each group in the medical image segmentation model with training is B', and each medical image is B_qWherein q ∈ { 1., B' }, then the medical image B_qThe extracted characteristics of a certain target class k to be segmented are

The similarity between features is measured by a cosine similarity calculation method, the cosine similarity is recorded as S (x, y), and the cosine similarity is calculated in the following manner:

in this example, x is

y is

Then

Should be close to 1, m, n e {1,. and B' }. l (x, y) represents a function that measures the dissimilarity between x and y, where x is in this embodiment

y is 1, the absolute value loss is adopted in calculation, and the formula is as follows:

then the intra-class feature consistency constraint penalty L_comThe calculation formula is as follows:

calculating to obtain the intra-class feature consistency constraint loss L according to a formula_com。

Inter-class feature diversity constraint loss L_divConstraint loss L of consistency of computing mode and intra-class features_comSimilarly, but it differs from intra-class feature consistency constraint loss computation in that the similarity of features between different classes should be as close to 0 as possible, and therefore, the cumulative feature difference constraint loss L_divCan be expressed as:

calculating according to a formula to obtain a classInter-feature dissimilarity constraint loss L_com。

And step S223, obtaining a segmentation loss function according to the class segmentation fraction p' and the real Mask corresponding to the medical image data for training.

Due to pyramid bottom feature f₁The method has the best positioning capability and improved classification capability, so that the method disclosed by the invention can be used for independently calculating a segmentation loss by using a real Mask corresponding to training medical image data corresponding to the real Mask, and enhancing the precision of a medical image segmentation model.

In this embodiment, in step S219, the pyramid bottom feature f for training is used₁'obtaining a class segmentation score p' which generates a segmentation loss L with respect to a true Mask corresponding to the medical image data for training corresponding to the class segmentation score p_segHere, the real Mask corresponding to the medical image data for training is denoted as M. In the present embodiment, the division loss L_segA simple cross entropy loss function is used for the calculation.

And step S225, training to obtain the medical image segmentation model according to the multi-scale self-adaptive segmentation loss function, the intra-class consistency constraint loss function, the inter-class difference constraint loss function and the segmentation loss function.

In this embodiment, the formula for calculating the total loss L is:

wherein s is the number of the multi-scale feature maps, and is 4 in the embodiment;

represents the weight lost by the scale-adaptive segmentation produced by the ith scale,

representing the intra-class feature consistency constraint penalty L generated by the ith scale_comThe weight of (a) is determined,

representing the constrained loss L of the inter-class feature diversity generated by the ith scale_divThe weight of (a) is determined,

represents the final segmentation loss L generated by the ith scale_segThe weight of (c).

And finally, synthesizing the four loss functions and training the network at the same time to obtain a medical image segmentation model.

It should be noted that the method of the embodiments of the present disclosure may be executed by a single device, such as a computer or a server. The method of the embodiment can also be applied to a distributed scene and completed by the mutual cooperation of a plurality of devices. In such a distributed scenario, one of the devices may only perform one or more steps of the method of the embodiments of the present disclosure, and the devices may interact with each other to complete the method.

It should be noted that the above describes some embodiments of the disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims may be performed in a different order than in the embodiments described above and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.

Based on the same inventive concept, corresponding to any of the above embodiments, the present disclosure further provides an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the program, the medical image segmentation method according to any of the above embodiments is implemented.

Fig. 3 is a schematic diagram illustrating a more specific hardware structure of an electronic device according to this embodiment, where the electronic device may include: a processor 1010, a memory 1020, an input/output interface 1030, a communication interface 1040, and a bus 1050. Wherein the processor 1010, memory 1020, input/output interface 1030, and communication interface 1040 are communicatively coupled to each other within the device via bus 1050.

The processor 1010 may be implemented by a general-purpose CPU (Central Processing Unit), a microprocessor, an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits, and is configured to execute related programs to implement the technical solutions provided in the embodiments of the present disclosure.

The Memory 1020 may be implemented in the form of a ROM (Read Only Memory), a RAM (Random Access Memory), a static storage device, a dynamic storage device, or the like. The memory 1020 may store an operating system and other application programs, and when the technical solution provided by the embodiments of the present specification is implemented by software or firmware, the relevant program codes are stored in the memory 1020 and called to be executed by the processor 1010.

The input/output interface 1030 is used for connecting an input/output module to input and output information. The i/o module may be configured as a component in a device (not shown) or may be external to the device to provide a corresponding function. The input devices may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc., and the output devices may include a display, a speaker, a vibrator, an indicator light, etc.

The communication interface 1040 is used for connecting a communication module (not shown in the drawings) to implement communication interaction between the present apparatus and other apparatuses. The communication module can realize communication in a wired mode (such as USB, network cable and the like) and also can realize communication in a wireless mode (such as mobile network, WIFI, Bluetooth and the like).

Bus 1050 includes a path that transfers information between various components of the device, such as processor 1010, memory 1020, input/output interface 1030, and communication interface 1040.

It should be noted that although the above-mentioned device only shows the processor 1010, the memory 1020, the input/output interface 1030, the communication interface 1040 and the bus 1050, in a specific implementation, the device may also include other components necessary for normal operation. In addition, those skilled in the art will appreciate that the above-described apparatus may also include only those components necessary to implement the embodiments of the present description, and not necessarily all of the components shown in the figures.

The electronic device of the above embodiment is used to implement the corresponding medical image segmentation method in any of the foregoing embodiments, and has the beneficial effects of the corresponding method embodiment, which are not described herein again.

Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, is limited to these examples; within the idea of the present disclosure, also technical features in the above embodiments or in different embodiments may be combined, steps may be implemented in any order, and there are many other variations of the different aspects of the embodiments of the present disclosure as described above, which are not provided in detail for the sake of brevity.

In addition, well-known power/ground connections to Integrated Circuit (IC) chips and other components may or may not be shown in the provided figures for simplicity of illustration and discussion, and so as not to obscure the embodiments of the disclosure. Furthermore, devices may be shown in block diagram form in order to avoid obscuring embodiments of the present disclosure, and this also takes into account the fact that specifics with respect to implementation of such block diagram devices are highly dependent upon the platform within which the embodiments of the present disclosure are to be implemented (i.e., specifics should be well within purview of one skilled in the art). Where specific details (e.g., circuits) are set forth in order to describe example embodiments of the disclosure, it should be apparent to one skilled in the art that the embodiments of the disclosure can be practiced without, or with variation of, these specific details. Accordingly, the description is to be regarded as illustrative instead of restrictive.

While the present disclosure has been described in conjunction with specific embodiments thereof, many alternatives, modifications, and variations of these embodiments will be apparent to those of ordinary skill in the art in light of the foregoing description. For example, other memory architectures (e.g., dynamic ram (dram)) may use the discussed embodiments.

The disclosed embodiments are intended to embrace all such alternatives, modifications and variances which fall within the broad scope of the appended claims. Therefore, any omissions, modifications, equivalents, improvements, and the like that may be made within the spirit and principles of the embodiments of the disclosure are intended to be included within the scope of the disclosure.

Claims

1. A medical image segmentation method comprises the following steps:

acquiring medical image data;

preprocessing the medical image data to obtain processed medical image data;

2. The method of claim 1, wherein the training process of the medical image segmentation model comprises:

acquiring medical image data for training;

3. The method of claim 2, wherein deriving the scale-adaptive attention class weight comprises:

according to a certain characteristic f in the enhanced characteristic pyramid for training_i' after convolution operation and average pooling, the data is inputted to a first full-connection network FC₁Activated using the ReLU activation function and re-input into the second fully connected network FC₂Using Sigmoid activation functionActivating to obtain the scale self-adaptive attention class weight w_i。

4. The method of claim 2, wherein obtaining the target feature comprises:

5. The method of claim 1, wherein the processed medical image data is obtained by thresholding and normalizing the medical image data.

6. The method of claim 1, wherein the pyramid bottom feature f of the medical image is obtained₁The method specifically comprises the following steps:

7. The method according to claim 1, wherein the prediction score p is obtained by 2d convolution operation for pyramid bottom features of the medical image.

8. The method of claim 7, wherein the predictive score, p, scale is of size c H W; c is the number c of classes to be segmented of the medical image data for training; h is the length of the prediction fraction p, and W is the width of the prediction fraction p.

9. The method of claim 1, wherein the medical image is a top-most image of a patientFinal segmentation result P is passed

Calculating to obtain;

the above-mentioned

10. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the method of any one of claims 1 to 8 when executing the program.