CN115131565B

CN115131565B - Histological image segmentation model based on semi-supervised learning

Info

Publication number: CN115131565B
Application number: CN202210858624.1A
Authority: CN
Inventors: 邓有朋; 金强国; 苏苒; 孟昭鹏
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2022-07-20
Filing date: 2022-07-20
Publication date: 2023-05-02
Anticipated expiration: 2042-07-20
Also published as: CN115131565A

Abstract

The invention discloses a histological image segmentation model based on semi-supervised learning, which comprises the following steps: a teacher model, a student model, a multi-level forced consistency module and a total loss function for supervision training; training the student model by using marked data and unmarked data, training the teacher model by using unmarked data, and performing consistency constraint on a segmentation prediction result of the teacher model and a segmentation prediction result of a variant of a multi-level potential representation of an encoder in the student model by using a multi-level consistency loss function by using a multi-level forced consistency module when the student model and the teacher model are trained by using the unmarked data; the histological image segmentation model is effective, a multi-level forced consistency module and a multi-level consistency loss function are provided, and the prediction invariance of a model segmentation prediction result is enhanced by adding disturbance to multi-level potential representation of the model.

Description

Histological image segmentation model based on semi-supervised learning

Technical Field

The invention belongs to the technical field of image segmentation, and particularly relates to a histological image segmentation model based on semi-supervised learning.

Background

Accurate segmentation of cells and glands using histological images is an indispensable but challenging task in computer-aided diagnosis. Advanced performance is achieved by means of a large number of labeled data through a method of histological image segmentation by deep learning techniques [1]. However, there is a challenging problem in the field of histological image analysis, namely that the performance enhancement of deep learning models requires a large number of high quality and well-annotated data supports. However, unlike natural images, labeling of medical images requires expert participation with domain knowledge, and labeling good data acquisition is a time-consuming and labor-intensive task.

In recent years, in order to solve the problem of labeling difficulties, more and more research has been devoted to medical image segmentation with a limited amount of labeled data and a large amount of unlabeled data using semi-supervised learning techniques [2,4,5]. However, how to promote consistency between annotated data and unlabeled data presents a significant challenge to the development of semi-supervised learning. While research is currently focused on formulating perturbations to consistently train tagged and untagged data [2,4,5], existing consistency training methods focus primarily on formulating perturbations applied to input spaces and advanced feature spaces, and ignore formulating perturbations in hierarchical latent feature spaces of deep network architectures. Also, in the Mean-Teacher architecture commonly used in consistency training methods, a Teacher model is typically used to generate training targets for student models. However, it is difficult to determine whether the Teacher model performs better than the student model during the training process, and the low-performance Teacher model presents a serious challenge for training of the Mean-Teacher architecture.

Disclosure of Invention

Aiming at the defects of the prior art, the invention aims to provide a histological image segmentation model based on semi-supervised learning, which solves the problems of labeling difficulty in the field of computer-aided diagnosis and low-performance Teacher model interference model training in a Mean-Teacher architecture.

The aim of the invention is achieved by the following technical scheme.

A semi-supervised learning based histological image segmentation model, comprising: a teacher model, a student model, a multi-level forced consistency module and a total loss function for supervision training;

the structure of the teacher model is the same as that of the student model, the teacher model and the student model are both composed of a coder and a main decoder by adopting deep LabV3+ with cavity convolution, wherein the coder comprises a convolution block CB and four residual blocks, the four residual blocks are RB1, RB2, RB3 and RB4 respectively, each residual block is a pre-trained ResNet34, and the RB3 and RB4 use cavity convolution;

training the student model by using marked data and unmarked data, training the teacher model by using unmarked data, and performing consistency constraint on a segmentation prediction result of the teacher model and a segmentation prediction result of a variant of a multi-level potential representation of an encoder in the student model by using a multi-level consistency loss function by using a multi-level forced consistency module when the student model and the teacher model are trained by using the unmarked data; the method for obtaining the segmentation prediction result of the variant of the multi-level potential representation of the encoder in the student model comprises the following steps: the output potential representation z of each residual block of RB2, RB3, and RB4 in the student model _h Obtaining variants by perturbation operations

Generating a segmentation prediction result through an auxiliary decoder;

for marked data, the student model passes through a supervised loss function L _seg Training is carried out;

the multi-level consistency loss function comprises a learnable multi-level loss function L _{lh_c} And self-guided multi-level consistency loss function L _{sgh_c} ；

Total loss function L _total The concrete steps are as follows:

L _total ＝L _seg +λ _{h_c} (L _{lh_c} +λ _{sgh_c} L _{sgh_c} )

wherein lambda is _{sgh_c} Is L _{sgh_c} Coefficient lambda of (a) _{sgh_c} ＝0～1，λ _{h_c} Expressed as:

where q is a scaling factor, v is equal to the current number of iterations of the training, and T is equal to the total number of iterations of the training.

In the above technical solution, the update policy of the weights in the teacher model is: for each training batch, the weight update of the teacher model is based on the weight of the teacher model in the previous training batch and the weight of the student model in the present training batch, and the update strategy is as follows: weights θ 'for teacher model in the t training batch' _t The method comprises the following steps:

θ′ _t ＝αθ′ _t-1 +(1-α)θ _t

wherein, θ' _t-1 Representing the weight, θ, of the teacher model in the t-1 th training lot _t Representing the weight of the student model in the t-th training batch, α represents updating the student model θ using gradient descent during the overall training process _t The decay rate of the exponential moving average of (a).

In the above technical solution, the supervised loss function L _seg From cross entropy loss function L _ce Sum-of-variance constraint cross-loss function L _var Composition, supervised loss function L _seg The expression is as follows:

L _seg ＝L _ce +λ _var L _var

wherein L is _ce Represents a cross entropy loss function, L _var Represents a variance constraint cross-loss function, lambda _var Representing the weights of the variance constraint cross-loss function.

In the above technical solution, the labeled data B for each training lot _l Variance constraint cross-loss function L _var The following is indicated:

wherein D represents marked data B of the training batch _l Number of middle split instancesOrder, B _d Representing all pixels contained in the d-th segmentation instance in the training batch, |B _d I represents B _d The number of pixels in (p) ^j Representation B _d Prediction probability of j-th pixel belonging to correct class, j= … … |b _d |，u _d Represented at B _d An average of all pixel prediction probabilities in (a).

In the above technical solution, the manner of disturbance operation is random, specifically dropout or a noise layer using a feature level.

In the above technical solution, all the auxiliary decoders have the same structure, including a hole space convolution pooling pyramid layer and an up-sampling layer.

In the above technical solution, a learnable multi-level loss function L _{lh_c} The concrete representation is as follows:

wherein B is _u Represents any training batch without marked data, |B _u I represents B _u The number of pixels in L _mse Represents the mean square error function, H represents the number of layers in the multi-layer forced consistency module,

representing the segmentation prediction result of the student model main decoder for the kth pixel,/for the student model main decoder>

Representing the prediction result of the division of the kth pixel by the H auxiliary decoder of the student model, h= … … H, < >>

Representing the learner-predicted probability of the teacher model for the kth pixel,>

expressed as:

wherein u' _k Can be expressed as:

and representing the segmentation prediction result of the teacher model on the kth pixel.

In the above technical solution, the self-guiding multi-level consistency loss function L _{sgh_c} The concrete representation is as follows:

the beneficial effects of the invention are as follows:

(1) The invention provides a histological image segmentation model based on semi-supervised learning, and the results of two embodiments show the effectiveness of the histological image segmentation model;

(2) The invention provides a multi-level forced consistency module and a multi-level consistency loss function, and the prediction invariance of a model segmentation prediction result is enhanced by adding disturbance to multi-level potential representation of the model.

Drawings

FIG. 1 is a block diagram of a histological image segmentation model based on semi-supervised learning;

fig. 2 shows the experimental effect after training.

Detailed Description

The technical scheme of the invention is further described below with reference to specific embodiments.

Example 1

A semi-supervised learning based histological image segmentation model, comprising: the system comprises a teacher model, a student model, a multi-level forced consistency (Hierarchical consistencyenforcement) module and a total loss function for supervision training, wherein the teacher model and the student model have the same structure, the teacher model and the student model are both made of deep LabV3+ [8] with cavity convolution, the deep LabV3+ is composed of an encoder and a main decoder (g), the encoder comprises a convolution block CB and four residual blocks, the four residual blocks are RB1, RB2, RB3 and RB4 respectively, each residual block is a pre-trained ResNet34, RB3 and RB4 use cavity convolution, expansion parameters of the cavity convolution are respectively set to be 2 and 4 in sequence, and parameters between the teacher model and the student model are independent;

the update strategy of the weights in the teacher model is as follows: for each training batch, the weight update of the teacher model is based on the weight of the teacher model in the previous training batch and the weight of the student model in the present training batch, and the update strategy is as follows: weights θ 'for teacher model in the t training batch' _t The method comprises the following steps:

θ′ _t ＝αθ′ _t-1 +(1-α)θ _t

wherein, θ' _t-1 Representing the weight, θ, of the teacher model in the t-1 th training lot _t Representing the weight of the student model in the t training batch, alpha represents updating the student model alpha using gradient descent during the overall training process _t An exponential moving average (Exponential Moving Average), in this example α=0.99.

The student model is trained by the marked data and the unmarked data, the teacher model is trained by the unmarked data, and the multi-level forced consistency (Hierarchical consistency enforcement) module adopts a multi-level consistency loss function to carry out consistency constraint on the segmentation prediction result of the teacher model and the segmentation prediction result of the variant of the multi-level potential representation of the encoder in the student model when the student model and the teacher model are trained by the unmarked data.

For marked data, the student model passes through a supervised loss function L _seg Training, supervised loss function L _seg By cross entropy loss function (Cross Entropy Loss) L _ce And a variance constraint cross-loss function (Variance Constrained Cross Loss) L _var The composition is as follows:

L _seg ＝L _ce +λ _var L _var

wherein L is _ce Representing cross entropy loss function [9 ]]，L _var Represents a variance constraint cross-loss function, lambda _var Weights representing variance-constrained cross-loss functions, λ in this embodiment _var ＝0.1。

Variance constraint cross-loss function L _var [9]The local constraint is carried out on pixels belonging to the same segmentation example, so as to solve the problem that the model cannot completely segment the whole segmentation example when the segmentation example in the image has uneven color or texture. Noted data B for each training batch _l Variance constraint cross-loss function L _var The following is indicated:

wherein D represents marked data B of the training batch _l Number of split instances, B _d Representing all pixels contained in the d-th segmentation instance in the training batch, |B _d I represents B _d The number of pixels in (p) ^j Representation B _d Prediction probability of j-th pixel belonging to correct class, j= … … |b _d |，u _d Represented at B _d An average of all pixel prediction probabilities in (a).

The method for obtaining the segmentation prediction result of the variant of the multi-level potential representation of the encoder in the student model comprises the following steps: as shown in FIG. 1, the output potential of each residual block of RB2, RB3, and RB4 in the student model represents z _h (z in FIG. 1) ₁ 、z ₂ And z ₃ ) Obtaining variants by perturbation operations

(is +.>

And->

) Then through an auxiliary decoder (FIG. 1)Middle->

And->

) And generating a segmentation prediction result.

The manner of the perturbation operation is random, in particular dropout or a noise floor using feature levels [11].

All auxiliary decoders are identical in structure, including a hole-space convolution pooling pyramid layer (Atrous Spatial Pyramid Pooling layer) and an upsampling layer, wherein the four sample rates of the hole-space convolution pooling pyramid layer are set to 6, 8, 18, and 24, respectively.

The multi-level forced consistency (Hierarchical consistency enforcement) module can provide stronger constraint for student model training, thereby promoting generalization of student networks. Furthermore, to ensure that the student model has greater generalization capability, the present invention does not impose constraints on the original potential representation of each level of the student model encoder, but rather imposes constraints on variants of the original potential representation of each level of the encoder.

The multi-level consistency loss function comprises a learnable multi-level loss function L _{lh_c} And self-guided multi-level consistency loss function L _{sgh_c} 。

In order to prevent the teacher model from obtaining high uncertainty estimation and strengthen the hierarchical consistency, a learnable multi-level consistency loss function L is provided _{lh_c} Learnable multi-level loss function L _{lh_c} The concrete representation is as follows:

wherein B is _u Represents any training batch without marked data, |B _u I represents B _u The number of pixels in L _mse Represents a mean square error function (Mean Squared Error) for calculating the difference between the segmentation prediction results of the teacher model and the student model, and H represents a multi-level strengthThe number of layers in the consistency (Hierarchical consistency enforcement) module, in this example H takes 3,

representing the segmentation prediction result of the student model main decoder (g in fig. 1) for the kth pixel,/for the k pixel>

Representing the learner-able prediction probability of the kth pixel by the teacher model, can provide more reliable predictions for the student model to guide,

expressed as:

wherein u' _k Can be expressed as:

As can be seen from equation (4), when the teacher model produces unreliable results (high uncertainty), the teacher model's learned prediction probability for the kth pixel

Approximate prediction of student model main decoder to it +.>

Conversely, when the teacher model has confidence in the predictions (low uncertainty)Learner-predicted probability of teacher model for kth pixel>

Predicted outcome with teacher model->

The same, and provides a certain prediction as a goal of student model learning.

Multi-level consistency loss function L through self-guidance _{sgh_c} To ensure that the outputs of the plurality of auxiliary decoders and the output of the main decoder of the student model are consistent, and the method is specifically expressed as follows:

as can be seen from the formula (7), the student model takes prediction of the main decoder as guidance through constraint of the self-guidance multi-level consistency loss function, and the inconsistency among all decoders is minimized, so that the characteristic representation capability of the student model is enhanced.

Total loss function L _total The concrete steps are as follows:

L _total ＝L _seg +λ _{h_c} (L _{lh_c} +λ _{sgh_c} L _{sgh_c} )

wherein lambda is _{sgh_c} Is L _{sgh_c} Coefficient lambda of (a) _{sgh_c} =0 to 1, in this example 0.1, λ _{h_c} Expressed as:

where q is a scaling factor, in this embodiment q=0.1, v is equal to the current number of iterations of the training, and T is equal to the total number of iterations of the training.

The learnable multi-level loss function can ensure that the segmentation prediction result of the student model can be consistent with the segmentation prediction result of the teacher model, and can also prevent the teacher model from obtaining high-uncertainty prediction. The self-guiding multi-level consistency loss function can ensure that the segmentation prediction result of the auxiliary branch decoder of the student model can be consistent with the segmentation prediction result of the main branch decoder, thereby enhancing the characteristic representation capability of the student model.

MonUSeg [15] (multi-organ-core segmented dataset) and CRAG [16] (colorectal adenocarcinoma dataset) were prepared as datasets, respectively, and image blocks were truncated in the original image in the dataset in a sliding window manner, with MonUSeg, the image block size was set to 128×128, and with CRAG, the image block size was set to 480×480.

The data set is divided into marked data and unmarked data without marking, wherein the marked data is 5%, 10% and 20% of the data set, and the rest data in the data set is unmarked data.

Substituting the marked data and the unmarked data in the data set into the student model in the histological image segmentation model for training, and substituting the unmarked data in the data set into the teacher model in the histological image segmentation model for training.

For MonUSeg, the number of images in a training batch is set to be 16, the total number of training iterations is set to be 500, and for a CRAG data set, the number of images in a training batch is set to be 8, and the total number of training iterations is set to be 300. In order to prevent overfitting, online data enhancement is performed during training, including random scaling, flipping, rotation, and affine transformation. The experimental results (HCE) of the present invention are shown in table 1 and fig. 2, table 1 shows the results of the performance comparison of the present invention with the most advanced semi-supervised learning methods on MoNuSeg and CRAG.

TABLE 1

For MonUSeg, the present invention achieved the highest results on both the evaluation criteria of Dice and AJI. For CRAG, the present invention achieves the highest effect on all evaluation indexes (F1, dice, haus). The invention enables the model segmentation performance and robustness to be continuously improved by encouraging multi-level consistency training.

Fig. 2 shows the segmentation prediction results of the MoNuSeg and CRAG with labeled data of 5% and 10% in the dataset according to the invention and other semi-supervised learning methods. From the segmentation prediction results, the invention has better expandability for segmentation examples of different shapes, such as small cells or large glands.

Example 2

This embodiment is substantially the same as embodiment 1, except that: there are labeled data as 50% in the dataset and MoNuSeg in the dataset. The experimental results are shown in Table 2. Table 2 shows the performance of the present invention compared to the fully supervised method on the MoNuSeg dataset. The present invention (HCE) achieves the best results on both F1 and Dice evaluation criteria.

TABLE 2

The foregoing has described exemplary embodiments of the invention, it being understood that any simple variations, modifications, or other equivalent arrangements which would not unduly obscure the invention may be made by those skilled in the art without departing from the spirit of the invention.

[1]Sahasrabudhe,M.,Christodoulidis,S.,Salgado,R.,Michiels,S.,Loi,S.,André,F.,Paragios,N.,Vakalopoulou,M.:Self-supervised Nuclei Segmentation in Histopathological Images Using Attention.In:International Conference on Medical Image Computing and ComputerAssisted Intervention.pp.393–402.Springer(2020).

[2]Yu,L.,Wang,S.,Li,X.,Fu,C.W.,Heng,P.A.:Uncertainty-aware self-ensembling model for semi-supervised 3D left atrium segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.605–613.Springer(2019).

[3].Graham,S.,Chen,H.,Gamper,J.,Dou,Q.,Heng,P.A.,Snead,D.,Tsang,Y.W.,Rajpoot,N.:MILD-Net:Minimal information loss dilated network for gland instance segmentation in colon histology images.Medical Image Analysis 52,199–211(2019)

[4]Xie,Y.,Zhang,J.,Liao,Z.,Verjans,J.,Shen,C.,Xia,Y.:Pairwise Relation Learning for Semi-supervised Gland Segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.417–427.Springer(2020).

[5]Li,X.,Yu,L.,Chen,H.,Fu,C.W.,Xing,L.,Heng,P.A.:Transformation-consistent self-ensembling model for semi-supervised medical image segmentation.IEEE Transactions on Neural Networks and Learning Systems pp.1–12(2020).

[6].Li,X.,Yu,L.,Chen,H.,Fu,C.W.,Xing,L.,Heng,P.A.:Transformation-consistent selfensembling model for semisupervised medical image segmentation.IEEE Transactions on Neural Networks and Learning Systems pp.1–12(2020).https://doi.org/10.1109/TNNLS.2020.2995319

[7].Li,Y.,Chen,J.,Xie,X.,Ma,K.,Zheng,Y.:Self-loop uncertainty:A novel pseudo-label for semi-supervised medical image segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.614–623.Springer(2020)

[8]Chen,L.C.,Papandreou,G.,Kokkinos,I.,Murphy,K.,Yuille,A.L.:DeepLab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs.IEEE Transactions on Pattern Analysis and Machine Intelligence 40(4),834–848(2017).

[9]Qu,H.,Yan,Z.,Riedlinger,G.M.,De,S.,Metaxas,D.N.:Improving nuclei/gland instance segmentation in histopathology images by full resolution neural network and spatial constrained loss.In:International Conference on Medical Image Computing and ComputerAssisted Intervention.pp.378–386.Springer(2019).

[10]Raza,S.E.A.,Cheung,L.,Shaban,M.,Graham,S.,Epstein,D.,Pelengaris,S.,Khan,M.,Rajpoot,N.M.:Micro-Net:A unified model for segmentation of various objects in microscopy images.Medical Image Analysis 52,160–173(2019)

[11]Ouali,Y.,Hudelot,C.,Tami,M.:Semi-supervised semantic segmentation with cross-consistency training.In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.pp.12674–12684(2020)

[12]Tarvainen,A.,Valpola,H.:Mean teachers are better role models:Weight-averaged consistency targets improve semi-supervised deep learning results.In:Advances in Neural Information Processing Systems.pp.1195–1204(2017)

[13]Verma,V.,Lamb,A.,Kannala,J.,Bengio,Y.,Lopez-Paz,D.:Interpolation Consistency Training for Semi-supervised Learning.In:Proceedings of the 28th International Joint Conference on Artificial Intelligence.pp.3635–3641.IJCAI’19,AAAI Press(2019)

[14]Vu,T.H.,Jain,H.,Bucher,M.,Cord,M.,Pérez,P.:ADVENT:Adversarial entropy minimization for domain adaptation in semantic segmentation.In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.pp.2517–2526(2019)

[15]Kumar,N.,Verma,R.,Anand,D.,Zhou,Y.,Onder,O.F.,Tsougenis,E.,Chen,H.,Heng,P.A.,Li,J.,Hu,Z.,et al.:A multi-organ nucleus segmentation challenge.IEEE Transactions on Medical Imaging 39(5),1380–1391(2019)

[16]Awan,R.,Sirinukunwattana,K.,Epstein,D.,Jefferyes,S.,Qidwai,U.,Aftab,Z.,Mujeeb,I.,Snead,D.,Rajpoot,N.:Glandular morphometrics for objective grading of colorectal adenocarcinoma histology images.Scientific Reports 7(1),1–12(2017)

[17]Xiang,T.,Zhang,C.,Liu,D.,Song,Y.,Huang,H.,Cai,W.:BiO-Net:Learning Recurrent Bidirectional Connections for Encoder-Decoder Architecture.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.74–84.Springer(2020)

[18]Xie,Y.,Lu,H.,Zhang,J.,Shen,C.,Xia,Y.:Deep segmentation-emendation model for gland instance segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.469–477.Springer(2019)

[19]Xie,Y.,Zhang,J.,Liao,Z.,Verjans,J.,Shen,C.,Xia,Y.:Pairwise Relation Learning for Semi-supervised Gland Segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.417–427.Springer(2020)

[20]Yu,L.,Wang,S.,Li,X.,Fu,C.W.,Heng,P.A.:Uncertainty-aware self-ensembling model for semi-supervised 3D left atrium segmentation.In:International Conference on Medical Image Computing and Computer-Assisted Intervention.pp.605–613.Springer(2019)。

Claims

1. A semi-supervised learning based histological image segmentation model, comprising: a teacher model, a student model, a multi-level forced consistency module and a total loss function for supervision training;

the structure of the teacher model is the same as that of the student model, the teacher model and the student model are both composed of a coder and a main decoder by adopting deep LabV3+ with cavity convolution, wherein the coder comprises a convolution block and four residual blocks, the four residual blocks are RB1, RB2, RB3 and RB4 respectively, each residual block is a pre-trained ResNet34, and the RB3 and RB4 use cavity convolution;

Generating a segmentation prediction result through an auxiliary decoder;

the multi-level consistency loss function comprises a learnable multi-level loss function L _{lh_c} And self-guided multi-level consistency loss function L _{sgh_c} The method comprises the steps of carrying out a first treatment on the surface of the Learnable multi-level loss function L _{lh_c} The concrete representation is as follows:

expressed as:

wherein u' _k Can be expressed as:

representing a segmentation prediction result of the teacher model on a kth pixel;

self-guiding multi-level consistency loss function L _{sgh_c} The concrete representation is as follows:

total loss function L _total The concrete steps are as follows:

L _total ＝L _seg +λ _{h_c} (L _{lh_c} +λ _{sgh_c} L _{sgh_c} )

q is a scaling factor, v is equal to the current iteration number of the training, and T is equal to the total iteration number of the training.

2. The histological image segmentation model according to claim 1, wherein the update strategy of weights in the teacher model is: for each training batch, the weight update of the teacher model is based on the weight of the teacher model in the previous training batch and the weight of the student model in the present training batch, and the update strategy is as follows: weights θ 'for teacher model in the t training batch' _t The method comprises the following steps:

θ′ _t ＝αθ′ _t-1 +(1-α)θ _t

wherein, θ' _t-1 Representing the weight, θ, of the teacher model in the t-1 th training lot _t Represents the t thWeights of student models in training batch, α represents updating student model θ using gradient descent during the overall training process _t The decay rate of the exponential moving average of (a).

3. The histological image segmentation model according to claim 1, wherein the supervised loss function L _seg From cross entropy loss function L _ce Sum-of-variance constraint cross-loss function L _var Composition, supervised loss function L _seg The expression is as follows:

L _seg ＝L _ce +λ _var L _var

4. A histological image segmentation model according to claim 3, wherein for each training batch there is labeling data B _l Variance constraint cross-loss function L _var The following is indicated:

5. The histological image segmentation model according to claim 4, wherein the manner of perturbation operation is random, in particular dropout or a noise layer using a feature level.

6. The histological image segmentation model of claim 5, wherein all ancillary decoders are structurally identical, including a hole-space convolution pooling pyramid layer and an upsampling layer.