CN114972341A

CN114972341A - WSI image classification method, system and medium based on Bayesian assisted learning

Info

Publication number: CN114972341A
Application number: CN202210895166.9A
Authority: CN
Inventors: 余晋刚; 吴梓浩; 吴锦全
Original assignee: South China University of Technology SCUT
Current assignee: Guangzhou Fangxin Medical Technology Co ltd
Priority date: 2022-07-28
Filing date: 2022-07-28
Publication date: 2022-08-30
Anticipated expiration: 2042-07-28
Also published as: CN114972341B

Abstract

The invention discloses a WSI image classification method, a system and a medium based on Bayesian assisted learning, wherein the method comprises the following steps: acquiring a WSI image and performing threshold segmentation to obtain a pathological image block set; constructing a WSI classification model based on a Bayesian decision theory, wherein the WSI classification model comprises an image block feature extractor, an image block classifier and a feature fusion and classification module; initializing a WSI classification model, and iteratively training the WSI classification model on the pathological image block set by using an EM (effective ray spectroscopy) method to obtain a trained WSI classification model; and inputting the WSI image to be diagnosed into the trained WSI classification model, and outputting the class label of the WSI image to be diagnosed. The WSI classification model is constructed based on the Bayesian decision theory, the image block classifier is introduced to perform an auxiliary learning task, information in the WSI image is fully extracted, and meanwhile, the EM method is used for performing dynamic learning in the iterative training process, so that the WSI classification performance and the model robustness are improved.

Description

WSI image classification method, system and medium based on Bayesian assisted learning

Technical Field

The invention belongs to the technical field of digital pathological full-section image classification, and particularly relates to a WSI image classification method, a system and a medium based on Bayesian assisted learning.

Background

Pathological diagnosis is recognized as a gold standard for clinical cancer diagnosis, and has important significance for the establishment of a treatment scheme and prognosis of cancer patients. In the traditional method, a pathologist carries out pathological diagnosis by dragging and magnifying different positions of a pathological slide by means of a microscope; the diagnosis process is complicated and has strong subjectivity, and the consistency of the diagnosis results given by different doctors to difficult cases is poor. With the progress of the slide digital scanning technology, digital pathology full-section images (WSI) are easier to obtain, and the design of automated and intelligent computer-aided diagnosis methods for digital pathology images is also receiving wide attention. However, the digital pathological full-section image classification diagnosis still faces some technical difficulties, which restrict the application of the auxiliary diagnosis method: 1) the WSI has ultrahigh resolution, such as 100,000x100,000 pixels, and is difficult to directly process by adopting a common deep learning method, so the WSI is generally divided into a plurality of small image blocks (patch), and then classification diagnosis is carried out on the WSI based on the patch, but the WSI has ultrahigh resolution, so the WSI is divided into a great number of patches, and on average, each WSI has 8000 to 15000 patches; when network training is carried out, a large number of intermediate feature graphs need to be stored, so that memory overflow is easily caused, and therefore, under the condition that only WSI integral class labels are provided, all patches cannot be used for carrying out end-to-end training on the feature extractor and the fusion model; 2) the fine pixel level label of the WSI is difficult to obtain, and usually only an integral class label can be obtained, and the label information corresponding to each patch cannot be known; 3) the WSI image has abundant changes of cell morphology, structure and texture, has high heterogeneity, and is less in data volume compared with other medical image data, so that the phenomenon of overfitting of a deep learning model is easily caused.

Aiming at the problems, the CLAM method proposed by Lu.M.Y and the like extracts patch features by utilizing a ResNet-50 network pre-trained by an ImageNet data set, and then learns a fusion module based on an attention mechanism through the patch features to finally realize the classification of WSI; however, the process of training the feature extractor and the fusion model by the method is completely split, so that the classifier is not favorable for learning feature information which is beneficial to distinguishing different types of WSI, and the feature information cannot be sufficiently coupled to improve the classification performance of the WSI, so that the classification performance of the WSI is limited by the quality of the patch features finally. Selecting one with the highest prediction score from all the patches to represent the WSI image based on the maximum pooling multi-instance learning method, and training the whole network by using the patches, wherein the method can simultaneously train a feature extractor and a fusion model; however, training can be performed only through a small number of patches, information in a large number of WSI images is lost, so that the stability of the model is poor, noise is high, the overall performance depends on selection strategies such as clustering, and the learning of a high-quality feature extractor is not facilitated. Hou et al put forward a method for updating Patch classifier with EM algorithm in the document "Patch-based constrained neural network for whole slice image classification", supposing that all Patch class labels in WSI are consistent with WSI whole label, then training a Patch-level classifier, i.e. Patch feature extractor, generating probability graph of WSI through the classifier, and performing Gaussian smoothing processing on the probability graph, selecting higher confidence Patch from the probability graph according to threshold value, using the higher confidence Patch to retrain the classifier, iterating until the selected Patch is consistent, then using the trained classifier to retrain prediction for all patches in WSI, counting prediction class distribution as WSI whole feature, and using the whole feature training logic classifier to perform classification regression on WSI; however, the method assumes that it is unreasonable that all patch category labels in the WSI are consistent with the WSI overall label, and a background area is also included in the label, so that irrelevant noise is introduced, and the performance of the classifier is reduced; meanwhile, the process of training the feature extractor and the fusion model is completely split, which is not beneficial to learning feature information which is beneficial to distinguishing different types of WSI by the classifier, and enables the two structures to be mutually independent and not to be fully coupled so as to improve the classification performance of the WSI; finally, the adoption of the class distribution histogram of patch as the WSI feature is not reasonable, and is not enough to represent rich information in the WSI image.

Disclosure of Invention

The invention mainly aims to overcome the defects of the prior art and provide a WSI image classification method, a system and a medium based on Bayesian assisted learning, which solve the technical problem of poor classification performance caused by the fact that all patches can not be used for updating a feature extractor and a fusion model simultaneously, construct a WSI classification model based on Bayesian theorem, introduce image block classification as an assisted learning task training feature extractor, and fully utilize information in a WSI image to learn a high-quality feature extractor; and meanwhile, the EM method is used for dynamic learning in the iterative training process so as to improve the classification performance of the WSI and the robustness of the model.

In order to achieve the purpose, the invention adopts the following technical scheme:

the first purpose is to provide a WSI image classification method based on Bayesian assisted learning, which comprises the following steps:

acquiring a WSI image and a corresponding class label, converting the WSI image into an HSV space, extracting a foreground area according to saturation, and dividing the foreground area into a plurality of image blocks to obtain a pathological image block set;

constructing a WSI classification model based on a Bayesian decision theory, wherein the WSI classification model comprises an image block feature extractor, an image block classifier and a feature fusion and classification module; the feature fusion and classification module comprises an attention fusion module and a WSI classifier;

the image block feature extractor is used for extracting features of image blocks in the pathological image block set; the image block classifier is used for giving a pseudo label to the image block; the attention fusion module is used for acquiring attention weight corresponding to the image block characteristics; the WSI classifier is used for classifying the WSI images;

initializing a WSI classification model, iteratively training the WSI classification model on the pathological image block set by using an EM (effective ray spectroscopy) method, and optimizing model parameters until the model converges to obtain a trained WSI classification model;

and inputting the WSI image to be diagnosed of the cancer patient into the trained WSI classification model, outputting the classification result of the WSI image to be diagnosed and drawing a probability thermodynamic diagram.

As an optimal technical scheme, the WSI classification model is built based on a Bayesian decision theory and is built in a pathological image block set

The training learning parameters areθ=(θ ₁ ,θ ₂ ,θ ₃ ) WSI classification modelF _θ Wherein

Is shown asiA sheet of the WSI image is printed,

is shown asiIn a WSI imagenThe number of the image blocks is one,N _i is shown asiThe number of image blocks of a WSI image block,y _i is shown asiA category label corresponding to the WSI image,θ ₁ are parameters of the image block feature extractor,θ ₂ for the parameters of the feature fusion and classification module,θ ₃ for the parameters of the image block classifier, based on the input WSI imageXPredicting category labelsy= F _θ (X) The method specifically comprises the following steps:

will train the learning parameters asθ=(θ ₁ ,θ ₂ ,θ ₃ ) WSI classification modelF _θ Defined as the maximum likelihood estimation problem, expressed as:

wherein the content of the first and second substances,θ ^* representing theoretical parameters of the WSI classification modelDI represents the number of WSI images in the pathological image block set;

introducing hidden variables

A class label set for representing the image block is obtainedObjective function of WSI classification model:

wherein the content of the first and second substances,

representing each image block

The implicit true category label is that the user is,Crepresenting a set of category labels contained in the WSI image.

As a preferred technical scheme, the image block feature extractor adopts a ResNet-50 network as a backbone network;

the initialization WSI classification model specifically comprises the following steps:

pre-training an image block feature extractor on an ImageNet data set, and initializing parameters of the image block feature extractor;

the initialized image block feature extractor encodes an input image block into a 1024-dimensional feature vector;

and initializing the parameters of the image block classifier, the attention fusion module and the WSI classifier by adopting random initialization.

As a preferred technical scheme, before iteratively training a WSI classification model on a pathological image block set by using an EM method, an initialized image block feature extractor is used to extract initialized image block features of the pathological image block set, and then an attention fusion module and a WSI classifier are trained by using the initialized image block features and class labels corresponding to WSI images, and parameters of a feature fusion and classification module are updated;

and classifies the image blocks

Fraction value of corresponding category label of all image blocks belonging to WSI image in medium prediction

The number of the given 1 s is 1,

is shown asiIn a WSI imagenPrediction score values for image blocks.

As a preferred technical scheme, the step of iteratively training the WSI classification model by the EM method comprises E-step and M-step;

the E-step is based on the input WSI imageX _i And corresponding category labely _i And parameters of the WSI classification modelθ ^t() Assigning a pseudo label to each image block in the set of pathological images, whereinθ ^t() Denotes the firsttModel parameters of the round iteration;

defining an objective function based on a WSI classification model

And according to the equation

And decomposing the objective function of the WSI classification model into:

wherein the content of the first and second substances,

the set of image block classes representing the prediction by the WSI classification model is equal to the a posteriori probability of the true label underlying the image block,J(θ,θ ^t() ) Indicating WSI Classification model usagetParameters of a wheel iteration modelθ ^t() Training to obtain model parameters ofθThe objective function of (1);

and further expanding to obtain an objective function expansion formula of the WSI classification model:

determining parameters of the M-step for updating the image block feature extractor according to an objective function expansion of a WSI classification modelθ ₁ And parameters of image block classifierθ ₃ Then according to the parameters of image block feature extractorθ ₁ Updating parameters of a feature fusion and classification moduleθ ₂ 。

As a preferable technical scheme, the E-step is specifically as follows:

inputting the pathological image block data set into a WSI classification model, and using an image block feature extractor

Extracting image block features:

wherein the content of the first and second substances,

is shown asiIn a WSI imagenPage image block characteristics;

inputting the image data into an attention fusion module omega to obtain attention weight values of all image blocks in the WSI imagea _n } _i ：

Wherein the leafa _n } _i Is shown asiIn a WSI imagenAttention weight of the image blocks;

simultaneous input of image block features into image block classifier

：

The attention weighted value of the same image block is readya _n } _i And score value

Performing dot product to obtain integral fraction of pathological image block sets _n } _i ：

Making an integral score of a pathological image block sets _n } _i Sorting in descending order, and assigning pseudo label to image block according to threshold lambdaz _n } _i ；

The M-step comprises M1-step and M2-step, and specifically comprises the following steps:

m1-step: pseudo label for using image blockz _n } _i Training image block feature extractor by full-supervision training method

And an image block classifier

；

M2-step: fixed image block feature extractor

The parameters of the image block are extracted again, and the updated image block features are usedClass label retraining feature fusion and classification module corresponding to WSI image

The parameter (c) of (c).

As a preferred technical scheme, the loss function of the WSI classification model is constructed by adopting cross entropy, and comprises an L1 loss function and an L2 loss function;

the L1 loss function is used to train the updated WSI classifier, and is expressed as:

wherein N represents the total number of classes predicted by the WSI classification model,

representing a WSI classification model predicting that a WSI image belongs to a categoryiThe score of (a) is calculated,y _i e {0,1} represents a category label corresponding to the WSI image subjected to 0-1 encoding;

the L2 loss function is used to train the update image block classifier, represented as:

wherein N +1 represents the total number of classes predicted by the image block classifier,

representing image block classifier to predict image block to belong to WSI image real categoryiThe value of the fraction of (c) is,z _i e {0,1} represents the image block pseudo label after 0-1 encoding.

As a preferred technical solution, the mapping probabilistic thermodynamic diagram is used for diagnosing cancer patients, and specifically includes:

inputting the WSI image to be diagnosed of the cancer patient into the trained WSI classification model, and outputting a classification result;

drawing the classification resultFractional values of all image blocks in the WSI image to be diagnosed predicted by the image block classifier

Obtaining the probability values of the image blocks belonging to the various categories corresponding to the WSI image to be diagnosed through the normalization of the softmax function;

creating an all-0 matrix, and downsampling the length and the width of the matrix for a WSI image to be diagnosednDoubling;

performing coordinate conversion according to the position of the image block in the WSI image to be diagnosed, and filling the probability value of the image block into the corresponding position of the matrix to obtain a probability matrix;

and constructing a color mapping table corresponding to different probability values, mapping colors to corresponding positions of the WSI image to be diagnosed according to the probability values of the probability matrix, and obtaining a probability thermodynamic diagram.

The second purpose is to provide a WSI image classification system based on Bayesian assisted learning, which is applied to the WSI image classification method based on Bayesian assisted learning and comprises a data acquisition module, a model construction module, a model training module and a diagnosis module;

the data acquisition module is used for acquiring a WSI image and a corresponding class label, converting the WSI image into an HSV space, extracting a foreground area according to saturation and dividing the foreground area into a plurality of image blocks to obtain a pathological image block set;

the model construction module is used for constructing a WSI classification model based on a Bayesian decision theory, and the WSI classification model comprises an image block feature extractor, an image block classifier and a feature fusion and classification module; the feature fusion and classification module comprises an attention fusion module and a WSI classifier;

the model training module is used for initializing the WSI classification model, iteratively training the WSI classification model on the pathological image block set by using an EM (effective electromagnetic radiation) method, and optimizing model parameters until the model converges to obtain the trained WSI classification model;

the diagnosis module is used for inputting the WSI images to be diagnosed of the cancer patients into the trained WSI classification model, outputting the classification results of the WSI images to be diagnosed and drawing a probability thermodynamic diagram.

A third object is to provide a computer-readable storage medium, which stores a program that, when executed by a processor, implements the above-mentioned WSI image classification method based on bayesian-assisted learning.

Compared with the prior art, the invention has the following advantages and beneficial effects:

firstly, the invention trains the feature extractor in the WSI classification model by introducing an image block classification auxiliary task, so that rich feature expression with discrimination capability can be better learned from highly heterogeneous digital pathology full-section image data, and the accuracy of WSI image classification can be substantially improved;

secondly, the problem modeling and formula derivation are carried out by combining the Bayesian decision theory, and the method is different from other WSI classification deep learning methods.

Thirdly, a better image block classifier can be obtained according to the introduced auxiliary learning task, the WSI image can be conveniently scanned, a corresponding probability thermodynamic diagram is obtained, and a better visualization effect is achieved; and the visualization result can help a doctor to locate an interested region in the WSI image, thereby being beneficial to the diagnosis of cancer.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a flowchart of a WSI image classification method based on Bayesian-assisted learning according to an embodiment of the present invention;

FIG. 2 is a schematic flow chart of iterative training of a WSI classification model by using an EM method according to an embodiment of the present invention;

FIG. 3 is a class activation diagram generated by the image block classifier for different iterations according to an embodiment of the present invention;

FIG. 4 is a probability thermodynamic diagram obtained by predicting all patches in a WSI image by an image block classifier under different iteration times in the embodiment of the present invention;

FIG. 5 is a diagram illustrating the structure of a WSI image classification system based on Bayesian-assisted learning according to an embodiment of the present invention;

fig. 6 is a block diagram of a computer-readable storage medium according to an embodiment of the present invention.

Detailed Description

In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. It is to be understood that the embodiments described are only a few embodiments of the present application and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Reference in the specification to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the specification. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is explicitly and implicitly understood by one skilled in the art that the embodiments described herein can be combined with other embodiments.

Referring to fig. 1, in an embodiment of the present application, a WSI image classification method based on bayesian-assisted learning is provided, including the following steps:

s1, acquiring the WSI image and the corresponding class label, converting the WSI image into an HSV space, extracting a foreground region according to saturation, and dividing the foreground region into a plurality of image blocks to obtain a pathological image block set;

firstly, acquiring a digital pathological full-section WSI imageX _i And its corresponding category labely _i Then the WSI image is processedX _i Converting the space into HSV space, extracting a foreground region according to the saturation, extracting an effective pathological tissue part from the foreground region, and removing most of the remaining white background region, so that a large amount of storage cost can be saved, and meanwhile, the noise of a data set is reduced; then, the tissue region of the WSI image is divided into 256 × 256 sized image blocks (patch) at 20 × magnification to obtain a pathological image block set, so that the subsequent processing can be performed using the deep neural network.

S2, constructing a WSI classification model based on a Bayesian decision theory, wherein the WSI classification model comprises an image block feature extractor, an image block classifier and a feature fusion and classification module; the feature fusion and classification module comprises an attention fusion module and a WSI classifier;

the image block feature extractor is used for extracting features of image blocks in the pathological image block set; the image block classifier is used for giving a false label to the image block; the attention fusion module is used for acquiring attention weight corresponding to the image block characteristics; the WSI classifier is used for classifying the WSI images.

Based on Bayesian decision theory, by using the pathological image block set

Is shown asiA sheet of the WSI image is printed,

is shown asiIn a WSI imagenThe number of image blocks is one,N _i is shown asiThe number of image blocks of a WSI image block,θ ₁ are parameters of the image block feature extractor,θ ₂ for the parameters of the feature fusion and classification module,θ ₃ parameters of image block classifier are used to make WSI classification modelF _θ Capable of being based on inputted WSI imageXPredicting category labelsy = F _θ (X) The method specifically comprises the following steps:

will train the learning parameters asθ=(θ ₁ ,θ ₂ ,θ ₃ ) WSI classification modelF _θ Is defined as the Maximum Likelihood Estimation (MLE) problem, expressed as:

wherein the content of the first and second substances,θ ^* theoretical parameters representing the WSI classification modelDI represents the number of WSI images in the pathological image block set;

the classification process of each WSI image is mutually independent, and then hidden variables are introduced into the maximum likelihood estimation function

And representing a class label set of the image block to obtain an objective function of the WSI classification model:

wherein the content of the first and second substances,

representing each image block

Due to hidden variablesZ _i Is unknown and therefore cannot directly optimize the objective function to obtain the normType parameters, therefore the EM method is used for iterative optimization until the model stops converging.

S3, initializing the WSI classification model, iteratively training the WSI classification model on the pathological image block set by using an EM (effective ray spectroscopy) method, and optimizing model parameters until the model converges to obtain the trained WSI classification model;

in the embodiment, the image block feature extractor adopts a ResNet-50 network as a backbone network;

in the stage of initializing the WSI classification model, pre-training an image block feature extractor on an ImageNet data set, and initializing parameters of the image block feature extractor; the initialized image block feature extractor can encode the input image block into a 1024-dimensional feature vector;

and initializing parameters of the image block classifier, the attention fusion module and the WSI classifier by adopting random initialization.

Considering that if the E-Step of the first iteration is directly performed by using the parameters obtained by random initialization, the pseudo label of each image block is predicted, the quality of the pseudo label is poor, and the performance of the next iteration process is possibly further influenced; and feature extractor

And feature fusion and classification module

The combination of the two is actually a multi-class attention multi-instance learning model, and since the attention multi-instance learning model is used in a WSI image classification task, all image blocks are usually encoded into feature vectors by using a pre-training network, and then a subsequent attention fusion module is trained, which is equivalent to freezing parameters of a preceding feature extractor, before performing E-step of a first iteration, initialized image block features of a pathological image block set are extracted by using the initialized image block feature extractor, and then the attention fusion module and the WSI classifier are trained by using the initialized image block features and class labels corresponding to the WSI images, which is equivalent to updating once first

The parameters of (1); and classifies the image blocks

The number of the given 1 s is given to the target,

is shown asiIn a WSI imagenPrediction score values for image blocks.

As shown in fig. 2, the process of iteratively training the WSI classification model using the EM method is described in detail below:

the step of iteratively training the WSI classification model by the EM method comprises E-step and M-step;

where the E-step key is to estimate the posterior probability distribution

I.e. from the input WSI imageX _i And corresponding category labely _i And parameters of the WSI classification modelθ ^t() Assigning a pseudo label to each image block in the set of pathological images, whereinθ ^t() Is shown astThe model parameters of the round iteration are specifically as follows:

Extracting image block features:

wherein the content of the first and second substances,

is shown asiIn a WSI imagenPage image block characteristics;

simultaneous input of image block features into image block classifier

：

Making an integral score of a pathological image block sets _n } _i Sorting in descending order, and assigning pseudo label to image block according to threshold lambdaz _n } _i (ii) a Image with score value higher than threshold value λ in the present embodimentThe method comprises the steps that a block is endowed with a class label corresponding to a WSI image, image blocks with the score values lower than a threshold lambda are regarded as irrelevant to classification, and the image blocks are classified into a background class;

in practical process, whole fraction of all image blocks in different WSI imagess _n } _i The value range is different, the threshold lambda aiming at different WSI images is difficult to confirm, and the threshold has high influence on the quality of the pseudo label, so the invention adopts an approximate method according to a great Chinese styles _n } _i Selecting a top-N image block with the highest score to be endowed with a WSI real category label, and dividing the top-N image block with the lowest score into a background category; since the threshold change only affects the category of the image blocks with the scores in the middle, and the average number of the image blocks in the WSI image is 8000 to 15000, the image blocks at the head end and the tail end cannot be affected; on the other hand, if all image blocks are used to train the feature extractor

The introduction of noise is inevitable and this way reduces the number of image blocks used for training and thus improves their quality, so that

Features that are beneficial for WSI classification are easier to learn.

M-step is to update the parameters of the WSI classification model so as to make the objective function of the WSI classification model reach the maximum;

defining an objective function based on a WSI classification model

And according to the equation

And decomposing the objective function of the WSI classification model into:

wherein the content of the first and second substances,

the set of image block classes representing the prediction by the WSI classification model is equal to the a posteriori probability of the true label underlying the image block,J(θ,θ ^t() ) Indicating WSI Classification model usagetParameters of a wheel iteration modelθ ^t() Training to obtain model parameters ofθCan be understood as: from an input WSI imageX _i And the parameters areθ ^t() The WSI classification model predicts that the WSI class label is equal to the corresponding class label of the WSI imagey _i And according to the input WSI imageX _i With the parameters ofθ ^t() The WSI classification model and the corresponding class label of the WSI imagey _i The prediction image block class label is equal to the likelihood that the image block implies a true label. The maximum likelihood estimation is to find the optimal model parameters

So that the probability that the predicted WSI class label and image block class label are equal to the respective true labels is maximized, and thereforeJ(θ,θ ^t() ) Is also an objective function to be optimized;

therefore, according to the expansion, M-step first optimizes the parameters of the first term, i.e. updates the parameters of the image block feature extractorθ ₁ And parameters of image block classifierθ ₃ Then the updated parameters are used againθ ₁ To optimizeParameters of two termsθ ₂ I.e. based on the parameters of the image block feature extractorθ ₁ Updating parameters of a feature fusion and classification moduleθ ₂ And completing the parameter updating of the whole WSI classification model, specifically comprising the following steps:

the M-step is divided into two steps according to the expansion, the first step M1-step: pseudo label for using image blockz _n } _i Training image block feature extractor by full-supervision training method

And an image block classifier

；

Second step M2-step: fixed image block feature extractor

The pathological image blocks are encoded again, and the encoded image block features and the class labels corresponding to the WSI images are used for retraining the parameters of the attention fusion module and the WSI classifierθ ₂ 。

In iterative training, a loss function of the WSI classification model is constructed by adopting cross entropy, and the loss function comprises an L1 loss function and an L2 loss function;

wherein N represents the total number of categories predicted by the WSI classification model,

the L2 loss function is used to train the update graph block classifier, represented as:

representing image block classifier to predict image block to belong to WSI image real categoryiThe value of the fraction of (c) is,z _i e {0,1} represents the image block pseudo label after 0-1 coding.

And S4, inputting the WSI image to be diagnosed of the cancer patient into the trained WSI classification model, outputting the classification result of the WSI image to be diagnosed and drawing a probability thermodynamic diagram.

The probabilistic thermodynamic diagram can clearly reflect which part of the image belongs to the category 1 and which part of the image belongs to the category 2 in the binary problem; if the multi-classification problem is analyzed, a probability thermodynamic diagram for performing two-classification on each class and other classes needs to be drawn separately; in this embodiment, two classification tasks on the Camelyon data set to determine whether a cancer patient has breast cancer metastasis are taken as examples, and specific steps for drawing a probability thermodynamic diagram are described as follows:

after a trained WSI classification model is obtained, the WSI image to be diagnosed is input, and the image block classifier is used for predicting the score values of all image blocks in the WSI image

After the value range of the score value is changed into 0,1 after the normalization of the softmax function]Taking the normalized scores as probability values of the image blocks belonging to all the categories, wherein the sum of the scores of the image blocks belonging to all the categories is 1;

then, an all-0 matrix is created, the length and width of which are downsampled for the WSI image to be diagnosednObtaining the length and width of the matrix by down-sampling 64 times in the embodiment;

according to the position of an image block in the WSI image to be diagnosed, simple coordinate conversion is carried out, the probability value predicted by each image block is filled in the corresponding position of a matrix, and finally a probability matrix is obtained, wherein the values in the probability matrix are all decimals between 0 and 1, and the size of the probability matrix is 64 times of the size of the WSI image to be diagnosed;

In this embodiment, a color mapping table from blue to red is constructed, and the color mapping table corresponds to different probability values, where a probability value of 0 is represented by pure blue, a probability value of 1 is represented by pure red, and a probability value of 0 to 1 is mapped to a gradient color between pure blue and pure red; finally, a probability thermodynamic diagram equal to the size of the WSI image 1/64 to be diagnosed is obtained, the probability of breast cancer metastasis of each region in the WSI image is reflected, the probability that metastasis occurs is higher when the color is red, and the probability value that metastasis occurs is lower when the color is blue; the pathologist can directly and quickly lock the area with the breast cancer metastasis from the red areas and then carry out more careful examination, thereby accelerating the diagnosis process.

The existing WSI image weak supervision classification method does not fully learn the characteristics of the image, and more, the WSI classification accuracy is improved by learning a better fusion model, such as a multi-example learning model, and fusing information of different patches; however, as can be seen from the machine learning principle, only better features are learned to enable the system to generate better pattern discrimination capability. Therefore, the invention adopts a new idea, effectively processes image data with high heterogeneity by directly learning more discriminative image characteristics, and essentially improves the accuracy of WSI classification diagnosis.

The invention provides a WSI image classification method (BAL) based on Bayesian assisted learning, which is characterized in that image block classification is used as an auxiliary task, a WSI classification model is constructed by applying Bayesian decision theory, and model parameters are updated by using EM algorithm, so that end-to-end learning from a feature extractor to a feature fusion and classification module is realized. Compared with the prior art, the method has the advantages that the features of the image blocks are effectively and dynamically updated, and the features with rich judgment information are finally learned, so that the accuracy of WSI classification is improved; in addition, the invention provides a more reasonable strategy for labeling the false labels, and the local information of the image block classification and the global information required by the WSI classification are fully combined, so that the image block classification labels with high quality are screened out and used for training the image block classifier; meanwhile, the image block classifier obtained by the auxiliary task learning can be used for generating a probability thermodynamic diagram corresponding to the WSI, so that the accurate positioning of a cancerous region is realized, and the interpretability of the WSI auxiliary diagnosis system is greatly improved.

To verify the effectiveness of the method of the invention, experiments were performed on open datasets of three different cancer types, the three datasets being the Camelyon16 dataset, the TCGA-NSCLC dataset and the TCGA-RCC dataset, respectively;

the camellyon 16 data set is a public data set for pathological biopsy of the sentinel lymph node of the breast cancer, and comprises 270H & E staining WSIs in a training set and 129 WSI images in a testing set, and is used for diagnosing whether the sentinel lymph node of the breast cancer is transferred or not and performing a classification task; in the test, 15% of samples are randomly extracted from a training set to serve as a verification set, the rest 85% of data are used as training, and a test set divided by an original data set is reserved for testing.

The TCGA-NSCLC data set is a non-small cell lung cancer digital pathological full-section image data set, comprises 535 lung adenocarcinoma WSIs and 512 lung squamous carcinoma WSIs, and 1047 WSI images in total, and is used for typing two subtypes of non-small cell lung cancer and performing a classification task; in order to carry out more sufficient tests, the training/testing set is divided by common 5-fold cross validation, the proportion of adenocarcinoma/squamous carcinoma of each fold is ensured to be the same as that of the original data set, and in each division, 20% of samples are randomly selected from the training set to serve as a validation set for selecting models and hyper-parameters.

TCGA-RCC is a renal cell carcinoma digital pathology whole slice image dataset, comprises 297 renal papillary cell carcinoma WSI, 121 renal chromophobe cell carcinoma WSI and 519 renal clear cell carcinoma WSI, and 937 WSI images in total, and is used for typing three subtypes of renal cell carcinoma and performing three classification tasks; the test was also performed using a 5-fold cross-validation method, as described in the TCGA-NSCLC data set.

The test selects MAX-MIL, ABMIL, EM-CNN, CLAM and TransMIL methods in the prior art to carry out the test on three public data sets, and the results are shown in the following table:

from the above table, the final performance of the method of the present invention exceeds that of the existing method on three data sets, wherein BAL (t0) represents the model of the method of the present invention when just initialized, BAL (t3) represents the model of the method of the present invention after 3 rounds of EM iteration, it can be seen that the performance of the model just initialized is not as good as that of the current optimal methods such as CLAMs and transamls, but after 3 rounds of EM iteration, the performance of the model is greatly improved and finally exceeds that of other methods, which proves that the iterative updating of the feature extractor and the feature fusion and classification model by using the EM algorithm is effective and can help for WSI classification.

In addition, as shown in fig. 3, the given 6 image blocks are images containing breast cancer lymph node metastasis, it can be seen that the class activation map can better cover the tumor region in the patch, and as the EM iteration proceeds from left to right, the class activation map has better coverage degree in the tumor region, and the region outline is more obvious, so that it should be proved that the discrimination capability of the image block classifier in the EM iteration is improved, so that the network can learn better feature expression, and the image block classifier obtained by the auxiliary task can realize better tumor region localization capability, which is deficient in other WSI classification methods.

In addition, as shown in fig. 4, 3 WSIs are selected from the Camelyon16 data set of breast cancer lymph node metastasis, and probability thermodynamic diagrams of the WSIs drawn at different iteration times, so that as the iteration progresses, the probability thermodynamic diagrams can better cover the region of the WSI image where the breast cancer metastasis occurs, and help a doctor quickly locate a region of interest in the WSI image to assist the doctor in diagnosis.

It should be noted that, for the sake of simplicity, the foregoing method embodiments are described as a series of acts or combinations, but those skilled in the art should understand that the present invention is not limited by the described order of acts, as some steps may be performed in other orders or simultaneously according to the present invention.

Based on the same idea as the WSI image classification method based on Bayesian assisted learning in the embodiment, the invention also provides a WSI image classification system based on Bayesian assisted learning, and the system can be used for executing the WSI image classification method based on Bayesian assisted learning. For convenience of illustration, the structural diagram of the WSI image classification system based on bayesian-assisted learning in the embodiment of the present invention only shows the parts related to the embodiment of the present invention, and those skilled in the art will understand that the illustrated structure does not constitute a limitation to the apparatus, and may include more or less components than those illustrated, or combine some components, or arrange different components.

Referring to fig. 5, in another embodiment of the present application, a WSI image classification system based on bayesian-assisted learning is provided, the system includes a data acquisition module, a model construction module, a model training module, and a diagnosis module;

the data acquisition module is used for acquiring a WSI image and a corresponding class label, converting the WSI image into an HSV space, extracting a foreground region according to saturation and dividing the foreground region into a plurality of image blocks to obtain a pathological image block set;

the model construction module is used for constructing a WSI classification model based on a Bayesian decision theory, and comprises an image block feature extractor, an image block classifier and a feature fusion and classification module; the feature fusion and classification module comprises an attention fusion module and a WSI classifier;

It should be noted that, the WSI image classification system based on bayesian-aided learning of the present invention corresponds to the WSI image classification method based on bayesian-aided learning of the present invention one to one, and the technical features and the beneficial effects thereof described in the above embodiment of the WSI image classification method based on bayesian-aided learning are both applicable to the embodiment of the WSI image classification system based on bayesian-aided learning, and specific contents can be referred to the description in the embodiment of the method of the present invention, which is not repeated herein, and thus the present invention is declared.

In addition, in the implementation of the WSI image classification system based on bayesian-assisted learning in the foregoing embodiment, the logical division of each program module is only an example, and in practical applications, the foregoing function allocation may be performed by different program modules according to needs, for example, due to configuration requirements of corresponding hardware or convenience of implementation of software, that is, the internal structure of the WSI image classification system based on bayesian-assisted learning is divided into different program modules to perform all or part of the functions described above.

Referring to fig. 6, in an embodiment, a computer-readable storage medium is provided, in which a program is stored in a memory, and when the program in the memory is executed by a processor, the method for classifying a WSI image based on bayesian-assisted learning is implemented, and specifically, the method includes:

the image block feature extractor is used for extracting features of image blocks in the pathological image block set; the image block classifier assigns a false label to the image block; the attention fusion module is used for acquiring attention weight corresponding to the image block characteristics; the WSI classifier is used for classifying the WSI images;

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. The WSI image classification method based on Bayesian assisted learning is characterized by comprising the following steps of:

the image block feature extractor is used for extracting features of image blocks in the pathological image block set; the image block classifier is used for giving a false label to the image block; the attention fusion module is used for acquiring attention weight corresponding to the image block characteristics; the WSI classifier is used for classifying the WSI images;

2. The WSI image classification method based on Bayesian assisted learning as per claim 1, wherein the WSI classification model is constructed based on Bayesian decision theory and is constructed by collecting pathological image blocks

Is shown asiA sheet of the WSI image is printed,

is shown asiIn a WSI imagenThe number of image blocks is one,N _i is shown asiThe number of image blocks of a WSI image block,y _i is shown asiA category label corresponding to the WSI image,θ ₁ are parameters of the image block feature extractor,θ ₂ for the parameters of the feature fusion and classification module,θ ₃ for the parameters of the image block classifier, based on the input WSI imageXPredicting category labelsy= F _θ (X) The method specifically comprises the following steps:

introducing hidden variables

wherein the content of the first and second substances,

representing each image block

3. The WSI image classification method based on Bayesian assisted learning of claim 2, wherein the image block feature extractor adopts a ResNet-50 network as a backbone network;

4. The WSI image classification method based on Bayesian assisted learning according to claim 3, wherein before an EM method is used for iterative training of a WSI classification model on a pathological image block set, an initialized image block feature of the pathological image block set is extracted by using an initialized image block feature extractor, and then an attention fusion module and a WSI classifier are trained by using the initialized image block feature and a class label corresponding to the WSI image, so that parameters of a feature fusion and classification module are updated;

and classifies the image blocks

The number of the given 1 s is given to the target,

is shown asiIn a WSI imagenPrediction score values for image blocks.

5. The WSI image classification method based on Bayesian assistant learning of claim 4, wherein the step of iteratively training WSI classification models by EM method comprises E-step and M-step;

the E-step is based on the input WSI imageX _i And corresponding category labely _i And parameters of the WSI classification modelθ ^t() Assigning a pseudo label to each image block in the set of pathological images, whereinθ ^t() Is shown astModel parameters of the round iteration;

defining an objective function based on a WSI classification model

And according to the equation

And decomposing the objective function of the WSI classification model into:

wherein the content of the first and second substances,

the set of image block classes representing the prediction of the WSI classification model is equal to the a posteriori probability of the true label underlying the image block,J(θ,θ ^t() ) Indicating WSI Classification model usagetParameters of a wheel iteration modelθ ^t() Training to obtain model parameters ofθThe objective function of (1);

6. The WSI image classification method based on Bayesian assisted learning as recited in claim 5, wherein the E-step is specifically:

Extracting image block features:

wherein the content of the first and second substances,

is shown asiIn a WSI imagenPage image block characteristics;

simultaneous input of image block features into image block classifier

：

And an image block classifier

；

M2-step: fixed image block feature extractor

The parameters of the method are used for extracting the characteristics of the pathological image block again, and the updated image block characteristics and the class label corresponding to the WSI image are used for retraining the characteristic fusion and classification module

The parameter (c) of (c).

7. The WSI image classification method based on Bayesian assistant learning of claim 6, wherein the loss function of the WSI classification model is constructed by adopting cross entropy, and comprises an L1 loss function and an L2 loss function;

representing that an image block classifier predicts that an image block belongs to a WSI image true categoryiThe value of the fraction of (c) is,z _i e {0,1} represents the image block pseudo label after 0-1 encoding.

8. The WSI classification diagnosis method based on Bayesian assisted learning according to claim 7, wherein the drawing probability thermodynamic diagram specifically comprises:

the fraction values of all image blocks in the WSI image to be diagnosed, which are predicted by the image block classifier in the classification result

creating an all-0 matrix, wherein the length and the width of the matrix are down-sampled for the WSI image to be diagnosednDoubling;

9. The WSI image classification system based on Bayesian assisted learning is characterized by being applied to the WSI image classification method based on Bayesian assisted learning in any one of claims 1-8, and comprising a data acquisition module, a model construction module, a model training module and a diagnosis module;

10. A computer-readable storage medium storing a program, wherein the program, when executed by a processor, implements the bayesian-aided learning-based WSI image classification method of any of claims 1-8.