WO2021073279A1 - Staining normalization method and system for digital pathological image, electronic device and storage medium - Google Patents

Staining normalization method and system for digital pathological image, electronic device and storage medium Download PDF

Info

Publication number
WO2021073279A1
WO2021073279A1 PCT/CN2020/112366 CN2020112366W WO2021073279A1 WO 2021073279 A1 WO2021073279 A1 WO 2021073279A1 CN 2020112366 W CN2020112366 W CN 2020112366W WO 2021073279 A1 WO2021073279 A1 WO 2021073279A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
gaussian mixture
hsd
mixture model
staining
Prior art date
Application number
PCT/CN2020/112366
Other languages
French (fr)
Chinese (zh)
Inventor
南洋
王佳平
李风仪
侯晓帅
谢春梅
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021073279A1 publication Critical patent/WO2021073279A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H30/00ICT specially adapted for the handling or processing of medical images
    • G16H30/40ICT specially adapted for the handling or processing of medical images for processing medical images, e.g. editing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/90Determination of colour characteristics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Definitions

  • This application relates to the field of image processing of digital medicine, and in particular to a method, system, electronic device, and storage medium for normalizing digital pathological image staining.
  • kidney diseases Due to the wide variety of kidney diseases, the etiology and pathogenesis are complex, the clinical manifestations of many kidney diseases are not completely consistent with the histological changes of the kidney, and the treatment plans and the development results of the disease are also very different.
  • results of renal pathological examinations have become a gold indicator for the diagnosis of kidney diseases.
  • pathologists perform renal biopsy, they need to obtain some important medical indicators based on visual and empirical observations. Now they plan to assist pathologists in reading pictures through artificial intelligence.
  • the size of the feature block generated by the cutting is different.
  • the existing normalization of coloring is usually achieved through methods such as confrontation generation networks and variational autoencoders. Although this method can perform style transfer, However, uncontrollable noise points are easily generated, which may change the image structure. Therefore, a more stable and natural dyeing normalization method is urgently needed.
  • This application provides a method, system, electronic device and storage medium for staining and normalization of digital pathological images. Its main purpose is to solve the problem that the current staining normalization can be achieved by confrontation generation networks, variational autoencoders and other methods. Perform style transfer, but it is easy to produce uncontrollable noise points, which may change the problem of image structure, and effectively improve the stability of the stained normalized image.
  • the present application provides a method for normalizing digital pathological image staining, which is applied to digital pathological images, and includes the following steps:
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • S120 Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
  • S130 Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  • this application provides a digital pathological image staining normalization system, including:
  • the RGB unit performs data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • the HSD unit performs HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and converts the RGB image into an HSD image;
  • the convolutional neural network unit uses the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving different styles of images, until an optimal deep convolutional Gaussian mixture model is obtained;
  • the staining normalization unit performs staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal depth convolution Gaussian mixture model.
  • the present application provides an electronic device, which includes a memory, a processor, and a coloring normalization program for a digital pathological image stored in the memory, and the coloring of the digital pathological image is normalized
  • the program is executed by the processor, the following steps are performed:
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • S120 Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
  • S130 Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  • the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a staining normalization program for a digital pathological image, and the staining of the digital pathological image is unified.
  • the transformation program is executed by the processor, the following steps are implemented:
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • S120 Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
  • S130 Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  • the digital pathological image staining normalization method, system, electronic device, and computer-readable storage medium proposed in this application generate RGB image I(x,y) by analyzing the digital pathological slice image, and then calculate the RGB image I(x,y) according to the conversion rules.
  • the RGB image I(x,y) is subjected to HSD transformation, and then the RGB image is converted into an HSD image.
  • the HSD image is used to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different styles of images, and then through the depth
  • the convolutional Gaussian mixture model performs staining normalization on HE stained digital pathological images of histopathology to be detected, which effectively improves the effectiveness and stability of staining normalization.
  • Fig. 1 is a flowchart of an embodiment of a method for normalizing digital pathological image staining according to the present application
  • FIG. 2 is a schematic diagram of RGB image generation according to an embodiment of a method for normalizing staining of digital pathological images according to the present application;
  • Figure 3 is a system framework diagram of an embodiment of the application
  • Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
  • the size of the feature block generated by the cutting is different.
  • the existing normalization of coloring is usually achieved through methods such as confrontation generation networks and variational autoencoders. Although this method can perform style transfer, However, it is easy to produce uncontrollable noise points, which may change the image structure.
  • this application provides a method for normalizing digital pathology image staining, which combines DCGMM with deep convolutional neural networks to enhance the interpretability of the model and improve the stability of training, so as to perform on the premise of preserving structural information Style transfer provides a guarantee for the accuracy of medical diagnosis.
  • Fig. 1 is a flowchart of a method for normalizing staining of digital pathological images in this application.
  • the method for normalizing staining of digital pathological images includes the following steps:
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • the pre-stored digital pathological slice image can be a digital pathological slice image that is scanned by a slice scanner to a computer for storage, or it can be a digital pathological slice image extracted from a pathological slice database. Read these digital pathology slice images used as training materials into the opencv image processing library, and generate color images in RGB space in the opencv image processing library, that is, RGB images;
  • FIG. 2 is a schematic diagram of RGB image generation according to an embodiment of a method for normalizing staining of digital pathological images of the present application. As shown in Figure 2, the pathological slice on the left is finally processed into a color image in the RGB space on the right.
  • step S120 Perform HSD transformation on the RGB image I(x, y) generated in step S110 according to the preset conversion rule, and convert the RGB image into HSD image; among them, the conversion process of HSD into RGB image into HSD image, RGB image , HSD images are all images in a specific format.
  • RGB images the mixed information of color (chromatic) and intensity (intensity) hinders the standardization of color recognition, and HSD images will not hinder the standardization of color recognition. Therefore, before image training, the RGB image needs to be converted to HSD. image;
  • the conversion rules are:
  • I R represents the two-dimensional matrix of the R channel in the RGB image
  • I G represents the two-dimensional matrix of the G channel in the RGB image
  • I S represents the two-dimensional matrix of the S channel
  • S130 Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
  • GMM Gaussian Mixture Model, Gaussian Mixture Model
  • the essence of the image is a three-dimensional matrix.
  • the characteristics of some objects in the image can be expressed by a single Gaussian model or a mixture of multiple single Gaussian models.
  • GMM is to use Gaussian probability density function to accurately quantify something, and decompose a class of objects into a number of models based on Gaussian probability density function (normal distribution curve).
  • Gaussian probability density function normal distribution curve
  • the gray distribution value in each channel is generally multi-peak.
  • GMM Global System for Mobile Imaging
  • the extraction of ⁇ in the E-step is completed through a deep convolutional neural network, and the solution of the M-step is regarded as an optimization problem, and ⁇ k that minimizes the negative log-likelihood function is obtained , ⁇ k , ⁇ k , the negative log likelihood function is regarded as Loss, and the optimization problem is solved.
  • each pixel in the original image is a certain type of probability value p(x),
  • ⁇ k is the probability of taking the k-th Gaussian distribution
  • N represents the multivariate normal distribution with the mean ⁇ k and the covariance matrix ⁇ k.
  • p(x) is the category represented by a certain point in the current image. The formula indicates that for each point x in the image, it can be expressed as a superposition of k Gaussian mixture models, so it is necessary to use a neural network to replace the traditional E-step to calculate the parameter ⁇ , and then use the existing ⁇ to calculate ⁇ Evaluate with ⁇ to solve the category corresponding to each point in the image.
  • S133 Solve ⁇ k , ⁇ k , ⁇ k from the vector ⁇ and the HSD image in combination with the probability, and repeatedly train the deep convolutional Gaussian mixture model.
  • the deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and then solves ⁇ k , ⁇ k , ⁇ k through gamma, and then calculates the log-likelihood function, and then minimizes the log-likelihood Function, use gradient descent algorithm to update ⁇ k , ⁇ k , ⁇ k ; make the negative log likelihood function as small as possible, that is, keep updating until ⁇ k , ⁇ k , ⁇ k are obtained to maximize the log likelihood function;
  • gamma is the posterior probability
  • DCGMM deep convolutional Gaussian mixture model
  • S134 Training the deep convolutional Gaussian mixture model (DCGMM) repeatedly based on the deep learning convolutional neural network until the optimal deep convolutional Gaussian mixture model is obtained, so that DCGMM can predict which GMM each pixel of the current picture belongs to according to the current picture A single Gaussian model, and then transform the HSD of the pixel to be transformed into the HSD value of the template image to complete the style transfer, such as judging which category the current pixel belongs to (background, cell, tissue).
  • DCGMM deep convolutional Gaussian mixture model
  • the neural network layers used include: convolutional layer, batch processing layer, activation layer , Pooling layer and Upsampling layer;
  • the size of the image I is (H*W*C)
  • W represents the weight of the convolution kernel and filter in the convolution layer
  • b represents the bias term.
  • the convolutional layer is based on the kernel and extracts the key features of the image.
  • the Dilated Convolution (Conv dilated ) is also used in the middle layer of the network.
  • the convolutional layer adds a hyperparameter expansion rate (Dilation Rate), which represents the number of intervals between the convolution kernel filters.
  • Dilation Rate represents the number of intervals between the convolution kernel filters.
  • the principle of the batch layer is:
  • I is the input feature image
  • formula (5) standardizes I
  • E(I) and Var(I) are the mean and variance of I respectively
  • formula (6) performs scaling transformation
  • ⁇ and ⁇ are scaling factors and Offset. Batch processing can prevent gradient explosion and gradient disappearance, and speed up model convergence;
  • the principle of the activation layer is:
  • the formula (7) indicates that when I is a positive number, no transformation is performed, and when I is a negative number, it is output with a certain probability.
  • the ReLU layer makes the convolution operation nonlinear, so that the model can fit complex actual results;
  • the principle of the pooling layer is:
  • Maximum pooling down-samples the input tensor, selecting the maximum value of each region, and reducing the tensor dimension after maximum pooling;
  • the feature vector ⁇ that is consistent with the length and width of the input image is output.
  • the digital pathological image staining normalization method in this embodiment generates RGB image I (x, y) by analyzing the digital pathological slice image data, and performs HSD transformation on the RGB image I (x, y) according to the conversion rule, and then Convert RGB images to HSD images, use HSD images to continuously train the deep convolutional Gaussian mixture model to extract Gaussian mixture models for different styles of images, and then use the deep convolutional Gaussian mixture model to stain digital pathology images to be detected with HE staining. Unified, effectively improve the effectiveness and stability of dyeing normalization.
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
  • DCGMM After training the DCGMM, apply DCGMM to the image to be tested. DCGMM automatically calculates the category of each pixel in the image to be detected. According to the category of each pixel, perform H, The conversion of S and D completes the normalization of dyeing.
  • the conversion includes changing the average value, whitening and color transformation;
  • the color change transformation refers to scaling the whitening Gaussian distribution through singular value decomposition (SVD) to obtain the same covariance matrix as the template image.
  • FIG. 3 is a frame diagram of the digital pathological image staining normalization system involved in the digital pathological image staining normalization method according to an embodiment of the application.
  • the digital pathological image staining normalization system involved in this embodiment includes an RGB unit, an HSD unit, and a convolutional neural network unit;
  • the RGB unit is used to scan the pathological slice through the slice scanner into a computer for storage to generate a digital pathological slice image, so that the digital pathological slice image is read into the opencv image processing library, and the RGB space color image is generated in the opencv image processing library. That is, the RGB image I(x,y),
  • I R represents the two-dimensional matrix of the R channel in the RGB image
  • I G represents the two-dimensional matrix of the G channel in the RGB image
  • I S represents the two-dimensional matrix of the S channel
  • the HSD unit is used to perform HSD conversion on the RGB image I(x,y), and convert the RGB image into an HSD image.
  • the conversion rules are:
  • Convolutional neural network unit includes E-step module and M-step module, composed of many neural network layers;
  • the E-step module is used to complete the extraction of ⁇ in E-step through the deep convolutional neural network.
  • this planning operation generally includes: convolution, pooling, non-linear activation function, etc.;
  • the probability is:
  • the deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and then solves ⁇ k , ⁇ k , ⁇ k through gamma, and then calculates the log-likelihood function, and then minimizes the log-likelihood Function, use gradient descent algorithm to update ⁇ k , ⁇ k , ⁇ k ; make the negative log likelihood function as small as possible, that is, keep updating until ⁇ k , ⁇ k , ⁇ k are obtained to maximize the log likelihood function;
  • gamma is the posterior probability
  • the neural network layer includes a convolutional layer, a batch processing layer, an activation layer, a pooling layer and an Upsampling layer.
  • the principle of the convolutional layer is:
  • the size of the image I is (H*W*C)
  • W represents the weight of the convolution kernel and filter in the convolutional layer
  • b represents the bias term.
  • the convolutional layer is based on the kernel and extracts the key features of the image.
  • the Dilated Convolution (Conv dilated ) is also used in the middle layer of the network.
  • the convolutional layer adds a hyperparameter expansion rate (Dilation Rate), which represents the number of intervals between the convolution kernel filters.
  • Dilation Rate represents the number of intervals between the convolution kernel filters.
  • the principle of the batch layer is:
  • I is the input feature image
  • the formula Standardize I, E(I) and Var(I) are the mean and variance of I, respectively
  • the formula F BN-recale ⁇ *F BN-normalization + ⁇ for scaling transformation, ⁇ and ⁇ are the scaling factor and bias, respectively Shift. Batch processing can prevent gradient explosion and gradient disappearance, and speed up model convergence;
  • the principle of the activation layer is:
  • This formula indicates that when I is a positive number, no transformation is done, and when I is a negative number, it outputs with a certain probability.
  • the ReLU layer makes the convolution operation nonlinear, so that the model can fit complex actual results;
  • the principle of the pooling layer is:
  • Maximum pooling down-samples the input tensor, selecting the maximum value of each region, and reducing the tensor dimension after maximum pooling;
  • the feature vector ⁇ that is consistent with the length and width of the input image is output.
  • the staining normalization unit performs staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal depth convolution Gaussian mixture model.
  • the electronic device 40 may be a terminal device with arithmetic functions such as a server, a tablet computer, a portable computer, a desktop computer, and the like.
  • the electronic device 40 includes a processor 41, a memory 42, a computer program 43, a network interface, and a communication bus.
  • the electronic device 40 may be a tablet computer, a desktop computer, or a smart phone, but is not limited thereto.
  • the memory 42 includes at least one type of readable storage medium.
  • the at least one type of readable storage medium may be a non-volatile storage medium such as flash memory, hard disk, multimedia card, card-type memory, and the like.
  • the readable storage medium may be an internal storage unit of the electronic device 40, such as a hard disk of the electronic device 40.
  • the readable storage medium may also be an external memory of the electronic device 40, such as a plug-in hard disk equipped on the electronic device 40, a smart media card (SMC), a secure digital ( Secure Digital, SD card, Flash Card, etc.
  • the readable storage medium of the memory 42 is generally used to store the computer program 43 installed in the electronic device 40 and the like.
  • the processor 41 may be a central processing unit (CPU), microprocessor or other data processing chip in some embodiments, and is used to run program codes or processing data stored in the memory 42, such as digital pathological image staining. Normalization procedures, etc.
  • CPU central processing unit
  • microprocessor or other data processing chip in some embodiments, and is used to run program codes or processing data stored in the memory 42, such as digital pathological image staining. Normalization procedures, etc.
  • the network interface may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is generally used to establish a communication connection between the electronic device 40 and other electronic devices.
  • a standard wired interface such as a WI-FI interface
  • WI-FI interface wireless interface
  • the communication bus is used to realize the connection and communication between these components.
  • FIG. 4 only shows the electronic device 40 with the components 41-43, but it should be understood that it is not required to implement all the illustrated components, and more or fewer components may be implemented instead.
  • the memory 42 as a computer storage medium may include an operating system and a digital pathological image staining normalization program; the processor 41 executes the digital pathological image staining normalization program stored in the memory 42 The following steps are implemented when a program is changed:
  • S110 Scan the pathological slice into a computer through a slice scanner for storage to generate a digital pathological slice image, perform data analysis on the digital pathological slice image, and generate an RGB image I(x,y),
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • S130 Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
  • the embodiments of the present application also provide a computer-readable storage medium.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium includes digital pathological image staining and normalization.
  • the normalization program based on the digital pathological image staining is executed by the processor to achieve the following operations:
  • S110 Scan the pathological slice into a computer through a slice scanner for storage to generate a digital pathological slice image, perform data analysis on the digital pathological slice image, and generate an RGB image I(x,y),
  • I R is the two-dimensional matrix of the R channel in the RGB image
  • I G is the two-dimensional matrix of the G channel in the RGB image
  • I B is the two-dimensional matrix of the B channel in the RGB image
  • S130 Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
  • S140 Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
  • the conversion rule is:
  • H, S, D are different channels of the image in the HSD space.
  • the vector ⁇ of posterior probability is extracted through the convolutional neural network, where the vector ⁇ is a k-dimensional vector with a value between (0,1);
  • Convolutional neural networks include convolutional layers, batch processing layers, activation layers, pooling layers, and Upsampling layers.
  • the process of repeatedly training the deep convolutional Gaussian mixture model includes:
  • the deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and solves ⁇ k , ⁇ k , ⁇ k through gamma.
  • the log-likelihood function is used to continuously update until ⁇ k , ⁇ k , and ⁇ k are obtained to maximize the log-likelihood function.
  • the process of staining and normalizing the HE stained digital pathology image of the histopathology to be detected by the deep convolution Gaussian mixture model includes:
  • the deep convolutional Gaussian mixture model automatically calculates the category of each pixel in the image to be detected, and compares the original image and the image to be tested according to the category of each pixel. Perform the conversion of H, S, D for the areas of the same category in the same category to complete the normalization of dyeing;
  • the conversion of H, S, D includes at least average value, whitening and color conversion.
  • the specific implementation of the computer-readable storage medium of the present application is substantially the same as the specific implementation of the above-mentioned digital pathological image staining normalization method and electronic device, and will not be repeated here.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Radiology & Medical Imaging (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A staining normalization method and system for a digital pathological image, an electronic device and a storage medium. The staining normalization method for a digital pathological image comprises: performing data parsing on a pre-stored digital pathological slice image to generate an RGB image I(x,y) (S110); performing HSD transform on the RGB image I(x,y) according to a preset conversion rule, and converting the RGB image into an HSD image (S120); using the HSD image to continuously train a deep convolutional Gaussian mixture model to extract and solve Gaussian mixture models for different styles of images until the optimal deep convolutional Gaussian mixture model is obtained (S130); and using the optimal deep convolutional Gaussian mixture model to perform staining normalization on a histopathology HE stained digital pathological image to be detected (S140). By using the staining normalization method for a digital pathological image, the effectiveness and stability of staining normalization may be effectively improved.

Description

数字病理图像染色归一化方法、系统、电子装置及存储介质Digital pathological image staining normalization method, system, electronic device and storage medium
本申请要求于2019年10月18日提交中国专利局、申请号为201910993386.3,发明名称为“数字病理图像染色归一化方法、电子装置及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed with the Chinese Patent Office on October 18, 2019, the application number is 201910993386.3, and the invention title is "Digital Pathology Image Staining Normalization Method, Electronic Device and Storage Medium", and its entire content Incorporated in this application by reference.
技术领域Technical field
本申请涉及数字医疗的图像处理领域,尤其涉及一种数字病理图像染色归一化方法、系统、电子装置及存储介质。This application relates to the field of image processing of digital medicine, and in particular to a method, system, electronic device, and storage medium for normalizing digital pathological image staining.
背景技术Background technique
由于肾脏疾病的种类繁多,病因及发病机制复杂,许多肾脏疾病的临床表现与肾脏的组织学改变并不完全一致,其治疗方案及病情的发展结果也差别极大。目前,肾脏病理检查结果已经成为肾脏疾病诊断的金指标。病理医生在进行肾穿活检时,需要凭肉眼以及经验观测得到一些重要的医学指标,现拟通过人工智能辅助病理医生进行阅片工作。然而,发明人发现,不同技术员在进行制片染色过程中无法保证每批量产出的玻片都呈同样的颜色分布,这会对数字医疗的AI+医疗的AI模型产生较大的干扰,造成预测结果的波动。Due to the wide variety of kidney diseases, the etiology and pathogenesis are complex, the clinical manifestations of many kidney diseases are not completely consistent with the histological changes of the kidney, and the treatment plans and the development results of the disease are also very different. At present, the results of renal pathological examinations have become a gold indicator for the diagnosis of kidney diseases. When pathologists perform renal biopsy, they need to obtain some important medical indicators based on visual and empirical observations. Now they plan to assist pathologists in reading pictures through artificial intelligence. However, the inventor found that different technicians cannot guarantee that the slides produced in each batch will have the same color distribution during the production and dyeing process. This will cause greater interference to the AI + medical AI model of digital medical care and cause predictions. Fluctuations in results.
目前,图像经过特征块切割后,切割生成的特征块大小不一。这时,就需要采取归一化的操作来统一特征块图像的尺寸,现有的染色归一化通常通过对抗生成网络,变分自动编码器等方法实现,这种方法虽然能够进行风格迁移,但皆容易产生不可控的噪声点,有可能改变图像结构,所以亟需一种更稳定更自然的染色归一化方法。At present, after the image is cut by the feature block, the size of the feature block generated by the cutting is different. At this time, it is necessary to adopt a normalization operation to unify the size of the feature block image. The existing normalization of coloring is usually achieved through methods such as confrontation generation networks and variational autoencoders. Although this method can perform style transfer, However, uncontrollable noise points are easily generated, which may change the image structure. Therefore, a more stable and natural dyeing normalization method is urgently needed.
发明内容Summary of the invention
本申请提供一种数字病理图像染色归一化方法、系统、电子装置及存储介质,其主要目的在于解决目前现有染色归一化通过对抗生成网络,变分自动编码器等方法实现,虽然能够进行风格迁移,但皆容易产生不可控的噪声点,有可能改变图像结构的问题,有效提高染色归一化图像的稳定性。This application provides a method, system, electronic device and storage medium for staining and normalization of digital pathological images. Its main purpose is to solve the problem that the current staining normalization can be achieved by confrontation generation networks, variational autoencoders and other methods. Perform style transfer, but it is easy to produce uncontrollable noise points, which may change the problem of image structure, and effectively improve the stability of the stained normalized image.
为实现上述目的,本申请提供一种数字病理图像染色归一化方法,应用于数字病理图像,包括如下步骤:In order to achieve the above objective, the present application provides a method for normalizing digital pathological image staining, which is applied to digital pathological images, and includes the following steps:
S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000001
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000001
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
为了实现上述目的,本申请提供一种数字病理图像染色归一化系统,包括:In order to achieve the above objectives, this application provides a digital pathological image staining normalization system, including:
RGB单元,对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),The RGB unit performs data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000002
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000002
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
HSD单元,根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;The HSD unit performs HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and converts the RGB image into an HSD image;
卷积神经网络单元,利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;The convolutional neural network unit uses the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving different styles of images, until an optimal deep convolutional Gaussian mixture model is obtained;
染色归一化单元,通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。The staining normalization unit performs staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal depth convolution Gaussian mixture model.
为实现上述目的,本申请提供一种电子装置,该电子装置包括:存储器、处理器及存储在所述存储器中的数字病理图像的染色归一化程序,所述数字病理图像的染色归一化程序被所述处理器执行时实行如下步骤:In order to achieve the above objective, the present application provides an electronic device, which includes a memory, a processor, and a coloring normalization program for a digital pathological image stored in the memory, and the coloring of the digital pathological image is normalized When the program is executed by the processor, the following steps are performed:
S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000003
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000003
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,其中,所述计算机可读存储介质中存储有数字病理图像的染色归一化程序,所述数字病理图像的染色归一化程序被处理器执行时,实现如下步骤:In addition, in order to achieve the above object, the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a staining normalization program for a digital pathological image, and the staining of the digital pathological image is unified. When the transformation program is executed by the processor, the following steps are implemented:
S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000004
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000004
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles, until an optimal deep convolutional Gaussian mixture model is obtained;
S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
本申请提出的数字病理图像染色归一化方法、系统、电子装置及计算机可读存储介质,通过将数字病理切片图像进行数据解析,生成RGB图像I(x,y),根据转换规则对所述RGB图像I(x,y)进行HSD变换,再将所述RGB图像转换为HSD图像,利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,进而通过深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化,有效提高染色归一化的有效性和稳定性。The digital pathological image staining normalization method, system, electronic device, and computer-readable storage medium proposed in this application generate RGB image I(x,y) by analyzing the digital pathological slice image, and then calculate the RGB image I(x,y) according to the conversion rules. The RGB image I(x,y) is subjected to HSD transformation, and then the RGB image is converted into an HSD image. The HSD image is used to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different styles of images, and then through the depth The convolutional Gaussian mixture model performs staining normalization on HE stained digital pathological images of histopathology to be detected, which effectively improves the effectiveness and stability of staining normalization.
为了实现上述以及相关目的,本申请的一个或多个方面包括后面将详细说明的特征。下面的说明以及附图详细说明了本申请的某些示例性方面。然而,这些方面指示的仅仅是可使用本申请的原理的各种方式中的一些方式。此外,本申请旨在包括所有这些方面以及它们的等同物。In order to achieve the above and related objects, one or more aspects of the present application include features that will be described in detail later. The following description and drawings illustrate some exemplary aspects of the present application in detail. However, these aspects indicate only some of the various ways in which the principles of the present application can be used. Furthermore, this application is intended to include all these aspects and their equivalents.
附图说明Description of the drawings
图1为本申请数字病理图像染色归一化方法实施例的流程图;Fig. 1 is a flowchart of an embodiment of a method for normalizing digital pathological image staining according to the present application;
图2为本申请数字病理图像染色归一化方法实施例的RGB图像生成示意图;FIG. 2 is a schematic diagram of RGB image generation according to an embodiment of a method for normalizing staining of digital pathological images according to the present application;
图3为本申请实施例的系统框架图;Figure 3 is a system framework diagram of an embodiment of the application;
图4为根据本申请实施例的电子装置的结构示意图;Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application;
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the application, and not used to limit the application.
目前图像经过特征块切割后,切割生成的特征块大小不一。这时,就需要采取归一化的操作来统一特征块图像的尺寸,现有的染色归一化通常通过对抗生成网络,变分自动编码器等方法实现,这种方法虽然能够进行风格迁移,但皆容易产生不可控的噪声点,有可能改变图像结构,为了避免染色过程中无法保证每批量产出的玻片都呈同样的颜色分布,这会对AI模型产生较大的干扰,造成预测结果的波动的情况,所以本申请提供数字病理图像染色归一化方法,通过DCGMM结合深度卷积神经网络,增强模型的可解释性,提高训练的稳定性,以在保留结构信息的前提下进行风格迁移,为医学诊断的精确性提供了保证。At present, after the feature block is cut in the image, the size of the feature block generated by the cutting is different. At this time, it is necessary to adopt a normalization operation to unify the size of the feature block image. The existing normalization of coloring is usually achieved through methods such as confrontation generation networks and variational autoencoders. Although this method can perform style transfer, However, it is easy to produce uncontrollable noise points, which may change the image structure. In order to avoid the inability to ensure that the glass slides produced in each batch have the same color distribution during the dyeing process, this will cause greater interference to the AI model and cause predictions Due to the fluctuation of the results, this application provides a method for normalizing digital pathology image staining, which combines DCGMM with deep convolutional neural networks to enhance the interpretability of the model and improve the stability of training, so as to perform on the premise of preserving structural information Style transfer provides a guarantee for the accuracy of medical diagnosis.
图1为本申请数字病理图像染色归一化方法的流程图,在本实施例中,数字病理图像染色归一化方法包括如下步骤:Fig. 1 is a flowchart of a method for normalizing staining of digital pathological images in this application. In this embodiment, the method for normalizing staining of digital pathological images includes the following steps:
S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x, y),
Figure PCTCN2020112366-appb-000005
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000005
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
预存的数字病理切片图像可以是通过切片扫描仪将病理切片扫描到计算机中进行存储生成数字病理切片图像,也可以是从病理切片数据库中提取的数字病理切片图像。将这些用作训练素材的数字病理切片图像读入opencv图像处理库,在opencv图像处理库中生成RGB空间的彩色图像,即RGB图像;The pre-stored digital pathological slice image can be a digital pathological slice image that is scanned by a slice scanner to a computer for storage, or it can be a digital pathological slice image extracted from a pathological slice database. Read these digital pathology slice images used as training materials into the opencv image processing library, and generate color images in RGB space in the opencv image processing library, that is, RGB images;
图2为本申请数字病理图像染色归一化方法实施例的RGB图像生成示意图。如图2所示,左边的病理切片最终被处理为右边的RGB空间的彩色图像。FIG. 2 is a schematic diagram of RGB image generation according to an embodiment of a method for normalizing staining of digital pathological images of the present application. As shown in Figure 2, the pathological slice on the left is finally processed into a color image in the RGB space on the right.
S120:根据预设的转换规则对步骤S110生成的RGB图像I(x,y)进行HSD变换,将RGB图像转换为HSD图像;其中,HSD变换为RGB图像转换为HSD图像的变换过程,RGB图像、HSD图像均为特定格式的图像。S120: Perform HSD transformation on the RGB image I(x, y) generated in step S110 according to the preset conversion rule, and convert the RGB image into HSD image; among them, the conversion process of HSD into RGB image into HSD image, RGB image , HSD images are all images in a specific format.
由于RGB图像中,颜色(chromatic)与强度(intensity)的混合信息妨碍了染色识别的标准化,而HSD图像不会妨碍染色识别标准化,因此,在进行图片训练前,需要先将RGB图像转换为HSD图像;In RGB images, the mixed information of color (chromatic) and intensity (intensity) hinders the standardization of color recognition, and HSD images will not hinder the standardization of color recognition. Therefore, before image training, the RGB image needs to be converted to HSD. image;
其中,转换的规则为:
Figure PCTCN2020112366-appb-000006
Among them, the conversion rules are:
Figure PCTCN2020112366-appb-000006
其中I R代表了RGB图像中的R通道的二维矩阵I G代表RGB图像中G通道的二维矩阵,I S代表S通道的二维矩阵; Where I R represents the two-dimensional matrix of the R channel in the RGB image, I G represents the two-dimensional matrix of the G channel in the RGB image, and I S represents the two-dimensional matrix of the S channel;
S130:利用HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得出最优解,得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
GMM(Gaussian Mixture Model,高斯混合模型),GMM (Gaussian Mixture Model, Gaussian Mixture Model),
图像的本质是三维矩阵,图像中一些物体的特征(如细胞、组织、基液对应的颜色,形态特征等)都可以由一个单一的高斯模型或多个单一高斯模型混合进行表达。GMM就是用高斯概率密度函数精确地量化某一事物,将一类对象分解为若干的基于高斯概率密度函数(正态分布曲线)形成的模型。对于病理图像,每个通道中的灰度分布值一般为多峰,通过将直方图的多峰特性看作为多个高斯分布的叠加,能够进行图像分割,区分图像中每个点所属的类别,为求解不同风格染色图像的GMM(Gaussian mixturemodel高斯混合模型),需要求解π k,∑ k,μ k,传统方法中,通过EM(Expectation-Maximization algorithm,EM,最大期望值)算法对GMM的参数进行求解,首先,执行E-step进行后验概率γ的提取, 然后利用提取的γ执行M-step计算求得使对数似然函数最大的π k,∑ k,μ kThe essence of the image is a three-dimensional matrix. The characteristics of some objects in the image (such as the color of cells, tissues, base fluid, morphological characteristics, etc.) can be expressed by a single Gaussian model or a mixture of multiple single Gaussian models. GMM is to use Gaussian probability density function to accurately quantify something, and decompose a class of objects into a number of models based on Gaussian probability density function (normal distribution curve). For pathological images, the gray distribution value in each channel is generally multi-peak. By treating the multi-peak characteristics of the histogram as the superposition of multiple Gaussian distributions, image segmentation can be performed to distinguish the category to which each point in the image belongs. In order to solve the GMM (Gaussian mixture model Gaussian mixture model) of different styles of stained images, it is necessary to solve π k , ∑ k , μ k . In the traditional method, the parameters of GMM are processed by the EM (Expectation-Maximization algorithm, EM, maximum expected value) algorithm. To solve the problem, first, perform E-step to extract the posterior probability γ, and then use the extracted γ to perform M-step calculation to obtain π k , ∑ k , and μ k that maximize the log-likelihood function.
本申请中,通过深度卷积神经网络完成E-step中对γ的提取,并将M-step的求解看为一个最优化问题,求使得求得使负的对数似然函数最小的π k,∑ k,μ k,即将负的对数似然函数看为Loss,求解最优化问题。 In this application, the extraction of γ in the E-step is completed through a deep convolutional neural network, and the solution of the M-step is regarded as an optimization problem, and π k that minimizes the negative log-likelihood function is obtained , ∑ k , μ k , the negative log likelihood function is regarded as Loss, and the optimization problem is solved.
具体的,原图中的每个像素点为某一类的概率值p(x),Specifically, each pixel in the original image is a certain type of probability value p(x),
Figure PCTCN2020112366-appb-000007
Figure PCTCN2020112366-appb-000007
其中π k为取第k个高斯分布的概率,N代表具有均值μ k和协方差矩阵∑k的多元正态分布。在针对组织病理HE染色(hematoxylin-eosin staining,苏木精—伊红染色法,简称HE染色)数字病理图像的GMM求解中,p(x)为当前图像中的某一点所代表的类别,上述公式表示对于图像中的每个点x,都可以表示为由k个高斯混合模型的叠加,于是需要通过神经网络代替传统的E-step进行参数γ的计算,然后,通过现有的γ对μ和∑进行估算,求解图像中每个点对应的类别。 Where π k is the probability of taking the k-th Gaussian distribution, and N represents the multivariate normal distribution with the mean μ k and the covariance matrix Σk. In the GMM solution of digital pathology images for histopathological HE staining (hematoxylin-eosin staining, HE staining for short), p(x) is the category represented by a certain point in the current image. The formula indicates that for each point x in the image, it can be expressed as a superposition of k Gaussian mixture models, so it is necessary to use a neural network to replace the traditional E-step to calculate the parameter γ, and then use the existing γ to calculate μ Evaluate with ∑ to solve the category corresponding to each point in the image.
S131:通过卷积神经网络提取后验概率的向量γ,通过归化操作将HSD图像中每个像素点转化为一个值处于(0,1)之间的k维向量γ,该值代表了3种类别分别对应的概率值,即将输入图像按组织结构分离为不同的通道;在本申请中设置k=3,分别对应图像中的3种类别,分别对应细胞核,组织,背景3个区域;该规划操作一般包括:卷积、池化、非线性激活函数等;S131: Extract the posterior probability vector γ through the convolutional neural network, and convert each pixel in the HSD image into a k-dimensional vector γ with a value between (0, 1) through the normalization operation, and this value represents 3 The probability value corresponding to each category, that is, the input image is separated into different channels according to the tissue structure; in this application, k=3 is set to correspond to the three categories in the image, corresponding to the nucleus, tissue, and background respectively; this Planning operations generally include: convolution, pooling, nonlinear activation functions, etc.;
S132:获取图像的像素点X={x 1,x 2,…,x p},该图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率为: S132: Acquire the pixel point X of the image = {x 1 ,x 2 ,...,x p }, the probability that the pixel point x 1 , x 2 ,..., x p in the image is generated by the kth Gaussian distribution is:
Figure PCTCN2020112366-appb-000008
Figure PCTCN2020112366-appb-000008
其中,隐变量z代表了具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布; Among them, the hidden variable z represents the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
S133:结合概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型。 S133: Solve π k , ∑ k , μ k from the vector γ and the HSD image in combination with the probability, and repeatedly train the deep convolutional Gaussian mixture model.
具体的,深度卷积高斯混合模型(DCGMM)通过网络的正向传播得到gamma,再通过gamma求解π k,∑ k,μ k,然后计算对数似然函数,接着通过最小化对数似然函数,利用梯度下降算法来更新π k,∑ k,μ k;使得负对数似然函数越小越好,即持续更新直至得到π k,∑ k,μ k使得对数似然函数最大;其中,gamma为后验概率; Specifically, the deep convolutional Gaussian mixture model (DCGMM) obtains gamma through the forward propagation of the network, and then solves π k , ∑ k , μ k through gamma, and then calculates the log-likelihood function, and then minimizes the log-likelihood Function, use gradient descent algorithm to update π k , ∑ k , μ k ; make the negative log likelihood function as small as possible, that is, keep updating until π k , ∑ k , μ k are obtained to maximize the log likelihood function; Among them, gamma is the posterior probability;
计算求得使公式(2)中的对数似然函数最小的π k,∑ k,μ k,其计算详情如下所示: Calculate and obtain π k , ∑ k , μ k that minimize the log-likelihood function in formula (2). The calculation details are as follows:
for i=1,2,…,k and j=1,2..,p:for i=1,2,...,k and j=1,2...,p:
μ i=E(x i|argmax(γ j)=i) μ i =E(x i |argmax(γ j )=i)
i=E[(x j-E[x j])(x j-E[x j]) T)|argmax(γ j)=i] i =E[(x j -E[x j ])(x j -E[x j ]) T )|argmax(γ j )=i]
Figure PCTCN2020112366-appb-000009
Figure PCTCN2020112366-appb-000009
z=N(μ,∈·∑)z=N(μ,∈·∑)
结合公式(1),其对数似然函数为:Combined with formula (1), its log likelihood function is:
Figure PCTCN2020112366-appb-000010
Figure PCTCN2020112366-appb-000010
S134:基于深度学习卷积神经网络反复训练深度卷积高斯混合模型(DCGMM),直至得到最优的深度卷积高斯混合模型,使DCGMM能够根据当前图片预测当前图片的各个像素点所属GMM中哪一个单一高斯模型,然后通过将待转变像素点的HSD转换为模板图像的HSD值,从而完成风格迁移,比如判断当前像素属于哪一类(背景、细胞、组织)。S134: Training the deep convolutional Gaussian mixture model (DCGMM) repeatedly based on the deep learning convolutional neural network until the optimal deep convolutional Gaussian mixture model is obtained, so that DCGMM can predict which GMM each pixel of the current picture belongs to according to the current picture A single Gaussian model, and then transform the HSD of the pixel to be transformed into the HSD value of the template image to complete the style transfer, such as judging which category the current pixel belongs to (background, cell, tissue).
在通过卷积神经网络提取后验概率的向量γ,基于深度学习卷积神经网络反复训练深度卷积高斯混合模型过程中,所运用的神经网络层包括:卷积层、批处理层、激活层、池化层及Upsampling层;In the process of extracting the posterior probability vector γ through the convolutional neural network, and repeatedly training the deep convolutional Gaussian mixture model based on the deep learning convolutional neural network, the neural network layers used include: convolutional layer, batch processing layer, activation layer , Pooling layer and Upsampling layer;
卷积层原理为:The principle of the convolutional layer is:
F con(I)=I·W+b     (4) F con (I)=I·W+b (4)
公式(4)中,图像I大小为(H*W*C),W表示卷积层中卷积核和滤波器的权重,b表示偏置项。卷积层基于kernel,提取图像关键特征。In formula (4), the size of the image I is (H*W*C), W represents the weight of the convolution kernel and filter in the convolution layer, and b represents the bias term. The convolutional layer is based on the kernel and extracts the key features of the image.
除正常卷积外,为增加神经网络的感受野(Reception Field),空洞卷积(Dilated Convolution,Conv dilated)也被使用在网络的中间层,该卷积层增加了一个超参数膨胀率(Dilation Rate),代表了卷积核filter的间隔数量,相比于最大池化层(MaxPoolingLayer),空洞卷积可以保留内部数据结构,避免使用池化层造成信息损失的前提下,增大了感受野; In addition to the normal convolution, in order to increase the Reception Field of the neural network, the Dilated Convolution (Conv dilated ) is also used in the middle layer of the network. The convolutional layer adds a hyperparameter expansion rate (Dilation Rate), which represents the number of intervals between the convolution kernel filters. Compared with the MaxPoolingLayer, the hole convolution can retain the internal data structure, avoiding the use of the pooling layer to cause information loss, and increasing the receptive field ;
批处理层原理为:The principle of the batch layer is:
Figure PCTCN2020112366-appb-000011
Figure PCTCN2020112366-appb-000011
F BN-recale=γ*F BN-normalization+β    (6) F BN-recale =γ*F BN-normalization +β (6)
I为输入的特征图像,公式(5)对I做标准化处理,E(I)和Var(I)分别为I的均值和方差;公式(6)进行缩放变换,γ和β分别为缩放因子和偏移量。批处理可以防止梯度爆炸和梯度消失,加快模型收敛速度;I is the input feature image, formula (5) standardizes I, E(I) and Var(I) are the mean and variance of I respectively; formula (6) performs scaling transformation, γ and β are scaling factors and Offset. Batch processing can prevent gradient explosion and gradient disappearance, and speed up model convergence;
激活层原理为:The principle of the activation layer is:
F ReLU=max(α×I,I)      (7) F ReLU =max(α×I,I) (7)
公式(7)表示,当I为正数时,不做任何变换,当I为负数时以一定概率输出。ReLU层使卷积操作获得非线性,使模型可以拟合复杂的实际结果;The formula (7) indicates that when I is a positive number, no transformation is performed, and when I is a negative number, it is output with a certain probability. The ReLU layer makes the convolution operation nonlinear, so that the model can fit complex actual results;
池化层原理为:The principle of the pooling layer is:
Figure PCTCN2020112366-appb-000012
Figure PCTCN2020112366-appb-000012
最大池化对输入张量进行降采样,选取每个区域的最大值,经过最大池化,减小张量维度;Maximum pooling down-samples the input tensor, selecting the maximum value of each region, and reducing the tensor dimension after maximum pooling;
UpSampling层原理为:The principle of UpSampling layer is:
通过采用双线性插值方法,即在原有图像像素的基础上在像素点之间采用合适的插值算法插入新的元素,来达到放大特征图尺寸的目的;By adopting the bilinear interpolation method, that is, on the basis of the original image pixels, using a suitable interpolation algorithm to insert new elements between the pixels, to achieve the purpose of enlarging the size of the feature map;
网络经过上采样及下采样步骤后,经过最后一层Softmax激活层,假设有一个数组V共含j个元素,Vi表示V中的第i个元素,其计算方法如下:After the network goes through the up-sampling and down-sampling steps, it passes through the last Softmax activation layer. Assuming that there is an array V containing j elements in total, Vi represents the i-th element in V. The calculation method is as follows:
Figure PCTCN2020112366-appb-000013
Figure PCTCN2020112366-appb-000013
经过激活函数激活后,即输出了与输入图像长宽保持一致的特征向量γ。After the activation function is activated, the feature vector γ that is consistent with the length and width of the input image is output.
本实施例中数字病理图像染色归一化方法,通过将数字病理切片图像进行数据解析,生成RGB图像I(x,y),根据转换规则对RGB图像I(x,y)进行HSD变换,再将RGB图像转换为HSD图像,利用HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,进而通过深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化,有效提高染色归一化的有效性和稳定性。The digital pathological image staining normalization method in this embodiment generates RGB image I (x, y) by analyzing the digital pathological slice image data, and performs HSD transformation on the RGB image I (x, y) according to the conversion rule, and then Convert RGB images to HSD images, use HSD images to continuously train the deep convolutional Gaussian mixture model to extract Gaussian mixture models for different styles of images, and then use the deep convolutional Gaussian mixture model to stain digital pathology images to be detected with HE staining. Unified, effectively improve the effectiveness and stability of dyeing normalization.
S140:通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
在训练好DCGMM之后,在待测试图像上应用DCGMM,DCGMM自动计算待检测图像中每个像素所属的类别,根据每个像素所属类别分别对原图和待测试图像中同一类别的区域进行H、S、D的转换,完成染色归一化。After training the DCGMM, apply DCGMM to the image to be tested. DCGMM automatically calculates the category of each pixel in the image to be detected. According to the category of each pixel, perform H, The conversion of S and D completes the normalization of dyeing.
其中,转换包括改变平均值,白化和色彩变换;色彩变化变换是指通过奇异值分解(SVD)对白化高斯分布进行缩放以获得与模板图像相同的协方差矩阵。Among them, the conversion includes changing the average value, whitening and color transformation; the color change transformation refers to scaling the whitening Gaussian distribution through singular value decomposition (SVD) to obtain the same covariance matrix as the template image.
图3为本申请实施例的数字病理图像染色归一化方法中涉及的数字病理图像染色归一化系统的框架图。如图3所示,本实施例涉及的数字病理图像染色归一化系统包括RGB单元、HSD单元、卷积神经网络单元;FIG. 3 is a frame diagram of the digital pathological image staining normalization system involved in the digital pathological image staining normalization method according to an embodiment of the application. As shown in FIG. 3, the digital pathological image staining normalization system involved in this embodiment includes an RGB unit, an HSD unit, and a convolutional neural network unit;
RGB单元用于将通过切片扫描仪将病理切片扫描到计算机中进行存储生成数字病理 切片图像,使将数字病理切片图像读入opencv图像处理库,在opencv图像处理库中生成RGB空间的彩色图像,即RGB图像I(x,y),
Figure PCTCN2020112366-appb-000014
The RGB unit is used to scan the pathological slice through the slice scanner into a computer for storage to generate a digital pathological slice image, so that the digital pathological slice image is read into the opencv image processing library, and the RGB space color image is generated in the opencv image processing library. That is, the RGB image I(x,y),
Figure PCTCN2020112366-appb-000014
其中I R代表了RGB图像中的R通道的二维矩阵I G代表RGB图像中G通道的二维矩阵,I S代表S通道的二维矩阵; Where I R represents the two-dimensional matrix of the R channel in the RGB image, I G represents the two-dimensional matrix of the G channel in the RGB image, and I S represents the two-dimensional matrix of the S channel;
HSD单元用于将RGB图像I(x,y)进行HSD变换,将RGB图像转换为HSD图像,其中,转换的规则为:
Figure PCTCN2020112366-appb-000015
The HSD unit is used to perform HSD conversion on the RGB image I(x,y), and convert the RGB image into an HSD image. The conversion rules are:
Figure PCTCN2020112366-appb-000015
卷积神经网络单元包括E-step模块和M-step模块,由众多神经网络层组成;Convolutional neural network unit includes E-step module and M-step module, composed of many neural network layers;
E-step模块用于通过深度卷积神经网络完成E-step中对γ的提取,具体的,通过卷积神经网络提取后验概率的向量γ,通过归化操作将HSD图像中每个像素点转化为一个值处于(0,1)之间的k维向量γ,该值代表了3种类别分别对应的概率值,即将输入图像按组织结构分离为不同的通道;在本申请中设置k=3,分别对应图像中的3种类别,分别对应细胞核,组织,背景3个区域;该规划操作一般包括:卷积、池化、非线性激活函数等;The E-step module is used to complete the extraction of γ in E-step through the deep convolutional neural network. Specifically, the vector γ of the posterior probability is extracted through the convolutional neural network, and each pixel in the HSD image is processed through the normalization operation. Converted into a k-dimensional vector γ with a value between (0,1), which represents the probability values corresponding to the three categories, that is, the input image is separated into different channels according to the organizational structure; in this application, k= 3. Corresponding to the three categories in the image, respectively corresponding to the three regions of cell nucleus, tissue, and background; this planning operation generally includes: convolution, pooling, non-linear activation function, etc.;
该M-step模块用于获取图像的像素点X={x 1,x 2,…,x p},该图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率为: The M-step module is used to obtain the pixel points X={x 1 ,x 2 ,...,x p } of the image, and the pixels x 1 ,x 2 ,...,x p in the image are generated by the kth Gaussian distribution The probability is:
Figure PCTCN2020112366-appb-000016
Figure PCTCN2020112366-appb-000016
其中,隐变量z代表了具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布; Among them, the hidden variable z represents the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
结合概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型。 Combine probability to solve π k , ∑ k , μ k from the vector γ and HSD image, and train the deep convolutional Gaussian mixture model repeatedly.
具体的,深度卷积高斯混合模型(DCGMM)通过网络的正向传播得到gamma,再通过gamma求解π k,∑ k,μ k,然后计算对数似然函数,接着通过最小化对数似然函数,利用梯度下降算法来更新π k,∑ k,μ k;使得负对数似然函数越小越好,即持续更新直至得到π k,∑ k,μ k使得对数似然函数最大;其中,gamma为后验概率; Specifically, the deep convolutional Gaussian mixture model (DCGMM) obtains gamma through the forward propagation of the network, and then solves π k , ∑ k , μ k through gamma, and then calculates the log-likelihood function, and then minimizes the log-likelihood Function, use gradient descent algorithm to update π k , ∑ k , μ k ; make the negative log likelihood function as small as possible, that is, keep updating until π k , ∑ k , μ k are obtained to maximize the log likelihood function; Among them, gamma is the posterior probability;
计算求得使公式(2)中的对数似然函数最小的π k,∑ k,μ k,其计算详情如下所示: Calculate and obtain π k , ∑ k , μ k that minimize the log-likelihood function in formula (2). The calculation details are as follows:
for i=1,2,…,k and j=1,2..,p:for i=1,2,...,k and j=1,2...,p:
μ i=E(x i|argmax(γ j)=i) μ i =E(x i |argmax(γ j )=i)
i=E[(x j-E[x j])(x j-E[x j]) T)|argmax(γ j)=i] i =E[(x j -E[x j ])(x j -E[x j ]) T )|argmax(γ j )=i]
Figure PCTCN2020112366-appb-000017
Figure PCTCN2020112366-appb-000017
z=N(μ,∈·∑)z=N(μ,∈·∑)
结合公式(1),其对数似然函数为:Combined with formula (1), its log likelihood function is:
Figure PCTCN2020112366-appb-000018
Figure PCTCN2020112366-appb-000018
神经网络层包括卷积层、批处理层、激活层、池化层及Upsampling层,其中,卷积层原理为:The neural network layer includes a convolutional layer, a batch processing layer, an activation layer, a pooling layer and an Upsampling layer. The principle of the convolutional layer is:
F con(I)=I·W+b F con (I)=I·W+b
该公式中,图像I大小为(H*W*C),W表示卷积层中卷积核和滤波器的权重,b表示偏置项。卷积层基于kernel,提取图像关键特征。In this formula, the size of the image I is (H*W*C), W represents the weight of the convolution kernel and filter in the convolutional layer, and b represents the bias term. The convolutional layer is based on the kernel and extracts the key features of the image.
除正常卷积外,为增加神经网络的感受野(Reception Field),空洞卷积(Dilated Convolution,Conv dilated)也被使用在网络的中间层,该卷积层增加了一个超参数膨胀率(Dilation Rate),代表了卷积核filter的间隔数量,相比于最大池化层(MaxPoolingLayer),空洞卷积可以保留内部数据结构,避免使用池化层造成信息损失的前提下,增大了感受野; In addition to the normal convolution, in order to increase the Reception Field of the neural network, the Dilated Convolution (Conv dilated ) is also used in the middle layer of the network. The convolutional layer adds a hyperparameter expansion rate (Dilation Rate), which represents the number of intervals between the convolution kernel filters. Compared with the MaxPoolingLayer, the hole convolution can retain the internal data structure, avoiding the use of the pooling layer to cause information loss, and increasing the receptive field ;
批处理层原理为:The principle of the batch layer is:
Figure PCTCN2020112366-appb-000019
Figure PCTCN2020112366-appb-000019
F BN-recale=γ*F BN-normalizationF BN-recale =γ*F BN-normalization
I为输入的特征图像,公式
Figure PCTCN2020112366-appb-000020
对I做标准化处理,E(I)和Var(I)分别为I的均值和方差;公式F BN-recale=γ*F BN-normalization+β进行缩放变换,γ和β分别为缩放因子和偏移量。批处理可以防止梯度爆炸和梯度消失,加快模型收敛速度;
I is the input feature image, the formula
Figure PCTCN2020112366-appb-000020
Standardize I, E(I) and Var(I) are the mean and variance of I, respectively; the formula F BN-recale = γ*F BN-normalization + β for scaling transformation, γ and β are the scaling factor and bias, respectively Shift. Batch processing can prevent gradient explosion and gradient disappearance, and speed up model convergence;
激活层原理为:The principle of the activation layer is:
F ReLU=max(α×I,I) F ReLU =max(α×I,I)
该公式表示,当I为正数时,不做任何变换,当I为负数时以一定概率输出。ReLU层使卷积操作获得非线性,使模型可以拟合复杂的实际结果;This formula indicates that when I is a positive number, no transformation is done, and when I is a negative number, it outputs with a certain probability. The ReLU layer makes the convolution operation nonlinear, so that the model can fit complex actual results;
池化层原理为:The principle of the pooling layer is:
Figure PCTCN2020112366-appb-000021
Figure PCTCN2020112366-appb-000021
最大池化对输入张量进行降采样,选取每个区域的最大值,经过最大池化,减小张量维度;Maximum pooling down-samples the input tensor, selecting the maximum value of each region, and reducing the tensor dimension after maximum pooling;
UpSampling层原理为:The principle of UpSampling layer is:
通过采用双线性插值方法,即在原有图像像素的基础上在像素点之间采用合适的插值算法插入新的元素,来达到放大特征图尺寸的目的;By adopting the bilinear interpolation method, that is, on the basis of the original image pixels, using a suitable interpolation algorithm to insert new elements between the pixels, to achieve the purpose of enlarging the size of the feature map;
网络经过上采样及下采样步骤后,经过最后一层Softmax激活层,假设有一个数组V共含j个元素,Vi表示V中的第i个元素,其计算方法如下:After the network goes through the up-sampling and down-sampling steps, it passes through the last Softmax activation layer. Assuming that there is an array V containing j elements in total, Vi represents the i-th element in V. The calculation method is as follows:
Figure PCTCN2020112366-appb-000022
Figure PCTCN2020112366-appb-000022
经过激活函数激活后,即输出了与输入图像长宽保持一致的特征向量γ。After the activation function is activated, the feature vector γ that is consistent with the length and width of the input image is output.
染色归一化单元,通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。The staining normalization unit performs staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal depth convolution Gaussian mixture model.
图4为根据本申请实施例的电子装置示意图,在本实施例中,电子装置40可以是服务器、平板计算机、便携计算机、桌上型计算机等具有运算功能的终端设备。4 is a schematic diagram of an electronic device according to an embodiment of the present application. In this embodiment, the electronic device 40 may be a terminal device with arithmetic functions such as a server, a tablet computer, a portable computer, a desktop computer, and the like.
该电子装置40包括:处理器41、存储器42、计算机程序43、网络接口及通信总线。The electronic device 40 includes a processor 41, a memory 42, a computer program 43, a network interface, and a communication bus.
电子装置40可以是平板电脑、台式电脑、智能手机,但不限于此。The electronic device 40 may be a tablet computer, a desktop computer, or a smart phone, but is not limited thereto.
存储器42包括至少一种类型的可读存储介质。至少一种类型的可读存储介质可为如闪存、硬盘、多媒体卡、卡型存储器等的非易失性存储介质。在一些实施例中,该可读存储介质可以是该电子装置40的内部存储单元,例如该电子装置40的硬盘。在另一些实施例中,该可读存储介质也可以是该电子装置40的外部存储器,例如该电子装置40上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。The memory 42 includes at least one type of readable storage medium. The at least one type of readable storage medium may be a non-volatile storage medium such as flash memory, hard disk, multimedia card, card-type memory, and the like. In some embodiments, the readable storage medium may be an internal storage unit of the electronic device 40, such as a hard disk of the electronic device 40. In other embodiments, the readable storage medium may also be an external memory of the electronic device 40, such as a plug-in hard disk equipped on the electronic device 40, a smart media card (SMC), a secure digital ( Secure Digital, SD card, Flash Card, etc.
在本实施例中,该存储器42的可读存储介质通常用于存储安装于该电子装置40的计算机程序43等。In this embodiment, the readable storage medium of the memory 42 is generally used to store the computer program 43 installed in the electronic device 40 and the like.
处理器41在一些实施例中可以是一中央处理器(Central Processing Unit,CPU),微处理器或其他数据处理芯片,用于运行存储器42中存储的程序代码或处理数据,例如数字病理图像染色归一化程序等。The processor 41 may be a central processing unit (CPU), microprocessor or other data processing chip in some embodiments, and is used to run program codes or processing data stored in the memory 42, such as digital pathological image staining. Normalization procedures, etc.
网络接口可选地可以包括标准的有线接口、无线接口(如WI-FI接口),通常用于在该电子装置40与其他电子设备之间建立通信连接。The network interface may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is generally used to establish a communication connection between the electronic device 40 and other electronic devices.
通信总线用于实现这些组件之间的连接通信。The communication bus is used to realize the connection and communication between these components.
图4仅示出了具有组件41-43的电子装置40,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。FIG. 4 only shows the electronic device 40 with the components 41-43, but it should be understood that it is not required to implement all the illustrated components, and more or fewer components may be implemented instead.
在图4所示的装置实施例中,作为一种计算机存储介质的存储器42中可以包括操作系统、以及数字病理图像染色归一化程序;处理器41执行存储器42中存储的数字病理图 像染色归一化程序时实现如下步骤:In the device embodiment shown in FIG. 4, the memory 42 as a computer storage medium may include an operating system and a digital pathological image staining normalization program; the processor 41 executes the digital pathological image staining normalization program stored in the memory 42 The following steps are implemented when a program is changed:
S110:通过切片扫描仪将病理切片扫描到计算机中进行存储生成数字病理切片图像,对数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Scan the pathological slice into a computer through a slice scanner for storage to generate a digital pathological slice image, perform data analysis on the digital pathological slice image, and generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000023
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000023
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
S120:根据预设的转换规则对RGB图像I(x,y)进行HSD变换,将RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to the preset conversion rule, and convert the RGB image into an HSD image;
S130:利用HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得出最优解,得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
S140:通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
此外,本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质可以是非易失性,也可以是易失性,该计算机可读存储介质中包括数字病理图像染色归一化程序,该基于数字病理图像染色归一化程序被处理器执行时实现如下操作:In addition, the embodiments of the present application also provide a computer-readable storage medium. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium includes digital pathological image staining and normalization. The normalization program based on the digital pathological image staining is executed by the processor to achieve the following operations:
S110:通过切片扫描仪将病理切片扫描到计算机中进行存储生成数字病理切片图像,对数字病理切片图像进行数据解析,生成RGB图像I(x,y),S110: Scan the pathological slice into a computer through a slice scanner for storage to generate a digital pathological slice image, perform data analysis on the digital pathological slice image, and generate an RGB image I(x,y),
Figure PCTCN2020112366-appb-000024
其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
Figure PCTCN2020112366-appb-000024
Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
S120:根据预设的转换规则对RGB图像I(x,y)进行HSD变换,将RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to the preset conversion rule, and convert the RGB image into an HSD image;
S130:利用HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得出最优解,得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train the deep convolutional Gaussian mixture model to extract the Gaussian mixture model for solving different style images, until the optimal solution is obtained, and the optimal deep convolutional Gaussian mixture model is obtained;
S140:通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal deep convolution Gaussian mixture model.
该转换规则为:The conversion rule is:
Figure PCTCN2020112366-appb-000025
H、S、D为图像在HSD空间下的不同通道。
Figure PCTCN2020112366-appb-000025
H, S, D are different channels of the image in the HSD space.
通过卷积神经网络提取后验概率的向量γ,其中,向量γ为一个值处于(0,1)之间的k维向量;The vector γ of posterior probability is extracted through the convolutional neural network, where the vector γ is a k-dimensional vector with a value between (0,1);
获取HSD图像的像素点X={x 1,x 2,…,x p},计算HSD图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率: Get the pixel point X={x 1 ,x 2 ,...,x p } of the HSD image, and calculate the probability that the pixel point x 1 ,x 2 ,...,x p in the HSD image is generated by the kth Gaussian distribution:
Figure PCTCN2020112366-appb-000026
z k为具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布;
Figure PCTCN2020112366-appb-000026
z k is the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
结合概率从向量γ以及HSD图像中求解π k,∑ k,μ k,持续训练深度卷积高斯混合模型,以利用梯度下降算法不断更新π k,∑ k,μ k Combine probability to solve π k , ∑ k , μ k from the vector γ and HSD image, and continuously train the deep convolution Gaussian mixture model to continuously update π k , ∑ k , μ k using the gradient descent algorithm.
卷积神经网络包括卷积层、批处理层、激活层、池化层、及Upsampling层。Convolutional neural networks include convolutional layers, batch processing layers, activation layers, pooling layers, and Upsampling layers.
在结合概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型过程中,包括: In the process of solving π k , ∑ k , μ k from the vector γ and HSD image in combination with probability, the process of repeatedly training the deep convolutional Gaussian mixture model includes:
深度卷积高斯混合模型通过网络的正向传播得到gamma,通过gamma求解π k,∑ k,μ kThe deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and solves π k , ∑ k , μ k through gamma.
不断更新π k,∑ k,μ k过程中,利用对数似然函数持续更新直至得到π k,∑ k,μ k使得对数似然函数最大。 In the process of continuously updating π k , ∑ k , and μ k , the log-likelihood function is used to continuously update until π k , ∑ k , and μ k are obtained to maximize the log-likelihood function.
通过深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化的过程包括:The process of staining and normalizing the HE stained digital pathology image of the histopathology to be detected by the deep convolution Gaussian mixture model includes:
在待测试图片上应用训练完成的深度卷积高斯混合模型,深度卷积高斯混合模型自动计算待检测图像中每个像素所属的类别,并根据每个像素所属类别分别对原图和待测试图像中同一类别的区域进行H、S、D的转换,完成染色归一化;Apply the trained deep convolutional Gaussian mixture model on the image to be tested. The deep convolutional Gaussian mixture model automatically calculates the category of each pixel in the image to be detected, and compares the original image and the image to be tested according to the category of each pixel. Perform the conversion of H, S, D for the areas of the same category in the same category to complete the normalization of dyeing;
H、S、D的转换至少包括平均值、白化和色彩变换。The conversion of H, S, D includes at least average value, whitening and color conversion.
本申请之计算机可读存储介质的具体实施方式与上述数字病理图像染色归一化方法、电子装置的具体实施方式大致相同,在此不再赘述。The specific implementation of the computer-readable storage medium of the present application is substantially the same as the specific implementation of the above-mentioned digital pathological image staining normalization method and electronic device, and will not be repeated here.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、装置、物品或者方法不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、装置、物品或者方法所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、装置、物品或者方法中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, device, article or method including a series of elements not only includes those elements, It also includes other elements not explicitly listed, or elements inherent to the process, device, article, or method. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, device, article, or method that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。The serial numbers of the foregoing embodiments of the present application are only for description, and do not represent the superiority or inferiority of the embodiments. Through the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiments can be implemented by means of software plus the necessary general hardware platform. Of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disk, optical disk), including a number of instructions to make a terminal device (which may be a computer, a server, or a network device, etc.) execute the method described in each embodiment of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the preferred embodiments of the application, and do not limit the scope of the patent for this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of the application, or directly or indirectly applied to other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (20)

  1. 一种数字病理图像的染色归一化方法,应用于数字病理图像的处理,包括如下步骤:A staining normalization method for digital pathological images, applied to the processing of digital pathological images, includes the following steps:
    S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),
    Figure PCTCN2020112366-appb-100001
    其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
    S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
    Figure PCTCN2020112366-appb-100001
    Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
    S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
    S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles until an optimal deep convolutional Gaussian mixture model is obtained;
    S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  2. 根据权利要求1所述的数字病理图像的染色归一化方法,其中,所述转换规则为:The staining normalization method for digital pathological images according to claim 1, wherein the conversion rule is:
    Figure PCTCN2020112366-appb-100002
    其中,所述H、S、D为图像在HSD空间下的不同通道。
    Figure PCTCN2020112366-appb-100002
    Wherein, the H, S, D are different channels of the image in the HSD space.
  3. 根据权利要求1所述的数字病理图像的染色归一化方法,其中,利用所述HSD图像训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型的过程包括:The staining normalization method of digital pathological images according to claim 1, wherein the process of using the HSD image to train a deep convolutional Gaussian mixture model to extract and solve Gaussian mixture models for images of different styles comprises:
    通过卷积神经网络提取后验概率的向量γ;其中,所述向量γ为一个值处于(0,1)之间的k维向量;Extracting a vector γ of posterior probability through a convolutional neural network; wherein the vector γ is a k-dimensional vector with a value between (0, 1);
    获取所述HSD图像的像素点X={x 1,x 2,…,x p},计算所述HSD图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率: Obtain the pixel points X={x 1 , x 2 ,..., x p } of the HSD image, and calculate the pixels x 1 , x 2 ,..., x p in the HSD image generated by the k-th Gaussian distribution Probability:
    Figure PCTCN2020112366-appb-100003
    所述z k为具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布;
    Figure PCTCN2020112366-appb-100003
    Said z k is the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
    结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,持续训练深度卷积高斯混合模型,以利用梯度下降算法不断更新所述π k,∑ k,μ k Solve π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability, and continuously train the deep convolution Gaussian mixture model to continuously update the π k , Σ k , μ k using a gradient descent algorithm.
  4. 根据权利要求3所述的数字病理图像的染色归一化方法,其中,所述卷积神经网络包括卷积层、批处理层、激活层、池化层及Upsampling层。The staining normalization method for digital pathological images according to claim 3, wherein the convolutional neural network includes a convolutional layer, a batch processing layer, an activation layer, a pooling layer, and an Upsampling layer.
  5. 根据权利要求3所述的数字病理图像的染色归一化方法,其中,在结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型过程中,包括: The staining normalization method of digital pathological images according to claim 3, wherein the process of repetitive training of deep convolutional Gaussian mixture model is to solve π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability Include:
    所述深度卷积高斯混合模型通过网络的正向传播得到gamma,通过gamma求解π k,∑ k,μ kThe deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and solves π k , ∑ k , and μ k through gamma.
  6. 根据权利要求3所述的数字病理图像的染色归一化方法,其中,不断更新所述π k,∑ k,μ k过程中,利用对数似然函数持续更新直至得到π k,∑ k,μ k使得所述对数似然函数最大,所述对数似然函数为: The staining normalization method of digital pathological images according to claim 3, wherein in the process of continuously updating the π k , Σ k , μ k , the log-likelihood function is used to continuously update until π k , Σ k , μ k maximizes the log likelihood function, and the log likelihood function is:
    Figure PCTCN2020112366-appb-100004
    Figure PCTCN2020112366-appb-100004
  7. 根据权利要求1所述的数字病理图像的染色归一化方法,其中,通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化的过程包括:The method for staining normalization of digital pathology images according to claim 1, wherein the process of staining and normalizing the HE stained digital pathology image for histopathology to be detected through the optimal depth convolution Gaussian mixture model includes:
    在待检测组织病理HE染色数字病理图像上应用所述最优的深度卷积高斯混合模型,所述最优的深度卷积高斯混合模型自动计算待检测图像中每个像素所属的类别,并根据每个像素所属类别分别对原图和待测试图像中同一类别的区域进行H、S、D的转换,完成染色归一化。The optimal deep convolutional Gaussian mixture model is applied to the HE-stained digital pathological image of histopathology to be detected, and the optimal deep convolutional Gaussian mixture model automatically calculates the category to which each pixel in the image to be detected belongs, and according to Each pixel belongs to the same category of the original image and the image to be tested by the conversion of H, S, D to complete the normalization of dyeing.
  8. 根据权利要求7所述的数字病理图像的染色归一化方法,其中,所述H、S、D的转换至少包括平均值、白化和色彩变换。The staining normalization method of digital pathological images according to claim 7, wherein the conversion of H, S, D includes at least average value, whitening and color conversion.
  9. 一种数字病理图像的染色归一化系统,包括:A staining normalization system for digital pathological images, including:
    RGB单元,对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),
    Figure PCTCN2020112366-appb-100005
    其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
    The RGB unit performs data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
    Figure PCTCN2020112366-appb-100005
    Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
    HSD单元,根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;The HSD unit performs HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and converts the RGB image into an HSD image;
    卷积神经网络单元,利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;The convolutional neural network unit uses the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving different styles of images, until an optimal deep convolutional Gaussian mixture model is obtained;
    染色归一化单元,通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。The staining normalization unit performs staining normalization on the HE stained digital pathology image of the histopathology to be detected through the optimal depth convolution Gaussian mixture model.
  10. 一种电子装置,该电子装置包括:存储器、处理器及存储在所述存储器中的数字病理图像的染色归一化程序,所述数字病理图像的染色归一化程序被所述处理器执行时实行如下步骤:An electronic device comprising: a memory, a processor, and a staining normalization program for a digital pathological image stored in the memory; when the staining normalization program for the digital pathological image is executed by the processor Perform the following steps:
    S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),
    Figure PCTCN2020112366-appb-100006
    其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
    S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
    Figure PCTCN2020112366-appb-100006
    Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
    S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
    S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles until an optimal deep convolutional Gaussian mixture model is obtained;
    S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  11. 根据权利要求10所述的电子装置,其中,所述转换规则为:The electronic device according to claim 10, wherein the conversion rule is:
    Figure PCTCN2020112366-appb-100007
    其中,所述H、S、D为图像在HSD空间下的不同通道。
    Figure PCTCN2020112366-appb-100007
    Wherein, the H, S, D are different channels of the image in the HSD space.
  12. 根据权利要求10所述的电子装置,其中,利用所述HSD图像训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型的过程包括:The electronic device according to claim 10, wherein the process of using the HSD image to train a deep convolutional Gaussian mixture model to extract and solve the Gaussian mixture model for images of different styles comprises:
    通过卷积神经网络提取后验概率的向量γ;其中,所述向量γ为一个值处于(0,1)之间的k维向量;Extracting a vector γ of posterior probability through a convolutional neural network; wherein the vector γ is a k-dimensional vector with a value between (0, 1);
    获取所述HSD图像的像素点X={x 1,x 2,…,x p},计算所述HSD图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率: Obtain the pixel points X={x 1 , x 2 ,..., x p } of the HSD image, and calculate the pixels x 1 , x 2 ,..., x p in the HSD image generated by the kth Gaussian distribution Probability:
    Figure PCTCN2020112366-appb-100008
    所述z k为具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布;
    Figure PCTCN2020112366-appb-100008
    Said z k is the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
    结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,持续训练深度卷积高斯混合模型,以利用梯度下降算法不断更新所述π k,∑ k,μ k Solve π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability, and continuously train the deep convolution Gaussian mixture model to continuously update the π k , Σ k , μ k using a gradient descent algorithm.
  13. 根据权利要求12所述的电子装置,其中,在结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型过程中,包括: The electronic device according to claim 12, wherein the process of solving π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability, and repeatedly training the deep convolutional Gaussian mixture model comprises:
    所述深度卷积高斯混合模型通过网络的正向传播得到gamma,通过gamma求解π k,∑ k,μ kThe deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and solves π k , ∑ k , and μ k through gamma.
  14. 根据权利要求12所述的电子装置,其中,不断更新所述π k,∑ k,μ k过程中,利用对数似然函数持续更新直至得到π k,∑ k,μ k使得所述对数似然函数最大,所述对数似然函数为: The electronic device according to claim 12, wherein in the process of continuously updating the π k , Σ k , and μ k , the log likelihood function is used to continuously update until π k , Σ k , and μ k are obtained such that the logarithm The likelihood function is the largest, and the log likelihood function is:
    Figure PCTCN2020112366-appb-100009
    Figure PCTCN2020112366-appb-100009
  15. 根据权利要求10所述的电子装置,其中,通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化的过程包括:The electronic device according to claim 10, wherein the process of staining and normalizing the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model comprises:
    在待检测组织病理HE染色数字病理图像上应用所述最优的深度卷积高斯混合模型,所述最优的深度卷积高斯混合模型自动计算待检测图像中每个像素所属的类别,并根据每个像素所属类别分别对原图和待测试图像中同一类别的区域进行H、S、D的转换,完成染色归一化。The optimal deep convolutional Gaussian mixture model is applied to the HE-stained digital pathological image of histopathology to be detected, and the optimal deep convolutional Gaussian mixture model automatically calculates the category to which each pixel in the image to be detected belongs, and according to Each pixel belongs to the same category of the original image and the image to be tested by the conversion of H, S, D to complete the normalization of dyeing.
  16. 一种计算机可读存储介质,所述计算机可读存储介质中存储有数字病理图像的染色归一化程序,所述数字病理图像的染色归一化程序被处理器执行时,实现如下步骤:A computer-readable storage medium in which a staining normalization program of a digital pathological image is stored, and when the staining normalization program of a digital pathological image is executed by a processor, the following steps are implemented:
    S110:对预存的数字病理切片图像进行数据解析,生成RGB图像I(x,y),
    Figure PCTCN2020112366-appb-100010
    其中,I R为RGB图像中的R通道的二维矩阵,I G为RGB图像中G通道的二维矩阵,I B为RGB图像中B通道的二维矩阵;
    S110: Perform data analysis on the pre-stored digital pathological slice image to generate an RGB image I(x,y),
    Figure PCTCN2020112366-appb-100010
    Among them, I R is the two-dimensional matrix of the R channel in the RGB image, I G is the two-dimensional matrix of the G channel in the RGB image, and I B is the two-dimensional matrix of the B channel in the RGB image;
    S120:根据预设的转换规则对所述RGB图像I(x,y)进行HSD变换,将所述RGB图像转换为HSD图像;S120: Perform HSD conversion on the RGB image I(x, y) according to a preset conversion rule, and convert the RGB image into an HSD image;
    S130:利用所述HSD图像持续训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型,直至得到最优的深度卷积高斯混合模型;S130: Use the HSD image to continuously train a deep convolutional Gaussian mixture model to extract Gaussian mixture models for solving images of different styles until an optimal deep convolutional Gaussian mixture model is obtained;
    S140:通过所述最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化。S140: Perform staining normalization on the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model.
  17. 根据权利要求16所述的计算机可读存储介质,其中,所述转换规则为:The computer-readable storage medium according to claim 16, wherein the conversion rule is:
    Figure PCTCN2020112366-appb-100011
    其中,所述H、S、D为图像在HSD空间下的不同通道。
    Figure PCTCN2020112366-appb-100011
    Wherein, the H, S, D are different channels of the image in the HSD space.
  18. 根据权利要求16所述的计算机可读存储介质,其中,利用所述HSD图像训练深度卷积高斯混合模型来提取求解不同风格图像的高斯混合模型的过程包括:The computer-readable storage medium according to claim 16, wherein the process of using the HSD image to train a deep convolutional Gaussian mixture model to extract and solve the Gaussian mixture model for images of different styles comprises:
    通过卷积神经网络提取后验概率的向量γ;其中,所述向量γ为一个值处于(0,1)之间的k维向量;Extracting a vector γ of posterior probability through a convolutional neural network; wherein the vector γ is a k-dimensional vector with a value between (0, 1);
    获取所述HSD图像的像素点X={x 1,x 2,…,x p},计算所述HSD图像中的像素点x 1,x 2,…,x p由第k个高斯分布生成的概率: Obtain the pixel points X={x 1 , x 2 ,..., x p } of the HSD image, and calculate the pixels x 1 , x 2 ,..., x p in the HSD image generated by the k-th Gaussian distribution Probability:
    Figure PCTCN2020112366-appb-100012
    所述z k为具有均值μ=[μ 1,..,μ k],协方差∑=σ 2I的颜色的分布;
    Figure PCTCN2020112366-appb-100012
    Said z k is the distribution of colors with mean μ=[μ 1 ,.., μ k ] and covariance ∑=σ 2 I;
    结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,持续训练深度卷积高斯混合模型,以利用梯度下降算法不断更新所述π k,∑ k,μ k Solve π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability, and continuously train the deep convolution Gaussian mixture model to continuously update the π k , Σ k , μ k using a gradient descent algorithm.
  19. 根据权利要求18所述的计算机可读存储介质,其中,在结合所述概率从向量γ以及HSD图像中求解π k,∑ k,μ k,反复训练深度卷积高斯混合模型过程中,包括: The computer-readable storage medium according to claim 18, wherein the process of solving π k , Σ k , μ k from the vector γ and the HSD image in combination with the probability, and repeatedly training the deep convolutional Gaussian mixture model comprises:
    所述深度卷积高斯混合模型通过网络的正向传播得到gamma,通过gamma求解π k,∑ k,μ kThe deep convolutional Gaussian mixture model obtains gamma through the forward propagation of the network, and solves π k , ∑ k , and μ k through gamma.
  20. 根据权利要求16所述的计算机可读存储介质,其中,通过最优的深度卷积高斯混合模型对待检测组织病理HE染色数字病理图像进行染色归一化的过程包括:16. The computer-readable storage medium according to claim 16, wherein the process of staining and normalizing the HE stained digital pathology image of the histopathology to be detected by the optimal deep convolution Gaussian mixture model comprises:
    在待检测组织病理HE染色数字病理图像上应用所述最优的深度卷积高斯混合模型,所述最优的深度卷积高斯混合模型自动计算待检测图像中每个像素所属的类别,并根据每 个像素所属类别分别对原图和待测试图像中同一类别的区域进行H、S、D的转换,完成染色归一化。The optimal deep convolutional Gaussian mixture model is applied to the HE stained digital pathology image of histopathology to be detected, and the optimal deep convolutional Gaussian mixture model automatically calculates the category to which each pixel in the image to be detected belongs, and according to Each pixel belongs to the same category of the original image and the image to be tested by the conversion of H, S, D to complete the normalization of dyeing.
PCT/CN2020/112366 2019-10-18 2020-08-31 Staining normalization method and system for digital pathological image, electronic device and storage medium WO2021073279A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910993386.3 2019-10-18
CN201910993386.3A CN111028923B (en) 2019-10-18 2019-10-18 Digital pathological image staining normalization method, electronic device and storage medium

Publications (1)

Publication Number Publication Date
WO2021073279A1 true WO2021073279A1 (en) 2021-04-22

Family

ID=70201150

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/112366 WO2021073279A1 (en) 2019-10-18 2020-08-31 Staining normalization method and system for digital pathological image, electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN111028923B (en)
WO (1) WO2021073279A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114627010A (en) * 2022-03-04 2022-06-14 透彻影像(北京)科技有限公司 Dyeing space migration method based on dyeing density map
CN115410050A (en) * 2022-11-02 2022-11-29 杭州华得森生物技术有限公司 Tumor cell detection equipment based on machine vision and method thereof

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111028923B (en) * 2019-10-18 2024-01-30 平安科技(深圳)有限公司 Digital pathological image staining normalization method, electronic device and storage medium
CN113393454A (en) * 2021-07-02 2021-09-14 北京邮电大学 Method and device for segmenting pathological target examples in biopsy tissues
CN117058014B (en) * 2023-07-14 2024-03-29 北京透彻未来科技有限公司 LAB color space matching-based dyeing normalization system and method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170177971A1 (en) * 2015-12-17 2017-06-22 Research & Business Foundation Sungkyunkwan University Method of detecting color object by using noise and system for detecting light emitting apparatus by using noise
CN108596237A (en) * 2018-04-19 2018-09-28 北京邮电大学 A kind of endoscopic polyp of colon sorting technique of LCI laser based on color and blood vessel
CN109544507A (en) * 2018-10-18 2019-03-29 清影医疗科技(深圳)有限公司 A kind of pathological image processing method and system, equipment, storage medium
CN109919959A (en) * 2019-01-24 2019-06-21 天津大学 Tone mapping image quality evaluating method based on color, naturality and structure
CN111028923A (en) * 2019-10-18 2020-04-17 平安科技(深圳)有限公司 Digital pathological image dyeing normalization method, electronic device and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2001290568A1 (en) * 2000-08-25 2002-03-04 Amnis Corporation Measuring the velocity of small moving objects such as cells
EP4300424A3 (en) * 2017-08-04 2024-03-06 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
CN108172278B (en) * 2018-01-24 2021-09-10 北京航空航天大学 HE staining pathological image color normalization method
CN108876735B (en) * 2018-06-01 2021-11-02 武汉大学 Real image blind denoising method based on depth residual error network
CN110322396B (en) * 2019-06-19 2022-12-23 怀光智能科技(武汉)有限公司 Pathological section color normalization method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170177971A1 (en) * 2015-12-17 2017-06-22 Research & Business Foundation Sungkyunkwan University Method of detecting color object by using noise and system for detecting light emitting apparatus by using noise
CN108596237A (en) * 2018-04-19 2018-09-28 北京邮电大学 A kind of endoscopic polyp of colon sorting technique of LCI laser based on color and blood vessel
CN109544507A (en) * 2018-10-18 2019-03-29 清影医疗科技(深圳)有限公司 A kind of pathological image processing method and system, equipment, storage medium
CN109919959A (en) * 2019-01-24 2019-06-21 天津大学 Tone mapping image quality evaluating method based on color, naturality and structure
CN111028923A (en) * 2019-10-18 2020-04-17 平安科技(深圳)有限公司 Digital pathological image dyeing normalization method, electronic device and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114627010A (en) * 2022-03-04 2022-06-14 透彻影像(北京)科技有限公司 Dyeing space migration method based on dyeing density map
CN115410050A (en) * 2022-11-02 2022-11-29 杭州华得森生物技术有限公司 Tumor cell detection equipment based on machine vision and method thereof

Also Published As

Publication number Publication date
CN111028923A (en) 2020-04-17
CN111028923B (en) 2024-01-30

Similar Documents

Publication Publication Date Title
WO2021073279A1 (en) Staining normalization method and system for digital pathological image, electronic device and storage medium
US10489909B2 (en) Method of automatically detecting microaneurysm based on multi-sieving convolutional neural network
Janowczyk et al. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases
CN111462042B (en) Cancer prognosis analysis method and system
US11562491B2 (en) Automatic pancreas CT segmentation method based on a saliency-aware densely connected dilated convolutional neural network
CN110472616A (en) Image-recognizing method, device, computer equipment and storage medium
CN102651128B (en) Image set partitioning method based on sampling
CN112257704A (en) Cervical fluid-based cell digital image classification method based on deep learning detection model
CN109544507A (en) A kind of pathological image processing method and system, equipment, storage medium
CN110533583B (en) Self-adaptive image augmentation system based on cervical fluid-based cells
CN110322436B (en) Medical image processing method, device, storage medium and equipment
WO2021136368A1 (en) Method and apparatus for automatically detecting pectoralis major region in molybdenum target image
US20230051951A1 (en) Method for training image processing model
CN113160185A (en) Method for guiding cervical cell segmentation by using generated boundary position
CN113781488A (en) Tongue picture image segmentation method, apparatus and medium
Khattar et al. Computer assisted diagnosis of skin cancer: a survey and future recommendations
CN113538422A (en) Pathological image automatic classification method based on dyeing intensity matrix
CN112132841A (en) Medical image cutting method and device
Yuan et al. Explore double-opponency and skin color for saliency detection
CN116721289A (en) Cervical OCT image classification method and system based on self-supervision cluster contrast learning
Dogra et al. Novel seed selection techniques for MR brain image segmentation using graph cut
CN116189785A (en) Spatial domain identification method based on spatial transcriptomics data feature extraction
CN114170224B (en) System and method for cellular pathology classification using generative staining normalization
US11455753B1 (en) Systems and methods to process electronic images to adjust attributes of the electronic images
CN115601360A (en) Chromosome structure abnormality auxiliary identification method and system and computer equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20876443

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20876443

Country of ref document: EP

Kind code of ref document: A1