WO2015124772A1 - Group sparsity model for image unmixing - Google Patents

Group sparsity model for image unmixing Download PDF

Info

Publication number
WO2015124772A1
WO2015124772A1 PCT/EP2015/053745 EP2015053745W WO2015124772A1 WO 2015124772 A1 WO2015124772 A1 WO 2015124772A1 EP 2015053745 W EP2015053745 W EP 2015053745W WO 2015124772 A1 WO2015124772 A1 WO 2015124772A1
Authority
WO
WIPO (PCT)
Prior art keywords
group
tissue
image
markers
unmixing
Prior art date
Application number
PCT/EP2015/053745
Other languages
French (fr)
Inventor
Srinivas Chukka
Ting Chen
Original Assignee
Ventana Medical Systems, Inc.
F. Hoffmann-La Roche Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ventana Medical Systems, Inc., F. Hoffmann-La Roche Ag filed Critical Ventana Medical Systems, Inc.
Priority to EP19170902.1A priority Critical patent/EP3550514B1/en
Priority to EP15708459.1A priority patent/EP3108448B1/en
Publication of WO2015124772A1 publication Critical patent/WO2015124772A1/en
Priority to US15/243,899 priority patent/US10540762B2/en
Priority to US16/590,942 priority patent/US11113809B2/en
Priority to US17/391,398 priority patent/US11893731B2/en
Priority to US18/389,714 priority patent/US20240135532A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/90Determination of colour characteristics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/698Matching; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10056Microscopic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30024Cell structures in vitro; Tissue sections in vitro
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30204Marker

Definitions

  • the present subject disclosure relates to digital pathology. More particularly, the present subject disclosure relates to color unmixing methods and systems for a multiplex IHC image that can accommodate any number of stain colors.
  • Multiplex immunohistochemistry (IHC) staining is an emerging technique for the detection of multiple biomarkers within a single tissue section and has become more popular due to its significant efficiencies and the rich diagnostic information it has.
  • a multiplex IHC slide has the potential advantage of simultaneously identifying multiple biomarkers in one tissue section as opposed to single biomarker labeling in multiple slides. Therefore, it is often used for the simultaneous assessment of multiple hallmarks of cancerous tissue.
  • a cancerous tissue slide is stained by the multiplex assay to identify biomarkers.
  • tumors in human often contain infiltrates (e.g., T-cells or B-cells) of immune cells, which may prevent the development of tumors or favor the outgrowth of tumors.
  • multiple stains are used to target different type of immune cells and the population distribution of each type of the immune cells are used to study the clinical outcome of the patients.
  • the stained slide is then imaged, for example, using a CCD color camera mounted on a microscope or a scanner.
  • the cells are stained, for example, with chromogenic dyes, fluorescent markers and/or quantum dots, and then imaged.
  • the image is unmixed to obtain the constituent dyes and/or the proportions of each dye in the color mixture, as a prerequisite step for multiplex image analysis, for example, multiplex IHC image analysis.
  • multiplex image analysis for example, multiplex IHC image analysis.
  • each pixel of the color mixture ysR 3 is a linear combination of the pure stain colors and solves a linear system to obtain the combination weights bsR M .
  • This technique is most widely used in the current digital pathology domain, however, the maximum number of stains that can be solved is limited to three as the linear system is deficient for not enough equations (X being a 3*M matrix).
  • the color unmixing problem may be formulated into a non-negative matrix factorization and color decomposition performed in a fully automated manner, wherein no reference stain color selection is required.
  • a color space may be divided into several systems with up to three colors by solving a convex framework, with a linear system being used to solve each individual system. Due to the
  • Sparse models for high dimensional multi-spectral image unmixing adopt the L 0 norm to regularize the combination weights b of the reference colors hence leading to a solution that only a small number of reference colors are contributed to the stain color mixture, but these are also designed for multi-spectral images and do not use any prior biological information about the biomarkers, which may lead to undesired solutions for real data.
  • these methods cannot be applied to RGB images due to the image acquisition system, i.e. multi-spectral imaging instead of a CCD color camera to capture the image using a set of spectral narrow-band filters.
  • the number of filters K can be as many as dozens or hundreds, leading to a multichannel image that provides much richer information than the brightfield RGB image.
  • the linear system constructed from it is always an over-determined system with X being a KxM(K» M) matrix that leads to a unique solution, however, the scanning process in the multi-spectral imaging system is very time consuming and only a single field of view manually selected by a technician can be scanned instead of the whole slide, thereby limiting the usage of such methods.
  • the present invention provides in particular for a tissue analysis system and method, a system for unmixing a tissue image and a digital storage medium that stores digitally encoded instructions executable by a processor of a tissue analysis system for configuring the tissue analysis system for execution of a method of the invention.
  • the subject disclosure solves the above-identified problems in the current state of the art by providing systems and methods for unmixing multiplex IHC images having a number of stain contributions greater than a number of color channels, such as an RGB brightfield image.
  • Operations disclosed herein include obtaining reference colors from the training images, modeling a RGB image unmixing problem using a group sparsity framework, in which the fractions of stain contributions from colocation markers are modeled within a same group and fractions of stain contributions from non-colocation markers are modeled in different groups, providing co-localization information of the markers to the group sparsity model, solving this group sparsity model using an algorithm such as a Group Lasso, yielding a least square solution within each group which corresponds to the unmixing of the colocation markers, and yielding a sparse solution among the groups that correspond to the unmixing of non-colocation markers.
  • Reduction of the model to sparse unmixing without colocalization constraint is enabled by setting only one member in each group, and generating spars
  • the subject disclosure provides a method for unmixing an image, the method comprising generating a group sparsity model wherein a fraction of a stain contribution from colocation markers is assigned within a single group and a fraction of a stain contribution from non-colocation markers is assigned within separate groups, and solving the group sparsity model using group lasso algorithm to yield a least squares solution within each group and sparse solution among groups.
  • the method may be a computer-implemented method.
  • the subject disclosure provides a system for unmixing an image, comprising a processor, and a memory coupled to the
  • the memory to store digitally encoded instructions that, when executed by the processor, cause the processor to perform operations including generating a group sparsity framework using known co-location information of a plurality of biomarkers within an image of a tissue section; wherein a fraction of each stain contribution is assigned to a different group based on the known co- location information, and solving the group sparsity model using group lasso algorithm to yield a least squares solution for each group.
  • the subject disclosure provides a digital storage medium to store digitally encoded instructions executable by a processor to perform operations including modeling an RGB image unmixing problem using a group sparsity framework, in which fractions of stain contributions from a plurality of colocation markers are modeled within a same group and fractions of stain contributions from a plurality of non-colocation markers are modeled in different groups, providing co-localization information of the plurality of colocation markers to the modeled group sparsity framework, solving the modeled framework using a group lasso to yield a least squares solution within each group, wherein the least squares solution corresponds to the unmixing of the colocation markers, and yielding a sparse solution among the groups that corresponds to the unmixing of the non-colocation markers.
  • FIG. 1 depicts a system for image unmixing using a group sparsity model
  • FIG. 2 depicts a framework of an unmixing algorithm, according to an exemplary embodiment of the subject disclosure.
  • FIGS. 3A-3C depict a simulated example of an image to be unmixed
  • FIG. 4 depicts an example of a clinical data set containing several different
  • cancer tissue samples that were used to test the disclosed operations, according to exemplary embodiments of the subject disclosure.
  • FIG. 5 depicts the unmixing examples of decomposing a multiplexed image into single stain channels using the prior art versus the group sparsity methods disclosed herein, in accordance with an exemplary embodiment of the subject disclosure.
  • FIG. 6 depicts an example of orange only cells, according to an exemplary
  • FIG. 7 depicts example images of nucleus co-localization cases, according to exemplary embodiments of the subject disclosure.
  • FIG. 8 depicts less background noise using the proposed sparse constrained method versus prior art methods for RGB images with a group size set to 1 , according to an exemplary embodiment of the subject disclosure.
  • Fig. 9 schematically shows an embodiment of a tissue analysis system.
  • FIG. 10 schematically shows a flow diagram of an embodiment of a tissue
  • the present disclosure relates, inter alia, to an analysis system, e.g. to a tissue analysis system.
  • the system may be suitable for analyzing biological specimen, for example, tissue provided on a slide.
  • image data encompasses raw image data
  • the image data may comprise a pixel matrix.
  • the analysis system may comprise a color data storage module that stores color data, e.g. color data indicative of a color of the stains. Such color data is also referred to as "reference data".
  • the color data may be descriptive of a single frequency or a characteristic spectral profile of the stain.
  • the color data storage module may store color data for each of a plurality of stains.
  • the plurality of stains may comprise at least 4, at least 10, at least 20 or at least 100 stains.
  • biomarker may be understood in the sense of a tissue feature (e.g. (a presence of) a particular cell type, for instance immune cells), in particular a tissue feature indicative of a medical condition.
  • the biomarker may be identifiable by the presence of a particular molecule, for instance
  • tissue feature instance a protein, in the tissue feature.
  • the term "marker” may be understood in the sense of a stain, dye or a tag (that allows a biomarker to be differentiated from ambient tissue and/or from other biomarkers).
  • the tag may be stained or dyed.
  • the tag may be an antibody, e.g. an antibody having an affinity to a protein of a particular biomarker.
  • a marker may have an affinity to a particular biomarker, e.g. to a particular molecule / protein / cell structure / cell (indicative of a particular biomarker).
  • the biomarker to which a marker has an affinity may be specific / unique for the respective marker.
  • a marker may mark tissue, i.e. a biomarker in the tissue, with a color.
  • the color of tissue marked by a respective marker may be specific / unique for the respective marker.
  • the present disclosure designates such relationships between a color and a marker as a "marker having a color” or as a "color of a marker.”
  • Any of the plurality of markers may have an affinity to at least one tissue feature selected from the group consisting of a tumor cell cytokeratin, a regulatory T-cell nucleus, a universal nucleus, a B-cell membrane, a universal T- cell membrane, and a cytotoxic T-cell membrane.
  • the analysis system may comprise a co-location data storage module that defines a plurality of groups of the markers (whose color data is stored by the color data storage module).
  • Each group may consist of markers having an affinity to a respective common tissue feature.
  • each marker of a respective group may have an affinity to a common tissue feature.
  • Each of the plurality of groups may consist of at least one and no more than three markers.
  • the plurality of groups may comprise each of the plurality of markers. In other words, the plurality of groups may be defined such that each individual marker of the plurality of markers belongs to at least one group of the plurality of groups.
  • the analysis system may comprise a tissue image data storage module that stores a plurality of pixels representative of a tissue image.
  • the tissue image may be an RGB, a CYMK image or other multi-channel color image (of a tissue sample).
  • the multi-channel color image may comprise from 1 to 10, e.g. from 3 to 5, (color) channels.
  • each pixel may comprise color information for any of a plurality of color channels, e.g. for each of a red, green and blue channel of an RGB image.
  • the pixels may represent the tissue image (at a resolution) such that (in at least 50%, at least 75% or at least 90% of all cases) at least one pixel is required to represent any individual biomarker of the biomarkers identifiable by the plurality of markers or the background.
  • the pixels may represent the tissue image (at a resolution) such that (in at least 50%, at least 75% or at least 90% of all cases) any individual biomarker of the biomarkers identifiable by the plurality of markers or the background occupies at least one pixel.
  • the pixels may represent the tissue image such that, by virtue of the image resolution and a biological co-location (in tissue) of the molecules / proteins / cell structures / cells to which the individual markers of the plurality of markers have an affinity, at most three markers are visible per pixel (for at least 50%, at least 75% or at least 90% of the pixels).
  • the expression "at most three markers” is not limited to the sense that at most three individual antibody tags (or other individual markers) are visible per pixel. Instead, the expression “at most three markers” may be understood in the sense that, of the plurality of markers, at most three of the plurality are visible per pixel.
  • the colors constituting an individual pixel may be limited to three colors selected from the group of tissue colors obtained by marking tissue with the plurality of markers.
  • the number of markers in the respective group may define an upper limit for the number of (different kinds of) stains, i.e. markers, per pixel.
  • the framework is not limited to the constraint of "at most three markers".
  • stain may be understood in the broad sense of any type of marker such as an antibody, dye or stain suitable to mark, i.e. "stain,” biomarkers in a tissue.
  • the analysis system may comprise a tissue image analysis module (for unmixing the tissue image).
  • the tissue image analysis module may (be configured to) calculate a linear combination of the colors of the markers of a respective group that yields a minimum difference between (the color information of) a respective pixel and the linear combination of colors.
  • the tissue image analysis module may calculate such a linear combination for any individual group of the groups and for any individual pixel of the plurality of pixels.
  • the tissue image analysis module may unmix (e.g. as understood in the art of imaging spectroscopy) a pixel into the colors of the markers of a respective group, e.g.
  • the aforementioned “minimum” difference need not be understood as a mathematical minimum or as an absolute minimum. Instead, the “minimum” difference may be "minimum” as determinable by data available to the tissue image analysis module.
  • the difference between a linear combination of the colors of the markers of a respective group and the respective pixel may be measured in a color space of the respective pixel, e.g. as a polynomial that comprises, for each (color) channel of the pixel, the difference between the respective linear combination and the value of the pixel in the respective channel as a variable.
  • the difference may be measured as a sum of the squares obtained by squaring, for each (color) channel of the pixel, the difference between the respective linear combination and the value of the pixel in the respective channel.
  • the tissue image analysis module may calculate a linear combination of the colors of the markers of a respective group that yields a minimum difference between a respective pixel and the linear combination by means of a least square algorithm.
  • the tissue image analysis module may affect the aforementioned calculating using any of the color data stored in the color data storage module, any of the co-location data stored in the co-location data storage module and/or any of the plurality of pixels stored in the tissue image data storage module. Accordingly, the tissue image analysis module may be configured to read any of the color data from the color data storage module, to read any of the co-location data from the co-location data storage module and/or to read any of the pixels from the tissue image data storage module.
  • the tissue image analysis module may determine a group for which the
  • tissue image analysis module may determine which group of markers has colors that can be combined to most closely match a respective pixel and may output (data indicative of) the (determined) group as an analysis result.
  • the (tissue) analysis system may comprise a (tissue) staining module that stains a tissue sample with any of the plurality of markers.
  • stain is not limited to an application of a stain (in a limited sense of the word) to a tissue, but may instead likewise comprise exposing the tissue to any type of markers such as antibodies or dyes to mark biomarkers in the tissue.
  • the tissue image data storage module may receive the plurality of pixels directly or indirectly from a source that need not be an element of the (tissue) analysis system.
  • the (tissue) analysis system may comprise a (tissue) imaging module, e.g. a (tissue) imaging module that images a tissue sample to obtain the plurality of pixels representative of a tissue image.
  • the tissue sample may be a tissue sample stained with any of the plurality of markers, e.g. by the (tissue) staining module.
  • the imaging module may utilize nonvisible electromagnetic radiation (UV light, for example), magnetic resonance, ultrasound or other imaging techniques to capture the tissue image.
  • the (tissue) imaging module may comprise a microscope and a (CCD) camera arranged to capture a plurality of (raw) pixels representative of an image of a tissue sample magnified by the microscope.
  • the plurality of pixels stored by the tissue image data storage module may be identical to and/or derived from raw pixels captured by the (tissue) imaging module.
  • image sensors or methods for capturing an image such as a digital image, may be utilized.
  • the (tissue) imaging module may comprise a bright-field illumination
  • module that effects bright-field illumination of the tissue sample and may effect capture of the plurality of pixels representative of an image of the tissue sample during bright-field illumination of the tissue sample.
  • the (tissue) imaging module may comprise a
  • the (tissue) imaging module may effect imaging, i.e. capture of the plurality of pixels representative of an image of the tissue sample, by means of a CCD camera selected from the group consisting of an RGB CCD camera and a CCD camera having at most five color channels.
  • the CCD camera may capture pixels in each of a red, green and blue channel or in each of a red, green, blue and UV channel.
  • the CCD camera may comprise a beam splitter for splitting incident light into the various (color) channels for capture.
  • Systems and/or methods described herein relate to unmixing more than three stains (for example, chromogenic dyes, fluorescent stains, or quantum dots), while preserving the biological constraints of the biomarkers.
  • Unlimited numbers of markers may be unmixed from a limited-channel image, such as an RGB image, without adding any mathematical complicity to the model.
  • Known co- localization information of different biomarkers within the same tissue section enables defining fixed upper bounds for the number of stains at one pixel.
  • a group sparsity model may be leveraged to explicitly model the fractions of stain contributions from the co-localized biomarkers into one group to yield a least squares solution within the group.
  • a sparse solution may be obtained among the groups to ensure that only a small number of groups with a total number of stains being less than the upper bound are activated. Results of applying these methods on a clinical data set containing a large number of multiplex IHC slides demonstrates better unmixing results than the prior art.
  • FIG. 1 depicts a system 100 for image unmixing using group sparsity modeling, according to an exemplary embodiment of the subject disclosure.
  • System 100 comprises a memory 1 10, which stores a plurality of processing modules or logical instructions that are executed by processor 105 coupled to electronic processing device 101 , for example a computer .
  • electronic processing device 101 which may be a computer, also includes user input and output devices such as a keyboard, mouse, stylus, and a display / touchscreen.
  • processor 105 executes logical instructions stored on memory 1 10, performing image analysis and other quantitative operations resulting in an output of results to a user operating electronic processing device 101 or via a network.
  • input data 102 may provide a means for inputting image data from one or more scanned IHC slides to memory 1 10.
  • Image data may include data related to color channels or frequency channels.
  • a biological specimen for example, a tissue section may need to be stained by means of application of a staining assay containing one or more different biomarkers associated with chromogenic stains for brightfield imaging or fluorophores for fluorescence imaging.
  • Staining assays can use chromogenic stains for brightfield imaging, organic fluorophores, quantum dots, or organic fluorophores together with quantum dots for fluorescence imaging, or any other combination of stains and viewing or imaging devices.
  • stains are specified to identity one or more types of biomarkers, for example, immune cells.
  • CD3 is a known universal marker for all the T-cells and CD8 only captures the cytotoxic T- cells in the membrane.
  • FoxP3 marks the regulatory T-cells in the nuclei and Hematoxylin (HTX) stains all the nuclei. Therefore, input 102 may further include co-location information for different biomarker, as co-localization information of the markers can be inferred from the biological knowledge.
  • CD3 and CD8 co-locate in the membrane while FoxP3 and HTX may appear in the same nucleus. Tumor markers on the tumor cell's cytoplasm region coexist with B-cell markers on the B-cell's membrane.
  • processing modules may be executed in order to analyze the image data and unmix the image using a group sparsity framework.
  • a pre-processing module 1 1 1 may be executed for converting an image such as an RGB image into an optical density (OD) space using the following formula derived from Beer's law based on the fact that the optical density is proportional to the stain
  • An unmixing module 1 12 may be invoked to unmix the optical density image O.
  • y be a pixel of O, where y represents a 3-dimensional column vector corresponding to the OD values converted from RGB. If there are M biomarkers available in the multiplex IHC slide, that provides M stain colors.
  • a typical unmixing problem may thus be formulated as the following: min
  • Each column of X corresponds to a reference stain color sampled from a control slide of pure stain or approximate pure stain. Reference colors may be stored in and retrieved from a reference database 1 13, or provided externally from a network.
  • This linear system has a solution only when the column of X is less than or equal to 3 for ysR 3 . Therefore, meaningful regularization may facilitate finding a solution for the linear system.
  • the biomarker co-localization information provides a partition of b into a set of groups gi ,g2, - - N being the total number of groups. Within each group, the biomarkers are known to have the co- localization possibility; these biomarkers are also refered herein as co- localization or colocation markers .
  • the method of the present invention adopts this biological information (input via input 102) to formulate the regularization term of the cost function.
  • Let g, be a qrdimensional column vector representing the combination weights of the stains within the it group and qi be the number of stains within the group g,. This provides qi + q2- .. + qN M. x, denotes the ith group of reference colors, resulting in a 3xq, matrix as depicted in FIG. 2.
  • FIG. 2 depicts a framework for group sparsity modeling, according to an
  • the RGB image y obtained from the imaging system or other input is represented as 220
  • the reference color matrix X includes a plurality of columns representing, from left to right, stains 1 to 6, and the unmixed image b corresponding to each stain may be represented by a matrix 224.
  • column 231 may represent a tumor cell cytokeratin (e.g. Dabsyl / Oscar)
  • column 232 may represent a regulatory T-cell nucleus (e.g. TAMRA / FP3)
  • column 233 may represent a universal nucleus (e.g.
  • column 234 may represent a B-cell membrane (e.g. Cy5 / CD20), column 5 may represent a universal T-cell membrane (e.g. Rho1 10 / CD3), and column 6 may represent a cytotoxic T-cell membrane (e.g. Cy5T / CD8).
  • Each of the six stains 231 -236 may be grouped into four different groups gi,g2, etc., where co-localized stains are in the same group, based on the groups depicted in key 226. Based on this biological co-localization information of the biomarkers, it may be straightforward to conclude that only two colors can co-exist at each pixel for this case.
  • the unmixing algorithm may perform operations including: activating only one group of stains with the contribution weights from the other groups marked as zero for each pixel, and, within the activated group, estimating the fractions of the contributions from each constituent stain.
  • the unmixing problem is solved by the unmixing method using a group sparsity framework so as to ensure the sparsity among the group but non-sparsity within the group.
  • Group sparsity and group lasso operations are further described in N. Simon, et al., "A Sparse Group Lasso". Journal of Computational and Graphical Statistics, vol. 22(2), pp.231 - 245, 2013. However, in accordance with the operations described herein, the following example may be used.
  • the unmixing problem may formulated as a group lasso criterion, i.e. as the following convex optimization problem or cost function where b - [ b h, - ⁇ Y - [g g ⁇ -. 9%]* and II ⁇ I is the Euclidean norm without squared.
  • the first term in Eqn.3 is the reconstruction error between the original RGB image and the unmixed images, and may be solved for the linear system as further described in A. C. Ruifrok and D. A. Johnston, "Quantification of Histochemical Staining by Color Deconvolution", Anal. Quant. Cytol.
  • the tuning parameter of the Lasso or group Lasso criterion i.e. the regularization parameter ⁇
  • the regularization parameter ⁇ can be chosen by cross-validation. For example, the image data is partitioned into complimentary subsets. The unmixing is then performed separately for the subsets using the Lasso or group Lasso criterion as described above with various choices for ⁇ and the one that leads to the least unmixing error is choosen.
  • the selection of a value for ⁇ can be performed by the steps of
  • the image data is divided into 10 subsets and respective 10 different values for ⁇ are chosen as candidate values.
  • the candidate value for ⁇ that yields a solution for the respective subset of the image data that has the lowest error is then selected for unmixing the entire image data or further image data that is acquired from the same or another tissue sample using the lasso or group lasso criterion.
  • the modules include logic that is executed by processor 105.
  • Logic refers to any information having the form of instruction signals and/or data that may be applied to affect the operation of a processor.
  • Software is one example of such logic.
  • the operations described herein may be implemented in a software language, for example, in C++, to provide fast computation. In such an embedment, it may cost about 7 seconds to unmix a 750 by 1400 image on an Intel Core i7 1.87GHZ PC.
  • processors are processors (processing units), microprocessors, digital signal processors, controllers and microcontrollers, etc.
  • Logic may be formed from signals stored on a computer-readable medium or digital storage medium such as memory 1 10 that, in an exemplary embodiment, may be a random access memory (RAM), read-only memories (ROM), erasable / electrically erasable programmable read-only memories (EPROMS/EEPROMS), flash memories, etc.
  • Logic may also comprise digital and/or analog hardware circuits, for example, hardware circuits comprising logical AND, OR, XOR, NAND, NOR, and other logical operations.
  • Logic may be formed from
  • logic may be
  • a particular logic unit is not limited to a single logical location on the network. Moreover, the modules need not be executed in any specific order.
  • the remaining figures depict simulated and actual results of how the unmixing operations described herein are empirically validated and compared to existing techniques.
  • the images depicted are from a CCD camera, i.e. RGB images, these operations may also be applied to spectral images such as those obtained via a spectral camera or scanner device, or any image comprising a mixture of the underlying co-localized biomarker expressions.
  • the disclosed operations are described with respect to cancerous tissue but may be applied to any tissue type and disease state.
  • FIGS. 3A-3C depict a simulated example of an image to be unmixed, and a
  • FIG. 3A depicts a simulated image comprising six colors following the stain co-localization and grouping of FIG. 2. The six colors depicted are yellow 331 , red 332, blue 333, red + cyan 334, blue + purple 335, and green 336.
  • FIG. 3B depicts the MSE compared to the ground truth unmixed channels for ⁇ within the range 0 to 20. The plot demonstrates that the system becomes stable when ⁇ >10. The example unmixing results are shown in FIG.
  • FIG. 3C depicting columns of colors yellow 331 , purple 337, blue 333, red 332, cyan 338, and green 336.
  • the first row shows the ground truth (i.e. ideal depiction) and the other rows showing the unmixed results by varying ⁇ .
  • 0, the system becomes deficient, hence unmixing errors are observed as shown in the second row of Fig 3C.
  • FIGS. 4-6 depict a clinical data set containing several different cancer issue
  • Fig. 4 yellow chromogen for tumor cell cytokeratin 441 , purple for regulatory T- cell nucleus 444, blue for universal nucleus 442, light blue for B-cell membrane, orange for universal T-cell membrane and dark green for cytotoxic T-cell membrane 443.
  • FIG. 5 depicts the unmixing examples of decomposing a multiplexed image 550 into single stain channels using the prior art 551 versus the group sparsity methods 552 disclosed herein, in accordance with an exemplary embodiment of the subject disclosure. It can be seen from FIG. 5 that pixel discontinuities and artifacts are observed from the color system unmixing results. Instead of solving a multiple three color system using the nearest neighbor assignment, a single system may be solved for all the pixels, hence leading to smoother unmixed images depicted by reference 552. Meanwhile, the algorithm maintains the biological constraints as wells as reduces the background noises.
  • FIG. 6 depicts an example of orange only cells (column 662), according to an exemplary embodiment of the subject disclosure.
  • the figure shows a section of multiplexed images 661 zoomed in, with three channels of the region depicted in the following columns: universal T- cell membrane channel (orange) 662, cytotoxic T-cell membrane channel (green) 663, and universal nucleus channel 664.
  • Images a-d represent different combinations of methods: (a) prior art without co-localization constraint, (b) group sparsity-based unmixing without co-localization constraint, (c) prior art with co- localization constraint, and (d) group sparsity-based unmixing with co-localization constraint. It can be seen that in the corresponding unmixed region of pure orange cells in column 662, the green signal is weak. However, strong green and orange signals exist in the other regions within the same image. This demonstrates that l_2 norm constraint is used within the group to linearly separate the color mixture into different stain contribution weights. Meanwhile, the prior art methods (a) and (c) are prone to unmixing errors due to the hard assignment of the system based on color similarity.
  • FIG. 7 depicts example images of nucleus co-localization cases, according to exemplary embodiments of the subject disclosure.
  • This figure shows the advantages of having biological co-localization constraints.
  • FIG. 6 shows multiplexed images 771 , 772, and 773, with three channels of the region depicted in the following columns: regulatory T-cell nucleus channel 774 (purple), and universal nucleus channel 775 (blue).
  • Images a-d represent different combinations of methods: (a) prior art without co-localization constraint, (b) group sparsity-based unmixing without co-localization constraint, (c) prior art with co-localization constraint, and (d) group sparsity-based unmixing with co- localization constraint.
  • the presence of nuclei is shown in both purple 774 and blue 775 channels if group sparsity is used. However, no purple is observed if no co-localization constraint is used.
  • the algorithm can also be used to less than three color unmixing.
  • the group size may be set to 1 , and compared with prior art methods for 2-stain unmixing. However, the group size may vary, for different applications, in various embodiments of the subject disclosure. The results are shown in FIG. 8, depicting much less background noise using the proposed sparse constrained method 884 versus prior art methods 883 for RGB images 881 and 882 with a group size set to 1.
  • Electronic processing devices such as computers, typically include known
  • a processor such as a processor, an operating system, system memory, memory storage devices, input-output controllers, input-output devices, and display devices. It will also be understood by those of ordinary skill in the relevant art that there are many possible configurations and components of an electronic processing device and may also include cache memory, a data backup unit, and many other devices.
  • Examples of input devices include a keyboard, a cursor control devices (e.g., a mouse), a microphone, a scanner, and so forth.
  • Examples of output devices include a display device (e.g., a monitor or projector), speakers, a printer, a network card, and so forth.
  • Display devices may include display devices that provide visual information, this information typically may be logically and/or physically organized as an array of pixels.
  • An interface controller may also be included that may comprise any of a variety of known or future software programs for providing input and output interfaces. For example, interfaces may include what are generally referred to as "Graphical User
  • Interfaces (often referred to as GUI's) that provide one or more graphical representations to a user. Interfaces are typically enabled to accept user inputs using means of selection or input known to those of ordinary skill in the related art.
  • the interface may also be a touch screen device.
  • applications on an electronic processing device may employ an interface that includes what are referred to as "command line interfaces" (often referred to as CLI's).
  • CLI's typically provide a text based interaction between an application and a user.
  • command line interfaces present output and receive input as lines of text through display devices.
  • some implementations may include what are referred to as a "shell” such as Unix Shells known to those of ordinary skill in the related art, or Microsoft Windows Powershell that employs object-oriented type programming architectures such as the Microsoft .NET framework.
  • interfaces may include one or more GUI's, CLI's or a combination thereof.
  • a processor may include a commercially available processor such as a Celeron, Core, or Pentium processor made by Intel Corporation, a SPARC processor made by Sun
  • Microsystems an Athlon, Sempron, Phenom, or Opteron processor made by AMD Corporation, or it may be one of other processors that are or will become available.
  • Some embodiments of a processor may include what is referred to as multi-core processor and/or be enabled to employ parallel processing technology in a single or multi-core configuration.
  • a multi-core architecture typically comprises two or more processor "execution cores".
  • each execution core may perform as an independent processor that enables parallel execution of multiple threads.
  • a processor may be configured in what is generally referred to as 32 or 64 bit architectures, or other architectural configurations now known or that may be developed in the future.
  • a processor typically executes an operating system, which may be, for example, a Windows type operating system from the Microsoft Corporation; the Mac OS X operating system from Apple Computer Corp.; a Unix or Linux-type operating system available from many vendors or what is referred to as an open source; another or a future operating system; or some combination thereof.
  • An operating system interfaces with firmware and hardware in a well-known manner, and facilitates the processor in coordinating and executing the functions of various programs that may be written in a variety of programming languages.
  • An operating system typically in cooperation with a processor, coordinates and executes functions of the other components of an electronic processing device.
  • An operating system also provides scheduling, input-output control, file and data management, memory management, and communication control and related services, all in accordance with known techniques.
  • System memory may include any of a variety of known or future memory storage devices that can be used to store the desired information and that can be accessed by an electronic processing device, such as a computer.
  • Computer readable media or digital storage media may include volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as digitally encoded instructions, such as computer readable instructions, data structures, program modules, or other data. Examples include any commonly available random access memory (RAM), readonly memory (ROM), electronically erasable programmable read-only memory (EEPROM), digital versatile disks (DVD), magnetic medium, such as a resident hard disk or tape, an optical medium such as a read and write compact disc, or other memory storage device.
  • RAM random access memory
  • ROM readonly memory
  • EEPROM electronically erasable programmable read-only memory
  • DVD digital versatile disks
  • magnetic medium such as a resident hard disk or tape
  • an optical medium such as a read and write compact disc, or other memory storage device.
  • Memory storage devices may include any of a variety of known or future devices, including a compact disk drive, a tape drive, a removable hard disk drive, USB or flash drive, or a diskette drive. Such types of memory storage devices typically read from, and/or write to, a program storage medium such as, respectively, a compact disk, magnetic tape, removable hard disk, USB or flash drive, or floppy diskette. Any of these program storage media, or others now in use or that may later be developed, may be considered a computer program product. As will be appreciated, these program storage media typically store a software program and/or data. Software programs, also called control logic, typically are stored in system memory and/or the program storage device used in conjunction with memory storage device.
  • a program product comprising a digital storage medium having control logic (software program, including program code) stored therein.
  • the control logic when executed by a processor, causes the processor to perform functions described herein.
  • some functions are implemented primarily in hardware using, for example, a hardware state machine. Implementation of the hardware state machine so as to perform the functions described herein will be apparent to those skilled in the relevant arts.
  • Input-output controllers could include any of a variety of known devices for accepting and processing information from a user, whether a human or a machine, whether local or remote. Such devices include, for example, modem cards, wireless cards, network interface cards, sound cards, or other types of controllers for any of a variety of known input devices.
  • Output controllers could include controllers for any of a variety of known display devices for presenting information to a user, whether a human or a machine, whether local or remote.
  • the functional elements of an electronic processing device communicate with each other via a system bus.
  • Some embodiments of an electronic processing device may communicate with some functional elements using network or other types of remote communications.
  • an instrument control and/or a data processing application if implemented in software, may be loaded into and executed from system memory and/or a memory storage device.
  • instrument control and/or data processing applications may also reside in a read-only memory or similar device of the memory storage device, such devices not requiring that the instrument control and/or data processing applications first be loaded through input-output controllers. It will be understood by those skilled in the relevant art that the instrument control and/or data processing applications, or portions of it, may be loaded by a processor, in a known manner into system memory, or cache memory, or both, as advantageous for execution.
  • an electronic processing device may include one or more library files, experiment data files, and an internet client stored in system memory.
  • experiment data could include data related to one or more experiments or assays, such as detected signal values, or other values associated with one or more sequencing by synthesis (SBS) experiments or processes.
  • SBS sequencing by synthesis
  • an internet client may include an application enabled to access a remote service on another electronic processing device using a network and may for instance comprise what are generally referred to as "Web Browsers".
  • some commonly employed web browsers include Microsoft Internet Explorer available from Microsoft Corporation, Mozilla Firefox from the Mozilla Corporation, Safari from Apple Computer Corp., Google Chrome from the Google Corporation, or other type of web browser currently known in the art or to be developed in the future.
  • an internet client may include, or could be an element of, specialized software applications enabled to access remote information via a network such as a data processing application for biological applications.
  • a network may include one or more of the many various types of networks well known to those of ordinary skill in the art.
  • a network may include a local or wide area network that may employ what is commonly referred to as a TCP/IP protocol suite to communicate.
  • a network may include a network, such as a computer network, comprising a worldwide system of interconnected networks that is commonly referred to as the internet, or could also include various intranet architectures.
  • Firewalls also sometimes referred to as Packet Filters, or Border Protection Devices
  • firewalls may comprise hardware or software elements or some combination thereof and are typically designed to enforce security policies put in place by users, such as for instance network administrators, etc.
  • FIG. 9 schematically shows an embodiment of a tissue analysis system
  • tissue analysis system 900 comprises a color data storage module 910, a co-location data storage module 920, a tissue image data storage module 930, a tissue image analysis module 940, an optional tissue imaging module 950, an optional tissue staining module 960 and a communication bus 970 comprising a plurality of communication links 971 (for the sake of legibility, only one of the communication links bears a reference sign).
  • Communication bus 970 and the communication links 971 communicatively interconnect the aforementioned components 910-960.
  • FIG. 10 schematically shows a flow diagram 1000 of an embodiment of a tissue analysis method in accordance with the present disclosure, e.g. as described above.
  • flow diagram 1010 comprises an optional step 1010 of staining a tissue sample, an optional step 1020 of imaging the (stained) tissue sample, a step 1030 of storing color data, a step 1040 of storing coal location data, a step 1050 of storing a plurality of pixels representative of the tissue image, a step 1060 of unmixing a tissue image, a step 1070 of determining a group for which a minimum differences smallest and a step 1080 of outputting data indicative of a tissue feature as an analysis result.
  • the verb "may” is used to designate optionality / noncompulsoriness. In other words, something that "may” can, but need not.
  • the verb “comprise” may be understood in the sense of including. Accordingly, the verb “comprise” does not exclude the presence of other elements / actions.
  • relational terms such as “first,” “second,” “top,” “bottom” and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions.
  • any number of the respective elements e.g. as designating one, at least one, at least two, each or all of the respective elements.
  • the term "any” may be understood as designating any collection(s) of the respective elements, e.g. as designating one or more collections of the respective elements, a collection comprising one, at least one, at least two, each or all of the respective elements.
  • the respective collections need not comprise the same number of elements.
  • the expression “at least one” may, inter alia, be understood as one, two, three, four, five, ten, fifteen, twenty or one hundred. Similarly, the expression “at least one” may, inter alia, be understood as “one or more,” “two or more” or “five or more.”
  • expressions in parentheses may be understood as being optional.
  • quotation marks may emphasize that the expression in quotation marks may also be understood in a figurative sense.
  • quotation marks may identify a particular expression under discussion.
  • the specification may have presented the method and/or process of the present subject disclosure as a particular sequence of steps. However, to the extent that the method or process does not rely on the particular order of steps set forth herein, the method or process should not be limited to the particular sequence of steps described. As one of ordinary skill in the art would appreciate, other sequences of steps may be possible. Therefore, the particular order of the steps set forth in the specification should not be construed as limitations on the claims. In addition, the claims directed to the method and/or process of the present subject disclosure should not be limited to the

Landscapes

  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Medical Informatics (AREA)
  • Radiology & Medical Imaging (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Abstract

Systems and/or methods described herein relate to unmixing more than three stains, while preserving the biological constraints of the biomarkers. Unlimited numbers of markers may be unmixed from a limited-channel image, such as an RGB image, without adding any mathematical complicity to the model. Known co-localization information of different biomarkers within the same tissue section enables defining fixed upper bounds for the number of stains at one pixel. A group sparsity model may be leveraged to explicitly model the fractions of stain contributions from the co-localized biomarkers into one group to yield a least squares solution within the group. A sparse solution may be obtained among the groups to ensure that only a small number of groups with a total number of stains being less than the upper bound are activated.

Description

GROUP SPARSITY MODEL FOR IMAGE UNMIXING
[001] This U.S. Patent Application claims priority to U.S. Provisional Patent Application
Serial No. 61/943,265, filed February 21 , 2014, the content of which is hereby incorporated by reference herein in its entirety into this disclosure.
BACKGROUND OF THE SUBJECT DISCLOSURE
Field of the Subject Disclosure
[002] The present subject disclosure relates to digital pathology. More particularly, the present subject disclosure relates to color unmixing methods and systems for a multiplex IHC image that can accommodate any number of stain colors. Background of the Subject Disclosure
[003] Multiplex immunohistochemistry (IHC) staining is an emerging technique for the detection of multiple biomarkers within a single tissue section and has become more popular due to its significant efficiencies and the rich diagnostic information it has. A multiplex IHC slide has the potential advantage of simultaneously identifying multiple biomarkers in one tissue section as opposed to single biomarker labeling in multiple slides. Therefore, it is often used for the simultaneous assessment of multiple hallmarks of cancerous tissue. Often, a cancerous tissue slide is stained by the multiplex assay to identify biomarkers. For example, tumors in human often contain infiltrates (e.g., T-cells or B-cells) of immune cells, which may prevent the development of tumors or favor the outgrowth of tumors. In this scenario, multiple stains are used to target different type of immune cells and the population distribution of each type of the immune cells are used to study the clinical outcome of the patients. The stained slide is then imaged, for example, using a CCD color camera mounted on a microscope or a scanner.
In order to conduct accurate detection and classification of the cells, the cells are stained, for example, with chromogenic dyes, fluorescent markers and/or quantum dots, and then imaged. The image is unmixed to obtain the constituent dyes and/or the proportions of each dye in the color mixture, as a prerequisite step for multiplex image analysis, for example, multiplex IHC image analysis. Several techniques exist in the prior art to decompose each pixel of the RGB image into a collection of constituent stains and the fractions of the contributions from each of them. For example, color unmixing or deconvolution is used to unmix the RGB image with up to three stains in the converted optical density space. Given the reference color vectors X.GR3 of the pure stains, the method assumes that each pixel of the color mixture ysR3 is a linear combination of the pure stain colors and solves a linear system to obtain the combination weights bsRM. The linear system is denoted as y=Xb, where X = [XI , ... ,XM ](M≤ 3) is the matrix of reference colors. This technique is most widely used in the current digital pathology domain, however, the maximum number of stains that can be solved is limited to three as the linear system is deficient for not enough equations (X being a 3*M matrix). The color unmixing problem may be formulated into a non-negative matrix factorization and color decomposition performed in a fully automated manner, wherein no reference stain color selection is required. This method also solves for y = Xb and has the same limitation in dealing with large stain numbers. A color space may be divided into several systems with up to three colors by solving a convex framework, with a linear system being used to solve each individual system. Due to the
independent assignment of each pixel into different systems, the spatial continuity is lost in the unmixed images and artifacts such as holes are observed. Other methods may work for a larger number of stain colors, such as two-stage methods developed in the remote sensing domain to first learn the reference colors from the image context and then use them to unmix the image, however, these methods are designed to work for multi-spectral image unmixing which has more color channels than the RGB image. Sparse models for high dimensional multi-spectral image unmixing adopt the L0 norm to regularize the combination weights b of the reference colors hence leading to a solution that only a small number of reference colors are contributed to the stain color mixture, but these are also designed for multi-spectral images and do not use any prior biological information about the biomarkers, which may lead to undesired solutions for real data. Moreover, these methods cannot be applied to RGB images due to the image acquisition system, i.e. multi-spectral imaging instead of a CCD color camera to capture the image using a set of spectral narrow-band filters. The number of filters K can be as many as dozens or hundreds, leading to a multichannel image that provides much richer information than the brightfield RGB image. The linear system constructed from it is always an over-determined system with X being a KxM(K» M) matrix that leads to a unique solution, however, the scanning process in the multi-spectral imaging system is very time consuming and only a single field of view manually selected by a technician can be scanned instead of the whole slide, thereby limiting the usage of such methods.
[006] Therefore, there exists no numerical solution for unmixing an image having more unknown variable than the number of equations in the least squares system. To accurately unmix an IHC image and differentiate all the stains used is of tremendous clinical importance since it is the initial key step in multiplex IHC image analysis of digital pathology. Due to the limitations of a CCD color camera, an acquired RGB or brightfield image only contains three channels, the unmixing of which into more than three colors is a challenging task. Accordingly, a method for unmixing, which compensates for the limitations of the CCD color camera, is desirable.
SUMMARY OF THE SUBJECT DISCLOSURE
The present invention provides in particular for a tissue analysis system and method, a system for unmixing a tissue image and a digital storage medium that stores digitally encoded instructions executable by a processor of a tissue analysis system for configuring the tissue analysis system for execution of a method of the invention.
[007] The subject disclosure solves the above-identified problems in the current state of the art by providing systems and methods for unmixing multiplex IHC images having a number of stain contributions greater than a number of color channels, such as an RGB brightfield image. Operations disclosed herein include obtaining reference colors from the training images, modeling a RGB image unmixing problem using a group sparsity framework, in which the fractions of stain contributions from colocation markers are modeled within a same group and fractions of stain contributions from non-colocation markers are modeled in different groups, providing co-localization information of the markers to the group sparsity model, solving this group sparsity model using an algorithm such as a Group Lasso, yielding a least square solution within each group which corresponds to the unmixing of the colocation markers, and yielding a sparse solution among the groups that correspond to the unmixing of non-colocation markers. Reduction of the model to sparse unmixing without colocalization constraint is enabled by setting only one member in each group, and generating sparse unmixing results for more than two or three markers, in contrast to typical methods without sparse regularization.
[008] In one exemplary embodiment, the subject disclosure provides a method for unmixing an image, the method comprising generating a group sparsity model wherein a fraction of a stain contribution from colocation markers is assigned within a single group and a fraction of a stain contribution from non-colocation markers is assigned within separate groups, and solving the group sparsity model using group lasso algorithm to yield a least squares solution within each group and sparse solution among groups. The method, like all other methods disclosed herein, may be a computer-implemented method.
[009] In another exemplary embodiment, the subject disclosure provides a system for unmixing an image, comprising a processor, and a memory coupled to the
processor, the memory to store digitally encoded instructions that, when executed by the processor, cause the processor to perform operations including generating a group sparsity framework using known co-location information of a plurality of biomarkers within an image of a tissue section; wherein a fraction of each stain contribution is assigned to a different group based on the known co- location information, and solving the group sparsity model using group lasso algorithm to yield a least squares solution for each group.
[0010] In yet another exemplary embodiment, the subject disclosure provides a digital storage medium to store digitally encoded instructions executable by a processor to perform operations including modeling an RGB image unmixing problem using a group sparsity framework, in which fractions of stain contributions from a plurality of colocation markers are modeled within a same group and fractions of stain contributions from a plurality of non-colocation markers are modeled in different groups, providing co-localization information of the plurality of colocation markers to the modeled group sparsity framework, solving the modeled framework using a group lasso to yield a least squares solution within each group, wherein the least squares solution corresponds to the unmixing of the colocation markers, and yielding a sparse solution among the groups that corresponds to the unmixing of the non-colocation markers.
BRIEF DESCRIPTION OF THE DRAWINGS
[0011] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0012] FIG. 1 depicts a system for image unmixing using a group sparsity model,
according to an exemplary embodiment of the subject disclosure.
[0013] FIG. 2 depicts a framework of an unmixing algorithm, according to an exemplary embodiment of the subject disclosure.
[0014] FIGS. 3A-3C depict a simulated example of an image to be unmixed, and
variations of a mean square error (MSE) of the regularization parameter λ, according to exemplary embodiments of the subject disclosure.
[0015] FIG. 4 depicts an example of a clinical data set containing several different
cancer tissue samples that were used to test the disclosed operations, according to exemplary embodiments of the subject disclosure.
[0016] FIG. 5 depicts the unmixing examples of decomposing a multiplexed image into single stain channels using the prior art versus the group sparsity methods disclosed herein, in accordance with an exemplary embodiment of the subject disclosure.
[0017] FIG. 6 depicts an example of orange only cells, according to an exemplary
embodiment of the subject disclosure.
[0018] FIG. 7 depicts example images of nucleus co-localization cases, according to exemplary embodiments of the subject disclosure.
[0019] FIG. 8 depicts less background noise using the proposed sparse constrained method versus prior art methods for RGB images with a group size set to 1 , according to an exemplary embodiment of the subject disclosure. [0020] Fig. 9 schematically shows an embodiment of a tissue analysis system.
[0021] Fig. 10 schematically shows a flow diagram of an embodiment of a tissue
analysis method.
DETAILED DESCRIPTION OF THE SUBJECT DISCLOSURE
[0022] Before elucidating the embodiments shown in the Figures, various
embodiments of the present disclosure will first be described in general terms.
[0023] The present disclosure relates, inter alia, to an analysis system, e.g. to a tissue analysis system. The system may be suitable for analyzing biological specimen, for example, tissue provided on a slide.
[0024] The term "image data" as understood herein encompasses raw image data
acquired from the biological tissue sample, such as by means of an optical sensor or sensor array, or pre-processed image data. In particular, the image data may comprise a pixel matrix.
[0025] The analysis system may comprise a color data storage module that stores color data, e.g. color data indicative of a color of the stains. Such color data is also referred to as "reference data". The color data may be descriptive of a single frequency or a characteristic spectral profile of the stain. The color data storage module may store color data for each of a plurality of stains. The plurality of stains may comprise at least 4, at least 10, at least 20 or at least 100 stains.
[0026] In the present disclosure, the term "biomarker" may be understood in the sense of a tissue feature (e.g. (a presence of) a particular cell type, for instance immune cells), in particular a tissue feature indicative of a medical condition. The biomarker may be identifiable by the presence of a particular molecule, for
instance a protein, in the tissue feature.
[0027] In the present disclosure, the term "marker" may be understood in the sense of a stain, dye or a tag (that allows a biomarker to be differentiated from ambient tissue and/or from other biomarkers). The tag may be stained or dyed. The tag may be an antibody, e.g. an antibody having an affinity to a protein of a particular biomarker. A marker may have an affinity to a particular biomarker, e.g. to a particular molecule / protein / cell structure / cell (indicative of a particular biomarker). The biomarker to which a marker has an affinity may be specific / unique for the respective marker. A marker may mark tissue, i.e. a biomarker in the tissue, with a color. The color of tissue marked by a respective marker may be specific / unique for the respective marker. For short, the present disclosure designates such relationships between a color and a marker as a "marker having a color" or as a "color of a marker."
[0028] Any of the plurality of markers may have an affinity to at least one tissue feature selected from the group consisting of a tumor cell cytokeratin, a regulatory T-cell nucleus, a universal nucleus, a B-cell membrane, a universal T- cell membrane, and a cytotoxic T-cell membrane.
[0029] The analysis system may comprise a co-location data storage module that defines a plurality of groups of the markers (whose color data is stored by the color data storage module). Each group may consist of markers having an affinity to a respective common tissue feature. In other words, each marker of a respective group may have an affinity to a common tissue feature. The common tissue feature to which (each of) the markers of a respective group have an
affinity may be unique for each respective group. Each of the plurality of groups may consist of at least one and no more than three markers. The plurality of groups may comprise each of the plurality of markers. In other words, the plurality of groups may be defined such that each individual marker of the plurality of markers belongs to at least one group of the plurality of groups.
[0030] The analysis system may comprise a tissue image data storage module that stores a plurality of pixels representative of a tissue image. The tissue image may be an RGB, a CYMK image or other multi-channel color image (of a tissue sample). The multi-channel color image may comprise from 1 to 10, e.g. from 3 to 5, (color) channels. As such, each pixel may comprise color information for any of a plurality of color channels, e.g. for each of a red, green and blue channel of an RGB image.
[0031] The pixels may represent the tissue image (at a resolution) such that (in at least 50%, at least 75% or at least 90% of all cases) at least one pixel is required to represent any individual biomarker of the biomarkers identifiable by the plurality of markers or the background. In other words, the pixels may represent the tissue image (at a resolution) such that (in at least 50%, at least 75% or at least 90% of all cases) any individual biomarker of the biomarkers identifiable by the plurality of markers or the background occupies at least one pixel. Similarly, the pixels may represent the tissue image such that, by virtue of the image resolution and a biological co-location (in tissue) of the molecules / proteins / cell structures / cells to which the individual markers of the plurality of markers have an affinity, at most three markers are visible per pixel (for at least 50%, at least 75% or at least 90% of the pixels). The expression "at most three markers" is not limited to the sense that at most three individual antibody tags (or other individual markers) are visible per pixel. Instead, the expression "at most three markers" may be understood in the sense that, of the plurality of markers, at most three of the plurality are visible per pixel. In other words, the colors constituting an individual pixel (in addition to the natural colors of the tissue) may be limited to three colors selected from the group of tissue colors obtained by marking tissue with the plurality of markers. As such, for any of the plurality of groups, the number of markers in the respective group may define an upper limit for the number of (different kinds of) stains, i.e. markers, per pixel. However,
mathematically, the framework is not limited to the constraint of "at most three markers". In the present context, the term "stain" may be understood in the broad sense of any type of marker such as an antibody, dye or stain suitable to mark, i.e. "stain," biomarkers in a tissue.
The analysis system may comprise a tissue image analysis module (for unmixing the tissue image). The tissue image analysis module may (be configured to) calculate a linear combination of the colors of the markers of a respective group that yields a minimum difference between (the color information of) a respective pixel and the linear combination of colors. The tissue image analysis module may calculate such a linear combination for any individual group of the groups and for any individual pixel of the plurality of pixels. In other words, the tissue image analysis module may unmix (e.g. as understood in the art of imaging spectroscopy) a pixel into the colors of the markers of a respective group, e.g. by finding a linear combination of the colors of the markers of a respective group that (closely / most closely) matches the color of a respective pixel (as represented by the color channels of the respective pixel). The aforementioned "minimum" difference need not be understood as a mathematical minimum or as an absolute minimum. Instead, the "minimum" difference may be "minimum" as determinable by data available to the tissue image analysis module. The difference between a linear combination of the colors of the markers of a respective group and the respective pixel may be measured in a color space of the respective pixel, e.g. as a polynomial that comprises, for each (color) channel of the pixel, the difference between the respective linear combination and the value of the pixel in the respective channel as a variable. For example, the difference may be measured as a sum of the squares obtained by squaring, for each (color) channel of the pixel, the difference between the respective linear combination and the value of the pixel in the respective channel. As such, the tissue image analysis module may calculate a linear combination of the colors of the markers of a respective group that yields a minimum difference between a respective pixel and the linear combination by means of a least square algorithm.
The tissue image analysis module may affect the aforementioned calculating using any of the color data stored in the color data storage module, any of the co-location data stored in the co-location data storage module and/or any of the plurality of pixels stored in the tissue image data storage module. Accordingly, the tissue image analysis module may be configured to read any of the color data from the color data storage module, to read any of the co-location data from the co-location data storage module and/or to read any of the pixels from the tissue image data storage module.
[0034] The tissue image analysis module may determine a group for which the
(aforementioned) minimum difference (between a respective pixel and a linear combination of the colors of the markers of the respective group) is smallest and may output (data indicative of) the tissue feature of the (determined) group as an analysis result. In other words, the tissue image analysis module may determine which group of markers has colors that can be combined to most closely match a respective pixel and may output (data indicative of) the (determined) group as an analysis result.
[0035] The (tissue) analysis system may comprise a (tissue) staining module that stains a tissue sample with any of the plurality of markers. In the present context, the verb "stain" is not limited to an application of a stain (in a limited sense of the word) to a tissue, but may instead likewise comprise exposing the tissue to any type of markers such as antibodies or dyes to mark biomarkers in the tissue.
[0036] The tissue image data storage module may receive the plurality of pixels directly or indirectly from a source that need not be an element of the (tissue) analysis system. In this respect, the (tissue) analysis system may comprise a (tissue) imaging module, e.g. a (tissue) imaging module that images a tissue sample to obtain the plurality of pixels representative of a tissue image. The tissue sample may be a tissue sample stained with any of the plurality of markers, e.g. by the (tissue) staining module. The imaging module may utilize nonvisible electromagnetic radiation (UV light, for example), magnetic resonance, ultrasound or other imaging techniques to capture the tissue image. The (tissue) imaging module may comprise a microscope and a (CCD) camera arranged to capture a plurality of (raw) pixels representative of an image of a tissue sample magnified by the microscope. The plurality of pixels stored by the tissue image data storage module may be identical to and/or derived from raw pixels captured by the (tissue) imaging module. One of ordinary skill in the art would recognize that other image sensors or methods for capturing an image, such as a digital image, may be utilized.
[0037] The (tissue) imaging module may comprise a bright-field illumination
module that effects bright-field illumination of the tissue sample and may effect capture of the plurality of pixels representative of an image of the tissue sample during bright-field illumination of the tissue sample.
[0038] As touched upon above, the (tissue) imaging module may comprise a
CCD camera, e.g. a CCD camera selected from the group consisting of an RGB CCD camera and a CCD camera having at most five color channels. The (tissue) imaging module may effect imaging, i.e. capture of the plurality of pixels representative of an image of the tissue sample, by means of a CCD camera selected from the group consisting of an RGB CCD camera and a CCD camera having at most five color channels. For example, the CCD camera may capture pixels in each of a red, green and blue channel or in each of a red, green, blue and UV channel. The CCD camera may comprise a beam splitter for splitting incident light into the various (color) channels for capture. [0039] The present disclosure relates, inter alia, to an analysis method, e.g. to a tissue analysis method. The method may be suitable for analyzing biological tissue provided on a slide. As such, the aforementioned discussion of an analysis system applies mutatis mutandis, to an analysis method employing the techniques described above.
[0040] The various embodiments of the present disclosure having been
described above in general terms, the embodiments shown in the Figures will now be elucidated.
[0041] Systems and/or methods described herein relate to unmixing more than three stains (for example, chromogenic dyes, fluorescent stains, or quantum dots), while preserving the biological constraints of the biomarkers. Unlimited numbers of markers may be unmixed from a limited-channel image, such as an RGB image, without adding any mathematical complicity to the model. Known co- localization information of different biomarkers within the same tissue section enables defining fixed upper bounds for the number of stains at one pixel. A group sparsity model may be leveraged to explicitly model the fractions of stain contributions from the co-localized biomarkers into one group to yield a least squares solution within the group. A sparse solution may be obtained among the groups to ensure that only a small number of groups with a total number of stains being less than the upper bound are activated. Results of applying these methods on a clinical data set containing a large number of multiplex IHC slides demonstrates better unmixing results than the prior art.
[0042] FIG. 1 depicts a system 100 for image unmixing using group sparsity modeling, according to an exemplary embodiment of the subject disclosure. System 100 comprises a memory 1 10, which stores a plurality of processing modules or logical instructions that are executed by processor 105 coupled to electronic processing device 101 , for example a computer . Besides processor 105 and memory 1 10, electronic processing device 101 , which may be a computer, also includes user input and output devices such as a keyboard, mouse, stylus, and a display / touchscreen. As will be explained in the following discussion, processor 105 executes logical instructions stored on memory 1 10, performing image analysis and other quantitative operations resulting in an output of results to a user operating electronic processing device 101 or via a network.
For instance, input data 102 may provide a means for inputting image data from one or more scanned IHC slides to memory 1 10. Image data may include data related to color channels or frequency channels. For instance, a biological specimen, for example, a tissue section may need to be stained by means of application of a staining assay containing one or more different biomarkers associated with chromogenic stains for brightfield imaging or fluorophores for fluorescence imaging. Staining assays can use chromogenic stains for brightfield imaging, organic fluorophores, quantum dots, or organic fluorophores together with quantum dots for fluorescence imaging, or any other combination of stains and viewing or imaging devices. In the analysis of biological specimens, for example, cancerous tissues, different stains are specified to identity one or more types of biomarkers, for example, immune cells. For instance, CD3 is a known universal marker for all the T-cells and CD8 only captures the cytotoxic T- cells in the membrane. FoxP3 marks the regulatory T-cells in the nuclei and Hematoxylin (HTX) stains all the nuclei. Therefore, input 102 may further include co-location information for different biomarker, as co-localization information of the markers can be inferred from the biological knowledge. For example, CD3 and CD8 co-locate in the membrane while FoxP3 and HTX may appear in the same nucleus. Tumor markers on the tumor cell's cytoplasm region coexist with B-cell markers on the B-cell's membrane.
[0044] Upon receiving this image data and co-location information, a plurality of
processing modules may be executed in order to analyze the image data and unmix the image using a group sparsity framework. A pre-processing module 1 1 1 may be executed for converting an image such as an RGB image into an optical density (OD) space using the following formula derived from Beer's law based on the fact that the optical density is proportional to the stain
concentration, using:
Oe = - _<¾(-½-)
J0,c where c is the index of the RGB color channels, l0 is the RGB value of the white points and O is the optical density image obtained. O is utilized to reference an image as further described below.
[0045] An unmixing module 1 12 may be invoked to unmix the optical density image O.
Let y be a pixel of O, where y represents a 3-dimensional column vector corresponding to the OD values converted from RGB. If there are M biomarkers available in the multiplex IHC slide, that provides M stain colors. Let b be the combination weight vector of the stains and bm,m = 1 , ...,M is the η¼ element of b. A typical unmixing problem may thus be formulated as the following: min |v— Xh\
[0046] Each column of X corresponds to a reference stain color sampled from a control slide of pure stain or approximate pure stain. Reference colors may be stored in and retrieved from a reference database 1 13, or provided externally from a network. This linear system has a solution only when the column of X is less than or equal to 3 for ysR3. Therefore, meaningful regularization may facilitate finding a solution for the linear system. The biomarker co-localization information provides a partition of b into a set of groups gi ,g2, - -
Figure imgf000019_0001
N being the total number of groups. Within each group, the biomarkers are known to have the co- localization possibility; these biomarkers are also refered herein as co- localization or colocation markers . The method of the present invention adopts this biological information (input via input 102) to formulate the regularization term of the cost function. Let g, be a qrdimensional column vector representing the combination weights of the stains within the it group and qi be the number of stains within the group g,. This provides qi + q2- .. + qN = M. x, denotes the ith group of reference colors, resulting in a 3xq, matrix as depicted in FIG. 2.
[0047] FIG. 2 depicts a framework for group sparsity modeling, according to an
exemplary embodiment of the subject disclosure. The framework is an elaboration of the unmixing algorithm y = X b. The RGB image y obtained from the imaging system or other input is represented as 220, the reference color matrix X includes a plurality of columns representing, from left to right, stains 1 to 6, and the unmixed image b corresponding to each stain may be represented by a matrix 224. For example, column 231 may represent a tumor cell cytokeratin (e.g. Dabsyl / Oscar), column 232 may represent a regulatory T-cell nucleus (e.g. TAMRA / FP3), column 233 may represent a universal nucleus (e.g. HTX), column 234 may represent a B-cell membrane (e.g. Cy5 / CD20), column 5 may represent a universal T-cell membrane (e.g. Rho1 10 / CD3), and column 6 may represent a cytotoxic T-cell membrane (e.g. Cy5T / CD8). Each of the six stains 231 -236 may be grouped into four different groups gi,g2, etc., where co-localized stains are in the same group, based on the groups depicted in key 226. Based on this biological co-localization information of the biomarkers, it may be straightforward to conclude that only two colors can co-exist at each pixel for this case. The unmixing algorithm, for an exemplary multiplex IHC image, may perform operations including: activating only one group of stains with the contribution weights from the other groups marked as zero for each pixel, and, within the activated group, estimating the fractions of the contributions from each constituent stain. As a result of these conditions the unmixing problem is solved by the unmixing method using a group sparsity framework so as to ensure the sparsity among the group but non-sparsity within the group. Group sparsity and group lasso operations are further described in N. Simon, et al., "A Sparse Group Lasso". Journal of Computational and Graphical Statistics, vol. 22(2), pp.231 - 245, 2013. However, in accordance with the operations described herein, the following example may be used.
For example, six stains may be available resulting in M = 6 based on the explanation above. Two of them are co-localized membrane stains and two are co-localized nucleus stains. One is tumor cytokeratin stain and the rest is a membrane stain but only for a B-cell. This information enables dividing the stains into four groups (N = 4) as shown in Fig.2. For instance, g2 contains b2 and b3 that are corresponding to the two co-located nucleus stains and each column of X2 is the reference color for the stain within the 2nd group. However, the 4th stain of the B-cell marker does not co-localize with other biomarkers, so g3 only has one single member b4.
Specifically to this example, the unmixing problem may formulated as a group lasso criterion, i.e. as the following convex optimization problem or cost function
Figure imgf000021_0001
where b - [b h, -■■ Y - [g g ■■■ -. 9%]* and II I is the Euclidean norm without squared. The first term in Eqn.3 is the reconstruction error between the original RGB image and the unmixed images, and may be solved for the linear system as further described in A. C. Ruifrok and D. A. Johnston, "Quantification of Histochemical Staining by Color Deconvolution", Anal. Quant. Cytol. Histol., 23:291 -299, 2001 , which minimize the least square error between the intensity of the raw image and the possible linear combination of the reference colors that approximate the raw image. The second term is the group sparsity constraint, λ is the regularization parameter that controls the amount of the group sparsity constraint in the second term. When this cost function is minimized, ideally only a very small number of groups are active in the results due to the group sparsity constraint. This equation may be solved by an alternative direction method of multipliers (ADMM) algorithm, as further described in S. Boyd, et al., "Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers," Foundations and Trends in Machine Learning, vol. 3(1 ), pp.1 -122, 2010. This model will act like a Lasso optimization method at the group level. The entire groups will be dropped out in the result when an optimal b is found. That is, in an exemplary embodiment, only a small number of g, are non-zero.
[0050] When the size of each group q, = 1 , the model becomes equivalent to a lasso as described above. In this case, no biological co-localization information is used in this model. However the sparsity constraints on the unmixed channels are retained which suppress the background noises and the system remains to be solvable. In the experimental results shown with respect to FIGS. 3 onwards, the efficacy of lasso unmixing is demonstrated by limiting the size of the group to 1 .
[0051] The tuning parameter of the Lasso or group Lasso criterion, i.e. the regularization parameter λ, can be chosen by cross-validation. For example, the image data is partitioned into complimentary subsets. The unmixing is then performed separately for the subsets using the Lasso or group Lasso criterion as described above with various choices for λ and the one that leads to the least unmixing error is choosen.
[0052]
[0053] For the data set having ground truth unmixing results, an error is calculated for each of the unmixing solutions and the choice of a value for λ that yields the lowest error is chosen for performing the unmixing of the complete image data using that choice of λ for the Lasso or group Lasso criterion. The ground truth can be obtained by manual annotation.
For example, the selection of a value for λ can be performed by the steps of
• dividing the image data into at least a first subset and a second subset,
• choosing a first value λι and a second value λ2 for the tuning parameter λ,
• calculating a first solution using the group Lasso or Lasso criterion for the first subset with the first value λι of the tuning parameter λ,
• calculating a first error of the unmixing result obtained from the first
solution,
• calculating a second solution of the group Lasso or Lasso criterion for the second subset with the second value λ2 of the tuning parameter,
• calculating a second error of the unmixing result obtained from the second solution,
• selecting one of the first value λι and second value λ2 of the tuning
parameter that results in the lowest error.
For example, the image data is divided into 10 subsets and respective 10 different values for λ are chosen as candidate values.
The candidate value for λ that yields a solution for the respective subset of the image data that has the lowest error is then selected for unmixing the entire image data or further image data that is acquired from the same or another tissue sample using the lasso or group lasso criterion.
As described above, the modules include logic that is executed by processor 105. "Logic", as used herein and throughout this disclosure, refers to any information having the form of instruction signals and/or data that may be applied to affect the operation of a processor. Software is one example of such logic. For example, the operations described herein may be implemented in a software language, for example, in C++, to provide fast computation. In such an embedment, it may cost about 7 seconds to unmix a 750 by 1400 image on an Intel Core i7 1.87GHZ PC. In should be understood by one skilled in the art that the language used for implementation of the disclosed operations may vary. Examples of processors are processors (processing units), microprocessors, digital signal processors, controllers and microcontrollers, etc. Logic may be formed from signals stored on a computer-readable medium or digital storage medium such as memory 1 10 that, in an exemplary embodiment, may be a random access memory (RAM), read-only memories (ROM), erasable / electrically erasable programmable read-only memories (EPROMS/EEPROMS), flash memories, etc. Logic may also comprise digital and/or analog hardware circuits, for example, hardware circuits comprising logical AND, OR, XOR, NAND, NOR, and other logical operations. Logic may be formed from
combinations of software and hardware. On a network, logic may be
programmed on a server, or a complex of servers. A particular logic unit is not limited to a single logical location on the network. Moreover, the modules need not be executed in any specific order.
The remaining figures depict simulated and actual results of how the unmixing operations described herein are empirically validated and compared to existing techniques. Although the images depicted are from a CCD camera, i.e. RGB images, these operations may also be applied to spectral images such as those obtained via a spectral camera or scanner device, or any image comprising a mixture of the underlying co-localized biomarker expressions. Moreover, the disclosed operations are described with respect to cancerous tissue but may be applied to any tissue type and disease state.
[0056] FIGS. 3A-3C depict a simulated example of an image to be unmixed, and a
mean square error (MSE) of the regularization parameter λ increasing. FIG. 3A depicts a simulated image comprising six colors following the stain co-localization and grouping of FIG. 2. The six colors depicted are yellow 331 , red 332, blue 333, red + cyan 334, blue + purple 335, and green 336. Referring to FIG. 3B, as the group sparsity regularization parameter increases, more accurate unmixing is achieved. FIG. 3B depicts the MSE compared to the ground truth unmixed channels for λ within the range 0 to 20. The plot demonstrates that the system becomes stable when λ>10. The example unmixing results are shown in FIG. 3C, depicting columns of colors yellow 331 , purple 337, blue 333, red 332, cyan 338, and green 336. In FIG. 3, the first row shows the ground truth (i.e. ideal depiction) and the other rows showing the unmixed results by varying λ. When λ = 0, the system becomes deficient, hence unmixing errors are observed as shown in the second row of Fig 3C.
[0057] FIGS. 4-6 depict a clinical data set containing several different cancer issue
samples that were used to test the disclosed operations, including colorectal cancer, non-small cell lung cancer, and breast cancer tissue samples that consist of 32 fields of view. The tissues were stained with following assay as shown in Fig. 4: yellow chromogen for tumor cell cytokeratin 441 , purple for regulatory T- cell nucleus 444, blue for universal nucleus 442, light blue for B-cell membrane, orange for universal T-cell membrane and dark green for cytotoxic T-cell membrane 443.
[0058] FIG. 5 depicts the unmixing examples of decomposing a multiplexed image 550 into single stain channels using the prior art 551 versus the group sparsity methods 552 disclosed herein, in accordance with an exemplary embodiment of the subject disclosure. It can be seen from FIG. 5 that pixel discontinuities and artifacts are observed from the color system unmixing results. Instead of solving a multiple three color system using the nearest neighbor assignment, a single system may be solved for all the pixels, hence leading to smoother unmixed images depicted by reference 552. Meanwhile, the algorithm maintains the biological constraints as wells as reduces the background noises.
[0059] Since the green cytotoxic T-cell membrane marker is the subset of the orange universal T-cell membrane marker, FIG. 6 depicts an example of orange only cells (column 662), according to an exemplary embodiment of the subject disclosure. The figure shows a section of multiplexed images 661 zoomed in, with three channels of the region depicted in the following columns: universal T- cell membrane channel (orange) 662, cytotoxic T-cell membrane channel (green) 663, and universal nucleus channel 664. Images a-d represent different combinations of methods: (a) prior art without co-localization constraint, (b) group sparsity-based unmixing without co-localization constraint, (c) prior art with co- localization constraint, and (d) group sparsity-based unmixing with co-localization constraint. It can be seen that in the corresponding unmixed region of pure orange cells in column 662, the green signal is weak. However, strong green and orange signals exist in the other regions within the same image. This demonstrates that l_2 norm constraint is used within the group to linearly separate the color mixture into different stain contribution weights. Meanwhile, the prior art methods (a) and (c) are prone to unmixing errors due to the hard assignment of the system based on color similarity.
FIG. 7 depicts example images of nucleus co-localization cases, according to exemplary embodiments of the subject disclosure. This figure shows the advantages of having biological co-localization constraints. As in FIG. 6, this figure shows multiplexed images 771 , 772, and 773, with three channels of the region depicted in the following columns: regulatory T-cell nucleus channel 774 (purple), and universal nucleus channel 775 (blue). Images a-d represent different combinations of methods: (a) prior art without co-localization constraint, (b) group sparsity-based unmixing without co-localization constraint, (c) prior art with co-localization constraint, and (d) group sparsity-based unmixing with co- localization constraint. The presence of nuclei is shown in both purple 774 and blue 775 channels if group sparsity is used. However, no purple is observed if no co-localization constraint is used. As a special case example, the algorithm can also be used to less than three color unmixing. The group size may be set to 1 , and compared with prior art methods for 2-stain unmixing. However, the group size may vary, for different applications, in various embodiments of the subject disclosure. The results are shown in FIG. 8, depicting much less background noise using the proposed sparse constrained method 884 versus prior art methods 883 for RGB images 881 and 882 with a group size set to 1.
[0061] Therefore, embodiments disclosed herein propose a novel color unmixing
strategy for multiplexed brightfield histopathology images based on a group sparsity model. The biological co-localization information of the bio-markers is explicitly defined in the regularization term to produce biologically meaningful unmixing results. The experiments of both synthetic and clinical data
demonstrate the efficacy of the proposed algorithm in terms of accuracy, stability and robustness.
[0062] Electronic processing devices, such as computers, typically include known
components, such as a processor, an operating system, system memory, memory storage devices, input-output controllers, input-output devices, and display devices. It will also be understood by those of ordinary skill in the relevant art that there are many possible configurations and components of an electronic processing device and may also include cache memory, a data backup unit, and many other devices. Examples of input devices include a keyboard, a cursor control devices (e.g., a mouse), a microphone, a scanner, and so forth. Examples of output devices include a display device (e.g., a monitor or projector), speakers, a printer, a network card, and so forth. Display devices may include display devices that provide visual information, this information typically may be logically and/or physically organized as an array of pixels. An interface controller may also be included that may comprise any of a variety of known or future software programs for providing input and output interfaces. For example, interfaces may include what are generally referred to as "Graphical User
Interfaces" (often referred to as GUI's) that provide one or more graphical representations to a user. Interfaces are typically enabled to accept user inputs using means of selection or input known to those of ordinary skill in the related art. The interface may also be a touch screen device. In the same or alternative embodiments, applications on an electronic processing device may employ an interface that includes what are referred to as "command line interfaces" (often referred to as CLI's). CLI's typically provide a text based interaction between an application and a user. Typically, command line interfaces present output and receive input as lines of text through display devices. For example, some implementations may include what are referred to as a "shell" such as Unix Shells known to those of ordinary skill in the related art, or Microsoft Windows Powershell that employs object-oriented type programming architectures such as the Microsoft .NET framework.
Those of ordinary skill in the related art will appreciate that interfaces may include one or more GUI's, CLI's or a combination thereof. A processor may include a commercially available processor such as a Celeron, Core, or Pentium processor made by Intel Corporation, a SPARC processor made by Sun
Microsystems, an Athlon, Sempron, Phenom, or Opteron processor made by AMD Corporation, or it may be one of other processors that are or will become available. Some embodiments of a processor may include what is referred to as multi-core processor and/or be enabled to employ parallel processing technology in a single or multi-core configuration. For example, a multi-core architecture typically comprises two or more processor "execution cores". In the present example, each execution core may perform as an independent processor that enables parallel execution of multiple threads. In addition, those of ordinary skill in the related will appreciate that a processor may be configured in what is generally referred to as 32 or 64 bit architectures, or other architectural configurations now known or that may be developed in the future.
[0064] A processor typically executes an operating system, which may be, for example, a Windows type operating system from the Microsoft Corporation; the Mac OS X operating system from Apple Computer Corp.; a Unix or Linux-type operating system available from many vendors or what is referred to as an open source; another or a future operating system; or some combination thereof. An operating system interfaces with firmware and hardware in a well-known manner, and facilitates the processor in coordinating and executing the functions of various programs that may be written in a variety of programming languages. An operating system, typically in cooperation with a processor, coordinates and executes functions of the other components of an electronic processing device. An operating system also provides scheduling, input-output control, file and data management, memory management, and communication control and related services, all in accordance with known techniques.
[0065] System memory may include any of a variety of known or future memory storage devices that can be used to store the desired information and that can be accessed by an electronic processing device, such as a computer. Computer readable media or digital storage media may include volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as digitally encoded instructions, such as computer readable instructions, data structures, program modules, or other data. Examples include any commonly available random access memory (RAM), readonly memory (ROM), electronically erasable programmable read-only memory (EEPROM), digital versatile disks (DVD), magnetic medium, such as a resident hard disk or tape, an optical medium such as a read and write compact disc, or other memory storage device. Memory storage devices may include any of a variety of known or future devices, including a compact disk drive, a tape drive, a removable hard disk drive, USB or flash drive, or a diskette drive. Such types of memory storage devices typically read from, and/or write to, a program storage medium such as, respectively, a compact disk, magnetic tape, removable hard disk, USB or flash drive, or floppy diskette. Any of these program storage media, or others now in use or that may later be developed, may be considered a computer program product. As will be appreciated, these program storage media typically store a software program and/or data. Software programs, also called control logic, typically are stored in system memory and/or the program storage device used in conjunction with memory storage device. In some embodiments, a program product is described comprising a digital storage medium having control logic (software program, including program code) stored therein. The control logic, when executed by a processor, causes the processor to perform functions described herein. In other embodiments, some functions are implemented primarily in hardware using, for example, a hardware state machine. Implementation of the hardware state machine so as to perform the functions described herein will be apparent to those skilled in the relevant arts. Input-output controllers could include any of a variety of known devices for accepting and processing information from a user, whether a human or a machine, whether local or remote. Such devices include, for example, modem cards, wireless cards, network interface cards, sound cards, or other types of controllers for any of a variety of known input devices. Output controllers could include controllers for any of a variety of known display devices for presenting information to a user, whether a human or a machine, whether local or remote. In the presently described embodiment, the functional elements of an electronic processing device communicate with each other via a system bus. Some embodiments of an electronic processing device may communicate with some functional elements using network or other types of remote communications. As will be evident to those skilled in the relevant art, an instrument control and/or a data processing application, if implemented in software, may be loaded into and executed from system memory and/or a memory storage device. All or portions of the instrument control and/or data processing applications may also reside in a read-only memory or similar device of the memory storage device, such devices not requiring that the instrument control and/or data processing applications first be loaded through input-output controllers. It will be understood by those skilled in the relevant art that the instrument control and/or data processing applications, or portions of it, may be loaded by a processor, in a known manner into system memory, or cache memory, or both, as advantageous for execution. Also, an electronic processing device may include one or more library files, experiment data files, and an internet client stored in system memory. For example, experiment data could include data related to one or more experiments or assays, such as detected signal values, or other values associated with one or more sequencing by synthesis (SBS) experiments or processes. Additionally, an internet client may include an application enabled to access a remote service on another electronic processing device using a network and may for instance comprise what are generally referred to as "Web Browsers". In the present example, some commonly employed web browsers include Microsoft Internet Explorer available from Microsoft Corporation, Mozilla Firefox from the Mozilla Corporation, Safari from Apple Computer Corp., Google Chrome from the Google Corporation, or other type of web browser currently known in the art or to be developed in the future. Also, in the same or other embodiments an internet client may include, or could be an element of, specialized software applications enabled to access remote information via a network such as a data processing application for biological applications.
A network may include one or more of the many various types of networks well known to those of ordinary skill in the art. For example, a network may include a local or wide area network that may employ what is commonly referred to as a TCP/IP protocol suite to communicate. A network may include a network, such as a computer network, comprising a worldwide system of interconnected networks that is commonly referred to as the internet, or could also include various intranet architectures. Those of ordinary skill in the related arts will also appreciate that some users in networked environments may prefer to employ what are generally referred to as "firewalls" (also sometimes referred to as Packet Filters, or Border Protection Devices) to control information traffic to and from hardware and/or software systems. For example, firewalls may comprise hardware or software elements or some combination thereof and are typically designed to enforce security policies put in place by users, such as for instance network administrators, etc.
[0067] Figure 9 schematically shows an embodiment of a tissue analysis system
900 in accordance with the present disclosure, e.g. as described above.
[0068] In the illustrated embodiment, tissue analysis system 900 comprises a color data storage module 910, a co-location data storage module 920, a tissue image data storage module 930, a tissue image analysis module 940, an optional tissue imaging module 950, an optional tissue staining module 960 and a communication bus 970 comprising a plurality of communication links 971 (for the sake of legibility, only one of the communication links bears a reference sign). Communication bus 970 and the communication links 971 communicatively interconnect the aforementioned components 910-960.
[0069] Figure 10 schematically shows a flow diagram 1000 of an embodiment of a tissue analysis method in accordance with the present disclosure, e.g. as described above.
[0070] In the illustrated embodiment, flow diagram 1010 comprises an optional step 1010 of staining a tissue sample, an optional step 1020 of imaging the (stained) tissue sample, a step 1030 of storing color data, a step 1040 of storing coal location data, a step 1050 of storing a plurality of pixels representative of the tissue image, a step 1060 of unmixing a tissue image, a step 1070 of determining a group for which a minimum differences smallest and a step 1080 of outputting data indicative of a tissue feature as an analysis result.
[0071] In the present disclosure, the verb "may" is used to designate optionality / noncompulsoriness. In other words, something that "may" can, but need not. In the present disclosure, the verb "comprise" may be understood in the sense of including. Accordingly, the verb "comprise" does not exclude the presence of other elements / actions. In the present disclosure, relational terms such as "first," "second," "top," "bottom" and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions.
[0072] In the present disclosure, the term "any" may be understood as
designating any number of the respective elements, e.g. as designating one, at least one, at least two, each or all of the respective elements. Similarly, the term "any" may be understood as designating any collection(s) of the respective elements, e.g. as designating one or more collections of the respective elements, a collection comprising one, at least one, at least two, each or all of the respective elements. The respective collections need not comprise the same number of elements.
[0073] In the present disclosure, the expression "at least one" is used to
designate any (integer) number or range of (integer) numbers (that is technically reasonable in the given context). As such, the expression "at least one" may, inter alia, be understood as one, two, three, four, five, ten, fifteen, twenty or one hundred. Similarly, the expression "at least one" may, inter alia, be understood as "one or more," "two or more" or "five or more."
[0074] In the present disclosure, expressions in parentheses may be understood as being optional. As used in the present disclosure, quotation marks may emphasize that the expression in quotation marks may also be understood in a figurative sense. As used in the present disclosure, quotation marks may identify a particular expression under discussion.
[0075] In the present disclosure, many features are described as being optional, e.g. through the use of the verb "may" or the use of parentheses. For the sake of brevity and legibility, the present disclosure does not explicitly recite each and every permutation that may be obtained by choosing from the set of optional features. However, the present disclosure is to be interpreted as explicitly disclosing all such permutations. For example, a system described as having three optional features may be embodied in seven different ways, namely with just one of the three possible features, with any two of the three possible features or with all three of the three possible features.
[0076] The foregoing disclosure of the exemplary embodiments of the present subject disclosure has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the subject disclosure to the precise forms disclosed. Many variations and modifications of the embodiments described herein will be apparent to one of ordinary skill in the art in light of the above disclosure. The scope of the subject disclosure is to be defined only by the claims appended hereto, and by their equivalents.
Further, in describing representative embodiments of the present subject disclosure, the specification may have presented the method and/or process of the present subject disclosure as a particular sequence of steps. However, to the extent that the method or process does not rely on the particular order of steps set forth herein, the method or process should not be limited to the particular sequence of steps described. As one of ordinary skill in the art would appreciate, other sequences of steps may be possible. Therefore, the particular order of the steps set forth in the specification should not be construed as limitations on the claims. In addition, the claims directed to the method and/or process of the present subject disclosure should not be limited to the
performance of their steps in the order written, and one skilled in the art can readily appreciate that the sequences may be varied and still remain within the spirit and scope of the present subject disclosure.

Claims

WHAT IS CLAIMED IS:
1. A method for spectral unmixing of an image obtained from a biological tissue sample being stained by multiple stains using a group lasso criterion comprising
- inputting (102) image data (y) obtained from the biological tissue sample,
- reading reference data from electronic memory (1 13, 910), the reference data being descriptive of the stain color of each one of the multiple stains,
- reading colocation data from electronic memory (920), the colocation data being descriptive of groups (g,) of the stains, each group comprising stains that can be collocated in the biological tissue sample, and each group forming a group for the group lasso criterion, at least one of the groups having a size of two or above (q, >1 ),
- calculating (1 12, 940) a solution of the group lasso criterion for obtaining the unmixed image using the reference data as a reference matrix (X).
2. A method for spectral unmixing of an image obtained from a biological tissue sample being stained by multiple stains using a lasso criterion, comprising
- inputting (102) image data (y) obtained from the biological tissue sample,
- reading reference data from electronic memory (1 13, 910), the reference data being descriptive of the stain color of each one of the multiple stains,
- calculating (1 12, 940) a solution of the lasso criterion for obtaining the unmixed image.
3. The method of claim 1 or 2, wherein the tuning parameter (λ) for the regularization term of the group Lasso or Lasso criterion is determined by cross-validation.
4. The method of claim 3, wherein the cross-validation is performed by dividing the image data into at least a first subset and a second subset, choosing a first value and a second value for the tuning parameter, calculating a first solution using the group Lasso or Lasso criterion for the first subset with the first value of the tuning parameter, calculating a first error of the unmixing result obtained from the first solution, calculating a second solution of the group Lasso or Lasso criterion for the second subset with the second value of the tuning parameter, calculating a second error of the unmixing result obtained from the second solution, selecting one of the first and second values of the tuning parameter that results in the lowest error.
5. The method of any one of the preceding claims, wherein the biological tissue sample is stained by a first number of stains, each stain having a different stain color, and further comprising acquiring the image data via a second number of color channels, the second number being below the first number.
6. The method of any one of the preceding claims, wherein the image data is acquired by bright field or fluorescence imaging.
7. A tissue analysis system for spectral unmixing of an image obtained from a biological tissue sample being configured to perform a method of any one of the preceding claims.
8. A tissue analysis system (100, 900), comprising: a color data storage module (1 13, 910) that stores, for each of a plurality of markers, color data indicative of a color (231 -236) of tissue marked by the respective marker; a co-location data storage module (920) that stores co-location data (226) defining a plurality of groups (gi to g4) of said markers, each group consisting of markers having an affinity to a respective common tissue feature (441 -444); a tissue image data storage module (930) that stores a plurality of pixels (220) representative of a tissue image, each pixel comprising color information; and a tissue image analysis module (1 12, 940) for unmixing said tissue image, wherein said tissue image analysis module is configured for reading said color data from said color data storage module, said co-location data from said co-location data storage module and said pixels from said tissue image data storage module and for calculating, for each of said pixels and for each of said groups, a linear combination (224) of the colors (231 -236) of the markers of the respective group that yields a minimum difference between said color information of the respective pixel (220) and said linear combination of colors, and wherein, for each of said pixels, said tissue image analysis module determines a group for which said minimum difference is smallest and outputs said tissue feature of said group as an analysis result.
9. The tissue analysis system of claim 8, wherein the step of calculating is performed using a group lasso or lasso criterion.
10. The tissue analysis system of claim 8 or 9, comprising: a tissue imaging module (950) that images a tissue sample to obtain said plurality of pixels representative of a tissue image.
1 1 . The tissue analysis system of claim 8, 9 or 10, comprising: a tissue staining module (960) that stains a tissue sample with each of said plurality of markers; and a tissue imaging module (950) that images said stained tissue sample to obtain said plurality of pixels representative of a tissue image.
12. The tissue analysis system of any one of claims 10 or 1 1 , wherein said tissue imaging module effects said imaging of said tissue sample during bright-field illumination of said tissue sample.
13. The tissue analysis system of any one of claims 10 to 12, wherein said tissue imaging module effects said imaging by means of a CCD camera selected from the group consisting of an RGB CCD camera and a CCD camera having at most five color channels.
14. The tissue analysis system of any one of the preceding claims 8 to 13, wherein each of said plurality of groups consists of at least one and no more than three markers.
15. The tissue analysis system of any one of the preceding claims 8 to 14, wherein, for each of said plurality of groups, the number of markers in the respective group defines an upper limit for a number of stains per pixel of said plurality of pixels.
The tissue analysis system of any one of the preceding claims 8 to 15, wherein each of said plurality of markers has an affinity to at least one tissue feature selected from the group consisting of a tumor cell cytokeratin, a regulatory T-cell nucleus, a universal nucleus, a B-cell membrane, a universal T-cell membrane, and a cytotoxic T-cell membrane.
17. A tissue analysis method, comprising: storing (1030), for each of a plurality of markers, color data indicative of a color of tissue marked by the respective marker; storing (1040) co-location data defining a plurality of groups of said markers, each group consisting of markers having an affinity to a respective common tissue feature; storing (1050) a plurality of pixels representative of a tissue image, each pixel comprising color information; unmixing (1060) said tissue image using said color data, said co-location data and said pixels by calculating, for each of said pixels and for each of said groups, a linear combination of the colors of the markers of the respective group that yields a minimum difference between said color information of the respective pixel and said linear combination of colors; determining (1070), for each of said pixels, a group for which said minimum difference is smallest; and outputting (1080) said tissue feature of said group as an analysis result.
18. The tissue analysis method of claim 17, comprising: imaging (1020) a tissue sample to obtain said plurality of pixels representative of a tissue image.
19. The tissue analysis method of claim 17, comprising: staining (1010) a tissue sample with each of said plurality of markers; and imaging (1020) said stained tissue sample to obtain said plurality of pixels representative of a tissue image.
20. The tissue analysis method of claim 18 or 19, wherein said imaging of said tissue sample is effected during bright-field illumination of said tissue sample.
21 . The tissue analysis method of any one of claims 18 to 20, wherein said imaging is effected by means of a CCD camera selected from the group consisting of an RGB CCD camera and a CCD camera having at most five color channels.
22. The tissue analysis method of any one of claims 17 to 21 , wherein each of said plurality of groups consists of at least one and no more than three markers.
23. The tissue analysis method of any one of claims 17 to 22, wherein, for each of said plurality of groups, the number of markers in the respective group defines an upper limit for a number of stains per pixel of said plurality of pixels.
24. The tissue analysis method of any one of claims 17 to 23, wherein each of said plurality of markers has an affinity to at least one tissue feature selected from the group consisting of a tumor cell cytokeratin, a regulatory T-cell nucleus, a universal nucleus, a B-cell membrane, a universal T-cell membrane, and a cytotoxic T-cell membrane.
25. A digital storage medium to store digitally encoded instructions executable by a processor to perform operations comprising:
generating a group sparsity model wherein a fraction of a stain
contribution from colocation markers is assigned within a single group and a fraction of a stain contribution from non-colocation markers is assigned within separate groups; and
solving the group sparsity model using an unmixing algorithm to yield a least squares solution within each group.
26. The digital storage medium of claim 25, wherein known co-location
information of the colocation and non-colocation markers within the image enables defining fixed upper bounds for the number of stains at one pixel of the image.
27. The digital storage medium of claim 25, obtaining reference colors from training images.
28. The digital storage medium of claim 25, wherein the group sparsity model is a group lasso model.
29. The digital storage medium of claim 28, wherein solving the group sparsity model within each group corresponds to an unmixing of the colocation markers.
30. The digital storage medium of claim 29, wherein yielding a sparse solution among each group corresponds to an unmixing of the non-colocation markers.
31. The digital storage medium of claim 25, further comprising reducing the group sparsity model to sparse unmixing without a colocation constraint.
32. The digital storage medium of claim 31 , further comprising setting only one member in each group, and generating sparse unmixing results for less or equal to three markers.
33. The digital storage medium of claim 25, further comprising converting the image into an optical density space.
34. The digital storage medium of claim 25, wherein the image is a brightfield multiplex IHC RGB image.
35. The digital storage medium of claim 34, wherein the converting comprises
Oc = - _Og(-^ ) using the following formula derived from Beer's law: 0,c where c is an index of the RGB color channels, l0 is an RGB value of white points and O is an optical density image obtained as a result.
36. The digital storage medium of claim 25, wherein the markers may
represent one or more of: a tumor cell cytokeratin, a regulatory T-cell nucleus, a universal nucleus, a B-cell membrane, a universal T-cell membrane, or a cytotoxic T-cell membrane, in any combination.
37. A system for unmixing an image, comprising:
a processor; and
a memory coupled to the processor, the memory to store digitally encoded instructions that, when executed by the processor, cause the processor to perform operations comprising:
generating a group sparsity framework using known co-location
information of a plurality of biomarkers within an image of a tissue section; wherein a fraction of each stain contribution is assigned to a different group based on the known co-location information; and solving the group sparsity model using an unmixing algorithm to yield a least squares solution for each group.
The system of claim 37, wherein the known co-location information enables defining a fixed upper bound for the number of stains at a singl pixel of the image.
The system of claim 38, wherein a sparse solution is obtained among the groups to ensure that only a small number of groups with a total number of stains being less than the upper bound are activated.
40. The system of claim 37, wherein the operations further comprise ensuring sparsity among the groups but non-sparsity within each group.
41 . The system of claim 37, wherein the unmixing algorithm utilizes a
regularization parameter that controls an amount of a group sparsity constraint.
42. The system of claim 37, wherein the unmixing algorithm is solved using an alternative direction method of multipliers (ADMM).
43. A digital storage medium to store digitally encoded instructions executable by a processor to perform operations comprising:
modeling an RGB image unmixing problem using a group sparsity
framework, in which fractions of stain contributions from a plurality of colocation markers are modeled within a same group and fractions of stain contributions from a plurality of non-colocation markers are modeled in different groups;
providing co-localization information of the plurality of colocation markers to the modeled group sparsity framework;
solving the modeled framework using a group lasso to yield a least
squares solution within each group, wherein the least squares solution corresponds to the unmixing of the colocation markers; and yielding a sparse solution among the groups that corresponds to the unmixing of the non-colocation markers.
The digital storage medium of claim 43, wherein a reduction of the model to sparse unmixing without a colocation constraint is enabled by setting only one member in each group, and generating sparse unmixing results for less than or equal to three markers or more than three markers.
PCT/EP2015/053745 2014-02-21 2015-02-23 Group sparsity model for image unmixing WO2015124772A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP19170902.1A EP3550514B1 (en) 2014-02-21 2015-02-23 Group sparsity model for image unmixing
EP15708459.1A EP3108448B1 (en) 2014-02-21 2015-02-23 Group sparsity model for image unmixing
US15/243,899 US10540762B2 (en) 2014-02-21 2016-08-22 Group sparsity model for image unmixing
US16/590,942 US11113809B2 (en) 2014-02-21 2019-10-02 Group sparsity model for image unmixing
US17/391,398 US11893731B2 (en) 2014-02-21 2021-08-02 Group sparsity model for image unmixing
US18/389,714 US20240135532A1 (en) 2014-02-21 2023-12-19 Group sparsity model for image unmixing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201461943265P 2014-02-21 2014-02-21
US61/943,265 2014-02-21

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/243,899 Continuation US10540762B2 (en) 2014-02-21 2016-08-22 Group sparsity model for image unmixing

Publications (1)

Publication Number Publication Date
WO2015124772A1 true WO2015124772A1 (en) 2015-08-27

Family

ID=52630335

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2015/053745 WO2015124772A1 (en) 2014-02-21 2015-02-23 Group sparsity model for image unmixing

Country Status (3)

Country Link
US (4) US10540762B2 (en)
EP (2) EP3550514B1 (en)
WO (1) WO2015124772A1 (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016120441A2 (en) 2015-01-30 2016-08-04 Ventana Medical Systems, Inc. Quality metrics for automatic evaluation of dual ish images
WO2016120440A1 (en) 2015-01-29 2016-08-04 Ventana Medical Systems, Inc. Dot detection, color classification of dots and counting of color classified dots
WO2016189065A1 (en) 2015-05-26 2016-12-01 Ventana Medical Systems, Inc. Method and system for assessing stain quality for in-situ hybridization and immunohistochemistry
CN107169137A (en) * 2017-06-09 2017-09-15 华东师范大学 A kind of semi-supervised hashing image searcher based on Group Lasso
WO2017174267A1 (en) * 2016-04-06 2017-10-12 F. Hoffmann-La Roche Ag Spatial index creation for ihc image analysis
WO2018115055A1 (en) 2016-12-22 2018-06-28 Ventana Medical Systems, Inc. Computer scoring based on primary stain and immunohistochemistry images
WO2019002281A1 (en) 2017-06-28 2019-01-03 Ventana Medical Systems, Inc. System level calibration
WO2019025533A1 (en) 2017-08-04 2019-02-07 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
WO2019110567A1 (en) 2017-12-05 2019-06-13 Ventana Medical Systems, Inc. Method of computing tumor spatial and inter-marker heterogeneity
WO2019110583A1 (en) 2017-12-07 2019-06-13 Ventana Medical Systems, Inc. Deep-learning systems and methods for joint cell and region classification in biological images
WO2019110561A1 (en) 2017-12-06 2019-06-13 Ventana Medical Systems, Inc. Method of storing and retrieving digital pathology analysis results
WO2019121564A2 (en) 2017-12-24 2019-06-27 Ventana Medical Systems, Inc. Computational pathology approach for retrospective analysis of tissue-based companion diagnostic driven clinical trial studies
CN109949333A (en) * 2019-03-20 2019-06-28 北京暴雷科技有限公司 A kind of text seal isolation technics mixed based on color solution
WO2019197509A1 (en) 2018-04-13 2019-10-17 Ventana Medical Systems, Inc. Systems for cell shape estimation
WO2019219651A1 (en) 2018-05-15 2019-11-21 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
US10540762B2 (en) 2014-02-21 2020-01-21 Ventana Medical Systems, Inc. Group sparsity model for image unmixing
WO2020081343A1 (en) 2018-10-15 2020-04-23 Ventana Medical Systems, Inc. Systems and methods for cell classification
EP3882851A1 (en) 2014-12-30 2021-09-22 Ventana Medical Systems, Inc. Method for co-expression analysis
CN116188423A (en) * 2023-02-22 2023-05-30 哈尔滨工业大学 Super-pixel sparse and unmixed detection method based on pathological section hyperspectral image

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2017099616A (en) * 2015-12-01 2017-06-08 ソニー株式会社 Surgical control device, surgical control method and program, and surgical system
US10572996B2 (en) * 2016-06-28 2020-02-25 Contextvision Ab Method and system for detecting pathological anomalies in a digital pathology image and method for annotating a tissue slide
CN109241843B (en) * 2018-08-02 2022-02-18 南京理工大学 Space-spectrum combined multi-constraint optimization non-negative matrix unmixing method
EP3608701A1 (en) * 2018-08-09 2020-02-12 Olympus Soft Imaging Solutions GmbH Method for providing at least one evaluation method for samples
US20230186470A1 (en) * 2021-12-15 2023-06-15 Ventana Medical Systems, Inc. Synthesis singleplex from multiplex brightfield imaging using generative adversarial network
DE102022130251A1 (en) 2022-11-16 2024-05-16 Carl Zeiss Microscopy Gmbh Microscope arrangement and method for examining a sample stained with multiple dyes

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014195193A1 (en) * 2013-06-03 2014-12-11 Ventana Medical Systems, Inc. Image adaptive physiologically plausible color separation

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102005022880B4 (en) * 2005-05-18 2010-12-30 Olympus Soft Imaging Solutions Gmbh Separation of spectrally or color superimposed image contributions in a multi-color image, especially in transmission microscopic multi-color images
US8199999B2 (en) * 2008-06-17 2012-06-12 Cambridge Research & Instrumentation, Inc. Image classifier training
PT2332087T (en) * 2008-07-25 2020-06-05 Fund D Anna Sommer Champalimaud E Dr Carlos Montez Champalimaud Systems and methods of treating, diagnosing and predicting the occurrence of a medical condition
US9779213B2 (en) * 2008-07-25 2017-10-03 Fundacao D. Anna Sommer Champalimaud E Dr. Carlos Montez Champalimaud System for evaluating a pathological stage of prostate cancer
US20130080134A1 (en) * 2008-07-25 2013-03-28 Fundação D. Anna Sommer Champalimaud e Dr. Carlos Montez Champalimaud Systems and methods for predicting favorable-risk disease for patients enrolled in active surveillance
CN102308212A (en) * 2008-12-04 2012-01-04 加利福尼亚大学董事会 Materials and methods for determining diagnosis and prognosis of prostate cancer
JP2010243970A (en) * 2009-04-10 2010-10-28 Nikon Corp Confocal microscope, image processing apparatus and program
JP2011022131A (en) * 2009-06-18 2011-02-03 Olympus Corp Medical diagnosis support device, image processing method, image processing program, and virtual microscope system
US20120010528A1 (en) * 2010-04-26 2012-01-12 Aureon Biosciences, Inc. Systems and methods for predicting disease progression in patients treated with radiotherapy
US8946389B2 (en) * 2011-04-25 2015-02-03 University Of Washington Compositions and methods for multiplex biomarker profiling
WO2013116735A1 (en) * 2012-02-01 2013-08-08 20/20 Gene Systems, Inc. Methods for predicting tumor response to targeted therapies
EP2943761B1 (en) * 2013-01-10 2023-11-29 Akoya Biosciences, Inc. Multispectral imaging system and methods
US9488639B2 (en) * 2013-02-25 2016-11-08 Flagship Biosciences, Inc. Cell-based tissue analysis
US9786050B2 (en) * 2013-03-15 2017-10-10 The Board Of Trustees Of The University Of Illinois Stain-free histopathology by chemical imaging
EP3053138B1 (en) * 2013-09-30 2018-08-22 Ventana Medical Systems, Inc. Systems and methods for adaptive histopathology image unmixing
EP3550514B1 (en) 2014-02-21 2020-11-25 Ventana Medical Systems, Inc. Group sparsity model for image unmixing
WO2019121564A2 (en) * 2017-12-24 2019-06-27 Ventana Medical Systems, Inc. Computational pathology approach for retrospective analysis of tissue-based companion diagnostic driven clinical trial studies

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014195193A1 (en) * 2013-06-03 2014-12-11 Ventana Medical Systems, Inc. Image adaptive physiologically plausible color separation

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
MARIAN-DANIEL IORDACHE ET AL: "Hyperspectral unmixing with sparse group lasso", GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011 IEEE INTERNATIONAL, IEEE, 24 July 2011 (2011-07-24), pages 3586 - 3589, XP032061729, ISBN: 978-1-4577-1003-2, DOI: 10.1109/IGARSS.2011.6049999 *
NOAH SIMON ET AL: "A Sparse-Group Lasso", JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, vol. 22, no. 2, 1 April 2013 (2013-04-01), pages 231 - 245, XP055181162, ISSN: 1061-8600, DOI: 10.1080/10618600.2012.681250 *
SAMAROV DANIEL V ET AL: "Validating the LASSO algorithm by unmixing spectral signatures in multicolor phantoms", OPTICAL DIAGNOSTICS AND SENSING XII: TOWARD POINT-OF-CARE DIAGNOSTICS; AND DESIGN AND PERFORMANCE VALIDATION OF PHANTOMS USED IN CONJUNCTION WITH OPTICAL MEASUREMENT OF TISSUE IV, SPIE, 1000 20TH ST. BELLINGHAM WA 98225-6705 USA, vol. 8229, no. 1, 9 February 2012 (2012-02-09), pages 1 - 9, XP060002122, DOI: 10.1117/12.908133 *
TING CHEN ET AL: "Stain Unmixing in Brightfield Multiplex Immunohistochemistry Images", PROC. STMI 2014, 14 September 2014 (2014-09-14), pages 1 - 9, XP055181741, Retrieved from the Internet <URL:http://stmi2014.ece.cornell.edu/papers/STMI-O-2.pdf> [retrieved on 20150409] *

Cited By (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10540762B2 (en) 2014-02-21 2020-01-21 Ventana Medical Systems, Inc. Group sparsity model for image unmixing
US11893731B2 (en) 2014-02-21 2024-02-06 Ventana Medical Systems, Inc. Group sparsity model for image unmixing
US11113809B2 (en) 2014-02-21 2021-09-07 Ventana Medical Systems, Inc. Group sparsity model for image unmixing
EP4187486A1 (en) 2014-12-30 2023-05-31 Ventana Medical Systems, Inc. Method for co-expression analysis
EP3882851A1 (en) 2014-12-30 2021-09-22 Ventana Medical Systems, Inc. Method for co-expression analysis
US10235559B2 (en) 2015-01-29 2019-03-19 Ventana Medical Systems, Inc. Dot detection, color classification of dots and counting of color classified dots
WO2016120440A1 (en) 2015-01-29 2016-08-04 Ventana Medical Systems, Inc. Dot detection, color classification of dots and counting of color classified dots
WO2016120441A2 (en) 2015-01-30 2016-08-04 Ventana Medical Systems, Inc. Quality metrics for automatic evaluation of dual ish images
WO2016189065A1 (en) 2015-05-26 2016-12-01 Ventana Medical Systems, Inc. Method and system for assessing stain quality for in-situ hybridization and immunohistochemistry
CN109074645B (en) * 2016-04-06 2021-12-14 豪夫迈·罗氏有限公司 Spatial index creation for IHC image analysis
WO2017174267A1 (en) * 2016-04-06 2017-10-12 F. Hoffmann-La Roche Ag Spatial index creation for ihc image analysis
US10825195B2 (en) 2016-04-06 2020-11-03 Hoffman-La Roche, Inc. Spatial index creation for IHC image analysis
CN109074645A (en) * 2016-04-06 2018-12-21 豪夫迈·罗氏有限公司 Spatial index for IHC image analysis creates
WO2018115055A1 (en) 2016-12-22 2018-06-28 Ventana Medical Systems, Inc. Computer scoring based on primary stain and immunohistochemistry images
US11657503B2 (en) 2016-12-22 2023-05-23 Ventana Medical Systems, Inc. Computer scoring based on primary stain and immunohistochemistry images related application data
CN107169137A (en) * 2017-06-09 2017-09-15 华东师范大学 A kind of semi-supervised hashing image searcher based on Group Lasso
CN107169137B (en) * 2017-06-09 2019-10-08 华东师范大学 A kind of semi-supervised hashing image searcher based on Group Lasso
WO2019002281A1 (en) 2017-06-28 2019-01-03 Ventana Medical Systems, Inc. System level calibration
EP4300424A2 (en) 2017-08-04 2024-01-03 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
US11715557B2 (en) 2017-08-04 2023-08-01 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
US11990228B2 (en) 2017-08-04 2024-05-21 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
WO2019025533A1 (en) 2017-08-04 2019-02-07 Ventana Medical Systems, Inc. Automatic assay assessment and normalization for image processing
US11526984B2 (en) 2017-12-05 2022-12-13 Ventana Medical Systems, Inc. Method of computing tumor spatial and inter-marker heterogeneity
WO2019110567A1 (en) 2017-12-05 2019-06-13 Ventana Medical Systems, Inc. Method of computing tumor spatial and inter-marker heterogeneity
WO2019110561A1 (en) 2017-12-06 2019-06-13 Ventana Medical Systems, Inc. Method of storing and retrieving digital pathology analysis results
US11682192B2 (en) 2017-12-07 2023-06-20 Ventana Medical Systems, Inc. Deep-learning systems and methods for joint cell and region classification in biological images
WO2019110583A1 (en) 2017-12-07 2019-06-13 Ventana Medical Systems, Inc. Deep-learning systems and methods for joint cell and region classification in biological images
US11972859B2 (en) 2017-12-24 2024-04-30 Ventana Medical Systems, Inc. Computational pathology approach for retrospective analysis of tissue-based companion diagnostic driven clinical trial studies
US11721427B2 (en) 2017-12-24 2023-08-08 Ventana Medical Systems, Inc. Computational pathology approach for retrospective analysis of tissue-based companion diagnostic driven clinical trial studies
WO2019121564A2 (en) 2017-12-24 2019-06-27 Ventana Medical Systems, Inc. Computational pathology approach for retrospective analysis of tissue-based companion diagnostic driven clinical trial studies
US11842483B2 (en) 2018-04-13 2023-12-12 Ventana Medical Systems, Inc. Systems for cell shape estimation
WO2019197509A1 (en) 2018-04-13 2019-10-17 Ventana Medical Systems, Inc. Systems for cell shape estimation
US11615532B2 (en) 2018-05-15 2023-03-28 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
WO2019219651A1 (en) 2018-05-15 2019-11-21 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
WO2020081343A1 (en) 2018-10-15 2020-04-23 Ventana Medical Systems, Inc. Systems and methods for cell classification
CN109949333A (en) * 2019-03-20 2019-06-28 北京暴雷科技有限公司 A kind of text seal isolation technics mixed based on color solution
CN109949333B (en) * 2019-03-20 2021-06-08 北京小蜻蜓智能科技有限公司 Character and seal separation method based on color unmixing
CN116188423B (en) * 2023-02-22 2023-08-08 哈尔滨工业大学 Super-pixel sparse and unmixed detection method based on pathological section hyperspectral image
CN116188423A (en) * 2023-02-22 2023-05-30 哈尔滨工业大学 Super-pixel sparse and unmixed detection method based on pathological section hyperspectral image

Also Published As

Publication number Publication date
US20160358335A1 (en) 2016-12-08
EP3108448A1 (en) 2016-12-28
US11113809B2 (en) 2021-09-07
US20240135532A1 (en) 2024-04-25
US20210366109A1 (en) 2021-11-25
EP3108448B1 (en) 2019-05-15
EP3550514A1 (en) 2019-10-09
EP3550514B1 (en) 2020-11-25
US11893731B2 (en) 2024-02-06
US10540762B2 (en) 2020-01-21
US20200034966A1 (en) 2020-01-30

Similar Documents

Publication Publication Date Title
US11893731B2 (en) Group sparsity model for image unmixing
US11205266B2 (en) Systems and methods for detection of structures and/or patterns in images
CN111448582B (en) System and method for single channel whole cell segmentation
AU2015265811B2 (en) An image processing method and system for analyzing a multi-channel image obtained from a biological tissue sample being stained by multiple stains
EP3055835B1 (en) Systems and methods for comprehensive multi-assay tissue analysis
US9996924B2 (en) Systems and methods for spectral unmixing of microscopic images using pixel grouping
CN108140249A (en) For showing the image processing system of the multiple images of biological sample and method
US10430945B2 (en) Systems and methods for color deconvolution
Hoque et al. Retinex model based stain normalization technique for whole slide image analysis
CN111656393A (en) Histological image analysis
Chen et al. Group sparsity model for stain unmixing in brightfield multiplex immunohistochemistry images
Chen et al. Stain unmixing in brightfield multiplex immunohistochemistry images

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15708459

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015708459

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015708459

Country of ref document: EP