WO2001020044A2 - Signal counting for in situ hybridization - Google Patents

Signal counting for in situ hybridization Download PDF

Info

Publication number
WO2001020044A2
WO2001020044A2 PCT/US2000/025465 US0025465W WO0120044A2 WO 2001020044 A2 WO2001020044 A2 WO 2001020044A2 US 0025465 W US0025465 W US 0025465W WO 0120044 A2 WO0120044 A2 WO 0120044A2
Authority
WO
WIPO (PCT)
Prior art keywords
signals
spot
signal
probe
images
Prior art date
Application number
PCT/US2000/025465
Other languages
French (fr)
Other versions
WO2001020044A3 (en
Inventor
Olli P. Kallioniemi
Juha Kononen
Lukas Budendorf
Edward R. Dougherty
Artyom M. Grigoryan
Original Assignee
The Government Of The United States Of America, As Represented By The Secretary, Department Of Health & Human Services, The National Institutes Of Health
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Government Of The United States Of America, As Represented By The Secretary, Department Of Health & Human Services, The National Institutes Of Health filed Critical The Government Of The United States Of America, As Represented By The Secretary, Department Of Health & Human Services, The National Institutes Of Health
Priority to AU74949/00A priority Critical patent/AU7494900A/en
Publication of WO2001020044A2 publication Critical patent/WO2001020044A2/en
Publication of WO2001020044A3 publication Critical patent/WO2001020044A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6816Hybridisation assays characterised by the detection means
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6841In situ hybridisation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N15/00Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
    • G01N15/10Investigating individual particles
    • G01N15/14Optical investigation techniques, e.g. flow cytometry
    • G01N15/1429Signal processing
    • G01N15/1433Signal processing using image recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/695Preprocessing, e.g. image segmentation

Definitions

  • This invention relates to methods of counting probe signals in biological specimens, such as probe signals produced by in situ hybridization in cells or tissue sections.
  • Such diagnostic and prognostic cellular changes include the presence of tumor specific cell surface antigens (as in melanoma), and genetic abnormalities (such as activated oncogenes in tumors).
  • a variety of techniques have evolved to detect the presence of these cellular abnormalities, including immunophenotyping with monoclonal antibodies, in situ hybridization with probes, and DNA amplification using the polymerase chain reaction (PCR).
  • ISH in situ hybridization
  • labeled hybridizing agents such as DNA, RNA, or single stranded or double stranded DNA probes
  • the probes can be labeled by direct or indirect means.
  • direct labeling the label (a chromophore) is integral to the probe.
  • Direct labels include fluorescent dyes such as derivatives of rhodamine, fiuorescein, and Texas Red, or enzymatic reporters such as horseradish peroxidase or alkaline phosphatase.
  • Indirect labeling involves attaching a hapten (such as biotin, mercury or dioxygenin) to a probe through a linker. After hybridization, the hapten is detected using a labeled antibody or other specific binding protein.
  • FISH fluorescent in situ hybridization
  • Dyes such as fiuorescein isothiocyanate (FITC) are often used to label a sequence of probe DNA in FISH.
  • the probe hybridizes to a defined target nucleotide sequence of DNA in the cell, and the FITC fluoresces green when excited by a mercury arc lamp or Argon laser (in the case of laser scanning microscopy), so that the labeled probe can be visually detected when the probed tissue is viewed through a microscope.
  • Each chromosome containing the target DNA sequence will produce a fluorescent signal (sometimes called a spot) in every cell when the specimen is illuminated with suitable excitation.
  • specimens hybridized with a probe hybridizing to a specific region on chromosome 21 will produce two fluorescent signals in cells from normal individuals and three signals from Down's Syndrome (Trisomy 21) subjects who have an extra chromosome number 21.
  • a fluorescent probe that hybridizes to an X chromosome will give one fluorescent signal per cell in males (who have only one X chromosome) and two fluorescent signals in females (who have two X chromosomes).
  • FISH is an excellent method for detection of gene copy number alterations in cancer and other diseases.
  • FISH is a standard tool for analyzing congenital genetic alterations in clinical diagnostics (such as numerical chromosomal alterations, duplications, inversions and microdeletions).
  • characteristic gene amplifications or deletions are associated with the development and progression of the tumor.
  • FISH analysis of gene amplifications in cancer can have prognostic and therapeutic significance, such as the detection of the HER-2 oncogene amplification in breast cancer. Detection of this oncogene by FISH was recently approved by the Food and Drug Administration as a diagnostic tool for human breast cancer.
  • a significant impediment to the accurate counting of fluorescent signals is that the probes hybridize throughout a three-dimensional nucleus, and the probe signals have to be counted from different focal planes for each nucleus.
  • cells and probe signals can overlap in the two-dimensional view, and overlapping signals are seen as a single signal. Such overlapping results in undercounting of signals, which can make it appear that an amplified gene is less amplified, or that fewer copies of a normal gene are present.
  • U.S. Patent No. 5,523,207 discloses a two dye FISH method in which probes are labeled with a first dye (such as FITC) and a contour of the nucleus is labeled with a second dye (such as propidium iodide or PI). These two dyes allow the number of signals per visualized nucleus to be determined.
  • a first dye such as FITC
  • a second dye such as propidium iodide or PI
  • fluorescently tagged nucleic acid probe signals are counted in a region of interest in a biological specimen by determining a ratio of signals from a test probe to signals of a reference probe, and the region of interest includes multiple cells.
  • the reference probe may be a fluorescently labeled polynucleotide (such as DNA or RNA) that hybridizes to the region of interest in a gene, and the reference probe is a polynucleotide labeled with a different fluorescent color, and which hybridizes to a reference target.
  • the test probe may hybridize to a gene that is implicated or suspected to be involved in a particular disease, such as tumor development and progression.
  • the reference probe may, for example, recognize a centromere of the same chromosome on which the gene of interest is contained. An increased ratio of test probe signals to reference probe signals would then indicate an amplification in gene copy number, while a decrease in that ratio would indicate a relative loss of the gene of interest (such as a gene deletion).
  • determining a ratio between the quantities of test and reference signals the problem of measuring changes in gene copy number with respect to a cell or nucleus is avoided. Typically, a count of centromeres approximates a nucleus or cell count.
  • Particularly accurate counting of FISH signals can also be accomplished by obtaining successive contiguous images of the region of interest to distinguish overlapping signals from the biological specimen. Without distinguishing the overlapping signals, they would otherwise obscure one another, and diminish accuracy of the spot count.
  • the successive images are slices, such as digital microscopic optical sections from different depths of the biological specimen, obtained by confocal microscopy.
  • the successive images are transformed by detecting and representing the positional values in each image of fluorescent emission signal segments, which make up the probe signal, as an array of digital values. Signal segments below a certain value (e.g. a threshold number of pixels) may be eliminated.
  • the remaining digital signal segments may then be analyzed to combine contiguous fluorescent signal segments in successive optical sections into a single spot signal, which may be assigned to a particular optical section in which a strongest fluorescent signal segment is located, or a group of optical sections across which the contiguous signal segments have been detected.
  • This localization of the fluorescent signal to a particular optical section allows overlapping spots to be distinguished, both in the axial and transverse dimensions of the three-dimensional representation.
  • the system detects the location of fluorescent spot signals in three- dimensional space by performing a morphological top-hat transform to digital images of the different levels to obtain fluorescent intensity spikes that indicate a spot signal segment.
  • a threshold level of fluorescence intensity is determined to eliminate signal segments that are below a fluorescence intensity that would be associated with a valid spot signal. Remaining contiguous spot signal segments are segmented into a single spot signal, and the single spot signal may be assigned a location at a level associated with a greatest fluorescent intensity signal segment.
  • Certain disclosed embodiments also include devices for counting signals from in situ hybridization of probes in biological tissue, or determining a three- dimensional relationship between the signals, in which the device includes a confocal microscope, a digital camera positioned to obtain digital optical section images of different levels of the biological tissue, and a computer implemented system that detects and combines contiguous or adjacent signal segments (which are above a threshold) at the different levels, and separates vertically overlying, transversely overlapping, or non-contiguous signals from one another. After separating such signals from different levels into different spot signals, the computer implemented system then counts the spot signals, or compares their relative locations in three-dimensional space.
  • the device includes a confocal microscope, a digital camera positioned to obtain digital optical section images of different levels of the biological tissue, and a computer implemented system that detects and combines contiguous or adjacent signal segments (which are above a threshold) at the different levels, and separates vertically overlying, transversely overlapping, or non-contiguous signals from one another. After
  • Two or more different signal types may be used, one color for the test probe signal and a second color for the reference probe signal.
  • the ratio of the number of test probe signals to reference probe signals can then be determined, or an unexpected overlap of the signals (as in a genetic translocation) can be assessed.
  • the test probe signals and the reference probe signals may be obtained separately, for example by successively illuminating the tissue specimen with light of different colors that selectively causes the different dyes to fluoresce, by viewing the specimen through filters that filter out signals (such as a filter that removes colors except the color of interest), or exposing separate contiguous tissue sections to the different probes.
  • the test and reference probe signals can be obtained simultaneously, using multiple band-pass excitation and/or emission filters.
  • the FISH spots are accurately counted by obtaining the plurality of digital optical images at different levels of a biological sample, and constructing a three-dimensional image showing discrete fluorescent signal segments at different levels of the three-dimensional image.
  • the three-dimensional image is constructed by determining a location of a fluorescent signal segment of a particular color in the different levels of the biological sample, combining contiguous signal segments (which are above a preselected threshold) into a single spot signal, and separating non-contiguous signal segments into different spots signals.
  • the location of signal segments in each level is determined by the presence of a fluorescent brightness intensity spike that indicates an increase in image component intensity as compared to a background intensity.
  • the locations of signals of different colors can be similarly resolved in three-dimensional space, the number of spot signals of each color counted, and a ratio of the spot signals determined.
  • This three-dimensional imaging would also allow one to study the orientation or location of the spots in the nuclei to make conclusions about the presence of genetic rearrangements that do not change copy number of the signals, such as translocations and inversions.
  • Certain features can be implemented to increase the accuracy of the spot counting. For example, under certain circumstances, a group of spots may be packed so closely together as to form a cluster that is difficult to analyze using a standard algorithm. Clusters can be automatically identified and counted using a cluster calibration feature, or a user can specify that a particular region of an image is a cluster.
  • Another feature for increasing accuracy filters out false spots by, for instance, eliminating spots appearing at a same location in the test and reference probe images or filtering out image data indicating autofluorescence.
  • a user interface can be presented to allow a user to either select areas of particular interest for separate processing or specify areas not to be processed. In this way, the invention can benefit from information provided by a human operator.
  • the ratio of signals can be a ratio of spot signals from a test probe that recognizes a gene of interest, and a reference probe that recognizes a chromosomal locus having an expected quantity in the biological specimen.
  • the method can further include determining whether there is an increase in an expected ratio between the test signals and the reference signals, indicating an amplification of the gene of interest, or whether there is a decrease in the expected ratio between the test signals and the reference signals, indicating relative loss of the gene of interest.
  • This method is particularly applicable to high throughput techniques for performing automated FISH analysis of a large number of tissue, cell, or other specimens.
  • the specimens can, for example, be in a tissue or cell microarray that includes specimens from the same or different sources, such as tumors.
  • the method includes providing an array of biological samples, hybridizing the biological samples with a fluorescent test probe that hybridizes to a gene of interest in the biological samples and with a fluorescent reference probe that hybridizes to a chromosomal reference locus in the biological samples. Images are then obtained by confocal microscopy of contiguous sections at different depths of a plurality of the biological samples in the array, and fluorescent signal segments from the contiguous sections are detected. The contiguous signal segments in different sections are combined into a corresponding single spot signal, and the separate spot signals are resolved from one another. The distinct spot signals can then be counted.
  • the tissue array can include an array of many different tissue specimens, for example at least 50 tissue specimens, but hundreds or even thousands of tissue specimens can be included in the microarray.
  • the tissue arrays can be constructed by obtaining a plurality of donor specimens, placing each donor specimen in an assigned location in a recipient array, and obtaining a plurality of copies of the recipient array in a manner that each copy contains a plurality of donor specimens that maintain their assigned locations.
  • a set of tissue samples relating to a same type of tissue from a plurality of donor specimens can be included.
  • tissue arrays made by any method are suitable for use with the method of counting FISH spot signals.
  • FIGS 1A and IB are schematic side and top views illustrating the problem of viewing three-dimensional FISH images in two dimensions.
  • FIG. 2A is a schematic view illustrating the prior approach of counting two color FISH spot signals with respect to cell nuclei (the contours of the nuclei being shown around the spot signals).
  • FIG. 2B is a view similar to FIG. 2A, but shows counting two color FISH spot signals and determining a ratio of the different colored signals in a region of interest (such as a field of view FOV) instead of with respect to cell nuclei.
  • FIG. 2C is a photomicrograph of prostate tissue, illustrating a region of interest (ROI) within the tissue for purposes of calculating a ratio of test to reference probe signals.
  • FIG. 3 is a schematic view illustrating two color FISH, in which signal locations have been determined in three-dimensional space. .
  • FIGS. 4A and 4B are flow diagrams illustrating one embodiment of an automated system for counting FISH spot signals in accordance with the present invention.
  • FIG. 5 A is a flow diagram illustrating an algorithm for counting the FISH signals.
  • FIG. 5B is a series of optical sections 1-5, and a max image M, of prostate tissue subjected to two color FISH with a probe for the androgen receptor (labeled red) and a probe for the normal X chromosome (labeled green), in which red signals appear weaker than green signals.
  • FIG. 6A is a composite figure showing a print-out of optical sections of a series of eight confocal images (cuts 1-8), and a view of the image that would be seen (max-image) when the series of eight overlapping images would be viewed from above as a two-dimensional image.
  • FIG. 6B is a series of optical sections 1-5, and a max image M, of breast cancer tissue subjected to two color FISH with a ribosomal S6K probe (labeled red) and a probe for the chromosome 17 centromere (labeled green), in which red signals appear weaker than green signals.
  • red ribosomal S6K probe
  • green probe for the chromosome 17 centromere
  • FIG. 7 is a MATLAB graphical user interface (GUI) for displaying optical sections of a three-dimensional tissue specimen that has been subjected to FISH.
  • GUI MATLAB graphical user interface
  • FIG. 8 is a histogram of the number of occurrences of the various gray levels, or fluorescence intensities, in the max image, which is used to threshold valid pixels and eliminate non-informative image components.
  • FIG. 9A is a print-out of a MATLAB GUI illustrating processing spot signal segments from different levels, and combining contiguous signal segments above a threshold into a single spot signal (which can be assigned to a particular level or levels) for purposes of counting a spot signal.
  • FIG. 9B is a schematic view of a vertical side view of optical sections, illustrating how image components below a certain threshold are eliminated to separate otherwise vertically contiguous signal segments, and separate overlying signals from one another.
  • FIG. 9C is a view similar to FIG. 9B, showing how transversely and vertically overlapping signal segments are separated by eliminating signal segments below a threshold, and grouping the remaining signal segments into discrete spot signals.
  • FIG. 10 is a three-dimensional representation of signal segments in different optical sections, resolved into a spot in an assigned optical section, and a combined image of the signals projected as spheres into a max image, in which a volume of the spheres is proportional to an intensity of the signal.
  • FIG. 11 is a view of a tissue section mounted on a slide, for processing by
  • FIG. 12 is a view of a tissue microarray mounted on a slide, for processing by FISH.
  • FIG. 13 is a flowchart showing an overview of an algorithm used to count FISH spots in a set of digital image slices.
  • FIG. 14 is a data flow diagram of an algorithm used to count FISH spots in a set of digital image slices.
  • FIG. 15A is an image showing FISH spots relating to the X centromere in normal prostate tissue, including false spots caused by autofluorescence.
  • FIG. 15B is a graph showing intensities of spot candidates for purposes of removing spots exceeding a threshold intensity.
  • FIG. 15C is a graph showing intensities combined with area for spot candidates for purposes of removing spots exceeding a threshold.
  • FIG. 15D is the image of 15A modified to emphasize spot candidates identified as autofluorescent tissue particles.
  • FIG. 16A is a scatter plot of spot candidates for an image relating to a FISH experiment conducted for centromere 17.
  • FIG. 16B is a scatter plot of spot candidates for an image relating to a FISH experiment conducted for the gene HER-2 and identifies spot clusters on the plot.
  • FIG. 17 is an illustration of a user interface for selecting areas of interest to be considered when calculating a spot count for an image.
  • FIG. 18 is an illustration of a user interface depicting a three-dimensional representation of a spot candidate.
  • the present invention includes a method and apparatus for counting FISH spots, and particularly an automated, computer implemented method and device that helps avoid counting errors introduced by vertically or horizontally overlapping spots in two-dimensional projections of three-dimensional tissue sections.
  • FIG. 1 A shows a vertical section through a stack of contiguous horizontal confocal layers of a tissue section that has been subjected to FISH.
  • Hybridization probe signals are shown as spots in this figure, in which it can be seen in a vertical section that the spots are discrete and spaced along the z-axis.
  • FIG. 1 A shows a vertical section through a stack of contiguous horizontal confocal layers of a tissue section that has been subjected to FISH.
  • Hybridization probe signals are shown as spots in this figure, in which it can be seen in a vertical section that the spots are discrete and spaced along the z-axis.
  • FIG. 1 A shows a vertical section through a stack of contiguous horizontal confocal layers of a tissue section that has been
  • FIG. IB illustrates a similar problem in which even more probe spots appear throughout the z-depth of the tissue section, but appear as an indistinguishable blur when viewed in an x-y plane from above, as shown at the bottom of FIG. IB.
  • FIG. 2A shows the conventional technique of counting hybridization signals within each cell nucleus.
  • the red label (R) may, for example, be attached to a probe that hybridizes to a gene of interest (such as a hormone receptor gene that may be amplified in certain tumors).
  • the green label (G) in contrast, may be attached to a probe that hybridizes to a known chromosomal locus that is not expected to vary in disease states (such as the centromere of a chromosome on which the gene of interest is found).
  • the red label (R) is illustrated by a gray color in the schematic figure, while the green label (G) is represented by a darker color.
  • the gene of interest could be recognized by a probe for a gene on the X chromosome labeled with spectrum orange (to provide an orange-red spot signal), and the reference probe could be labeled with spectrum green (to provide a green spot signal) for the centromere of the X-chromosome.
  • a single green signal (G) would therefore be observed in the nuclei of the schematic representation of FIG. 2 A from male cells (which have only one X chromosome), while two green signals (G) would be seen in female cells.
  • Amplification of the gene of interest would be noted in certain cells in the schematic representation of FIG. 2A in which there is an increase in the ratio of red signals (R) to green signals (G).
  • the present invention adopts a different approach which has been found to be more accurate, and has the additional advantage of being more accurately automated.
  • This approach is shown in FIG. 2B, in which the ratios of probes are determined without reference to the cells (or the nucleus) in which the probes are contained.
  • the nuclear contours highlighted in FIG. 2A are absent in FIG. 2B to illustrate this difference.
  • FIG. 2B shows the FISH spots of FIG. 2A in a region of interest (such as the microscope field of view [FOV] shown in FIG. 2 A), but without reference to the nuclear contours illustrated in FIG. 2A.
  • the ratio of test probes (R) to reference probes (G) in the region of interest is the ratio that is calculated. It has been found that this ratio provides sufficient information (over a sufficiently large number of cells in a region of interest) to be informative about the relative amplification or deletion of a gene of interest.
  • a region of interest is any arbitrary informative region across which an informative ratio can be determined.
  • a region of interest is a microscopic field of view at low magnifications (e.g. 100-200X).
  • An entire microscope field of view can then be used for the image capturing and analysis at 400-1000X magnification (X40-100 objectives) for FISH analysis.
  • the thickness of the tissue sections used for FISH analysis is the same as in sections routinely used for histopathological analyses, ranging from 4-10 ⁇ in thickness.
  • Another example of a region of interest is an area which is selected for a specific analysis, as shown in FIG. 2C, which is a cross-sectional photomicrograph of tissue from a human prostate, as it would appear in a tissue microarray.
  • region of interest is the area of malignant cells that is outlined in black.
  • a region of interest in this example is any homogenous site of a specimen, where most, if not all, specimens carry a particular alteration. However, contamination with non-altered normal tissue in the region of interest can be tolerated, if the copy number alteration in the abnormal cells is substantial.
  • a view of the probe signals in three-dimensional space is constructed, as shown in FIG. 3.
  • This view shows the three-dimensional relationship of the test and reference probe signals, which are interspersed among one another in all three dimensions of the tissue section which has undergone FISH.
  • This three-dimensional view of the section can be further processed by a computer implemented system, as described below, to automatically count signals of each color, and to obtain a ratio of the different colored signals.
  • This approach is particularly useful for high-throughput analysis, for example, of tissue microarrays such as those disclosed in PCT publications WO US99/04000 and US99/04001.
  • the three-dimensional imaging can be used to detect many kinds of genetic rearrangements in cells other than deletions or amplifications.
  • differently colored probes for the bcr and abl genes could be used to detect a fusion of these genes which occurs in a genetic translocation associated with chronic myelogenous leukemia (CML).
  • CML chronic myelogenous leukemia
  • Such differentially labeled probes will flank translocation breakpoints, and produce a fusion after the translocation has occurred, but will be at separate loci if the translocation has not occurred. However, unless proximity or distance between the differentially labeled spots can be determined in three-dimensions, a measurement of the fusion will be inaccurate.
  • the method of determining a three-dimensional relationship of probe spot signals within a region of interest will permit a large scale quantitative analysis of three-dimensional distances between any two probe signals. For example, the three-dimensional coordinates of a red signal are determined, and the three-dimensional coordinates of a green signal are determined, and the coordinates are then analyzed to determine if they are overlapping (suggesting a translocation that has moved them into contiguous genetic loci) or separate (on different genetic loci than they would be normally).
  • FIG. 4A shows such an automated spot counter 10 in accordance with the present invention.
  • the device 10 includes an automated optical microscope 12 (such as a confocal microscope) having a motorized stage 14 for the movement of a slide 16 relative to the viewing region of the viewing portion 18 of the microscope, a camera 20 for obtaining electronic or digital images from the optical microscope, a processing system 22 for counting the spots, and a memory 24 and a high resolution color monitor 26 for the storage and display respectively of images processed by the device 10.
  • an automated optical microscope 12 such as a confocal microscope
  • a motorized stage 14 for the movement of a slide 16 relative to the viewing region of the viewing portion 18 of the microscope
  • a camera 20 for obtaining electronic or digital images from the optical microscope
  • a processing system 22 for counting the spots
  • a memory 24 and a high resolution color monitor 26 for the storage and display respectively of images processed by the device 10.
  • the classification device 10 is automated and computer implemented, and therefore also includes, in addition to the motorized stage 14, an automated apparatus for focussing, for changing lens objectives between high and low power, and for adjustment of the light incident of the slide, as well as circuitry for controlling the movement of the motorized stage, typically in response to a command from the processing system.
  • the microscope may also include an automated slide transport system for moving the slides containing the specimen to be classified on to and off of the motorized stage, and a bar code reader for reading encoded information from the slide.
  • An example of a microscope performing at least some of these functions is manufactured by Carl Zeiss, Inc. of Germany, or Atto Instruments of Rockville, Maryland.
  • the microscope is a confocal microscope from Atto Instruments, such as that shown in PCT WO 99/22261, a Laser Scanning Microscope LSM 510 from Carl Zeiss, Inc., or an Axioplan 2 microscope from Carl Zeiss, Inc., equipped with a CARV module available from Atto instruments.
  • An example of a camera 20 suitable for use with the invention is a Quantix CCD camera available from Photometries of Tuscon, Arizona.
  • the signal counting device 10 is shown in FIG. 4B with particular emphasis on the classification elements embodied in the processing system 22.
  • the processing system 22 may include an image processor and digitizer 42, and a general processor 46 with peripherals for printing, storage, etc.
  • the general processor 46 can be an INTEL PENTIUM microprocessor or similar microprocessor based microcomputer, although it may be another computer-type device suitable for efficient execution of the functions described herein.
  • the general processor 46 controls the functioning and the flow of data between components of the device 10, may cause execution of additional primary feature signal counting algorithms, and handles the storage of image and classification information.
  • the general processor 46 additionally controls peripheral devices such as a printer 48, a storage device 24, such as an optical or magnetic hard disk, a tape drive, etc., as well as other devices including a bar code reader 50, a slide marker 52, autofocus circuitry, a robotic slide handler, the stage 14, and a mouse 53.
  • peripheral devices such as a printer 48, a storage device 24, such as an optical or magnetic hard disk, a tape drive, etc.
  • other devices including a bar code reader 50, a slide marker 52, autofocus circuitry, a robotic slide handler, the stage 14, and a mouse 53.
  • the depicted devices include computer-readable media such as a hard disk to provide storage of data, data structures, computer-executable instructions, and the like.
  • computer-readable media refers to a hard disk, other types of media which are readable by a computer, such as removable magnetic disks, CDs, magnetic cassettes, flash memory cards, digital video disks, and the like, may also be used.
  • the image processor and digitizer 42 digitizes images from the digital camera 20 and performs a primary algorithmic classification on the images to filter out unwanted information.
  • the image processor 42 and the general computer 46 may each access read-only and/or random access memory, as would be readily apparent to one skilled in the art, for the storage and execution of software necessary to perform the functions described relative to that processing component. Further, each component 42 and 46 includes circuitry, integrated circuit chips, etc. for the control of communication or data transfer over the data bus 54, as well as other functions typical of similar processors.
  • the steps performed by the computer implemented system are illustrated in FIG. 5A.
  • the tissue is first prepared and subjected to FISH (60) in the conventional manner, for example, as described in Example 2.
  • the system then obtains multiple confocal optical sections (62), and the successive images are transformed by detecting and representing positional values in each image of fluorescent emission of a particular color as an array of digital values. Localized increases (spikes) in the digital values may represent signals in the optical section (64).
  • a histogram is then generated (66) which produces a saddle point (described in association with FIG. 8) that permits fluorescent intensity image components below a threshold value to be eliminated (68, 70).
  • those that are substantially contiguous are grouped into segments according to various criteria (72). Additional filtering can be performed (74), for example, to account for autofluorescence and spot clusters.
  • the spot signals With the spot signals then resolved in three-dimensional space, they are counted. Additionally, the counted signals can be combined into a single max image, which is similar to the two-dimensional view seen through a conventional microscope.
  • the max image is a convenient view for rapidly reviewing the FISH results in two dimensions.
  • signals from a second probe may also need to be detected (76).
  • an optical filter can be changed to detect the new color, or an incident beam of laser light of a different color or wavelength can be directed at the tissue section.
  • the signals of a different color are then resolved in three-dimensional space using steps 62-74 as previously described. Once no more signals of a different color are to be obtained, a ratio of the signals from the test probe to the reference probe is calculated to determine whether there is a gene copy alteration.
  • EXAMPLE 2 Example of Automated FISH Signal Counting From a Series of Confocal Images in High Throughput Analysis with Tissue Microarrays
  • Gene amplification is an important mechanism for the up-regulation of critical genes involved in cancer initiation and progression.
  • a number of important oncogenes have already been found to be activated by DNA amplification. These include the HER-2 (17ql2), C-MYC (8q24), PRAD1/CYCLIN D (1 lql3), FGFR-1 (8pl2), and FGFR-2 (10q24) oncogenes. All of these are examples of genes for which alterations in gene copy number could serve as an indicator or disease onset or progression.
  • tissue microarrays of the type shown in PCT publications WO US99/04000 and 04001 (which are fully incorporated by reference) as a high throughput technique for efficiently performing FISH on hundreds of tissue sample specimens.
  • An example of such a tissue microarray is also shown in FIG. 12.
  • an automated technique for counting probe signals in the tissue microarray specimens it would take many hours for each of the tissue specimens in the array to be examined and scored.
  • hundreds or even thousands of the tissue specimens in the array can be examined and scored in much less time.
  • Formalin-fixed and paraffin-embedded tumor and control specimens were obtained from the archives of the Institutes for Pathology, University of Basel (Switzerland) and the Tampere University Hospital (Finland). The least differentiated tumor area was selected to be sampled for the tissue microarray.
  • the specimens that were interpretable for at least one FISH probe included the following: I) transurethral resections from 32 patients with benign prostatic hyperplasia (BPH) to be used as controls; II) 223 primary tumors, including 64 cancers incidentally detected in transurethral resections for BPH; 145 clinically localized cancers from radical prostatectomies, and 14 transurethral resections from patients with primary, locally extensive disease; III) 54 local recurrences after hormonal therapy failure, including 31 transurethral resections from living patients and 23 specimens obtained from autopsies; IV) Sixty-two metastases collected at the autopsies from 47 patients who had undergone androgen deprivation by orchiectomy, and had subsequently died of end-stage metastatic prostate cancer. Metastatic tissue was sampled from pelvic lymph nodes (8), lung (21), liver (16), pleura (5), adrenal gland (5), kidney (2), mediastinal lymph nodes (1), peritoneum (1), stomach (1)
  • the prostate tissue microarray was constructed using a tissue arraying instrument that created holes in a recipient paraffin block, and acquired tissue cores from the donor block by a thin-walled needle with an inner diameter of 0.6mm, held in a X-Y precision guide.
  • the cylindrical sample was retrieved from the selected region in the donor and extruded directly into the recipient block with defined array coordinates.
  • a solid steel wire closely fit in the tube was used to transfer the tissue cores into the recipient block.
  • multiple 5 ⁇ m sections were cut with a microtome using an adhesive-coated tape sectioning system (Instrumedics, Hackensack, New Jersey). H&E stained sections were used for histologic verification of tumor tissue on the arrayed samples.
  • Fluorescence in-situ hybridization (FISH) Fluorescence in-situ hybridization
  • Two-color FISH to sections of the arrayed formalin- fixed samples was performed using a Spectrum Orange-labeled androgen receptor (AR) probe with corresponding FITC-labeled centromeric probes (Vysis, Downer's Grove, Illinois). The hybridization was performed according to the manufacturer's instructions.
  • AR Spectrum Orange-labeled androgen receptor
  • tissue treatment protocol was developed to allow formalin-fixed tumors on the array to be reliably analyzed by FISH: the slides of the prostate tissue microarray were first deparaffinized, acetylated in 0.2 N HC1, incubated in 1 M sodium thiocyanate solution at 80 degrees C for 30 minutes and immersed in a protease solution (0.5mg/ml in 0.9% NaCl) (Vysis, Downer's Grove, Illinois) for 10 minutes at 37 degrees C.
  • protease solution 0.5mg/ml in 0.9% NaCl
  • the slides were then post-fixed in 10% buffered formalin for 10 minutes, air dried, denaturated for 5 minutes at 73 degrees C in 70% formamide/2x SSC (SSC is 0.3M sodium chloride and 0.03M sodium citrate) solution and dehydrated in 70, 80, and 100% ethanol, followed by proteinase K (4 ⁇ g/ml phosphate buffered saline) (GIBCOBRL, Life Technologies Inc.,
  • Test/control signal ratios in the range between 1 and 3 were regarded as low level gains, and were not scored as evidence of specific gene amplification.
  • FIG. 5B shows a series of optical sections 1-5, and a max image M that combines the fluorescent signal segments of the optical sections 1-5.
  • a "signal segment" in an optical section in combination with signal segments from other optical sections, are selectively chosen to make up a complete probe signal seen in the Max view.
  • the red signal segments in the optical sections (and the red signals in the Max view) have weaker signals than the green signals.
  • Any of the two-dimensional optical sections 1-5, or the max image M can be analyzed to determine the ratio of red to green signal segments or signals. As shown in FIG.
  • the ratio of red to green is substantially 1 , throughout the optical sections 1 -5 and in the max image M. This indicates that there is not an amplification of the AR, at least in the tissue section illustrated in FIG. 5B.
  • the green signals appear as brighter spots, and the red signals are more difficult to detect.
  • a three-dimensional representation of the max image can be obtained by analyzing the optical sections that contain each colored signal segment, and separating spot signals that overlap in an axial (vertical) or transverse (horizontal) direction. The separated signals can then be counted, and a ratio of red to green signals more accurately determined.
  • FIG. 6A illustrates FISH-signal counting from a series of confocal images.
  • Cuts 1 through 8 are optical sections obtained by the confocal microscope filtered to obtain only the green signal segments at different levels, while the Max-image is a combination of the cuts 1-8 (and is similar to the superimposed images that would be obtained in conventional two-dimensional FISH).
  • the total number of separate green signal segments counted in cuts 1-8 was 393 (summing the number of signal segments shown in brackets above each section number 1-8).
  • the green signals appear as slightly darker spots, such as the one jutting atop cut #6.
  • the first microarray consisted of 372 ethanol-fixed primary breast cancers.
  • the second microarray consisted of 612 primary breast cancers from the years 1985-1995, from patients with complete clinico-pathological information, including an average of 5.4 years of follow-up. Both series of tumors were analyzed, with 668 cases being informative for all experimental and clinical data. Both tumor cohorts were obtained from the Institute of Pathology, University of Basel.
  • the tumor samples included 73.3% ductal, 13.6% lobular, 3% medullary, 2.6% mucinous, 1.5% cribriform, 1.4% tubular, 1.1% papillary carcinomas, 1.9% ductal carcinoma in situ, and 1.7% of other rare histological subtypes.
  • the grade distribution was 24% grade 1, 40% grade 2, and 36% grade 3.
  • the pT stage was pTl in 32%, pT2 in 51%, pT3 in 7%, and pT4 in 10%.
  • a SpectrumOrange labeled PAC probe specific for S6K and a SpectrumGreen labeled chromosome 17 centromere probe (Vysis, Downers Grove, IL) was used for copy number analysis.
  • Interphase FISH to breast cancer cell lines was done as previously described in Barlund et al., Genes Chrom. Cancer 20:372-376, 1997.
  • the hybridizations are evaluated using a Zeiss confocal fluorescence microscope, and following the algorithm shown in FIG. 5A.
  • the breast cancer tissue section showed a high-level SK6 gene amplification, with a higher number of red signals than green reference signals, as shown in FIG. 6B.
  • the red signals appear as brighter dots, such as the loose cluster of dots in the center of image 5.
  • FIG. 7 shows a graphical user interface (GUI) that is displayed by the system for processing the fluorescent images and counting signals associated with probes that have hybridized to a target nucleic acid sequence.
  • GUI graphical user interface
  • the histogram plots the relative frequency of gray levels (which corresponds to signal intensity) across the pixels of the image of each optical section, where the x-axis is the frequency of occurrence, and the y-axis is a brightness value.
  • This graph shows two modes, in which the first mode (the sharp spike) represents noise (primarily dark background), and the second mode (the broader peak) represents useful signals (such as brighter pixels associated with fluorescent intensity).
  • the histogram is smoothed, and the derivative of the resulting graph is calculated.
  • FIG. 9A shows an image which is displayed of contiguous signal segments in different optical sections 1-8 of a tissue stack (which corresponds to a microarray spot).
  • This FISH signal is designated spot 224, and the image intensity of the signal segments that are present on optical sections 1-8 can be seen in the small image panels across the bottom of the display.
  • Each small panel 1-8 is a section of the three-dimensional space within the tissue section, and the number of pixels where signal is present is shown by the white boxes in the display panels.
  • the center representation (with segments labeled la-8a) will be used for purposes of illustration to explain how the different signal segments are combined into a single spot signal, or how certain signal segments are discarded.
  • the relatively small brightness signals (which correlate with respectively few white pixels) in panels 1, 2, 3 and 4 are mapped into correspondingly small three- dimensional geometric boxes la, 2a, 3a and 4a, having a volume proportional to the area of the illuminated pixels in the panels 1-4.
  • the absence of any illuminated pixels in panel 5 corresponds to an absence of any geometric figure in space 5 a (or can indicate that any pixels were below a threshold value and therefore not considered for further data manipulation).
  • the area of the white pixels in panel 6 correspond to the volume 6a
  • the area of the white pixels in panel 8 corresponds to the volume 8a
  • the very large illuminated area in panel 7 corresponds to the figure 7a, which has the largest volume of any of the geometric figures.
  • One function of the algorithm is to eliminate background noise from actual probe signals. This can be done by setting a threshold value, below which a signal segment can be considered not to be a valid signal segment.
  • the signal segments in panels 1, 2, 3, 4 and 6 do not meet the threshold criteria for minimum illuminated pixels, and are therefore not considered when combining the signal segments into an actual signal.
  • the signal segments in panels 7 and 8 do, however, meet the criteria for minimum pixel illumination. Since more pixels are illuminated in panel 7 than in panel 8, the location of the signal is assigned to level 7, although alternatively it could be considered to reside in both levels 7 and 8.
  • FIGS. 9B and 9C Other signal overlap situations that the algorithm can resolve are also shown schematically in FIGS. 9B and 9C.
  • FIG. 9B a side view of optical sections 1-10 is shown, in which two spot signals vertically (axially with respect to the microscope lens) overlap one another.
  • the signal segments in sections 1, 5, 6 and 10 fall below a pre-selected threshold value, and are eliminated from further consideration.
  • the remaining signal segments in sections 2, 3, 7, 8 and 9 are above the threshold, and are grouped into a first spot signal at contiguous sections 2-4, and a second spot signal at contiguous sections 7-9.
  • the probable location of the top spot signal can be assigned to section 3. where the strongest signal segment of the top spot signal is found, or alternatively the spot signal can be considered to reside at sections 2-4.
  • the probable location of the bottom spot signal can be assigned to section 8, where the strongest signal segment of the bottom spot signal is found, or in sections 7-9 across which the signal segments extend.
  • FIG. 9C schematically illustrates how vertically and/or
  • a side view of optical sections 1 -8 is shown in which the signal segments in sections 1 and 8 do not meet a threshold, and are eliminated.
  • section 4 there are two horizontally non-contiguous signal segments.
  • the signal segment 4B (represented by a gray bar) does not meet a threshold value and is eliminated, while the segment 4A (represented by the black bar) does satisfy the threshold and remains for consideration.
  • the segment 5B (represented by the gray bar) meets the threshold, while the segment 5 A (represented by the black bar) does not meet the threshold and is eliminated.
  • FIG. 10 illustrates a three-dimensional representation of a tissue section that has been subjected to FISH and analyzed by the method of the present invention.
  • the three-dimensional space is divided into x-y-z coordinates, in which the z axis is associated with successive optical sections 1-8 of the tissue section, and the signal segments of all of the signals in the section are illustrated.
  • Signal segments are illustrated as small black cylinders, and the assigned location of each spot is illustrated as a gray colored larger cylinder, some of which are designated Al, A2, A3, A4, A5, A6, A7 and A8.
  • Each of these gray cylinders A is associated with a corresponding sphere B in the max plane, in which the volume of each sphere B can be proportional to the volume of the A cylinder with which it is associated.
  • the system additionally provides an opportunity for a user to provide guidance during spot counting.
  • the user can specify a particular area of interest by selecting it on the screen.
  • the max image e.g., image M of FIG. 5B
  • the user may select an area or areas via a pointing device (e.g., a mouse).
  • Counting is then limited to only the selected area or areas.
  • a pointing device e.g., a mouse
  • the system also provides a way to eliminate a specified area or areas selected via a pointing device (e.g., a mouse). Portions of the image within the specified area or areas (sometimes called “gated areas”) is ignored when spots are counted.
  • a pointing device e.g., a mouse
  • the method of the present invention facilitates the analysis and quantitation of FISH on tissue microarrays, by overcoming previous obstacles to automation.
  • This method applies dual-color FISH with different gene probes and corresponding chromosomal reference probes to tissue microarrays. Because each tissue section is a three-dimensional volume, a confocal microscope system is used to generate a three-dimensional stack of a number of fluorescent images (e.g., 24 images) over defined tumor areas of each of the 4 ⁇ m thick tissue specimens on the tissue microarrays.
  • the algorithm identifies signal segments on each level along the Z-axis, and differentiates signal segments which overlap vertically above the X-Y plane.
  • the algorithm can be refined to account for spot clusters, and a variety of filtering mechanisms can be used to remove signals related to, for example, auto- fluorescence and other false signals.
  • a variety of parameters used during the algorithm can be modified by the user to account for circumstances related to a particular image or set of images. For example, since the magnification used for an image can vary, the parameters of the algorithm can be modified to account for the relative size of a pixel. There are many other examples of parameters (e.g., numeric values) that can be varied to avoid a strict adherence to a particular value.
  • Various results of intermediary calculations are stored to facilitate efficient re-calculation when parameters are modified by the user.
  • the algorithm is provided with a set of digital image slices (e.g., a raster image) representing a set of observations (e.g., of a biological specimen subjected to FISH) taken at different depths along a z-axis via a confocal microscope.
  • the set of images is sometimes called a "stack.”
  • the stack typically consists of anywhere from 8-32 images, although more images may be used. Satisfactory results have been obtained with 24 images.
  • possible fluorescent image components are identified in the slices.
  • pixels meeting certain criteria are selected as possibly corresponding to a location within or near a spot and placed into a set of binarized images.
  • the criteria can include an intensity level, and the images may be subjected to various filters before constructing the binary image.
  • a binary projection image can be constructed from a stack of binary images by setting pixels in the binary projection image at locations (X,Y) whenever there is any pixel set in any of the stack of binary images at the same (X,Y) location (regardless of the Z location).
  • insignificant regions in the image slices are discarded. So, for example, single lone pixels in a slice, or a contiguous set of 2-3 pixels are ignored, removed, or labeled as insignificant.
  • a similar process can be applied to the projection image.
  • each contiguous region in the projection image is considered in turn. For example, minimal rectangles can be drawn around each substantially contiguous region in the projection image, numbered, and considered in turn.
  • contiguous regions in the image slices associated with the projection image region under consideration are analyzed and grouped into spot candidates.
  • the next region in the projection image is considered, until each has been considered.
  • filtering an augmentation can be applied to the candidate spots.
  • certain candidate spots may be false spots, or others may actually be clusters representing plural spots.
  • the spots are counted.
  • the spot count can then be used visa-vis other spot counts to provide a ratio (e.g., a ratio of genes to centromeres).
  • GUI Graphical User Interface
  • An implementation of the algorithm in a scenario with 24 image slices can have the following features:
  • a morphological top-hat transform is applied to each of the 24 images to yield 24 outputs, each possessing brightness intensity spikes jutting above an essentially flat background.
  • Each bright spike can correspond either to a signal segment, or to noise.
  • Each top-hat image is thresholded to produce a stack of 24 binary images showing spike locations. Morphological filters are applied to the binary images to eliminate noise, and touching spots are segmented.
  • Binary spot markers occurring as vertical neighbors in the stack are grouped into one final spot located at a particular assigned stack level.
  • Various parameters of the algorithm are set to fit the physical characteristics of the images. These include window size for the top-hat transform, threshold levels, and sizes of filter structuring elements.
  • X k o B is opening by the structuring element B , which is taken to be a circle fitting within a 7x7 square (roughly, a circle with a diameter of 7).
  • Other structuring elements can be used (e.g., the following examples show a 5x5 square) and can be specified by the user via a user interface feature.
  • the data of the top-hat transform * is stored for future processing in case the value of the threshold needs to be changed.
  • a projection of multiple slices of the image can be calculated as
  • the projection is calculated with reference to all available image slices, in which case the binary image B max is sometimes called the "projection binary image” because it is roughly equivalent to peering down through the stack of binary images.
  • the binary image B max is also a "maximum” image; however, in the example, the maximum is computed comparing only l's and 0's.
  • the obtained binary images B k are processed to find the separate regions revealed by the thresholded top-hat transform. Each actual spot is composed of some number of these regions on different binary slices. Each of these slice regions is referred to as a signal segment, so that each spot signal is composed of a union of signal segments.
  • the software can identify signal segments by locating substantially contiguous groups of pixels, number the signal segments, and calculate the following data:
  • B the number of pixels composing the signal segment
  • C the coordinates of the minimal rectangle which contains the signal segment.
  • the projection binary image B max is processed similarly, and its separate regions (projection binary image regions) are identified by locating substantially contiguous groups of pixels and numbered. The following are calculated for spot signals on the projection binary image:
  • B the number of pixels (l's) composing the region
  • C the coordinates of the minimal rectangle which contains the region
  • the algorithm attempts to identify each of the spots by integrating the signal segments into spot signals via analysis of the sizes, intensities, and locations (in 3 dimensions) of the signal segments.
  • a threshold e.g., 3 pixels
  • signal segments having 3 or fewer pixels can be ignored when determining whether the slice above and/or below a signal segment is contiguous with the particular signal segment.
  • Specif ing such a threshold can be particularly useful in resolving closely- neighboring spots, such as two spots having substantially similar (X,Y) coordinates but appearing in different slices of the image. Examples of such scenarios are shown in FIGS. 9B and 9C. As shown, there can be more than one spot in a vertical direction or in a horizontal direction. During processing, segments that are substantially vertically contiguous can be associated with a particular optical slice having the greatest signal intensity, which is often the center of a single spot. However, it may be that there is another image slice with great signal intensity in the same segment, in which case there may be more than one spot in the segment.
  • the information about a spot candidate includes a size (e.g., total number of pixels across the image slices) and an intensity (e.g., the brightest pixel in the region of the slice having the most pixels for the spot candidate).
  • a size e.g., total number of pixels across the image slices
  • an intensity e.g., the brightest pixel in the region of the slice having the most pixels for the spot candidate.
  • the slice data 1402 is used to construct a max image 1404 and the top-hat slice data 1406.
  • the max image 1404 is useful for presenting to the user to give the user an informative depiction of a combination of the image slices and for various user interface features. It can be constructed by taking the pixel with the greatest intensity out of the image slices in the z-plane.
  • the top-hat slice data 1406 can be generated as described above.
  • Max top- hat image 1408 can be generated by finding the greatest intensity of the top-hat slices for each (X,Y) location.
  • the max top-hat image 1408 need not be displayed, but is useful, in combination with the top-hat slice data 1406 to construct the histogram 1410.
  • a threshold 1412 is chosen with reference to the histogram 1410 as described above. Then, the thresholded slice data 1420 can be constructed by generating a binary image for each of the slices, where pixels in the slices are set if the slice image in the slice data 1402 exceeds the threshold 1412.
  • substantially-contiguous regions of the thresholded slice data can be identified to generate regions 1422.
  • the regions are filtered to produce filtered regions 1424 of the thresholded data.
  • a projected binary image 1426 can be constructed, and substantially-contiguous regions 1430 within the projected binary image 1426 found. These regions can be filtered to generate filtered regions 1440 of the projected image.
  • Z-axis analysis data 1428 can be generated with reference to the filtered regions 1424 and the regions 1430 and used to generate the candidate spots 1432, from which, false spots 1434 and clusters 1442 can be identified. Additional filtering can be performed to generate spot count 1444. Finally, display data 1450 can be used to present graphical results to the user.
  • the data flows in the diagram are general only, and often data in upstream data components can be relied upon by downstream data components (or vice versa), even if not specifically shown in FIG. 14. As the data components are generated, they are often saved to permit efficient re-calculation when a parameter is adjusted to account for a particular image scenario.
  • one feature relates to removing false spots relating to, for example, autofluorescence.
  • Two separate approaches relate to removing based on intensity and intensity in combination with area. For example, after candidate spots are identified, an intensity of the spot can be calculated. If the intensity exceeds a certain threshold (say, two standard deviations above the mean intensity), the spot can be discarded and not counted.
  • the intensity can be combined with (e.g., multiplied by) the area or volume of a spot. Again, if the value is over a particular threshold, the spot can be discarded and not counted.
  • the approaches can be combined. For example, the image in FIG.
  • FIG. 15A illustrates spot candidates in an image of normal prostate tissue in which the X centromere has been labeled via FISH. Some of the spot candidates are actually autofluorescent tissue, and can be eliminated from the spot count. The image can be presented as part of the user interface.
  • the graph in FIG. 15B shows the intensity of the 149 spot candidates identified in the image. A mean and standard deviation of the spot candidate's intensities is calculated, and used as a threshold (e.g., two standard deviations above the mean). Spot candidates having intensities above the threshold are discarded (e.g., identified as "small autofluorescent tissue particles").
  • 15C is similar to that of 15B, but also includes an area component (e.g., by multiplying the intensity by the number of pixels" and not included in the spot count).
  • spot candidates having a rating above the threshold are discarded (e.g., identified as "large autofluorescent tissue particles" and not included in the spot count). Eight candidates can be removed because they exceed the threshold.
  • the threshold in either case can be adjusted manually by a user if the default setting is inappropriate.
  • FIG. 15D shows the image of 15 A, but the small autofluorescent tissue particles, large autofluorescent tissue particles, and the true FISH signals are differentiated by presenting each as a different color. The image can be presented as part of a user interface for the system.
  • the image components 1510 are portrayed in the color green to indicate that they are considered true FISH signals.
  • the image components 1520 and 1521, among others, are portrayed in the color yellow to indicate that they are small autofluorescent tissue particles.
  • the image components 1530-1533, among others, are portrayed in the color blue to indicate that they are large autofluorescent tissue particles.
  • a variety of other colors or other ways of emphasizing and differentiating the image components related to autofluorescence can be used.
  • Still another feature than can be implemented is removal of spots appearing at substantially identical positions in two channels (e.g., red and green channels generated in two separate images, such as an image for a test probe and an image for a reference probe). Typically, such a situation means the spot can be ignored (e.g., not counted for either image).
  • Yet another feature relates to identifying clusters and estimating the number of spots in a cluster.
  • certain criteria e.g., the size of a projection binary image region or the total number of pixels in slice regions related to a projection binary image region
  • the software can be configured to accept user guidance as to which portions of the image are a cluster (e.g., via selection by pointing device). Determining the number of spots in a cluster can be difficult; however, one way of providing accurate estimates is to provide a calibration mechanism. For example, in one instance, 200 sets of clusters were analyzed manually to determine how many spots were in the clusters. These clusters were then subjected to the above-described algorithmic analysis.
  • a factor of 2.5 was determined to be appropriate for determining how many spots were in a cluster.
  • an estimate of 10 is provided.
  • Other, more complex, analysis can be done, such as, for example, counting the total number of pixels associated with the signal segments related to a projection binary image region.
  • a mapping between spots detected and actual spots can be used. Sometimes detection of a small number of spots (e.g., 3) may in fact indicate a tight cluster of (e.g., 9) spots, while detection of a larger number of spots (e.g. 7) may be a loose cluster of (e.g., 9) spots. Therefore, a uniform gain factor is not appropriate.
  • the appropriate method for handling clusters may vary depending on the target being counted. For example, certain genes have a higher propensity for clustering than do centromeres. The user can manually adjust the cluster detection and counting parameters manually. The number of spot signals is counted and the results are provided to the user.
  • the interactive options of the program allow the threshold Tto be changed, and process the spot counting of the fluorescent images from step 3, avoiding the repeated calculation of the top-hat transforms.
  • the output data using this algorithm can include:
  • Table of data for spot-cuts for L slices which consists of:
  • r.v.l. is the abbreviation of relative vertical location
  • y, op are the coordinates of the rectangles surrounding the spot-cuts. The last box indicates whether the spot-cut is below, on, or above the true spot.
  • FIGS. 16A and 16B show scatter plots generated for a FISH experiment in which the presence of centromere 17 and gene HER-2 were detected to generate a ratio indicating possible amplification of HER-2.
  • Spots relating to centromere 17 typically do not cluster and are shown in FIG. 16A. Spot candidates are represented by appearing in a position appropriate for their intensity and size (e.g., number of pixels).
  • the user can set a threshold parameter (e.g., 4) so that candidate spots having fewer than the specified number of pixels are excluded from the spot count. Also, a minimum intensity parameter can be set.
  • Spots relating to HER-2 are shown in FIG. 16B and are more difficult to count due to potential clustering. For purposes of illustration, areas of the plot roughly relating to background noise 1652, real signals 1654, and clusters 1656 are circled.
  • An image e.g., the maximum image 1404 is presented to the user, who can then verify that certain spot candidates are indeed spots, although there are of minimal intensity. The user selects the spot candidate shown and activates a user interface element indicating the spot candidate is of minimal intensity.
  • the spot candidates of minimal intensity are shown in red to facilitate choosing an appropriate minimum intensity threshold (e.g., by adjusting an appropriate parameter in the algorithm), below which spot candidates are not counted as spots.
  • an appropriate minimum intensity threshold e.g., by adjusting an appropriate parameter in the algorithm
  • the minimum intensity threshold is typically adjusted to barely include the minimum intensity candidates.
  • clusters 1656 can be used to adjust the cluster selection criteria.
  • the user can at any time click on a particular spot candidate (e.g., 1660) to navigate to information about the spot candidate, including a three-dimensional view of the spot candidate.
  • the user can then manually designate the spot candidate as a cluster or non-cluster.
  • the scatter plot 1650 thus presents a useful tool for allowing the user to more easily grasp the totality of the image data and easily navigate to appropriate user interface screens for analyzing and adjusting the data, as well as calibrating the system.
  • EXAMPLE 8 User Direction of Areas to be Analyzed
  • the system can also provide a user interface by which the user can guide the selection of areas to be analyzed. Although the system typically discards clearly aberrant data (e.g., a substantially total black or high-intensity image slice), the system may benefit from assistance from the user in determining which areas of an image are of interest or which are not. As described below, the user can select an area to be counted separately or an area not to be counted.
  • FIG. 17 shown a user interface 1702 presented to the user for designating areas of interest.
  • the user interface 1702 presents a max image (e.g., 1404).
  • a pointing device e.g., a mouse or trackball
  • the user can designate a particular area 1706 as one to be processed separately. Multiple areas can be added to an area set and processed separately. In this way, the user can direct the system to focus on particularly relevant portions of the image.
  • the system also supports designating areas as not to be processed or included in the spot count.
  • the algorithm can store the areas in a list and, for example, not count spots at a location (e.g., X, Y coordinates) within the area(s) designated.
  • the above feature is useful because areas of the image can sometimes be identified as debris, stroma, connective tissue, or blood vessels, which tend to interfere with determining an appropriate spot count in certain scenarios (e.g., some material may be particularly prone to autofluorescence).
  • FIG. 18 shows a user interface 1800, which depicts a spot candidate in three views, 1802, 1804, and 1806.
  • the views assist in determining, for example, whether the spot candidate is a cluster, as is likely the case in the illustrated example.
  • the view 1806 is additionally processed to give a smoother representation of the spot candidate.
  • the horizontally-contiguous region with the greatest area e.g., number of pixels
  • a status line 1818 indicates information about the spot candidate.
  • a strip along the interface indicates the max image, and the 15 image slices (numbered 0-14) in binary form after thresholding.
  • the image slice having the greatest area region is noted at 1820.
  • User interface controls 1832 and 1834 allow navigation to spot candidates and identified clusters, respectively.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Health & Medical Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Immunology (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Dispersion Chemistry (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Image Processing (AREA)

Abstract

Fluorescently tagged nucleic acid probe signals are counted in biological specimens by determining a ratio of signals from a test probe to signals of a reference probe. Probe signals need not be counted with reference to cells, nuclei, or nuclear contours. Gene amplification or deletion can thus be detected by analyzing the ratio. Successive image slices are obtained by confocal microscopy, and the images are digitized. The digital images are transformed and analyzed to combine contiguous fluorescent signal segments in successive optical sections to identify discrete probe signals, or spots. Spots overlapping in the axial and transverse dimensions of a three-dimensional representation of the biological specimens can be distinguisghed. A graphical user interface presents various features for consideration by a user, who can provide guidance to a computer system counting the spots. Various features directed to identifying spot clusters and autofluorescent material can increase accuracy of spot counting.

Description

SIGNAL COUNTING FOR IN SITU HYBRIDIZATION
FIELD OF THE INVENTION
This invention relates to methods of counting probe signals in biological specimens, such as probe signals produced by in situ hybridization in cells or tissue sections.
BACKGROUND OF THE INVENTION
Recent advances in molecular medicine have provided a greater opportunity to understand the genetic basis of disease, as well as the cellular mechanisms of disease, and select appropriate treatments with the greatest likelihood of success. Such diagnostic and prognostic cellular changes include the presence of tumor specific cell surface antigens (as in melanoma), and genetic abnormalities (such as activated oncogenes in tumors). A variety of techniques have evolved to detect the presence of these cellular abnormalities, including immunophenotyping with monoclonal antibodies, in situ hybridization with probes, and DNA amplification using the polymerase chain reaction (PCR).
One such technique for molecular diagnosis is in situ hybridization (ISH) in which labeled hybridizing agents (such as DNA, RNA, or single stranded or double stranded DNA probes) are exposed to intact tissue sections. The probes can be labeled by direct or indirect means. In direct labeling, the label (a chromophore) is integral to the probe. Direct labels include fluorescent dyes such as derivatives of rhodamine, fiuorescein, and Texas Red, or enzymatic reporters such as horseradish peroxidase or alkaline phosphatase. Indirect labeling involves attaching a hapten (such as biotin, mercury or dioxygenin) to a probe through a linker. After hybridization, the hapten is detected using a labeled antibody or other specific binding protein.
When fluorescent dyes are used as labels, the technique is referred to as fluorescent in situ hybridization (FISH). Dyes such as fiuorescein isothiocyanate (FITC) are often used to label a sequence of probe DNA in FISH. The probe hybridizes to a defined target nucleotide sequence of DNA in the cell, and the FITC fluoresces green when excited by a mercury arc lamp or Argon laser (in the case of laser scanning microscopy), so that the labeled probe can be visually detected when the probed tissue is viewed through a microscope. Each chromosome containing the target DNA sequence will produce a fluorescent signal (sometimes called a spot) in every cell when the specimen is illuminated with suitable excitation. For example, specimens hybridized with a probe hybridizing to a specific region on chromosome 21 will produce two fluorescent signals in cells from normal individuals and three signals from Down's Syndrome (Trisomy 21) subjects who have an extra chromosome number 21. Alternatively, a fluorescent probe that hybridizes to an X chromosome will give one fluorescent signal per cell in males (who have only one X chromosome) and two fluorescent signals in females (who have two X chromosomes).
FISH is an excellent method for detection of gene copy number alterations in cancer and other diseases. FISH is a standard tool for analyzing congenital genetic alterations in clinical diagnostics (such as numerical chromosomal alterations, duplications, inversions and microdeletions). In cancer, characteristic gene amplifications or deletions are associated with the development and progression of the tumor. FISH analysis of gene amplifications in cancer can have prognostic and therapeutic significance, such as the detection of the HER-2 oncogene amplification in breast cancer. Detection of this oncogene by FISH was recently approved by the Food and Drug Administration as a diagnostic tool for human breast cancer.
However, a limitation on the more widespread use of FISH technology has been that counting of the fluorescent spots is extremely tedious, inaccurate, often highly subjective, and subject to substantial intra-observer variability. It also requires a highly trained technician who can recognize the cells or tissue to be analyzed and recognize and count tiny fluorescent spots accurately. Finally, at most 100 or 200 cells are typically analyzed per specimen, and in the case of gene amplification, much less than that (such as 20 per specimen). This can lead to statistical inaccuracy in defining the correct copy number.
A significant impediment to the accurate counting of fluorescent signals is that the probes hybridize throughout a three-dimensional nucleus, and the probe signals have to be counted from different focal planes for each nucleus. However, cells and probe signals can overlap in the two-dimensional view, and overlapping signals are seen as a single signal. Such overlapping results in undercounting of signals, which can make it appear that an amplified gene is less amplified, or that fewer copies of a normal gene are present.
Given the tedium and subjectivity of signal counting, efforts have been made to automate this technique. For example, U.S. Patent No. 5,523,207 discloses a two dye FISH method in which probes are labeled with a first dye (such as FITC) and a contour of the nucleus is labeled with a second dye (such as propidium iodide or PI). These two dyes allow the number of signals per visualized nucleus to be determined. However, automated FISH spot counting using such techniques has been limited because FISH signals in the nuclei are often at different focal planes, resulting in interfering out-of-focus light. Moreover, automated detection of nuclear boundaries has been very difficult to perform in tissue sections. These factors have contributed to the inadequacy of existing algorithms for performing automated FISH.
It would therefore be helpful to provide a method and device for improving the accuracy of FISH spot counting.
SUMMARY OF THE DISCLOSURE
In one embodiment disclosed herein, fluorescently tagged nucleic acid probe signals are counted in a region of interest in a biological specimen by determining a ratio of signals from a test probe to signals of a reference probe, and the region of interest includes multiple cells. This is a contrast to prior approaches, in which probe signals have been counted with reference to cells or nuclei, and in which automated methods have counted probe signals with respect to stained nuclear contours. In certain embodiments, the reference probe may be a fluorescently labeled polynucleotide (such as DNA or RNA) that hybridizes to the region of interest in a gene, and the reference probe is a polynucleotide labeled with a different fluorescent color, and which hybridizes to a reference target. The test probe may hybridize to a gene that is implicated or suspected to be involved in a particular disease, such as tumor development and progression. The reference probe may, for example, recognize a centromere of the same chromosome on which the gene of interest is contained. An increased ratio of test probe signals to reference probe signals would then indicate an amplification in gene copy number, while a decrease in that ratio would indicate a relative loss of the gene of interest (such as a gene deletion). By determining a ratio between the quantities of test and reference signals, the problem of measuring changes in gene copy number with respect to a cell or nucleus is avoided. Typically, a count of centromeres approximates a nucleus or cell count. Particularly accurate counting of FISH signals can also be accomplished by obtaining successive contiguous images of the region of interest to distinguish overlapping signals from the biological specimen. Without distinguishing the overlapping signals, they would otherwise obscure one another, and diminish accuracy of the spot count. In particular embodiments, the successive images are slices, such as digital microscopic optical sections from different depths of the biological specimen, obtained by confocal microscopy. The successive images are transformed by detecting and representing the positional values in each image of fluorescent emission signal segments, which make up the probe signal, as an array of digital values. Signal segments below a certain value (e.g. a threshold number of pixels) may be eliminated. The remaining digital signal segments may then be analyzed to combine contiguous fluorescent signal segments in successive optical sections into a single spot signal, which may be assigned to a particular optical section in which a strongest fluorescent signal segment is located, or a group of optical sections across which the contiguous signal segments have been detected. This localization of the fluorescent signal to a particular optical section allows overlapping spots to be distinguished, both in the axial and transverse dimensions of the three-dimensional representation.
The system detects the location of fluorescent spot signals in three- dimensional space by performing a morphological top-hat transform to digital images of the different levels to obtain fluorescent intensity spikes that indicate a spot signal segment. A threshold level of fluorescence intensity is determined to eliminate signal segments that are below a fluorescence intensity that would be associated with a valid spot signal. Remaining contiguous spot signal segments are segmented into a single spot signal, and the single spot signal may be assigned a location at a level associated with a greatest fluorescent intensity signal segment. Certain disclosed embodiments also include devices for counting signals from in situ hybridization of probes in biological tissue, or determining a three- dimensional relationship between the signals, in which the device includes a confocal microscope, a digital camera positioned to obtain digital optical section images of different levels of the biological tissue, and a computer implemented system that detects and combines contiguous or adjacent signal segments (which are above a threshold) at the different levels, and separates vertically overlying, transversely overlapping, or non-contiguous signals from one another. After separating such signals from different levels into different spot signals, the computer implemented system then counts the spot signals, or compares their relative locations in three-dimensional space. Two or more different signal types (such as two or more distinguishable fluorescent dyes) may be used, one color for the test probe signal and a second color for the reference probe signal. The ratio of the number of test probe signals to reference probe signals can then be determined, or an unexpected overlap of the signals (as in a genetic translocation) can be assessed. The test probe signals and the reference probe signals may be obtained separately, for example by successively illuminating the tissue specimen with light of different colors that selectively causes the different dyes to fluoresce, by viewing the specimen through filters that filter out signals (such as a filter that removes colors except the color of interest), or exposing separate contiguous tissue sections to the different probes. Alternatively, the test and reference probe signals can be obtained simultaneously, using multiple band-pass excitation and/or emission filters. Once the test and reference probe spot signals have been counted, a ratio of test probe signals to reference probe signals is calculated (without reference to boundaries of a cell or nucleus) to determine whether there is a genetic alteration, such as an alteration in gene copy number.
Alternatively, in certain disclosed embodiments, the FISH spots are accurately counted by obtaining the plurality of digital optical images at different levels of a biological sample, and constructing a three-dimensional image showing discrete fluorescent signal segments at different levels of the three-dimensional image. The three-dimensional image is constructed by determining a location of a fluorescent signal segment of a particular color in the different levels of the biological sample, combining contiguous signal segments (which are above a preselected threshold) into a single spot signal, and separating non-contiguous signal segments into different spots signals. The location of signal segments in each level is determined by the presence of a fluorescent brightness intensity spike that indicates an increase in image component intensity as compared to a background intensity. The locations of signals of different colors can be similarly resolved in three-dimensional space, the number of spot signals of each color counted, and a ratio of the spot signals determined. This three-dimensional imaging would also allow one to study the orientation or location of the spots in the nuclei to make conclusions about the presence of genetic rearrangements that do not change copy number of the signals, such as translocations and inversions.
Certain features can be implemented to increase the accuracy of the spot counting. For example, under certain circumstances, a group of spots may be packed so closely together as to form a cluster that is difficult to analyze using a standard algorithm. Clusters can be automatically identified and counted using a cluster calibration feature, or a user can specify that a particular region of an image is a cluster.
Another feature for increasing accuracy filters out false spots by, for instance, eliminating spots appearing at a same location in the test and reference probe images or filtering out image data indicating autofluorescence. In addition, a user interface can be presented to allow a user to either select areas of particular interest for separate processing or specify areas not to be processed. In this way, the invention can benefit from information provided by a human operator.
In the copy number analysis, the ratio of signals can be a ratio of spot signals from a test probe that recognizes a gene of interest, and a reference probe that recognizes a chromosomal locus having an expected quantity in the biological specimen. The method can further include determining whether there is an increase in an expected ratio between the test signals and the reference signals, indicating an amplification of the gene of interest, or whether there is a decrease in the expected ratio between the test signals and the reference signals, indicating relative loss of the gene of interest. This method is particularly applicable to high throughput techniques for performing automated FISH analysis of a large number of tissue, cell, or other specimens. The specimens can, for example, be in a tissue or cell microarray that includes specimens from the same or different sources, such as tumors. In such embodiments, the method includes providing an array of biological samples, hybridizing the biological samples with a fluorescent test probe that hybridizes to a gene of interest in the biological samples and with a fluorescent reference probe that hybridizes to a chromosomal reference locus in the biological samples. Images are then obtained by confocal microscopy of contiguous sections at different depths of a plurality of the biological samples in the array, and fluorescent signal segments from the contiguous sections are detected. The contiguous signal segments in different sections are combined into a corresponding single spot signal, and the separate spot signals are resolved from one another. The distinct spot signals can then be counted. In particular embodiments, the tissue array can include an array of many different tissue specimens, for example at least 50 tissue specimens, but hundreds or even thousands of tissue specimens can be included in the microarray. The tissue arrays can be constructed by obtaining a plurality of donor specimens, placing each donor specimen in an assigned location in a recipient array, and obtaining a plurality of copies of the recipient array in a manner that each copy contains a plurality of donor specimens that maintain their assigned locations. For example, a set of tissue samples relating to a same type of tissue from a plurality of donor specimens can be included. However, tissue arrays made by any method are suitable for use with the method of counting FISH spot signals.
The foregoing and other objects, features, and advantages of the invention will become more apparent from the following detailed description of disclosed embodiments which proceeds with respect to the accompanying drawings.
BRIEF DESCRIPTION OF THE FIGURES
FIGS 1A and IB are schematic side and top views illustrating the problem of viewing three-dimensional FISH images in two dimensions. FIG. 2A is a schematic view illustrating the prior approach of counting two color FISH spot signals with respect to cell nuclei (the contours of the nuclei being shown around the spot signals).
FIG. 2B is a view similar to FIG. 2A, but shows counting two color FISH spot signals and determining a ratio of the different colored signals in a region of interest (such as a field of view FOV) instead of with respect to cell nuclei. FIG. 2C is a photomicrograph of prostate tissue, illustrating a region of interest (ROI) within the tissue for purposes of calculating a ratio of test to reference probe signals. FIG. 3 is a schematic view illustrating two color FISH, in which signal locations have been determined in three-dimensional space. .
FIGS. 4A and 4B are flow diagrams illustrating one embodiment of an automated system for counting FISH spot signals in accordance with the present invention.
FIG. 5 A is a flow diagram illustrating an algorithm for counting the FISH signals.
FIG. 5B is a series of optical sections 1-5, and a max image M, of prostate tissue subjected to two color FISH with a probe for the androgen receptor (labeled red) and a probe for the normal X chromosome (labeled green), in which red signals appear weaker than green signals. FIG. 6A is a composite figure showing a print-out of optical sections of a series of eight confocal images (cuts 1-8), and a view of the image that would be seen (max-image) when the series of eight overlapping images would be viewed from above as a two-dimensional image.
FIG. 6B is a series of optical sections 1-5, and a max image M, of breast cancer tissue subjected to two color FISH with a ribosomal S6K probe (labeled red) and a probe for the chromosome 17 centromere (labeled green), in which red signals appear weaker than green signals. The excess of red S6 kinase signals over green reference signals indicates amplification of kinase.
FIG. 7 is a MATLAB graphical user interface (GUI) for displaying optical sections of a three-dimensional tissue specimen that has been subjected to FISH. FIG. 8 is a histogram of the number of occurrences of the various gray levels, or fluorescence intensities, in the max image, which is used to threshold valid pixels and eliminate non-informative image components.
FIG. 9A is a print-out of a MATLAB GUI illustrating processing spot signal segments from different levels, and combining contiguous signal segments above a threshold into a single spot signal (which can be assigned to a particular level or levels) for purposes of counting a spot signal.
FIG. 9B is a schematic view of a vertical side view of optical sections, illustrating how image components below a certain threshold are eliminated to separate otherwise vertically contiguous signal segments, and separate overlying signals from one another.
FIG. 9C is a view similar to FIG. 9B, showing how transversely and vertically overlapping signal segments are separated by eliminating signal segments below a threshold, and grouping the remaining signal segments into discrete spot signals.
FIG. 10 is a three-dimensional representation of signal segments in different optical sections, resolved into a spot in an assigned optical section, and a combined image of the signals projected as spheres into a max image, in which a volume of the spheres is proportional to an intensity of the signal. FIG. 11 is a view of a tissue section mounted on a slide, for processing by
FISH.
FIG. 12 is a view of a tissue microarray mounted on a slide, for processing by FISH.
FIG. 13 is a flowchart showing an overview of an algorithm used to count FISH spots in a set of digital image slices.
FIG. 14 is a data flow diagram of an algorithm used to count FISH spots in a set of digital image slices.
FIG. 15A is an image showing FISH spots relating to the X centromere in normal prostate tissue, including false spots caused by autofluorescence. FIG. 15B is a graph showing intensities of spot candidates for purposes of removing spots exceeding a threshold intensity. FIG. 15C is a graph showing intensities combined with area for spot candidates for purposes of removing spots exceeding a threshold.
FIG. 15D is the image of 15A modified to emphasize spot candidates identified as autofluorescent tissue particles. FIG. 16A is a scatter plot of spot candidates for an image relating to a FISH experiment conducted for centromere 17.
FIG. 16B is a scatter plot of spot candidates for an image relating to a FISH experiment conducted for the gene HER-2 and identifies spot clusters on the plot.
FIG. 17 is an illustration of a user interface for selecting areas of interest to be considered when calculating a spot count for an image.
FIG. 18 is an illustration of a user interface depicting a three-dimensional representation of a spot candidate.
DETAILED DESCRIPTION OF SEVERAL EXAMPLES The present invention includes a method and apparatus for counting FISH spots, and particularly an automated, computer implemented method and device that helps avoid counting errors introduced by vertically or horizontally overlapping spots in two-dimensional projections of three-dimensional tissue sections. One of the problems solved by this invention is illustrated in FIG. 1 A, which shows a vertical section through a stack of contiguous horizontal confocal layers of a tissue section that has been subjected to FISH. Hybridization probe signals are shown as spots in this figure, in which it can be seen in a vertical section that the spots are discrete and spaced along the z-axis. However, when viewed from above (in an x-y plane) as shown at the bottom of FIG. 1A, the discrete spots overlap and can not clearly be counted as separate spots. FIG. IB illustrates a similar problem in which even more probe spots appear throughout the z-depth of the tissue section, but appear as an indistinguishable blur when viewed in an x-y plane from above, as shown at the bottom of FIG. IB.
Another problem with the prior art is illustrated in FIG. 2A, which shows the conventional technique of counting hybridization signals within each cell nucleus. This view illustrates two color FISH, in which two probes are labeled with different dyes that fluoresce with different colors. The red label (R) may, for example, be attached to a probe that hybridizes to a gene of interest (such as a hormone receptor gene that may be amplified in certain tumors). The green label (G), in contrast, may be attached to a probe that hybridizes to a known chromosomal locus that is not expected to vary in disease states (such as the centromere of a chromosome on which the gene of interest is found). The red label (R) is illustrated by a gray color in the schematic figure, while the green label (G) is represented by a darker color.
In a particular example, the gene of interest could be recognized by a probe for a gene on the X chromosome labeled with spectrum orange (to provide an orange-red spot signal), and the reference probe could be labeled with spectrum green (to provide a green spot signal) for the centromere of the X-chromosome. A single green signal (G) would therefore be observed in the nuclei of the schematic representation of FIG. 2 A from male cells (which have only one X chromosome), while two green signals (G) would be seen in female cells. Amplification of the gene of interest would be noted in certain cells in the schematic representation of FIG. 2A in which there is an increase in the ratio of red signals (R) to green signals (G).
As shown in FIG. 2A, it is conventional to count the number of signals in each nucleus of a large number of cells. This approach has been adopted because amplification or deletion of a gene occurs in large populations of cells, and significant changes in copy numbers of genes are often only detected by examining a large number of cells (for example, at least 200). Because the amplification has been considered to be a nuclear event, a change in the copy number of a gene with respect to each nucleus has been counted, both manually and in automated systems. Such systems have been difficult to automate, however, because cells and nuclei overlap (as shown by the overlapping nuclear contours in FIG. 2A), and the nuclear contours have been difficult to reliably recognize in automated systems.
The present invention adopts a different approach which has been found to be more accurate, and has the additional advantage of being more accurately automated. This approach is shown in FIG. 2B, in which the ratios of probes are determined without reference to the cells (or the nucleus) in which the probes are contained. The nuclear contours highlighted in FIG. 2A are absent in FIG. 2B to illustrate this difference. Hence, FIG. 2B shows the FISH spots of FIG. 2A in a region of interest (such as the microscope field of view [FOV] shown in FIG. 2 A), but without reference to the nuclear contours illustrated in FIG. 2A. In accordance with the present invention, the ratio of test probes (R) to reference probes (G) in the region of interest is the ratio that is calculated. It has been found that this ratio provides sufficient information (over a sufficiently large number of cells in a region of interest) to be informative about the relative amplification or deletion of a gene of interest.
A region of interest is any arbitrary informative region across which an informative ratio can be determined. In some instances, a region of interest is a microscopic field of view at low magnifications (e.g. 100-200X). An entire microscope field of view can then be used for the image capturing and analysis at 400-1000X magnification (X40-100 objectives) for FISH analysis. The thickness of the tissue sections used for FISH analysis is the same as in sections routinely used for histopathological analyses, ranging from 4-10 μ in thickness. Another example of a region of interest is an area which is selected for a specific analysis, as shown in FIG. 2C, which is a cross-sectional photomicrograph of tissue from a human prostate, as it would appear in a tissue microarray. In this figure, the region of interest (ROI) is the area of malignant cells that is outlined in black. A region of interest in this example is any homogenous site of a specimen, where most, if not all, specimens carry a particular alteration. However, contamination with non-altered normal tissue in the region of interest can be tolerated, if the copy number alteration in the abnormal cells is substantial.
In some embodiments, to further improve the ability of the system to accurately determine the ratio of test probe signals to reference probe signals, a view of the probe signals in three-dimensional space is constructed, as shown in FIG. 3. This view shows the three-dimensional relationship of the test and reference probe signals, which are interspersed among one another in all three dimensions of the tissue section which has undergone FISH. This three-dimensional view of the section can be further processed by a computer implemented system, as described below, to automatically count signals of each color, and to obtain a ratio of the different colored signals. This approach is particularly useful for high-throughput analysis, for example, of tissue microarrays such as those disclosed in PCT publications WO US99/04000 and US99/04001.
In yet other embodiments, the three-dimensional imaging can be used to detect many kinds of genetic rearrangements in cells other than deletions or amplifications. For example, differently colored probes for the bcr and abl genes could be used to detect a fusion of these genes which occurs in a genetic translocation associated with chronic myelogenous leukemia (CML). See Tkachuk et al., Science 250:559-562, 1990. Such differentially labeled probes will flank translocation breakpoints, and produce a fusion after the translocation has occurred, but will be at separate loci if the translocation has not occurred. However, unless proximity or distance between the differentially labeled spots can be determined in three-dimensions, a measurement of the fusion will be inaccurate. Hence the method of determining a three-dimensional relationship of probe spot signals within a region of interest (such as a nucleus or other region) will permit a large scale quantitative analysis of three-dimensional distances between any two probe signals. For example, the three-dimensional coordinates of a red signal are determined, and the three-dimensional coordinates of a green signal are determined, and the coordinates are then analyzed to determine if they are overlapping (suggesting a translocation that has moved them into contiguous genetic loci) or separate (on different genetic loci than they would be normally).
EXAMPLE 1 Device for Performing Automated FISH
FIG. 4A shows such an automated spot counter 10 in accordance with the present invention. Briefly, the device 10 includes an automated optical microscope 12 (such as a confocal microscope) having a motorized stage 14 for the movement of a slide 16 relative to the viewing region of the viewing portion 18 of the microscope, a camera 20 for obtaining electronic or digital images from the optical microscope, a processing system 22 for counting the spots, and a memory 24 and a high resolution color monitor 26 for the storage and display respectively of images processed by the device 10. In a disclosed embodiment, the classification device 10 is automated and computer implemented, and therefore also includes, in addition to the motorized stage 14, an automated apparatus for focussing, for changing lens objectives between high and low power, and for adjustment of the light incident of the slide, as well as circuitry for controlling the movement of the motorized stage, typically in response to a command from the processing system. The microscope may also include an automated slide transport system for moving the slides containing the specimen to be classified on to and off of the motorized stage, and a bar code reader for reading encoded information from the slide. An example of a microscope performing at least some of these functions is manufactured by Carl Zeiss, Inc. of Germany, or Atto Instruments of Rockville, Maryland. In particular embodiments, the microscope is a confocal microscope from Atto Instruments, such as that shown in PCT WO 99/22261, a Laser Scanning Microscope LSM 510 from Carl Zeiss, Inc., or an Axioplan 2 microscope from Carl Zeiss, Inc., equipped with a CARV module available from Atto instruments. An example of a camera 20 suitable for use with the invention, is a Quantix CCD camera available from Photometries of Tuscon, Arizona.
The signal counting device 10 is shown in FIG. 4B with particular emphasis on the classification elements embodied in the processing system 22. The processing system 22 may include an image processor and digitizer 42, and a general processor 46 with peripherals for printing, storage, etc. The general processor 46 can be an INTEL PENTIUM microprocessor or similar microprocessor based microcomputer, although it may be another computer-type device suitable for efficient execution of the functions described herein. The general processor 46 controls the functioning and the flow of data between components of the device 10, may cause execution of additional primary feature signal counting algorithms, and handles the storage of image and classification information. The general processor 46 additionally controls peripheral devices such as a printer 48, a storage device 24, such as an optical or magnetic hard disk, a tape drive, etc., as well as other devices including a bar code reader 50, a slide marker 52, autofocus circuitry, a robotic slide handler, the stage 14, and a mouse 53. Although a single processor system is shown, the invention could also be carried out in a variety of other systems. The depicted devices include computer-readable media such as a hard disk to provide storage of data, data structures, computer-executable instructions, and the like. Although the description of computer-readable media above refers to a hard disk, other types of media which are readable by a computer, such as removable magnetic disks, CDs, magnetic cassettes, flash memory cards, digital video disks, and the like, may also be used.
The image processor and digitizer 42 (hereinafter referred to as the image processor) digitizes images from the digital camera 20 and performs a primary algorithmic classification on the images to filter out unwanted information. The image processor 42 and the general computer 46 may each access read-only and/or random access memory, as would be readily apparent to one skilled in the art, for the storage and execution of software necessary to perform the functions described relative to that processing component. Further, each component 42 and 46 includes circuitry, integrated circuit chips, etc. for the control of communication or data transfer over the data bus 54, as well as other functions typical of similar processors. The steps performed by the computer implemented system are illustrated in FIG. 5A. The tissue is first prepared and subjected to FISH (60) in the conventional manner, for example, as described in Example 2. The system then obtains multiple confocal optical sections (62), and the successive images are transformed by detecting and representing positional values in each image of fluorescent emission of a particular color as an array of digital values. Localized increases (spikes) in the digital values may represent signals in the optical section (64). A histogram is then generated (66) which produces a saddle point (described in association with FIG. 8) that permits fluorescent intensity image components below a threshold value to be eliminated (68, 70). Among the remaining image components, those that are substantially contiguous are grouped into segments according to various criteria (72). Additional filtering can be performed (74), for example, to account for autofluorescence and spot clusters.
With the spot signals then resolved in three-dimensional space, they are counted. Additionally, the counted signals can be combined into a single max image, which is similar to the two-dimensional view seen through a conventional microscope. The max image is a convenient view for rapidly reviewing the FISH results in two dimensions.
In two-color FISH, signals from a second probe (usually of a different color) may also need to be detected (76). In that event, an optical filter can be changed to detect the new color, or an incident beam of laser light of a different color or wavelength can be directed at the tissue section. The signals of a different color are then resolved in three-dimensional space using steps 62-74 as previously described. Once no more signals of a different color are to be obtained, a ratio of the signals from the test probe to the reference probe is calculated to determine whether there is a gene copy alteration.
EXAMPLE 2 Example of Automated FISH Signal Counting From a Series of Confocal Images in High Throughput Analysis with Tissue Microarrays Gene amplification is an important mechanism for the up-regulation of critical genes involved in cancer initiation and progression. A number of important oncogenes have already been found to be activated by DNA amplification. These include the HER-2 (17ql2), C-MYC (8q24), PRAD1/CYCLIN D (1 lql3), FGFR-1 (8pl2), and FGFR-2 (10q24) oncogenes. All of these are examples of genes for which alterations in gene copy number could serve as an indicator or disease onset or progression.
This example uses tissue microarrays, of the type shown in PCT publications WO US99/04000 and 04001 (which are fully incorporated by reference) as a high throughput technique for efficiently performing FISH on hundreds of tissue sample specimens. An example of such a tissue microarray is also shown in FIG. 12. In the absence of an automated technique for counting probe signals in the tissue microarray specimens, it would take many hours for each of the tissue specimens in the array to be examined and scored. However, using the automated technique described herein, hundreds or even thousands of the tissue specimens in the array can be examined and scored in much less time. Prostate tumor microarray
Formalin-fixed and paraffin-embedded tumor and control specimens were obtained from the archives of the Institutes for Pathology, University of Basel (Switzerland) and the Tampere University Hospital (Finland). The least differentiated tumor area was selected to be sampled for the tissue microarray. The specimens that were interpretable for at least one FISH probe included the following: I) transurethral resections from 32 patients with benign prostatic hyperplasia (BPH) to be used as controls; II) 223 primary tumors, including 64 cancers incidentally detected in transurethral resections for BPH; 145 clinically localized cancers from radical prostatectomies, and 14 transurethral resections from patients with primary, locally extensive disease; III) 54 local recurrences after hormonal therapy failure, including 31 transurethral resections from living patients and 23 specimens obtained from autopsies; IV) Sixty-two metastases collected at the autopsies from 47 patients who had undergone androgen deprivation by orchiectomy, and had subsequently died of end-stage metastatic prostate cancer. Metastatic tissue was sampled from pelvic lymph nodes (8), lung (21), liver (16), pleura (5), adrenal gland (5), kidney (2), mediastinal lymph nodes (1), peritoneum (1), stomach (1), and ureter (1).
Construction and sectioning of tissue microarrays
The prostate tissue microarray was constructed using a tissue arraying instrument that created holes in a recipient paraffin block, and acquired tissue cores from the donor block by a thin-walled needle with an inner diameter of 0.6mm, held in a X-Y precision guide. The cylindrical sample was retrieved from the selected region in the donor and extruded directly into the recipient block with defined array coordinates. A solid steel wire closely fit in the tube was used to transfer the tissue cores into the recipient block. After the construction of the array block, multiple 5μm sections were cut with a microtome using an adhesive-coated tape sectioning system (Instrumedics, Hackensack, New Jersey). H&E stained sections were used for histologic verification of tumor tissue on the arrayed samples. Fluorescence in-situ hybridization (FISH)
Two-color FISH to sections of the arrayed formalin- fixed samples was performed using a Spectrum Orange-labeled androgen receptor (AR) probe with corresponding FITC-labeled centromeric probes (Vysis, Downer's Grove, Illinois). The hybridization was performed according to the manufacturer's instructions. The following tissue treatment protocol was developed to allow formalin-fixed tumors on the array to be reliably analyzed by FISH: the slides of the prostate tissue microarray were first deparaffinized, acetylated in 0.2 N HC1, incubated in 1 M sodium thiocyanate solution at 80 degrees C for 30 minutes and immersed in a protease solution (0.5mg/ml in 0.9% NaCl) (Vysis, Downer's Grove, Illinois) for 10 minutes at 37 degrees C. The slides were then post-fixed in 10% buffered formalin for 10 minutes, air dried, denaturated for 5 minutes at 73 degrees C in 70% formamide/2x SSC (SSC is 0.3M sodium chloride and 0.03M sodium citrate) solution and dehydrated in 70, 80, and 100% ethanol, followed by proteinase K (4μg/ml phosphate buffered saline) (GIBCOBRL, Life Technologies Inc.,
Rockville, Maryland) treatment for 7 minutes at 37 degrees C. The slides were then dehydrated and hybridized. The hybridization mixture contained 3μl of each of the probes and Cotl-DNA (lmg/ml; GIBCOBRL, LifeTechnologies Inc., Rockville, Maryland) in a hybridization mixture. After overnight hybridization at 37 degrees C in a humid chamber, slides were washed and counterstained with 0.2μM DAPI. FISH signals were scored with a Zeiss fluorescence microscope equipped with a double-band pass filter using x40-xl00 objectives. The relative number of gene signals in relation to the centromeric signals was evaluated for a region of interest. Criteria for gene amplification were as follows: at least 3 times more test probe signals than centromeric signals per cell in at least 10% of the tumor cells.
Test/control signal ratios in the range between 1 and 3 were regarded as low level gains, and were not scored as evidence of specific gene amplification.
Output from the two color AR FISH is shown in FIG. 5B, which shows a series of optical sections 1-5, and a max image M that combines the fluorescent signal segments of the optical sections 1-5. A "signal segment" in an optical section, in combination with signal segments from other optical sections, are selectively chosen to make up a complete probe signal seen in the Max view. The red signal segments in the optical sections (and the red signals in the Max view) have weaker signals than the green signals. Any of the two-dimensional optical sections 1-5, or the max image M, can be analyzed to determine the ratio of red to green signal segments or signals. As shown in FIG. 5B, the ratio of red to green (seen as a ratio of weaker to stronger spots) is substantially 1 , throughout the optical sections 1 -5 and in the max image M. This indicates that there is not an amplification of the AR, at least in the tissue section illustrated in FIG. 5B. In a gray scale version of FIG. 5B, the green signals appear as brighter spots, and the red signals are more difficult to detect. Alternatively, a three-dimensional representation of the max image can be obtained by analyzing the optical sections that contain each colored signal segment, and separating spot signals that overlap in an axial (vertical) or transverse (horizontal) direction. The separated signals can then be counted, and a ratio of red to green signals more accurately determined. Examples of FISH output from the green labeled probe, obtained by the three-dimensional analysis system of the present invention, are shown in FIG. 6A, which illustrates FISH-signal counting from a series of confocal images. Cuts 1 through 8 are optical sections obtained by the confocal microscope filtered to obtain only the green signal segments at different levels, while the Max-image is a combination of the cuts 1-8 (and is similar to the superimposed images that would be obtained in conventional two-dimensional FISH). The total number of separate green signal segments counted in cuts 1-8 was 393 (summing the number of signal segments shown in brackets above each section number 1-8). After automated analysis of the signal segments, with separation of vertically and horizontally overlapping signals, it was determined that there were 283 unique signals. In a gray scale version of FIG. 6A, the green signals appear as slightly darker spots, such as the one jutting atop cut #6.
EXAMPLE 3 S6K Amplification in Breast Cancer
In this example, the biological consequences were examined of genomic rearrangements at 17q23, a locus amplified in up to 20% of primary breast cancers as assessed by comparative genomic hybridization (CGH). An array of primary breast cancers was constructed, and used to determine S6K gene amplification frequencies in vivo.
Two cohorts of primary breast cancers were studied using the tissue microarray analyses. The first microarray consisted of 372 ethanol-fixed primary breast cancers. The second microarray consisted of 612 primary breast cancers from the years 1985-1995, from patients with complete clinico-pathological information, including an average of 5.4 years of follow-up. Both series of tumors were analyzed, with 668 cases being informative for all experimental and clinical data. Both tumor cohorts were obtained from the Institute of Pathology, University of Basel. The tumor samples included 73.3% ductal, 13.6% lobular, 3% medullary, 2.6% mucinous, 1.5% cribriform, 1.4% tubular, 1.1% papillary carcinomas, 1.9% ductal carcinoma in situ, and 1.7% of other rare histological subtypes. The grade distribution was 24% grade 1, 40% grade 2, and 36% grade 3. The pT stage was pTl in 32%, pT2 in 51%, pT3 in 7%, and pT4 in 10%. A SpectrumOrange labeled PAC probe specific for S6K and a SpectrumGreen labeled chromosome 17 centromere probe (Vysis, Downers Grove, IL) was used for copy number analysis. Interphase FISH to breast cancer cell lines was done as previously described in Barlund et al., Genes Chrom. Cancer 20:372-376, 1997. The hybridizations are evaluated using a Zeiss confocal fluorescence microscope, and following the algorithm shown in FIG. 5A.
The breast cancer tissue section showed a high-level SK6 gene amplification, with a higher number of red signals than green reference signals, as shown in FIG. 6B. In a gray scale version of 6B, the red signals appear as brighter dots, such as the loose cluster of dots in the center of image 5.
EXAMPLE 4 Graphical User Interface for Three-Dimensional FISH Signal Counting Although the invention can be implemented in a variety of computing environments, the following examples are implemented in MATLAB, which is available from Mathworks of Natick, Massachusetts. FIG. 7 shows a graphical user interface (GUI) that is displayed by the system for processing the fluorescent images and counting signals associated with probes that have hybridized to a target nucleic acid sequence. The image- displayed on the interface is an image of one of the optical sections of a tissue section that has undergone FISH (in this case slice=5 refers to the 5l of 8 contiguous optical sections at successively deeper depths of the tissue section; stack=7 refers to the number of the tissue section on the tissue array that is being processed; and threshold=26 refers to a threshold value of signal intensity below which intensity values are eliminated from calculations).
FIG. 8 is a histogram which illustrates how the threshold value (threshold=26) is determined in this example. The histogram plots the relative frequency of gray levels (which corresponds to signal intensity) across the pixels of the image of each optical section, where the x-axis is the frequency of occurrence, and the y-axis is a brightness value. This graph shows two modes, in which the first mode (the sharp spike) represents noise (primarily dark background), and the second mode (the broader peak) represents useful signals (such as brighter pixels associated with fluorescent intensity). To determine a saddle point (threshold value), the histogram is smoothed, and the derivative of the resulting graph is calculated. The point T=26 is the point at which the derivative of the curve changes value from + to -, and this is the value that is selected as the threshold. Pixels having a brightness below this level are eliminated from further data manipulation.
FIG. 9A shows an image which is displayed of contiguous signal segments in different optical sections 1-8 of a tissue stack (which corresponds to a microarray spot). This FISH signal is designated spot 224, and the image intensity of the signal segments that are present on optical sections 1-8 can be seen in the small image panels across the bottom of the display. Each small panel 1-8 is a section of the three-dimensional space within the tissue section, and the number of pixels where signal is present is shown by the white boxes in the display panels. Although several three-dimensional representations of the signal segments are shown in FIG. 9, the center representation (with segments labeled la-8a) will be used for purposes of illustration to explain how the different signal segments are combined into a single spot signal, or how certain signal segments are discarded. The relatively small brightness signals (which correlate with respectively few white pixels) in panels 1, 2, 3 and 4 are mapped into correspondingly small three- dimensional geometric boxes la, 2a, 3a and 4a, having a volume proportional to the area of the illuminated pixels in the panels 1-4. The absence of any illuminated pixels in panel 5 corresponds to an absence of any geometric figure in space 5 a (or can indicate that any pixels were below a threshold value and therefore not considered for further data manipulation). The area of the white pixels in panel 6 correspond to the volume 6a, and the area of the white pixels in panel 8 corresponds to the volume 8a, while the very large illuminated area in panel 7 corresponds to the figure 7a, which has the largest volume of any of the geometric figures.
One function of the algorithm is to eliminate background noise from actual probe signals. This can be done by setting a threshold value, below which a signal segment can be considered not to be a valid signal segment. The signal segments in panels 1, 2, 3, 4 and 6 do not meet the threshold criteria for minimum illuminated pixels, and are therefore not considered when combining the signal segments into an actual signal. The signal segments in panels 7 and 8 do, however, meet the criteria for minimum pixel illumination. Since more pixels are illuminated in panel 7 than in panel 8, the location of the signal is assigned to level 7, although alternatively it could be considered to reside in both levels 7 and 8.
Other signal overlap situations that the algorithm can resolve are also shown schematically in FIGS. 9B and 9C. In FIG. 9B a side view of optical sections 1-10 is shown, in which two spot signals vertically (axially with respect to the microscope lens) overlap one another. The signal segments in sections 1, 5, 6 and 10 fall below a pre-selected threshold value, and are eliminated from further consideration. The remaining signal segments in sections 2, 3, 7, 8 and 9 are above the threshold, and are grouped into a first spot signal at contiguous sections 2-4, and a second spot signal at contiguous sections 7-9. The probable location of the top spot signal can be assigned to section 3. where the strongest signal segment of the top spot signal is found, or alternatively the spot signal can be considered to reside at sections 2-4. The probable location of the bottom spot signal can be assigned to section 8, where the strongest signal segment of the bottom spot signal is found, or in sections 7-9 across which the signal segments extend. FIG. 9C schematically illustrates how vertically and/or horizontally
(transversely with respect to the optical axis of the microscope lens) overlapping spot signals can be resolved. A side view of optical sections 1 -8 is shown in which the signal segments in sections 1 and 8 do not meet a threshold, and are eliminated. In section 4, there are two horizontally non-contiguous signal segments. The signal segment 4B (represented by a gray bar) does not meet a threshold value and is eliminated, while the segment 4A (represented by the black bar) does satisfy the threshold and remains for consideration. Similarly, there are two non-contiguous signal segments in section 5. The segment 5B (represented by the gray bar) meets the threshold, while the segment 5 A (represented by the black bar) does not meet the threshold and is eliminated. Once the signal segments 4B and 5A are eliminated, the segments can be resolved into a top spot signal (comprising segments 2, 3 and 4A) and a bottom spot signal (comprising segments 5B, 6 and 7)
FIG. 10 illustrates a three-dimensional representation of a tissue section that has been subjected to FISH and analyzed by the method of the present invention. The three-dimensional space is divided into x-y-z coordinates, in which the z axis is associated with successive optical sections 1-8 of the tissue section, and the signal segments of all of the signals in the section are illustrated. Signal segments are illustrated as small black cylinders, and the assigned location of each spot is illustrated as a gray colored larger cylinder, some of which are designated Al, A2, A3, A4, A5, A6, A7 and A8. Each of these gray cylinders A is associated with a corresponding sphere B in the max plane, in which the volume of each sphere B can be proportional to the volume of the A cylinder with which it is associated. The system additionally provides an opportunity for a user to provide guidance during spot counting. For example, the user can specify a particular area of interest by selecting it on the screen. Typically, the max image (e.g., image M of FIG. 5B) is presented, and the user may select an area or areas via a pointing device (e.g., a mouse). Counting is then limited to only the selected area or areas. Such a feature can be particularly useful when the user recognizes that a certain area of the image relates to a region of interest.
The system also provides a way to eliminate a specified area or areas selected via a pointing device (e.g., a mouse). Portions of the image within the specified area or areas (sometimes called "gated areas") is ignored when spots are counted. EXAMPLE 5 Algorithm Overview
The method of the present invention facilitates the analysis and quantitation of FISH on tissue microarrays, by overcoming previous obstacles to automation. This method, as implemented in the algorithm of this example, applies dual-color FISH with different gene probes and corresponding chromosomal reference probes to tissue microarrays. Because each tissue section is a three-dimensional volume, a confocal microscope system is used to generate a three-dimensional stack of a number of fluorescent images (e.g., 24 images) over defined tumor areas of each of the 4 μm thick tissue specimens on the tissue microarrays. The algorithm identifies signal segments on each level along the Z-axis, and differentiates signal segments which overlap vertically above the X-Y plane.
The algorithm can be refined to account for spot clusters, and a variety of filtering mechanisms can be used to remove signals related to, for example, auto- fluorescence and other false signals. A variety of parameters used during the algorithm can be modified by the user to account for circumstances related to a particular image or set of images. For example, since the magnification used for an image can vary, the parameters of the algorithm can be modified to account for the relative size of a pixel. There are many other examples of parameters (e.g., numeric values) that can be varied to avoid a strict adherence to a particular value. Various results of intermediary calculations are stored to facilitate efficient re-calculation when parameters are modified by the user.
An overview of an implementation of the algorithm is shown in FIG. 13. Typically, the algorithm is provided with a set of digital image slices (e.g., a raster image) representing a set of observations (e.g., of a biological specimen subjected to FISH) taken at different depths along a z-axis via a confocal microscope. Thus, the set of images is sometimes called a "stack." The stack typically consists of anywhere from 8-32 images, although more images may be used. Satisfactory results have been obtained with 24 images. At 1302, possible fluorescent image components are identified in the slices.
For example, pixels meeting certain criteria are selected as possibly corresponding to a location within or near a spot and placed into a set of binarized images. The criteria can include an intensity level, and the images may be subjected to various filters before constructing the binary image.
At 1304, the resulting pixels in each slice are projected onto a projection image. For example, a binary projection image can be constructed from a stack of binary images by setting pixels in the binary projection image at locations (X,Y) whenever there is any pixel set in any of the stack of binary images at the same (X,Y) location (regardless of the Z location).
At 1306, insignificant regions in the image slices are discarded. So, for example, single lone pixels in a slice, or a contiguous set of 2-3 pixels are ignored, removed, or labeled as insignificant. A similar process can be applied to the projection image.
At 1308, each contiguous region in the projection image is considered in turn. For example, minimal rectangles can be drawn around each substantially contiguous region in the projection image, numbered, and considered in turn. At 1310, contiguous regions in the image slices associated with the projection image region under consideration are analyzed and grouped into spot candidates. At 1320, the next region in the projection image is considered, until each has been considered.
At 1322, filtering an augmentation can be applied to the candidate spots. For example, certain candidate spots may be false spots, or others may actually be clusters representing plural spots.
Finally, at 1324, the spots are counted. The spot count can then be used visa-vis other spot counts to provide a ratio (e.g., a ratio of genes to centromeres).
EXAMPLE 6
Details of Algorithm
In this method, the traditional signals-per-nucleus approach is replaced by an overall gene-to-chromosome ratio in a defined tumor area. Therefore, the ratio between the signals of gene probes and the reference probes are calculated irrespective of the number of nuclei. This new method for automated capturing and quantitation of FISH signals significantly advances the utility of tissue microarrays as a high-throughput tool for detection of relative gene copy number alterations during cancer development and progression.
Although the invention can be implemented in a variety of computing environments, the following examples are implemented in MATLAB, which is available from Mathworks of Natick, Massachusetts. Various MATLAB-based Graphical User Interface (GUI) tools allows visualization of three-dimensional shapes of spots as well as their two-dimensional projections on the slices. The dynamic interface provides for manipulation with many input parameters for the threshold, number of slices, and filtering. It also provides for visualization of differently colored spots, storage, displaying and printing data in the form of different 3-D images, diagrams, and tables.
An implementation of the algorithm in a scenario with 24 image slices can have the following features:
(1) A morphological top-hat transform is applied to each of the 24 images to yield 24 outputs, each possessing brightness intensity spikes jutting above an essentially flat background. Each bright spike can correspond either to a signal segment, or to noise.
(2) Each top-hat image is thresholded to produce a stack of 24 binary images showing spike locations. Morphological filters are applied to the binary images to eliminate noise, and touching spots are segmented.
(3) Binary spot markers occurring as vertical neighbors in the stack are grouped into one final spot located at a particular assigned stack level. Various parameters of the algorithm are set to fit the physical characteristics of the images. These include window size for the top-hat transform, threshold levels, and sizes of filter structuring elements.
The performance of the algorithm can be based on the following main steps: 1. Given L + 1 slices of the fluorescent images, Xn, Xi, ..., Xι, of sizes Nx , calculate the max-image Ymax: rmax(7, = maχ{Xk (j,i);k = 0,X,...,L}. The histogram, H(t), of this image is calculated and smoothed, and then the approximate value of the threshold, T, is determined as the minimal saddle-point between two modes of the histogram. The histogram is displayed on the screen and the threshold can be changed on-line. The chosen value of the threshold is used in the next steps for computing binary images.
2. For each image X^, k=0,l,...,L, the top-hat transform is calculated. Sometimes the top-hat transform is called "image minus opening." The calculation is
X, Xk ° B, (1)
where Xk o B is opening by the structuring element B , which is taken to be a circle fitting within a 7x7 square (roughly, a circle with a diameter of 7). Other structuring elements can be used (e.g., the following examples show a 5x5 square) and can be specified by the user via a user interface feature. The opening is defined by X o B = (X0 B) θ B , and calculated as follows:
(X0B)(j,i) = r n{X(j + j] ,i + i]);(j,i) e B}, (2) (X θ B)(j, i) = max{X(j +
Figure imgf000029_0001
,/ + , ); (j, 0 e B} . (3)
To calculate the top-hat transform in (1), the decomposition of the transform by two top-hat transforms with 1x5 and 5x1 structuring elements is used, which results in the fast performance of the transform 5x5:
X0 ø[ι 1 1 1 l] , (4)
Figure imgf000029_0002
Figure imgf000030_0001
Figure imgf000030_0002
The data of the top-hat transform * is stored for future processing in case the value of the threshold needs to be changed.
3. Each top-hat transform Yk is thresholded by the value T, resulting in a binary image Bk, k=0, 1,...,L,
Bk (j,i) = X, if Yk(j,i) > T, Bk(f,i) = 0, if Yk(j,i) ≤ T
A projection of multiple slices of the image can be calculated as
na O M l i Bk(j,i) = X ,
for at least one k. Typically, the projection is calculated with reference to all available image slices, in which case the binary image Bmax is sometimes called the "projection binary image" because it is roughly equivalent to peering down through the stack of binary images. In some sense, the binary image Bmax is also a "maximum" image; however, in the example, the maximum is computed comparing only l's and 0's.
4. The obtained binary images Bk, are processed to find the separate regions revealed by the thresholded top-hat transform. Each actual spot is composed of some number of these regions on different binary slices. Each of these slice regions is referred to as a signal segment, so that each spot signal is composed of a union of signal segments. The software can identify signal segments by locating substantially contiguous groups of pixels, number the signal segments, and calculate the following data:
A. coordinates of the signal segment center;
B. the number of pixels composing the signal segment; C. the coordinates of the minimal rectangle which contains the signal segment.
Signal segments composed of a single pixel are ignored during this procedure but their presence can be used during other steps.
5. The projection binary image Bmax is processed similarly, and its separate regions (projection binary image regions) are identified by locating substantially contiguous groups of pixels and numbered. The following are calculated for spot signals on the projection binary image:
A. coordinates of the projection binary image region center;
B. the number of pixels (l's) composing the region; C. the coordinates of the minimal rectangle which contains the region;
D. the number of signal segments from the different optical sections, which lie inside the surrounding rectangle of the region; and the largest such signal segment is marked.
The algorithm then attempts to identify each of the spots by integrating the signal segments into spot signals via analysis of the sizes, intensities, and locations (in 3 dimensions) of the signal segments. Typically, for purposes of determining whether a particular signal segment is of sufficient size to indicate a contiguous connection, a threshold (e.g., 3 pixels) is useful to designate as a cutoff. So, for example, signal segments having 3 or fewer pixels can be ignored when determining whether the slice above and/or below a signal segment is contiguous with the particular signal segment.
Specif ing such a threshold can be particularly useful in resolving closely- neighboring spots, such as two spots having substantially similar (X,Y) coordinates but appearing in different slices of the image. Examples of such scenarios are shown in FIGS. 9B and 9C. As shown, there can be more than one spot in a vertical direction or in a horizontal direction. During processing, segments that are substantially vertically contiguous can be associated with a particular optical slice having the greatest signal intensity, which is often the center of a single spot. However, it may be that there is another image slice with great signal intensity in the same segment, in which case there may be more than one spot in the segment.
Typically, the information about a spot candidate includes a size (e.g., total number of pixels across the image slices) and an intensity (e.g., the brightest pixel in the region of the slice having the most pixels for the spot candidate).
A data flow diagram for an implementation of the algorithm is shown in FIG. 14. The slice data 1402 is used to construct a max image 1404 and the top-hat slice data 1406. The max image 1404 is useful for presenting to the user to give the user an informative depiction of a combination of the image slices and for various user interface features. It can be constructed by taking the pixel with the greatest intensity out of the image slices in the z-plane. The top-hat slice data 1406 can be generated as described above. Max top- hat image 1408 can be generated by finding the greatest intensity of the top-hat slices for each (X,Y) location. The max top-hat image 1408 need not be displayed, but is useful, in combination with the top-hat slice data 1406 to construct the histogram 1410. A threshold 1412 is chosen with reference to the histogram 1410 as described above. Then, the thresholded slice data 1420 can be constructed by generating a binary image for each of the slices, where pixels in the slices are set if the slice image in the slice data 1402 exceeds the threshold 1412.
From the thresholded slice data 1420, substantially-contiguous regions of the thresholded slice data can be identified to generate regions 1422. The regions are filtered to produce filtered regions 1424 of the thresholded data.
Also from the thresholded slice data 1420, a projected binary image 1426 can be constructed, and substantially-contiguous regions 1430 within the projected binary image 1426 found. These regions can be filtered to generate filtered regions 1440 of the projected image.
Z-axis analysis data 1428 can be generated with reference to the filtered regions 1424 and the regions 1430 and used to generate the candidate spots 1432, from which, false spots 1434 and clusters 1442 can be identified. Additional filtering can be performed to generate spot count 1444. Finally, display data 1450 can be used to present graphical results to the user.
The data flows in the diagram are general only, and often data in upstream data components can be relied upon by downstream data components (or vice versa), even if not specifically shown in FIG. 14. As the data components are generated, they are often saved to permit efficient re-calculation when a parameter is adjusted to account for a particular image scenario.
In addition to the above-described filters, additional filtering mechanisms can be provided to improve accuracy of spot counting. For example, one feature relates to removing false spots relating to, for example, autofluorescence. Two separate approaches relate to removing based on intensity and intensity in combination with area. For example, after candidate spots are identified, an intensity of the spot can be calculated. If the intensity exceeds a certain threshold (say, two standard deviations above the mean intensity), the spot can be discarded and not counted. In addition, or alternatively, the intensity can be combined with (e.g., multiplied by) the area or volume of a spot. Again, if the value is over a particular threshold, the spot can be discarded and not counted. The approaches can be combined. For example, the image in FIG. 15A illustrates spot candidates in an image of normal prostate tissue in which the X centromere has been labeled via FISH. Some of the spot candidates are actually autofluorescent tissue, and can be eliminated from the spot count. The image can be presented as part of the user interface. The graph in FIG. 15B shows the intensity of the 149 spot candidates identified in the image. A mean and standard deviation of the spot candidate's intensities is calculated, and used as a threshold (e.g., two standard deviations above the mean). Spot candidates having intensities above the threshold are discarded (e.g., identified as "small autofluorescent tissue particles"). The graph in FIG. 15C is similar to that of 15B, but also includes an area component (e.g., by multiplying the intensity by the number of pixels" and not included in the spot count). Again, spot candidates having a rating above the threshold are discarded (e.g., identified as "large autofluorescent tissue particles" and not included in the spot count). Eight candidates can be removed because they exceed the threshold. The threshold in either case can be adjusted manually by a user if the default setting is inappropriate. FIG. 15D shows the image of 15 A, but the small autofluorescent tissue particles, large autofluorescent tissue particles, and the true FISH signals are differentiated by presenting each as a different color. The image can be presented as part of a user interface for the system. Eliminating the autofluorescent particles typically leads to more accurate spot count results. In a color version of the image, the image components 1510, among others, are portrayed in the color green to indicate that they are considered true FISH signals. The image components 1520 and 1521, among others, are portrayed in the color yellow to indicate that they are small autofluorescent tissue particles. The image components 1530-1533, among others, are portrayed in the color blue to indicate that they are large autofluorescent tissue particles. A variety of other colors or other ways of emphasizing and differentiating the image components related to autofluorescence can be used.
Still another feature than can be implemented is removal of spots appearing at substantially identical positions in two channels (e.g., red and green channels generated in two separate images, such as an image for a test probe and an image for a reference probe). Typically, such a situation means the spot can be ignored (e.g., not counted for either image).
Yet another feature relates to identifying clusters and estimating the number of spots in a cluster. In some cases, certain criteria (e.g., the size of a projection binary image region or the total number of pixels in slice regions related to a projection binary image region) indicates that a cluster of spots is present. And, the software can be configured to accept user guidance as to which portions of the image are a cluster (e.g., via selection by pointing device). Determining the number of spots in a cluster can be difficult; however, one way of providing accurate estimates is to provide a calibration mechanism. For example, in one instance, 200 sets of clusters were analyzed manually to determine how many spots were in the clusters. These clusters were then subjected to the above-described algorithmic analysis. Under certain circumstances, a factor of 2.5 was determined to be appropriate for determining how many spots were in a cluster. Thus, for example, if an area is identified as a cluster and the standard algorithmic analysis shows 4 spots, an estimate of 10 is provided. Other, more complex, analysis can be done, such as, for example, counting the total number of pixels associated with the signal segments related to a projection binary image region.
For clusters, a mapping between spots detected and actual spots can be used. Sometimes detection of a small number of spots (e.g., 3) may in fact indicate a tight cluster of (e.g., 9) spots, while detection of a larger number of spots (e.g. 7) may be a loose cluster of (e.g., 9) spots. Therefore, a uniform gain factor is not appropriate. The appropriate method for handling clusters may vary depending on the target being counted. For example, certain genes have a higher propensity for clustering than do centromeres. The user can manually adjust the cluster detection and counting parameters manually. The number of spot signals is counted and the results are provided to the user.
The interactive options of the program allow the threshold Tto be changed, and process the spot counting of the fluorescent images from step 3, avoiding the repeated calculation of the top-hat transforms.
The output data using this algorithm can include:
1. Original images of all slices.
2. Max-image.
3. Histogram of Max-image.
4. Max-image with colored spots. 5. Top-Hat transforms of the images.
6. Number of spots .
7. Max-image with colored spots after filtering out spots with less than some specified number of pixels.
8. Original images with colored spot-cuts. 9. Images with colored rectangles surrounding the spots.
10. 3-D view of each spot with sections lying on slices by vertical.
11. 2-D view of spot composition by projections. 12. 3-D view of a number of spots with their projections by slices.
13. Table of data for spot-cuts for L slices, which consists of:
Figure imgf000036_0001
where r.v.l. is the abbreviation of relative vertical location, xlefi, xrιghl, yboom , and y,op are the coordinates of the rectangles surrounding the spot-cuts. The last box indicates whether the spot-cut is below, on, or above the true spot.
14. Table of data for spots on the Max-image, which consists of:
Figure imgf000036_0002
EXAMPLE 7
Scatter Plot of Spot Candidates and Calibration Features
An additional user interface can be presented for assisting in analyzing and manipulating the data related to FISH experiments. For example, FIGS. 16A and 16B show scatter plots generated for a FISH experiment in which the presence of centromere 17 and gene HER-2 were detected to generate a ratio indicating possible amplification of HER-2.
Spots relating to centromere 17 typically do not cluster and are shown in FIG. 16A. Spot candidates are represented by appearing in a position appropriate for their intensity and size (e.g., number of pixels). The user can set a threshold parameter (e.g., 4) so that candidate spots having fewer than the specified number of pixels are excluded from the spot count. Also, a minimum intensity parameter can be set.
Spots relating to HER-2 are shown in FIG. 16B and are more difficult to count due to potential clustering. For purposes of illustration, areas of the plot roughly relating to background noise 1652, real signals 1654, and clusters 1656 are circled. To facilitate determining an appropriate threshold, the user is presented with a calibration feature. An image (e.g., the maximum image 1404) is presented to the user, who can then verify that certain spot candidates are indeed spots, although there are of minimal intensity. The user selects the spot candidate shown and activates a user interface element indicating the spot candidate is of minimal intensity.
Subsequently, when the scatter plot 1650 of FIG. 16B is shown, the spot candidates of minimal intensity are shown in red to facilitate choosing an appropriate minimum intensity threshold (e.g., by adjusting an appropriate parameter in the algorithm), below which spot candidates are not counted as spots. Thus some of the points 1658, typically near the bottom of the plot, are red. The minimum intensity threshold is typically adjusted to barely include the minimum intensity candidates.
Also useful is the illustration of clusters 1656, which can be used to adjust the cluster selection criteria. Additionally, the user can at any time click on a particular spot candidate (e.g., 1660) to navigate to information about the spot candidate, including a three-dimensional view of the spot candidate. At the user interface, the user can then manually designate the spot candidate as a cluster or non-cluster.
The scatter plot 1650 thus presents a useful tool for allowing the user to more easily grasp the totality of the image data and easily navigate to appropriate user interface screens for analyzing and adjusting the data, as well as calibrating the system.
EXAMPLE 8 User Direction of Areas to be Analyzed The system can also provide a user interface by which the user can guide the selection of areas to be analyzed. Although the system typically discards clearly aberrant data (e.g., a substantially total black or high-intensity image slice), the system may benefit from assistance from the user in determining which areas of an image are of interest or which are not. As described below, the user can select an area to be counted separately or an area not to be counted.
FIG. 17 shown a user interface 1702 presented to the user for designating areas of interest. Typically, the user interface 1702 presents a max image (e.g., 1404). By using a pointing device (e.g., a mouse or trackball), the user can designate a particular area 1706 as one to be processed separately. Multiple areas can be added to an area set and processed separately. In this way, the user can direct the system to focus on particularly relevant portions of the image.
The system also supports designating areas as not to be processed or included in the spot count. The algorithm can store the areas in a list and, for example, not count spots at a location (e.g., X, Y coordinates) within the area(s) designated.
The above feature is useful because areas of the image can sometimes be identified as debris, stroma, connective tissue, or blood vessels, which tend to interfere with determining an appropriate spot count in certain scenarios (e.g., some material may be particularly prone to autofluorescence).
EXAMPLE 9 User Interface Including Three-dimensional Representations of Spot Candidate Still another user interface feature can be presented by the system to assist in evaluation and manipulation of the image data. FIG. 18 shows a user interface 1800, which depicts a spot candidate in three views, 1802, 1804, and 1806. The views assist in determining, for example, whether the spot candidate is a cluster, as is likely the case in the illustrated example. The view 1806 is additionally processed to give a smoother representation of the spot candidate. The horizontally-contiguous region with the greatest area (e.g., number of pixels) is specially identified in the views with emphasis 1808. A status line 1818 indicates information about the spot candidate.
A strip along the interface indicates the max image, and the 15 image slices (numbered 0-14) in binary form after thresholding. The image slice having the greatest area region is noted at 1820. User interface controls 1832 and 1834 allow navigation to spot candidates and identified clusters, respectively.
In view of the many possible embodiments to which the principles of the invention may be applied, it should be recognized that the illustrated embodiments are examples of the invention, and should not be taken as a limitation on the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Claims

We claim:
1. A method for counting nucleic acid probe signals in a region of interest in a biological specimen, the method comprising: counting a number of test signals from a test probe; counting a number of reference signals from a reference probe; and determining a ratio of the counted test signals from the test probe to the counted reference signals from the reference probe, wherein the region of interest comprises multiple cells.
2. The method of claim 1, wherein the reference probe is a polynucleotide that hybridizes to a centromere, and the number of reference signals from the reference probe approximates a nucleus count in the biological specimen.
3. The method of claim 1, wherein the reference probe recognizes a target on a same chromosome as the test probe.
4. The method of claim 1 , wherein the test probe is a polynucleotide that hybridizes to a target sequence in a gene, and the reference probe is a polynucleotide that hybridizes to a reference sequence.
5. The method of claim 3, wherein the reference probe recognizes a centromere of the same chromosome on which the gene of interest is contained.
6. The method of claim 1, further comprising obtaining successive images of the region of interest to distinguish overlapping signals in the biological specimen.
7. The method of claim 6, wherein the successive images are optical sections of the region of interest.
8. The method of claim 7, wherein the optical sections are at different depths of the biological specimen.
9. The method of claim 8, wherein the successive images are transformed into digital representations in which contiguous signal segments in successive optical sections are combined into a single signal in a particular optical section in which a strongest signal segment is located.
10. The method of claim 6, wherein different successive images are obtained for the test probe signals and the reference probe signals, and a quantity of test probe signals and reference probe signals are determined.
11. The method of claim 6, wherein successive images are obtained which show distinguishable test probe signals and reference probe signals, and a quantity of the test probe signals and reference probe signals are determined.
12. The method of claim 6, wherein the successive images are obtained by confocal microscopy.
13. The method of claim 1 , wherein the ratio of signals is determined without reference to boundaries of a cell nucleus.
14. The method of claim 1 , wherein the ratio of signals is determined without reference to the boundaries of a cell.
15. A method of counting visible signals from probes used with in situ hybridization of biological samples, the method comprising: obtaining a plurality of images at different levels of the biological sample; constructing a three-dimensional image indicating discrete signals at different levels of the three-dimensional image; and counting the discrete signals at different levels of the three-dimensional image.
16. The method of claim 15, wherein the three-dimensional image is constructed by determining a location of a signal segment in the different levels of the biological sample, combining overlapping signal segments in contiguous levels into a single spot signal, and separating signal segments in non-contiguous levels into different spots.
17. The method of claim 16, wherein the location of signal segments is determined by the presence of an increase in brightness intensity that indicates an increase of signal as compared to a background signal.
18. The method of claim 17, wherein the probes display fluorescent signals, and the increase in brightness intensity is associated with an increase in fluorescence compared to the background signal.
19. The method of claim 15, wherein the signals comprise test signals from a test probe and reference signals from a reference probe.
20. The method of claim 19, wherein the test probe recognizes a gene of interest, and the reference probe recognizes a chromosomal locus having an expected quantity in the biological specimen.
21. The method of claim 20, further comprising determining a ratio between the test signals and the reference signals.
22. The method of claim 21, further comprising determining:
(a) whether there is an increase in an expected ratio between the test signal and the reference signal, indicating an amplification of the gene of interest; or
(b) whether there is a decrease in the expected ratio between the test signal and the reference signal, indicating relative loss of the gene of interest.
23. The method of claim 19, wherein the test probe is selected from the group consisting of probes that recognize genes implicated or suspected in the development or progression of a tumor.
24. The method of claim 15, wherein the biological samples are in a microarray.
25. The method of claim 24, wherein the microarray comprises a tissue microarray.
26. The method of claim 25, wherein the tissue microarray comprises tissue samples of a same tissue type taken from a plurality of donor specimens.
27. The method of claim 15, wherein the plurality of images consists of between eight and thirty two images at different levels of the biological sample.
28. The method of claim 15, further comprising: avoiding counting discrete signals having intensities exceeding a threshold intensity.
29. The method of claim 15, further comprising: avoiding counting discrete signals having a combined intensity and area exceeding a threshold value.
30. The method of claim 15, further comprising: avoiding counting discrete signals related to autofluorescent material.
31. The method of claim 15, further comprising: depicting a two-dimensional image representing the three-dimensional image for consideration by a user.
32. The method of claim 31 , further comprising: emphasizing discrete signals related to autofluorescent material in the two- dimensional image.
33. The method of claim 15 , further comprising : identifying a set of one or more discrete signals as a cluster; and counting the cluster as a number of discrete signals greater than the number of discrete signals in the set.
34. The method of claim 33 wherein the cluster is counted as a number of discrete signals indicated by applying a mapping to the number of discrete signals in the set.
35. The method of claim 33 wherein the cluster is counted as a number of discrete signals indicated by a function calibrated via manual counting of spots in a plurality of images.
36. The method of claim 33 wherein the cluster is counted as a number of discrete signals indicated by a gain factor applied to the number of discrete signals in the set.
37. The method of claim 15 wherein the plurality of images are a set of images taken during a first analysis of a first color channel, and a second set of images are taken of the biological sample for a second color channel, the method further comprising: avoiding counting discrete signals appearing at a same location in the set of images for the first color channel and the set of images in the second color channel.
38. The method of claim 15 wherein the plurality of images are a set of images taken for a test probe, and a second set of images are taken of the biological sample for a reference probe, the method further comprising: avoiding counting discrete signals appearing at a same location in the set of images for the test probe and the set of images for the reference probe.
39. The method of claim 15 further comprising: receiving a directive from a user indicating counting is to be avoided for a specified portion of the biological sample; and responsive to the directive, avoiding counting discrete signals for the specified portion of the biological sample.
40. The method of claim 15 further comprising: receiving a directive from a user indicating counting is to be performed separately for a specified portion of the biological sample; and responsive to the directive, separately counting discrete signals for the specified portion of the biological sample.
41. A high-throughput method of counting fluorescent in situ hybridization signals, comprising: providing an array of biological samples; hybridizing the biological samples with a fluorescent test probe that hybridizes to a gene of interest in the biological samples and with a fluorescent reference probe that hybridizes to a chromosomal reference locus in the biological samples; obtaining images by confocal microscopy of contiguous sections at different depths of a plurality of the biological samples; detecting fluorescent signal segments from the contiguous sections, and segmenting contiguous signals in different contiguous sections into a corresponding single spot signal; and projecting each spot signal into a two-dimensional plane, and counting the spots.
42. The method of claim 41 , wherein the fluorescent spot signals from the test probe are obtained and detected separately from the fluorescent spot signals from the reference probe.
43. The method of claim 42, wherein the fluorescent spot signals from the test probe are obtained and detected simultaneously with the fluorescent spot signals from the reference probe.
44. The method of claim 41 , wherein the array comprises an array of at least 6 different specimens.
45. The method of claim 44, wherein fluorescent in situ hybridization signals are counted in at least 50 different specimens.
46. The method of claim 41 , wherein the method is a computer implemented system.
47. The method of claim 41 , wherein providing an array of biological samples comprises providing an array of biological specimens: obtaining a plurality of donor specimens; placing each donor specimen in an assigned location in a recipient array; and obtaining a plurality of copies of the recipient array in a manner that each copy contains a plurality of donor specimens that maintain their assigned locations.
48. A method for counting signals from in situ hybridization of probes in biological tissue, the method comprising: obtaining digital images of different levels of the biological tissue with a confocal microscope; detecting spot signals at the different levels and separating overlying signals from one another.
49. The method of claim 48, wherein the spot signals are detected at different levels with a computer implemented system that detects fluorescent signals as signal segments from the different levels, wherein the levels are contiguous optical sections, and the system segments contiguous signal segments from different vertical levels into one spot signal, while separating non-contiguous signal segments from different levels into different spot signals.
50. The method of claim 49, wherein the computer implemented system projects the spot signals into a single level and counts the spot signals.
51. The method of claim 49, wherein the computer implemented system detects fluorescent signals as spot signals by
(a) performing a morphological transform to digital images of the different levels to obtain digital representations of brightness intensity that indicate a signal segment;
(b) eliminating signal segments below a threshold;
(c) grouping contiguous signal segments into a single spot.
52. The method of claim 51 , further comprising: avoiding signal segments having an intensity above a threshold.
53. The method of claim 51 , further comprising: identifying clusters within the signal segments; and counting the clusters each as a plurality of spots.
54. A device for counting signals from in situ hybridization of probes in biological tissue, the device comprising: a confocal microscope; a digital camera positioned to obtain digital optical sections of different levels of the biological tissue; and a computer implemented system that detects brightness signals at the different levels and separates overlapping brightness signals from one another.
55. The device of claim 54, wherein the computer implemented system detects fluorescent signals as signal segments from the different levels, wherein the levels are contiguous, and the system groups contiguous signal segments from different levels into one spot signal, while separating non-contiguous signal segments from different levels into different spot signals.
56. The device of claim 55, wherein the computer implemented system counts the spot signals.
57. The device of claim 55, wherein the computer implemented system detects fluorescent signals as spot signals by
(a) performing a morphological transform to digital images of the different levels to obtain a digital representation of brightness intensity that indicate a signal segment; (b) eliminating signal segments below a threshold;
(c) segmenting contiguous signal segments; and
(d) grouping vertical contiguous signal segments into a single spot signal at an assigned level associated with a greatest signal segment intensity.
58. A computer-readable medium comprising computer-executable instructions for performing the following: within a stack of image slices generated from a plurality of confocal microscopic observations of a FISH experiment as a plurality of depths along a z- axis, identifying possible fluorescent image components; projecting the possible fluorescent image components within the image slices onto a projection image; discarding insignificant contiguous possible fluorescent image components in the slices; for each contiguous region in the projection image, grouping regions of possible florescent image components associated with the contiguous region in the projection image into spot candidates; applying a filter to the spot candidates; and counting the remaining spot candidates as spots.
59. The computer-readable medium of claim 58 wherein insignificant contiguous possible fluorescent image components are determined by comparing a size of a contiguous possible fluorescent image component with a threshold.
60. A computer-generated user interface for presenting results of microscopic observation of biological tissue subjected to a FISH experiment, the user interface comprising: a scatter plot of sets of image components designated as spot candidates for the FISH experiment; wherein the scatter plot comprises points indicating a size and intensity of spot candidates.
61. The computer-generated user interface of claim 60 wherein at least one point is operable to receive a user interface activation to navigate to a user interface display of information for a spot candidate associated with the point.
62. The computer-generated user interface of claim 60 wherein at least one point is operable to receive a user interface activation to navigate to a user interface display of a three-dimensional depiction of a spot candidate associated with the point.
63. The computer-generated user interface of claim 60 further comprising: a display image depicting a view of the tissue subjected to the FISH experiment and a depiction of least one candidate spot thereon; wherein the depiction of the candidate spot is operable to receive a user interface activation to designate the spot as a minimal intensity spot; and wherein candidate spots designated as minimal intensity spots are visually emphasized when presenting the scatter plot.
PCT/US2000/025465 1999-09-17 2000-09-15 Signal counting for in situ hybridization WO2001020044A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU74949/00A AU7494900A (en) 1999-09-17 2000-09-15 Signal counting for in situ hybridization

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15460199P 1999-09-17 1999-09-17
US60/154,601 1999-09-17

Publications (2)

Publication Number Publication Date
WO2001020044A2 true WO2001020044A2 (en) 2001-03-22
WO2001020044A3 WO2001020044A3 (en) 2002-05-10

Family

ID=22551987

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/025465 WO2001020044A2 (en) 1999-09-17 2000-09-15 Signal counting for in situ hybridization

Country Status (2)

Country Link
AU (1) AU7494900A (en)
WO (1) WO2001020044A2 (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2826977A1 (en) * 2001-07-03 2003-01-10 Imstar S A METHOD AND SYSTEM FOR DETECTING INTER-CHROMOSOMAL IMBALANCE BY IN SITU HYBRIDIZATION OF FLUORESCENT PROBES (FISH) ON INTERPHASIC CELL CORES
WO2004113922A2 (en) * 2003-06-19 2004-12-29 Applied Precision, Llc System and method employing photokinetic techniques in cell biology imaging applications
WO2005076216A2 (en) * 2004-02-03 2005-08-18 Bioimagene, Inc. Method and system for automaticed digital image based flourescent in situ hybridization (fish) analysis
US7219016B2 (en) 2001-04-20 2007-05-15 Yale University Systems and methods for automated analysis of cells and tissues
GB2459384A (en) * 2008-04-24 2009-10-28 Univ Nottingham Trent Homogeneous immunoassay comprising z-axis fluorescence imaging
US7709222B2 (en) 2006-07-13 2010-05-04 Yale University Methods for making cancer prognoses based on subcellular localization of biomarkers
US8367351B2 (en) 2006-05-05 2013-02-05 Historx, Inc. Methods for determining signal transduction activity in tumors
US8428887B2 (en) 2003-09-08 2013-04-23 Ventana Medical Systems, Inc. Method for automated processing of digital images of tissue micro-arrays (TMA)
US8515683B2 (en) 2003-09-10 2013-08-20 Ventana Medical Systems, Inc. Method and system for automated detection of immunohistochemical (IHC) patterns
WO2014043253A1 (en) * 2012-09-11 2014-03-20 Neogenomics Laboratories Automated fish reader using learning machines
US8755580B2 (en) 2012-03-17 2014-06-17 Sony Corporation Flourescent dot counting in digital pathology images
US9042631B2 (en) 2013-01-24 2015-05-26 General Electric Company Method and systems for cell-level fish dot counting
JP2016507759A (en) * 2013-02-25 2016-03-10 フラッグシップ バイオサイエンシズ エルエルシーFlagship Biosciences Llc Cell-based tissue analysis
EP2450457B1 (en) * 2010-11-09 2017-08-16 Olympus Corporation Method of analyzing genetically abnormals cells
WO2019219651A1 (en) * 2018-05-15 2019-11-21 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
CN111175267A (en) * 2020-01-18 2020-05-19 珠海圣美生物诊断技术有限公司 Cell interpretation method and system based on FISH technology
CN114463609A (en) * 2022-04-13 2022-05-10 珠海横琴圣澳云智科技有限公司 Method and device for determining fluorescent mixed points in cell microscopic image
CN114739970A (en) * 2022-06-09 2022-07-12 珠海横琴圣澳云智科技有限公司 Method and device for judging fluorescent signal point fracture
CN116144739A (en) * 2023-03-06 2023-05-23 中国医学科学院北京协和医院 Gene rearrangement signal recording method based on fluorescence in situ hybridization of separation probe

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5817462A (en) * 1995-02-21 1998-10-06 Applied Spectral Imaging Method for simultaneous detection of multiple fluorophores for in situ hybridization and multicolor chromosome painting and banding
WO1999044063A2 (en) * 1998-02-25 1999-09-02 The United States Of America, Represented By The Secretary, Department Of Health And Human Services Tumor tissue microarrays for rapid molecular profiling

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5817462A (en) * 1995-02-21 1998-10-06 Applied Spectral Imaging Method for simultaneous detection of multiple fluorophores for in situ hybridization and multicolor chromosome painting and banding
WO1999044063A2 (en) * 1998-02-25 1999-09-02 The United States Of America, Represented By The Secretary, Department Of Health And Human Services Tumor tissue microarrays for rapid molecular profiling

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
AUBELE ET AL.: "Comparative FISH analysis of numerical chromosome 7 abnormalities in 5-um and 15-um paraffin-embedded tissue sections from prostatic carcinoma" HISTOCHEMISTRY AND CELL BIOLOGY, vol. 107, no. 2, 1997, pages 121-126, XP001024090 *
RODENACKER ET AL.: "Groping for quantitative digital 3-D image analysis: an approach to quantitative fluorescence in situ hybridization in thick tissue sections of prostate carcinoma" ANALYTICAL CELLULAR PATHOLOGY, vol. 15, no. 1, 1997, pages 19-29, XP001024113 *

Cited By (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10217219B2 (en) 2001-04-20 2019-02-26 Yale University Systems and methods for automated analysis of cells and tissues
US11315250B2 (en) 2001-04-20 2022-04-26 Yale University Systems and methods for automated analysis of cells and tissues
US8639450B2 (en) 2001-04-20 2014-01-28 Yale University Systems and methods for automated analysis of cells and tissues
US8185320B2 (en) 2001-04-20 2012-05-22 Yale University Systems and methods for automated analysis of cells and tissues
US8121794B2 (en) 2001-04-20 2012-02-21 Yale University Systems and methods for automated analysis of cells and tissues
US7219016B2 (en) 2001-04-20 2007-05-15 Yale University Systems and methods for automated analysis of cells and tissues
US8036833B2 (en) 2001-04-20 2011-10-11 Yale University Systems and methods for automated analysis of cells and tissues
US7873480B2 (en) 2001-04-20 2011-01-18 Yale University Systems and methods for automated analysis of cells and tissues
WO2003004700A1 (en) * 2001-07-03 2003-01-16 Imstar Image Et Modelisation : Strategie, Analyse Et Realisation Method and system for detecting inter-chromosomal imbalance by fluorescent in situ hybridization (fish) on interphase nuclei
FR2826977A1 (en) * 2001-07-03 2003-01-10 Imstar S A METHOD AND SYSTEM FOR DETECTING INTER-CHROMOSOMAL IMBALANCE BY IN SITU HYBRIDIZATION OF FLUORESCENT PROBES (FISH) ON INTERPHASIC CELL CORES
US7683345B2 (en) 2003-06-19 2010-03-23 Applied Precision, Inc. System and method employing photokinetic techniques in cell biology imaging applications
US7608840B2 (en) 2003-06-19 2009-10-27 Applied Precision, Llc System and method employing photokinetic techniques in cell biology imaging applications
US7408176B2 (en) 2003-06-19 2008-08-05 Applied Precision, Llc System and method employing photokinetic techniques in cell biology imaging applications
WO2004113922A3 (en) * 2003-06-19 2005-06-02 Applied Precision Llc System and method employing photokinetic techniques in cell biology imaging applications
WO2004113922A2 (en) * 2003-06-19 2004-12-29 Applied Precision, Llc System and method employing photokinetic techniques in cell biology imaging applications
US8428887B2 (en) 2003-09-08 2013-04-23 Ventana Medical Systems, Inc. Method for automated processing of digital images of tissue micro-arrays (TMA)
US8515683B2 (en) 2003-09-10 2013-08-20 Ventana Medical Systems, Inc. Method and system for automated detection of immunohistochemical (IHC) patterns
WO2005076216A3 (en) * 2004-02-03 2005-10-27 Bioimagene Inc Method and system for automaticed digital image based flourescent in situ hybridization (fish) analysis
WO2005076216A2 (en) * 2004-02-03 2005-08-18 Bioimagene, Inc. Method and system for automaticed digital image based flourescent in situ hybridization (fish) analysis
US8367351B2 (en) 2006-05-05 2013-02-05 Historx, Inc. Methods for determining signal transduction activity in tumors
US8497080B2 (en) 2006-05-05 2013-07-30 Historx, Inc. Methods for determining signal transduction activity in tumors
US7709222B2 (en) 2006-07-13 2010-05-04 Yale University Methods for making cancer prognoses based on subcellular localization of biomarkers
GB2459384A (en) * 2008-04-24 2009-10-28 Univ Nottingham Trent Homogeneous immunoassay comprising z-axis fluorescence imaging
EP2450457B1 (en) * 2010-11-09 2017-08-16 Olympus Corporation Method of analyzing genetically abnormals cells
US8755580B2 (en) 2012-03-17 2014-06-17 Sony Corporation Flourescent dot counting in digital pathology images
WO2014043253A1 (en) * 2012-09-11 2014-03-20 Neogenomics Laboratories Automated fish reader using learning machines
US9378407B2 (en) 2012-09-11 2016-06-28 Neogenomics Laboratories, Inc. Automated fish reader using learning machines
US9042631B2 (en) 2013-01-24 2015-05-26 General Electric Company Method and systems for cell-level fish dot counting
EP2946190A4 (en) * 2013-02-25 2016-11-23 Flagship Biosciences Llc Cell-based tissue analysis
JP2016507759A (en) * 2013-02-25 2016-03-10 フラッグシップ バイオサイエンシズ エルエルシーFlagship Biosciences Llc Cell-based tissue analysis
WO2019219651A1 (en) * 2018-05-15 2019-11-21 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
CN112424825A (en) * 2018-05-15 2021-02-26 文塔纳医疗系统公司 Quantification of signal in stained aggregates
US20220148176A1 (en) * 2018-05-15 2022-05-12 Ventana Medical Systems, Inc. Quantitation of Signal in Stain Agrregates
US11615532B2 (en) 2018-05-15 2023-03-28 Ventana Medical Systems, Inc. Quantitation of signal in stain aggregates
CN111175267A (en) * 2020-01-18 2020-05-19 珠海圣美生物诊断技术有限公司 Cell interpretation method and system based on FISH technology
CN114463609A (en) * 2022-04-13 2022-05-10 珠海横琴圣澳云智科技有限公司 Method and device for determining fluorescent mixed points in cell microscopic image
CN114739970A (en) * 2022-06-09 2022-07-12 珠海横琴圣澳云智科技有限公司 Method and device for judging fluorescent signal point fracture
CN116144739A (en) * 2023-03-06 2023-05-23 中国医学科学院北京协和医院 Gene rearrangement signal recording method based on fluorescence in situ hybridization of separation probe

Also Published As

Publication number Publication date
WO2001020044A3 (en) 2002-05-10
AU7494900A (en) 2001-04-17

Similar Documents

Publication Publication Date Title
US7835869B2 (en) Method and apparatus for computer controlled rare cell, including fetal cell, based diagnosis
WO2001020044A2 (en) Signal counting for in situ hybridization
US7522757B2 (en) Method and apparatus for computer controlled cell based diagnosis
US8559693B2 (en) Systems and methods for automated characterization of genetic heterogeneity in tissue samples
US20050265588A1 (en) Method and system for digital image based flourescent in situ hybridization (FISH) analysis
EP3182121A1 (en) Method and/or system for analyzing biological samples using a computer system
US20130078636A1 (en) Method for detecting and quantitating multiple subcellular components
US20140221227A1 (en) Automated cancer diagnostic methods using fish
US7901887B2 (en) Automated cancer diagnostic methods using fish
EP2327040B1 (en) A method and a system for determining a target in a biological sample by image analysis
JP2008518587A (en) Detection and quantification of subcellular components
WO2005076216A2 (en) Method and system for automaticed digital image based flourescent in situ hybridization (fish) analysis
Rodenacker et al. Groping for quantitative digital 3‐D image analysis: an approach to quantitative fluorescence in situ hybridization in thick tissue sections of prostate carcinoma
Grigoryan et al. Morphological spot counting from stacked images for automated analysis of gene copy numbers by fluorescence in situ hybridization
Grigoryan et al. Rapid analysis of gene copy numbers in cancer by FISH and automated spot counting from stacked confocal images

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP