AU741337B2 - Method and system for automated detection of clustered microcalcifications from digital mammograms - Google Patents

Method and system for automated detection of clustered microcalcifications from digital mammograms Download PDF

Info

Publication number
AU741337B2
AU741337B2 AU92955/98A AU9295598A AU741337B2 AU 741337 B2 AU741337 B2 AU 741337B2 AU 92955/98 A AU92955/98 A AU 92955/98A AU 9295598 A AU9295598 A AU 9295598A AU 741337 B2 AU741337 B2 AU 741337B2
Authority
AU
Australia
Prior art keywords
image
microcalcifications
pixel
dog
clustered
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU92955/98A
Other versions
AU9295598A (en
Inventor
Elton P. Amburn
Telford S. Berkey
Randy P. Broussard
Martin P. Desimio
Jeffrey W. Hoffmeister
Edward M. Ochoa
Thomas F. Rathbun
Steven K. Rogers
John E. Rosenstengel
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualia Computing Inc
Original Assignee
Qualia Computing Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualia Computing Inc filed Critical Qualia Computing Inc
Publication of AU9295598A publication Critical patent/AU9295598A/en
Application granted granted Critical
Publication of AU741337B2 publication Critical patent/AU741337B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B29WORKING OF PLASTICS; WORKING OF SUBSTANCES IN A PLASTIC STATE IN GENERAL
    • B29CSHAPING OR JOINING OF PLASTICS; SHAPING OF MATERIAL IN A PLASTIC STATE, NOT OTHERWISE PROVIDED FOR; AFTER-TREATMENT OF THE SHAPED PRODUCTS, e.g. REPAIRING
    • B29C49/00Blow-moulding, i.e. blowing a preform or parison to a desired shape within a mould; Apparatus therefor
    • B29C49/0015Making articles of indefinite length, e.g. corrugated tubes
    • B29C49/0021Making articles of indefinite length, e.g. corrugated tubes using moulds or mould parts movable in a closed path, e.g. mounted on movable endless supports
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/20Image enhancement or restoration by the use of local operators
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B6/00Apparatus for radiation diagnosis, e.g. combined with radiation therapy equipment
    • A61B6/50Clinical applications
    • A61B6/502Clinical applications involving diagnosis of breast, i.e. mammography
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F16ENGINEERING ELEMENTS AND UNITS; GENERAL MEASURES FOR PRODUCING AND MAINTAINING EFFECTIVE FUNCTIONING OF MACHINES OR INSTALLATIONS; THERMAL INSULATION IN GENERAL
    • F16LPIPES; JOINTS OR FITTINGS FOR PIPES; SUPPORTS FOR PIPES, CABLES OR PROTECTIVE TUBING; MEANS FOR THERMAL INSULATION IN GENERAL
    • F16L11/00Hoses, i.e. flexible pipes
    • F16L11/04Hoses, i.e. flexible pipes made of rubber or flexible plastics
    • F16L11/11Hoses, i.e. flexible pipes made of rubber or flexible plastics with corrugated wall
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/187Segmentation; Edge detection involving region growing; involving region merging; involving connected component labelling
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B29WORKING OF PLASTICS; WORKING OF SUBSTANCES IN A PLASTIC STATE IN GENERAL
    • B29CSHAPING OR JOINING OF PLASTICS; SHAPING OF MATERIAL IN A PLASTIC STATE, NOT OTHERWISE PROVIDED FOR; AFTER-TREATMENT OF THE SHAPED PRODUCTS, e.g. REPAIRING
    • B29C2791/00Shaping characteristics in general
    • B29C2791/004Shaping under special conditions
    • B29C2791/006Using vacuum
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B29WORKING OF PLASTICS; WORKING OF SUBSTANCES IN A PLASTIC STATE IN GENERAL
    • B29CSHAPING OR JOINING OF PLASTICS; SHAPING OF MATERIAL IN A PLASTIC STATE, NOT OTHERWISE PROVIDED FOR; AFTER-TREATMENT OF THE SHAPED PRODUCTS, e.g. REPAIRING
    • B29C2791/00Shaping characteristics in general
    • B29C2791/004Shaping under special conditions
    • B29C2791/007Using fluid under pressure
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B29WORKING OF PLASTICS; WORKING OF SUBSTANCES IN A PLASTIC STATE IN GENERAL
    • B29LINDEXING SCHEME ASSOCIATED WITH SUBCLASS B29C, RELATING TO PARTICULAR ARTICLES
    • B29L2023/00Tubular articles
    • B29L2023/18Pleated or corrugated hoses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10116X-ray image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30068Mammography; Breast

Description

WO 99/09887 PCT/US98/17886 -1- METHOD AND SYSTEM FOR AUTOMATED DETECTION OF CLUSTERED MICROCALCIFICATIONS FROM DIGITAL MAMMOGRAMS BACKGROUND OF THE INVENTION 1. Field of the Invention This invention relates to a method and system for automated detection of clustered microcalcifications from digital images without reduction of radiologist sensitivity.
2. Discussion of Background Mammography, along with physical examination, is the current procedure of choice for breast cancer screening. Screening mammography has been responsible for an estimated 30 to 35 percent reduction in breast cancer mortality rates. However, in 1996 approximately 185,700 new breast cancer cases were diagnosed and 44,300 women died from this disease. Women have about a 1 in 8 chance of being diagnosed with breast cancer, and 1 in 30 will die of this disease in her lifetime.
Although mammography is a well-studied and standardized methodology, for 10 to 30 percent of women diagnosed with breast cancer, their mammograms were interpreted as negative. Additionally, only 10 to 20 percent of patients referred for biopsy based on mammniographic findings prove to have cancer.
Further, estimates indicate the malignancies missed by radiologists are evident in two-thirds of the mammograms retrospectively. Missed detections may be attributed to several factors including: poor image quality, improper patient positioning, inaccurate interpretation, fibroglandular tissue obscuration, subtle nature of radiographic findings, eye fatigue, or oversight.
To increase sensitivity, a double reading has been suggested.
However, the growing increase in the number of screening mammograms makes this option unlikely. Alternatively, a computer-aided diagnosis (CAD or CADx) system may act as a "second reader" to assist the radiologist in detecting and diagnosing lesions. Several investigators have attempted to analyze mammographic abnormalities with digital computers. However, the known studies are believed to have achieved rates of true-positive detections versus false-positive detections that 2 are undesirably low.
Microcalcifications represent an ideal target for automated detection because subtle microcalcifications are often the first and sometimes the only radiographic findings in early, curable breast cancers, yet individual microcalcifications in a suspicious cluster have a fairly limited range of radiographic appearances. Between 30 and percent of breast carcinomas detected radiographically demonstrate microcalcifications on mammograms, and between and 80 percent of breast carcinomas reveal microcalcifications upon microscopic examination. Any increase in the detection rate of microcalcifications by mammography will lead to further improvements in its efficacy in the detection of early breast cancer.
the ability of physicians to diagnose cancer, the problem is that all CAD systems fail to detect some regions of interest that could be found by a human interpreter.
However, human interpreters also miss regions of interest that are subsequently shown to be indicators of cancers.
a false negative error while associating a normal region 25 with a cancer is termed a false positive error.
incorporated by practicing radiologists into their mammographic analyses. No existing CAD system can claim to find all of the suspicious regions detected by an average radiologist, and they tend to have unacceptably high false positive error rates. However, CAD systems are capable of \\melb files\homeS\KarraR\Keep\speci\92955-98 .doc 3/10/01 3 finding some suspicious regions that may be missed by radiologists.
Accordingly, it is an object of the present invention to provide a method for automated clustered microcalcification detection by digital image processing in screening mammography.
SUMMARY OF THE INVENTION According to the present invention there is provided a method for automated clustered microcalcification detection by digital image processing in screening mammography, comprising: storing a digital mammogram; applying a filtering algorithm to said digital representation to obtain a filtered image essentially comprising suspected microcalcifications; define coordinates representative of the location of said suspected microcalcifications; grouping said coordinates into clusters, said clusters representing microcalcifications located .within a predetermined distance of a predetermined number of other microcalcifications and comprising potential clustered microcalcifications; 25 computing features for each of said potential clustered microcalcifications; eliminating potential clustered microcalcifications based on said computed features for each of said potential clustered microcalcifications, and; indicating in said digital mammogram image the locations of those potential clustered microcalcifications remaining after the previous step.
\\melb-files\home\KarraR\Keep\speci\9295598.doc 3/10/01 3a BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a flow diagram illustrating the automated system for the detection of clustered microcalcifications in a digital mammogram; Figs. 2 and 3 are flow diagrams illustrating the autocropping method and system of the invention; Figs. 4-10 are flow diagrams illustrating in more detail the autocropping method and system of the invention; Fig. 11 is a flow diagram illustrating in greater detail the clustered **o \\melbf iles\home\KarraR\Keep\speci\9295598 .doe 3/10/01 WO 99/09887 PCTIUS98/1 7886 -4microcalcification detector of the invention; Fig. 12 is a schematic diagram illustrating a 3 x 3 cross-shaped median filter of the invention; Fig. 13 is a three-dimensional plot of a Difference of Gaussians (DoG) filter kernel; Fig. 14 is a cross-sectional view through the center of the DoG filter kernel of Fig. 13; Fig. 15 is a flow diagram illustrating the global thresholding portion of the microcalcification detection system; Fig. 16 is a flow diagram illustrating the dual local thresholding of the invention; Fig. 17 is a flow diagram illustrating combining the results of global and dual-local thresholding; Fig. 18 is a flow diagram illustrating the sloping local thresholding of the invention; Fig. 19 is a flow diagram illustrating the clustering method of the invention; Fig. 20 is a schematic diagram illustrating the clustering method of the invention; Fig. 21 is a flow diagram illustrating the feature computation process of the invention; Fig. 22 is a flow diagram illustrating a classifier having one discriminant function per class; Fig. 23 is a schematic diagram illustrating a multi-layer perceptron neural network for a two-class classifier; Fig. 24 is a histogram of testing results after detection and classification; Fig. 25 is a flow diagram illustrating the parameter optimization method of the invention; Fig. 26 is a plot of a free response receiver operating characteristic curve of the invention before classifying detections; WO 99/09887 PCT/US98/17886 Fig. 27 is a plot of a free response receiver operating characteristic curve of the invention after classifying detections; Fig. 28 is a plot of probability density functions showing the relationship between the probabilities of false negative and false positive detections; Fig. 29 is a plot of probability density functions showing the relationship between the probabilities of true negative and true positive detections; Fig. 30 is a Venn diagram showing the relationship between radiologist and CAD system detections; Fig. 31 is a flow diagram illustrating a method for incorporating computer-aided diagnosis detections with those of a human interpreter for optimal sensitivity; and Fig. 32 is a flow diagram illustrating an alternative embodiment of the invention that includes a density detector.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT Referring now to the drawings, wherein like reference numerals designate identical or corresponding parts throughout the several views, and more particularly to Fig. 1 thereof, there is shown a flow diagram illustrating a sequence of steps performed in order to detect the locations of clusters of microcalcifications within a digital mammogram.
In a first step 100, a digital mammogram is obtained using hardware such as digital mammography systems, or by digitizing mammography films using laser or charge-coupled device (CCD) digitizers. In an optimized cropping step 200, a rectangular analysis region containing breast tissue is segmented from the digital mammogram image and a binary mask corresponding to the breast tissue is created for use in later processing steps to decrease the time required for processing the mammogram image. The binary mask is also used to limit detections to areas of the image containing breast tissue.
Clustered microcalcifications are detected in a clustered microcalcification detection step 300. After first filtering the cropped image with a median filter to reduce noise, the image is filtered using an optimized difference of WO 99/09887 PCTIUS98/17886 -6- Gaussians (DoG) filter to enhance the microcalcifications. The DoG-filtered image is then subjected to optimized threshold tests to detect potential microcalcifications.
The detected microcalcifications are shrunk to single-pixel representations and detections outside of the breast area are removed. The remaining microcalcifications are grouped into clusters. Features are then computed for the clusters. Detected clusters are classified as either suspicious or non-suspicious in a classification step 400.
The parameters used by the autocropping, clustered microcalcification detection, and classification steps 200, 300, 400 are optimized in a parameteroptimizing step 500. The parameters are optimized by parameter-optimizing means that uses a genetic algorithm (GA) so as to maximize the true-positive detection rate while minimizing the false-positive detection rate. Of course, other optimization schemes may be used as well.
The detected clustered microcalcifications are stored in a list of image coordinates. The detection results are processed in a processing step 600 by simply adding an offset to each of the microcalcification coordinates to account for translation of the coordinates incurred as a result of the cropping procedure.
Detected clustered microcalcifications are indicated on the digital mammogram by means of rectangles drawn around the clustered microcalcifications in a display step 700. Other indicators may be used such as, for example, arrows pointing to suspected microcalcifications, or ellipses around suspected microcalcifications.
ACQUIRING A DIGITAL REPRESENTATION OF A MAMMOGRAM One method of obtaining digital mammograms comprises digitizing radiologic films by means of a laser or charge-coupled device (CCD) scanner.
Digital images obtained in this manner typically have a sample spacing of about 100 utm per pixel, with a gray-level resolution of 10 to 12 bits per pixel. In one embodiment of the present invention, radiologic films are scanned using a Model CX812T digitizer manufactured by Radiographic Digital Imaging of Compton, California, to produce digital images having 50 gim spacing per pixel and 12 bits of gray-level resolution per pixel. Another possible input source for digital images is a WO 99/09887 PCT[US98/1 7886 -7digital mammography unit from Trex Medical Corporation of Danbury, Connecticut, which has a spatial resolution of about 45 .lm per pixel and a gray-level resolution of 14 bits per pixel.
The digital images are stored as digital representations of the original mammogram images on computer-readable storage media. In a preferred embodiment, the digital representations or images are stored on a 2 GB hard drive of a general-purpose computer such as a PC having dual Pentium II microprocessors running at 200 MHZ, 512 MB of RAM memory, a ViewSonic PT813 monitor, a pointing device, and a Lexmark Optra S 1625 printer. The system operates within a Windows NT operating system.
AUTOCROPPING
As may be seen in Figs. 2 and 3, a digital mammogram image 190 is first cropped to segment an analysis region 296 from the image and produce a binary mask 298 corresponding to breast tissue in the analysis region. Preferably, the cropping is performed automatically, although it could be cropped manually. The image is cropped as a preliminary step because the breast tissue does not cover the whole radiographic film. Focusing the processing of the image on only that portion of the image which breast tissue reduces the time required to process the image.
Also, other items appearing on the film, such as labels and patient information, are excluded from consideration, and false-positive indications lying outside of the breast tissue area are eliminated.
Referring to Figs. 4 through 10, the autocropping process will be described in detail. The image is first subsampled from 50 Lm to 400 lm to reduce the amount of data to be processed in step 202. Of course, the image may be downsampled to other resolutions as desired. Not all of the original image data is needed to reliably segment the breast tissue from the remainder of the image.
Subsampling every eighth pixel in both the horizontal and vertical directions reduces the amount of data by 64 times. For purposes of segmenting the breast tissue from the rest of the image, the consequent loss of resolution is immaterial.
A white border twenty pixels in width is added around all sides of the WO 99/09887 PCT[US98/1 7886 -8subsampled image in step 204. White corresponds to the maximum pixel value possible given the number of bits used to represent each pixel. For images having 12 bits of gray-scale resolution, the maximum gray-scale value is 4095. The bordered image is then thresholded in step 206 with a relatively high threshold value such that most of the breast tissue is guaranteed to be less than the threshold to produce a binary image. In one embodiment of the invention, the threshold is set equal to a predetermined percentage of the gray-scale value of a pixel near the top middle portion of the image. The thresholded image is then inverted, that is, ones become zeroes and zeroes become ones, in step 208. The inverted image is then dilated in step 210. Dilation is a morphological operation in which each pixel in a binary image is turned on, that is, set to a value of one, if any of its neighboring pixels are on. If the pixel is already on, it is left on.
In step 212 the dilated image is cropped to the size of the largest blob.
Blobs are contiguous groups of pixels having the value one. This step 212 removes bright borders from the subsampled mammogram representation while ensuring that none of the breast area is reduced. Other techniques that threshold to find the border have a very difficult time dealing with bright areas in the breast adjacent to the border such as, for example, when breast implants are visible in the image. Pixels from the original image, resulting from step 202, corresponding to the locations of the pixels in the cropped blob, are selected for subsequent processing. Note that this is a simple subset of pixels from the input image.
The image from step 212 is histogram equalized in step 214. The average brightness of the image will vary widely from mammogram to mammogram.
Moreover, different digitizers having different optical density characteristics are an additional source of variability in brightness levels in the digital representation of the mammogram. The breast mask that is the output of the autocropper is mainly defined by means of a region-growing algorithm that requires a single contrast setting to work properly. However, it has been determined experimentally that a single contrast setting will not work for a wide range of image inputs. Therefore, each image is mapped into a normalized image space using an automatic histogram enhancement process, after which a single contrast setting works well.
WO 99/09887 PCT[US98/1 7886 -9- First, a histogram of the image is obtained. Typically, most of the data in the breast area will be in the lower histogram bins (corresponding to grayscale values of about 0 1000), with borders and labels being in the higher bins (corresponding to gray-scale values of about 4000 4095) for 12-bit data. The upper and lower bin values that contain the typical breast data are determined. The lower bin value is the first highest peak encountered when going from the lowest gray-scale value toward the highest gray-scale value. The upper bin is the last zero-value bin encountered when going from the highest gray-scale level toward the lowest grayscale value. Then the data are reduced to an eight-bit representation and linearly stretched over the range of the data type. For example, values in the lower bins are set to zero. Values of data in the upper bins are set to 255. The rest of the data are then linearly mapped between the lower and upper bins.
After the image has been histogram equalized, the equalized image may be considered to be a matrix. The image matrix is divided into left and right halves, of equal size if possible, and the brighter side is selected in a step 216. The sums of all the pixels in the left and right halves are computed. The sum values are then compared and the side having the greater sum is the brighter side.
Prior to region growing the brighter side, algorithm variables are initialized in step 218. The size of the region-grown mask is preliminarily checked in step 220. If it is large enough, then the mask is acceptable. Otherwise, processing continues to find the mask. The side of the image to be region grown is selected in step 222. In step 224 this region is searched to find its maximum gray-scale value.
This maximum value is used to find a pixel to start a region-growing algorithm.
Region growing is the process of grouping connected pixels sharing some like characteristic. The choice of characteristic influences the resultant region. The input to a region growing function is a gray-scale image and a starting point to begin growing. The output is a binary image with ones indicating pixels within the grown region, blobs. Region growing will create a single blob, but that blob may have within it internal holes, that is, pixels that are off. To grow a blob, each of the four nearest neighbors of a pixel of interest are looked at. The contrast ratio is computed for each nearest neighbor pixel. If the contrast ratio is less than a contrast ratio WO 99/09887 PCT/US98/17886 threshold, then the neighbor pixel is set to a one in a binary mask image. Otherwise, the neighbor pixel is set to zero. The region growing algorithm spirals outwardly from the starting or seed pixel, progressively looking at nearest neighbor pixels until done. To those skilled in the art, it is clear that other region growing algorithms may also be applied.
In step 226, region growing begins with the pixel identified from the previous step 224 to produce a binary mask. The size of the mask resulting from step 226 is computed in step 228 and checked in step 230. There may be three points of failure for this approach. First, the brightest point in the search region may be an artifact outside the breast. Therefore, if the resulting mask is not large enough pixels), then the search region is moved closer to the side of the image and searched again. This is repeated three times, each time lowering the contrast value threshold.
This corresponds to the path taken through steps 232 and 234. Second, the side selection approach may be in error. Therefore, if a valid breast mask is not found in the first side searched, then the other side of the breast is searched. This corresponds to the path taken through steps 236 and 238. Third, if a valid breast mask is not found on either side, then the whole breast is thresholded and the largest object is taken to be the breast mask in step 240.
Since a constant contrast value is used in the region-growing algorithm, some masks will be too large. Typically, there will be "tails" along the edge of the digitized mammogram image where extra light leaked in while the original mammogram film was being digitized. The tails are reduced by applying a series of erodes and then a series of dilates to the image. Erosion is a morphological operation in which each pixel in a binary image is turned off unless all of its neighbors are on. If the pixel is already off, it is left off. But first, the holes in the mask must be filled in or the multiple erodes may break the mask into disjoint sections. Thus, holes in the mask are closed in step 242 by means of a majority operation. The majority operation is a morphological operation in which each pixel in a binary image is turned on if a majority of its neighboring pixels are on. If the pixel is already on, it is left on.
However, another problem is that some smaller breast masks can not WO 99/09887 PCTIUS98/1 7886 -11undergo as many erodes as can larger breast masks. Therefore, as a fail-safe measure, the sum of the breast mask is taken before and after the erodes and dilates.
If the size is reduced too much by more than the original mask before the morphological operators is used. Thus, a duplicate copy of the mask is made in step 244 before the mask is eroded and dilated in steps 246 and 248, respectively. The size of the resultant mask is then computed in step 250 and compared with the size of the mask from step 242 in step 252. If the new size is less than half the old size, then the duplicate mask, from step 244, is selected in step 254 for subsequent processing.
Otherwise, the resultant mask from step 248 is used.
The original image (from step 202) is then cropped to the size of the breast mask just found (either from step 242 or step 248) in step 256. In case the resulting mask is too small for subsequent processing, a crop adjustment is always made in step 258. The adjustment comes in the form of increasing the size of the breast mask bounding box by including additional pixels from the original image in the cropped image.
The cropped image is then automatically histogram enhanced in step 260 as previously described above in connection with step 214. This enhanced image is passed through a loose region growing step 262 to produce a generous mask. This means that the image is subjected to a lower threshold to yield more "on" pixels. This mask is then subjected to hole-closing, eroding, and dilating in steps 264, 266, and 268, respectively, as above, but to a lesser degree.
The same steps described above are repeated one final time in steps 270 through 276, but the crop adjustments are less and the contrast value is increased for a tight region growing step 276. This tight region growing step 276 can afford the higher contrast value since it will be region growing in just the cropped image.
This results in a parsimonious estimate of breast tissue. The resulting mask is segmented to find the largest object in step 278 and its bounding box shrunk to just enclose the object in step 280. There may still be some holes in the breast mask.
Therefore, after crop adjustments in step 282, the mask is inverted in step 284 and the largest object is found in step 286. This largest object is extracted and then inverted in step 288 to obtain the penultimate mask.
WO 99/09887 PCT[US98/1 7886 -12- The final mask is obtained by closing holes in the penultimate mask with multiple majority operations and dilations in step 290. The image is then cropped to the size of the resulting mask and the autocropping is complete. An important result from the autocropper is the offset of the cropped image. This is the pixel location in the original image that corresponds to the pixel in the upper left pixel of the cropped image. Keeping track of all the cropping and crop adjustments determines this offset value.
The output of the autocropping process is a rectangular array of pixels representing a binary mask wherein the pixels corresponding to breast tissue are assigned a value of one while the remainder of the pixels are assigned a value of zero. Put another way, the binary mask is a silhouette of the breast made up of ones while the background is made up of zeroes.
Parameters of the autocropper may be optimized to obtain better breast masks. The procedure is described below in the optimization section.
DETECTION OF CLUSTERED MICROCALCIFICATIONS Turning now to Fig. 11, there is seen therein a flow diagram illustrating in greater detail the clustered microcalcification detection system 300 of the invention.
That portion of the digital representation of the mammogram corresponding to the analysis region 296, designated a cropped sub-image 302, produced in the cropping step 200, is first processed to reduce noise in a noise reduction step 310 to reduce digitization noise that contributes to false detections of microcalcifications. The noise-reduced image is then filtered using an optimized target-size-dependent difference of Gaussians (DoG) spatial kernel in step 320 to enhance differences between targets and background, thus creating global and local maxima in the filtered image. The optimized DoG-filtered image is then thresholded in step 340 to segment maxima that represent potential detections of microcalcifications.
The detected maxima are converted to single-pixel coordinate representations in a conversion step 350. The coordinate representations of the WO 99/09887 PCT/US98/1 7886 -13detected maxima are compared with the binary mask of the analysis area in a first false-positive removal step 360 to remove false detections outside the breast mask area. The remaining coordinate representations in the analysis area are clustered in a clustering step 370. Features are computed for the remaining clusters in a feature computation step 380 and used to remove non-suspicious detections in a classifying step 400 (Fig. The remaining detections are outputted as detected clustered microcalcifications in an outputting step 600 in the form of cluster coordinates.
Turning now to a more detailed discussion of the steps in the clustered microcalcification detection process, the digital mammogram image is first filtered to reduce noise in the image. Although the main limitation in image quality should be the granularity of the film emulsion, noise is introduced from the process of digitization. This noise may later be detected as a pseudocalcification. In this system, a cross-shaped median filter is used because it is well known to be extremely effective at removing single-pixel noise. The median filter is a non-linear spatial filter that replaces each pixel value with the median of the pixel values within a kernel of chosen size and shape centered at a pixel of interest. Referring to Fig. 12, it may be seen that the cross shape is formed by the set of pixels which include the center pixel and its four nearest neighbors. The cross shape preserves lines and corners better than typical block-shaped median filters and limits the possible substitution to the four nearest neighbors, thereby reducing the potential for edge displacement.
After noise has been reduced, the image is filtered with an optimized DoG kernel to enhance microcalcifications. Filtering is accomplished by convolving the noise-reduced image with the DoG kernel. In an alternative embodiment, filtering is accomplished by first obtaining the fast Fourier transforms (FFTs) of the noise-reduced image and the DoG kernel, then multiplying the FFTs together, and taking the inverse FFT of the result.
The DoG kernel was chosen because neurophysiological experiments provide evidence that the human visual pathway includes a set of"channels" that are spatial frequency selective. Essentially, at each point in the visual field, there are size-tuned filters or masks analyzing an image. The operation of these spatial WO 99/09887 PCT/US98/17886 -14receptive fields can be approximated closely by a DoG.
The 2-D Gaussian mask is given as: X- +y2) G(x,y) ce 20 (1 where c normalizes the sum of mask elements to unity, x and y are horizontal and vertical indices, and o is the standard deviation. Using Equation 1, the difference of two Gaussians with different a yields: -(x2 y y2) 202 (2) DoG(x,y) cle -ce It has been shown that when 02 1.60a, then the DoG filter's response closely matches the response of human spatial receptive filters. Therefore, with motivation from human physiology, let the ratio of the DoG standard deviation constants be 1:1.6. Then, for a target of size (average width) t pixels, use a 2 t/2 and, from the rule of thumb, oi 02/1.6.
Since microcalcifications typically range from 100 to 300 lmr in diameter, potential target sizes for the 50 ntm digitized mammograms correspond to 2 to 6 pixels. It has been found that a DoG kernel constructed using an optimization technique for selecting the target size parameter, such as the GA detailed below, has an optimized target size oft 6.01 pixels. The targetsize t will vary depending on such factors as the resolution and scale of the image to be processed. The impulse response of a DoG filter having t 6.01 pixels and 02 1.6oi is shown in Figs. 13 and 14.
Once the noised-reduced cropped image has been DoG filtered to enhance differences between targets and background, the DoG-filtered subimage contains differences in gray levels between potential microcalcifications and background. Although microcalcifications tend to be among the brightest objects in DoG-filtered subimages, they may exist within regions of high average gray levels and thus prove difficult to reliably segment. The thresholding process used in one WO 99/09887 PCT[US98/17886 embodiment of the invention that generally addresses these concerns involves pairwise pixel "ANDing" of the results of global histogram and locally adaptive thresholding. However, the preferred embodiment of the invention uses sloping local thresholding.
Since targets tend to exist within an image's higher gray levels, then the global threshold may be approximated by finding the level which segments a preselected percentage of the corresponding higher pixel levels in the image histogram. An embodiment of a global thresholding method is illustrated in Fig. Locally adaptive thresholding may be implemented by varying the high and low thresholds based on the local pixel value mean and standard deviation. An embodiment of a dual-local thresholding method is illustrated in Fig. 16.
After computing the image histogram, p(rk), the gray level threshold, g, used to segment a preselected upper fraction,f, of the histogram, is found using: f =1 -tp(rk) (3) k=o where rk is the k Ih gray level, 0 g g and gma is the maximum gray level in the image.
The locally adaptive thresholds, 11o and t hi, are found using tlo kIowN(x,y) PINN(x,y) (4) and thi khiOGN(x,y) PN(x,y) where kio and khi are used to preselect the multiple of the local standard deviation of gray-level intensities, and pNN(x,y) is the local gray-level mean of the N x N neighborhood centered on the pixel at of the DoG-filtered image. Other neighborhood shapes, such as rectangular, circular, and ellipsoidal, may also be used.
Pixels whose brightness or gray-level value falls within the threshold interval, that is, tio brightness thi, are set equal to one. Optimization off, ko, khi and N is WO 99/09887 PCT[US98/1 7886 -16discussed below in connection with the parameter-optimizing process. The results of the global thresholding process may be combined with the results of the local thresholding step by logically ANDing them as shown in Fig. 17. Alternatively, either thresholding method may be used alone.
The preferred thresholding means are illustrated in Fig. 18 wherein it may be seen that an N x N window is centered at a pixel x,y in the input image The mean, and standard deviation, of the digital mammogram image pixels under the window are computed. A local threshold value, is computed as: T(x,y) A Bp(x,y) Co(x,y) (6) where values for N, A, B, and C are computed during a parameter optimization stage, discussed below. Values for T(x,y) are computed for every x,y location in the image.
The digital mammogram has also been DoG filtered, producing an image Each pixel of the DoG-filtered image d(x,y) is compared to the threshold value Pixels in the locally thresholded image ls(x,y) are set to one where values of the DoG-filtered image are greater than the threshold, and set to zero elsewhere.
The advantage of this novel local sloping thresholding method over prior art thresholding methods is that the threshold is computed from the pixels in a pre-DoG-filtered image rather than from a post-DoG-filtered image. This eliminates the need for background trend correction. In conventional local thresholding, the threshold is computed as: T(x,y) Bg(x,y) Co(x,y) (7) from the mean and standard deviation of the DoG-filtered image. The problem of using a local threshold computed from the DoG-filtered image is that DoG-filtered images typically have mean values close to zero and standard deviations significantly affected by the presence of targets.
Local thresholds computed from the statistics of the DoG-filtered image suffer from the following adverse effects. First, since the mean value is close WO 99/09887 PCT/US98/17886 -17to zero, a degree of freedom is lost in the computation of the threshold, which becomes essentially a function of the standard deviation. Second, the absolute brightness of the input image is lost. To keep many spurious detections from occurring, it is desirable to have high thresholds in bright regions. However, the information about the local mean of the input image is not available in the DoGfiltered image. Finally, the standard deviations of DoG-filtered images are increased by detections of targets. This is so because when local bright spots of proper size exist in the original image, large gray-scale values result in the DoG-filtered image.
Thus, the presence of targets in a region increases the local standard deviation thereby raising the threshold of that region. The higher threshold reduces the probability of passing a bright spot to subsequent processing stages.
The novel local thresholding method just described solves the above problems by computing thresholds from the input image, which are then applied to the DoG-filtered image. Additionally, the threshold computed here includes an offset term A, which is independent of the local image mean.
After thresholding, detections are converted to single-pixel representations by computing the centroid or center of gravity of groups of contiguous pixels found by the thresholding process. Detections are thus represented as single pixels having a value of logical one while the remaining pixels have a value of logical zero.
False-positive detections outside of the breast area are removed by logically ANDing the binary mask from the autocropper with the single-pixel representations of the detections.
Calcifications associated with malignancies usually occur in clusters and can be extensive. The cluster detection module identifies clusters based on a clustering algorithm as depicted in Fig. 19. Specifically, a suspicious cluster is declared when at least PCsmin or more detected signals are separated by less than a nearest neighbor distance, Optimization of/Csmin and dn,, is discussed below in connection with the parameter optimizing process. Figure 20 illustrates the clustering process for the case wherein uCsi, 5 and 4.
Additional false-positive clustered microcalcifications are removed by WO 99/09887 PCT/US98/17886 -18means of a classifier, detailed below. Features are extracted for each of the potential clustered microcalcifications as shown in Fig. 21. The eight features computed for each of the potential clustered microcalcifications in a preferred embodiment are: 1. The larger eigenvalue (1 1 of the covariance matrix of the points in a cluster; 2. The smaller eigenvalue (12) of the covariance matrix of the points in a cluster; 3. The ratio of the smaller eigenvalue of the covariance matrix to the larger eigenvalue of the covariance matrix of the points in a cluster. Equivalent to the ratio of the minor axis to the major axis of an ellipse fitted to cover the points in a cluster; 4. Linear density calculated as the number of detected microcalcifications divided by the maximum interpoint distance; Standard deviation of the distances between points in a cluster; 6. Mean minus median of the distances between points in a cluster; 7. Range of points in cluster calculated as maximum interpoint distance minus the minimum interpoint distance; and 8. Density of a cluster calculated as the number of detections divided by the area of a box just large enough to enclose the detections.
Of course, other features could be computed for the potential microcalcification clusters, and the invention is not limited to the number or types of features enumerated herein.
CLASSIFYING DETECTIONS The cluster features are provided as inputs to the classifier, which classifies each potential clustered microcalcification as either suspicious or not suspicious. In practice, the clustered microcalcification detector is only able to locate regions of interest in the digital representation of the original mammogram that may be associated with cancer. In any detector, there is a tradeoff between locating as many potentially suspicious regions as possible versus reducing the number of normal regions falsely detected as being potentially suspicious. CAD WO 99/09887 PCT/US98/17886 -19systems are designed to provide the largest detection rates possible at the expense of detecting potentially significant numbers of regions that are actually normal. Many of these unwanted detections are removed from consideration by applying pattern recognition techniques.
Pattern recognition is the process of making decisions based on measurements. In this system, regions of interest or detections are located by a detector, and then accepted or rejected for display. The first step in the process is to characterize the detected regions. Toward this end, multiple measurements are computed from each of the detected regions. Each measurement is referred to as a feature. A collection of measurements for a detected region is referred to as a feature vector, wherein each element of the vector represents a feature value. The feature vector is input to a discriminant function.
Referring to Fig. 22, there may be seen therein a classifier having a feature vector x applied to a set of discriminant functions The classifier shown in Fig. 22 is designed with one discriminant function per class. A discriminant function computes a single value as a function of an input feature vector.
Discriminant functions may be learned from training data and implemented in a variety of functional forms. The output of a discriminant function is referred to as a test statistic. Classification is selecting a class according to the discriminant function with the greatest output value. The test statistic is compared to a threshold value.
For values of the test statistic above the threshold, the region or detection associated with the feature vector is retained and displayed as potentially suspicious. When the test statistic is below the threshold, the region is not displayed.
Many methods are available for designing discriminant functions.
One approach considered for this invention is a class of artificial neural networks.
Artificial neural networks require training, whereby the discriminant function is formed with the assistance of labeled training data.
In a preferred embodiment, the classification process is implemented by means of a multi-layer perceptron (MLP) neural network Of course, other classifier means could be used such as, for example, a statistical quadratric classifier.
Only potential clustered microcalcifications classified as suspicious are retained for WO 99/09887 PCT/US98/17886 eventual designation for a radiologist. Alternatively, it may be desirable to iteratively loop between MLP NN analysis of the individual microcalcification detections and the microcalcification clusters.
Referring to Fig. 23, a schematic diagram of an MLP NN may be seen therein. The MLP NN includes a first layer of J hidden layer nodes or perceptrons 410, and one output node or perceptron 420 for each class. The preferred embodiment of the invention uses two output nodes, one each for the class of suspicious detections and the class of non-suspicious detections. Of course, more or fewer classes could be used for classifying clusters of microcalcifications. Each computed feature x i is first multiplied by a weight wij, where i is an index representing the ith feature vector element, andj is an index representing the j h first layer node. The output yj of each first layer perceptron 410 is a nonlinear function of the weighted inputs and is given by: d Yv =f (wjx (8) where d represents the total number of features x i and is typically a saturating nonlinearity. In this embodiment, tanh(*). The first layer or hidden layer node outputs yj are then multiplied by a second layer of weights ujk and applied to the output layer nodes 420. The output of an output layer node 420 is a nonlinear function of the weighted inputs and is given by: z(y f( (ukx (9) where k is an index representing the kth output node.
The hyperbolic tangent function is used in a preferred embodiment of the system because it allows the MLP NN to be trained relatively faster as compared to other functions. However, functions other than the hyperbolic tangent may be used to provide the outputs from the perceptrons. For example, linear functions may be used, as well as smoothly varying nonlinear functions, such as the sigmoid WO 99/09887 PCT/US98/17886 -21function.
The weight values are obtained by training the network. Training consists of repeatedly presenting feature vectors of known class membership as inputs to the network. Weight values are adjusted with a back propagation algorithm to reduce the mean squared error between actual and desired network outputs.
Desired outputs of z, and z 2 for a suspicious input are +1 and respectively.
Desired outputs of z, and z 2 for non-suspicious inputs are -1 and respectively.
Other error metrics and output values may also be used.
In this embodiment of the system, the MLP NN is implemented by means of software running on a general-purpose computer. Alternatively, the MLP NN could also be implemented in a hardware configuration by means readily apparent to those with ordinary skill in the art.
After training, each detected clustered microcalcification is classified as either suspicious or not suspicious by means forming the difference z, z 2 then comparing the difference to a threshold, 0. For values of z, z 2 greater than or equal to the threshold 0, z, z 2 0, the classifier returns a value of +1 for suspicious clustered microcalcifications, and for values of z I z 2 0, the classifier returns a value of -1 for non-suspicious clustered microcalcifications.
In order to arrive at optimum values for the respective weights, and the number of first layer nodes, the MLP NN was trained with a training set of feature vectors derived from a database of 978 mamnmogram images.
To develop and test the CAD system of the invention, truth data was first generated. Truth data provides a categorization of the tissue in the digital images as a function of position. Truth data was generated by certified radiologists marking truth boxes over image regions associated with cancer. In addition to the mammogram images, the radiologists also had access to patient histories and pathology reports.
The radiologists identified 57 regions of interest, containing biopsyconfirmed cancers associated with clustered microcalcifications, by means of truth boxes. All 978 images were then processed by the microcalcification detector of the invention to produce a plurality of feature vectors, a subset of which were associated WO 99/09887 PCT/US98/17886 -22with the 57 truth boxes. Half of the subset feature vectors were randomly chosen, along with about three times as many feature vectors not associated with clustered microcalcifications, to comprise the training set of feature vectors. The MLP NN, having a predetermined number of hidden nodes, was then trained using the training set. The remaining feature vectors were used as a test database to evaluate the performance of the MLP NN after training. Training of the MLP NN was carried out by means of the Levenberg-Marquardt back propagation algorithm.
Alternatively, the MLP NN can be trained with other learning algorithms and may have nonlinearities other than the hyperbolic tangent in either or both layers. In an alternative embodiment with sigmoidal output nodes, the Bayes optimal solution of the problem of classifying clustered microcalcification detections as either suspicious or non-suspicious may be obtained.
In one run of the preferred embodiment during testing, before application of the MLP NN classifier to eliminate false-positive clustered microcalcifications, the detection procedure found about 93% of the true-positive clustered microcalcifications in both the training and test databases while indicating about 10 false-positive clustered microcalcifications per image. It was found that after an MLP NN classifier having 25 first layer nodes was used with the respective optimum weights found during training, 93% of the true-positive detections were retained while 57% of the false-positive detections were successfully removed.
Referring to Fig. 24, there may be seen a histogram of the results of testing on the testing database after classification by the MLP NN. Of course, the MLP NN of the invention may be operated with more or fewer first layer nodes as desired.
DISPLAYING DETECTIONS After the locations of clustered microcalcifications have been determined, they are indicated on the original digitized mammogram image, or a copy of the original image, by drawing rectangular boxes around microcalcifications.
Other means for indicating the locations of microcalcifications may be used, such as, for example, placing arrows in the image pointing at detections or drawing ellipses around the detections.
WO 99/09887 PCT/US98/1 7886 -23- The locations of clustered microcalcifications are passed to the display detections procedure as a list of row and column coordinates of the upper left and lower right pixels bounding each of the clusters. The minimum row and column coordinates and maximum row and column coordinates are computed for each cluster. Bounding boxes defined by the minimum and maximum row and column coordinates are added to the original digitized image, by means well known in the art. The resulting image is then stored as a computer-readable file, displayed on a monitor, or printed as a hard-copy image, as desired.
In one embodiment of the system, the resulting image is saved to a hard disk on a general-purpose computer having dual Pentium II processors and running a Windows NT operating system. The resulting image may be viewed on a VGA or SVGA monitor, such as a ViewSonic PT813 monitor, or printed as a hard-copy gray-scale image using a laser printer, such as a Lexmark Optra S 1625 Of course, other hardware elements may be used by those with ordinary skill in the art.
OPTIMIZING THE PARAMETERS Genetic algorithms (GAs) have been successfully applied to many diverse and difficult optimization problems. A preferred embodiment of this invention uses an implementation of a GA developed by Houck, et al. Genetic Algorithm for Function Optimization," Tech. Rep., NCSU-IE TR 95-09, 1995), which is incorporated by reference herein, to find promising parameter settings. The parameter optimization process of the invention is shown in Fig. 25. This is a novel application of optimization techniques as compared to current computer-aided diagnosis systems require hand tuning by experiment.
GAs search the solution space to maximize a fitness (objective) function by use of simulated evolutionary operators such as mutation and sexual recombination. In this embodiment, the fitness function to be maximized reflects the goals of maximizing the number of true-positive detections while minimizing the number of false-positive detections. GA use requires determination of several issues: objective function design, parameter set representation, population initialization, WO 99/09887 PCTIUS98/1 7886 -24choice of selection function, choice of genetic operators (reproduction mechanisms) for simulated evolution, and identification of termination criteria.
The design of the objective function is a key factor in the performance of any optimization algorithm. The function optimization problem for detecting clustered microcalcifications may be described as follows: given some finite domain, D, a particular set of cluster detection parameters x kio, khi, N, Csmin dnn} where x E D, and an objective functionfobj D Ot, where R denotes the set of real numbers, find the x in D that maximizes or minimizes fob. When sloping local thresholding is used in the cluster detector, the parameters N, A, B, and C are optimized. Radiologic imaging systems may be optimized to maximize the TP rate subject to the constraint of minimizing the FP rate. This objective may be recast into the functional form shown in the following equation: -FP(x) TP(x) TPin o FPp otherwise where maximization is the goal. For a particular set of cluster detection parameters, if the minimum acceptable TP rate, TPmn, is exceeded, the objective function returns the negative of the FP rate. Otherwise, if the TP rate falls below TPmin, the objective function returns a constant value, FPpenalty -10. Other objective functions may also be used.
Since a real-valued GA is an order of magnitude more efficient in CPU time than the binary GA, and provides higher precision with more consistent results across replications, this embodiment of the invention uses a floating-point representation of the GA.
This embodiment also seeds the initial population with some members known beforehand to be in an interesting part of the search space so as to iteratively improve existing solutions. Also, the number of members is limited to twenty so as to reduce the computational cost of evaluating objective functions.
In one embodiment of the invention, normalized geometric ranking is used, as discussed in greater detail in Houck, et al., supra, for the probabilistic WO 99/09887 PCT/US98/1 7886 selection process used to identify candidates for reproduction. Ranking is less prone to premature convergence caused by individuals that are far above average. The basic idea of ranking is to select solutions for the mating pool based on the relative fitness between solutions. This embodiment also uses the default genetic operation schemes of arithmetic crossover and nonuniform mutation included in Houck, et al.'s
GA.
This embodiment continues to search for solutions until the objective function converges. Alternatively, the search could be terminated after a predetermined number of generations. Although termination due to loss of population diversity and/or lack of improvement is efficient when crossover is the primary source of variation in a population, homogeneous populations can be succeeded with better (higher) fitness when using mutation. Crossover refers to generating new members of a population by combining elements from several of the most fit members. This corresponds to keeping solutions in the best part of the search space. Mutation refers to randomly altering elements from the most fit members. This allows the algorithm to exit an area of the search space that may be just a local maximum. Since restarting populations that may have converged proves useful, several iterations of the GA are run until a consistent lack of increase in average fitness is recognized.
Once potentially optimum solutions are found by using the GA, the most fit GA solution may be further optimized by local searches. An alternative embodiment of the invention uses the simplex method to further refine the optimized GA solution.
The autocropping system may also benefit from optimization of its parameters including contrast value, number of erodes, and number of dilates. The method for optimizing the autocropper includes the steps of generating breast masks by hand for some training data, selecting an initial population, and producing breast masks for training data. The method further includes the steps of measuring the percent of overlap of the hand-generated and automatically-generated masks as well as the fraction of autocropped breast tissue outside the hand-generated masks. The method further comprises selecting winning members, generating new members, and WO 99/09887 PCTIUS98/1 7886 -26iterating in a like manner as described above until a predetermined objective function converges.
In Figures 26 and 27, there may be seen therein free response receiver operating characteristic curves for the system of the invention for the outputs of the optimized microcalcification detector and the classifier, respectively. Figure 26 represents the performance of the optimized detector before classifying detections, while Fig. 27 represents the performance of the system after classifying detections.
Although the GA has been described above in connection with the parameter optimization portion of the preferred embodiment, other optimization techniques are suitable such as, for example, response surface methodology. Of course, processing systems other than those described herein may be optimized by the methods disclosed herein, including the GA.
INCORPORATING CAD SYSTEM OUTPUTS FOR OPTIMAL SENSITIVITY Performance metrics for detection of suspicious regions associated with cancer are often reported in terms of sensitivity and specificity. Sensitivity measures how well a system finds suspicious regions and is defined as the percentage of suspicious regions detected from the total number of suspicious regions in the cases reviewed. Sensitivity is defined as:
TP
Sensitivity (11) TP FN where TP is the number of regions reported as suspicious by a CAD system that are associated with cancers, and FN is the number of regions that are known to be cancerous that are not reported as suspicious. Specificity measures how well the system reports normal regions as normal. Specificity is defined as:
TN
Specificity TN(12) FP TN where TN represents regions correctly identified as not suspicious and FP represents regions reported as suspicious that are not cancerous.
WO 99/09887 PCT[US98/1 7886 -27- Current CAD systems increase specificity by reducing FP. However, FP and TP are coupled quantities. That is, a reduction of FP leads to a reduction of TP. This implies that some of the suspicious regions that could have been detected are missed when the objective is to maintain high specificity.
Figures 28 and 29 illustrate relationships between the quantities TP, FP, TN, and FN. A measurement from a screening mammography image is represented by test statistic, x. The probability density function ofx is represented by p(x) and the decision threshold is represented by 0. Ifx is greater than 0, a suspicious region is reported. Areas under the probability density functions represent probabilities of events. From Fig. 28 observe that increasing the threshold reduces the probability of FP decisions. However, observe from Fig. 29 that increasing the threshold simultaneously reduces the probability of TP decisions.
Another metric that exists for CAD systems is positive predictive value (PPV), which is defined as the probability that cancer actually exists when a region of interest is labeled as suspicious. PPV can be calculated from the following equation:
TP
PPV TP (13) TP FP Note that increasing TP or reducing FP increases PPV.
Radiologists and computers find different suspicious regions. Figure is a Venn diagram depicting a possible distribution of suspicious regions for man and machine detections. Some suspicious regions are found solely by a human interpreter or radiologist, some solely by a CAD system, some are found by both, and some are not found by either.
Referring to Fig. 31, there may be seen a preferred method for incorporating the outputs of a CAD system, and more particularly for the CAD system of the invention, with the observations of a human interpreter of a screening mammography image 10 for optimal sensitivity, wherein a radiologist examines the screening mammography image 10 in a step 20 and reports a set of suspicious regions 30 designated as S1. The CAD system then operates on the image 10 in a 28 step 40 and reports a set of suspicious regions designated as S2. The radiologist then examines set S2 and accepts or rejects members of set S2 as suspicious in a step 60, thereby forming a third set of suspicious regions 70 denoted as S3, which is a subset of S2. The radiologist then creates in a step 80 a set of workup regions denoted as S4 which is the union of sets S1 and S3. The workup regions 90 are then recommended for further examination such as taking additional mammograms with greater resolution, examining the areas of the breast tissue corresponding to the workup regions by means of ultrasound, or performing biopsies of the breast tissue.
While the invention has been described in connection 15 with detecting clustered microcalcifications in mammograms, it should be understood that the methods and systems described herein may also be applicable to other medical Simages such as chest x-rays.
While the form of apparatus herein described constitutes a preferred embodiment of this invention, it is to be understood that the invention is not limited to this precise form of apparatus, and that changes may be made therein without departing from the scope of the invention 25 which is defined in the appended claims.
It is to be understood that, if any prior art publication is referred to herein, such reference does not constitute an admission that the publication forms a part of the common general knowledge in the art, in Australia or in any other country.
For the purposes of this specification it will be \\melbfies\ho.eS\KaraR\Keep\speci\92955-98.doc 3/10/01 28a clearly understood that the word "comprising" means "including but not limited to", and that the word "comprises" has a corresponding meaning.
\\melb-files\home\KarraR\Keep\speci\92955-98.doc 3/10/01

Claims (5)

1. A method for automated clustered microcalcification detection by digital image processing in screening mammography, comprising: storing a digital mammogram; applying a filtering algorithm to said digital representation to obtain a filtered image essentially comprising suspected microcalcifications; define coordinates representative of the location of said suspected microcalcifications; grouping said coordinates into clusters, said clusters representing microcalcifications located within a predetermined distance of a predetermined number of other microcalcifications and comprising potential clustered microcalcifications; computing features for each of said potential clustered microcalcifications; eliminating potential clustered microcalcifications based on said computed features for each of said potential clustered microcalcifications, and; indicating in said digital mammogram image the locations of those potential clustered microcalcifications remaining after the previous step.
2. The method of claim 1, wherein the step of filtering comprises filtering the said digital mammogram image with a difference of Gaussians (DoG) filter to produce a DoG- filtered image. \\melbfiles\home\KaraR\Keep\speci\92955- 9 .doc 3/10/01 30
3. The method of claim 2, wherein the step of defining said coordinates of suspected microcalcifications comprises: determining a local threshold value T for each pixel location in the DoG-filtered image; for each pixel in the DoG-filtered image, comparing the gray level value of the pixel to its corresponding local threshold value; identifying selected pixels with a gray level value greater than the corresponding local threshold value; generating a locally thresholded image by assigning a value of to all locations associated with the selected pixels, and setting to all other pixel locations, thereby forming a multiplicity of groups of contiguous pixels with value of 1; computing the center of gravity identifying said (x,y) coordinates for each group of contiguous pixels.4. The method of claim 3, wherein the step of determining a local threshold value for each pixel comprises the steps of: defining a set of pixel locations in said DoG-filtered :image, said set of pixel locations centered at said (x,y) coordinates; computing the mean and standard deviation from the DoG- filtered image at said set of pixel locations; computing said local threshold value T according to the equation: T A B Cowhere A, B, and C are predetermined coefficients, and p and are the mean and standard deviation of pixel values at said set of pixel locations. \\melb-files\home\KarraR\Keep\speci\92955.98.doc 3/10/01 31 The method of claim 1, wherein the said step of eliminating potential clustered microcalcifications based on said computed features comprises the steps of: applying said computed features as inputs to a statistical quadratic classifier; retaining certain potential clustered microcalcifications as determined by the output of the statistical quadratic classifier.
6. The method of claim 1, wherein the said step of eliminating potential clustered microcalcifications based on said computed features comprises the steps of: Sapplying said computed features as inputs to a neural network; retaining certain potential clustered microcalcifications as determined by the output of the neural network.
7. A method for automated clustered microcalcification detection by digital image processing in screening mammography, substantially as herein described with reference to the accompanying diagrams 0 Dated this 3 rd day of October 2001 QUALIA COMPUTING INC. By their Patent Attorneys GRIFFITH HACK Fellows Institute of Patent and Trade Mark Attorneys of Australia \\melbfiles\home\KarraR\Keep\speci\92955.9adoc 3/10/01
AU92955/98A 1997-08-28 1998-08-28 Method and system for automated detection of clustered microcalcifications from digital mammograms Ceased AU741337B2 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US5780197P 1997-08-28 1997-08-28
US60/057801 1997-08-28
US6699697P 1997-11-28 1997-11-28
US60/066996 1997-11-28
US7676098P 1998-03-03 1998-03-03
US60/076760 1998-03-03
PCT/US1998/017886 WO1999009887A1 (en) 1997-08-28 1998-08-28 Method and system for automated detection of clustered microcalcifications from digital mammograms

Related Child Applications (1)

Application Number Title Priority Date Filing Date
AU77378/01A Division AU7737801A (en) 1997-08-28 2001-10-03 Method and system for automated detection of clustered microcalcifications from digital mammograms

Publications (2)

Publication Number Publication Date
AU9295598A AU9295598A (en) 1999-03-16
AU741337B2 true AU741337B2 (en) 2001-11-29

Family

ID=27369332

Family Applications (1)

Application Number Title Priority Date Filing Date
AU92955/98A Ceased AU741337B2 (en) 1997-08-28 1998-08-28 Method and system for automated detection of clustered microcalcifications from digital mammograms

Country Status (10)

Country Link
EP (1) EP1009283A1 (en)
JP (1) JP2003532934A (en)
KR (1) KR20010023427A (en)
CN (1) CN1273516A (en)
AU (1) AU741337B2 (en)
BR (1) BR9812021A (en)
CA (1) CA2297986A1 (en)
IL (1) IL134557A0 (en)
NO (1) NO20000914D0 (en)
WO (1) WO1999009887A1 (en)

Families Citing this family (64)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6233304B1 (en) * 1998-11-25 2001-05-15 General Electric Company Methods and apparatus for calcification scoring
FR2796740B1 (en) * 1999-07-19 2001-10-26 Ge Medical Syst Sa METHOD AND SYSTEM FOR MANAGING THE SATURATION ON A DIGITIZED RADIOGRAPHIC IMAGE
US6654506B1 (en) * 2000-01-25 2003-11-25 Eastman Kodak Company Method for automatically creating cropped and zoomed versions of photographic images
IT1320956B1 (en) * 2000-03-24 2003-12-18 Univ Bologna METHOD, AND RELATED EQUIPMENT, FOR THE AUTOMATIC DETECTION OF MICROCALCIFICATIONS IN DIGITAL SIGNALS OF BREAST FABRIC.
GB2370438A (en) 2000-12-22 2002-06-26 Hewlett Packard Co Automated image cropping using selected compositional rules.
JP2004526179A (en) 2000-12-22 2004-08-26 ヒューレット・パッカード・カンパニー Image composition evaluation method and apparatus
US7034848B2 (en) 2001-01-05 2006-04-25 Hewlett-Packard Development Company, L.P. System and method for automatically cropping graphical images
GB0116113D0 (en) 2001-06-30 2001-08-22 Hewlett Packard Co Tilt correction of electronic images
GB2378340A (en) 2001-07-31 2003-02-05 Hewlett Packard Co Generation of an image bounded by a frame or of overlapping images
US7783089B2 (en) * 2002-04-15 2010-08-24 General Electric Company Method and apparatus for providing mammographic image metrics to a clinician
US7149335B2 (en) * 2002-09-27 2006-12-12 General Electric Company Method and apparatus for enhancing an image
FR2863749B1 (en) * 2003-12-10 2006-04-07 Ge Med Sys Global Tech Co Llc RADIOLOGICAL IMAGE PROCESSING METHOD FOR DETECTION OF MICROCALCIFICATIONS
JP2008505704A (en) * 2004-07-09 2008-02-28 フィッシャー イメイジング コーポレイション Breast screening method in fusion mammography
WO2006013514A1 (en) * 2004-07-26 2006-02-09 Koninklijke Philips Electronics N.V. System and method for automated suspicious object boundary determination
US8203338B2 (en) * 2007-08-06 2012-06-19 Carestream Health, Inc. Linear structure verification in medical applications
US9474440B2 (en) 2009-06-18 2016-10-25 Endochoice, Inc. Endoscope tip position visual indicator and heat management system
US10130246B2 (en) 2009-06-18 2018-11-20 Endochoice, Inc. Systems and methods for regulating temperature and illumination intensity at the distal tip of an endoscope
US10524645B2 (en) 2009-06-18 2020-01-07 Endochoice, Inc. Method and system for eliminating image motion blur in a multiple viewing elements endoscope
KR101030594B1 (en) * 2009-07-02 2011-04-21 국립암센터 Method of diagnosing micro-calcification
EP2462561B1 (en) * 2009-08-03 2018-10-31 Volpara Health Technologies Limited A method and system for analysing tissue from images
US9706908B2 (en) 2010-10-28 2017-07-18 Endochoice, Inc. Image capture and video processing systems and methods for multiple viewing element endoscopes
US10663714B2 (en) 2010-10-28 2020-05-26 Endochoice, Inc. Optical system for an endoscope
US10517464B2 (en) 2011-02-07 2019-12-31 Endochoice, Inc. Multi-element cover for a multi-camera endoscope
US11676730B2 (en) 2011-12-16 2023-06-13 Etiometry Inc. System and methods for transitioning patient care from signal based monitoring to risk based monitoring
EP2817781B1 (en) * 2012-02-24 2017-07-19 Paul Scherrer Institut A system for non-invasively classification of different types of micro-calcifications in human tissue
US9636003B2 (en) 2013-06-28 2017-05-02 Endochoice, Inc. Multi-jet distributor for an endoscope
US10595714B2 (en) 2013-03-28 2020-03-24 Endochoice, Inc. Multi-jet controller for an endoscope
EP2994034B1 (en) 2013-05-07 2020-09-16 EndoChoice, Inc. White balance enclosure for use with a multi-viewing elements endoscope
US9949623B2 (en) 2013-05-17 2018-04-24 Endochoice, Inc. Endoscope control unit with braking system
US10064541B2 (en) 2013-08-12 2018-09-04 Endochoice, Inc. Endoscope connector cover detection and warning system
US9943218B2 (en) 2013-10-01 2018-04-17 Endochoice, Inc. Endoscope having a supply cable attached thereto
KR101341288B1 (en) 2013-10-24 2013-12-12 사회복지법인 삼성생명공익재단 Quality assurance system and the method for radiotherapy apparatus
US9968242B2 (en) 2013-12-18 2018-05-15 Endochoice, Inc. Suction control unit for an endoscope having two working channels
US9474497B2 (en) * 2014-01-15 2016-10-25 Agfa Healthcare Method and system for generating pre-scaled images for a series of mammography images
WO2015112747A2 (en) 2014-01-22 2015-07-30 Endochoice, Inc. Image capture and video processing systems and methods for multiple viewing element endoscopes
EP3076872B1 (en) * 2014-04-29 2018-06-13 Koninklijke Philips N.V. Device and method for tomosynthesis imaging
US11234581B2 (en) 2014-05-02 2022-02-01 Endochoice, Inc. Elevator for directing medical tool
US10198840B2 (en) * 2014-06-27 2019-02-05 Koninklijke Philips N.V. Silhouette display for visual assessment of calcified rib-cartilage joints
US10258222B2 (en) 2014-07-21 2019-04-16 Endochoice, Inc. Multi-focal, multi-camera endoscope systems
JP6665164B2 (en) 2014-08-29 2020-03-13 エンドチョイス インコーポレイテッドEndochoice, Inc. Endoscope assembly
EP3235241B1 (en) 2014-12-18 2023-09-06 EndoChoice, Inc. System for processing video images generated by a multiple viewing elements endoscope
US10271713B2 (en) 2015-01-05 2019-04-30 Endochoice, Inc. Tubed manifold of a multiple viewing elements endoscope
KR102355294B1 (en) * 2015-02-13 2022-01-25 한국외국어대학교 연구산학협력단 Image processing device and image processing method
US10376181B2 (en) 2015-02-17 2019-08-13 Endochoice, Inc. System for detecting the location of an endoscopic device during a medical procedure
US10078207B2 (en) 2015-03-18 2018-09-18 Endochoice, Inc. Systems and methods for image magnification using relative movement between an image sensor and a lens assembly
US10401611B2 (en) 2015-04-27 2019-09-03 Endochoice, Inc. Endoscope with integrated measurement of distance to objects of interest
ES2818174T3 (en) 2015-05-17 2021-04-09 Endochoice Inc Endoscopic image enhancement using contrast-limited adaptive histogram equalization (CLAHE) implemented in a processor
WO2016194338A1 (en) * 2015-06-04 2016-12-08 パナソニック株式会社 Image processing method and image processing device
EP3367950A4 (en) 2015-10-28 2019-10-02 Endochoice, Inc. Device and method for tracking the position of an endoscope within a patient's body
CN113425225A (en) 2015-11-24 2021-09-24 安多卓思公司 Disposable air/water and suction valve for endoscope
CN108882902B (en) * 2016-02-08 2022-08-30 医默观系统公司 System and method for visualization and characterization of objects in images
EP3419497B1 (en) 2016-02-24 2022-06-01 Endochoice, Inc. Circuit board assembly for a multiple viewing element endoscope using cmos sensors
US10292570B2 (en) 2016-03-14 2019-05-21 Endochoice, Inc. System and method for guiding and tracking a region of interest using an endoscope
US10993605B2 (en) 2016-06-21 2021-05-04 Endochoice, Inc. Endoscope system with multiple connection interfaces to interface with different video data signal sources
US10372876B2 (en) 2017-01-20 2019-08-06 Agfa Healthcare Inc. System and method for providing breast image data
CN109124585A (en) * 2018-08-13 2019-01-04 脱浩东 For handling the method, apparatus and system of mammary gland data
CN110111271B (en) * 2019-04-24 2021-06-22 北京理工大学 Single-pixel imaging method based on side suppression network
CN110633644A (en) * 2019-08-16 2019-12-31 杭州电子科技大学 Human body joint angle prediction method based on electromyographic wavelet packet decomposition and GABP
WO2021062292A1 (en) * 2019-09-26 2021-04-01 Etiometry Inc. Systems and methods for transitioning patient care from signal based monitoring to risk-based monitoring
CN110880169A (en) * 2019-10-16 2020-03-13 平安科技(深圳)有限公司 Method, device, computer system and readable storage medium for marking focus area
CN113744171B (en) * 2020-05-28 2023-11-14 上海微创卜算子医疗科技有限公司 Vascular calcification image segmentation method, system and readable storage medium
CN112013203B (en) * 2020-07-18 2021-09-24 淮阴工学院 Pipe network detection system based on DRNN neural network
US11954853B2 (en) * 2021-07-21 2024-04-09 GE Precision Healthcare LLC Systems and methods for fast mammography data handling
CN117522864B (en) * 2024-01-02 2024-03-19 山东旭美尚诺装饰材料有限公司 European pine plate surface flaw detection method based on machine vision

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5491627A (en) * 1993-05-13 1996-02-13 Arch Development Corporation Method and system for the detection of microcalcifications in digital mammograms
US5622171A (en) * 1990-08-28 1997-04-22 Arch Development Corporation Method and system for differential diagnosis based on clinical and radiological information using artificial neural networks
US5825910A (en) * 1993-12-30 1998-10-20 Philips Electronics North America Corp. Automatic segmentation and skinline detection in digital mammograms

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5572565A (en) * 1994-12-30 1996-11-05 Philips Electronics North America Corporation Automatic segmentation, skinline and nipple detection in digital mammograms

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5622171A (en) * 1990-08-28 1997-04-22 Arch Development Corporation Method and system for differential diagnosis based on clinical and radiological information using artificial neural networks
US5491627A (en) * 1993-05-13 1996-02-13 Arch Development Corporation Method and system for the detection of microcalcifications in digital mammograms
US5825910A (en) * 1993-12-30 1998-10-20 Philips Electronics North America Corp. Automatic segmentation and skinline detection in digital mammograms

Also Published As

Publication number Publication date
AU9295598A (en) 1999-03-16
CA2297986A1 (en) 1999-03-04
KR20010023427A (en) 2001-03-26
IL134557A0 (en) 2001-04-30
NO20000914L (en) 2000-02-24
NO20000914D0 (en) 2000-02-24
WO1999009887A1 (en) 1999-03-04
JP2003532934A (en) 2003-11-05
BR9812021A (en) 2000-09-26
CN1273516A (en) 2000-11-15
EP1009283A1 (en) 2000-06-21

Similar Documents

Publication Publication Date Title
AU741337B2 (en) Method and system for automated detection of clustered microcalcifications from digital mammograms
US6115488A (en) Method and system for combining automated detections from digital mammograms with observed detections of a human interpreter
US6137898A (en) Gabor filtering for improved microcalcification detection in digital mammograms
US6970587B1 (en) Use of computer-aided detection system outputs in clinical practice
US7308126B2 (en) Use of computer-aided detection system outputs in clinical practice
US6801645B1 (en) Computer aided detection of masses and clustered microcalcifications with single and multiple input image context classification strategies
Boumaraf et al. A new computer-aided diagnosis system with modified genetic feature selection for BI-RADS classification of breast masses in mammograms
US7181056B2 (en) Method, and corresponding apparatus, for automatic detection of regions of interest in digital images of biological tissue
Laroussi et al. Diagnosis of masses in mammographic images based on Zernike moments and Local binary attributes
AU5888500A (en) Computer aided detection of masses and clustered microcalcification strategies
Makandar et al. Classification of mass type based on segmentation techniques with support vector machine model for diagnosis of breast cancer
AU7737801A (en) Method and system for automated detection of clustered microcalcifications from digital mammograms
Patrocinio et al. Classifying clusters of microcalcification in digitized mammograms by artificial neural network
MXPA00002074A (en) Method and system for automated detection of clustered microcalcifications from digital mammograms
TW492858B (en) Method and system for automated detection of clustered microcalcifications from digital mammograms
Osman Breast Cancer Detection and Classification Using Advanced Computer-Aided Diagnosis System
Malathi et al. Classification of Multi-view Digital Mammogram Images Using SMO-WkNN.
Patrocinio et al. Classifier scheme for clustered microcalcifications in digitized mammograms by using Artificial Neural Networks
Mohammed Salih Image enhancement based breast cancer detection using artificial neural network
Faye et al. STATUS OF THESIS
Li et al. Robust Detection of Masses in Digitized Mammograms

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired