WO2012052106A1

WO2012052106A1 - Method for classifying patterns in image data records

Info

Publication number: WO2012052106A1
Application number: PCT/EP2011/004808
Authority: WO
Inventors: Iris Paternoster-Bieker
Original assignee: Iris Paternoster-Bieker
Priority date: 2010-09-29
Filing date: 2011-09-26
Publication date: 2012-04-26
Also published as: EP2622540A1; DE102010046867A1

Abstract

The invention relates to a method for classifying patterns in image data records, wherein the classification comprises a fuzzy c-means clustering, wherein the method comprises detection of training image data records, wherein the training image data records contain image data relating to different classes of patterns, wavelet transformation of the training image data records in order to obtain a set of wavelet coefficients for each training image data record, determination of a set of statistical measures for each training image data record from the associated set of wavelet coefficients, classification of all sets of statistical measures for the training image data records in order to form clusters of the sets of statistical measures, wherein each cluster has a cluster centre, and wherein each cluster centre is assigned to one or more of the classes of patterns, determination of the clusters which have not been uniquely assigned and are assigned to a plurality of classes of patterns, and reclassification of the sets of statistical measures for the training image data records of the clusters which have not been uniquely assigned in order to form further clusters of the sets of statistical measures for each cluster which has not been uniquely assigned, wherein each further cluster has a further cluster centre, and wherein each further cluster centre is assigned to one or more of the classes of patterns.

Description

The invention relates to a method for classifying patterns in image datasets, and to a computer program product.

Textures or patterns play an important role in the composition of natural images, their analysis and classification in various image analysis applications (Porter, R., Canagarajah, N .: Robust rotation invariant texture Classification: wavelet, Gabor filter and GMRF based schemes : IEE Proc.-Vis. Image Signal Processing 144 (1997), No. 3, pp. 180-188). The field of application has a broad spectrum: surface inspection, texture object recognition, OCR, document segmentation, tissue recognition in medical images, automated visual examinations, content-based image search, and remote recognition (see Hiremath, PS, S .: Wavelet Based Features for Texture Classification In: GVIP Journal 6 (2006), No. 3, pp. 55-58).

Most existing texture analysis methods are based on the assumption that the texture images have been taken with the same orientation and scale. This is a limitation when using these methods. In most cases it is very difficult or even impossible to ensure the same orientation, scaling and translation (Zhang, J., Tan, T .: Brief review of invariant texture analysis methods., In: Pattern Recognition 35 (2002), pp. 735-747 ).

The methods for representing the texture features can be roughly subdivided into a statistical or structural approach (Haralick, RM: Statistical and Structural Approaches to Texture, In: Proceedings of the IEEE 67 (1979), No. 5, pp. 786-804). , In the statistical method, textures are based on the statistical properties of the

CONFIRMATION COPY displayed local gray values. These values usually have constant or very little different properties within a texture. Thus, different textures can be recognized by comparing these values.

Structural models assume that the textures consist of texture primitives. They can be reproduced using primitive positioning controls (see Tuceryan, M., Jain, AK: The Handbook of Pattern Recognition and Computer Vision, World Scientific and Publishing Co., 1998. 207-248 p. //www.cs.iupui.edu/~tuceryan/research/ComputerVision/texture- review.pdf). Structural texture analysis models consist of two phases: a) Determining the texture elements b) Defining the positioning rules. This is applicable for very regular textures. The texture primitives may be e.g. with edge detection using the Laplacian-of-Gaussian or difference-of-Gaussian method. Once the primitives are fixed, either the primitives are statistically evaluated or the positioning rules deciphered (Wu, J .: Rotation Invariant Classification of 3D Surface Texture Using Photometry Stereo.) Thesis submitted for the Degree of Doctor of Philosophy, Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, 2003).

Previous texture analysis methods used first-order or second-order statistics, then developed model-based methods such as Gaussian Markov Random Fields, Gibbs Random Fields, or Wold Model The coefficients of these models are used to describe the texture The main problem with this approach is the estimation of the coefficients and the selection of a best fit model for the given texture, which usually transforms the coefficients model-based methods include "Simultaneous Autoregressive (SAR)", "Circular Simultaneous Autoregressive (CSAR)", "Rotation Invariant SAR (RISAR)", "Multichannel Gabor Filter", " Fractals "," steerable pyramid "and" wavelet transform n "(see, eg, Zhang, J.; Tan, T .: Brief review of invariant texture analysis methods. ln: Pattern Recognition 35 (2002), pp. 735-747; and Mandelbrot, BB: The Fractal Geometry of Nature. Freeman, New York, 1983). Wavelet theory has evolved over time as a unified basis for various signal processing applications such as multiresolution signal processing, machine vision, subband coding and speech compression (Chitre, Y., Dhawan, AP: M-band wavelet discrimination of In: Pattern Recognition 32 (1999), pp. 773-789) As the name "multiresolution" implies, it deals with the representation and analysis of a signal (image) in several resolutions. The charm of this method is that you can see the properties overlooked in one resolution in a different resolution. Until the late 1980s, interest in this method remained low, but it is currently difficult to keep track of the multitude of scientific work on wavelets (Gonzalez, RC, Woods, RE: Digital Image Processing, Pearson Education, Inc.). Pearson Prentice Hall, New Jersey, 2008).

Illustratively, the reason for a wavelet transform can be described as follows: When we look at an image, we see contiguous areas of similar textures and intensity levels that make up the objects. If these objects are very small in size and have low contrast, we examine them in higher resolutions. If they are very large and have high contrast values, they will be displayed coarser. But if there are both small and large objects or objects with low and high contrast in an image at the same time, it is advantageous to examine the image in different resolutions. This is the fundamental background for the "multi-resolution ^and Editing, see Gonzalez, RC; Woods, RE: Digital Image Processing. Pearson Education, Inc. Pearson Prentice Hall, NJ, 2008.

A distinction is made between the so-called "continuous wavelet transformation" and the "discrete wavelet transformation". The latter is an infinite series development and thus requires the numerical calculation abort after a finite number of terms.

An example of a Discrete Wavelet function is the Haar wavelet, which is used in one embodiment of this invention. The Haar wavelet was proposed by Alfred Haar in 1910 and is the first known wavelet in the literature. It is also referred to as the "D2" wavelet, a special form of "Daubechies wavelet". The Haar wavelet is also known as a simple and easy orthonormal wavelet. This makes it very easy to implement. The disadvantage is that the Haar wavelet is not continuous and therefore not differentiable. This property may be an advantage or disadvantage depending on the signal.

The discrete wavelet transform (wavelet series) of a function f (x) is defined by

OO OO μ = -οο fc = -oo with the wavelet coefficients ό _μ

- o The Haar wavelet <p (x) and its scaling function φ (χ) are defined by

I 0 <x <l / 2. _f

I 1 0 <<1.

-) = {- 1 l / 2 <x <L <? (*) = < _R ^~

I 0 otherwise

0 otherwise

The magnitude of the wavelet coefficients in a particular channel is greater for those images that have a strong texture property in orientation and frequency represented by the channel. Therefore, the texture of an image can be mapped with a feature vector formed by statistical measures of the wavelet coefficients from the respective channel (sub-band). This finally allows the use of the characteristics of the wavelet coefficients for texture classification.

However, a problem with conventional classification methods for texture classification is that the recognition rate of textures is insufficient. In contrast, the invention has the object to provide a method for the classification of patterns in image data sets, which can be used to improve conventional classification methods for texture classification.

The object underlying the invention is achieved with the features of the independent claims. Preferred embodiments of the invention are specified in the dependent patent claims.

A method is proposed for classifying patterns in image data sets, wherein the classification comprises fuzzy c-means clustering, the method comprising ("training phase"): a) acquisition of training image data sets, wherein the training image data sets contain image data of different classes of B) wavelet transformation of the training image data sets to obtain a set of wavelet coefficients for each training image data set; c) determining a set of statistical measures for each training image data set from the associated set of wavelet coefficients; d) Classifying all sets of statistical measures of the training image datasets to form clusters of the sets of statistical measures, each cluster having a cluster center, each cluster center being associated with one or more of the classes of the patterns, e) determining the ambiguity associated clusters, which are several classes of Mu f) reclassifying the sets of statistical measures of the training image data sets of the unclassified clusters to form further clusters of the sets of statistical measures for each not uniquely associated clusters, each additional cluster having a further cluster center, each additional cluster center being associated with one or more of the classes of the patterns.

Embodiments of the invention have the advantage that it is possible in a reliable manner to classify patterns in image data sets. If, in an initial classification pass according to steps a) -d), individual clusters are assigned to multiple classes of patterns for the purpose of determining cluster centers, the classification for these non-unique clusters is repeated in steps e) and f), thereby the recognition rate increases significantly. The core of the invention is thus the cascaded repetition of steps e) and f) in combination of the determination of statistical measures from the coefficients of wavelet transformations and fuzzy c-means clustering. This ensures that sets of statistical measures of the training image records can be uniquely assigned to a single class of patterns. This "training phase" thus creates a knowledge base (also known as "knowledgebase") which can then be used to classify real test image data records.

With the help of fuzzy sets, cluster problems can be solved in general. The problem with cluster problems is to group a set of data with similar properties and map them to the same cluster. Nearby data points have similar properties and are to be grouped into common clusters.

The application of fuzzy sets in a classification task causes this class membership to be relaxed and, as a result, a data element can simultaneously belong to all classes with different degrees (see Tizhoosh, HR: Fuzzy Image Processing, Introduction to Theory and Practice, Springer-Verlag, Heidelberg , 1998). The idea of fuzzy C-Means is that each of the N data elements xi is not only assigned to a cluster, but to each cluster with a certain affiliation Pi _j (x). Instead of minimizing the distance to the cluster center for all data elements, the distance of each element belonging to the cluster Center multiplied, cf. Chapter Fuzzy Logic, ln: Kramer, O .: Computational Intelligence, An Introduction, Springer Verlag, Berlin Heidelberg, 2009, pp. 75-99.

Minimized

) V k

^ Σ Σ ^ ιι ** - ^! ² . with the modifier "m" (also called fuzzyficator) The membership function and the cluster centers are as follows:

After initializing the affiliations μ 1 ₎ , the cluster centers are calculated alternately according to the above equation in each step, and then the affiliations to the clusters are updated. These two steps are carried out alternately until the sum of the changes in the membership values μ ^ falls below a value ε. This defines the cluster centers and the knowledge base.

In the classification, the knowledge base is thus initially built on the basis of a training data set.

According to one embodiment of the invention, the method further comprises the following steps ("test phase"): a) acquisition of test image data sets, b) wave-image transformation of the test image data sets to obtain

Set of wavelet coefficients for each test image data set, c) determining a set of statistical measures for each test image dataset from the associated set of wavelet coefficients; d) classifying all sets of statistical measures of the test image datasets to form clusters of the sets of statistical measures of the test image datasets; Classification is performed using the previously determined cluster centers; e) reclassifying the clusters of the test image data sets for which associated cluster centers were associated with a plurality of the classes of training image data sets, the re-classifying using the previously determined further ones Cluster centers are done.

As a result of these steps, the generated knowledge base in the form of the unambiguous assignment of cluster centers to sets of statistical measures is thus applied to test image data records to be analyzed after completion of the training phase. Step e) ensures that, even in the case of potential non-unique assignment of a set of statistical measures to one of the classes of patterns by the reclassification process, the corresponding set of statistical measures of the test image data set can be uniquely assigned to a single class of patterns.

In other words, during the test, the previously generated knowledge base is used in step d). In step e), the clusters determined in step e) of the training phase are reclassified, using in each case the knowledge base of the individual 'cluster from step f) of the training phase.

According to another embodiment of the invention, the training image data sets image data include a first number of different classes of patterns, wherein the number of clusters formed by the classification of the training image data sets corresponds to the first number, wherein for a non-unique cluster - this cluster is assigned a second number of different classes of patterns,

- when re-classifying this cluster, the number of further clusters of the second number corresponds. According to a further embodiment of the invention, the steps e) and f) of the training phase are repeated in cascaded fashion until the number of clearly assignable clusters has exceeded a predetermined minimum value. For example, the minimum value is at least 95%.

According to another embodiment of the invention, the statistical measures are chosen such that they are invariant with respect to rotation of the image data sets. Thus, the method is stable executable, since the orientation of the image data sets no longer plays a role. For any orientation of the image data sets, the method provides the same precise classification result. If rotation invariance is to be achieved, then the pairs of mutually diagonal channels from the wavelet transform are combined into one statistical feature (Porter, R., Canagarajah, N: Robust rotation invariant texture classification: wavelet, Gabor filter and GMRF based schemes : IEE Proc.-Vis. Image Signal Processing 144 (1997), No. 3, pp. 180-188)

It should be noted that wavelet transformations are otherwise scaling invariant, so that the scaling of the image data sets to be classified also plays no role.

According to a further embodiment of the invention, the statistical measures include an energy value and / or an entropy value and / or a standard deviation.

In detail, for example, the following statistical measures can be defined: · Energy: In the analysis of the textures of wavelet-transformed images, the mean value of the magnitude of the wavelet coefficients is mostly used. The energy of the nth channel is, according to Porter, R.; Canagarajah, N .: Robust rotation invariant texture Classification: wavelet, Gabor filters and GMRF based schemes. In: IEE Proc.-Vis. Image Signal Processing 144 (1997), No. 3, pp. 180-188 is defined as follows:

ENERCV lc: ,, =

MN

M N

where the dimensions of the channel is MxN, i and j are the row and column of the nth channel, and x is the wavelet coefficient within that channel.

• Energy2: Instead of the above energy parameter, according to Chitre, Y.; Dhawan, AP: M-band wavelet discrimination of natural textures. In: Pattern Recognition 32 (1999), pp. 773-789, this value can be calculated as a second feature according to:

• Entropy: The third feature may be the entropy value in each channel. This feature has been used e.g. from Chitre, Y.; Dhawan, A.P .: M-band wavelet discrimination of natural textures. In: Pattern Recognition 32 (1999), pp. 773-789 describes and is calculated according to:

Standard deviation: This measure has been described, for example, by Manthalkar, R. Biswas, PK; Chatterji, BN: Rotation and scale invariant texture features using discrete wavelet packet transform. In: Pattern Recognition Letters 24 (2003), pp. 2455-2462:

It should be noted that the training image records contain unique classes of patterns. In other words, a single image is not associated with 2 different classes, but the class assignment of the images is unique. According to a further embodiment of the invention, during the training phase, the method further comprises setting the set of statistical measures, wherein steps d) -f) are performed for different set of statistical measures defining the set of statistical measures for which the recognition rate of the training phase is maximized. This set of statistical measures thus determined is then also applied identically in the classification of the test images.

The determination of the set of statistical measures can be carried out, for example, as an intermediate step between steps c) and d). However, it can also be determined mathematically in advance (before step c) which sets of statistical measures should be used.

Thus, according to one embodiment of the invention, e.g. not all (rotationally invariant) statistical features are included in the classification. It may well be that, depending on the task of investigation, some features may be more appropriate for separating classes than others. An iterative method would be possible in which a classification of the training data is carried out with only a part of the statistical features. The "better" result / better selection can then continue to be used. The statistical measures determined during the training remain unchanged during the test.

It should be noted that a reduction of the features can lead to an acceleration of the method. In another aspect, the invention relates to a computer program product having computer-executable instructions for performing the method steps as described above.

In the following, preferred embodiments of the invention are explained in more detail with reference to the drawings. Show it:

1 is an overview of micrographs,

Fig. 2A part A of a tabular overview of the training phase under

Using 6 wavelet features,

2B part B of a tabular overview of the training phase under

Using 6 wavelet features,

3 shows schematic steps of a construction of a cascaded classification in the training phase,

4 shows schematic steps of a construction of a cascaded classification in the test phase,

Fig. 5 is a tabular overview of the wavelet error rate after the 2nd stage of the cascaded classification of the micrographs.

Figure 1 shows a graphical overview of various micrographs (publisher, steel iron: straightening series for the evaluation of the structure of annealed hot-work steels, SEP1614 board 2nd Verlag Stahleisen, Dusseldorf, 1996). Micrographs of crude steel show microstructures that arise when the hot steel cools, and are a measure of the quality of the material. The samples are taken from the annealed material, ground and polished and etched in 3% alcoholic nitric acid to reveal micro-homogeneity. To evaluate the quality, the sample viewed under microscope at a magnification of 500: 1 is assigned to an image of a guideline series. FIG. 1 shows the straightening row with the steps from GA1 to GF5. These 30 image data were digitized, one image section was cut out in a standardized manner and each section was rotated in 5 degree increments so that a total of 2,160 image data sets were available. To the training data the images with rotation angle from 0 to 135 have been assigned. This gives a total of 840 images (28 images for each of the 30 classes). In addition, test data were generated, which contain the images with an angle greater than 135 degrees, so a total of 1,320 images.

The exemplary classification of these 840 images in the training phase has now proceeded as described above using fuzzy c-means clustering: First, a wavelet transform was performed for each image data set to obtain a set of wavelet coefficients for each training image data set then determining a set of statistical measures for each training image data set from the associated set of wavelet coefficients. Subsequently, the classification of all sets of statistical measures of the training image datasets was made to form clusters of the sets of statistical measures, each cluster having a cluster center, each cluster center being associated with one or more of the classes of the patterns.

Figures 2A and 2B show a tabular overview of the training phase using 6 wavelet features: The detection rate for pure wavelet features was 72.9%, 27.1% of the images (ie their sets of statistical measures) were incorrect Class assigned. Only 24 clusters were unique, 2 clusters contained 2 classes, 2 clusters consisted of 3 classes, a cluster of 4 and one of 5 classes.

For example, cluster # 9 shows both a possible membership of class GC5 and class GD3, with 56 being assigned to sets of statistical measures (belonging to different images) in cluster # 9.

From the training phase it is known which clusters consist of several classes. According to the invention, in a further step, by re-classifying those clusters that consist of several classes, a separation of the previously combined classes takes place. The classification is carried out individually for each cluster. FIG. 3 shows the procedure in the diagram. First of all, it is assumed that the images used for training purposes consist uniquely of k classes, ie that each image can be uniquely assigned to a single class. These training image data are then classified according to steps b) through d) of the training phase, resulting in a number of k clusters of the sets of statistical measures of the training image data sets in step 302. In other words, the training image data sets image data have a number k of different classes of patterns, the number of clusters formed by the classification of the training image data sets corresponding to the number k.

For the clearly identified clusters, which can only be assigned to one class, the knowledge base is updated accordingly. Illustratively, in the knowledge base, the cluster centers of the sets are stored by a number p (p> 0) of statistical measures, each cluster center being uniquely assigned a class. Each set of statistical measures can be graphically represented as a point in a p-dimensional space, with the associated cluster center also being arranged in this p-dimensional space. Any further set of statistical measures (for example, a test data set) that is near this cluster center can thus be identified as belonging to that cluster center.

However, as FIG. 3 shows, there is also the possibility that individual clusters (for example cluster 2 and cluster m) have not been unambiguously assigned to one but to several classes. For these non-unique clusters, a re-classification is now performed in step 304 to form further clusters of sets of statistical measures for each non-unique cluster in step 302, each additional cluster having a further cluster center, each additional cluster Center is assigned to one or more of the classes of patterns. If cluster 2 consists of n different classes, a classification is performed on n different classes. Similarly, for clusters m consisting of I different classes, classification is performed on I different classes. This results in further clusters in step 306. For the now clearly identified clusters, which in turn can only be assigned to one class, the knowledge base is updated accordingly. In accordance with the required recognition rate, this method can be continued cascaded for clusters that can not be assigned unambiguously until a desired recognition rate has been reached. This creates a new knowledge base for each cluster that is reclassified.

Since it is known from the training phase which clusters were not unambiguously assigned and for which a new knowledge base was generated, the method can be applied in the same way in the test execution. By way of example, "test execution" is understood to mean the application of the method for classifying real images.

This is shown in FIG. 4, where in step 400 a collection of test image data sets is made, to which the classification knowledge base generated in FIG. 3 step 300 is applied. In detail, a wavelet transformation of the test image data sets is first performed to obtain a set of wavelet coefficients for each test image data set, whereupon a determination of a set of statistical measures for each test image data set is made from the associated set of wavelet coefficients. Thereafter, the cluster center mappings stored in the knowledge base generated in accordance with Figure 3 are used to classify all sets of statistical measures of the test image data sets to form clusters of the sets of statistical measures of the test image data sets. This results in step 402 different clusters 1..k.

Subsequently, as shown in FIG. 4, the multiple assignment clusters known from training are reclassified, with the re-classification done using the previously determined other cluster centers. That is, for this purpose, the new knowledge bases generated in step 304 are used. This results in steps 404 clustering sets of statistical measures (i.e., individual test images) that can be uniquely assigned to corresponding classes.

Referring to Figs. 2A and 2B, the separate classification has been performed according to the method of Fig. 3 for the six hitherto ambiguous clusters of wavelet features. One cluster could be assigned completely correctly, with the others a significant reduction of the remaining errors was recorded because the classes were better separated. FIG. 5 shows the non-unique clusters before and after the second classification. Of the previously 228 incorrect assignments, only 31 remain. This results in an improvement of the recognition rate from 72.9% to 96.3% for the cascaded classification of the training data from the wavelet features.

In a test phase, the o.g. Classified 1320 microstructures. In the first stage, 72.5% were correctly assigned, 362 images were wrongly classified. Subsequently, as shown in Figure 4, the multiple assignment clusters known from training were reclassified. In the second stage, another 321 images could be assigned correctly. The remaining 42 represented an error rate of 3.2% and a detection rate of 96.8%.

Thus, the method of a two-stage classification with fuzzy c-means clustering with the statistical features from the coefficients of wavelet transformation has the advantage that a very good classification result can be reliably achieved.

Claims

claims

A method of classifying patterns in image data sets, the classification comprising fuzzy c-means clustering, the method comprising: a) acquiring training image data sets, wherein the training image data sets contain image data of different classes of patterns, b) wavelet Transforming the training image datasets to obtain a set of wavelet coefficients for each training image dataset; c) determining a set of statistical measures for each training image dataset from the associated set of wavelet coefficients; d) classifying all sets of statistical measures of the wavelet coefficients Training image data sets for forming clusters of the sets of statistical measures, each cluster having a cluster center, each cluster center being associated with one or more of the classes of the patterns, e) determining the non-unique clusters which are multiple classes of Associated with patterns, f) reclassification generating the sets of statistical measures of the training image data sets of the uniquely assigned clusters to form further clusters of the sets of statistical measures for each non-unique cluster, each further cluster having a further cluster center, each further cluster center one or more of the classes associated with the pattern.

2. The method of claim 1, further comprising: a) acquiring test image data sets, b) wavelet transforming the test image data sets to obtain a set of wavelet coefficients for each test image data set, c) determining a set of statistical measures for every test

Image data set from the associated set of wavelet coefficients; d) classifying all sets of statistical measures of the test image datasets to form clusters of the sets of statistical measures of the test image datasets, wherein the classification is done using the previously determined cluster centers, e ) Reclassifying clusters of test image datasets for which associated cluster centers were associated with multiple of the classes of training bios datasets, the re-classifying using the previously determined further cluster centers.

3. The method of claim 1, wherein the training image data sets contain image data of a first number of different classes of patterns, wherein the number of clusters formed by the classification of the training image data sets corresponds to the first number, wherein for a non-unique cluster

- this cluster is assigned a second number of different classes of patterns,

- when re-classifying this cluster, the number of further clusters of the second number corresponds.

4. The method of claim 1, wherein the steps e) and f) are repeated cascaded until the number of uniquely assignable clusters has exceeded a predetermined minimum value.

5. The method of claim 4, wherein the minimum value is at least 95%.

The method of claim 1, wherein the statistical measures are invariant to rotation of the image data sets.

7. The method of claim 5, wherein the statistical measures include an energy value and / or an entropy value and / or a standard deviation.

The method of claim 1, further comprising determining the set of statistical measures, wherein steps d) -f) are performed for different predetermined sets of statistical measures, wherein the set of statistical measures for which the recognition rate is maximized is determined ,

A computer program product having computer executable instructions for performing the method steps of the preceding claims.