AU2003215441B2 - Method of identifying endmember spectral values from hyperspectral image data - Google Patents

Method of identifying endmember spectral values from hyperspectral image data Download PDF

Info

Publication number
AU2003215441B2
AU2003215441B2 AU2003215441A AU2003215441A AU2003215441B2 AU 2003215441 B2 AU2003215441 B2 AU 2003215441B2 AU 2003215441 A AU2003215441 A AU 2003215441A AU 2003215441 A AU2003215441 A AU 2003215441A AU 2003215441 B2 AU2003215441 B2 AU 2003215441B2
Authority
AU
Australia
Prior art keywords
data
endmember
endmembers
squares
estimates
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2003215441A
Other versions
AU2003215441A1 (en
Inventor
Mark Berman
Robert Aidan Dunne
Harri Tapio Kiiveri
Ryan Lagerstrom
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Commonwealth Scientific and Industrial Research Organization CSIRO
Original Assignee
Commonwealth Scientific and Industrial Research Organization CSIRO
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AUPS1552A external-priority patent/AUPS155202A0/en
Application filed by Commonwealth Scientific and Industrial Research Organization CSIRO filed Critical Commonwealth Scientific and Industrial Research Organization CSIRO
Priority to AU2003215441A priority Critical patent/AU2003215441B2/en
Publication of AU2003215441A1 publication Critical patent/AU2003215441A1/en
Application granted granted Critical
Publication of AU2003215441B2 publication Critical patent/AU2003215441B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Astronomy & Astrophysics (AREA)
  • Remote Sensing (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Image Processing (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)

Description

WO 03/088130 PCT/AU03/00409 METHOD OF IDENTIFYING ENDMEMBER SPECTRAL VALUES FROM HYPERSPECTRAL IMAGE DATA FIELD OF THE INVENTION [0001] The present invention relates to a method of identifying endmember spectral values from multispectral or hyperspectral image data, which is particularly useful in identifying different materials from multispectral or hyperspectral images.
BACKGROUND TO THE INVENTION [0002] It is known to collect remote sensing data to provide images of scenes to aid in broad scale discrimination of various features of land scanned including identifying mineral deposits and vegetation. Two examples of hyperspectral scanners are NASA's 224 band AVIRIS, which has bands spaced about every 10 nanometers in a range from 400 to 2500 nanometers, and the 128 band Australian commercial scanner, HyMap, which covers a similar wavelength range with about 16 nanometer resolution.
[0003] A goal is therefore to identify the components of each pixel in the hyperspectral image. This can be done by comparison with a library of spectra of "pure" materials. "Pure" materials in a hyperspectral image are often termed endmembers.
[0004] Depending on the resolution of the image obtained from the spectral scanner, an individual pixel may represent an area ranging in size from 5 to 10 meters across in images from an aircraft scan or 10 to 30 meters across from a satellite scan. Each pixel therefore will relate to a portion of a scene which will usually include a mixture of material components. It is not uncommon to find that not all of the pure spectral representations of endmembers are present in a scene.
[0005] Images are also subject to distortion due to noise from various sources including instruments, atmospheric interference, viewing geometry and topography of the area scanned. Corrections for these distortions are still not sufficiently accurate to allow for reliable comparisons to reference libraries. Also, many remotely sensed scenes contain materials not in libraries. Therefore, there are problems with matching spectra with ground-based libraries. There is consequently interest in identifying the component materials represented in a scanned scene, without reference to a library.
[0006] Similar problems occur in other fields where it is desired to determine WO 03/088130 PCT/AU03/00409 -2endmembers from multispectral, hyperspectral or other data where a signal is detected on a number of channels or bands. For example a similar problem occurs in the analysis ofproteomics and genomics array data where the signal represents cell or organism response across a range of proteins, cDNAs or oligonucleotides. In this context, each protein, cDNA or oligonucleotide is regarded as being equivalent to a wavelength or band in the hyperspectral or multispectral context. Similar problems also occur in fluorescence imaging such as fluorescence microscopy.
[0007] In the art the terms multispectral and hyperspectral, multidimensional and hyperdimensional etc. are used, with "hyper" generally meaning more than "multi". This distinction is not relevant for the purposes of this invention. For convenience, throughout the rest of the specification the term "multispectral" will be used to refer to both multispectral and hyperspectral data. The term "multidimensional" and other "multi" terms will likewise be used to mean more than one dimension.
[0008] Current solutions of finding endmembers often involve "whitening" or "sphering" the data and then fitting to the data a multidimensional simplex having a number of vertices equal to the number of endmembers.
[0009] The bands of a multispectral image are usually highly correlated. "Whitening" involves transforming the data to be uncorrelated with a constant variance and preferably an approximately Normal distribution of errors. It is also desirable to compress the dimensionality of the data to reduce calculation time.
[0010] A widely used algorithm to "whiten" the data is to compress the information into a smaller number of bands by use of the Minimum Noise Fraction (MNF) transform. This is disclosed in Green, Berman, Switzer, and Craig, M.
(1988). A transformation for ordering multispectral data in terms of image quality with implications for noise removal. IEEE Transactions on Geoscience and Remote Sensing, 26:65-74.
[0011] Simplex fitting using the pixel purity index (PPI) method is disclosed in Boardman, J. Kruse, and Green, R. (1995) Mapping target signatures via partial unmixing of AVIRIS data. In Green, R. (editor), Summaries of the Fifth Annual JPL Airborne Earth Science Workshop, volume 1, AVIRIS Workshop, pp 23-26. JPL Publ.
95-1, NASA, Pasadena, CA.
WO 03/088130 PCT/AU03/00409 -3- [0012] One of the main disadvantages of Boardman's method is that it requires considerable manual intervention in processing.
[0013] An alternative to Boardman's method is the N-FINDR algorithm by Winter, M.
(1999). Fast autonomous spectral endmember determination in hyperspectral data. In Proceedings of the 13 th International Committee on Applied Geologic Remote Sensing, Vancouver, vol. 2, pp 337-334. This process is fully automated. After transformation to dimensional subspace, this algorithm finds the M-dimensional simplex of maximum volume constrained to lie within the data cloud. Another alternative is to construct the minimum volume simplex enclosing the data cloud, which is provided by Craig, M. (1994). Minimum-volume transforms for remotely sensed data. IEEE Transactions on Geoscience and Remote Sensing, 32:542-552.
[0014] These solutions cannot satisfactorily deal with the common situation where pure or almost pure endmembers are absent from the scene. Furthermore, they do not deal well with noise in the data.
SUMMARY OF THE PRESENT INVENTION [0015] It is an object of the present invention to provide an improved method of identifying endmembers spectral values from multispectral data.
[0016] According to the present invention there is provided a method of identifying endmember spectral values from multispectral image data, where each multispectral data value is equal to a sum of mixing proportions of each endmember spectrum, said method including the steps of: processing the data to obtain a multidimensional simplex having a number of vertices equal to the number of endmembers, the position of each vertex representing a spectrum of one of the endmembers, wherein processing the data includes: providing starting estimates of each endmember spectrum for each image data value; estimating the mixing proportions for each data value from estimates of the spectra of all the endmembers; estimating the spectrum of each endmember from estimates of the mixing proportions of the spectra of all the endmembers for each image data value; WO 03/088130 PCT/AU03/00409 -4repeating estimation steps until a relative change in the regularised residual sum of squares is sufficiently small, the regularised residual sum of squares including a term which is a measure of the size of the simplex.
[0017] Preferably the term used in the regularised residual sum of squares is the sum of the squared distances between all of the simplex vertices.
[0018] Preferably the step of providing the starting estimates includes choosing starting points with a high pixel purity index score. More preferably the starting estimates are well separated.
[0019] Preferably the relative change in the regularised residual sum of the squares is regarded as sufficiently small when the ratio of successive values of regularised residual sum of squares is less than a tolerance. Preferably the tolerance is 0.99999.
[0020] Preferably processing the data includes whitening the data. Preferably whitening the data includes conducting a transform of the data into data that is not band correlated. Preferably processing the data includes removing bands that do not have a high signal to noise ratio.
[0021] Preferably the step of estimating the spectrum of each endmember is conducted using a linear estimation technique.
[0022] Preferably the step of estimating the mixing proportions is conducted using a quadratic programming minimisation technique.
BRIEF DESCRIPTION OF THE DRAWINGS [0023] In order to provide a better understanding a preferred cmbodiment of the present invention will now be described in detail, by way of example only, in relation to Figure 1 which is a diagrammatic representation of a simple example of the use of the method of the present invention.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS [0024] Multispectral image data is obtained from a multispectral scanner, such as the AVIRIS airborne scanner. A typical scan for mineral applications includes a short wave infra red scan with wavelengths in the region of 2,000 to 2,500 nanometers. This WO 03/088130 PCT/AU03/00409 spectral range is useful for exhibiting distinctive shapes for important clay minerals.
Typically this will provide 10's of thousands to millions of pixels or even many more.
[0025] A MNF transform is performed on the relevant bands of data to produce variables which are uncorrelated and approximately Normally distributed with an estimated error variance of 1. It is usual to retain MNF bands with the highest signal to noise ratios.
[0026] We let d be the number of MNF bands retained, N is the number of pixels, and M is the number of endmembers (assumed to be less than or equal to d+l).
[0027] It is convenient to think of the MNF data as an Nx d matrix, whose ith row is written as Xi, and whosejth column is written as xj. Similarly, it will also be convenient to think of the unknown endmembers as an Mx d matrix, whose kth row is written as Ek and whosejth column is written as ej.
[0028] The MNF transformed data can be represented by the following formula:
M
X
i pkEk (1) k=1 [0029] Here a is an error vector, and thep i k are mixing proportions that satisfy the constraints of:
M
pk pik (2) k=1 [0030] If the error term is ignored then and tell us that the data lie inside a simplex in dimensional space, and the MNF representations of the M endmembers are at the vertices of the simplex.
[0031] A least squares minimisation of equation is conducted subject to the constraints and a term that constrains the size of the simplex, while being faithful to the model. It can be shown that without the constraints the solution converges to a simplex that is too large. This problem may be solved iteratively: given estimates of the WO 03/088130 PCT/AU03/00409 -6endmember spectra, the proportions for each pixel are estimated, which is a quadratic programming problem; and given estimates of the proportions, the endmember MNF spectra are estimated, which is a linear estimation problem.
[0032] The constraint is the addition of a term to the residual sum of squares which is a measure of the size of the simplex. A convenient term to add is the sum of squared distances between all of the simplex vertices. It can easily be shown that this is proportional to the sum of the variances of the simplex vertices over the d dimensions which is a quadratic function of the vertices and therefore computationally convenient.
The regularised solution minimises: d R=Z Pe)x- Pe) e/Dey}, (3) j=1 where is small, and where D=IM 1 T/M.
[0033] R is the regularised residual sum of squares; P is a Nx Mmatrix of proportions of M endmembers for all N pixels; IM is the Mx M identity matrix and 1 is the vector of length M, where every entry equals 1.
[0034] Formula is minimised iteratively.
[0035] In what follows, Pt will denote the estimated value ofP after the ith iteration, and either ej,I, or Ek,i, k= will denote the estimated endmembers after the Ith iteration.
1. Let ej,, denote the starting values for the algorithm and let 1=1.
2. Let Pt denote the value of P minimising d R, /Dem-l} (4) j=1 subject to This is done using a quadratic programming algorithm.
There are two things to note here. First, the second term.in is independent of P, and so only the first term needs to be minimised in this step. Second, we can WO 03/088130 WO 03/88130PCT/ATJO3/00409 -7separate the minimisation into N separate quadratic programming niinimisations for the data at, each of the N pixels. Specifically, for i1.
we find pikk-- which minimise Xi -Y P~EkJ] Ekl, kk=i subject to Let R!,I,mi denote the minimum value of R4, 1 achieved., 3. Let denote the value of ey minimising d
R
1 2 {(xi~jplej) T(Xiplej) +Xj~I 6 This minimisation can be separated into d separate minimisations, and straightforward matrix algebra can be used to show that e 1 ,I (PITPrFfD) 4 PITxij--l...,d (7) Let RI,2,min denote the minimum value of RI,2 achieved. It can also be shown that d Rlt.i 1 X]{PI(PIP+X (8) ~j=1 where IN is the N x N identity matrix.
4. Let ri= R,2,minIR. tn (9) Because each step in the iteration reduces R, we must have R12mIn R 1 or r1<1. When rj gets very close to 1, the algorithm stops. More specifically: if rt< tol, let 1=1+1 and go to 2. Otherwise, stop and let or equivalently E1,1, .,A1 be the final estimates of the endmembers, and let WO 03/088130 PCT/AU03/00409 -8the ith row of Pt give the final estimates of the proportions of each of the estimated endmembers present in the ith pixel.
[0036] The process is terminated when a ratio of successive values of the regularised residual sum of squares is less than a tolerance (tol). The default tolerance value is 0.99999. Using this value in typical examples, 20 to 100 iterations are required until the process stops.
[0037] Using this method the projections of all the data onto this hyperplane need not lie inside the simplex.
[0038] Figure 1 shows a simulated toy example with a true simplex as a solid line, an (unregularised) least squares solution as a dotted line and a regularised least squares solution as a broken line. The regularised least squares solution provides much better estimates of the true endmembers.
[0039] Most of the information about the simplex is contained in data on or near boundaries of the data cloud. So if only data nearest the convex hull of the data cloud is used computation becomes quicker. In high dimensional problems, points at or near the ends of random one dimensional projections of the data can be used. Alternatively points only on or near two dimensional convex hulls of all d(d-1)/2 MNF band pairs are used.
[0040] The starting points for the iterative process can determine the outcome of the iterative process. Good starting points should be well separated in MNF space. Points with high PPI scores can be useful. The PPI scores are the number of times the data at each pixel are at or near one of the ends of these projections.
[0041] Apart from the estimates of the endmembers an intennediate product of the algorithm is the endmember proportions in each pixel. The proportions give a meaningful idea of how much each endmember is represented in each pixel (assuming that mixing is proportional to area). This can be represented as images/maps. A particularly useful diagnostic is the maximum proportion of each estimated endmember in the scene. The lower the maximum proportions for each endmember spectrum the further the estimated endmember is from the data cloud and the confidence in the estimate will be correspondingly smaller. For endmember estimates having a maximum WO 03/088130 PCT/AU03/00409 -9proportion less than 0.5 it becomes difficult to estimate the true endmember spectrum.
[0042] Another useful by-product of the algorithm is an image showing the contribution of each pixel to the regularised residual sum of squares. If there are some large residuals and especially if they are spatially clustered it is an indication that the model is not fitting the data adequately, either because the chosen value of M is too small or if only the data on or near the boundaries of the data cloud are used then important observations may have been omitted from the algorithm. Additional observations can be added to the data used and the algorithm re-run to see whether fitting can be improved.
[0043] The present invention may be readily adapted to identify endmember spectral values from multispectral data from a variety of sources, such as remote sensing data, array data, etc.
[0044] Obvious adaptions of the preset invention to make the hereindescribed method suitable to identify endmembers from a nominated source are intended to fall within the scope of the preset invention.

Claims (11)

1. A method of identifying endmember spectra values from multispectral image data, where each multispectral data value is equal to a sum of mixing proportions of each endmember spectrum, said method including the steps of: processing the data to obtain a multidimensional simplex having a number of vertices equal to the number of endmembers, the position of each vertex representing a spectrum of one of the endmembers, wherein processing the data includes: providing starting estimates of each endmember spectrum for each image data value; estimating the mixing proportions for each data value from estimates of the spectra of all the endmembers; estimating the spectrum of each endmember from estimates of the mixing proportions of the spectra of all the endmembers for each image data value; repeating estimation steps until a relative change in the regularised residual sum of squares is sufficiently small, the regularised residual sum of squares including a term which is a measure of the size of the simplex.
2. A method according to claim 1, wherein the term used in the regularised residual sum of squares is the sum of the squared distances between all of the simplex vertices.
3. A method according to claim 1, wherein the step of providing the starting estimates includes choosing starting points with a high pixel purity index score.
4. A method according to claim 1, wherein more preferably the starting estimates are well separated.
A method according to claim 1, wherein the relative change in the regularised residual sum of the squares is regarded as sufficiently small when the ratio of successive values of regularised residual sum of squares is less than a tolerance.
6. A method according to claim 1, wherein the tolerance is 0.99999. WO 03/088130 PCT/AU03/00409 -11-
7. A method according to claim 1, wherein processing the data includes whitening the data.
8. A method according to claim 7, wherein whitening the data includes conducting a transform of the data into data that is not band correlated.
9. A method according to claim 1, wherein processing the data includes removing bands that do not have a high signal to noise ratio.
A method according to claim 1, wherein the step of estimating the spectrum of each endmember is conducted using a linear estimation technique.
11. A method according to claim 1, wherein the step of estimating the mixing proportions is conducted using a quadratic programming minimisation technique.
AU2003215441A 2002-04-05 2003-04-04 Method of identifying endmember spectral values from hyperspectral image data Ceased AU2003215441B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003215441A AU2003215441B2 (en) 2002-04-05 2003-04-04 Method of identifying endmember spectral values from hyperspectral image data

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
AUPS1552 2002-04-05
AUPS1552A AUPS155202A0 (en) 2002-04-05 2002-04-05 Methods of identifying endmember spectral values for hyperspectral image data
AU2003215441A AU2003215441B2 (en) 2002-04-05 2003-04-04 Method of identifying endmember spectral values from hyperspectral image data
PCT/AU2003/000409 WO2003088130A1 (en) 2002-04-05 2003-04-04 Method of identifying endmember spectral values from hyperspectral image data

Publications (2)

Publication Number Publication Date
AU2003215441A1 AU2003215441A1 (en) 2003-10-27
AU2003215441B2 true AU2003215441B2 (en) 2007-04-05

Family

ID=34105067

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2003215441A Ceased AU2003215441B2 (en) 2002-04-05 2003-04-04 Method of identifying endmember spectral values from hyperspectral image data

Country Status (1)

Country Link
AU (1) AU2003215441B2 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998002842A1 (en) * 1996-07-12 1998-01-22 The Government Of The United States Of America As Represented By The Secretary Of The Navy Intelligent hypersensor processing system (ihps)
US6208752B1 (en) * 1998-03-12 2001-03-27 The United States Of America As Represented By The Secretary Of The Navy System for eliminating or reducing exemplar effects in multispectral or hyperspectral sensors

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998002842A1 (en) * 1996-07-12 1998-01-22 The Government Of The United States Of America As Represented By The Secretary Of The Navy Intelligent hypersensor processing system (ihps)
US6208752B1 (en) * 1998-03-12 2001-03-27 The United States Of America As Represented By The Secretary Of The Navy System for eliminating or reducing exemplar effects in multispectral or hyperspectral sensors

Also Published As

Publication number Publication date
AU2003215441A1 (en) 2003-10-27

Similar Documents

Publication Publication Date Title
CA2478900C (en) Method of identifying endmember spectral values from hyperspectral image data
Zhang et al. A tensor decomposition-based anomaly detection algorithm for hyperspectral image
Alparone et al. Intersensor statistical matching for pansharpening: Theoretical issues and practical solutions
Huang et al. Spatial and spectral image fusion using sparse matrix factorization
Boardman Analysis, understanding, and visualization of hyperspectral data as convex sets in n space
Lennon et al. Independent component analysis as a tool for the dimensionality reduction and the representation of hyperspectral images
Boardman Geometric mixture analysis of imaging spectrometry data
Keshava A survey of spectral unmixing algorithms
Rogge et al. Integration of spatial–spectral information for the improved extraction of endmembers
Bioucas-Dias et al. Estimation of signal subspace on hyperspectral data
Nascimento et al. Hyperspectral unmixing algorithm via dependent component analysis
Settle On constrained energy minimization and the partial unmixing of multispectral images
US6958466B1 (en) Method and system for detecting targets known up to a simplex from multi-spectral and hyper-spectral imagery employing the normal compositional model
CN115144357A (en) Mineral identification method based on airborne mid-infrared hyperspectral remote sensing data of mining area
Boardman et al. Exploring the spectral variability of the earth as measured by AVIRIS in 1999
Nakhostin et al. Joint anomaly detection and spectral unmixing for planetary hyperspectral images
AU2003215441B2 (en) Method of identifying endmember spectral values from hyperspectral image data
Beaven et al. Comparison of Gaussian mixture and linear mixture models for classification of hyperspectral data
CN111046844B (en) Hyperspectral image classification method based on neighborhood selection constraint
Olson et al. A study of anomaly detection performance as a function of relative spectral abundances for graph-and statistics-based detection algorithms
Endsley Spectral unmixing algorithms based on statistical models
CN111639543A (en) Hyperspectral remote sensing image wetland classification method based on Markov random field
Berman et al. ICE: an automated statistical approach to identifying endmembers in hyperspectral images
Vincent et al. Subpixel target detection in hyperspectral imaging
CN117557918B (en) Remote sensing image spectrum similar ground object identification method based on matrix condition number

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired