CN107909039B - High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm - Google Patents

High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm Download PDF

Info

Publication number
CN107909039B
CN107909039B CN201711138873.9A CN201711138873A CN107909039B CN 107909039 B CN107909039 B CN 107909039B CN 201711138873 A CN201711138873 A CN 201711138873A CN 107909039 B CN107909039 B CN 107909039B
Authority
CN
China
Prior art keywords
remote sensing
sensing image
classification
resolution remote
surface coverage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711138873.9A
Other languages
Chinese (zh)
Other versions
CN107909039A (en
Inventor
钟燕飞
赵�济
吕鹏远
王晶
马爱龙
刘艳飞
伍丝琪
张良培
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN201711138873.9A priority Critical patent/CN107909039B/en
Publication of CN107909039A publication Critical patent/CN107909039A/en
Application granted granted Critical
Publication of CN107909039B publication Critical patent/CN107909039B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/187Segmentation; Edge detection involving region growing; involving region merging; involving connected component labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10032Satellite or aerial image; Remote sensing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30181Earth observation
    • G06T2207/30184Infrastructure

Abstract

The invention discloses a high-resolution remote sensing image earth surface coverage classification method based on a parallel algorithm, which comprises the following steps: s1, segmenting the high-resolution remote sensing image data according to the number of the computers to obtain segmented high-resolution remote sensing image blocks; s2, distributing all high-resolution remote sensing image blocks to m processors based on an OpenMP parallel framework, and executing earth surface coverage classification processing concurrently; and S3, merging all the high-resolution remote sensing image block data according to a data segmentation principle to obtain a final earth surface coverage classification result. The method automatically segments the data according to the size of the data and the condition of using the memory of the computer, organizes the classification algorithm flow by using the configuration file, realizes the parallel classification algorithm, and thus can be suitable for the high-resolution earth surface coverage mapping task with extremely large data volume and finely divided ground object space.

Description

High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm
Technical Field
The invention belongs to the technical field of remote sensing image processing, and particularly relates to a high-resolution remote sensing image earth surface coverage classification method based on a parallel algorithm.
Background
The national social and economic construction of land resource matters is a foundation on which human beings live, and with the development of a urbanization process, the land surface coverage/utilization which reflects the natural attributes and social attributes of land object types on the land surface is determined, so that the method has important effects on earth system modes, global environment changes, national soil protection, urban development decisions, hydraulic engineering construction and the like. Therefore, in the past decades, international society and various organizations have paid high attention to research on ground surface coverage/utilization and changes thereof, research projects are frequently on a few occasions, such as a core project of land utilization/ground surface coverage change of an international biosphere plan (IGBP) and an international global environmental change human factor plan (IHDP), a core project of ground surface coverage/utilization change of NASA in the united states, a project of national 863 plan focus project of global ground surface coverage remote sensing mapping and key technology research, and the like, and corresponding ground surface coverage products are successively released as research results, and according to relevant documents, about 21 sets of ground surface coverage products are currently available on a global scale, and at least 43 sets of ground surface coverage products are currently available on an area scale. Meanwhile, for the needs of construction and development, the country successively develops a second national land survey and a first national situation census project in a large scale, wherein the acquisition of land surface coverage/utilization is an important task for land survey and geographical national situation monitoring. In conclusion, large-scale global or regional surface coverage mapping has important research significance and application value.
The present global ground surface covering product with the highest resolution is a global ground surface covering product with 30 m spatial resolution which is manufactured based on Landsat TM data in China. Compared with Landsat (TM) data used by a 30-meter ground surface covering product, the high-resolution remote sensing image in the application of meter-level spatial resolution ground surface covering mapping has the characteristics, and in order to cover the same area, the high-resolution remote sensing image needs larger size and more pixels, so that the data volume has higher requirements on scale and fine division of ground object space, and the global ground surface covering mapping task based on the high-resolution remote sensing image becomes very difficult. The invention aims to provide a full-automatic large-scale earth surface coverage classification method based on parallel computation aiming at a high-resolution remote sensing image.
Disclosure of Invention
Aiming at the problems of large data volume and large calculated amount in the application of high-resolution remote sensing image earth surface coverage mapping, the invention adopts a processing mode of data parallel and algorithm calculation parallel to extract the spectral characteristics of the remote sensing image, fuses the existing maximum likelihood and support vector machine algorithm, and then carries out post-processing on the intermediate result by using a connected region labeling algorithm, thereby being capable of quickly obtaining a high-resolution earth surface coverage classification result.
In order to achieve the purpose, the technical scheme of the invention is a high-resolution remote sensing image earth surface coverage classification method based on a parallel algorithm, which comprises the following steps:
step one, segmenting the high-resolution remote sensing image data according to the number of computers to obtain segmented high-resolution remote sensing image blocks, and simultaneously recording the coordinates of the starting and ending positions of each high-resolution remote sensing image block in the following way,
and setting h as the length of the image and w as the width of the image, the width of the high-resolution remote sensing image block after segmentation
Figure GDA0002227951100000021
Length H-W r, where the ratio r-H/W, Mey is the memory number, StIs a data type, SsA safety factor is set;
distributing all the high-resolution remote sensing image blocks to Mey processors based on an OpenMP parallel framework, and executing earth surface coverage classification processing concurrently;
and thirdly, combining all the high-resolution remote sensing image block data according to the coordinates of the starting and stopping positions of each high-resolution remote sensing image block to obtain a final earth surface coverage classification result.
Further, the surface coverage classification processing in the second step includes the following steps:
step 1, extracting the characteristics of a high-resolution remote sensing image block, which comprises the following substeps;
step 1.1, extracting spectral characteristics of a high-resolution remote sensing image block;
step 1.2, calculating a normalized vegetation index as the feature of vegetation extraction;
step 1.3, calculating a normalized water body index as the characteristic of the extracted water body;
step 1.4, calculating texture features based on gray level co-occurrence moments as local spatial features;
step 1.5, fusing the extracted spectral features, the normalized vegetation indexes, the normalized water body indexes and the texture features in a vector superposition mode, and inputting the fused spectral features, the normalized vegetation indexes, the normalized water body indexes and the texture features as the features of subsequent classification;
step 2, constructing a classification algorithm fusing maximum likelihood and a support vector machine, classifying the earth surface coverage categories in the high-resolution remote sensing image block, and obtaining a classified image, wherein the classification algorithm comprises the following substeps:
step 2.1, acquiring the member probability of the earth surface coverage category based on a maximum likelihood classification algorithm;
step 2.2, based on the obtained membership probability of the feature category, judging the accuracy of the feature discrimination according to a threshold, wherein the method further comprises the following steps:
step 2.2.1, determining the optimal mark type of the current pixel according to the maximum posterior probability;
step 2.2.2, judging the accuracy of the pixel category, if the category accuracy is greater than a given threshold value, directly using a classification result formed by labels corresponding to the maximum probability in the maximum likelihood classification algorithm in the step 2.1, otherwise, jumping to the step 2.3;
step 2.3, distinguishing pixels which are difficult to accurately distinguish in the maximum likelihood classification in a nonlinear space by using a nonlinear support vector machine classification algorithm;
step 3, generating a segmentation object on the classification image based on a connected region labeling algorithm, and post-processing the classification result in the step 2 through a region merging strategy to obtain a ground surface coverage classification result in the high-resolution remote sensing image block,
3.1, obtaining a segmentation object O by using a classic eight-neighborhood connected region labeling algorithm on the basis of the classification result in the step 2;
step 3.2, defining the obtained segmentation object as O according to the spatial resolution of the high-resolution remote sensing and the ground feature characteristicsiWherein i is the number of categories; passing threshold T1Judgment of OiWhether it belongs to a noisy object, wheniGreater than a given threshold T1It is directly retained, otherwise it is merged into the neighboring segmented object.
Further, the processing of classifying the surface coverage in the second step further includes a step 4 of performing quality evaluation on the classification result of the surface coverage according to a Kappa coefficient, and the specific implementation manner is as follows:
and 4.1, calculating a confusion matrix,
Figure GDA0002227951100000031
where m represents the number of object classes, pijIndicates the ith row and the jth column have pijPredicting the number of the pixels actually belonging to the ith class as the jth class; the numerical value on the main diagonal line of the matrix is represented by the number of pixels belonging to a certain class in the surface real image and belonging to the same class in the corresponding classification image, namely the number of correctly classified pixels;
step 4.2, calculating producer precision based on the confusion matrix, formula
Figure GDA0002227951100000032
Wherein i and j are respectively expressed as the land feature types occupied by the ith type and the jth type in the confusion matrix, AjjRepresenting the values of the elements on the diagonal of the confusion matrix;
calculating the overall classification accuracy, formula
Figure GDA0002227951100000033
Wherein N represents the total number of true samples;
calculating the Kappa coefficient, wherein the formula Kappa is (d-q)/(N-q), wherein N is trueThe total number of real samples, d the number of samples on the diagonal in the confusion matrix, C the number of classes, AijRepresenting the values of the elements in the confusion matrix corresponding to the i, j positions, q being expressed as
Figure GDA0002227951100000034
And 4.3, when the Kappa value is larger than α, the consistency between the classification result and the ground reference information is large or the precision is high, the consistency is medium when the Kappa value is β - α, and the consistency is poor when the Kappa value is smaller than β.
Further, the calculation formula of the normalized vegetation index in step 1.2 is
Figure GDA0002227951100000041
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imageredIs the reflectance value of the red band.
Further, the calculation formula of the normalized water body index in the step 1.3 is as follows
Figure GDA0002227951100000042
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imagegreenIs the reflectance value of the green band of the image.
Further, the texture features in step 1.4 include entropy and homogeneity based on gray level co-occurrence moments, which are implemented as follows,
let Kx ═ {1, 2, …, Nx } be the horizontal direction spatial domain and Ky ═ {1, 2, …, Ny } be the vertical direction spatial domain, the image may be defined as grayscale pixel interval Kx × Ky ═ k, where entropy
Figure GDA0002227951100000043
Homogeneity of the mixture
Figure GDA0002227951100000044
In the formula Qi'j'And expressing the element values of the ith 'row and j' column in the normalized gray level co-occurrence matrix.
Further, β and α in step 4.3 have values of 0.4 and 0.8, respectively.
Compared with the prior art, the invention has the following advantages and beneficial effects:
(1) the adaptability is good, and the data can be automatically segmented according to the size of the data and the condition of using a computer memory, so that the method can be suitable for operating environments with different configurations;
(2) the parallel computing capability is realized, an adaptive parallel algorithm is added, and parallel processing can be performed according to the number of CPUs (central processing units) of the machine;
(3) the method has the characteristics of easiness in use, no need of auxiliary information and manual intervention, high calculation speed and capability of realizing automatic processing.
(4) The global earth surface coverage mapping application system based on the high-resolution remote sensing satellite has reusability, and organizes a classification algorithm process by using the configuration files, so that different users can repeatedly obtain corresponding earth surface coverage mapping results through the same configuration files.
The method disclosed by the invention fuses the extracted spectral features, the normalized vegetation index, the normalized water body index and the texture features in a vector superposition mode to obtain fused feature input used as subsequent classification, classifies the high-resolution remote sensing images by utilizing a classification algorithm of a maximum likelihood and support vector machine, performs post-processing on classification results through a region merging strategy, and finally calculates a Kappa coefficient to perform quality evaluation on the classification images to realize the judgment of classification precision.
Drawings
FIG. 1 is a flowchart of a surface coverage classification process according to the present invention;
FIG. 2 is a schematic diagram of a hyperplane in a non-linear support vector machine according to an embodiment of the present invention.
Detailed Description
For a better understanding of the technical solutions of the present invention, the present invention will be further described in detail with reference to the accompanying drawings and examples.
Step 1, establishing a parallel computing framework.
Step 1.1, data are parallel, and the high-resolution remote sensing image data need to be segmented before ground surface coverage classification due to the large data volume of the high-resolution remote sensing image. The data parallel is that before data operation, data is automatically segmented according to a certain strategy, a large standard data is segmented into a plurality of small data modules, calculation is carried out by a plurality of CPU cores and other calculation resources, after the calculation is finished, the data is automatically merged according to a data segmentation principle, and the data in a required format is returned. The data parallelism can be calculated using the following formula.
After automatically segmenting the data, the width of each small data module
Figure GDA0002227951100000051
The ratio r is h/w, where h is the length of the image, w is the width of the image, Mey is the number of memories, StBeing data type, e.g. integer, floating point, SsFor the safety factor, the safety factor in this embodiment is 4; the length H of each small data template is W r; and simultaneously recording the coordinates of the start-stop position of the small data module.
And 1.2, paralleling the algorithms. The algorithm parallelism is specific to an algorithm, and in the actual calculation process, the parallel framework based on OpenMP and the like is distributed to a plurality of processors to execute calculation concurrently. The invention adopts a flow customization mode to realize automatic earth surface coverage classification, and a user can select an extraction method, a classification model and a classification post-processing method of the features through a configuration file.
And 2, extracting the features of the segmented high-resolution remote sensing image block, wherein the step further comprises the following steps.
And 2.1, directly using the multispectral wave band of the high-resolution remote sensing image block to extract spectral features, and providing basic judgment on the ground objects through superposition of common spectral wave bands.
Step 2.2, calculating the normalized vegetation index (NDVI) as the feature of the extracted vegetation, wherein the calculation formula is
Figure GDA0002227951100000052
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imageredIs the reflectance value of the red band.
Step 2.3, calculating a normalized water body index (NDWI) as the characteristic of the extracted water body, wherein the calculation formula is
Figure GDA0002227951100000061
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imagegreenIs the reflectance value of the green band of the image.
Step 2.4, texture features based on a gray level co-occurrence matrix (GLCM) are calculated as local spatial features, so as to describe a spatial distribution relationship between image elements, where Kx ═ {1, 2, …, Nx } is a horizontal direction spatial domain, Ky ═ {1, 2, …, Ny } is a vertical direction spatial domain, Nx and Ny are known quantities for a given image, an image may be defined as a gray level pixel interval Kx ═ Ky, and entropy is used to describe a spatial distribution relationship between image elements
Figure GDA0002227951100000062
And homogeneity
Figure GDA0002227951100000063
As local space characteristics, measuring and constructing texture characteristic value statistics, wherein Q in the formulai'j'And expressing the element values of the ith 'row and j' column in the normalized gray level co-occurrence matrix. The gray level co-occurrence matrix is a matrix function of pixel distance and angle, and reflects the comprehensive information of the image in direction, interval, variation amplitude and speed by calculating the correlation between two points of gray levels in a certain distance and a certain direction in the image, and the calculation mode is the prior art, and can be specifically referred to in document [1]]The present invention is not described in detail.
[1] High range, Whiteon, texture feature extraction [ J ] based on gray level co-occurrence matrix computer system applications, 2010,19(6):195-198.
And 2.5, fusing the extracted spectral features, the normalized vegetation indexes, the normalized water body indexes and the texture features in a vector superposition mode to obtain fusion serving as feature input of subsequent classification.
And 3, constructing a classification algorithm fusing the maximum likelihood and the support vector machine, classifying the high-resolution remote sensing image blocks, and obtaining a classified image, wherein the specific implementation mode of the step is as follows:
and 3.1, providing member probability of the earth surface coverage type based on a maximum likelihood classification algorithm (MLC), and providing a decision basis for obtaining a final earth surface coverage type. The maximum likelihood classification method takes the distribution of remote sensing multi-waveband data as multi-dimensional normal distribution to construct a discriminant function. The basic idea is as follows: forming a certain point group in a plane or space by the data of various known pixels; each kind of data of every dimension forms a normal distribution on its own axis, the multidimensional data of this kind forms a multidimensional normal distribution of this kind, have various multidimensional distribution models, to any unknown data vector of classification, can reverse to ask it to belong to the probability of various kinds; comparing the probabilities, to see which class the probability is large, the data vector or the pixel is classified as the class, which can be expressed as:
Figure GDA0002227951100000071
wherein m is the number of bands; p (w)i) Is an m-dimensional normal distribution density function of the i-th class, from which the probability of the m-dimensional random variable x to have various possible values in the k-th class can be seen. The m-dimensional data vector for a pixel may be represented as:
Figure GDA0002227951100000072
in miRepresents class ωiMean value of each band
Figure GDA0002227951100000073
The mean vector formed is given by the following formula,
Figure GDA0002227951100000074
Cirepresenting a covariance matrix of the class, the formula being
Figure GDA0002227951100000075
Wherein n iskIs the number of pixels of the kth class; wkIs the class k intra-class dispersion matrix, as shown in the following formula,
Figure GDA0002227951100000076
in the formula, ωk11k22,…ωkmmIs the within-class variance of class k; and omegak12,…,ωk1mAnd omegak21,…,ωkm1The class-k covariance is the class-within covariance, from which the covariance matrix of the class can be derived.
Through simplification, the original equation can be simplified into the following formula,
Figure GDA0002227951100000077
wherein p (w)i) Represents class ωiF denotes the corresponding class ωiThe probability density of (c). Based on the maximum likelihood decision function value, by Pi(l)=exp(Di(f) K obtains the membership probability of the surface feature class, where K is the number of surface coverage classes. The maximum likelihood classification algorithm (MLC) is prior art, see in particular document [2]]。
[2] Soul' an remote sensing digital image processing [ M ] science publishers, 2004.
Step 3.2, based on the obtained membership probability of the feature category, judging the accuracy of the feature discrimination according to a threshold, wherein the method further comprises the following steps:
step 3.2.1, determining the optimal mark of the current pixel according to the maximum posterior probability, formula
Figure GDA0002227951100000081
Wherein k represents the optimal mark of the current pixel;
step 3.2.2, the accuracy of the pixel category is judged, and if the category accuracy is larger than a given threshold value, the label formed by the labels corresponding to the maximum probability in the maximum likelihood classification algorithm in the step 2.1 is directly usedAnd (4) sorting results, otherwise, jumping to the step 2.3. Formula Pi(k) T, where T is a set default accuracy threshold, usually 0.8, indicating that the current pel is likely to get the corresponding label.
And 3.3, distinguishing pixels which are difficult to accurately distinguish in the maximum likelihood classification in a nonlinear space by using a nonlinear Support Vector Machine (SVM) classification algorithm. Different from the maximum likelihood probability-based classification method, the support vector machine is a machine learning algorithm based on a statistical learning theory, and the generalization error of the model is reduced while the sample error is minimized by adopting the structure risk minimization principle, so that the generalization capability of the model is improved. The formula f (x) w · Φ (x) + b represents the discriminant function of the hyperplane, where w is the weight vector, b is the offset, and Φ (x) is the vector related to the parameter x. As shown in fig. 2, there is a two-dimensional plane with two different data, represented by circles and crosses. Since these data are linearly separable, the two types of data can be separated by a straight line, which in two dimensions corresponds to a hyperplane. The nonlinear support vector machine classification algorithm is the prior art, and can be specifically referred to in the literature [3 ].
[3] Leayage statistical learning method [ M ] qing university press, 2012.
And 4, generating a segmentation object based on a connected region labeling algorithm, and performing post-processing on the classification result in the step 2 through a region merging strategy, wherein the specific implementation mode of the step is as follows:
4.1, on the basis of the classification result in the step 2, obtaining a segmented object O by using a classic eight-neighborhood connected region labeling algorithm for collecting and searching a data structure;
and 4.2, on the basis of the classification result in the step 2, defining the obtained segmentation object as O according to the spatial resolution and the ground feature characteristics of the high-resolution remote sensing imageiAnd i is the number of categories. Passing threshold T1Judging whether it belongs to a noise object (T)1Typically 0.8), e.g. above a given threshold T1Directly preserving, otherwise combining into adjacent segmentation object, and obtaining classification by maximum voting strategy in object unitAs a result, the classification result in step 2 is improved, so that a more refined surface coverage classification result is obtained.
And 5, reversely merging data by using the recorded start-stop position coordinates of the small data modules according to the process shown in the step 1 to obtain a final classification result.
And 6, evaluating the quality of the classified images, wherein the specific implementation mode of the step is as follows:
and 6.1, calculating a confusion matrix, wherein the confusion matrix is obtained by comparing the position of each real earth surface pixel with the classification pixels at corresponding positions in the classification map, the columns of the confusion matrix represent the number of the pixels in each class in the classification map, and the rows represent the real attribution class of the data. The confusion matrix is as follows:
Figure GDA0002227951100000091
where m represents the number of object classes, pijIndicates the ith row and the jth column have pijAnd predicting the picture element which actually belongs to the ith class as the jth class. Therefore, the value on the main diagonal line of the matrix is represented by the number of pixels in the surface real image, which belong to the same class as the corresponding classification image, i.e., the number of correctly classified pixels. Therefore, the larger the value on the main diagonal line in the matrix is, the more the number of correctly classified pixels is, and the higher the classification precision is.
Step 6.2, calculating Producer's Accuracy (PA) based on the confusion matrix, and calculating the formula
Figure GDA0002227951100000092
Wherein i and j are respectively expressed as the ground feature types shown in the ith class and the jth class in the confusion matrix, AjjRepresenting the values of the elements on the diagonal of the confusion matrix; calculate the Overall classification Accuracy (OA), formula
Figure GDA0002227951100000093
Where N represents the pre-selected real sample total. Calculating KThe appa coefficient reflects the imbalance between classes, where the formula kappa (d-q)/(N-q), where N is the total number of true samples, d is the number of samples on the diagonal in the confusion matrix, C is the number of classes (which may be user-defined), aijRepresenting the values of the elements in the confusion matrix corresponding to the i, j positions, q being expressed as
Figure GDA0002227951100000094
Kappa number>0.80, that is, the consistency between the classification map and the ground reference information is very large or the precision is very high, the Kappa value is 0.40-0.80 to indicate that the consistency is medium, and the Kappa value is less than 0.40 to indicate that the consistency is poor.
The specific embodiments described herein are merely illustrative of the spirit of the invention. Those skilled in the art to which the invention relates may effect numerous modifications, additions or substitutions to the specific embodiments described, without departing from the spirit or ambit of the invention as defined in the accompanying claims.

Claims (6)

1. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm is characterized by comprising the following steps of:
step one, segmenting the high-resolution remote sensing image data according to the number of computers to obtain segmented high-resolution remote sensing image blocks, and simultaneously recording the coordinates of the starting and ending positions of each high-resolution remote sensing image block in the following way,
and setting h as the length of the image and w as the width of the image, the width of the high-resolution remote sensing image block after segmentation
Figure FDA0002244241960000011
Length H-W r, where the ratio r-H/W, Mey is the memory number, StIs a data type, SsA safety factor is set;
distributing all the high-resolution remote sensing image blocks to Mey processors based on an OpenMP parallel framework, and executing earth surface coverage classification processing concurrently;
combining all the high-resolution remote sensing image block data according to the coordinates of the starting and stopping positions of each high-resolution remote sensing image block to obtain a final earth surface coverage classification result;
the earth surface coverage classification processing in the second step comprises the following steps:
step 1, extracting the characteristics of a high-resolution remote sensing image block, which comprises the following substeps;
step 1.1, extracting spectral characteristics of a high-resolution remote sensing image block;
step 1.2, calculating a normalized vegetation index as the feature of vegetation extraction;
step 1.3, calculating a normalized water body index as the characteristic of the extracted water body;
step 1.4, calculating texture features based on gray level co-occurrence moments as local spatial features;
step 1.5, fusing the extracted spectral features, the normalized vegetation indexes, the normalized water body indexes and the texture features in a vector superposition mode, and inputting the fused spectral features, the normalized vegetation indexes, the normalized water body indexes and the texture features as the features of subsequent classification;
step 2, constructing a classification algorithm fusing maximum likelihood and a support vector machine, classifying the earth surface coverage categories in the high-resolution remote sensing image block, and obtaining a classified image, wherein the classification algorithm comprises the following substeps:
step 2.1, acquiring the member probability of the earth surface coverage category based on a maximum likelihood classification algorithm;
step 2.2, based on the obtained membership probability of the feature category, judging the accuracy of the feature discrimination according to a threshold, wherein the method further comprises the following steps:
step 2.2.1, determining the optimal mark type of the current pixel according to the maximum posterior probability;
step 2.2.2, judging the accuracy of the pixel category, if the category accuracy is greater than a given threshold value, directly using a classification result formed by labels corresponding to the maximum probability in the maximum likelihood classification algorithm in the step 2.1, otherwise, jumping to the step 2.3;
step 2.3, distinguishing pixels which are difficult to accurately distinguish in the maximum likelihood classification in a nonlinear space by using a nonlinear support vector machine classification algorithm;
step 3, generating a segmentation object on the classification image based on a connected region labeling algorithm, and post-processing the classification result in the step 2 through a region merging strategy to obtain a ground surface coverage classification result in the high-resolution remote sensing image block,
3.1, obtaining a segmentation object O by using a classic eight-neighborhood connected region labeling algorithm on the basis of the classification result in the step 2;
step 3.2, defining the obtained segmentation object as O according to the spatial resolution of the high-resolution remote sensing and the ground feature characteristicsiWherein i is the number of categories; passing threshold T1Judgment of OiWhether it belongs to a noisy object, wheniGreater than a given threshold T1It is directly retained, otherwise it is merged into the neighboring segmented object.
2. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm as claimed in claim 1, wherein the earth surface coverage classification processing in the second step further comprises a step 4 of performing quality evaluation on the earth surface coverage classification result according to a Kappa coefficient, and the specific implementation mode is as follows:
and 4.1, calculating a confusion matrix,
Figure FDA0002244241960000021
where m represents the number of object classes, pijIndicates the ith row and the jth column have pijPredicting the number of the pixels actually belonging to the ith class as the jth class; the numerical value on the main diagonal line of the matrix is represented by the number of pixels belonging to a certain class in the surface real image and belonging to the same class in the corresponding classification image, namely the number of correctly classified pixels;
step 4.2, calculating producer precision based on the confusion matrix, formula
Figure FDA0002244241960000022
Wherein i and j are respectively expressed as the land feature types occupied by the ith type and the jth type in the confusion matrix, AjjRepresenting the values of the elements on the diagonal of the confusion matrix;
calculating the overall classification accuracy, formula
Figure FDA0002244241960000023
Wherein N represents the total number of true samples;
calculating the Kappa coefficient, wherein the formula Kappa is (d-q)/(N-q), where N is the total number of real samples, d is the number of samples on the diagonal in the confusion matrix, C is the number of classes, aijRepresenting the values of the elements in the confusion matrix corresponding to the i, j positions, q being expressed as
Figure FDA0002244241960000024
And 4.3, when the Kappa value is larger than α, the consistency between the classification result and the ground reference information is large or the precision is high, the Kappa value is in the range of [ β ], the consistency is medium, and the consistency is poor when the Kappa value is smaller than β.
3. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm as claimed in claim 2, wherein: the calculation formula of the normalized vegetation index in step 1.2 is
Figure FDA0002244241960000031
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imageredIs the reflectance value of the red band.
4. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm as claimed in claim 3, wherein: the calculation formula of the normalized water body index in the step 1.3 is
Figure FDA0002244241960000032
Where ρ isnirIs the reflectance value rho of the near infrared band of the high-resolution multispectral remote sensing imagegreenIs the inverse of the green band of the imageAnd (4) a refractive index value.
5. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm as claimed in claim 4, wherein: the texture features in step 1.4 include entropy and homogeneity based on gray level co-occurrence moments, which are implemented as follows,
let Kx ═ {1, 2, …, Nx } be the horizontal direction space domain, Ky ═ {1, 2, …, Ny } be the vertical direction space domain, the image is defined as the grayscale pixel interval Kx × Ky ═ k, where entropy
Figure FDA0002244241960000033
Homogeneity of the mixture
Figure FDA0002244241960000034
In the formula Qi'j'And expressing the element values of the ith 'row and j' column in the normalized gray level co-occurrence matrix.
6. The method for classifying the earth surface coverage of the high-resolution remote sensing image based on the parallel algorithm as claimed in claim 5, wherein the values of β and α in the step 4.3 are 0.4 and 0.8 respectively.
CN201711138873.9A 2017-11-16 2017-11-16 High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm Active CN107909039B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711138873.9A CN107909039B (en) 2017-11-16 2017-11-16 High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711138873.9A CN107909039B (en) 2017-11-16 2017-11-16 High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm

Publications (2)

Publication Number Publication Date
CN107909039A CN107909039A (en) 2018-04-13
CN107909039B true CN107909039B (en) 2020-03-10

Family

ID=61845743

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711138873.9A Active CN107909039B (en) 2017-11-16 2017-11-16 High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm

Country Status (1)

Country Link
CN (1) CN107909039B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108827880B (en) * 2018-04-23 2021-04-16 吉林大学 Earth surface coverage change detection method based on multispectral image and NDVI time sequence
CN108985260B (en) * 2018-08-06 2022-03-04 航天恒星科技有限公司 Remote sensing and meteorological integrated rice yield estimation method
CN109584284B (en) * 2018-12-13 2019-12-06 宁波大学 Hierarchical decision-making coastal wetland ground object sample extraction method
CN109657598B (en) * 2018-12-13 2020-05-05 宁波大学 Coastal wetland remote sensing classification method based on layering strategy
CN109657724B (en) * 2018-12-21 2021-08-03 浙江中烟工业有限责任公司 Parallel computing-based method for rapidly computing characteristic parameters of grooved filter sticks
CN110033044A (en) * 2019-04-17 2019-07-19 北京建筑大学 Ground mulching product integration or fusion method based on EAGLE matrix
CN110033499B (en) * 2019-04-21 2021-01-15 南京林业大学 Landsat OLI image-based water body mapping method
CN110796176B (en) * 2019-10-09 2022-12-16 武汉大学 High-resolution image classification method and system based on pixel pair and weighted voting
CN111199195B (en) * 2019-12-26 2023-07-18 中科禾信遥感科技(苏州)有限公司 Pond state full-automatic monitoring method and device based on remote sensing image
CN112651465B (en) * 2021-01-13 2024-02-02 北京观微科技有限公司 Equipment target interpretation method based on high-resolution remote sensing image
CN113222005B (en) * 2021-05-08 2022-05-27 兰州交通大学 Automatic updating method for land coverage
CN113223022B (en) * 2021-05-31 2022-04-12 湖南科技大学 Multivariate image segmentation method based on multivariate texture image analysis algorithm
CN114283335B (en) * 2021-12-27 2022-11-22 河南大学 Historical period remote sensing identification precision verification preparation method

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101256677A (en) * 2008-03-01 2008-09-03 深圳先进技术研究院 Automatic monitoring simulation and parallelization process method thereof
CN102073879A (en) * 2010-12-02 2011-05-25 南京大学 Method for identifying characteristic land categories of ocean remote sensing images of coast on basis of semi-supervised learning
CN103218783A (en) * 2013-04-17 2013-07-24 国家测绘地理信息局卫星测绘应用中心 Fast geometric correction method for satellite remote sensing image and based on control point image database
CN103400354A (en) * 2013-08-14 2013-11-20 山东大学 OpenMP-based geometric correcting and parallel processing method for remote-sensing images
CN104751477A (en) * 2015-04-17 2015-07-01 薛笑荣 Space domain and frequency domain characteristic based parallel SAR (synthetic aperture radar) image classification method
CN104881867A (en) * 2015-05-13 2015-09-02 华中科技大学 Method for evaluating quality of remote sensing image based on character distribution
CN107194313A (en) * 2017-04-19 2017-09-22 中国国土资源航空物探遥感中心 A kind of parallel intelligent object-oriented classification method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675164B2 (en) * 2001-06-08 2004-01-06 The Regents Of The University Of California Parallel object-oriented data mining system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101256677A (en) * 2008-03-01 2008-09-03 深圳先进技术研究院 Automatic monitoring simulation and parallelization process method thereof
CN102073879A (en) * 2010-12-02 2011-05-25 南京大学 Method for identifying characteristic land categories of ocean remote sensing images of coast on basis of semi-supervised learning
CN103218783A (en) * 2013-04-17 2013-07-24 国家测绘地理信息局卫星测绘应用中心 Fast geometric correction method for satellite remote sensing image and based on control point image database
CN103400354A (en) * 2013-08-14 2013-11-20 山东大学 OpenMP-based geometric correcting and parallel processing method for remote-sensing images
CN104751477A (en) * 2015-04-17 2015-07-01 薛笑荣 Space domain and frequency domain characteristic based parallel SAR (synthetic aperture radar) image classification method
CN104881867A (en) * 2015-05-13 2015-09-02 华中科技大学 Method for evaluating quality of remote sensing image based on character distribution
CN107194313A (en) * 2017-04-19 2017-09-22 中国国土资源航空物探遥感中心 A kind of parallel intelligent object-oriented classification method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Ore image segmentation by learning image and shape features;Dipti Prasad Mukherjee 等;《Pattern Recognition Letters》;20091231;第30卷(第6期);第615-622页 *
亚洲人造地表覆盖遥感精细化分类与分布特征分析;匡文慧 等;《中国科学:地球科学》;20161231;第46卷(第9期);第1162-1179页 *
傅里叶谱纹理和光谱信息结合的高分辨率遥感影像地表覆盖分类;佃袁勇 等;《武汉大学学报·信息科学版》;20170331;第42卷(第3期);第362-368页 *
基于动态分界点计算的并行几何校正算法;欧新良 等;《计算机研究与发展》;20061231;第43卷(第6期);第1115-1121页 *

Also Published As

Publication number Publication date
CN107909039A (en) 2018-04-13

Similar Documents

Publication Publication Date Title
CN107909039B (en) High-resolution remote sensing image earth surface coverage classification method based on parallel algorithm
Lv et al. Deep learning and superpixel feature extraction based on contractive autoencoder for change detection in SAR images
Lv et al. Very high resolution remote sensing image classification with SEEDS-CNN and scale effect analysis for superpixel CNN classification
Ghamisi et al. Multilevel image segmentation based on fractional-order Darwinian particle swarm optimization
CN110929607B (en) Remote sensing identification method and system for urban building construction progress
CN107067405B (en) Remote sensing image segmentation method based on scale optimization
Liu et al. Enhancing spectral unmixing by local neighborhood weights
CN107909015A (en) Hyperspectral image classification method based on convolutional neural networks and empty spectrum information fusion
CN110097101B (en) Remote sensing image fusion and coastal zone classification method based on improved reliability factor
Li et al. A hybrid method combining pixel-based and object-oriented methods and its application in Hungary using Chinese HJ-1 satellite images
Li et al. Adaptive scale selection for multiscale segmentation of satellite images
Shen et al. Optimizing multiscale segmentation with local spectral heterogeneity measure for high resolution remote sensing images
Yu et al. Analysis of large-scale UAV images using a multi-scale hierarchical representation
Peng et al. Object-based change detection from satellite imagery by segmentation optimization and multi-features fusion
Watanabe et al. Identifying the vegetation type in Google Earth images using a convolutional neural network: a case study for Japanese bamboo forests
Xiao et al. Segmentation of multispectral high-resolution satellite imagery using log Gabor filters
Cai et al. Improving sampling-based image matting with cooperative coevolution differential evolution algorithm
Yue et al. Texture extraction for object-oriented classification of high spatial resolution remotely sensed images using a semivariogram
CN106295498A (en) Remote sensing image target area detection apparatus and method
Liang et al. An extraction and classification algorithm for concrete cracks based on machine vision
CN112329559A (en) Method for detecting homestead target based on deep convolutional neural network
Zhang et al. Impervious surface extraction from high-resolution satellite image using pixel-and object-based hybrid analysis
Lv et al. Improved object-based convolutional neural network (IOCNN) to classify very high-resolution remote sensing images
Miao et al. Integration of satellite images and open data for impervious surface classification
Sutha Object based classification of high resolution remote sensing image using HRSVM-CNN classifier

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant