CN112052756A

CN112052756A - Post-earthquake high-resolution remote sensing image earthquake damage building detection method

Info

Publication number: CN112052756A
Application number: CN202010855069.8A
Authority: CN
Inventors: 王超; 王帅; 申祎; 张艳; 仇星; 石梵
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2020-08-24
Filing date: 2020-08-24
Publication date: 2020-12-08
Anticipated expiration: 2040-08-24
Also published as: CN112052756B

Abstract

The invention discloses a method for detecting earthquake damage of a building by using post-earthquake high resolution remote sensing images. Firstly, extracting a potential building set through WJSEG segmentation and a group of non-building screening rules; secondly, a seismic damage visual dictionary model is constructed by utilizing the characteristics of spectrum, texture and geometric morphology, and a semantic gap from pixels to seismic damage characteristics is spanned; on the basis, a visual dictionary optimization strategy based on intra-class and inter-class penalty factors is designed, and information redundancy and evidence conflict are reduced; and finally, further dividing the buildings into intact buildings, partial earthquake damage buildings and ruins through a random forest classifier. The experimental result shows that the overall precision of the method reaches more than 85 percent, so that key decision support information can be provided for emergency response rescue after earthquake and reconstruction after disaster.

Description

Post-earthquake high-resolution remote sensing image earthquake damage building detection method

Technical Field

The invention belongs to the field of image detection, and particularly relates to a method for detecting earthquake damage buildings.

Background

After an earthquake occurs, the position of the earthquake-damaged building is judged quickly and accurately, and the important function is achieved for carrying out emergency rescue and disaster assessment after the earthquake. Compared with the traditional field investigation relying on ground personnel, the earthquake damage building detection based on the high-resolution remote sensing image has the advantages of strong real-time performance, wide coverage range, low safety risk and the like, and becomes one of the research hotspots in the field of remote sensing application.

At present, methods for extracting earthquake-damaged buildings from high-resolution remote sensing images can be mainly divided into two types:

(1) a classification method based on post-earthquake single-temporal images. The method converts the detection problem of the earthquake damage building into the classification problem of the remote sensing image after the earthquake, thereby breaking through the limitation caused by the dependence on the image before the earthquake. For example, leaf Xin and the like can achieve the total precision of more than 80% in earthquake damage building detection based on Quickbird and aerial remote sensing images after earthquake by analyzing the spatial correlation of the internal gradient of the roof of the building.

(2) A change detection method based on a pre-earthquake time phase image and a post-earthquake time phase image. The method introduces the pre-earthquake image, can extract the change information of the building as the earthquake damage detection basis, and therefore has higher detection precision. For example, Roberta and the like propose a method for detecting the change of earthquake damage buildings by combining multiple parameters and multiple descriptors, and the overall accuracy of the method can reach more than 96%. The limitations of such methods are the difficult acquisition of the pre-earthquake images, and the high-precision registration of the pre-earthquake and post-earthquake images. Therefore, the classification method based on the single-temporal image after the earthquake has more application and popularization in emergency response after the earthquake. However, since such methods lack pre-earthquake reference information and the post-earthquake scene is more complex in structure and spatial layout, how to perform effective abstract representation on the earthquake damage characteristics of the building is a key and difficult problem in determining whether to accurately distinguish the earthquake damage building from other ground objects.

Disclosure of Invention

In order to solve the technical problems mentioned in the background technology, the invention provides a method for detecting earthquake-damaged buildings by high resolution remote sensing images after an earthquake.

In order to achieve the technical purpose, the technical scheme of the invention is as follows:

a method for detecting earthquake damage of buildings by high-resolution remote sensing images after earthquake comprises the following steps:

(1) sequentially carrying out image segmentation and non-building object elimination on the obtained post-earthquake high-resolution remote sensing image to obtain a potential building object set;

(2) constructing a seismic hazard visual dictionary from three angles of spectrum, texture and geometric morphology based on the visual bag-of-words model;

(3) constructing an optimized earthquake damage visual dictionary model based on the intra-class penalty factors and the inter-class penalty factors;

(4) and based on the optimized earthquake damage visual vocabulary model and the potential building object set, a random forest classifier is adopted to obtain a final earthquake damage building detection result.

Further, in step (1), the non-building object culling follows the following rules:

(a) area regulation: the area of each object, namely the number N of pixels contained in each object is counted_pixelsIf N is present_pixelsIf the area is smaller than or equal to the preset area threshold value, the object is considered as a weak target and is removed;

(b) morphological constructionBuilding index rule: determining a separation threshold T for a morphological building index using a maximum inter-class variance method_MBICalculating the mean value of the morphological building index of all pixels in each object

Rejection satisfies

All of the objects of (1);

(c) the rectangle degree and the length-width ratio are regular: the degree of rectangularity of an object is defined as Rd ═ N_pixels/N_rectangleIn which N is_pixelsThe number of pixels contained in the minimum bounding rectangle of the object, N_rectangleIs the aspect ratio of the smallest circumscribed rectangle; if an object satisfies Rd < 0.8 and N_rectangleIf the distance is more than 5, the object is considered as a long and narrow object and is removed.

Further, in the step (2), the spectrum-based earthquake damage visual dictionary is constructed by respectively performing K-means clustering on the three-band images of the RGB model according to gray values to obtain any object R_i∈R_potVisual dictionary WR of seismic injury of K-dimensional spectrum corresponding to three wave bands of R, G, B_i、WG_i、WB_iWherein R is_potThe set of potential building objects obtained in the step (1);

the construction method of the earthquake damage visual dictionary based on the texture is that local homogeneity indexes J-value are adopted as a class of earthquake damage visual vocabulary, and the definition of the J-value is as follows:

J-value＝(S_T-S_W)/S_W

wherein S is_TIs the total variance, S, of all pixels within a window of a certain size_WIs the sum of variances of pixels belonging to the same gray level within a window of a certain size; setting the number of scales, extracting an object R by calculating a multi-scale J-value image set and adopting the same clustering strategy as the built spectrum seismic damage visual dictionary for each scale_iCorresponding J-value earthquake damage visual dictionary WJ_i；

Seism based on geometric morphologyThe method for constructing the vicious vision dictionary comprises the steps of adopting area attributes, diagonal attributes and normalized moment of inertia attributes in morphological attribute profile MAPs as a class of earthquake vicious vision vocabulary respectively, and designing different morphological attribute operators to enable the characteristics of a target on specific scale parameters and attributes to have maximum response to be distinguished from other ground features, so as to obtain an object R_iEarthquake damage visual dictionary WAREA of extracted area, diagonal and normalized moment of inertia attributes_i、WDIAG_i、WNMI_i；

Defining an object R by integrating a visual dictionary of earthquake damage based on spectrum, texture and geometric morphology_iInitial earthquake damage visual dictionary W_i＝[WR_i,WG_i,WB_i,WJ_i,WAREA_i,WDIAG_i,WNMI_i]。

Further, the specific process of step (3) is as follows:

(301) respectively calculating the structural similarity SSIM between any two wave bands or scale images under the same characteristic, and calculating the in-class penalty factor alpha of the spectral characteristic according to the structural similarity SSIM_RGBJ-value, within class penalty factor alpha_JWithin class penalty factor alpha for area attributes_AREAWithin class penalty factor alpha for diagonal attributes_DIAGAnd an intra-class penalty factor alpha for the normalized moment of inertia attribute_NMI；

(302) Respectively adopting a mean value fusion strategy to R, G, B three-band images, multi-scale J-values and multi-scale sections with three attributes of area, diagonal and normalized moment of inertia to obtain five types of fusion images IMG_RGB、IMG_J-value、IMG_AREA、IMG_DIAGAnd IMG_NMI；

(303) Respectively calculating the structural similarity SSIM between each type of fusion image and other types of fusion images, and calculating the inter-class penalty factor beta of the five types of characteristics according to the structural similarity SSIM_RGB、β_J、β_AREA、β_DIAGAnd beta_NMI；

(304) Constructing an optimized earthquake damage visual dictionary W based on the punishment factors in the classes and the punishment factors between the classes_i＝[α_RGBβ_RGBWR_i，α_RGBβ_RGB WG_i，α_RGBβ_RGB WB_i，α_Jβ_J WJ_i，α_AREAβ_AREA WAREA_i，α_DIAGβ_DIAGWDIAG_i，α_NMIβ_NMI WNMI_i]。

Further, the specific process of step (4) is as follows:

(401) dividing the objects in the potential building object set into four categories of 'intact buildings', 'partial earthquake damage buildings', 'ruins' and 'other ground objects';

(402) counting an optimized earthquake damage visual dictionary histogram of each object to serve as an input feature space of a random forest classifier;

(403) selecting a training sample set, and randomly and retractably extracting a plurality of sample subsets by using a Bagging method to ensure that the number of samples in each sample subset is equal to that of the samples in the training sample set;

(404) determining the number of the random features corresponding to the nodes, and respectively constructing a decision tree model for each sample subset;

(405) classifying each sample to be classified, voting according to the classification labels, and determining the final classification labels according to the number of votes.

Adopt the beneficial effect that above-mentioned technical scheme brought:

the method is based on the inherent boundary of the building, comprehensively utilizes the spectrum, the texture and the geometric morphological characteristics, and carries out multi-angle fine portrayal on the earthquake damage building by constructing and optimizing the earthquake damage visual dictionary model, thereby spanning the semantic gap from pixels to the building earthquake damage characteristics. On the basis, the earthquake damage semantic histogram is used for image expression, so that the subsequent training and recognition of the random forest classifier are facilitated. In experiments conducted on post-earthquake high-resolution remote sensing images in the Wenchuan and Yushu areas, the method provided by the invention shows excellent performance.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is an original image, a reference image and a sub-image of data sets 1 and 2;

FIG. 3 is a graph of the seismic hazard building extraction results of the data set 1;

FIG. 4 is a graph of seismic hazard building extraction results for data set 2;

FIG. 5 is a graph of the detection of buildings by seismic injury for sub-image 1 and sub-image 2 of data set 1;

FIG. 6 is a graph of the detection of buildings of seismic origin for sub-images 3 and 4 of data set 2;

fig. 7 is a graph of the overall accuracy versus K.

Detailed Description

The technical scheme of the invention is explained in detail in the following with the accompanying drawings.

The invention provides a method for detecting earthquake damage of a building by using post-earthquake high resolution remote sensing images, which mainly comprises the following four steps: (1) extracting a set of potential buildings; (2) constructing a multi-feature earthquake damage visual dictionary; (3) optimizing and expressing the image of the earthquake damage visual vocabulary; (4) earthquake damage building detection based on random forest classification. The specific implementation flow is shown in fig. 1.

(1) Potential building set extraction

The invention firstly adopts a high-resolution remote sensing image segmentation algorithm WJSEG to divide adjacent isolated pixels into object sets with semantic information, and specific implementation steps can be seen in documents WANG C, et al.A novel multi-scale segmentation algorithm for high resolution displaced images based on wavelet transform and improved JSEG algorithm, Optik-International Journal for Light and Electron Optics,2014,125(19):5588-_iniThereby providing a basic analysis unit for subsequent processing.

Considering the ROI (region of interest) of the present invention as a seismic structure, in the initial object set R_iniOn the basis, the non-building objects are subjected to primary screening, so that the number of subsequent analysis units is reduced, and the reduction of classification false detection rate is facilitated. By analysing dictionaries of non-building objectsType characteristics, the design screening rules are as follows:

area regulation: the area of each object, namely the number N of pixels contained in each object is counted_pixelsIf N is present_pixelsIf the target is less than or equal to 80, the target is regarded as a small target such as a vehicle, noise and the like, and is removed. In addition, it should be specifically noted that the area threshold may be modified accordingly according to the actual situation.

Morphological Building Index (MBI, Morphological Building Index) rule: the MBI value of the pixel is obtained by calculating a multi-scale difference sequence by utilizing the characteristic that the building pixel shows high-brightness characteristics in the gray-scale image after the top hat transformation. If the MBI is larger, the probability that the pixel belongs to the building is higher. On the basis, the maximum inter-class variance method is used for determining the separation threshold T of the MBI_MBI. Computing the MBI mean of all pixels in each object

Rejection satisfies

All of the objects of (1);

the rectangle degree and the length-width ratio are regular: the degree of rectangularity of an object is defined as Rd ═ N_pixels/N_rectangle(wherein N is_pixelsThe number of pixels contained in the minimum bounding rectangle of the object, N_rectangleThe aspect ratio of the smallest circumscribed rectangle). At this time, if an object satisfies Rd < 0.8 and N_rectangleIf the number is more than 5, the object is considered as a long and narrow target such as a road, a river channel and the like, and is removed.

Using the above 3 rules to discriminate R_iniAll objects in (1) and all objects which are not eliminated are combined into a potential building object set R_pot。

(2) Construction of multi-feature earthquake damage visual dictionary

In the post-earthquake high-resolution remote sensing image, earthquake damage semantic knowledge implied in fine underlying visual features brought by improvement of spatial resolution is very rich. Therefore, the earthquake damage Visual dictionary is constructed from three angles of spectrum, texture and geometric morphology based on a Visual Bag of Words model (BoVW, Bag of Visual Words), and spans from the bottom layer Visual features to the semantic gap of the earthquake damage features of the building. The Visual Bag of Words (BoVW) model is an image processing algorithm developed from the field of natural language processing, and can provide an effective semantic feature description means for detection and identification of a specific target. According to the BoVW theory, the building earthquake damage visual dictionary is specifically as follows.

In post-earthquake scenes, areas of low grey value are often present because earthquake damage to buildings is rougher than damage sections of intact buildings. Therefore, the invention firstly carries out K-means clustering on the three-band images of the RGB model according to the gray values. On the basis, any object R can be obtained_i∈R_potThe corresponding K-dimensional spectral earthquake damage visual dictionary in the R, G, B three bands is: WR (pulse Width modulation)_i＝[red_i1,red_i2,…,red_iK]、WG_i＝[green_i1,green_i2,…,green_iK]、WB_i＝[blue_i1,blue_i2,…,blue_iK]。

In the aspect of texture characteristics, a perfect building generally has the characteristics of small internal gray level difference and strong texture structure consistency; the inner texture of the earthquake-damaged building has disordered structure and large gray level difference. Therefore, the local homogeneity index J-value is used as a class of earthquake damage visual vocabulary. J-value can effectively describe the texture complexity inside an object, and has the advantages of multi-scale and rotation invariance, which is defined as follows:

J-value＝(S_T-S_W)/S_W

wherein S is_TIs the total variance, S, of all pixels within a window of a certain size_WIs the sum of the variances of pixels belonging to the same gray scale within a window of a certain size. The smaller the value of J-value, the lower the degree of homogeneity inside the object, the more complex the texture, and thus the higher the possibility of belonging to a seismic structure. Setting the number of scales as 4, calculating a multi-scale J-value image set, and adopting the same clustering strategy as an RGB dictionary for each scale to extractTo take object R_iThe corresponding J-value earthquake damage visual dictionary is WJ_i＝[J_i11,…,J_i1K,J_i21,…,J_i2K,…,J_i41,…,J_i4K]。

In the aspect of geometric morphology, the composition of a perfect building in an earthquake scene is single, the shape is regular, and the structure is compact; earthquake damage to buildings presents a top surface with a mixture of intact and damaged areas, and debris consisting of collapsed building debris.

Therefore, the area, the diagonal line and the normalized moment of inertia NMI (normalized Mutual information) in the morphological Attribute profiles MAPs (morphological Attribute profiles) are respectively used as a class of earthquake damage visual vocabulary, and the implementation steps of the three attributes can be referred to in the literature. MAPs theory is developed from set theory, and the method distinguishes the characteristic appearance of a target on a specific scale parameter and attribute from the maximum response of other ground objects by designing different morphological attribute operators. Wherein, the area attribute is used for describing the scale of the internal structure of the object; the diagonal attribute reflects the diagonal length of the minimum circumscribed rectangle of the object and is used for describing the rule degree of the building; the NMI attribute reflects the mass distribution of the object, which is used to describe the compactness of the building. Taking the NMI attribute as an example, attribute filtering will progressively suppress objects smaller than a given NMI scale parameter. As the scale parameter increases, pixels in a sound building are progressively suppressed because of the higher degree of compactness. Therefore, the change trends of the area earthquake damage visual words of the intact buildings and the earthquake damage buildings in the multi-scale attribute section are obviously different, and the area and diagonal attributes have similar principles. The number of scales and the clustering strategy are the same as the RGB dictionary, and an object R is defined_iThe extracted earthquake damage visual dictionary of area, diagonal and NMI attributes are respectively: WAREA_i＝[AREA_i11,…,AREA_i1K,AREA_i21,…,AREA_i2K,…,AREA_i41,…,AREA_i4K]、WDIAG_i＝[DIAG_i11,…,DIAG_i1K,DIAG_i21,…,DIAG_i2K,…,DIAG_i41,…,DIAG_i4K]、WNMI_i＝[NMI_i11,…,NMI_i1K,NMI_i21,…,NMI_i2K,…,NMI_i41,…,NMI_i4K]。

On the basis, a 3-class earthquake damage visual dictionary is integrated and defined as an object R_iThe initial earthquake damage visual dictionary is W_i＝[WR_i,WG_i,WB_i,WJ_i,WAREA_i,WDIAG_i,WNMI_i]。

(3) Earthquake damage visual vocabulary image optimized expression

According to BoVW theory, on the basis of the constructed initial earthquake damage visual dictionary, for R_potRespectively counting the occurrence frequency of each visual vocabulary and constructing a visual vocabulary histogram. After all objects are traversed, the initial earthquake damage visual dictionary expression of the original image can be realized.

Nevertheless, different earthquake damage visual vocabularies have different expression effects on earthquake damage characteristics of the building, and redundant information possibly existing among the earthquake damage visual vocabularies not only increases the calculation complexity, but also reduces the detection precision due to mutual conflict among the different vocabularies as earthquake damage evidences. Therefore, the invention provides an optimized earthquake damage visual dictionary model based on intra-class penalty factors and inter-class penalty factors, so as to distinguish the contribution of different visual vocabularies to image expression, and the specific steps are as follows:

step 1: and calculating the intra-class penalty factor. Since there is usually a strong correlation between images of the same kind, different scales or bands, a lot of redundant information is contained. For this purpose, firstly, the structural similarity ssim (structural similarity) is used to calculate the correlation between any two bands or scale images under the same feature. SSIM combines the use of vector means, variances, and covariances with the unique advantages of bounded, symmetric, and maximum values over traditional "distance" similarity measures. SSIM between arbitrary vectors x and y is defined as follows:

wherein the content of the first and second substances,μ_x，μ_y，

σ_xyx and y means, standard deviation, variance and covariance, respectively. The value interval of SSIM is [0,1 ]]The larger the value, the stronger the correlation between the two vectors, and the more redundant information. Taking the area attribute as an example, if p and q are serial numbers of any two different scales, the SSIM between the two different scales can be represented as SSIM_pqAnd obtaining the intra-class penalty factor of the area attribute as follows:

similarly, the in-class penalty factors for which J-value, diagonal attribute and NMI attribute are available are divided into alpha_J、α_DIAGAnd alpha_NMI. The same strategy is adopted to calculate R, G, B the correlation between the three-band images, and the in-class penalty factor alpha of the spectral feature can be obtained_RGB。

Step 2: an inter-class penalty factor is calculated. Adopting mean value fusion strategy for R, G, B three-band image, multi-scale J-value and three-attribute multi-scale section respectively to obtain IMG_RGB、IMG_J-value、IMG_AREA、IMG_DIAGAnd IMG_NMIFive types of fused images. Next, inter-class penalty factors for RGB features are obtained using the following formula:

wherein, SSIM_RGB&J，SSIM_RGB&AREA，SSIM_RGB&DIAGAnd SSIM_RGB&NMIRespectively, the SSIM values between the RGB features and the four other feature fusion images. By analogy, the inter-class penalty factor beta of other similar characteristics can be obtained_RGB、β_J、β_AREA、β_DIAG、β_NMI。

Step 3: updating visual dictionary and vision of earthquake damageA perceptual vocabulary histogram. Based on the punishment factors in and among classes, the constructed optimized earthquake damage visual dictionary is as follows: w_i＝[α_RGBβ_RGB WR_i，α_RGBβ_RGB WG_i，α_RGBβ_RGB WB_i，α_Jβ_J WJ_i，α_AREAβ_AREAWAREA_i，α_DIAGβ_DIAGWDIAG_i，α_NMIβ_NMI WNMI_i]And correspondingly updating the visual vocabulary histogram as the input of a subsequent random forest classifier.

(4) Earthquake damage building detection based on random forest classification

Optimizing a set R of seismic damage visual vocabulary models and potential building objects based on the extraction_potIn the invention, an integrated classifier random forest RF (random forest) is adopted to obtain a final earthquake damage building detection result. The method comprises the following specific steps:

step 1: r is to be_potThe objects in (1) are divided into four categories of 'intact buildings', 'partially earthquake damaged buildings', 'ruins' and 'other ground objects'. The reason for further identifying the ruins in earthquake-damaged buildings is that such areas are usually the primary targets for emergency rescue after earthquake, and the 'partial earthquake-damaged buildings' can be used as reference for evaluation and reconstruction of earthquake damage after the earthquake;

step 2: statistics of each object R_iThe optimized earthquake damage visual dictionary histogram is used as an input feature space of the random forest classifier;

step 3: selecting a training sample set, and randomly and repeatedly extracting C sample subsets by using a Bagging method to ensure that the number of samples in each subset is equal to that of the training sample set;

step 4: determining the number of the random features corresponding to the nodes, and respectively constructing a decision tree model for each sample subset;

step 5: classifying each sample to be classified, voting according to the classification labels, and determining the final classification labels according to the number of votes.

Analysis of experiments

Two groups of high-resolution remote sensing images acquired in a short time period after the earthquake are adopted in the experiment, and compared with the experiment results of two different methods, the high-resolution remote sensing images are analyzed.

The data set 1 is a post-earthquake QuickBild panchromatic-multispectral fusion remote sensing image in Wenchun area of Sichuan province in China, and comprises R, G, B three wave bands. The earthquake occurrence time is 12 days at 5 months in 2008, the highest earthquake magnitude is 8.0 levels, the image acquisition time is 3 days at 6 months in 2008, and the size is 1024 × 1024 pixels, as shown in (a) in fig. 2. The data set 2 is a post-earthquake GE01 panchromatic-multispectral fusion remote sensing image in Yushu area of Qinghai province of China, and comprises R, G, B three wave bands. The earthquake occurrence time is 4/2010, the highest earthquake magnitude is 7.1, the image acquisition time is 5/6/2010, and the size is 1024 × 1024 pixels, as shown in fig. 2 (b). In addition, the reference image is drawn manually using field investigation and visual interpretation. Some representative regions in the original image are extracted and represented by sub-images with different line-shaped frames in fig. 2, where (a) the solid line frame is sub-image 1 and the dashed line frame is sub-image 2, and (b) the solid line frame is sub-image 3 and the dashed line frame is sub-image 4, for further detailed discussion and analysis.

In both sets of experiments, K in the proposed method was set to 14 and 18, respectively, according to the trial and error method. Meanwhile, in order to analyze the effectiveness of the constructed spectrum, texture and geometric morphology visual earthquake damage dictionary, experiments are firstly carried out based on the proposed method and only three different earthquake damage dictionaries are respectively utilized. On the basis, experiments are carried out by utilizing all three earthquake damage visual dictionaries so as to analyze the complementarity of the earthquake damage visual dictionaries in the detection of earthquake damage buildings.

In addition, two advanced methods were chosen for comparative experiments: the method 1 comprises the steps of firstly optimizing an initial feature set by using an improved SEATH algorithm, and further obtaining a detection result by using a classification method based on membership. The initial feature set adopts all the spectral, texture and geometric morphological features extracted by the method, and other implementation steps are consistent with those of the original text. The method 2 adopts color and gradient characteristics to construct a visual bag-of-words model, and obtains the earthquake damage building detection result through classification of a Support Vector Machine (SVM). It should be noted that, in the following description,in order to ensure the consistency of precision evaluation, all comparison methods adopt an initial object set R_iniAs a basic analysis unit.

The experimental results include intact buildings, partially earthquake damaged buildings, ruins and other four categories, and are represented by different colors, as shown in fig. 3 and 4.

The quantitative accuracy evaluation results are shown in tables 1 and 2. The identification effect of the method is obviously superior to that of other comparison methods through visual observation and quantitative precision evaluation. The overall precision of the method reaches more than 85%, and the fluctuation is only 0.2%, so that the optimized earthquake damage visual dictionary constructed by the method is feasible, effective and reliable in earthquake damage building detection. In addition, after the three types of earthquake damage visual dictionaries are comprehensively utilized, the overall precision is higher than that of a detection result obtained by singly adopting a certain type of earthquake damage dictionary, so that the complementarity of the spectrum, the texture and the geometric morphological characteristics in the detection of the earthquake damage building is proved. Nevertheless, the overall accuracy is still significantly lower than that of the method of the present invention, that is, the intra-class and inter-class penalty factors introduced by the present invention help to reduce redundant information and evidence conflicts, thereby improving the accuracy of subsequent classification. Compared with the method of the invention, the method 1 adopts the same initial feature set for optimization, and the precision difference of more than 10% in two groups of experiments further proves the excellent performance of the optimized earthquake damage visual dictionary constructed by the invention. The method 2 ignores the role of geometric morphological characteristics in the detection of earthquake-damaged buildings, and the detection precision is obviously lower than that of the method and the fluctuation is obvious.

Table 1: data set 1 quantitative accuracy evaluation results

Method/index	Overall accuracy (%)	False detection Rate (%)	Missing rate (%)	Kappa coefficient
					The method of the invention	85.60％	5.29％	14.30％	0.741
Spectral features	72.70％	11.10％	27.30％	0.553
					Morphological characteristics	72.30％	11.30％	27.70％	0.547
Texture features	69.50％	12.80％	30.50％	0.502
					Not optimized	73.50％	10.90％	25.50％	0.556
Method 1	74.90％	10.00％	25.10％	0.596
					Method 2	82.80％	6.49％	17.20％	0.697

Table 2: data set 2 quantitative accuracy evaluation results

The recognition results of the representative sub-images are shown in fig. 5 and 6. As shown in fig. 5 and 6, the selected sub-images are all representative post-earthquake scenes, and buildings with different earthquake damage degrees are mixed with ground objects such as vegetation, roads, wastelands and the like, so that the development analysis of the sub-images is favorable for further verifying the effect of the method. Wherein, only the method of the invention makes correct judgment on non-building objects which have similar spectral characteristics and regular shapes with buildings, such as vegetation and bare land; roads and ruins are easy to be confused due to similar highlight features which are usually shown in images, and only the method 2 of the invention makes correct judgment; for the identification of the intact building, only the method of the invention makes correct judgment, and the other two methods generate missing detection or error detection; for the identification of part of earthquake-damaged buildings, only the method 2 of the invention makes correct judgment. Therefore, further analysis of the typical region after the earthquake shows that the method of the invention achieves more ideal effect and is obviously superior to the two comparison methods, and the result is consistent with the results of visual observation and quantitative evaluation.

In the earthquake damage visual word bag model constructed by the invention, the setting of the dictionary length K has a remarkable influence on the overall precision. Although the local optimal value can be determined by adopting a trial-and-error method, the degree of automation is low. For this reason, the overall accuracy was further analyzed as a function of K.

As shown in fig. 7, the overall accuracy in both experiments showed a tendency of gradually increasing first and gradually decreasing after reaching the peak as K increased. Meanwhile, dataset 1 and dataset 2 experiments achieved a maximum overall at K14 (overall 85.6%) and K18 (overall 85.8%), respectively. In addition, when K is taken as a value in the value taking intervals [12 and 20], the overall precision can reach more than 80%. Therefore, it is suggested that in practical applications, the optimal value can be determined manually or by trial and error in this interval.

Conclusion

On the premise of lacking pre-earthquake reference information, the invention provides a high-resolution remote sensing image earthquake damage building detection method based on an optimized visual dictionary. The method spans the semantic gap from the pixels to the building earthquake damage features, and constructs an efficient optimized visual earthquake damage dictionary model, thereby realizing multi-angle fine portrayal of the earthquake damage building. By carrying out experiments on a plurality of groups of high-resolution remote sensing images after the earthquake of different regions and different sensors, the overall accuracy of the method disclosed by the invention is up to more than 85%. Meanwhile, through further division of intact buildings, partial earthquake-damaged buildings and ruins in detection results, key decision support can be provided for emergency response rescue after earthquake and reconstruction after disaster.

The embodiments are only for illustrating the technical idea of the present invention, and the technical idea of the present invention is not limited thereto, and any modifications made on the basis of the technical scheme according to the technical idea of the present invention fall within the scope of the present invention.

Claims

1. A method for detecting earthquake damage of buildings by high-resolution remote sensing images after earthquake is characterized by comprising the following steps:

2. The method for detecting earthquake-damaged buildings according to the post-earthquake high-resolution remote sensing images as claimed in claim 1, wherein in the step (1), the non-building object elimination follows the following rules:

(b) morphological building index rules: determining a separation threshold T for a morphological building index using a maximum inter-class variance method_MBICalculating the mean value of the morphological building index of all pixels in each object

Rejection satisfies

All of the objects of (1);

(c) the rectangle degree and the length-width ratio are regular: the degree of rectangularity of an object is defined as Rd ═ N_pixels/N_rectangleIn which N is_pixelsThe number of pixels contained in the minimum bounding rectangle of the object, N_rectangleIs at a minimum outsideConnecting the length-width ratio of the rectangle; if an object satisfies Rd < 0.8 and N_rectangleIf the distance is more than 5, the object is considered as a long and narrow object and is removed.

3. The method for detecting earthquake damage buildings according to the post-earthquake high-resolution remote sensing images as claimed in claim 1, wherein in the step (2), the spectrum-based earthquake damage visual dictionary is constructed by respectively performing K-means clustering on the three-band images of the RGB model according to gray values to obtain any object R_i∈R_potVisual dictionary WR of seismic injury of K-dimensional spectrum corresponding to three wave bands of R, G, B_i、WG_i、WB_iWherein R is_potThe set of potential building objects obtained in the step (1);

J-value＝(S_T-S_W)/S_W

The method for constructing the earthquake damage visual dictionary based on the geometric morphology comprises the steps of adopting area attributes, diagonal attributes and normalized moment of inertia attributes in morphological attribute profile MAPs as a class of earthquake damage visual vocabularies respectively, and designing different morphological attribute operators to enable the characteristics of a target on specific scale parameters and attributes to appear and distinguish the maximum response of the target with other ground objects so as to obtain an object R_iEarthquake damage visual dictionary WAREA of extracted area, diagonal and normalized moment of inertia attributes_i、WDIAG_i、WNMI_i；

4. The method for detecting earthquake damage buildings by using post-earthquake high-resolution remote sensing images as claimed in claim 3, wherein the concrete process of the step (3) is as follows:

(304) Constructing an optimized earthquake damage visual dictionary W based on the punishment factors in the classes and the punishment factors between the classes_i＝[α_RGBβ_RGB WR_i，α_RGBβ_RGB WG_i，α_RGBβ_RGB WB_i，α_Jβ_J WJ_i，α_AREAβ_AREA WAREA_i，α_DIAGβ_DIAGWDIAG_i，α_NMIβ_NMI WNMI_i]。

5. The method for detecting earthquake damage buildings by using post-earthquake high-resolution remote sensing images as claimed in claim 1, wherein the specific process of the step (4) is as follows: