CN112927767B

CN112927767B - Multi-element geochemical anomaly identification method based on graph attention self-coding

Info

Publication number: CN112927767B
Application number: CN202110196133.0A
Authority: CN
Inventors: 关庆锋; 任书良; 姚尧
Original assignee: China University of Geosciences
Current assignee: China University of Geosciences
Priority date: 2021-02-22
Filing date: 2021-02-22
Publication date: 2022-05-13
Anticipated expiration: 2041-02-22
Also published as: CN112927767A

Abstract

The invention discloses a multivariate geochemical anomaly identification method based on graph attention self-coding, which comprises the following steps of: determining a composition threshold value K; constructing a geochemical element topological network; learning the multivariate geochemical characteristics; reconstructing a multi-element geochemical element background; and calculating multiple abnormal values. The invention introduces graph learning into chemical exploration anomaly detection, utilizes K neighbor to construct a geochemical element topological relation graph, and constructs and trains a graph attention self-encoder capable of simultaneously extracting element composition relation and space structure characteristics. And reconstructing a multi-element geochemical background based on the trained graph attention self-encoder, and calculating to obtain the abnormal value of each final sampling point. The invention expands the existing neural network model, so that the model can directly process sampling point data and can be applied to irregular areas, the learning performance of a geophysical prospecting background and the practicability of the model are greatly improved, and a practical and reliable geophysical prospecting abnormity identification method is provided for complex geological conditions.

Description

Multi-element geochemical anomaly identification method based on graph attention self-coding

Technical Field

The invention relates to the field of multivariate geochemical exploration anomaly identification and the field of artificial intelligence application, in particular to a multivariate geochemical anomaly identification method based on graph attention self-coding.

Background

Identification of multiple geochemical anomalies is one of important contents for mineral resource exploration, and anomaly information of the multiple geochemical anomalies helps geologists judge potential mineral deposits. The geochemical exploration anomaly has strong spatial heterogeneity, and the spatial heterogeneity of the geochemical exploration elements must be considered. The traditional anomaly identification methods such as fractal/multiple analysis, a kriging method, space factor analysis and the like consider the correlation of space neighbor samples and are remarkable in the identification of the chemical exploration anomalies. In recent years, convolutional neural networks have been introduced into the research field of geochemical anomaly identification and have achieved higher accuracy by virtue of having good and automatic learning capability for complex spatial features. The input data of the model must be based on grid data after the interpolation of the sampling points; this causes secondary errors in the data and also limits the application of the model in natural environments (irregularities). In addition, the convolution window for extracting the spatial structure features can only be rectangular and has no rotation invariance, which is not consistent with the distribution condition of element features in a real environment and also limits the capability of the convolution layer for extracting the spatial features. Therefore, there is a need for improving and expanding the existing convolutional self-coding, and solving the above problems to improve its capability of diversified background learning and anomaly identification.

Disclosure of Invention

The invention provides a multivariate geochemical anomaly recognition method based on graph attention self-coding, aiming at the problems that the prior convolutional self-encoder can not directly use sampling point data and can not consider real sampling points and irregular mining areas, and comprising the following steps:

s1, calculating the spatial correlation of the earth chemical elements under different distance thresholds, drawing a distance threshold and a line graph of the correlation, and selecting the distance threshold at the obvious turning point of the line graph as K of the K neighbor algorithm;

s2, normalizing the concentration data of the multiple geochemical elements, and performing edge connection on the multiple geochemical exploration sampling points based on a K neighbor algorithm to complete a topological graph;

s3, constructing a graph attention self-encoder model, training by using the topological graph in S2, and selecting a model parameter with the minimum reconstruction error as a final graph attention self-encoder model;

s4, inputting the topological graph in the S2 into a final graph attention self-encoder model obtained by S3 training, and outputting to obtain a background value of the multiple geochemical elements;

and S5, calculating Euclidean distances between the multi-element geochemical element concentration data and the multi-element geochemical element background values in the S2 to be used as abnormal values, mapping the abnormal values to a geographic space, and generating a multi-element abnormal exploration map.

Compared with the prior art, the invention has the following beneficial effects:

(1) the optimal distance threshold value K is determined by utilizing the Moran' I index, and the sampling points are topologically structured by utilizing the K neighbor algorithm, so that the sampling point data has attribute information and topological structure information and can be used for deep learning of the image.

(2) The method introduces the graph deep learning into the field of the chemical exploration anomaly identification, constructs a graph attention self-coding neural network, and provides a chemical exploration anomaly identification model considering real sampling points and irregular mining areas; the network model directly takes real geochemical sampling point data as input data, secondary errors caused by interpolation are avoided, the model can be applied to irregular areas, and the practicability of the geochemical model is improved.

(3) The structure responsible for space extraction in the self-coding constructed by the method is a graph attention layer, and the attribute characteristics and the structural characteristics of sampling points in the multi-element geochemical topological graph can be extracted; compared with the common convolutional layer, the convolutional layer has rotational invariance, can better extract the spatial characteristics of different regions and different structures, and provides an efficient method for carrying out anomaly identification and mineral deposit judgment by utilizing diversified probe data under complex geological and mineral-forming environments.

Drawings

The invention will be further described with reference to the accompanying drawings and examples, in which:

FIG. 1 is a flow chart of a method of the present invention;

FIG. 2 is a plot of data for 5 chemical elements of a Minxife ore band in accordance with an embodiment of the present invention;

FIG. 3 is a schematic diagram of the proposed detection anomaly identification based on graph attention self-coding;

FIG. 4 is a final anomaly result graph in an implementation example;

FIG. 5 is a graph showing comparative evaluation of model performance according to the present invention.

Detailed Description

For a more clear understanding of the technical features, objects and effects of the present invention, embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

Referring to fig. 1, the method for recognizing multivariate geochemical anomalies based on graph attention self-coding of the present invention comprises the following steps:

s1, geological and mineralizing environment according to research areaSelecting geochemical element concentration data which is properly related to the detection of mineral product abnormity, and normalizing the geochemical element concentration data, wherein the final processing result is shown in a figure 2; in this example, the mineral product is detected as Fe, and the geochemical element which is related to the abnormality is Cu-Zn-Mn-Pb-Fe₂O₃；

S2, carrying out qualitative measurement on the spatial correlation and the aggregation of the geochemical elements by utilizing a Moran' I index, and selecting a threshold inflection point which can reserve enough spatial heterogeneity and contains enough background samples as an optimal K value of a K nearest neighbor algorithm;

s3, traversing all sampling points by using a K neighbor algorithm based on the determined K value, and performing edge connection establishment on neighbor sampling points within the distance K to form a multi-element geochemical topological graph; wherein, the sampling points are converted into nodes of a topological graph, and the concentration of multiple geochemical elements collected by the sampling points is used as a node attribute vector;

s4, constructing a graph attention self-coding model shown in the figure 3, inputting the obtained multivariate geochemical topological graph constructed in the S3 into the model for model training, observing reconstruction loss, stopping training when the error between input data and output data is not changed any more, and obtaining a completely trained graph attention self-coding model, wherein a loss function is specifically as follows:

n is the number of nodes (sampling points) in the graph, F is the characteristic number of the nodes,

is the kth original feature (element value) of the ith node (sample point),

for the kth reconstructed feature (element background value) of the ith node.

S5, as shown in FIG. 3, inputting the obtained multivariate geochemical topological graph constructed in the S3 into the model trained in the S4 again, and outputting to obtain a multivariate geochemical exploration background value of the region;

s6, calculating the Euclidean distance between the original data and the reconstructed background value of each sample as an abnormal value, wherein the formula is as follows:

f is the characteristic number of the node,

is the kth original feature (element value) of the ith node (sample point),

for the kth reconstructed feature (element background value) of the ith node.

And mapping the abnormal value to the geographical position of the sample to obtain a final regional mapping abnormal recognition map, as shown in fig. 4.

Evaluating an abnormal map of a known mining point area by adopting an ROC curve; when the area under the ROC curve to the whole graph area ratio (AUC) is greater than 0.5, it is demonstrated that the model is usable, with better model performance as the AUC approaches 1. The AUC value of the self-coding model constructed in this study was 0.797, which is shown in fig. 5, and is superior to other methods for identifying chemical anomalies.

The invention introduces graph learning into the abnormal identification of chemical exploration, and provides a graph attention self-coding model considering real sampling points and irregular mining areas; the network model directly takes the real geochemical sampling point data as input data, thereby avoiding secondary errors caused by interpolation, enabling the model to be applied in irregular areas and improving the practicability of the geochemical model. In addition, the structure responsible for space extraction in the self-encoding constructed by the method is an attention layer of the drawing, and the attribute characteristics and the structural characteristics of sampling points in the multi-element geochemical topological graph can be extracted; compared with the common convolutional layer, the convolutional layer has rotational invariance, can better extract the spatial characteristics of different regions and different structures, and provides an efficient method for carrying out anomaly identification and mineral deposit judgment by utilizing diversified probe data under complex geological and mineral-forming environments.

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. The method for identifying the multivariate geochemical anomaly based on the graph attention self-coding is characterized by comprising the following steps of:

2. The method for identifying multivariate geochemical abnormalities based on graph attention self-encoding according to claim 1, wherein the step S1 is specifically to quantitatively measure the spatial correlation and aggregative property of geochemical elements by using Moran' I index, and finally select the optimal K value.

3. The method for identifying multivariate geochemical anomaly based on graph attention self-encoding according to claim 1, wherein the sampling points are converted into nodes of a topological graph in step S2, and the concentrations of the multivariate geochemical elements collected by the sampling points are used as node attribute vectors.

4. The method for multivariate geochemical anomaly recognition based on graph attention self-coding according to claim 1, wherein in step S3, the encoder and the decoder of the graph attention self-coder model are both composed of two graph convolutional layers with attention mechanism.

5. The method for multivariate geochemical anomaly recognition based on graph attention self-coding according to claim 1, wherein in step S3, the final graph attention self-coder model reconstruction error L_REThe calculation formula is as follows:

wherein N is the number of sampling points, F is the characteristic number of the sampling points,

for the kth original feature of the ith sample point,

the k element background value of the ith sample point.

6. The method for multivariate geochemical anomaly recognition based on graph attention self-coding according to claim 1, wherein the abnormal value calculation formula in step S5 is as follows:

wherein F is the characteristic number of the sampling points,

for the kth original feature of the ith sample point,

is the k element background value of the ith node.