CN112163374B - Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model - Google Patents

Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model Download PDF

Info

Publication number
CN112163374B
CN112163374B CN202011029251.4A CN202011029251A CN112163374B CN 112163374 B CN112163374 B CN 112163374B CN 202011029251 A CN202011029251 A CN 202011029251A CN 112163374 B CN112163374 B CN 112163374B
Authority
CN
China
Prior art keywords
data
remote sensing
full
matrix
basic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011029251.4A
Other languages
Chinese (zh)
Other versions
CN112163374A (en
Inventor
李超岭
刘园园
孙钰
李丰丹
付偲
刘畅
吕霞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Command Center Of Natural Resources Comprehensive Survey Of China Geological Survey
Development & Research Center Of China Geological Survey Bureau
Original Assignee
Command Center Of Natural Resources Comprehensive Survey Of China Geological Survey
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Command Center Of Natural Resources Comprehensive Survey Of China Geological Survey filed Critical Command Center Of Natural Resources Comprehensive Survey Of China Geological Survey
Priority to CN202011029251.4A priority Critical patent/CN112163374B/en
Publication of CN112163374A publication Critical patent/CN112163374A/en
Application granted granted Critical
Publication of CN112163374B publication Critical patent/CN112163374B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Geometry (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Remote Sensing (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Geophysics And Detection Of Objects (AREA)

Abstract

The application discloses a processing method of a multi-mode data intermediate layer fusion full-connection geological map prediction model, which comprises the following steps: performing complement and normalization processing on the evidence data; reading data information from the processed evidence data according to the training sampling point coordinates; acquiring basic data and remote sensing data of training sampling points; and judging whether the data is read into the memory once, if so, directly inputting the model training, and if not, grouping the data from the data storage path to input the model training. The invention has the advantages that: the method not only converts the original data into high-dimensional characteristic expression, but also performs characteristic fusion on the basic data characteristics and the remote sensing data characteristics, and utilizes the fully connected neural network to maximally acquire the common characteristics of the basic data and the remote sensing data in the high-dimensional space, thereby greatly improving the capability and precision of predicting the object, and enabling the adjacent relation among the distribution form and the distribution direction of the lithology of the filling unit in space, the distribution position and the geological object to be maximally embodied and expressed with accuracy.

Description

Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model
Technical Field
The invention relates to a processing method for a multi-mode data middle layer fusion full-connection geological map prediction model.
Background
The geological evidence data has the characteristics of multiple professions (relating to earth materialization and remote), multiple modes (vector data, raster data or space data and attribute data), multiple scales (the highest precision is 5 meters, the common precision is between 10 and 2000 meters), and the like, and can reflect the geological characteristics of the earth surface and the part of the depth (removing the covering layer). In the past, the common processing method is basically to process according to the profession and the same data type, and the data fusion is difficult to get rid of the dilemma of multiple skins, so that the mining of the data in the deep aspect is greatly limited.
Disclosure of Invention
The invention aims to overcome the defects and provide a processing method for a multi-mode data intermediate layer fusion full-connection geological map prediction model, which solves the technical problem of how to deeply mine geological evidence data.
In order to achieve the above purpose, the technical scheme adopted by the invention is as follows: a processing method for a multi-modal data intermediate layer fusion full-connection geological map prediction model is characterized by comprising the following steps:
performing complement and normalization processing on the evidence data;
reading data information from the processed evidence data according to the training sampling point coordinates;
acquiring basic data and remote sensing data of training sampling points;
and judging whether the data is read into the memory once, if so, directly inputting the model training, and if not, grouping the data from the data storage path to input the model training.
Further, the supplementing the data includes:
reading remote sensing data as a data matrix, and marking the data as A;
creating a data matrix consistent with the basic data processing range, wherein the initial values of data in the matrix are all 0, and the data matrix is marked as B;
reading data consistent with the basic data processing range row by row from the lower left corner of A and storing the data to the corresponding position in B;
if the upper half of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the column where the data is located, and if the lower half of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the column where the data is located;
if the left half part of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the row where the data is located, and if the right half part of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the row where the data is located;
b is evidence data after the complement.
Further, the normalization process employs an algorithm:
where X represents the data that the underlying data or telemetry data read on each channel or class of chemical elements, x_min represents the minimum value in the data matrix X, and x_max represents the maximum value in the data matrix X.
Further, the model includes:
input layer: the basic data and the remote sensing data of the training sampling points are simultaneously input into a full-connection hybrid input model through an input layer;
basic data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely basic data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
remote sensing data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely remote sensing data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
fusion layer: carrying out feature fusion on the basic data features and the remote sensing data features;
full tie layer: the method comprises the steps of obtaining common characteristics of basic data and remote sensing data in a high-dimensional space by using a fully connected neural network;
output layer: and outputting the probability values of the sampling points predicted as various types.
The beneficial effects of the invention are as follows:
the method is simple to realize, not only converts the original data into high-dimensional characteristic expression, but also performs characteristic fusion on the basic data characteristics and the remote sensing data characteristics, and utilizes the fully connected neural network to maximally acquire the commonality characteristics of the basic data and the remote sensing data in the high-dimensional space, thereby greatly improving the capability and the precision of the prediction object, and enabling the most basic geological object of the geological map, namely the filling unit lithology, to be maximally reflected and expressed in terms of the indexes such as the spatial distribution form, the distribution direction, the distribution position, the adjacent relation among the geological objects and the like. After the technology breaks through, the existing geological investigation working mode is thoroughly changed, and the novel geological mapping mode integrating geological knowledge map, geological big data and deep learning algorithm is innovated, so that the optimal geological route and geological map prediction cycle progressive advancing mapping mode is realized, and a fine-granularity prediction geological map is formed.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute an undue limitation to the application. In the drawings:
FIG. 1 is a flow chart of a processing method of the multi-modal data middle-layer fusion fully-connected geologic map prediction model of the invention;
FIG. 2 is a schematic diagram of an embodiment of the method of FIG. 1;
FIG. 3 is a schematic diagram of various data processing scope embodiments;
FIG. 4 is a flow chart of evidence data normalization;
FIG. 5 is a network structure diagram of a multi-modal data middle layer fusion full-connection geologic map prediction model;
Detailed Description
Certain terms are used throughout the description and claims to refer to particular components. Those of skill in the art will appreciate that a hardware manufacturer may refer to the same component by different names. The description and claims do not take the form of an element differentiated by name, but rather by functionality. As used throughout the specification and claims, the word "comprise" is an open-ended term, and thus should be interpreted to mean "include, but not limited to. By "substantially" is meant that within an acceptable error range, a person skilled in the art is able to solve the technical problem within a certain error range, substantially achieving the technical effect. The description hereinafter sets forth the preferred embodiment for carrying out the present application, but is not intended to limit the scope of the present application in general, for the purpose of illustrating the general principles of the present application. The scope of the present application is defined by the appended claims.
Referring to fig. 1 to 5, a method for processing a multi-modal data middle layer fusion full-connection geological map prediction model according to the present invention includes:
step S101, performing complement and normalization processing on the data of the evidence;
step S102, reading data information from the processed evidence data according to the training sampling point coordinates;
step S103, basic data and remote sensing data of training sampling points are obtained;
step S104, judging whether the data is read into the memory once, if yes, directly inputting model training, and if no, inputting model training from the data storage path group.
In one embodiment, the supplementing the data includes:
reading remote sensing data as a data matrix, and marking the data as A;
creating a data matrix consistent with the basic data processing range, wherein the initial values of data in the matrix are all 0, and the data matrix is marked as B;
reading data consistent with the basic data processing range row by row from the lower left corner of A and storing the data to the corresponding position in B;
if the upper half of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the column where the data is located, and if the lower half of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the column where the data is located;
if the left half part of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the row where the data is located, and if the right half part of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the row where the data is located;
b is evidence data after the complement.
In one embodiment, the normalization process employs an algorithm:
where X represents the data that the underlying data or telemetry data read on each channel or class of chemical elements, x_min represents the minimum value in the data matrix X, and x_max represents the maximum value in the data matrix X.
In one embodiment, the model comprises:
input layer: the basic data and the remote sensing data of the training sampling points are simultaneously input into a full-connection hybrid input model through an input layer;
basic data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely basic data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
remote sensing data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely remote sensing data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
fusion layer: carrying out feature fusion on the basic data features and the remote sensing data features;
full tie layer: the method comprises the steps of obtaining common characteristics of basic data and remote sensing data in a high-dimensional space by using a fully connected neural network;
output layer: and outputting the probability values of the sampling points predicted as various types.
Method for processing spatial boundary consistency of various evidence data and supplementing evidence data
Because the data is multi-modal, and the source and processing mode of the data are various software, boundary data may not fill the whole frame area, data mismatch is caused, and blank areas (non-valued areas) appear, so as to fully utilize the used data, ensure that the predicted area can fill the whole frame area, and various evidence data space boundary consistency processes are required. In this example, 4 data of 1:50000 frames are taken as an example, and a method and a flow of data processing are described:
the data processing ranges of the evidence data after the four pictures are combined are recorded in the table 1, 20 ten thousand geochemical data and high magnetism 2500 are taken as basic data, remote sensing high three SAR data, a Digital Elevation Model (DEM) and remote sensing Landsat8 are taken as remote sensing data, and the processing ranges of the basic data and the remote sensing data are inconsistent. If the data processing ranges are inconsistent, the corresponding basic data and remote sensing data obtained according to the coordinate information of the sampling point may not coincide with the sampling point. As shown in fig. 3, the black area represents a data processing range, and fig. 3 (a) the data processing range is (4, 4); in fig. 3 (b), the data processing range is (5, 5), the white area is the position of the sampling point coordinate information, and under normal conditions, when the data is read, the position coordinate information of the white area in (a) is consistent and the data is correctly read when the data is read by taking the lower left corner as the origin and the position coordinate information of the white area in (a) and (b) is inconsistent when the data is read by taking the upper left corner as the origin and the data is read according to the sampling point coordinate information. To ensure consistency of data reading, the processing range of evidence data needs to be kept consistent.
TABLE 1 four-breadth-to-aggregate evidence data processing scope
Data Data processing range
Geochemistry of 1:20 ten thousand Wide 8114 pixels and high 7191 pixels
High magnetic_2500 Wide 8114 pixels and high 7191 pixels
ALOS SAR data Wide 8112 pixels and high 7189 pixels
Digital Elevation Model (DEM) Wide 8116 pixels and high 7190 pixels
Remote sensing Landsat8 Wide 8116 pixels and high 7190 pixels
And carrying out data complement processing on the remote sensing data by taking the basic data as a reference, so that the processing range of the remote sensing data is consistent with the basic data.
The data complement process is as follows:
(1) reading remote sensing data as a data matrix, and marking the data as A;
(2) creating a data matrix consistent with the basic data processing range, wherein the initial values of data in the matrix are all 0, and the data matrix is marked as B;
(3) reading data consistent with the basic data processing range row by row from the lower left corner of A and storing the data to the corresponding position in B;
(4) if the upper half of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the column where the data is located, and if the lower half of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the column where the data is located;
(5) if the left half part of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the row where the data is located, and if the right half part of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the row where the data is located;
(6) b is evidence data after the complement.
(2) Geological evidence data normalization
Geological evidence relates to a variety of specialized data, the dimensions of each of which are also substantially inconsistent. It is therefore desirable to change a dimensionless expression to a dimensionless (physical unit without actual data) expression, which facilitates the comparison and weighting of different units or magnitudes of the indicators, a process called normalization. The normalization changes the data into the decimal between (0, 1) or (-1, 1), which is a linear transformation, can ensure that the data can not be invalid after normalization processing, and can improve the expressive force of the data and realize data enhancement. The invention adopts the following normalization formula:
where X represents the data that the underlying data or telemetry data read on each channel or class of chemical elements, x_min represents the minimum value in the data matrix X, and x_max represents the maximum value in the data matrix X.
Normalization processing is carried out on the evidence data, firstly, floating point overflow or underflow during calculation is prevented from being caused by an excessively large or excessively small numerical range; secondly, different numerical ranges can cause different importance of different attributes to the model, and equalization of training of the model by the different attributes is achieved after data are normalized; thirdly, after normalization processing is carried out on the data, the solving speed of gradient descent can be increased, namely the convergence speed of the model is improved.
Referring to fig. 4, the basic data in the evidence data is the same as the remote sensing data in the processing manner, and the normalization processing data flow is specifically described below by taking normalization processing Digital Elevation Model (DEM) data as an example:
(1) obtaining a tif format file of the digital elevation model data from a specified evidence data storage path;
(2) reading the tif format file into a data matrix by using an imread function under an io module in the image packet, and recording the data matrix as A and the data matrix as a numpy array form;
(3) solving a maximum value B and a minimum value C in A by using a max method and a min method in a numpy library;
(4) calculation ofThe result is a result after normalization of Digital Elevation Model (DEM) data, the result is marked as D, and D is in a data matrix format;
(5) the D is a npy format file, i.e., a numpy array format file, and is stored under the folder of normalized data, and is stored under the name "dem.
(3) Data packet storage
Because the radius of the second sampling is larger, the obtained data volume of the training sampling point is overlarge, the problem that the data cannot be read into the memory for model training at one time may exist, when the problem occurs, the basic data and the remote sensing data of the training sampling point are required to be firstly put into the hard disk for storage, and the data is read into the memory from the hard disk during model training. And if the memory is allowed, directly entering the fourth step.
The method for storing data in groups is as follows:
(1) in order to prevent data reading errors, firstly, storing basic data and remote sensing data of training set sampling points obtained by expanding various mapping units and lithology after secondary sampling, and storing the data packets in basic and remote sensing data folders (for example, file storage paths:/datasets/PRB/training sampling point data/basic (remote sensing) data/Qh 3al fourth series riverbed/interval_data_1 (remote_data_1). Npy) corresponding to each mapping unit and lithology; each npy file stores basic data or remote sensing data of training set sampling points of the geologic body;
(2) and (3) grouping and synthesizing the basic and remote sensing data of the various mapping units and lithology obtained in the step (1) to obtain final training sampling point data, storing the final training sampling point data in a corresponding folder (a file storage path such as/data/PRB/training sampling point data/basic (remote sensing) data/inter_data_1 (remote_data_1). Npy), and storing the basic data or remote sensing data of the training sampling points of each mapping unit and lithology participating in training in each npy file.
And according to the mode of grouping and storing the data, the basic data and the remote sensing data of the training sampling points are stored, the basic data and the remote sensing data are grouped and read from a storage path during model training, and the data are read into a memory for model training.
(4) Modeling of multi-modal data middle layer fusion full-connection geological map prediction model
(1) Multimode data middle layer fusion full-connection geological map prediction model structure
Multimodal fusion refers to the process of integrating information from two or more modalities to make predictions. In the prediction process, a single mode cannot generally contain all effective information required for generating an accurate prediction result, and the multi-mode fusion process fuses information from two or more modes, so that information supplementation is realized, the coverage range of information contained in input data is widened, the accuracy of the prediction result is improved, and the robustness of a prediction model is improved. Currently, there are three main fusion modes for multi-modal data fusion: front-end fusion or data level fusion, back-end fusion or decision level fusion, and intermediate fusion. The intermediate fusion is to convert the data of different modes into high-dimensional characteristic expression and then fuse the high-dimensional characteristic expression with the intermediate layer of the model. The invention adopts an intermediate fusion mode. The network structure diagram of the multi-mode data middle layer fusion full-connection geological map prediction model is shown in fig. 5. Wherein the meaning of each layer is as follows:
input layer: the basic data and the remote sensing data of the training sampling points are simultaneously input into a full-connection hybrid input model through an input layer;
basic data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely basic data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression (3 layers of the full-connection layers are determined by the following (2) th point description);
remote sensing data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely remote sensing data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
fusion layer: carrying out feature fusion on the basic data features and the remote sensing data features;
full tie layer: the method comprises the steps of obtaining common characteristics of basic data and remote sensing data in a high-dimensional space by using a fully connected neural network;
output layer: and outputting the probability values of the sampling points predicted as various types.
(1) Full-connection layer number optimization determination for converting different-mode data into high-dimensional feature expression
The selection of the number of layers of the fully connected layer is an important index for improving modeling accuracy. Increasing the number of layers can lead to the increase of operation time, lower efficiency and overfitting, and insufficient layers can lead to insufficient extraction of characteristic information so as to influence model precision and recall rate. For geological maps predicted by map units and lithology, one project is typically on the scale of 2-4 joint tests, with the map units and lithology classification trees being between substantially 100-200. The prediction is compared with the calculation of the number of different full-connection layers, and the advantages of the three full-connection layers are obvious in the conversion of different mode data into high-dimensional feature expression extraction and the improvement of model precision and recall rate. Table 2 is a comparison table of the effect of using different full connection layer numbers on model accuracy under the same sample data conditions.
TABLE 2
From table 2, it can be found that the difference between the accuracy and the average recall value of the initial model obtained by training the 3 model structures on the same training set sampling point is obvious at the original sampling point and the training set sampling point. Because the accuracy of the initial model can have a certain influence on the training sampling point of the secondary modeling, if the accuracy of the initial model is too low, the reliability of the training sampling point of the secondary sampling and the label thereof can be low. As can be seen from table 2, the feature extraction of the basic data and the remote sensing data by using the 2-layer fully connected neural network is insufficient, and the fitting degree of the training set is insufficient, so that the model has the phenomenon of under fitting. The 3-layer fully-connected neural network is adopted to extract the characteristics of the basic data and the remote sensing data, the accuracy and average recall rate of the original sampling points in the initial model are improved by more than 15%, the accuracy and average recall rate of the original sampling points in the secondary model are improved by more than 13% -15%, and the recall rate of the training set of the final model is improved by more than 8% although the accuracy of the training set of the final model is improved by more than 95%. The effect of the 4-layer fully-connected neural network on the feature extraction of the basic data and the remote sensing data is obviously improved in comparison with that of the 2-layer fully-connected neural network, but compared with that of the 3-layer fully-connected neural network, all corresponding indexes are about 4% different, which shows that the increase of the number of layers of the fully-connected layers in a single branch for independently extracting the features of the basic data and the remote sensing data does not obviously increase the corresponding accuracy and average recall value. Therefore, in the geological map prediction model based on PRB data deep learning, the adoption of a 3-layer fully connected neural network for extracting the characteristics of the basic data and the remote sensing data is a proper choice.
Principle and method for determining network structure parameters
During modeling, the network structure parameters are selected according to the principle and method of selecting the network structure parameters in table 2 according to the number of the labels of the predicted objects and the number of geological evidence. Table 3 is an example of a parameter determination method and principle given according to a specific application.
TABLE 3 Table 3
As shown in table 3, the meaning of each parameter is as follows:
a. input-1 and Input-2 represent basic data and remote sensing data Input, respectively. The corresponding output dimension represents the corresponding data category. If the basic data comprises 39 kinds of chemical element data and ground high magnetic data, the output dimension is 40 dimensions. The remote sensing data comprise high-resolution third satellite data, landsat8 satellite data and digital elevation model data, and the total number of the data is 16, so that the output dimension is 16 dimensions.
b. The three full connection layers of Dense-1, dense-2 and Dense-3 are used as basic data extraction features, and the three full connection layers of Dense-4, dense-5 and Dense-6 are used as remote sensing data extraction features.
c. The Concate layer is a fusion layer.
d. Dense-7 acquires the commonality characteristic of the basic data and the remote sensing data in a high-dimensional space.
e. Output is the Output layer. The number of neurons of the first full-connection layer of each single mode extraction feature, namely parameters, is determined by the number of geologic body categories to be classified, the category number is set as A, the number of neurons of the first full-connection layer is set as B, and the relation between A and B meets the following formula: b=2n > a.
f. The activation function is also one of key elements of network model training, and is mainly like adding a nonlinear factor into a neural network, so that the network can solve more complex nonlinear problems, meanwhile, the activation function filters out some useless information in the forward propagation process, and updates network training parameters in the backward propagation process. The ReLU function has the characteristics of single-side inhibition, single-side straight-through, wider activation boundary, sparse network and the like, so that the activation function has a relatively high convergence rate; the Softmax activation function is generally used as an activation function of an output layer, is often used in combination with a cross entropy loss function, is also called a normalized exponential function, is generalized on multi-classification tasks by a two-classification function Sigmoid function, and aims to display multi-classification results in the form of probability. Therefore, in the present invention, the ReLU activation function is used in the fully connected layer, and the activation function generally used as the output layer is used in the output layer, often in combination with the cross entropy loss function. The function is also called a normalized exponential function, is generalized on multi-classification tasks by a classification function Sigmoid function, and aims to display multi-classification results in a probability form.
The fully connected hybrid input model selects SGD (random gradient descent algorithm) as a parameter optimizer, and compared with the classical gradient descent algorithm which traverses all training data when updating parameters, the random gradient descent algorithm approximates the average loss of all training samples by using the loss of a single training sample, thereby greatly accelerating the network training speed. In the selection of the learning rate, 0.001 is selected as the initial learning rate, and a method for attenuating the learning rate is adopted in the network training, namely, a larger learning rate is adopted at first, after each parameter update, the learning rate is reduced to make finer adjustment on the parameter when the parameter is updated next time.
In a specific application, the change of the number of the filling units can be flexibly changed according to the processing parameter requirements.
The beneficial effects of the invention are as follows:
the method is simple to realize, not only converts the original data into high-dimensional characteristic expression, but also performs characteristic fusion on the basic data characteristics and the remote sensing data characteristics, and utilizes the fully connected neural network to maximally acquire the commonality characteristics of the basic data and the remote sensing data in the high-dimensional space, thereby greatly improving the capability and the precision of the prediction object, and enabling the most basic geological object of the geological map, namely the filling unit lithology, to be maximally reflected and expressed in terms of the indexes such as the spatial distribution form, the distribution direction, the distribution position, the adjacent relation among the geological objects and the like. After the technology breaks through, the existing geological investigation working mode is thoroughly changed, and the novel geological mapping mode integrating geological knowledge map, geological big data and deep learning algorithm is innovated, so that the optimal geological route and geological map prediction cycle progressive advancing mapping mode is realized, and a fine-granularity prediction geological map is formed.
While the foregoing description illustrates and describes the preferred embodiments of the present application, it is to be understood that this application is not limited to the forms disclosed herein, but is not to be construed as an exclusive use of other embodiments, and is capable of many other combinations, modifications and environments, and adaptations within the scope of the teachings described herein, through the foregoing teachings or through the knowledge or skills of the relevant art. And that modifications and variations which do not depart from the spirit and scope of the present invention are intended to be within the scope of the appended claims.

Claims (1)

1. A processing method for a multi-modal data intermediate layer fusion full-connection geological map prediction model is characterized by comprising the following steps:
performing complement and normalization processing on the evidence data;
reading data information from the processed evidence data according to the training sampling point coordinates;
acquiring basic data and remote sensing data of training sampling points;
judging whether the data is read into the memory once or not, if yes, directly inputting model training, and if no, inputting model training from the data storage path group;
the supplementing the evidence data comprises the following steps:
reading remote sensing data as a data matrix, and marking the data as A;
creating a data matrix consistent with the basic data processing range, wherein the initial values of data in the matrix are all 0, and the data matrix is marked as B;
reading data consistent with the basic data processing range row by row from the lower left corner of A and storing the data to the corresponding position in B;
if the upper half of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the column where the data is located, and if the lower half of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the column where the data is located;
if the left half part of the matrix in B has data of 0, the data is replaced by a first value which is not 0 in the row where the data is located, and if the right half part of the matrix in B has data of 0, the data is replaced by a last value which is not 0 in the row where the data is located;
namely, the data matrix B after the complement value is obtained;
the normalization process adopts an algorithm:
wherein X represents data read by basic data or remote sensing data on each channel or each type of chemical element, X_MIN represents the minimum value in the data matrix X, and X_MAX represents the maximum value in the data matrix X;
the model comprises:
input layer: the basic data and the remote sensing data of the training sampling points are simultaneously input into a full-connection hybrid input model through an input layer;
basic data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely basic data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
remote sensing data feature extraction layer: the feature extraction layer is composed of three full-connection layers, full-connection operation is carried out through a full-connection neural network, the network features, namely remote sensing data features, are extracted by utilizing weight values, and original data are converted into high-dimensional feature expression;
fusion layer: carrying out feature fusion on the basic data features and the remote sensing data features;
full tie layer: the method comprises the steps of obtaining common characteristics of basic data and remote sensing data in a high-dimensional space by using a fully connected neural network;
output layer: and outputting the probability values of the sampling points predicted as various types.
CN202011029251.4A 2020-09-27 2020-09-27 Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model Active CN112163374B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011029251.4A CN112163374B (en) 2020-09-27 2020-09-27 Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011029251.4A CN112163374B (en) 2020-09-27 2020-09-27 Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model

Publications (2)

Publication Number Publication Date
CN112163374A CN112163374A (en) 2021-01-01
CN112163374B true CN112163374B (en) 2024-02-20

Family

ID=73864176

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011029251.4A Active CN112163374B (en) 2020-09-27 2020-09-27 Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model

Country Status (1)

Country Link
CN (1) CN112163374B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105512661A (en) * 2015-11-25 2016-04-20 中国人民解放军信息工程大学 Multi-mode-characteristic-fusion-based remote-sensing image classification method
CN109614880A (en) * 2018-11-19 2019-04-12 国家电网有限公司 A kind of multi-modal biological characteristic fusion method and device
CN110458282A (en) * 2019-08-06 2019-11-15 齐鲁工业大学 Multi-angle multi-mode fused image description generation method and system
CN110458043A (en) * 2019-07-20 2019-11-15 中国船舶重工集团公司第七二四研究所 A kind of SAR target identification method based on transfer learning and the output of full articulamentum
CN110490095A (en) * 2019-07-31 2019-11-22 中国人民解放军战略支援部队信息工程大学 A kind of multi-modal Fusion Features Modulation Identification method and system neural network based
CN110516723A (en) * 2019-08-15 2019-11-29 天津师范大学 A kind of multi-modal ground cloud atlas recognition methods based on the fusion of depth tensor
CN110796105A (en) * 2019-11-04 2020-02-14 中国矿业大学 Remote sensing image semantic segmentation method based on multi-modal data fusion
CN111275130A (en) * 2020-02-18 2020-06-12 上海交通大学 Deep learning prediction method, system, medium and device based on multiple modes

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805270B (en) * 2018-05-08 2021-02-12 华中科技大学 Convolutional neural network system based on memory

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105512661A (en) * 2015-11-25 2016-04-20 中国人民解放军信息工程大学 Multi-mode-characteristic-fusion-based remote-sensing image classification method
CN109614880A (en) * 2018-11-19 2019-04-12 国家电网有限公司 A kind of multi-modal biological characteristic fusion method and device
CN110458043A (en) * 2019-07-20 2019-11-15 中国船舶重工集团公司第七二四研究所 A kind of SAR target identification method based on transfer learning and the output of full articulamentum
CN110490095A (en) * 2019-07-31 2019-11-22 中国人民解放军战略支援部队信息工程大学 A kind of multi-modal Fusion Features Modulation Identification method and system neural network based
CN110458282A (en) * 2019-08-06 2019-11-15 齐鲁工业大学 Multi-angle multi-mode fused image description generation method and system
CN110516723A (en) * 2019-08-15 2019-11-29 天津师范大学 A kind of multi-modal ground cloud atlas recognition methods based on the fusion of depth tensor
CN110796105A (en) * 2019-11-04 2020-02-14 中国矿业大学 Remote sensing image semantic segmentation method based on multi-modal data fusion
CN111275130A (en) * 2020-02-18 2020-06-12 上海交通大学 Deep learning prediction method, system, medium and device based on multiple modes

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《Multi-modal deep learning for landform recognition》;Lin Du等;《ISPRS Journal of Photogrammetry and Remote Sensing》;20191009;第158卷;63-75 *

Also Published As

Publication number Publication date
CN112163374A (en) 2021-01-01

Similar Documents

Publication Publication Date Title
CN111191736B (en) Hyperspectral image classification method based on depth feature cross fusion
CN108920720B (en) Large-scale image retrieval method based on depth hash and GPU acceleration
CN108052911B (en) Deep learning-based multi-mode remote sensing image high-level feature fusion classification method
CN108009559B (en) Hyperspectral data classification method based on space-spectrum combined information
CN112862774B (en) Accurate segmentation method for remote sensing image building
CN112927253B (en) Rock core FIB-SEM image segmentation method based on convolutional neural network
Younis et al. Detection and annotation of plant organs from digitised herbarium scans using deep learning
CN110245683B (en) Residual error relation network construction method for less-sample target identification and application
CN112199886B (en) Processing method of PRB data deep learning geological map prediction model
CN113469119B (en) Cervical cell image classification method based on visual converter and image convolution network
CN113435253A (en) Multi-source image combined urban area ground surface coverage classification method
CN111652273A (en) Deep learning-based RGB-D image classification method
CN112101153A (en) Remote sensing target detection method based on receptive field module and multiple characteristic pyramid
CN114821340A (en) Land utilization classification method and system
CN113936214A (en) Karst wetland vegetation community classification method based on fusion of aerospace remote sensing images
CN114331122A (en) Key person risk level assessment method and related equipment
Zhang et al. SaltISCG: Interactive salt segmentation method based on CNN and graph cut
CN113313669B (en) Method for enhancing semantic features of top layer of surface defect image of subway tunnel
CN112163374B (en) Processing method for multi-modal data intermediate layer fusion full-connection geological map prediction model
CN113989287A (en) Urban road remote sensing image segmentation method and device, electronic equipment and storage medium
CN116824133A (en) Intelligent interpretation method for remote sensing image
CN115330703A (en) Remote sensing image cloud and cloud shadow detection method based on context information fusion
CN111461130B (en) High-precision image semantic segmentation algorithm model and segmentation method
CN115294346A (en) Gesture image recognition method for improving Incep-v 3 network
CN113592013A (en) Three-dimensional point cloud classification method based on graph attention network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20230717

Address after: 100037 Beijing City, Xicheng District Fuwai Street No. 45

Applicant after: DEVELOPMENT & RESEARCH CENTER OF CHINA GEOLOGICAL SURVEY BUREAU

Applicant after: Command center of natural resources comprehensive survey of China Geological Survey

Address before: 100037 Beijing City, Xicheng District Fuwai Street No. 45

Applicant before: DEVELOPMENT & RESEARCH CENTER OF CHINA GEOLOGICAL SURVEY BUREAU

TA01 Transfer of patent application right
CI02 Correction of invention patent application

Correction item: Applicant|Address|Applicant

Correct: Command center of natural resources comprehensive survey of China Geological Survey|100055 No.55, Honglian South Road, Xicheng District, Beijing

False: DEVELOPMENT & RESEARCH CENTER OF CHINA GEOLOGICAL SURVEY BUREAU|100037 Beijing City, Xicheng District Fuwai Street No. 45|China Geological Survey Natural Resources Comprehensive Survey Command Center

Number: 30-02

Volume: 39

CI02 Correction of invention patent application
GR01 Patent grant
GR01 Patent grant