CN116958752B

CN116958752B - Power grid infrastructure archiving method, device and equipment based on IPKCNN-SVM

Info

Publication number: CN116958752B
Application number: CN202311212335.5A
Authority: CN
Inventors: 陈然; 贺兰菲; 蔡杰; 周蠡; 李智威; 许汉平; 柯方超; 周英博; 熊川羽; 马莉; 张赵阳; 熊一; 王巍; 李吕满; 舒思睿; 何峰; 饶曦; 李晶晶
Original assignee: Hubei Keneng Electric Power Electronics Co ltd; Economic and Technological Research Institute of State Grid Hubei Electric Power Co Ltd
Current assignee: Hubei Keneng Electric Power Electronics Co ltd; Economic and Technological Research Institute of State Grid Hubei Electric Power Co Ltd
Priority date: 2023-09-20
Filing date: 2023-09-20
Publication date: 2023-12-15
Anticipated expiration: 2043-09-20
Also published as: CN116958752A

Abstract

The utility model provides a method for power grid infrastructure architecture archiving based on IPKCNN-SVM, firstly, annotate the text data of each image in the pre-training set through the image, carry out dependency syntactic analysis to each text data and obtain sentence dependency, learn sentence dependency through self-attention mechanism and obtain attention output, load attention output as priori knowledge to CNN network and pretrain, input each image in the training set in CNN network and extract image feature through the input layer, forward propagate the image feature of extraction through a plurality of convolution layers, after forward propagate, begin to carry out reverse propagate from the classifier of last layer convolution layer, update the parameter of each convolution layer, finally input each image in the test set in the CNN network that trains, accomplish the classification archiving to power grid infrastructure architecture. The model provided by the design has short training time and high classification precision.

Description

Power grid infrastructure archiving method, device and equipment based on IPKCNN-SVM

Technical Field

The application belongs to the technical field of electronic archiving, and particularly relates to an IPKCNN-SVM-based power grid infrastructure archiving method, device and equipment.

Background

The management of the grid infrastructure files is widely focused as an important component for supporting the development of the grid infrastructure, however, the types of the grid infrastructure files are complex and various, the number of the grid infrastructure files is increased drastically, and the management difficulty of the grid infrastructure files is increased. Aiming at the impact of digital transformation of the power grid infrastructure files, a precise identification and classification method is explored to ensure the archiving quality of the infrastructure files, and the method has important significance in promoting the development of the power grid infrastructure.

In the prior art, an electronic archive automatic classifying system research based on text feature recognition provides an intelligent recognition system based on text features, and the recognition and classification of electronic archives are realized by utilizing the system, however, the method and the strategy only can process small-scale archives and low-dimensional feature archives, and have obvious limitation when the number of archives is increased and the feature complexity is increased sharply, the training time is longer, and the classification precision is low.

Disclosure of Invention

The application aims to solve the problems in the prior art and provides an IPKCNN-SVM-based power grid infrastructure archiving method, device and equipment capable of shortening training time and improving classification accuracy.

In order to achieve the above object, the technical scheme of the present application is as follows:

an IPKCNN-SVM based power grid infrastructure archiving method, the method comprising:

s1, dividing the obtained multiple power grid infrastructure building images into a pre-training set, a training set and a testing set, and marking building categories in the images;

s2, marking text data of each image in the pre-training set through the images, performing dependency syntactic analysis on each text data to obtain sentence dependency relationship, then learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, and loading the attention output into a CNN network as priori knowledge to perform pre-training;

s3, inputting each image in a training set into a CNN network for joint training, wherein the joint training specifically comprises the steps of firstly extracting image features through an input layer, then forward propagating the extracted image features through a plurality of convolution layers, and after the forward propagation is completed, starting to reversely propagate from a classifier of a last convolution layer, and updating parameters of each convolution layer;

s4, inputting each image in the test set into a trained CNN network to finish classified archiving of the power grid infrastructure.

The step S2 includes:

s21, marking text data of each image in the pre-training set through the images, then giving a dependency relationship label, and coding sentence dependency relationship in each text data by using a Stanford NLP syntax analyzer to obtain dependency relationship codes:

；

in the above-mentioned method, the step of,encoding for dependencies>For the h-th dependency, h=1, 2, …, m, m represents the total number of dependencies;

s22, encoding and inputting the dependency relationship into a converter to obtain a characteristic representation:

；

in the above-mentioned method, the step of,encoding +.>Inputting the characteristic representation obtained after the converter, +.>、/>The characteristic representation output by the ith layer and the ith-1 layer full-connection layer respectively, < ->、/>Weights and biases of the i-th full link layers, i=1, 2, …, n, < >>Encoded by dependency +.>The first full-connection layer is input to obtain n which is the total layer number of the full-connection layer in the converter, the weight and bias of each full-connection layer are different, the number of neurons in each full-connection layer is the same as the number of dependency relations, and n is the total layer number of the full-connection layer in the converter>Is an activation function;

s23, learning sentence dependency relationship through a self-attention mechanism to obtain attention output:

；

in the above-mentioned method, the step of,for attention output, ++>Attention distribution weight represented for the ith feature, +.>Is->Transpose of->Representing a context vector;

s24, the attention output is used as a feature vector to be input into a CNN network for pretraining.

In the joint training of step S3, the extracted image features are propagated forward through a plurality of convolution layers according to the following formula:

；

in the above-mentioned method, the step of,for the j-th output feature of the k-th convolution layer,>the p-th input feature of the k-1 th convolutional layer; />，/>For the image feature set, +.>Weights for the k-th convolution layer, +.>For the k-th layer convolutional layer bias term, +.>Is an activation function;

after the forward propagation is completed, the backward propagation is started from the classifier of the last layer of convolution layer, the weight of each layer of convolution layer is updated, and the weight updating formula of the last layer of convolution layer is as follows:

；

wherein the saidUpdated weights for the last convolutional layer, +.>Error between output and input of classifier for last layer convolution layer, < >>For learning rate->Output of the last convolution layer;

the weight update formula of the rest convolution layers is as follows:

；

in the above-mentioned method, the step of,updated weights for convolutional layer b, +.>For the weight before update of convolutional layer b, +.>For the number of convolutional layers lying after convolutional layer b, +.>To be positioned behind convolution layer b +.>Individual rollsError between output and input of laminated classifier +.>Partial differentiation of ∈ ->To be positioned behind convolution layer b +.>The activation values generated by the activation functions of the individual convolutional layers.

The classifier of the CNN network is an SVM classifier, and parameters of the SVM classifier are optimized by utilizing a teaching and learning improved firefly algorithm after updating parameters of each convolution layer.

The optimization of the parameters of the SVM classifier by utilizing the teaching and learning improved firefly algorithm is as follows:

setting a search space of a firefly algorithm as a value range of parameters of an SVM classifier, initializing the position, brightness and attraction of the firefly, continuously optimizing the parameters of the SVM, taking the classification performance of the SVM classifier as the brightness of the firefly, updating the position of the firefly according to the following formula, and iteratively updating until the brightness converges:

；

in the above-mentioned method, the step of,for the final position of the firefly algorithm based on teaching and learning factors improvement, +.>For the position after the firefly update, +.>The individual moving direction of the fireflies at the time t+1 is obtained according to the difference between the fireflies u and the average position of the fireflies and the difference between the fireflies v and the average position of the fireflies; />、The difference between the firefly u and the firefly average position at time t+1, the difference between the firefly v and the firefly average position, and +.>For the mean position of firefly at time t, +.>、/>Taking 1 or 2 randomly as teaching factors; t is the current iteration number; />、/>The spatial positions of fireflies u and v at the moment t are respectively; />Is the attractive force between firefly u and firefly v; />Is constant and is generally taken as->；/>Is a random tree vector, ++>For maximum attractive force, typically 1 is taken; />For the light absorption coefficient, generally +.>，/>Is the Euclidean distance of firefly u and firefly v.

The parameters of the SVM classifier are penalty factors of the SVM classifierWidth parameter->。

An IPKCNN-SVM-based power grid infrastructure archiving device comprises an image preprocessing module, a CNN network parameter training module and a classification module;

the image preprocessing module is used for dividing the obtained multiple power grid infrastructure building images into a pre-training set, a training set and a testing set, and marking building categories in the images;

the CNN network parameter training module is used for marking text data of each image in a pre-training set through images, performing dependency syntactic analysis on each text data to obtain sentence dependency relationship, learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, loading the attention output into a CNN network to perform pre-training, inputting each image in the training set into the CNN network, extracting image features through an input layer, forward transmitting the extracted image features through a plurality of convolution layers, and after the forward transmission is completed, starting backward transmitting from a classifier of a last layer of convolution layer, and updating parameters of each convolution layer;

and the classification module is used for inputting each image in the test set into the trained CNN network to finish classification filing of the power grid infrastructure.

The archiving device further comprises an SVM parameter optimizing module;

the SVM parameter optimizing module is used for optimizing parameters of the SVM classifier by utilizing a teaching and learning improved firefly algorithm after updating parameters of each convolution layer.

An IPKCNN-SVM based grid-based capital construction archiving device, the archiving device comprising a memory and a processor;

the memory is used for storing computer program codes and transmitting the computer program codes to the processor;

the processor is configured to perform the archiving method described above in accordance with instructions in the computer program code.

A computer readable storage medium having stored thereon a computer program which when executed by a processor implements the archiving method described above.

Compared with the prior art, the application has the beneficial effects that:

1. the application relates to an IPKCNN-SVM-based power grid infrastructure archiving method, which comprises the steps of firstly dividing an obtained plurality of power grid infrastructure building images into a pre-training set, a training set and a testing set, marking out building types in the images, marking out text data of each image in the pre-training set through the images, performing dependency syntax analysis on each text data to obtain sentence dependency relationship, learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, loading the attention output as priori knowledge into a CNN network for pre-training, inputting each image in the training set into the CNN network for joint training, specifically, extracting image features through an input layer, carrying out forward propagation on the extracted image features through a plurality of convolution layers, carrying out backward propagation from a classifier of a last layer of convolution layer, updating parameters of each convolution layer, and finally inputting each image in the testing set into the trained CNN network to finish classifying and archiving of the power grid infrastructure building; according to the design, on one hand, the dependency relationship between words is learned through a self-attention mechanism, the obtained attention output is fused into a convolutional neural network model as priori knowledge to participate in pre-training, after the priori knowledge is added, the model can be helped to better understand and capture potential modes and associated information in text data, relevant features are more accurately identified and extracted, time-consuming fitting irrelevant features are avoided, convergence speed and classification precision of the model in the process of processing a large number of texts and data are improved, and on the other hand, a training mode combining forward propagation and reverse propagation is introduced into the convolutional neural network, so that training speed is finally effectively improved. Therefore, the method provided by the application has the advantages of short model training time and high classification precision.

2. The application relates to an IPKCNN-SVM-based power grid infrastructure archiving method, which adopts an SVM classifier as a classifier of a CNN network and optimizes parameters of the SVM classifier by utilizing teaching and learning improved firefly algorithm; the firefly algorithm is a global optimization algorithm, and the design can enhance global searching capability in the optimization process after introducing teaching factors, so that the firefly algorithm can better search the optimal solution in the SVM classifier parameter space, better optimize SVM classifier parameters, improve the distinguishing performance of the model on different categories, and further improve the classification precision of the model. Therefore, the method and the device can further improve the model classification precision.

Drawings

FIG. 1 is a flow chart of an archiving method according to the present application.

Fig. 2 is a classification accuracy test result of three models for grid-based building images with different signal-to-noise ratios.

Fig. 3 is a block diagram of the archival device according to the present application.

Fig. 4 is a block diagram of the configuration of the filing apparatus according to the present application.

Detailed Description

The present application will be described in further detail with reference to the following detailed description and the accompanying drawings.

Example 1:

referring to fig. 1, an IPKCNN-SVM-based power grid infrastructure archiving method is sequentially performed according to the following steps:

1. dividing the obtained multiple power grid infrastructure building images into a pre-training set, a training set and a testing set, and marking building categories in the images;

2. firstly, marking text data of each image in a pre-training set through the images, then giving a dependency relationship label, and coding sentence dependency relationship in the text data by using a Stanford NLP syntactic analyzer to obtain dependency relationship coding:

；

then, the dependency is encoded into a transducer to obtain a characteristic representation:

；

learning sentence dependency relationship through a self-attention mechanism to obtain attention output:

；

finally, the attention output of each image is used as a feature vector to be input into a CNN network for pre-training;

3. inputting each image in a training set into a CNN (computer network) for joint training, wherein the CNN has the structure that:

input layer- & gt convolution layer 1- & gt pooling layer 1- & gt classifier 1- & gt convolution layer 2- & gt pooling layer 2- & gt classifier 2- & gt … - & gt convolution layer a-1- & gt pooling layer a-1- & gt classifier a-1- & gt convolution layer a- & gt pooling layer- & gt full connection layer- & gt classifier a;

after the feature vector of the text data of each image is input into the input layer of the CNN network, the feature vector is propagated forward through a plurality of convolution layers according to the following formula:

；

4. after the forward propagation is completed, the backward propagation is started from the classifier of the last layer of convolution layer, the weight of each layer of convolution layer is updated, and the weight updating formula of the last layer of convolution layer is as follows:

；

the weight update formula of the rest convolution layers is as follows:

；

in the above-mentioned method, the step of,updated weights for convolutional layer b, +.>For the weight before update of convolutional layer b, +.>For the number of convolutional layers lying after convolutional layer b, +.>To be positioned behind convolution layer b +.>Error between classifier output and input of each convolution layer +.>Partial differentiation of ∈ ->To be positioned behind convolution layer b +.>An activation value generated by an activation function of the plurality of convolutional layers;

5. the classifier adopted by the CNN network is replaced by an SVM classifier, the SVM classifier is trained by using the output data of the full-connection layer, and the punishment factors of the SVM classifier are improved by using the teaching and learning improved firefly algorithmWidth parameter->Optimizing to obtain a trained IPKCNN-SVM model, wherein the optimizing process comprises the following steps:

setting the search space of firefly algorithm as punishment factorOr width parameter->Initializing the position, brightness and attraction of fireflies, continuously optimizing parameters, taking the classification performance of the SVM classifier as the brightness of the fireflies, updating the position of the fireflies according to the following formula, and iteratively updating until the brightness converges:

；

in the above-mentioned method, the step of,for the final position of the firefly algorithm based on teaching and learning factors improvement, +.>For the position after the firefly update, +.>The individual moving direction of the fireflies at the time t+1 is obtained according to the difference between the fireflies u and the average position of the fireflies and the difference between the fireflies v and the average position of the fireflies; />、The difference between the firefly u and the firefly average position at time t+1, the difference between the firefly v and the firefly average position, and +.>For the mean position of firefly at time t, +.>、/>Taking 1 or 2 randomly as teaching factors; t is the current iteration number; />、/>The spatial positions of fireflies u and v at the moment t are respectively; />Is the attractive force between firefly u and firefly v; />Is constant and is generally taken as->；/>Is a random tree vector, ++>For maximum attractive force, typically 1 is taken; />For the light absorption coefficient, generally +.>，/>The Euclidean distance is firefly u and firefly v;

6. and inputting each image in the test set into a trained IPKCNN-SVM model to finish classified archiving of the power grid infrastructure.

Performance test:

1. inputting a plurality of power grid infrastructure building images obtained in a power grid infrastructure region into the IPKCNN-SVM model, the CNN model and the CNN-SVM model for training and classifying regression, testing the training duration and classifying regression result accuracy of the three models, and testing the results shown in Table 1:

table 1 training duration test results for three models

As can be seen from table 1, the training time of the CNN model is 92.8703s, the test time is 0.477s, and the accuracy is 91.11%; the CNN-SVM model is respectively faster than 0.2563s and 0.401s in training time and testing time, which shows that the combination of the SVM and the CNN can improve the classification and identification efficiency of the model, and the accuracy is improved by 4.45%; compared with the CNN-SVM model, the IPKCNN-SVM model provided by the application has the advantages that the training time and the testing time are respectively reduced by 29.3% and 23.68%, and the accuracy is improved by 1.66%, so that the training time and the classifying time are obviously shortened, and the accuracy is effectively improved.

2. The method comprises the steps of inputting power grid infrastructure building images with different signal to noise ratios into the IPKCNN-SVM model, the CNN model and the CNN-SVM model for training and classifying regression, and testing the accuracy of classifying regression results of the three models, wherein the test results are shown in figure 2;

as can be seen from fig. 2, under the strong interference of 1dB gaussian white noise, the accuracy of the IPKCNN-SVM model of the present application can reach 82.22%, whereas the accuracy of the CNN-SVM model and the CNN model is only 80% and 75%; the accuracy of the three models is steadily improved when noise interference is gradually weakened, and under the small interference of 5dB noise, the accuracy of the IPKCNN-SVM model reaches 97.18 percent, and compared with the accuracy under the noise-free condition, the accuracy of the CNN-SVM model and the accuracy of the CNN model under the small interference of 5dB noise are 2.22 percent and 6.67 percent lower than those of the IPKCNN-SVM model.

Example 2:

referring to fig. 3, an IPKCNN-SVM-based power grid infrastructure archives device includes an image preprocessing module, a CNN network parameter training module, an SVM parameter optimizing module, and a classification module; the image preprocessing module is used for dividing the obtained multiple power grid infrastructure building images into a pre-training set, a training set and a testing set, marking building categories in the images, and specifically executing the steps 1 in the embodiment 1; the CNN network parameter training module is used for marking text data of each image in a pre-training set through images, performing dependency syntactic analysis on each text data to obtain sentence dependency relationship, learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, loading the attention output into a CNN network to perform pre-training, inputting each image in the training set into the CNN network, extracting image features through an input layer, forward propagating the extracted image features through a plurality of convolution layers, and after the forward propagating is completed, starting backward propagating from a classifier of a last layer of convolution layer, and updating parameters of each convolution layer; the SVM parameter optimizing module is used for optimizing parameters of the SVM classifier by utilizing data of a CNN network full convolution layer after updating parameters of each convolution layer through teaching and learning improvement firefly algorithm, and the module specifically executes the step 5 as in the embodiment 1; the classification module is used for inputting each image in the test set into a trained CNN network to finish classification filing of the power grid infrastructure building; this module performs in particular step 6 as in example 1.

Example 3:

referring to fig. 4, an IPKCNN-SVM based power grid infrastructure archival device includes a memory and a processor;

the memory is used for storing computer program codes and transmitting the computer program codes to the processor; the processor is used for executing the archiving method according to the instructions in the computer program code;

It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein. The scheme in the embodiment of the application can be realized by adopting various computer languages, such as object-oriented programming language Java, an transliteration script language JavaScript and the like.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

While preferred embodiments of the present application have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. It is therefore intended that the following claims be interpreted as including the preferred embodiments and all such alterations and modifications as fall within the scope of the application.

Claims

1. An IPKCNN-SVM-based power grid infrastructure archiving method is characterized in that:

the method comprises the following steps:

s2, marking text data of each image in a pre-training set through the images, performing dependency syntactic analysis on each text data to obtain sentence dependency relationship, then learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, and loading the attention output into a CNN network as priori knowledge to perform pre-training, wherein the method specifically comprises the following steps:

in the above equation, r is the dependency code,for the h-th dependency, h=1, 2, …, m, m represents the total number of dependencies;

s _i ＝tanh(w _i s _i-1 +b _i )；

S＝{s ₁ ，s ₂ ，...，s _i ...，s _n }；

in the above formula, S is a characteristic representation obtained by inputting the dependency relation code r into the converter, and S _i 、s _i-1 Respectively represents full connection by the ith layer and the ith-1 layerCharacterization of layer output, w _i 、b _i Weights and biases of the i-th full link layers, i=1, 2, …, n, s, respectively ₁ The method comprises the steps that dependence relation codes r are input into a first full-connection layer to obtain, n is the total number of layers comprising the full-connection layer in the converter, the weight and bias of each full-connection layer are different, the number of neurons comprising the full-connection layer is the same as the number of dependence relation, and tanh is an activation function;

in the above formula, D is the attention output, a _i The attention distribution weight represented for the ith feature,is s _i Transpose of s _w Representing a context vector;

s24, inputting the attention output as a feature vector into a CNN network for pre-training;

s4, replacing a classifier adopted by the CNN network with an SVM classifier, training the SVM classifier by utilizing output data of a CNN network full-connection layer, optimizing punishment factors rho and width parameters sigma of the SVM classifier by utilizing a teaching and learning improved firefly algorithm to obtain a trained IPKCNN-SVM model, inputting each image in a test set into the trained IPKCNN-SVM model, and finishing classification archiving of the power grid infrastructure.

2. The IPKCNN-SVM based power grid infrastructure archiving method of claim 1, wherein:

in the above-mentioned method, the step of,for the j-th output feature of the k-th convolution layer,>the p-th input feature of the k-1 th convolutional layer; p epsilon M _p ，M _p For the image feature set, +.>Weights for the k-th convolution layer, +.>A k-th convolution layer bias term, f is an activation function;

wherein the saidE is the updated weight of the last convolution layer _FC Error between output and input of classifier of the last layer of convolution layer, eta is learning rate and x _a Output of the last convolution layer;

the weight update formula of the rest convolution layers is as follows:

in the above-mentioned method, the step of,for the updated weights of convolution layer b, W _b For the weight before updating of convolution layer b, L is the number of convolution layers after convolution layer b,/>Error E between output and input of classifier of the first convolution layer after convolution layer b _l Partial differentiation of alpha _l The activation value generated for the activation function of the first convolutional layer after convolutional layer b.

3. The IPKCNN-SVM based power grid infrastructure archiving method of claim 1, wherein:

in the step S4, the optimization of the parameters of the SVM classifier by using the teaching and learning improved firefly algorithm is as follows:

X(t+1)＝X _fa (t+1)+ΔX _tlbo (t+1)；

in the above formula, X (t+1) is the final position of firefly algorithm based on teaching and learning factor improvement, X _fa (t+1) is the position after the update of firefly, ΔX _tlbo (t+1) is the individual moving direction of the firefly at time t+1 obtained from the difference between the firefly u and the average position of the firefly and the difference between the firefly v and the average position of the firefly; ΔX ₁ (t+1)、ΔX ₂ (t+1) is the difference between the average positions of firefly u and firefly at time t+1, the difference between the average positions of firefly v and firefly,for the mean position of firefly at time T, T _F1 、T _F2 Taking 1 or 2 randomly as teaching factors; t is the current iteration number; x is X _u (t)、X _v (t) are the spatial positions of firefly u and firefly v at time t, respectively; beta _uv (r _uv ) Is the attractive force between firefly u and firefly v; alpha _t Is constant and is generally taken as [0,1 ]]；/>As a random tree vector, beta ₀ For maximum attractive force, typically 1 is taken; gamma is the light absorption coefficient, generally taken as [0.01, 100 ]]，r _uv Is the Euclidean distance of firefly u and firefly v.

4. An IPKCNN-SVM-based power grid-based foundation construction archiving device is characterized in that:

the archiving device comprises an image preprocessing module, a CNN network parameter training module and a classification module;

the CNN network parameter training module is used for marking text data of each image in a pre-training set through the images, performing dependency syntactic analysis on each text data to obtain sentence dependency relationship, learning the sentence dependency relationship through a self-attention mechanism to obtain attention output, and loading the attention output into a CNN network to perform pre-training, and specifically comprises the following steps:

s _i ＝tanh(w _i s _i-1 +b _i )；

S＝{s ₁ ，s ₂ ，...，s _i ...，s _n }；

in the above formula, S is a characteristic representation obtained by inputting the dependency relation code r into the converter, and S _i 、s _i-1 Separate tableThe characteristic representation output by the ith layer and the ith-1 layer full connection layer is shown as w _i 、b _i Weights and biases of the i-th full link layers, i=1, 2, …, n, s, respectively ₁ The method comprises the steps that dependence relation codes r are input into a first full-connection layer to obtain, n is the total number of layers comprising the full-connection layer in the converter, the weight and bias of each full-connection layer are different, the number of neurons comprising the full-connection layer is the same as the number of dependence relation, and tanh is an activation function;

then inputting each image in the training set into a CNN network, extracting image features through an input layer, forward propagating the extracted image features through a plurality of convolution layers, and after the forward propagation is finished, starting to reversely propagate from a classifier of the last convolution layer, and updating parameters of each convolution layer;

the classification module is used for replacing a classifier adopted by the CNN network with an SVM classifier, training the SVM classifier by utilizing output data of a full-connection layer of the CNN network, optimizing penalty factors rho and width parameters sigma of the SVM classifier by utilizing a teaching and learning improved firefly algorithm to obtain a trained IPKCNN-SVM model, and inputting each image in a test set into the trained IPKCNN-SVM model to finish classifying and archiving of a power grid infrastructure building.

5. An IPKCNN-SVM-based power grid infrastructure archiving device is characterized in that:

the archiving device includes a memory and a processor;

the processor is configured to perform the method of any one of claims 1 to 3 according to instructions in the computer program code.

6. A computer-readable storage medium, characterized by: a computer program stored on a computer readable storage medium, which when executed by a processor, implements the method of any one of claims 1 to 3.