CN111597350A

CN111597350A - Rail transit event knowledge map construction method based on deep learning

Info

Publication number: CN111597350A
Application number: CN202010365826.3A
Authority: CN
Inventors: 黑新宏; 彭伟; 朱磊; 赵钦; 王一川; 姬文江; 姚燕妮; 焦瑞; 董林靖
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2020-04-30
Filing date: 2020-04-30
Publication date: 2020-08-28
Anticipated expiration: 2040-04-30
Also published as: CN111597350B

Abstract

The invention discloses a method for constructing a knowledge graph of a rail transit incident based on deep learning; constructing event recognition model training data by adopting a dictionary matching mode and a manual labeling mode; training a standard event recognition model by adopting a BERT-BilSTM-CRF algorithm, and automatically extracting standard entry events from a rail transit design standard text; event unification is carried out on the events output by the event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-classification model; adopting a snowball algorithm to construct training data of an event relation model; and training a relationship recognition model by adopting a BERT-BilSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between the events. The informatization of the rail transit construction design engineering is improved, and the workload of map construction is reduced.

Description

Rail transit event knowledge map construction method based on deep learning

Technical Field

The invention belongs to an important direction in the field of artificial intelligence, and particularly relates to a rail transit incident knowledge graph construction method based on deep learning.

Background

With the rapid development of the internet technology, a plurality of industries are deeply integrated with the emerging artificial intelligence technology, and remarkable results are obtained. The urban rail transit is used as a standard distribution of urban modernization, and plays an important role in promoting urban economic development. The rail transit construction engineering belongs to complex engineering and has the characteristics of large scale, long construction period, huge investment and the like. The early design planning stage in the rail transit construction engineering is the foundation of later engineering, and the later construction can be guaranteed only by complete early design planning. However, in the design and planning stage of the rail transit engineering, the referenced design standard is of a complicated variety, the information amount of each standard entry is huge, and the informatization degree of the whole rail transit construction engineering is low, so that the difficulty of inquiring the content of a certain standard in the design and planning stage is caused. And has extremely high requirements on the professional ability of designers in the design stage, so that the design work is extremely challenging. Therefore, knowledge map is needed to represent the track traffic design specification knowledge, and informatization of the track traffic construction engineering is promoted.

At present, most of knowledge maps are entity knowledge maps taking entities as cores, but entity information is separated from specific contexts, and one-sidedness of semantic information exists. Compared with an entity, the event can express semantic information more clearly. The event expression is mostly contained in the specification entry of the rail transit design standard. The design specification is thus expressed in the form of an event knowledge graph. Compared with the traditional knowledge graph construction method, most of the methods are low in automation degree, time-consuming and labor-consuming, so that the method for constructing the knowledge graph of the rail transit event based on deep learning is provided, the automation degree is improved, and the workload is reduced.

Disclosure of Invention

The invention aims to provide a method for constructing a knowledge graph of a rail transit incident based on deep learning. The specification is expressed through the event knowledge graph, so that the expressed content is richer and more accurate in semantics. The problems of low automation degree, time consumption and labor consumption in the traditional map construction technology are solved by utilizing deep learning.

The technical scheme adopted by the invention is that an event trigger dictionary matching mode and a manual tagging mode are adopted to construct training data of a rail transit event recognition model; training a standard event recognition model by adopting a BERT-BilSTM-CRF algorithm, and automatically extracting standard entry events from a rail transit design standard text; event unification is carried out on the events output by the event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-classification model; adopting a snowball algorithm to construct training data of an event relation model; and training a relationship recognition model by adopting a BERT-BilSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between events to form a rail transit event knowledge map. The event knowledge graph construction process comprises the following steps:

step 1, adopting an event triggering dictionary matching and manual labeling mode to an original text to construct training data of an event recognition model.

And 2, extracting a training set from the rail transit design standard events for preprocessing, dividing texts in the training set by standard entries, and labeling the texts by parts of speech.

And 3, training the text processed in the step 2 by using a BERT-BilSTM-CRF algorithm to train a rail transit design specification event recognition model.

And 4, constructing event relation training data by adopting a snowball algorithm on the original text.

And 5, extracting a training set from the rail transit design specification event relation generated in the step 4 for preprocessing, and dividing texts in the training set in an event pair mode.

And 6, training the text processed in the step 5 by using a BERT-BilSTM-ATTENTION-SOFTMAX algorithm to train a relationship recognition model.

And 7, preprocessing the rail transit design specification to divide the items according to the specification.

And 8, inputting the rail transit standard text preprocessed in the step 7 into the event recognition model generated in the step 3, and extracting events in the standard, wherein the events comprise event trigger words and event elements.

And 9, unifying the events identified in the step 8.

And step 10, storing the event identified in the step 9 into an event database.

And 11, storing the events identified in the step 9 into a database in the form of a triple of 'event element-relation-event trigger'.

And step 12, taking out the events from the event database generated in the step 10, forming event pairs, inputting the event pairs into the event relation recognition model generated in the step 6, and extracting the relation among the events in the specification.

And step 13, storing the event pairs in the step 10 and the event relations extracted in the step 12 into a database in the form of a triple group of 'event trigger-relation-event trigger'.

In step 1, an event consists of an event trigger word and an event element; because most event trigger words are fixed words, manual labeling is accelerated by adopting a dictionary matching mode, and model training data are constructed; dictionary expansion may be by way of a synonym forest.

In step 3, a BERT-BilSTM-CRF algorithm is used for training an event recognition model, and the whole model consists of three parts, namely a BERT layer, a BilSTM layer and a CRF layer. The BERT pre-training model is used for obtaining a word vector containing the normative context feature information, the BilSTM layer is used for feature extraction, the sequence information of the whole text is utilized, and the CRF layer is used for learning the constraint condition of the sentence and filtering the wrong prediction sequence.

In the step 4, a semi-supervised snowball algorithm is utilized to construct an event relation recognition model training set. The snowball algorithm comprises the following specific steps:

step 4.1, manually marking a small number of event relations to form an event relation table; each event relationship is to an event relationship table.

Step 4.2, matching the original sentence containing the event in the event relation table in the original text by using the existing event relation table, and generating a template; the format of the template is five-tuple form, which is < left >, event 1 type, < middle >, event 2 type, < right > respectively; len is a length which can be set arbitrarily, < left > is a vector representation of len words on the left side of the event 1, < middle > is a vector representation of words between the event 1 and the event 2, and < right > is a vector representation of len words on the right side of the event; the event 1 type is a numerical definition event, and the event 2 type is a numerical definition event.

4.3, clustering the generated templates, clustering the templates with the similarity larger than a threshold value of 0.7 into a class, generating a new template by using an averaging method, and adding the new template into a rule base for storing the templates; . The format of the template known from step 4.2 can be written as

E₁，E₂Respectively indicating an event 1 type and an event 2 type of the template P,

represents E₁The left 3-vocabulary length vector representation,

represents E₁，E₂The vector representation of the vocabulary in between,

represents E₂Vector representation of the three lexical lengths on the right. Similarity calculation between templates, exemplified as follows, template 1:

template 2:

if the condition E is satisfied₁＝E₁'&&E₂＝E'₂I.e. satisfy the template P₁Event 1 type E of₁And a template P₂Event 1 type E'₁Identical and template P₁Event 2 type E of₂And a template P₂Event 2 type E'₂Same, then template P₁And a template P₂Can be determined by

Calculated as mu₁μ₂μ₃Are weighted because

The calculation result of the similarity between the templates is greatly influenced, and mu can be set₂>μ₁>μ₃(ii) a If the condition E is not satisfied₁＝E₁'&&E₂＝E'₂Then template P₁And a template P₂The similarity of (c) can be noted as 0.

Step 4.4, firstly, scanning the original text by using the event recognition model trained in the step 3, recognizing the event type contained in the text, then, matching the original text by using the template in the rule base generated in the step 4.3, and converting the text obtained by matching into a five-tuple form of the template;

step 4.5, similarity calculation is carried out on the new template generated in the step 4.4 and templates in the rule base, the template with the similarity smaller than the threshold value of 0.7 is discarded, and the event in the template with the similarity larger than the threshold value of 0.7 is added into the event relation table;

and 4.6, repeatedly executing the steps 4.2-4.5 until the original text processing is finished.

In step 6, a BERT-BilSTM-ATTENTION-SOFTMAX algorithm is used for training a relationship recognition model. The whole model consists of four parts, namely a BERT layer, a BilSTM layer, an ATTENTION layer and a SOFTMAX layer. The BERT pre-training model is used for obtaining a word vector containing normative context feature information, the BilSTM layer is used for feature extraction, sequence information of the whole text is utilized, the ATTENTION layer is used for calculating ATTENTION probability to highlight the importance degree of a key word in the text, the SOFTMAX layer is used for generating the probability of various relation classes, and the maximum class probability is taken as a model prediction class.

In step 9, texts which refer to the same event exist in the standard texts; in order to avoid a great deal of redundant information in the event database; the event unified processing algorithm is adopted, and comprises the following specific steps:

step 9.1, training a word2vec model by using the original track traffic text;

step 9.2, inputting a rail transit event by using the word2vec model generated in the step 9.1 to generate an event vector;

step 9.3, calculating the similarity between the events by utilizing the cosine function value, and clustering the events into a class according to the similarity value of more than 0.8; the cosine function is as follows:

9.4, generating a new event in the step 9.3, and randomly combining all the events to calculate the similarity between event pairs;

and 9.5, inputting the similarity of the event pair and the event into a trained logistic regression two-classification model, and judging the similarity of the event. The logistic regression mathematical model is as follows:

and 9.6, according to the classification result of the step 9.5, if the events are similar, discarding one event, and if the events are not similar, storing both events.

The invention has the beneficial effects that:

the invention provides a method for constructing a knowledge graph of a rail transit incident based on deep learning, aiming at the problems of complicated engineering information, defects of a traditional knowledge graph and large workload of construction of the knowledge graph in a rail transit construction design stage. Adopting an event trigger dictionary matching mode and a manual labeling mode to construct training data of a rail transit event recognition model; training a standard event recognition model by adopting a BERT-BilSTM-CRF algorithm, and automatically extracting standard entry events from a rail transit design standard text; event unification is carried out on the events output by the event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-classification model; adopting a snowball algorithm to construct training data of an event relation model; and training a relationship recognition model by adopting a BERT-BilSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between events to form a rail transit event knowledge map. The informatization of the rail transit construction design engineering is improved, and the workload of map construction is reduced.

Drawings

FIG. 1 is a general flowchart of a method for constructing a knowledge graph of a rail transit event based on deep learning according to the present invention;

FIG. 2 is a process of constructing an event training data set by dictionary matching and manual labeling in the rail transit event knowledge graph construction method based on deep learning according to the invention;

FIG. 3 is a process of building a normative event recognition model based on a BERT-BilSTM-CRF algorithm in the rail transit event knowledge map building method based on deep learning according to the invention;

FIG. 4 is a process of event unification of events output by an event recognition model by a word2vec model, cosine similarity clustering and logistic regression two-classification model in the rail transit event knowledge graph construction method based on deep learning of the present invention;

FIG. 5 is a process of constructing training data of an event relation model by adopting a snowball algorithm in the rail transit event knowledge graph construction method based on deep learning;

FIG. 6 is a process of building a relationship recognition model based on a BERT-BilSTM-ATTENTION-SOFTMAX algorithm in the rail transit event knowledge graph building method based on deep learning.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

Referring to fig. 1, the method for constructing the rail transit event knowledge graph based on deep learning specifically comprises the following steps:

step 1, as shown in fig. 2, training data of an event recognition model is constructed by adopting an event-triggered dictionary matching and manual tagging mode for an original text. The pseudo code labeling the training set algorithm is as follows:

And 3, as shown in FIG. 3, training the track traffic design specification event recognition model by using a BERT-BilSTM-CRF algorithm on the text processed in the step 2. The pseudo code for constructing the event recognition model is as follows:

and 4, as shown in fig. 5, constructing an event relation recognition model training set by adopting a semi-supervised snowball algorithm on the original text. The snowball algorithm comprises the following specific steps:

represents E₁The left 3-vocabulary length vector representation,

represents E₁，E₂The vector representation of the vocabulary in between,

template 2:

Calculated as mu₁μ₂μ₃Are weighted because

And 5, extracting a training set from the rail transit design specification event relation generated in the step 4 for preprocessing, and dividing the text in an event pair mode.

And 6, training the text processed in the step 5 by using a BERT-BilSTM-ATTENTION-SOFTMAX algorithm to train a relationship recognition model. The pseudo code for constructing the event relationship recognition model is as follows, as shown in FIG. 6:

And 9, unifying the events identified in the step 8 as shown in fig. 4. There is text in the specification text that refers to the same event; in order to avoid a great deal of redundant information in the event database; the event unified processing algorithm is adopted, and comprises the following specific steps:

step 9.1, training a word2vec model by using the original track traffic text;

And step 10, storing the event identified in the step 9 into an event database.

And 11, storing the events identified in the step 9 into a database in the form of a triple of 'event element-relation-event trigger'. For example, "orbital center runway surface as emergency evacuation channel" is stored in the graph database with < orbital center runway surface, subject, as > and < emergency evacuation channel, subject, as >.

And step 13, storing the event pairs in the step 10 and the event relations extracted in the step 12 into a database in the form of a triple group of 'event trigger-relation-event trigger'. For example, the event relationship between "the track center lane bed surface is used as an emergency evacuation lane" and "the train end vehicles should be provided with special end doors and be provided with getting-off facilities" is stored in the map database as < as, conditional relationship, setup >.

The method adopts an event triggering dictionary matching mode and a manual labeling mode to construct training data of a rail transit event recognition model; training a standard event recognition model by adopting a BERT-BilSTM-CRF algorithm, and automatically extracting standard entry events from a rail transit design standard text; event unification is carried out on the events output by the event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-classification model; adopting a snowball algorithm to construct training data of an event relation model; and training a relationship recognition model by adopting a BERT-BilSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between events to form a rail transit event knowledge map. The informatization of the rail transit construction design engineering is improved, and the workload of map construction is reduced.

Claims

1. A rail transit incident knowledge map construction method based on deep learning is characterized in that an incident trigger dictionary matching mode and a manual labeling mode are adopted to construct rail transit incident recognition model training data; training a standard event recognition model by adopting a BERT-BilSTM-CRF algorithm, and automatically extracting standard entry events from a rail transit design standard text; event unification is carried out on the events output by the event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-classification model; adopting a snowball algorithm to construct training data of an event relation model; and training a relationship recognition model by adopting a BERT-BilSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between events to form a rail transit event knowledge map.

2. The rail transit event knowledge graph construction method based on deep learning according to claim 1, is characterized by specifically comprising the following steps:

And 9, unifying the events identified in the step 8.

And step 10, storing the event identified in the step 9 into an event database.

3. The method for building the knowledge graph of the rail transit events based on deep learning according to claim 2, wherein in the step 1, the events are composed of event trigger words and event elements; because most event trigger words are fixed words, manual labeling is accelerated by adopting a dictionary matching mode, and model training data are constructed; dictionary expansion may be by way of a synonym forest.

4. The method for constructing the rail transit event knowledge graph based on deep learning of claim 2, wherein in the step 3, an event recognition model is trained by using a BERT-BilSTM-CRF algorithm, and the whole model consists of three parts, namely a BERT layer, a BilSTM layer and a CRF layer; the BERT pre-training model is used for obtaining a word vector containing the normative context feature information, the BilSTM layer is used for feature extraction, the sequence information of the whole text is utilized, and the CRF layer is used for learning the constraint condition of the sentence and filtering the wrong prediction sequence.

5. The method for building a knowledge graph of rail transit events based on deep learning according to claim 2, wherein in the step 4, a semi-supervised snowball algorithm is used to build a training set of event relation recognition models. The snowball algorithm comprises the following specific steps:

represents E₁The left 3-vocabulary length vector representation,

represents E₁，E₂The vector representation of the vocabulary in between,

template 2:

if the condition E is satisfied₁＝E′₁&&E₂＝E′₂I.e. satisfy the template P₁Event 1 type E of₁And a template P₂Event 1 type E'₁Identical and template P₁Event 2 type E of₂And a template P₂Event 2 type E'₂Same, then template P₁And a template P₂Can be determined by

Calculated as mu₁μ₂μ₃Are weighted because

The calculation result of the similarity between the templates is greatly influenced, and mu can be set₂>μ₁>μ₃(ii) a If the condition E is not satisfied₁＝E′₁&&E₂＝E′₂Then template P₁And a template P₂The similarity of (c) can be noted as 0.

6. The method for building the knowledge graph of the rail transit events based on deep learning as claimed in claim 2, wherein in the step 6, a relation recognition model is trained by using a BERT-BilSTM-ATTENTION-SOFTMAX algorithm; the whole model consists of four parts, namely a BERT layer, a BilSTM layer, an ATTENTION layer and a SOFTMAX layer; the BERT pre-training model is used for obtaining a word vector containing normative context feature information, the BilSTM layer is used for feature extraction, sequence information of the whole text is utilized, the ATTENTION layer is used for calculating ATTENTION probability to highlight the importance degree of a key word in the text, the SOFTMAX layer is used for generating the probability of various relation classes, and the maximum class probability is taken as a model prediction class.

7. The method for building a knowledge graph of track traffic events based on deep learning as claimed in claim 2, wherein in the step 9, texts which refer to the same event exist in the specification texts; in order to avoid a great deal of redundant information in the event database; the event unified processing algorithm is adopted, and comprises the following specific steps:

step 9.1, training a word2vec model by using the original track traffic text;