CN111597350B

CN111597350B - Rail transit event knowledge graph construction method based on deep learning

Info

Publication number: CN111597350B
Application number: CN202010365826.3A
Authority: CN
Inventors: 黑新宏; 彭伟; 朱磊; 赵钦; 王一川; 姬文江; 姚燕妮; 焦瑞; 董林靖
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2020-04-30
Filing date: 2020-04-30
Publication date: 2023-06-02
Anticipated expiration: 2040-04-30
Also published as: CN111597350A

Abstract

The invention discloses a method for constructing a rail transit event knowledge graph based on deep learning; constructing event recognition model training data by adopting a dictionary matching mode and a manual labeling mode; training a canonical event recognition model by adopting a BERT-BiLSTM-CRF algorithm, and automatically extracting a canonical item event from a rail transit design canonical text; adopting a word2vec model, clustering cosine similarity, and logically regressing a two-class model to unify the events output by the event recognition model; adopting a nonowball algorithm to construct training data of an event relation model; and (3) training a relationship identification model by adopting a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between the events. The informatization of the track traffic construction design engineering is improved, and the workload of constructing the map is reduced.

Description

Rail transit event knowledge graph construction method based on deep learning

Technical Field

The invention belongs to an important direction in the field of artificial intelligence, and particularly relates to a track traffic event knowledge graph construction method based on deep learning.

Background

With the rapid development of internet technology, numerous industries are deeply fused with emerging artificial intelligence technologies, and remarkable achievements are achieved. Urban rail transit is used as a standard of urban modernization, and plays an important role in promoting urban economic development. The track traffic construction engineering belongs to complex engineering and has the characteristics of huge scale, long construction period, huge investment and the like. The early design planning stage in the track traffic construction engineering is the basis of the later engineering, and the later construction can be guaranteed only by completing the initial design planning. However, in the design planning stage of the track traffic engineering, the referred design specification standards are various and the information amount of each specification item is huge, and the informatization degree of the whole track traffic construction engineering is low, so that the difficulty of inquiring the specification content in the design planning stage is caused. And there is a very high requirement on the professional ability of the designer in the design stage, making design work very challenging. Therefore, knowledge graph is needed to represent the standard knowledge of the rail transit design, and the informatization of the rail transit construction engineering is promoted.

Most of the current knowledge maps are entity knowledge maps with entities as cores, but entity information is separated from specific contexts, and the one-sided nature of semantic information exists. Compared with the entity, the event can express semantic information more clearly. The event expression is contained in the specification items of the rail transit design standard. The design specification is expressed in the form of an event knowledge graph. Compared with the traditional knowledge graph construction method, most of the knowledge graph construction methods are low in automation degree, time-consuming and labor-consuming, so that the method for constructing the rail transit event knowledge graph based on deep learning is provided, the automation degree is improved, and the workload is reduced.

Disclosure of Invention

The invention aims to provide a rail transit event knowledge graph construction method based on deep learning. The event knowledge graph is used for representing the specification, so that the semantics of the represented content are richer and more accurate. The problems of low automation degree, time consumption and labor consumption in the traditional map construction technology are solved by deep learning.

The method adopts the technical scheme that the training data of the rail transit event recognition model is constructed by adopting an event trigger word dictionary matching mode and a manual labeling mode; training a canonical event recognition model by adopting a BERT-BiLSTM-CRF algorithm, and automatically extracting a canonical item event from a rail transit design canonical text; adopting a word2vec model, clustering cosine similarity, and logically regressing a two-class model to unify the events output by the event recognition model; adopting a nonowball algorithm to construct training data of an event relation model; and (3) training a relationship identification model by adopting a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between the events to form a rail traffic event knowledge graph. The event knowledge graph construction process comprises the following steps:

and step 1, constructing training data of an event recognition model by adopting an event-triggered word dictionary matching and manual labeling mode for the original text.

And 2, preprocessing a track traffic design specification event extraction training set, dividing texts in the training set by specification items, and marking the parts of speech of the texts.

And 3, training a track traffic design specification event recognition model by using the text processed in the step 2 through a BERT-BiLSTM-CRF algorithm.

And 4, constructing event relation training data by adopting a nonroball algorithm on the original text.

And 5, extracting the track traffic design specification event relation generated in the step 4 to preprocess a training set, and dividing texts in the training set in an event pair mode.

And 6, training a relation recognition model by using the text processed in the step 5 by using a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm.

And 7, preprocessing the rail transit design specification to divide the specification entry.

And 8, inputting the preprocessed track traffic specification text in the step 7 into the event recognition model generated in the step 3, and extracting events in the specification, wherein the events comprise event trigger words and event elements.

And 9, unifying the events identified in the step 8.

And step 10, storing the event identified in the step 9 into an event database.

Step 11, storing the event identified in step 9 into a graph database in the form of a triplet of event element-relationship-event trigger words.

And step 12, taking out the events from the event database generated in the step 10, forming event pairs, inputting the event pairs into the event relation recognition model generated in the step 6, and extracting the relation among the events in the specification.

And 13, storing the event pairs in the step 10 and the event relations extracted in the step 12 into a graph database in the form of a triplet of event trigger words-relation-event trigger words.

In the step 1, an event consists of an event trigger word and an event element; because most event trigger words have fixed vocabulary, a dictionary matching mode is adopted to accelerate manual annotation, and model training data is constructed; the dictionary can be expanded by means of synonym forests.

In step 3, the event recognition model is trained by using a BERT-BiLSTM-CRF algorithm, and the whole model consists of three parts, namely a BERT layer, a BiLSTM layer and a CRF layer. The BERT pre-training model is used for acquiring word vectors containing canonical context feature information, the BiLSTM layer is used for feature extraction, the CRF layer is used for learning constraint conditions of sentences by utilizing sequence information of the whole text, and filtering wrong prediction sequences.

And 4, constructing an event relation recognition model training set by using a semi-supervised swball algorithm. The swball algorithm comprises the following specific steps:

step 4.1, manually marking a small number of event relations to form an event relation table; each event relationship is for an event relationship table.

Step 4.2, matching original sentences containing the events in the event relation table in the original text by using the existing event relation table, and generating a template; the format of the template is in five-tuple form, which is < left >, event 1 type, < middle >, event 2 type, < right >; len is a length that can be set arbitrarily, < left > is a vector representation of len words to the left of event 1, < middle > is a word vector representation between event 1 and event 2, < right > is a vector representation of len words to the right of event; event 1 type is a numeric qualifying event and event 2 type is a numeric qualifying event.

Step 4.3, clustering the generated templates, wherein templates with similarity larger than a threshold value of 0.7 are clustered into one type, generating new templates by using an average method, and adding the new templates into a rule base for storing the templates; . From the steps of4.2 the format of the knowable templates can be written as

E ₁ ，E ₂ Event 1 type and event 2 type, respectively, representing template P,/->

Representation E ₁ Vector representation of 3 vocabulary lengths on the left, < >>

Representation E ₁ ，E ₂ Vector representation of words between->

Representation E ₂ Vector representation of three lexical lengths on the right. Similarity calculation between templates, examples are as follows, template 1: />

Template 2: />

If condition E is satisfied ₁ ＝E ₁ '&&E ₂ ＝E' ₂ I.e. to satisfy the template P ₁ Event 1 type E of (2) ₁ And template P ₂ Event 1 type E' ₁ Identical and template P ₁ Event 2 type E of (2) ₂ And template P ₂ Event 2 type E 'of (2)' ₂ Identical, template P ₁ And template P ₂ Can be defined by->

Calculated, mu ₁ μ ₂ μ ₃ Is the weight, because->

Has larger influence on the calculation result of the similarity between templates, and mu can be set ₂ >μ ₁ >μ ₃ The method comprises the steps of carrying out a first treatment on the surface of the If not meeting condition E ₁ ＝E ₁ '&&E ₂ ＝E' ₂ Template P ₁ And template P ₂ The similarity of (2) may be noted as 0.

Step 4.4, firstly, scanning an original text by using the event recognition model trained in the step 3, recognizing event types contained in the text, then, matching the original text by using the template in the rule base generated in the step 4.3, and converting the matched text into a five-tuple form of the template;

step 4.5, calculating the similarity between the new template generated in the step 4.4 and the template in the rule base, discarding the template with the similarity smaller than the threshold value 0.7, and adding the event in the template with the similarity larger than the threshold value 0.7 into the event relation table;

and 4.6, repeatedly executing the steps 4.2-4.5 until the original text processing is finished.

In step 6, the relationship identification model is trained using the BERT-BiLSTM-ATTENTION-SOFTMAX algorithm. The whole model consists of four parts, namely a BERT layer, a BiLSTM layer, an ATTENTION layer and a SOFTMAX layer. The BERT pre-training model is used for acquiring word vectors containing canonical context feature information, the BiLSTM layer is used for feature extraction, the ATTENTION layer is used for calculating ATTENTION probability to highlight importance degree of key words in the text by utilizing sequence information of the whole text, the SOFTMAX layer is used for generating probabilities of various relation categories, and the largest category probability is taken as model prediction category.

In step 9, there is a text referring to the same event in the specification text; to avoid creating a large amount of redundant information in the event database; the event unified processing algorithm is adopted, and the specific steps of the event unified processing algorithm are as follows:

step 9.1, training a word2vec model by utilizing a track traffic original text;

step 9.2, inputting a rail traffic event by using the word2vec model generated in the step 9.1, and generating an event vector;

step 9.3, calculating the similarity between the events by using cosine function values, and gathering the events with the similarity value larger than 0.8 into one class; the cosine function is as follows:

step 9.4, generating new events in the step 9.3, combining all the events at will, and calculating the similarity between event pairs;

and 9.5, inputting the similarity between the event pairs and the event into a trained logistic regression classification model, and judging the similarity of the events. The logistic regression mathematical model is as follows:

step 9.6, according to the classification result of step 9.5, discarding one event if the events are similar, and storing both events if the events are dissimilar.

The beneficial effects of the invention are as follows:

aiming at the problems of complex engineering information, defects of the traditional knowledge graph and large graph construction workload in the track traffic construction design stage, the invention provides a method for constructing a track traffic event knowledge graph based on deep learning. Constructing training data of a rail transit event recognition model by adopting an event trigger word dictionary matching mode and a manual labeling mode; training a canonical event recognition model by adopting a BERT-BiLSTM-CRF algorithm, and automatically extracting a canonical item event from a rail transit design canonical text; adopting a word2vec model, clustering cosine similarity, and logically regressing a two-class model to unify the events output by the event recognition model; adopting a nonowball algorithm to construct training data of an event relation model; and (3) training a relationship identification model by adopting a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between the events to form a rail traffic event knowledge graph. The informatization of the track traffic construction design engineering is improved, and the workload of constructing the map is reduced.

Drawings

FIG. 1 is a general flow chart of a method for constructing a rail transit event knowledge graph based on deep learning;

FIG. 2 is a process of constructing an event training data set by adopting dictionary matching and manual labeling in the track traffic event knowledge graph construction method based on deep learning;

FIG. 3 is a process of constructing a canonical event recognition model based on BERT-BiLSTM-CRF algorithm by using the deep learning-based rail transit event knowledge graph construction method;

FIG. 4 is a process of event unification for events output by an event recognition model by adopting a word2vec model, cosine similarity clustering and logistic regression two-class model in the track traffic event knowledge graph construction method based on deep learning;

FIG. 5 is a process of constructing training data of an event relationship model by adopting a swball algorithm according to the deep learning-based rail transit event knowledge graph construction method;

FIG. 6 is a process of constructing a relationship recognition model based on BERT-BiLSTM-ATTENTION-SOFTMAX algorithm in the method for constructing a rail transit event knowledge graph based on deep learning.

Detailed Description

The invention will be described in detail below with reference to the drawings and the detailed description.

Referring to fig. 1, the method for constructing the rail transit event knowledge graph based on deep learning specifically comprises the following steps:

step 1, as shown in fig. 2, training data of an event recognition model is constructed by adopting an event trigger word dictionary matching and manual labeling mode on an original text. The pseudo code of the labeling training set algorithm is as follows:

And 3, as shown in fig. 3, training a track traffic design specification event recognition model by using the BERT-BiLSTM-CRF algorithm on the text processed in the step 2. The pseudo code for constructing the event recognition model is as follows:

and 4, as shown in fig. 5, constructing an event relation recognition model training set by adopting a semi-supervised swball algorithm on the original text. The swball algorithm comprises the following specific steps:

Step 4.3, clustering the generated templates, wherein templates with similarity larger than a threshold value of 0.7 are clustered into one type, generating new templates by using an average method, and adding the new templates into a rule base for storing the templates; . The format of the template known from step 4.2 can be written as

Representation E ₁ ，E ₂ Vector representation of words between->

Template 2:

Calculated, mu ₁ μ ₂ μ ₃ Is the weight, because->

And 5, preprocessing the track traffic design specification event relation extraction training set generated in the step 4, and dividing the text in an event pair mode.

And 6, training a relation recognition model by using the text processed in the step 5 by using a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm. The pseudo code for constructing the event relationship identification model is as follows, as shown in FIG. 6:

And 9, unifying the events identified in the step 8 as shown in fig. 4. Text referring to the same event exists in the specification text; to avoid creating a large amount of redundant information in the event database; the event unified processing algorithm is adopted, and the specific steps of the event unified processing algorithm are as follows:

step 9.1, training a word2vec model by utilizing a track traffic original text;

And step 10, storing the event identified in the step 9 into an event database.

Step 11, storing the event identified in step 9 into a graph database in the form of a triplet of event element-relationship-event trigger words. For example, "track center bed surface as emergency evacuation channel" stores the map database with < track center bed surface, subject, as > and < emergency evacuation channel, object, as >.

And 13, storing the event pairs in the step 10 and the event relations extracted in the step 12 into a graph database in the form of a triplet of event trigger words-relation-event trigger words. For example, "a track center bed surface is used as an emergency evacuation channel" and "a train end vehicle should be provided with a special end door and a get-off facility is configured with an event relationship of < as a condition relationship, and > is stored in a map database.

The invention adopts an event trigger word dictionary matching mode and a manual labeling mode to construct the training data of the rail transit event recognition model; training a canonical event recognition model by adopting a BERT-BiLSTM-CRF algorithm, and automatically extracting a canonical item event from a rail transit design canonical text; adopting a word2vec model, clustering cosine similarity, and logically regressing a two-class model to unify the events output by the event recognition model; adopting a nonowball algorithm to construct training data of an event relation model; and (3) training a relationship identification model by adopting a BERT-BiLSTM-ATTENTION-SOFTMAX algorithm, and automatically extracting the relationship between the events to form a rail traffic event knowledge graph. The informatization of the track traffic construction design engineering is improved, and the workload of constructing the map is reduced.

Claims

1. The track traffic event knowledge graph construction method based on deep learning is characterized in that an event trigger word dictionary matching mode and a manual labeling mode are adopted to construct track traffic event recognition model training data; training a canonical event recognition model by adopting a BERT-BiLSTM-CRF algorithm, and automatically extracting a canonical item event from a rail transit design canonical text; adopting a word2vec model, clustering cosine similarity, and logically regressing a two-class model to unify the events output by the event recognition model; adopting a nonowball algorithm to construct training data of an event relation model; the BERT-BiLSTM-ATTENTION-SOFTMAX algorithm is adopted to train a relationship identification model, and relationships among events are automatically extracted to form a rail traffic event knowledge graph;

the method specifically comprises the following steps:

step 1, training data of an event recognition model is built by adopting an event trigger word dictionary matching and manual labeling mode for an original text, wherein in the step 1, an event consists of an event trigger word and an event element; because most event trigger words have fixed vocabulary, a dictionary matching mode is adopted to accelerate manual annotation, and model training data is constructed; the expansion of the dictionary may be by means of a synonym forest;

step 2, preprocessing a track traffic design specification event extraction training set, dividing texts in the training set by specification items, and marking the parts of speech of the texts;

step 3, training a track traffic design specification event recognition model by using the text processed in the step 2 through a BERT-BiLSTM-CRF algorithm; in the step 3, a BERT-BiLSTM-CRF algorithm is used for training an event recognition model, and the whole model consists of three parts, namely a BERT layer, a BiLSTM layer and a CRF layer; the BERT pre-training model is used for acquiring word vectors containing canonical context feature information, the BiLSTM layer is used for feature extraction, the CRF layer is used for learning constraint conditions of sentences by utilizing sequence information of the whole text, and filtering false prediction sequences;

and 4, constructing event relation training data by adopting a swball algorithm on the original text, wherein in the step 4, an event relation recognition model training set is constructed by utilizing a semi-supervision swball algorithm, and the specific steps of the swball algorithm are as follows:

step 4.1, manually marking a small number of event relations to form an event relation table; each event relationship is for an event relationship table;

step 4.2, matching original sentences containing the events in the event relation table in the original text by using the existing event relation table, and generating a template; the format of the template is in five-tuple form, which is < left >, event 1 type, < middle >, event 2 type, < right >; len is a length that can be set arbitrarily, < left > is a vector representation of len words to the left of event 1, < middle > is a word vector representation between event 1 and event 2, < right > is a vector representation of len words to the right of event; event 1 type is a numeric qualifying event, and event 2 type is a numeric qualifying event;

step 4.3, clustering the generated templates, wherein templates with similarity larger than a threshold value of 0.7 are clustered into one type, generating new templates by using an average method, and adding the new templates into a rule base for storing the templates; the format of the template known from step 4.2 can be written as

Representation E ₁ ，E ₂ Vector representation of words between->

Representation E ₂ Vector representation of three word lengths on the right, similarity calculation between templates, template 1: />

Template 2: />

If condition E is satisfied ₁ ＝E ₁ '&&E ₂ ＝E' ₂ I.e. to satisfy the template P ₁ Event 1 type E of (2) ₁ And template P ₂ Event 1 type E' ₁ Identical and template P ₁ Event 2 type E of (2) ₂ And template P ₂ Event 2 type E 'of (2)' ₂ Identical, template P ₁ And template P ₂ Can be defined by the similarity of

Calculated, mu ₁ μ ₂ μ ₃ Is the weight, because->

Has larger influence on the calculation result of the similarity between templates, and mu can be set ₂ >μ ₁ >μ ₃ The method comprises the steps of carrying out a first treatment on the surface of the If not meeting condition E ₁ ＝E ₁ '&&E ₂ ＝E' ₂ Template P ₁ And template P ₂ The similarity of (2) may be noted as 0; />

step 4.6, repeatedly executing the steps 4.2-4.5 until the original text processing is finished;

step 5, extracting a training set from the track traffic design specification event relation generated in the step 4 for preprocessing, and dividing texts in the training set in an event pair mode;

step 6, training a relation recognition model by using the BERT-BiLSTM-ATTENTION-SOFTMAX algorithm to the text processed in the step 5, wherein in the step 6, the relation recognition model is trained by using the BERT-BiLSTM-ATTENTION-SOFTMAX algorithm; the whole model consists of four parts, namely a BERT layer, a BiLSTM layer, an ATTENTION layer and a SOFTMAX layer; the BERT pre-training model is used for acquiring word vectors containing canonical context feature information, the BiLSTM layer is used for feature extraction, the ATTENTION layer is used for calculating ATTENTION probability to highlight importance degree of key words in the text by utilizing sequence information of the whole text, the SOFTMAX layer is used for generating probabilities of various relation categories, and the largest category probability is taken as a model prediction category;

step 7, preprocessing the rail transit design specification, and dividing the rail transit design specification by specification items;

step 8, inputting the track traffic specification text preprocessed in the step 7 into the event recognition model generated in the step 3, and extracting an event in the specification, wherein the event comprises an event trigger word and an event element;

step 9, unifying the events identified in the step 8;

step 10, storing the event identified in the step 9 into an event database;

step 11, storing the event identified in step 9 into a graph database in the form of a triplet of event element-relationship-event trigger word;

step 12, taking out the events from the event database generated in the step 10, forming event pairs, inputting the event pairs into the event relation recognition model generated in the step 6, and extracting the relation among the events in the specification;

2. The method for constructing a knowledge graph of a rail transit event based on deep learning according to claim 1, wherein in the step 9, text referring to the same event exists in the canonical text; to avoid creating a large amount of redundant information in the event database; the event unified processing algorithm is adopted, and the specific steps of the event unified processing algorithm are as follows:

step 9.1, training a word2vec model by utilizing a track traffic original text;

step 9.5, inputting the similarity between the event pairs and the event into a trained logistic regression classification model, and judging the similarity of the events; the logistic regression mathematical model is as follows: