CN113312501A - Construction method and device of safety knowledge self-service query system based on knowledge graph - Google Patents

Construction method and device of safety knowledge self-service query system based on knowledge graph Download PDF

Info

Publication number
CN113312501A
CN113312501A CN202110725884.7A CN202110725884A CN113312501A CN 113312501 A CN113312501 A CN 113312501A CN 202110725884 A CN202110725884 A CN 202110725884A CN 113312501 A CN113312501 A CN 113312501A
Authority
CN
China
Prior art keywords
knowledge
self
query
entity
construction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110725884.7A
Other languages
Chinese (zh)
Inventor
苏成
宋建炜
邓逸川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sino Singapore International Joint Research Institute
Original Assignee
Sino Singapore International Joint Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sino Singapore International Joint Research Institute filed Critical Sino Singapore International Joint Research Institute
Priority to CN202110725884.7A priority Critical patent/CN113312501A/en
Publication of CN113312501A publication Critical patent/CN113312501A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/08Construction

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Primary Health Care (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Evolutionary Computation (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Computing Systems (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Animal Behavior & Ethology (AREA)
  • Databases & Information Systems (AREA)
  • Biophysics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a construction method of a safety knowledge self-service inquiry system based on a knowledge graph, which comprises the following steps: collecting and preprocessing safety knowledge data, and establishing a corpus for entity extraction and relationship extraction; extracting entities by adopting a BERT-BilSTM-CRF algorithm, extracting relations by adopting a BERT-CNN algorithm, generating triple data, storing the triple data in a graph database and constructing a knowledge graph; constructing a training sample set based on the sample query sentence, and training a naive Bayes classifier; generating a Cypher query statement based on natural language processing and a naive Bayesian classification algorithm, and constructing a self-service query end; and configuring the knowledge graph and the self-service query terminal to the mobile intelligent terminal to construct a self-service query system. The query system can be used for self-service query at any time and any place, is not limited by geographical conditions and network conditions, and can be used for accurately querying answers of problems, so that the low efficiency and the complexity of looking up construction specifications and searching webpages are avoided.

Description

Construction method and device of safety knowledge self-service query system based on knowledge graph
Technical Field
The invention relates to the field of safety management of construction sites, in particular to a construction method and a device of a safety knowledge self-service query system based on a knowledge graph.
Background
The engineering project construction is a comprehensive production activity of multiple categories, and has the characteristics of long construction period and multiple uncertain factors in the construction process. In recent years, although the construction safety situation of China is better, various safety accidents happen sometimes, the construction safety problem is not ignored, and the construction safety management level needs to be further improved. At present, construction safety specifications and construction safety accident reports are accumulated continuously, most of massive construction site safety knowledge is mainly stored in construction safety specification documents and accident safety reports of a website of a building department, most of the documents are paper documents or fragmented electronic information, traditional construction safety management cannot fully utilize unstructured information, the search and the query of the paper construction specifications or traditional webpage are very inefficient and tedious, and the self-service query cannot be completed.
Google formally proposed the concept of knowledge graph in 2012, and its original intention is to improve the search capability of search engine, improve the search quality and search experience of users. The earliest knowledge maps were the knowledge base used by Google to enhance its search engine functionality, and knowledge maps have now been used to refer broadly to a variety of large-scale knowledge bases. In essence, a knowledge graph is a knowledge base of a semantic network, and in brief, a knowledge graph is a multi-relation graph, in which "entities" are usually used to express "nodes" in the graph and "relations" are usually used to express "edges" in the graph. An entity refers to something in the real world, and a relationship is used to express some kind of connection between different entities. The inventor thinks that applying the knowledge map to the construction of the construction safety knowledge self-service query system can make efficient and accurate self-service query possible.
Disclosure of Invention
The invention aims to overcome the defects and shortcomings in the prior art and provides a construction method of a safety knowledge self-service inquiry system based on a knowledge graph. The self-service query system overcomes the defects and improves the practicability by the aid of the self-service query system, and the self-service query system is high in zero fragmentation information and variable factors in the civil industry, construction safety knowledge is mainly stored in construction safety standard documents and accident safety reports of websites of a residential building department, most of the construction safety standard documents are paper documents or partial electronic information, and the non-structural documents cause low query efficiency. The invention further aims to overcome the defects and shortcomings in the prior art and provide a construction device of a safety knowledge self-service inquiry system based on a knowledge graph.
The first purpose of the invention can be achieved by adopting the following technical scheme:
a construction method of a self-service inquiry system of safety knowledge based on a knowledge graph is applied to the configuration of an intelligent mobile terminal, and comprises the following steps:
s1, collecting construction safety knowledge data, preprocessing the data, and constructing a corpus for entity extraction and relationship extraction based on the preprocessed safety knowledge data and application data labeling software. And marking entity category labels on entities in the text by adopting a marking tool YEDDA, and marking corresponding labels on the entities in the text and the relations between the entities by manually marking the corpora extracted by the relations. Generally, the collected security data cannot be directly used by subsequent operations, and thus a preprocessing operation is required. The safety knowledge mainly comes from construction safety specification documents, construction safety accident reports and accident safety reports of a website of a department of housing construction, if the unstructured safety knowledge is paper documents, the unstructured safety knowledge is firstly converted into electronic data, and then TXT (text based translation) processing is carried out.
S2, calling a BERT-BilSTM-CRF algorithm to extract a construction safety knowledge entity;
entity extraction, also known as named entity learning or named entity recognition, refers to the automatic identification of entities from a corpus. The BERT-BilSTM-CRF algorithm comprises a BERT model, a BilSTM model and a CRF model. The text to be processed sequentially passes through the three models, firstly the text to be processed passes through a BERT model to obtain a corresponding dynamic word vector, then the dynamic word vector is input into a BilSTM module for further processing, then a CRF module is used for decoding the output result of the BilSTM module to obtain a prediction labeling sequence, and finally each entity in the sequence is extracted and classified, so that the whole process of entity extraction is completed. The text to be processed comes from a corpus, and the concrete operation steps of entity extraction are as follows:
s21, outputting the corresponding dynamic word vector by the text to be processed through a BERT model:
the BERT model is a pre-training language model based on a Transform encoder, the Transform encoder is composed of a Self-Attention mechanism part (Self-Attention), a summation and normalization part (Add & Nomal) and a Feed-Forward neural Network part (Feed-Forward Network), and the most core Self-Attention mechanism of the Transform encoder can be expressed as:
Figure BDA0003138611460000031
where matrices Q, K and V are input word vectors; dkTo input vector dimensions, QKTMeaning that the semantic relation between the input word vectors is calculated, softmax (·) denotes a normalization function.
S2.2, outputting a corresponding word label score vector by the dynamic word vector through a BilSTM model:
the BilSTM model consists of two Long Short Term Memory (LSTM) (Long Short Term memory) models, and the LSTM model is a Recurrent Neural network (RNN-work) model with a gate control unit. Word vectors generated by the BERT pre-training language model are respectively input into positive-sequence LSTM units and negative-sequence LSTM units according to the sequence from left to right and from right to left, and three gate control units in the LSTM can selectively memorize and transmit information transmitted by the previous node, so that the BilTM model can comprehensively output word label score vectors by considering information in two directions. The calculation process for the LSTM model is as follows:
Figure BDA0003138611460000041
Figure BDA0003138611460000042
Figure BDA0003138611460000043
Figure BDA0003138611460000044
Figure BDA0003138611460000045
Figure BDA0003138611460000046
wherein sigma is a sigmoid activation function, x is a word embedding vector, and i, f and o respectively represent an input gate, a forgetting gate and an output gate; omega is the weight matrix of each control gate in different states,
Figure BDA0003138611460000047
a weight matrix representing the input gate,
Figure BDA0003138611460000048
a weight matrix representing a forgetting gate,
Figure BDA0003138611460000049
a weight matrix representing the output gates,
Figure BDA00031386114600000410
a weight matrix representing alternate value layers; b the offset vector of each control gate in different states,
Figure BDA00031386114600000411
representing the offset vector of the input gate,
Figure BDA00031386114600000412
a bias vector representing a forgetting gate,
Figure BDA00031386114600000413
a bias vector representing the output gate is shown,
Figure BDA00031386114600000414
is the offset vector of the alternate value layer; h is the output, htIs the output at time t, ht-1Is the output at time t-1;
Figure BDA00031386114600000415
a transition matrix representing the state of the old time to the state of the new time, ctIndicating the state of the memory cell at time t; tanh (. cndot.) is a pitch function with an output of (-1, 1).
S2.3, outputting an optimal label sequence by the word label score vector through a CRF model:
a CRF model is a Markov random field, meaning that given a random variable X, a Markov random field of a random variable Y, that is, given an input observed sequence, can output a predicted sequence of maximum probability. In the entity extraction, the CRF model considers the constraint relation among the word labels, calculates the occurrence probability of different label sequences by using the score of each word label and the transfer score among the word labels, and selects the sequence with the maximum occurrence probability as the label sequence of the text. The model can be expressed as:
Figure BDA00031386114600000416
wherein z and y are each independentlyRepresenting an input sentence and a sequence of output tags;
Figure BDA0003138611460000051
indicates that the jth word is discriminated as a label yjThe probability of (a) of (b) being,
Figure BDA0003138611460000052
representing all possible conversions to label yjN is the sequence length;
for each possible label combination situation, the required conditional probability is obtained after normalization, and the calculation process is as follows:
Figure BDA0003138611460000053
wherein, ytrueRepresents the true label value, P (y | x) represents the probability that x's predicted label is y;
taking a log-likelihood function for P (y | x) to obtain an objective function to be optimized, wherein the likelihood function calculation process of the correct label sequence is shown as the following formula:
Figure BDA0003138611460000054
the group of sequences with the highest probability is output as the optimal tag sequence, which can be expressed as:
Figure BDA0003138611460000055
and extracting and classifying each entity in the optimal label sequence, thereby finishing entity extraction.
S3, calling a BERT-CNN algorithm to perform construction safety knowledge relation extraction and construction safety knowledge triple extraction:
the BERT-CNN algorithm comprises a BERT model and a CNN model, and the text to be processed sequentially passes through the BERT model and the CNN model to realize the extraction of the relationship. Firstly, a text to be processed is subjected to a BERT model to obtain a corresponding dynamic word vector and an entity position vector, the entity position vector represents the position of an entity in a sentence, then the dynamic word vector and the entity position vector are input into a CNN model for further processing, a convolution layer of the CNN extracts sentence-level features, a compressed feature vector representation is obtained by using a pooling layer, and finally the feature vector classifies the relation expressed by the sentence through a fully-connected neural network layer of the CNN, so that entity relation extraction is realized. CNN is a neural network specially used for processing data having a grid-like structure, for example, for processing time series data, which can be regarded as a one-dimensional grid formed by regularly sampling on a time axis, and image data, which can be regarded as a two-dimensional pixel grid. The text to be processed comes from a corpus, and the specific operation steps of relation extraction are as follows:
s3.1, outputting a corresponding dynamic word vector by the text to be processed through a BERT model;
and S3.2, inputting the word vector and the position vector representing the position of the entity into the CNN model to realize the extraction of the relation. The sequence of the first character of the entity in the text is coded into a vector, namely the vector is the position vector of the entity, and the purpose of doing so is to improve the accuracy of relation extraction. The convolution operation of CNN is expressed as:
Figure BDA0003138611460000061
wherein, S (k, l) represents a value of the output characteristic surface function S at (k, l), i.e., a result value of the convolution operation; k (p, q) represents the value of the convolution kernel K at (p, q); i (k-p, l-q) represents the value of the input characteristic surface function I at (k-p, l-q); m and n represent the width and height of the convolution kernel, respectively.
S4, storing the triad into a database:
the entity and relationship data are processed into triple data of the type "entity → relationship → entity" and the triples are stored into a database. A knowledge graph is a typical graph structure knowledge base, and a triplet is a general representation of the knowledge graph, and when the fact is expressed by a triplet such as (entity 1, relationship, entity 2), (entity, attribute value), a graph database is particularly suitable to be selected as a storage medium, such as Neo4j, FlockDB, GraphDB, and the like. Neo4j is the most widely used software of database at present, so that the research of subsequent construction safety knowledge inquiry is carried out based on the Neo4j database.
S5, repeating the steps S2 to S4 to complete the construction of the knowledge graph:
a large amount of triple knowledge can be obtained by processing texts in the database, and the triple data is stored into a database Neo4j by applying a load csv method, so that construction of a construction safety knowledge map is completed.
S6, constructing a self-service safety knowledge query terminal based on natural language processing and naive Bayesian classification algorithm, wherein the self-service security knowledge query terminal is constructed according to the following steps:
s6.1, constructing a training sample set, wherein the training sample set comprises problem sample sentences of different categories and classification labels thereof, and the problem sample sentences are problems which can be asked by a user and are related to construction safety knowledge or construction safety accidents, such as' what safety protection tools should be set at an elevator wellhead? "how many construction safety accidents occurred in 2018 of Guangdong province? "," # how casualties caused by a security accident? "and the like.
The training sample set has a large number of categories, and the questions of each category should include the most detailed questions, such as "? The form of this question may also be expressed as how many people died in the accident, how many workers died in the accident, etc. Firstly, determining the number of possible problem categories and labeling each category, wherein the label of the first category is 0, the label of the second category is 1, and so on; and then determining all possible questions of each category problem, and sorting all possible questions in the category into a training sample set of the category.
S6.2 training a naive Bayes classifier, comprising the following steps:
vectorizing sample sentences in the training sample set to obtain sample sentence vectors;
converting the sample statement vector into a data set RDD which can be operated in parallel;
training a classifier by using a naive Bayes classification algorithm based on a data set RDD, wherein naive Bayes is originated from a classical mathematical theory and has a solid mathematical basis and stable classification efficiency, training the classifier according to keywords and labels in question sentences in the data set RDD, classifying the question sentences according to the keywords in sample sentences by using the trained classifier, and outputting the labels corresponding to the question categories.
S6.3, constructing a query template:
applying a natural language processing toolkit HanLP to perform word segmentation on a query sentence input by a user;
converting the query sentence after word segmentation into a query sentence vector;
inputting the query statement vector into a trained naive Bayes classifier, and outputting a label corresponding to the query statement;
and constructing a Cypher query statement template based on the tags.
S6.4, generating a Cypher query statement:
and extracting entities in the query statement as query conditions, and completely filling the query template according to a format provided by the query template to generate the Cypher query statement.
S7, constructing a safety knowledge self-service inquiry system based on the knowledge graph:
the self-service query method comprises the steps that a safety knowledge map and a self-service query end are configured to a mobile intelligent terminal to construct a self-service query system, the self-service query system comprises a construction safety knowledge map and a self-service query end, the construction safety knowledge is stored in a triple form by the safety knowledge map, and the self-service query end queries in the safety knowledge map according to query sentences input by a user and returns an accurate query result.
Further, the safety knowledge in step S1 is from construction safety specification documents and construction safety accident reports, and the safety data is electronic when paper data is used.
Further, the entity labeling in step S1 may also be applied with other tools, such as Stanza of the NLP group of stanford university, paddlelac of hundred degrees, LTP of haar size.
Further, in the step S4, a graph database Neo4j is used to store the triple knowledge.
The other purpose of the invention can be achieved by adopting the following technical scheme:
a construction device of a self-service knowledge inquiry system based on a knowledge graph is used for configuring a mobile terminal, and comprises the following components:
the corpus construction module is used for collecting construction safety knowledge data and constructing a corpus for entity extraction and relation extraction based on safety knowledge data application data labeling software;
the entity extraction module is used for calling a BERT-BilSTM-CRF algorithm to extract the construction safety knowledge entity;
the relation extraction module is used for calling a BERT-CNN algorithm to extract the construction safety knowledge relation;
the triple data generation module is used for processing the entity and the relation data into triple data of an entity → relation → entity type and storing the triple into the database;
the knowledge graph construction module completes construction of the safety knowledge graph by repeatedly calling the entity extraction module, the relation extraction module and the triple data generation module;
the self-service query end construction module generates a Cypher query statement based on natural language processing and a naive Bayesian classification algorithm, and constructs a safety knowledge self-service query end;
and the self-service query system construction module is used for configuring the safety knowledge map and the self-service query end to the mobile intelligent terminal to construct the self-service query system.
Compared with the prior art, the invention has the following advantages and effects:
the invention provides a construction method of a safety knowledge self-service query system based on a knowledge graph. By configuring the self-service query system containing the safety knowledge map constructed by the invention on the intelligent mobile terminal, construction site managers and constructors can perform self-service query on construction safety knowledge anytime and anywhere without the limitation of construction site geographical position and network condition and looking up a large amount of paper document data in a data room, and can efficiently query required safety knowledge even in remote mountainous areas without networks, so that the safety management level of the construction site managers and the constructors can be improved, the safety accident rate is effectively reduced, and the self-service query system has better significance for improving the construction safety management level of the whole building industry; in addition, different from the traditional method of returning a webpage, the self-service query system can accurately query the query result of the corresponding problem, and effectively avoids the low efficiency and the complexity of looking up the construction specification and searching the webpage.
Drawings
FIG. 1 is a flow chart of a construction method of a safety knowledge self-service inquiry system based on knowledge graph disclosed by the invention;
FIG. 2 is a schematic diagram of a query flow when the self-service query system of security knowledge is applied according to an embodiment of the present invention;
FIG. 3 is a schematic diagram illustrating an example of a query when the security knowledge self-service query system is applied according to an embodiment of the present invention;
FIG. 4 is a flow chart of a prior art BERT-BilSTM-CRF algorithm.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example one
The self-service inquiry system of the safety knowledge based on the knowledge graph comprises the safety knowledge graph and a self-service inquiry end of the safety knowledge, wherein the safety knowledge graph stores construction safety knowledge in the graph in a triple structure, and the self-service inquiry end of the safety knowledge can inquire in the safety knowledge graph according to inquiry sentences input by a user and return accurate inquiry results.
As shown in fig. 1, a construction safety knowledge self-service query system construction method based on a knowledge graph includes the following steps:
s1, constructing a safety knowledge corpus:
collecting construction safety knowledge data, such as ' high-altitude construction work safety technical specification ' JGJ80-2016 ', and textualizing the data to obtain construction safety knowledge data, labeling the data by using data labeling software, constructing a construction safety knowledge corpus for entity extraction and relationship extraction, and completing labeling of the categories of entities in the data and the relationships among the entities, wherein in the embodiment, a labeling tool ' YEDDA ' is used for labeling the entities in the text, and the linguistic data for relationship extraction is manually labeled to the entities in the text and the relationships among the entities by using corresponding labels.
S2 extracting entities based on BERT-BilSTM-CRF algorithm;
as shown in FIG. 4, the BERT-BilSTM-CRF algorithm comprises a BERT model, a BilSTM model and a CRF model, and texts to be processed from a construction safety knowledge corpus sequentially pass through the three models to realize entity extraction.
Taking the text "the elevator well head should be provided with the protective door" to be processed as an example, the following operations are specifically performed:
s2.1 the text to be processed is processed by the BERT model and the corresponding dynamic word vector is output
The BERT model is a pre-training language model based on a Transform encoder, and the most core self-attention mechanism of the Transform encoder can be expressed as:
Figure BDA0003138611460000111
where the matrix Q, K, V is an input word vector; dkFor the input vector dimension, divide by dkIs to obtain a more stable gradient, QKTThe semantic relation between input word vectors can be calculated, and the result is multiplied by a matrix V after being processed by a normalization function softmax (·), so that irrelevant words can be removed while the integrity of important words is kept; at this time, each word representation includes information of other words in the sentence, and the dynamic word vector in this embodiment is a 256-dimensional vector, which is more global than the conventional word vector representation.
S2.2, the dynamic word vector is processed by a BilSTM model, and a word tag score vector is output:
the BilSTM model consists of two LSTM models, the LSTM model is a recurrent neural network RNN model with a gate control unit, and the calculation process of the LSTM network is as follows:
Figure BDA0003138611460000112
Figure BDA0003138611460000113
Figure BDA0003138611460000114
Figure BDA0003138611460000115
Figure BDA0003138611460000116
Figure BDA0003138611460000117
wherein, sigma is sigmoid activation function, x is word embedding vector, i, f, o are respectively tableAn input display door, a forgetting door and an output door; omega is the weight matrix of each control gate in different states,
Figure BDA0003138611460000118
a weight matrix representing the input gate,
Figure BDA0003138611460000119
a weight matrix representing a forgetting gate,
Figure BDA00031386114600001110
a weight matrix representing the output gates,
Figure BDA00031386114600001111
a weight matrix representing alternate value layers; b the offset vector of each control gate in different states,
Figure BDA00031386114600001112
representing the offset vector of the input gate,
Figure BDA00031386114600001113
a bias vector representing a forgetting gate,
Figure BDA00031386114600001114
a bias vector representing the output gate is shown,
Figure BDA0003138611460000121
is the offset vector of the alternate value layer; h is the output, htIs the output at time t, ht-1Is the output at time t-1;
Figure BDA0003138611460000122
a transition matrix representing the state of the old time to the state of the new time, ctIndicating the state of the memory cell at time t; tanh (-) is a pitch function that adjusts the output value between-1 and 1.
The word vectors generated by the BERT model are respectively input into positive-sequence LSTM units and negative-sequence LSTM units according to the sequence from left to right and from right to left, and three gate control units in the LSTM can selectively memorize and transmit information transmitted by a previous node, so that the BilTM model can comprehensively output word tag score vectors by considering information in two directions.
S2.3, processing the word label score vector through a CRF model, and outputting an optimal label sequence of the text:
the model of the CRF model can be expressed as:
Figure BDA0003138611460000123
wherein z and y represent the input sentence and the output tag sequence, respectively;
Figure BDA0003138611460000124
indicates that the jth word is discriminated as a label yjThe probability of (a) of (b) being,
Figure BDA0003138611460000125
representing all possible conversions to label yjN is the sequence length;
for each possible label combination situation, the required conditional probability is obtained after normalization, and the calculation process is as follows:
Figure BDA0003138611460000126
wherein y istrueRepresents the true label value, P (y | x) represents the probability that x's predicted label is y;
taking a log-likelihood function for P (y | x), the likelihood function calculation process is shown as follows:
Figure BDA0003138611460000127
the group of sequences with the highest probability is output as the optimal tag sequence, which can be expressed as:
y*=argmaxs(z,ytrue)
and in the entity extraction, the CRF model considers the constraint relation among the word labels, calculates the occurrence probability of different label sequences by utilizing the score of each word label and the transfer score among the word labels, and selects the sequence with the maximum occurrence probability as the label sequence of the text. And extracting the entity elevator well mouth and the protection door from the text to be processed, namely the protection door which is required to be arranged at the elevator well mouth.
S3 carries out entity relation extraction based on BERT-CNN algorithm:
the text to be processed firstly outputs corresponding dynamic word vectors through a BERT model, and then the word vectors and position vectors representing the entity positions are input into a convolutional neural network CNN, so as to realize entity relationship extraction, wherein the relationship extraction specifically comprises the following steps:
s3.1, outputting the corresponding dynamic word vector through the BERT model for the text to be processed.
S3.2, inputting the word vector and the position vector representing the entity position into a Convolutional Neural Network (CNN), wherein the convolution operation of the CNN is represented as:
Figure BDA0003138611460000131
wherein, S (k, l) represents a value of the output characteristic surface function S at (k, l), i.e., a result value of the convolution operation; k (p, q) represents the value of the convolution kernel K at (p, q); i (k-p, l-q) represents the value of the input characteristic surface function I at (k-p, l-q); m and n represent the width and height of the convolution kernel, respectively.
The text to be processed, namely 'the elevator well mouth should be provided with the protective door', extracts the relation of 'setting' through the operation.
And S4, forming a triple according to the extracted entities and the extracted relations, wherein the triple formed by the text to be processed, namely the elevator well → the arrangement → the protective door, is stored in the graph database Neo4 j.
S5, repeating the steps S2 to S4 to complete the construction of the knowledge graph;
s6, constructing a self-service safety knowledge query terminal based on natural language processing and naive Bayesian classification algorithm, wherein the self-service security knowledge query terminal is constructed according to the following steps:
s6.1 construction of training sample set
According to the data characteristics of the knowledge graph in the construction safety field, a training sample set is constructed, the training sample set comprises sample sentences and classification labels thereof, a user query problem training sample set which is as comprehensive as possible is constructed, and each type of different problems in the data set is labeled. The sample sentence is a question about construction safety knowledge or construction safety accident that a user may ask, such as "what safety protection tool should be set at the elevator shaft mouth? "how many construction safety accidents occurred in 2018 of Guangdong province? "," # how casualties caused by a security accident? ".
The training sample set has a large number of categories, and the questions of each category should include the most detailed questions, such as "? The form of this question may also be expressed as how many people died in the accident, how many workers died in the accident, etc. We first need to determine the number of possible problem categories and label each category with a first category label of 0, a second category label of 1, and so on. The invention defines 12 types of problems according to the corpus content, the label number is from 0 to 11, and the keywords in the 12 types of problems are respectively: 0: accident time, 1: accident site, 2: casualty, 3: economic loss of accident, 4: accident cause, 5: accident resolution, 6: project construction unit in which accident occurs, 7: project construction unit of accident, 8: project supervision unit of accident, 9: accident type, 10: construction safety tool, 11: dangerous positions where safety accidents are likely to occur; then all possible question methods of each category problem are determined, all possible question sentences under the category are arranged into a training sample set of the category, so that the training sample sets of 12 categories are arranged in total, and a naive Bayes classifier is trained according to the large training sample set.
S6.2, segmenting words of sample sentences in the sample set into vectors, and training a naive Bayes classifier:
the sample sentences in the training sample set are natural sentences which cannot be directly read by a computer, the natural sentences need to be converted into vectors to obtain sample sentence vectors, and the vector conversion is carried out according to the following method: firstly, constructing a vocabulary data set, wherein the data set comprises characteristic words in all question sentences; and for the sample sentences in the training set, extracting the characteristic words of the sample sentences after word segmentation, comparing the characteristic words with the characteristic words in the vocubulary data set, setting the words as 1 if the characteristic words exist in the vocubulary data set, and setting the words as 0 if the characteristic words do not exist in the vocubulary data set, and converting all the sentences in the training set into vectors according to the idea for training a naive Bayes classifier.
The training steps are as follows:
vectorizing sample sentences in the training sample set to obtain sample sentence vectors;
converting the sample statement vector into a data set RDD which can be operated in parallel;
and training a classifier by using a naive Bayes classification algorithm based on a training data set RDD.
S6.3, constructing a query template:
in the embodiment, firstly, a natural language processing toolkit HanLP is applied to perform word segmentation on a query sentence input by a user, then, the query sentence after word segmentation is converted into a query sentence vector, then, the query sentence vector is input into a trained naive bayesian classifier, a label corresponding to the query sentence is output, and finally, a Cypher query sentence template is obtained through automatic matching of the label.
Assuming that the question that the user inputs the query is "what should be set at the elevator shaft mouth", the query template is constructed as "what should be set at a certain adjacent position" through the above operations.
S6.4, generating Cypher query sentences based on the query template:
after the query template exists, Cypher sentences capable of executing the query in the Neo4j are generated, and the required construction safety knowledge can be queried in the Neo4j database. The entity in the query sentence is extracted as the query condition, the query template is completely filled according to the format provided by the query template, a Cypher query sentence is generated, and the flow of generating the query language is shown in FIG. 3.
Assuming that the problem that the user inputs to be queried is what the elevator wellhead should be set, in this embodiment, a Cypher query template obtained by matching the query template according to the input query problem is: "Macth (p: Position) - [ r: set ] - > (t: Tool) where p.title ═ return t.name"; then extracting a related entity 'elevator well mouth' in the query question as a query condition; and then filling the query condition 'elevator well mouth' into a Cypher query template to obtain a complete Cypher query statement: "mach (p: Position) - [ r: set ] - > (t: Tool) where p.title ═ elevator wellhead' return t.name"; and finally, executing Cypher query statements, wherein the query answer returned by the self-service query system is 'safety door'.
S7, constructing a safety knowledge self-service inquiry system based on the knowledge graph:
the self-service inquiry system of the safety knowledge comprises a knowledge map and a self-service inquiry end, wherein the construction safety knowledge is stored in the knowledge map in a triple structure, and the self-service inquiry end inquires in the knowledge map according to inquiry sentences input by a user and returns an accurate inquiry result.
The self-service query system constructed by the invention is configured on intelligent mobile terminals such as a smart phone and a tablet, can automatically query construction safety knowledge at any time and any place by applying the self-service query system, is not limited by geographical positions and network conditions of a construction site, does not need to look up a large amount of paper document data in a data room, can efficiently query required safety knowledge even in remote mountainous areas without networks, and is different from the traditional query method of returning a webpage.
In order to better understand the self-service knowledge query system based on the knowledge graph provided by the invention, the following description is provided with reference to fig. 2 for the query process of the self-service query system, and it should be noted that the self-service query process is only described for providing a visual understanding of the construction method of the self-service knowledge query system based on the knowledge graph provided by the invention, and the invention is not limited.
As shown in fig. 2, to perform security knowledge query, a user needs to input a question to be queried on a mobile terminal, for convenience of description, the question of the query input by the user is referred to as a query statement, and after receiving the query statement input by the user, the self-service query system performs a series of processing on the query statement, and finally outputs a query result: firstly, natural language input by a user is subjected to word segmentation and vectorization through a HanLP natural language processing toolkit, a trained naive Bayes classifier is adopted to classify natural language query sentences input by the user, then query targets of the query sentences of the user are obtained according to classification results of the query sentences and are matched with a query template, then relevant entities in the query sentences are extracted as query conditions, the query template is filled into complete Cypher query sentences, and required construction safety knowledge can be queried in a Neo4j database after the complete query sentences exist. And finally, the self-service query system returns the answer of the query statement according to the Cypher query statement.
Particularly, the knowledge graph is a typical graph structure knowledge base, a graph database is particularly suitable to be selected as a storage medium, Neo4j is a graph database software which is most widely applied at present, so that the embodiment is developed based on the Neo4j database, and other graph databases can be used as storage media in the follow-up research, such as FlockDB, GraphDB and the like. In addition, in this embodiment, the user query is a text input query, and a voice input mode is further adopted in the follow-up research to improve the use experience of the user.
Example two
A construction device of a safety knowledge self-service inquiry system based on a knowledge graph is used for configuring a mobile terminal and comprises a corpus construction module, an entity extraction module, a relation extraction module, a triple data generation module, a knowledge graph construction library module, a self-service inquiry terminal construction module and a self-service inquiry system construction module. The functions of the constituent modules are as follows:
the corpus construction module is used for collecting construction safety knowledge data and constructing a corpus for entity extraction and relation extraction based on safety knowledge data application data labeling software;
the entity extraction module is used for calling a BERT-BilSTM-CRF algorithm to extract the construction safety knowledge entity;
the relation extraction module is used for calling a BERT-CNN algorithm to extract the construction safety knowledge relation;
the triple data generation module is used for processing the entity and the relation data into triple data of an entity → relation → entity type and storing the triple into the database;
the knowledge graph construction module completes construction of the safety knowledge graph by repeatedly calling the entity extraction module, the relation extraction module and the triple data generation module;
the self-service query end construction module generates a Cypher query statement based on natural language processing and a naive Bayesian classification algorithm, and constructs a safety knowledge self-service query end;
and the self-service query system construction module is used for configuring the safety knowledge map and the self-service query end to the mobile intelligent terminal to construct the self-service query system.
The self-service inquiry system of the safety knowledge constructed by the invention comprises a knowledge map and a self-service inquiry end, wherein the knowledge map stores the construction safety knowledge in a triple structure, and the self-service inquiry end inquires in the knowledge map according to inquiry sentences input by a user and returns an accurate inquiry result. When the knowledge graph is constructed, a corpus construction module is applied to process safety knowledge data to be processed, and a corpus used for entity extraction and relation extraction is constructed; then, calling an entity extraction module to extract the entity and calling a relationship extraction module to extract the relationship; then the triple data generation module processes the entity and the relation data into triple data of the type of entity → relation → entity, and stores the triple data into a database; and finally, calling a knowledge graph construction module to complete construction of the whole knowledge graph. The self-service query system comprising the safety knowledge map and the self-service query end is configured on intelligent terminals such as a smart phone and a tablet, and the self-service query system can be used for self-service query of construction safety knowledge at any time and any place.
It should be noted that, in the above device embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be achieved; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.
In addition, it can be understood by those skilled in the art that all or part of the steps in the above embodiments may be implemented by a program to instruct related hardware, and the corresponding program may be stored in a computer-readable storage medium, such as a ROM/RAM, a magnetic disk, or an optical disk.
The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims (8)

1. A construction method of a self-service inquiry system of safety knowledge based on a knowledge graph is applied to the configuration of an intelligent mobile terminal, and is characterized in that the construction method of the self-service inquiry system comprises the following steps:
s1, collecting construction safety knowledge data, and constructing a corpus for entity extraction and relationship extraction based on safety knowledge data application data labeling software;
s2, calling a BERT-BilSTM-CRF algorithm to extract a construction safety knowledge entity;
s3, calling a BERT-CNN algorithm to extract the construction safety knowledge relationship;
s4, processing the entity and the relation data into triple data of the type of entity → relation → entity, and storing the triple into a database;
s5, repeating the steps S2 to S4 to complete the construction of the knowledge graph;
s6, generating a Cypher query statement based on natural language processing and a naive Bayesian classification algorithm, and constructing a safety knowledge self-service query end;
and S7, configuring the security knowledge graph and the self-service inquiry terminal to the mobile intelligent terminal to construct a self-service inquiry system.
2. The method for constructing a knowledge-graph-based secure knowledge self-service query system according to claim 1, wherein the BERT-BiLSTM-CRF algorithm in step S2 includes a BERT model, a BiLSTM model and a CRF model, and the text to be processed sequentially passes through the three models to realize entity extraction, the text to be processed is from a corpus, and the entity extraction process is as follows:
s21, outputting a corresponding dynamic word vector by the text to be processed through a BERT model:
the BERT model is a pre-training language model based on a Transform encoder, the Transform encoder is composed of a self-attention mechanism part, a summation and normalization part and a feedforward neural network part, and the most core self-attention mechanism of the Transform encoder can be expressed as follows:
Figure FDA0003138611450000011
where matrices Q, K and V are input word vectors; dkTo input vector dimensions, QKTRepresenting the semantic relation between the input word vectors, and softmax ((-)) representing a normalization function;
s2.2, outputting a corresponding word label score vector by the dynamic word vector through a BilSTM model:
the BilSTM model consists of two LSTM models, and the calculation process of the LSTM models is as follows:
Figure FDA0003138611450000021
Figure FDA0003138611450000022
Figure FDA0003138611450000023
Figure FDA0003138611450000024
Figure FDA0003138611450000025
Figure FDA0003138611450000026
wherein sigma is a sigmoid activation function, x is a word embedding vector, and i, f and o respectively represent an input gate, a forgetting gate and an output gate; omega is the weight matrix of each control gate in different states,
Figure FDA0003138611450000027
a weight matrix representing the input gate,
Figure FDA0003138611450000028
a weight matrix representing a forgetting gate,
Figure FDA0003138611450000029
a weight matrix representing the output gates,
Figure FDA00031386114500000210
a weight matrix representing alternate value layers; b is the offset vector of each control gate in different states,
Figure FDA00031386114500000211
representing the offset vector of the input gate,
Figure FDA00031386114500000212
a bias vector representing a forgetting gate,
Figure FDA00031386114500000213
a bias vector representing the output gate is shown,
Figure FDA00031386114500000214
is the offset vector of the alternate value layer; h is the output, htIs the output at time t, ht-1Is the output at time t-1;
Figure FDA00031386114500000215
a transition matrix representing the state of the old time to the state of the new time, ctIndicating the state of the memory cell at time t; tanh (·) is a value-adjusting function, and the output value of the value-adjusting function is positioned in the interval (-1, 1);
s2.3, outputting an optimal label sequence by the word label score vector through a CRF model:
wherein the CRF model is a Markov random field, and is expressed as:
Figure FDA00031386114500000216
wherein z and y represent the input sentence and the output tag sequence, respectively;
Figure FDA0003138611450000031
indicates that the jth word is discriminated as a label yjThe probability of (a) of (b) being,
Figure FDA0003138611450000032
representing all possible conversions to label yjN is the sequence length;
for each possible label combination situation, the required conditional probability is obtained after normalization, and the calculation process is as follows:
Figure FDA0003138611450000033
wherein, ytrueRepresents the true label value, P (y | x) represents the probability that x's predicted label is y;
taking a log-likelihood function for P (y | x), the likelihood function calculation process is shown as follows:
Figure FDA0003138611450000034
the group of sequences with the highest probability is output as the optimal tag sequence, which can be expressed as:
Figure FDA0003138611450000035
and extracting and classifying each entity in the optimal label sequence, thereby finishing entity extraction.
3. The method for constructing the safe self-help knowledge inquiry system based on the knowledge graph of claim 1, wherein the relational extraction in the step S3 is performed according to the following steps:
s3.1, outputting a corresponding dynamic word vector and an entity position vector by the text to be processed through a BERT model, wherein the dynamic word vector is represented by the vector of each character, and the position vector represents the position vector of the entity in a sentence;
s3.2, inputting the dynamic word vector and the position vector into a CNN model to realize relation extraction, wherein the convolution operation of the CNN is expressed as:
Figure FDA0003138611450000036
wherein, S (K, l) represents the value of the output characteristic surface function S at (K, l), namely the value of the result of convolution operation, K (p, q) represents the value of the convolution kernel function K at (p, q), I (K-p, l-q) represents the value of the input characteristic surface function I at (K-p, l-q), and m and n represent the width and height of the convolution kernel respectively.
4. The construction method of the safe self-help knowledge inquiry system based on the knowledge graph of claim 1, wherein the self-help inquiry terminal in the step S6 is constructed according to the following steps:
s6.1, constructing a training sample set, wherein the training sample set comprises question sample sentences of different categories and classification labels thereof, and the question sample sentences are questions which are possibly asked by a user and are related to the category construction safety knowledge;
s6.2 training a naive Bayes classifier: the training steps are as follows:
vectorizing sample sentences in the training sample set to obtain sample sentence vectors;
converting the sample statement vector into a data set RDD which can be operated in parallel;
training a classifier by using a naive Bayes classification algorithm based on a data set RDD;
s6.3, constructing a query template:
applying a natural language processing toolkit HanLP to perform word segmentation on a query sentence input by a user;
converting the query sentence after word segmentation into a query sentence vector;
inputting the query statement vector into a trained naive Bayes classifier, and outputting a label corresponding to the query statement;
constructing a Cypher query statement template based on the tags;
s6.4, generating a Cypher query statement:
and extracting entities in the query statement as query conditions, and completely filling the query template to generate the Cypher query statement.
5. The method for constructing the self-service knowledge base knowledge inquiry system based on the knowledge graph of any one of claims 1 to 4, wherein the safety knowledge in the step S1 is from construction safety specification documents and construction safety accident reports, and the safety data is paper data which is to be electronized in advance.
6. The method for constructing the safe self-help knowledge query system based on the knowledge graph of any one of claims 1 to 4, wherein in the step S1, a labeling tool YEDDA is called to label entities of the corpus with entity categories.
7. The method for constructing a safe knowledge self-service inquiry system based on knowledge graph according to any one of claims 1 to 4, wherein in the step S4, the graph database Neo4j is adopted to store the triple knowledge.
8. A construction device of a self-service knowledge inquiry system based on knowledge graph based on any one of claims 1 to 4, which is used for configuring a mobile terminal, and the construction device of the self-service inquiry system comprises:
the corpus construction module is used for collecting construction safety knowledge data and constructing a corpus for entity extraction and relation extraction based on safety knowledge data application data labeling software;
the entity extraction module is used for calling a BERT-BilSTM-CRF algorithm to extract the construction safety knowledge entity;
the relation extraction module is used for calling a BERT-CNN algorithm to extract the construction safety knowledge relation;
the triple data generation module is used for processing the entity and the relation data into triple data of an entity → relation → entity type and storing the triple into the database;
the knowledge graph construction module completes construction of the safety knowledge graph by repeatedly calling the entity extraction module, the relation extraction module and the triple data generation module;
the self-service query end construction module generates a Cypher query statement based on natural language processing and a naive Bayesian classification algorithm, and constructs a safety knowledge self-service query end;
and the self-service query system construction module is used for configuring the safety knowledge map and the self-service query end to the mobile intelligent terminal to construct the self-service query system.
CN202110725884.7A 2021-06-29 2021-06-29 Construction method and device of safety knowledge self-service query system based on knowledge graph Pending CN113312501A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110725884.7A CN113312501A (en) 2021-06-29 2021-06-29 Construction method and device of safety knowledge self-service query system based on knowledge graph

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110725884.7A CN113312501A (en) 2021-06-29 2021-06-29 Construction method and device of safety knowledge self-service query system based on knowledge graph

Publications (1)

Publication Number Publication Date
CN113312501A true CN113312501A (en) 2021-08-27

Family

ID=77380730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110725884.7A Pending CN113312501A (en) 2021-06-29 2021-06-29 Construction method and device of safety knowledge self-service query system based on knowledge graph

Country Status (1)

Country Link
CN (1) CN113312501A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113449083A (en) * 2021-08-31 2021-09-28 深圳市信润富联数字科技有限公司 Operation safety management method, device, equipment and storage medium
CN113806554A (en) * 2021-09-14 2021-12-17 上海云思智慧信息技术有限公司 Knowledge graph construction method for massive conference texts
CN114218333A (en) * 2021-11-26 2022-03-22 西南交通大学 Geological knowledge map construction method and device, electronic equipment and storage medium
CN114238653A (en) * 2021-12-08 2022-03-25 华东师范大学 Method for establishing, complementing and intelligently asking and answering knowledge graph of programming education
CN114900346A (en) * 2022-04-28 2022-08-12 上海微波技术研究所(中国电子科技集团公司第五十研究所) Network security testing method and system based on knowledge graph
CN115129719A (en) * 2022-06-28 2022-09-30 深圳市规划和自然资源数据管理中心 Knowledge graph-based qualitative position space range construction method
CN115438141A (en) * 2022-11-08 2022-12-06 智慧眼科技股份有限公司 Information retrieval method based on knowledge graph model
CN115713307A (en) * 2022-11-17 2023-02-24 西安电子科技大学广州研究院 Intelligent operator responsibility determination method and device
CN116108000A (en) * 2023-04-14 2023-05-12 成都安哲斯生物医药科技有限公司 Medical data management query method
CN116340530A (en) * 2023-02-17 2023-06-27 江苏科技大学 Intelligent design method based on mechanical knowledge graph
CN116484010A (en) * 2023-03-15 2023-07-25 北京擎盾信息科技有限公司 Knowledge graph construction method and device, storage medium and electronic device
CN116562265A (en) * 2023-07-04 2023-08-08 南京航空航天大学 Information intelligent analysis method, system and storage medium
CN117093693A (en) * 2023-08-23 2023-11-21 北京深维智信科技有限公司 Intelligent question-answering method based on NLP
CN117520483A (en) * 2024-01-04 2024-02-06 北京奇虎科技有限公司 Information verification method and device based on large model
CN114238653B (en) * 2021-12-08 2024-05-24 华东师范大学 Method for constructing programming education knowledge graph, completing and intelligently asking and answering

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111737496A (en) * 2020-06-29 2020-10-02 东北电力大学 Power equipment fault knowledge map construction method
CN112131393A (en) * 2020-08-11 2020-12-25 淮阴工学院 Construction method of medical knowledge map question-answering system based on BERT and similarity algorithm
CN112597272A (en) * 2020-11-17 2021-04-02 北京计算机技术及应用研究所 Expert field knowledge graph query method based on natural language question
CN113010693A (en) * 2021-04-09 2021-06-22 大连民族大学 Intelligent knowledge graph question-answering method fusing pointer to generate network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111737496A (en) * 2020-06-29 2020-10-02 东北电力大学 Power equipment fault knowledge map construction method
CN112131393A (en) * 2020-08-11 2020-12-25 淮阴工学院 Construction method of medical knowledge map question-answering system based on BERT and similarity algorithm
CN112597272A (en) * 2020-11-17 2021-04-02 北京计算机技术及应用研究所 Expert field knowledge graph query method based on natural language question
CN113010693A (en) * 2021-04-09 2021-06-22 大连民族大学 Intelligent knowledge graph question-answering method fusing pointer to generate network

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113449083A (en) * 2021-08-31 2021-09-28 深圳市信润富联数字科技有限公司 Operation safety management method, device, equipment and storage medium
CN113806554A (en) * 2021-09-14 2021-12-17 上海云思智慧信息技术有限公司 Knowledge graph construction method for massive conference texts
CN114218333A (en) * 2021-11-26 2022-03-22 西南交通大学 Geological knowledge map construction method and device, electronic equipment and storage medium
CN114238653A (en) * 2021-12-08 2022-03-25 华东师范大学 Method for establishing, complementing and intelligently asking and answering knowledge graph of programming education
CN114238653B (en) * 2021-12-08 2024-05-24 华东师范大学 Method for constructing programming education knowledge graph, completing and intelligently asking and answering
CN114900346B (en) * 2022-04-28 2023-09-19 上海微波技术研究所(中国电子科技集团公司第五十研究所) Network security testing method and system based on knowledge graph
CN114900346A (en) * 2022-04-28 2022-08-12 上海微波技术研究所(中国电子科技集团公司第五十研究所) Network security testing method and system based on knowledge graph
CN115129719A (en) * 2022-06-28 2022-09-30 深圳市规划和自然资源数据管理中心 Knowledge graph-based qualitative position space range construction method
CN115438141A (en) * 2022-11-08 2022-12-06 智慧眼科技股份有限公司 Information retrieval method based on knowledge graph model
CN115713307A (en) * 2022-11-17 2023-02-24 西安电子科技大学广州研究院 Intelligent operator responsibility determination method and device
CN115713307B (en) * 2022-11-17 2024-02-06 西安电子科技大学广州研究院 Intelligent responsibility fixing method and device for operators
CN116340530A (en) * 2023-02-17 2023-06-27 江苏科技大学 Intelligent design method based on mechanical knowledge graph
CN116484010A (en) * 2023-03-15 2023-07-25 北京擎盾信息科技有限公司 Knowledge graph construction method and device, storage medium and electronic device
CN116484010B (en) * 2023-03-15 2024-01-16 北京擎盾信息科技有限公司 Knowledge graph construction method and device, storage medium and electronic device
CN116108000B (en) * 2023-04-14 2023-06-20 成都安哲斯生物医药科技有限公司 Medical data management query method
CN116108000A (en) * 2023-04-14 2023-05-12 成都安哲斯生物医药科技有限公司 Medical data management query method
CN116562265A (en) * 2023-07-04 2023-08-08 南京航空航天大学 Information intelligent analysis method, system and storage medium
CN116562265B (en) * 2023-07-04 2023-12-01 南京航空航天大学 Information intelligent analysis method, system and storage medium
CN117093693A (en) * 2023-08-23 2023-11-21 北京深维智信科技有限公司 Intelligent question-answering method based on NLP
CN117093693B (en) * 2023-08-23 2024-05-07 北京深维智信科技有限公司 Intelligent question-answering method based on NLP
CN117520483A (en) * 2024-01-04 2024-02-06 北京奇虎科技有限公司 Information verification method and device based on large model

Similar Documents

Publication Publication Date Title
CN113312501A (en) Construction method and device of safety knowledge self-service query system based on knowledge graph
US20220147836A1 (en) Method and device for text-enhanced knowledge graph joint representation learning
CN110298037B (en) Convolutional neural network matching text recognition method based on enhanced attention mechanism
CN110609891B (en) Visual dialog generation method based on context awareness graph neural network
CN110598005B (en) Public safety event-oriented multi-source heterogeneous data knowledge graph construction method
CN111209401A (en) System and method for classifying and processing sentiment polarity of online public opinion text information
CN109934261B (en) Knowledge-driven parameter propagation model and few-sample learning method thereof
CN112434535B (en) Element extraction method, device, equipment and storage medium based on multiple models
CN111522965A (en) Question-answering method and system for entity relationship extraction based on transfer learning
CN111259153B (en) Attribute-level emotion analysis method of complete attention mechanism
CN111831783B (en) Method for extracting chapter-level relation
CN113987147A (en) Sample processing method and device
CN113051922A (en) Triple extraction method and system based on deep learning
CN111581368A (en) Intelligent expert recommendation-oriented user image drawing method based on convolutional neural network
CN114911945A (en) Knowledge graph-based multi-value chain data management auxiliary decision model construction method
CN111026880A (en) Joint learning-based judicial knowledge graph construction method
CN113742733A (en) Reading understanding vulnerability event trigger word extraction and vulnerability type identification method and device
CN115599899A (en) Intelligent question-answering method, system, equipment and medium based on aircraft knowledge graph
CN116258137A (en) Text error correction method, device, equipment and storage medium
CN113886531A (en) Intelligent question and answer determining method and device, computer equipment and storage medium
CN114564563A (en) End-to-end entity relationship joint extraction method and system based on relationship decomposition
CN116340530A (en) Intelligent design method based on mechanical knowledge graph
CN115687609A (en) Zero sample relation extraction method based on Prompt multi-template fusion
CN115292490A (en) Analysis algorithm for policy interpretation semantics
CN115730597A (en) Multi-level semantic intention recognition method and related equipment thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210827

RJ01 Rejection of invention patent application after publication