CN109271506A - A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning - Google Patents
A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning Download PDFInfo
- Publication number
- CN109271506A CN109271506A CN201811441698.5A CN201811441698A CN109271506A CN 109271506 A CN109271506 A CN 109271506A CN 201811441698 A CN201811441698 A CN 201811441698A CN 109271506 A CN109271506 A CN 109271506A
- Authority
- CN
- China
- Prior art keywords
- answer
- question
- word
- entity
- candidate
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013135 deep learning Methods 0.000 title claims abstract description 32
- 238000004891 communication Methods 0.000 title claims abstract description 18
- 238000010276 construction Methods 0.000 title claims abstract description 12
- 238000013507 mapping Methods 0.000 title abstract description 5
- 238000000034 method Methods 0.000 claims abstract description 41
- 239000013598 vector Substances 0.000 claims description 34
- 230000006870 function Effects 0.000 claims description 24
- 239000011159 matrix material Substances 0.000 claims description 20
- 230000008569 process Effects 0.000 claims description 18
- 238000004422 calculation algorithm Methods 0.000 claims description 17
- 238000012549 training Methods 0.000 claims description 17
- 230000011218 segmentation Effects 0.000 claims description 15
- 238000013528 artificial neural network Methods 0.000 claims description 12
- 238000004458 analytical method Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 5
- 238000000605 extraction Methods 0.000 claims description 4
- 238000003062 neural network model Methods 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 3
- 238000011176 pooling Methods 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000007619 statistical method Methods 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 239000002585 base Substances 0.000 description 14
- 238000010801 machine learning Methods 0.000 description 6
- 238000003058 natural language processing Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 210000004556 brain Anatomy 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000007477 logistic regression Methods 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 241001672694 Citrus reticulata Species 0.000 description 1
- 241000238558 Eucarida Species 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000009411 base construction Methods 0.000 description 1
- 239000012458 free base Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The construction method for the field of power communication knowledge mapping question answering system based on deep learning that the present invention relates to a kind of, realize step are as follows: step 1: semanteme parsing, the problem of being proposed to user with natural language q is pre-processed, therefrom extract the keyword of user query, the entity w as question sentence such as inquiry sentence focus of attention.Step 2: semantic expressiveness will pass through pretreated natural language problem vectorization, while the matching degree by the Candidate Set vectorization of answer a, for subsequent computational problem q and answer a.Step 3: calculated by semantic matching degree, inquiry and the methods of reasoning, find out most matched with problem q, most accurate answer a so that the question and answer are to (q, score S (q, a) highest a).Study through the invention by knowledge mapping construct question answering system the national grid communications field feasibility.
Description
Technical Field
The invention belongs to the application range of Knowledge maps in the power communication industry, and particularly relates to a method for constructing a Knowledge map question-answering system (Knowledge base responding) based on Deep learning (Deep learning).
Background
Knowledge Graph (Knowledge Graph): is essentially a structured semantic knowledge base with pairs of entities with attributes linked by relationships. Which contains a large number of entity-pair relationships that are used to symbolically describe concepts in the physical world and interrelationships between concepts. From a graph perspective, a knowledge graph is actually a conceptual network, with nodes of the network being entities in the real world, and edges of the network representing connections (relationships) between pairs of entities[1]。
The concept of a knowledge graph was first introduced in 2012 by Google corporation to enhance the knowledge base of its search engine functionality[2]The method aims to systematize the search keywords so that each keyword has a complete knowledge system, thereby improving the search quality[3]. Although the traditional search engine technology can quickly screen and sort a large number of webpages according to the query structure of a user, the efficiency of information retrieval is improved. However, as the user cannot quickly and accurately feed back specific answers to questions, the information retrieval method gradually fails to meet the needs of the user as the total amount of internet information increases explosively. The appearance of the knowledge graph provides a feasible scheme for solving the problem of user information retrieval under big data[4].
Common knowledge graphs can be divided into two broad categories, general knowledge graphs and industry knowledge graphs. The universal knowledge graph emphasizes that the knowledge graph is constructed on the basis of big data, and a typical large-scale knowledge base is provided with Freebase[5]、Wikidata[6]、DBpedia[7]、 YAGO[8]And the like, which not only contain large amounts of semi-structured and unstructured data, but also have high domain coverage. And industry knowledge graph usageThe knowledge base is constructed by data of a specific industry, so the data size is smaller than that of a general knowledge graph generally, but the attribute and the data mode of an entity are richer. Industry knowledge maps are more accurate than general knowledge maps, but are only applicable to specific areas. At present, industry knowledge maps are researched and developed in the industries of electronic commerce, finance and the like[9]。
Knowledge-graphs are typically represented by "triplets", i.e., G ═ E, R, S. Wherein E ═ { E ═ E1,e2,...,e|E|Represents a set of entities in the knowledge base, representing a total of | E | different entities; r ═ R1,r2,...,r|R|Represents the set of entity relationships between pairs of entities in the knowledge base, | R | is the number of entity relationships;what is represented is a set of triples of the knowledge-graph, i.e., a representation of the entire knowledge-base. The triple set of the knowledge-graph represents the relationship between entities or the relationship between concepts and attributes.
Question Answering System (Question Answering System): QA for short is an advanced form of information retrieval, and its simple and accurate interaction mode makes the question-answering system become another research hotspot in the application field of Natural Language Processing (NLP). Different from the traditional search engine, the question-answering system accepts that the user proposes questions (such as how large yaoming is. While the traditional network information query tool mainly searches based on matching of keywords, although the scheme can generally meet the requirements of users, due to the fact that the way of expressing the problems by the keywords lacks background information on the context of the problems, the retrieved results are often tens of thousands of related web pages which are sorted by PageRank, and the requirements of the users are often a small part of the related web pages. So that the question-answering system matches the keywordsThere are substantial differences between conventional search engines that are configured to return a collection of related documents[10]。
In recent years, many question-answering systems have appeared at home and abroad. A relatively sophisticated question-answering system developed abroad is the Start of the Artificial Intelligence laboratory of the university of Mandarin province[11]AnswerBus of michigan university[12]、IBM[13]A statistical-based question-answering system, etc. Compared with an English question-answering system, the Chinese question-answering system is more complex in construction process because a large-scale Chinese knowledge base is very difficult to construct. At present, the inquiry and answer system developed at home is NKI developed by Chinese academy of sciences[14]Remote intelligent question answering system designed by Shanghai university of transportation[15]And an automatic question-answering system based on WeChat developed by Kaiki, etc. of electronic science and technology university[16]And the like. Compared with the English question-answering system, the Chinese question-answering system has late research start and is not mature enough, and the research start is related to various factors such as the complexity of grammar, syntax and semantics of Chinese and the like[17]。
The question-answering system can be divided into the following categories according to its representation: chat robots (such as Siri), knowledge base-based question-answering systems, intelligent search engines, free text-based question-answering systems. The knowledge can be divided into a question-answering system in a limited domain and a question-answering system in an open domain according to the domain range to which the knowledge belongs. The question-answering system in the open field is more complex than that in the limited field, and the question-answering system in the limited field is more targeted and professional[18]。
Deep learning: as a branch of machine learning, it is a research focus in the field of machine learning in recent years. The advent of deep learning has led to a great deal of development in the subject areas of Natural Language Processing (NLP) and computer vision. The concept of deep learning stems from the study of artificial neural networks. In 2006, Hinton et al proposed the concept of "deep learning". As a branch of machine learning, deep learning is opposite to 'shallow learning', and the motivation is to establish and simulate a neural network for human brain to analyze and learn, and to simulate the neural networkInterpreting data in the human brain, e.g. processing of images, sounds and text[19]。
Before deep learning occurs, algorithms mainly adopted in the field of machine learning include logistic regression (logistic regression), Support Vector Machine (SVM), Hidden Markov Model (HMM), and the like, and all the models do not include a hidden node layer or only include a hidden layer, and therefore all the models belong to shallow learning models. In the last 80 th century, Zipser et al proposed a back propagation algorithm[20]The algorithm can automatically correct the parameters of the artificial neural network model in the training process, and greatly promotes the development of the artificial neural network model and machine learning. In 2011, researchers of microsoft corporation adopt a deep learning technology to greatly reduce the error rate of voice recognition, and a technical breakthrough is brought to voice recognition. In 2012, Google Brain introduced the Google Brain project and used a Deep Neural Network (DNN) with 10 hundred million nodes for training to intelligently recognize pictures. In addition, large internet companies such as Facebook and Baidu also invest deep learning into research, so as to see the weight of deep learning in the future machine learning field[20]。
Disclosure of Invention
With the increasing maturity of knowledge graph construction technology, knowledge base construction for structured and unstructured data has become possible. After the Chinese general knowledge graph (such as 'Baidu consciousness') is gradually formed, the Chinese industry knowledge graph also appears in succession. At present, common application of knowledge graph is basically staying in the aspects of 'search engine' and 'question-answering system'. In the application aspect of the question-answering system, compared with the traditional question-answering system, the question-answering system based on the knowledge map mostly has one or more knowledge bases, and can answer the questions posed by the users by utilizing a retrieval algorithm or a logical reasoning algorithm. Thus, the number of entities in the knowledge-graph directly determines how well such a question-answering system performs. The question-answering system based on the knowledge graph has the advantage of being capable of accurately reasoning and answering questions posed by users.
The invention provides a method for constructing a question-answering system based on an industry knowledge graph by utilizing various data sources of national power grid power communication. Because national power grid communication has a large amount of structured data and unstructured text data, in order to better perform entity recognition on the unstructured data in the process of constructing a knowledge graph, a Natural Language Processing (NLP) technology based on deep learning is adopted to perform word segmentation, named entity recognition, entity relationship recognition and the like on the text data. In general, the construction process of the knowledge-graph question-answering system based on deep learning can be divided into two parts: the first part is the construction of a knowledge graph based on deep learning; and the second part is the construction of a question-answering system based on a knowledge graph. In view of the fact that the construction process of the question-answering system is mainly described herein, the construction of the knowledge graph is omitted, namely, the knowledge graph based on national power grid communication is assumed to be constructed. A vector modeling method based on deep learning is adopted as a question-answering algorithm, and the method has the core idea that: and mapping the entity of the knowledge base and the question in the natural language into the same vector space, and searching out the optimal candidate answer by calculating the semantic similarity.
The invention adopts the following scheme to solve the technical problems:
a construction method of a knowledge graph question-answering system in the electric power communication field based on deep learning is characterized in that natural language questions put forward by users are expressed as q ═ omega1...ωnAll candidate answers to the question are represented as answer candidate set CqThe method specifically comprises the following steps:
step 1: semantic analysis, specifically, preprocessing a question q which is provided by a user through natural language, extracting keywords inquired by the user from the question q, and taking the focus concerned by an inquiry sentence and the like as an entity w of the inquiry sentence;
step 2: performing semantic representation, specifically vectorizing the preprocessed natural language questions, and vectorizing a candidate set of the answer a at the same time, wherein the candidate set is used for subsequently calculating the matching degree of the question q and the answer a;
and step 3: and finding out the most accurate answer a which is most matched with the question q through semantic matching degree calculation, query and reasoning so that the score S (q, a) of the question-answer pair (q, a) is the highest.
In the above method for constructing a knowledge graph question-answering system in the field of power communication based on deep learning, the step 1 specifically includes:
step 1.1, problem word segmentation and part of speech tagging: performing problem word segmentation and part-of-speech tagging by adopting a Chinese lexical analysis system; after word segmentation, the natural language question is expressed as q ═ ω1...ωnWherein w isnIs a Chinese word;
step 1.2, problem named entity recognition: screening out the named entities described in the question q according to the part of speech marked in the step 1.1, and using the named entities as keywords and focus of attention of the question; the common method for extracting the keywords is a statistical method, namely, the weights of the candidate words are determined, and the candidate words with large weights are screened out to be used as final keywords; the specific method for extracting the keywords comprises the following steps:
step 1.21, performing word segmentation processing on the text;
step 1.22, filtering stop words, namely, fictional words, tone words, punctuation marks and the like;
step 1.23, respectively calculating word frequency factors and position factors of the words according to the counted word frequency and position information;
step 1.24, calculating the weights of the words by using a specific word weight function, sequencing the words, and selecting the word with the larger weight as a keyword;
step 1.23, after the extraction process, identifying named entities so as to calculate the subject and the object of the sentence;
step 1.3, generating answer candidate set: according to the question qThe key words and the named entities are searched in a knowledge base, the entity nodes and the adjacent nodes within the range of 2-hops are searched, and an answer candidate set C of the question q is formed by the key words and the named entitiesq。
In the above method for constructing a knowledge graph question-answering system in the field of power communication based on deep learning, the step 2 specifically includes:
step 2.1, problem q vectorization: for preprocessed problem q ═ ω1...ωnEach word ω thereof is expressedjMapping into a vector wj of a low-dimensional space, namely vectorization of the word; embedding matrix W by wordsvConvert it into a d-dimensional distributed vector, i.e.
wj=Wvu(ωj)
Wherein (u (ω)j)∈{0,1}|V|) As the word omegajThe corresponding one-hot form of the corresponding one,is a word embedding matrix (word embedding matrix), and | V | represents the size of the vocabulary; in the training process of the MCCNNs network, WvIs continuously updated as a hyper-parameter;
then, a representation of the problem q is computed using a sliding window at the convolutional layer of the neural network; for example, for column i of MCCNNs, an n-dimensional vector of the problem q is calculated as follows:
wherein 2s +1 is the size of the sliding window,is a weight matrix of the convolutional layer,denotes a deviation vector, h (-) is notLinear functions (e.g., sigmoid function, tanh function, etc.);
finally, training at the maximum pooling level outputs a vector representation of the problem q of fixed size, where the output of the i-th column of active units of MCCNNs is:
wherein max is a maximum function; this results in a low-dimensional space vector f for the problem qi(q);
Step 2.2, vectorization of the candidate answer set: for three features of the answer, the answer path, the answer context information and the answer type, the vectorization process is respectively shown as follows:
for Answer Path (Answer Path): the represented is the incidence relation between the answer node and the named entity in the question; then the distributed representation g of the answer path1(a) The vector representation of (c) can be calculated using the following formula:
wherein, | |1Denotes the L1 norm, up(a)∈R|R|×1A binary vector is used to indicate whether each association of the answer path exists,is a parameter matrix, | R | is the number of incidence relations;
for Answer Context information (Answer Context): its role is a constraint to deal with the problem; taking the entity relationship and the entity corresponding to the answer entity in the range of 1 hop (hop) as the context information of the answer entity; the distributed expression of answer context information obtained in the same way is:
wherein u isc(a)∈R|C|×1A binary vector representing whether an inode exists, | C | represents the number of entity-pair relationships in which an answer context exists,is a parameter matrix;
for Answer Type (Answer Type): type is a special entity relationship, such as the type of time 2018-01-01 is datetime; in the same way, the corresponding distributed expression is obtained as:
wherein,for embedded type of matrix, ut(a)∈R|T|×1A binary variable representing whether an answer type exists, | T | is the number of answer types;
in combination with the above description, the final output function of the MCCNNs is obtained as:
in the above method for constructing a knowledge graph question-answering system in the field of power communication based on deep learning, the step 3 specifically includes:
step 3.1, training MCCNNs neural network models: for each correct answer a ∈ A to the question qqRandomly from the answer candidate set CqExtracting k wrong answers a' as samples, and taking the samples as negative samples of the training model parameters; then Hinge loss letterThe number may be defined as:
l(q,a,a')=(m-S(q,a)+S(q,a'))+
where S (·,) is the scoring function defined in step 2, m is an interval parameter used to normalize the interval between the two scores S (q, a) and S (q, a'), and (z)+Max {0, z }; the subjective function is then:
wherein | AqL represents the number of exact answers,is a set of k wrong answers;
training the model by adopting a back propagation algorithm of a neural network, and continuously updating the model by utilizing a gradient descent algorithm by calculating a gradient value of a hyper-parameter;
step 3.2, semantic matching and reasoning: all answer candidate sets C related to the question q are retrieved in the testing processq(ii) a For each candidate answerCalculate its scoreThe answer a with the highest score in the candidate answer set is the most accurate answer of the question q;
for a natural language question q, there may be more than one answer due to its accuracy, such as "model type of device"; therefore, a scale criterion is needed to determine the final answer; the formula of the rating criterion is as follows:
where m is the interval defined in step 3.1.
Therefore, the invention has the following advantages:
1. the application range of the knowledge graph mainly focuses on two aspects of a search engine and a question-answering system at present, and the main content of the invention is to construct the question-answering system in the telecommunication field by means of the data structure of the knowledge graph. The invention aims to research the feasibility of a question-answering system constructed by a knowledge graph in the field of national power grid communication.
2. Because national power grid communication has a large amount of structured data and unstructured text data, in the process of constructing the knowledge graph, in order to better perform entity recognition on the unstructured data, a deep learning MCCNNs-based neural network model is adopted to perform word segmentation, named entity recognition, entity relationship recognition and the like on the text data.
Drawings
FIG. 1 is a schematic workflow diagram of a question-answering system.
Fig. 2 is a schematic diagram of the operation of the deep learning based question-answering system.
Detailed Description
Suppose we represent the natural language question posed by the user as q ═ ω1...ωnAll candidate answers to the question are represented as answer candidate set CqThe operation principle of the conventional question-answering system based on deep learning is shown in fig. 2. The question-answering algorithm of the question-answering system based on deep learning is specifically implemented as follows:
step 1: semantic parsing is the initialization phase of the question-answering algorithm, i.e. the preprocessing process of the natural language question q. As is known, the grammar structure of chinese is more complex than that of english, so that processes such as word segmentation and part-of-speech tagging are required to be performed on a text, so that keywords of a problem and the focus of attention on the problem can be retrieved from the text.
Step 1.1 problem participles and part-of-speech tagging
The Chinese text word segmentation takes 'word' as a processing object, so that the Chinese text needs to be cut into parts according to the word and marked with a cut label. After the text is participled, the grammar category of each word in the sentence is determined, and the part of speech is determined and labeled. The above processes are respectively called Chinese text segmentation and part-of-speech tagging processes. Here we use the chinese lexical analysis system (ICTCLAS for short) to solve the above problem. After word segmentation, we express the natural language problem as q ═ ω1...ωnWherein w isnIs a Chinese word.
Step 1.2 problem named entity recognition
And (4) screening out the named entities described in the question q according to the part of speech marked in the step 1.1, and using the named entities as keywords and focus of attention of the question. The common method for extracting keywords is a statistical method, that is, by determining the weights of candidate words, a final keyword with a higher weight is selected from the candidate words. The basic flow of keyword extraction is as follows:
1) performing word segmentation processing on the text;
2) filtering stop words, namely, fictional words, tone words, punctuation marks and the like;
3) respectively calculating word frequency factors and position factors of the words according to the counted word frequency and position information;
4) and calculating the weights of the words by using a specific word weight function, sequencing the words, and selecting the words with larger weights as keywords.
Meanwhile, in the extraction process of the keywords, named entities are identified so as to calculate the subject and the object of the sentence.
Step 1.3 generating answer candidate set
According to the keywords and named entities of the question q, searching the entity node and the adjacent nodes within the range of 2-hops in a knowledge base to form an answer candidate set C of the question qq。
Step 2: semantic representation, namely the question q preprocessed in step 1 and a candidate answer set CqVectorization.
Here we use the multi-convolutional neural network MCCNNs (multi-convolutional neural networks) model shown in FIG. 2 to solve the problem of the Answer Path (Answer Path), Answer Context information (Answer Context), Answer Type (Answer Type) in three aspects of q and CqLearning is performed. The three aspects are respectively trained by three columns of activation units of the neural network, and sentence vectors of the final problem q are respectively marked as f1(q),f2(q),f3(q) is carried out. Meanwhile, for each candidate answer a of the question q, a vector representation form is calculated and is marked as g1(a),g2(a),g3(a) In that respect F is then1 T(q)g1(a)、Andand respectively representing the training output values of three aspects of answer paths, context information and answer types. Using the above vectors we can calculate the matching degree of the specific question-answer pair (q, a), and the calculation formula is:
the vectorization process for the question q and the candidate answer a is as follows:
step 2.1 problem q vectorization
For preprocessed problem q ═ ω1...ωnEach word ω thereof is expressedjVector w mapped into a low dimensional spacejI.e. vectorization of words. We embed matrix W by wordsvConvert it into a d-dimensional distributed vector, i.e.
wj=Wvu(ωj)
Wherein (u (ω)j)∈{0,1}|V|) As the word omegajThe corresponding one-hot form of the corresponding one,is a word embedding matrix (word embedding matrix), and | V | represents the size of the vocabulary. In the training process of the MCCNNs network, WvIs continuously updated as a hyper-parameter.
A representation of the problem q is then computed using a sliding window at the convolutional layer of the neural network. For example, for column i of MCCNNs, an n-dimensional vector of the problem q is calculated as follows:
wherein 2s +1 is the size of the sliding window,is a weight matrix of the convolutional layer,representing the deviation vector, h (-) is a non-linear function (e.g., sigmoid function, tanh function, etc.).
Finally, training at the maximum pooling level outputs a vector representation of the problem q of fixed size, where the output of the i-th column of active units of MCCNNs is:
where max is a maximum function. Thus we finally get the low dimensional space vector f of the problem qi(q)。
Step 2.2 vectorization of candidate answer sets
For three features of the answer: answer path, answer context information and answer type, we show the vectorization process:
1) for Answer Path (Answer Path): what is shown is the association of the answer node with the named entity in the question. Then the distributed representation g of the answer path1(a) The vector representation of (c) can be calculated using the following formula:
wherein | · | purple sweet1Represents L1Norm, up(a)∈R|R|×1A binary vector is used to indicate whether each association of the answer path exists,and the | R | is the number of the incidence relations.
2) For Answer Context information (Answer Context): which acts as a constraint for dealing with the problem. We take the answer entity corresponding to the entity relationships and entities in the range of 1 hop (hop) as the context information of the answer entity. In the same way we can get a distributed expression of answer context information as:
wherein u isc(a)∈R|C|×1A binary vector representing whether an inode exists, | C | represents the number of entity-pair relationships in which an answer context exists,is a parameter matrix.
3) For Answer Type (Answer Type): a type is a special entity relationship, such as a time 2018-01-01 type is datetime. In the same way we can get the corresponding distributed expression as:
wherein,for embedded type of matrix, ut(a)∈R|T|×1A binary variable representing whether an answer type exists, | T | is the number of answer types.
In conjunction with the above description, we have obtained the final output function of MCCNNs as:
and step 3: obtaining the most accurate answer a
Step 3.1 training MCCNNs neural network model
For each correct answer a ∈ A to the question qqRandomly from the answer candidate set CqAnd extracting k wrong answers a' as samples, and taking the samples as negative samples of the parameters of the training model. Then the Hinge loss function can be defined as:
l(q,a,a')=(m-S(q,a)+S(q,a'))+
where S (·,) is the scoring function defined in step 2, m is an interval parameter used to normalize the interval between the two scores S (q, a) and S (q, a'), and (z)+Max {0, z }. The subjective function is then:
wherein | AqL represents the number of exact answers,is a set of k wrong answers.
Here we train the model using the back-propagation algorithm of the neural network, updating it continuously with the gradient descent algorithm by calculating the gradient values of the hyper-parameters.
Step 3.2 semantic matching and reasoning
During the test, we have retrieved all answer candidate sets C for the question qq. For each candidate answerCalculate its scoreThe answer a with the highest score in the candidate answer set is the most accurate answer to the question q.
For a natural language question q, there may be more than one answer, such as "model type of device", due to its accuracy. Therefore, a criterium is needed to determine the final answer. The formula of the rating criterion is as follows:
where m is the interval defined in step 3.1.
By combining the steps, a specific process of the question-answering system algorithm based on deep learning is completed. Compared with the traditional vector modeling method, the vector modeling algorithm based on deep learning solves the semantic sequence of the problem which is not considered, and improves the capability of the algorithm for extracting features.
The specific embodiments described herein are merely illustrative of the spirit of the invention. Those skilled in the art to which the invention relates may modify, supplement or substitute the specific embodiments described, without however departing from the spirit of the invention or exceeding the scope defined by the appended claims.
Claims (4)
1. A construction method of a knowledge graph question-answering system in the electric power communication field based on deep learning is characterized in that natural language questions put forward by users are expressed as q ═ omega1...ωnAll candidate answers to the question are represented as answer candidate set CqThe method specifically comprises the following steps:
step 1: semantic analysis, specifically, preprocessing a question q which is provided by a user through natural language, extracting keywords inquired by the user from the question q, and taking the focus concerned by an inquiry sentence and the like as an entity w of the inquiry sentence;
step 2: performing semantic representation, specifically vectorizing the preprocessed natural language questions, and vectorizing a candidate set of the answer a at the same time, wherein the candidate set is used for subsequently calculating the matching degree of the question q and the answer a;
and step 3: and finding out the most accurate answer a which is most matched with the question q through semantic matching degree calculation, query and reasoning so that the score S (q, a) of the question-answer pair (q, a) is the highest.
2. The method for constructing the power communication field knowledge graph question-answering system based on deep learning according to claim 1, wherein the step 1 specifically comprises:
step 1.1, problem word segmentation and part of speech tagging: performing problem word segmentation and part-of-speech tagging by adopting a Chinese lexical analysis system; after word segmentation, the natural language question is expressed as q ═ ω1...ωnWherein w isnIs a Chinese word;
step 1.2, problem named entity recognition: screening out the named entities described in the question q according to the part of speech marked in the step 1.1, and using the named entities as keywords and focus of attention of the question; the common method for extracting the keywords is a statistical method, namely, the weights of the candidate words are determined, and the candidate words with large weights are screened out to be used as final keywords; the specific method for extracting the keywords comprises the following steps:
step 1.21, performing word segmentation processing on the text;
step 1.22, filtering stop words, namely, fictional words, tone words, punctuation marks and the like;
step 1.23, respectively calculating word frequency factors and position factors of the words according to the counted word frequency and position information;
step 1.24, calculating the weights of the words by using a specific word weight function, sequencing the words, and selecting the word with the larger weight as a keyword;
step 1.23, after the extraction process, identifying named entities so as to calculate the subject and the object of the sentence;
step 1.3, generating answer candidate set: according to the keywords and named entities of the question q, the entity nodes and the nodes within the range of 2-hops are searched in the knowledge baseNeighboring nodes, together forming a candidate set C of answers to the question qq。
3. The method for constructing the electric power communication field knowledge graph question-answering system based on deep learning according to claim 1, wherein the step 2 specifically comprises:
step 2.1, problem q vectorization: for preprocessed problem q ═ ω1...ωnEach word ω thereof is expressedjVector w mapped into a low dimensional spacejI.e. vectorization of the word; embedding matrix W by wordsvConvert it into a d-dimensional distributed vector, i.e.
wj=Wvu(ωj)
Wherein (u (ω)j)∈{0,1}|V|) As the word omegajThe corresponding one-hot form of the corresponding one,is a word embedding matrix (word embedding matrix), V represents the size of the vocabulary; in the training process of the MCCNNs network, WvIs continuously updated as a hyper-parameter;
then, a representation of the problem q is computed using a sliding window at the convolutional layer of the neural network; for example, for column i of MCCNNs, an n-dimensional vector of the problem q is calculated as follows:
wherein 2s +1 is the size of the sliding window,is a weight matrix of the convolutional layer,representing deviation vectors, h (-) is a nonlinear function (such as sigmoid function, tanh function, etc.);
finally, training at the maximum pooling level outputs a vector representation of the problem q of fixed size, where the output of the i-th column of active units of MCCNNs is:
wherein max is a maximum function; this results in a low-dimensional space vector f for the problem qi(q);
Step 2.2, vectorization of the candidate answer set: for three features of the answer, the answer path, the answer context information and the answer type, the vectorization process is respectively shown as follows:
for Answer Path (Answer Path): the represented is the incidence relation between the answer node and the named entity in the question; then the distributed representation g of the answer path1(a) The vector representation of (c) can be calculated using the following formula:
wherein | · | purple sweet1Denotes the L1 norm, up(a)∈R|R|×1A binary vector is used to indicate whether each association of the answer path exists,is a parameter matrix, | R | is the number of incidence relations;
for Answer Context information (Answer Context): its role is a constraint to deal with the problem; taking the entity relationship and the entity corresponding to the answer entity in the range of 1 hop (hop) as the context information of the answer entity; the distributed expression of answer context information obtained in the same way is:
wherein u isc(a)∈R|C|×1A binary vector representing whether an inode exists, | C | represents that an entity pair exists for an answer contextThe number of the series is,is a parameter matrix;
for Answer Type (Answer Type): type is a special entity relationship, such as the type of time 2018-01-01 is datetime; in the same way, the corresponding distributed expression is obtained as:
wherein,for embedded type of matrix, ut(a)∈R|T|×1A binary variable representing whether an answer type exists, | T | is the number of answer types;
in combination with the above description, the final output function of the MCCNNs is obtained as:
4. the method for constructing the electric power communication field knowledge graph question-answering system based on deep learning according to claim 1, wherein the step 3 specifically comprises:
step 3.1, training MCCNNs neural network models: for each correct answer a ∈ A to the question qqRandomly from the answer candidate set CqExtracting k wrong answers a' as samples, and taking the samples as negative samples of the training model parameters; then the Hinge loss function can be defined as:
l(q,a,a')=(m-S(q,a)+S(q,a'))+
wherein S (-) is the score function defined in step 2, m is the interval parameter for normalizing the two scores S (q, a) and S (q, a')
A spacing therebetween, and (z)+Max {0, z }; then subjective letterThe number is as follows:
wherein | AqL represents the number of exact answers,is a set of k wrong answers;
training the model by adopting a back propagation algorithm of a neural network, and continuously updating the model by utilizing a gradient descent algorithm by calculating a gradient value of a hyper-parameter;
step 3.2, semantic matching and reasoning: all answer candidate sets C related to the question q are retrieved in the testing processq(ii) a For each candidate answerCalculate its scoreThe answer a with the highest score in the candidate answer set is the most accurate answer of the question q;
for a natural language question q, there may be more than one answer due to its accuracy, such as "model type of device"; therefore, a scale criterion is needed to determine the final answer; the formula of the rating criterion is as follows:
where m is the interval defined in step 3.1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811441698.5A CN109271506A (en) | 2018-11-29 | 2018-11-29 | A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811441698.5A CN109271506A (en) | 2018-11-29 | 2018-11-29 | A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109271506A true CN109271506A (en) | 2019-01-25 |
Family
ID=65186847
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811441698.5A Pending CN109271506A (en) | 2018-11-29 | 2018-11-29 | A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109271506A (en) |
Cited By (67)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109978060A (en) * | 2019-03-28 | 2019-07-05 | 科大讯飞华南人工智能研究院(广州)有限公司 | A kind of training method and device of natural language element extraction model |
CN110046262A (en) * | 2019-06-10 | 2019-07-23 | 南京擎盾信息科技有限公司 | A kind of Context Reasoning method based on law expert's knowledge base |
CN110059160A (en) * | 2019-04-17 | 2019-07-26 | 东南大学 | A kind of knowledge base answering method and device based on context end to end |
CN110096573A (en) * | 2019-04-22 | 2019-08-06 | 腾讯科技(深圳)有限公司 | A kind of text resolution method and relevant device |
CN110120001A (en) * | 2019-05-08 | 2019-08-13 | 成都佳发安泰教育科技股份有限公司 | The method and system that a kind of knowledge based spectrum library mentions point in conjunction with memory curve |
CN110232113A (en) * | 2019-04-12 | 2019-09-13 | 中国科学院计算技术研究所 | A kind of method and system improving the accuracy of knowledge base question and answer |
CN110309283A (en) * | 2019-06-28 | 2019-10-08 | 阿里巴巴集团控股有限公司 | A kind of answer of intelligent answer determines method and device |
CN110334272A (en) * | 2019-05-29 | 2019-10-15 | 平安科技(深圳)有限公司 | The intelligent answer method, apparatus and computer storage medium of knowledge based map |
CN110516057A (en) * | 2019-08-23 | 2019-11-29 | 深圳前海微众银行股份有限公司 | A kind of petition letter problem answer method and device |
CN110569343A (en) * | 2019-08-16 | 2019-12-13 | 华东理工大学 | question and answer based clinical text structuring method |
CN110717018A (en) * | 2019-04-15 | 2020-01-21 | 中国石油大学(华东) | Industrial equipment fault maintenance question-answering system based on knowledge graph |
CN110717019A (en) * | 2019-08-28 | 2020-01-21 | 厦门快商通科技股份有限公司 | Question-answering processing method, question-answering system, electronic device and medium |
CN110795543A (en) * | 2019-09-03 | 2020-02-14 | 腾讯科技(深圳)有限公司 | Unstructured data extraction method and device based on deep learning and storage medium |
CN111061856A (en) * | 2019-06-06 | 2020-04-24 | 北京理工大学 | Knowledge perception-based news recommendation method |
CN111078875A (en) * | 2019-12-03 | 2020-04-28 | 哈尔滨工程大学 | Method for extracting question-answer pairs from semi-structured document based on machine learning |
CN111143539A (en) * | 2019-12-31 | 2020-05-12 | 重庆和贯科技有限公司 | Knowledge graph-based question-answering method in teaching field |
CN111177339A (en) * | 2019-12-06 | 2020-05-19 | 百度在线网络技术(北京)有限公司 | Dialog generation method and device, electronic equipment and storage medium |
CN111177406A (en) * | 2019-12-25 | 2020-05-19 | 毛彬 | Knowledge system automatic construction method and system based on wikidata |
CN111274371A (en) * | 2020-01-14 | 2020-06-12 | 东莞证券股份有限公司 | Intelligent man-machine conversation method and equipment based on knowledge graph |
CN111324709A (en) * | 2020-02-10 | 2020-06-23 | 广西师范大学 | Intelligent question answering method based on subject knowledge graph and convolutional neural network |
CN111324717A (en) * | 2020-02-24 | 2020-06-23 | 武汉大学 | Open domain question and answer oriented answer obtaining method and system based on machine reading understanding |
CN111460116A (en) * | 2020-03-19 | 2020-07-28 | 北京百度网讯科技有限公司 | Question answering method, question answering system, electronic equipment and storage medium |
CN111552786A (en) * | 2020-04-16 | 2020-08-18 | 重庆大学 | Question-answering working method based on keyword extraction |
CN111680134A (en) * | 2020-04-20 | 2020-09-18 | 重庆兆光科技股份有限公司 | Method for measuring question-answering consulting information by information entropy |
CN111708899A (en) * | 2020-06-13 | 2020-09-25 | 广州华建工智慧科技有限公司 | Engineering information intelligent searching method based on natural language and knowledge graph |
CN111739541A (en) * | 2019-03-19 | 2020-10-02 | 上海云思智慧信息技术有限公司 | Conference assistance method and system based on voice, storage medium and terminal |
CN111814477A (en) * | 2020-07-06 | 2020-10-23 | 重庆邮电大学 | Dispute focus discovery method and device based on dispute focus entity and terminal |
CN111949774A (en) * | 2020-07-08 | 2020-11-17 | 深圳鹏锐信息技术股份有限公司 | Intelligent question answering method and system |
CN112035678A (en) * | 2020-09-03 | 2020-12-04 | 中国银行股份有限公司 | Bank business knowledge graph construction method and device |
CN112052311A (en) * | 2019-05-20 | 2020-12-08 | 天津科技大学 | Short text question-answering method and device based on word vector technology and knowledge graph retrieval |
CN112084312A (en) * | 2020-07-23 | 2020-12-15 | 江苏海洋大学 | Intelligent customer service system constructed based on knowledge graph |
CN112101592A (en) * | 2020-09-08 | 2020-12-18 | 中国电力科学研究院有限公司 | Power secondary device defect diagnosis method, system, device and storage medium |
CN112214584A (en) * | 2019-07-10 | 2021-01-12 | 国际商业机器公司 | Finding answers using knowledge graphs with entity relationships |
WO2021038380A1 (en) * | 2019-08-26 | 2021-03-04 | International Business Machines Corporation | Knowledge graph-based query in artificial intelligence chatbot with base query element detection and graph path generation |
CN112487173A (en) * | 2020-12-18 | 2021-03-12 | 北京百度网讯科技有限公司 | Man-machine conversation method, device and storage medium |
CN112527999A (en) * | 2020-12-22 | 2021-03-19 | 江苏省农业科学院 | Extraction type intelligent question and answer method and system introducing agricultural field knowledge |
CN112579795A (en) * | 2020-12-28 | 2021-03-30 | 重庆邮电大学 | Intelligent question-answering method based on knowledge graph embedded representation |
CN112749567A (en) * | 2021-03-01 | 2021-05-04 | 哈尔滨理工大学 | Question-answering system based on reality information environment knowledge graph |
CN112766507A (en) * | 2021-01-20 | 2021-05-07 | 河海大学 | Complex question knowledge base question-answering method based on embedded and candidate subgraph pruning |
CN112800203A (en) * | 2021-02-05 | 2021-05-14 | 江苏实达迪美数据处理有限公司 | Question-answer matching method and system fusing text representation and knowledge representation |
CN112989005A (en) * | 2021-04-16 | 2021-06-18 | 重庆中国三峡博物馆 | Knowledge graph common sense question-answering method and system based on staged query |
CN113220861A (en) * | 2021-06-01 | 2021-08-06 | 中国电子科技集团公司第五十四研究所 | Knowledge map question-answering system for professional field |
CN113220848A (en) * | 2020-01-21 | 2021-08-06 | 华为技术有限公司 | Automatic question answering method and device for man-machine interaction and intelligent equipment |
CN113312917A (en) * | 2021-05-28 | 2021-08-27 | 国网江苏省电力有限公司电力科学研究院 | Entity relation extraction method and system based on knowledge reasoning |
CN113326361A (en) * | 2021-05-25 | 2021-08-31 | 武汉理工大学 | Knowledge question-answering method and system based on automobile industry map and electronic equipment |
CN113344122A (en) * | 2021-06-29 | 2021-09-03 | 复旦大学 | Operation flow diagnosis method and device and storage medium |
CN113420231A (en) * | 2021-05-25 | 2021-09-21 | 国网浙江省电力有限公司物资分公司 | Data recommendation algorithm applied to purchasing system |
CN113468311A (en) * | 2021-07-20 | 2021-10-01 | 四川启睿克科技有限公司 | Knowledge graph-based complex question and answer method, device and storage medium |
CN113590799A (en) * | 2021-08-16 | 2021-11-02 | 东南大学 | Weak supervision knowledge graph question-answering method based on multi-view reasoning |
CN113672720A (en) * | 2021-09-14 | 2021-11-19 | 国网天津市电力公司 | Power audit question and answer method based on knowledge graph and semantic similarity |
CN113779220A (en) * | 2021-09-13 | 2021-12-10 | 内蒙古工业大学 | Mongolian multi-hop question-answering method based on three-channel cognitive map and graph attention network |
CN113792152A (en) * | 2021-08-23 | 2021-12-14 | 南京信息工程大学 | Method for fusing triangular graph and knowledge graph |
CN113849613A (en) * | 2021-09-15 | 2021-12-28 | 北方工业大学 | Food safety question-answer reasoning method, system and electronic equipment |
CN114090746A (en) * | 2021-10-09 | 2022-02-25 | 北京百分点科技集团股份有限公司 | Knowledge graph-based answer query method and device and electronic equipment |
CN114120166A (en) * | 2021-10-14 | 2022-03-01 | 北京百度网讯科技有限公司 | Video question and answer method and device, electronic equipment and storage medium |
CN114153993A (en) * | 2022-02-07 | 2022-03-08 | 杭州远传新业科技有限公司 | Automatic knowledge graph construction method and system for intelligent question answering |
CN114168719A (en) * | 2021-06-22 | 2022-03-11 | 北京理工大学 | Interpretable multi-hop question-answering method and system based on knowledge graph embedding |
CN114186068A (en) * | 2021-11-04 | 2022-03-15 | 国网天津市电力公司 | Audit system basis question-answering method based on multi-level attention network |
CN114896407A (en) * | 2022-03-21 | 2022-08-12 | 武汉理工大学 | Question-answering method based on combination of semantic analysis and vector modeling |
CN116303976A (en) * | 2023-05-12 | 2023-06-23 | 中国人民解放军国防科技大学 | Penetration test question-answering method, system and medium based on network security knowledge graph |
CN116821712A (en) * | 2023-08-25 | 2023-09-29 | 中电科大数据研究院有限公司 | Semantic matching method and device for unstructured text and knowledge graph |
CN116992007A (en) * | 2023-09-28 | 2023-11-03 | 北京致远互联软件股份有限公司 | Limiting question-answering system based on question intention understanding |
CN117290489A (en) * | 2023-11-24 | 2023-12-26 | 烟台云朵软件有限公司 | Method and system for quickly constructing industry question-answer knowledge base |
CN117828060A (en) * | 2024-01-03 | 2024-04-05 | 国投人力资源服务有限公司 | Online question-answering method, system and storage medium based on semantic recognition |
CN117891929A (en) * | 2024-03-18 | 2024-04-16 | 南京华飞数据技术有限公司 | Knowledge graph intelligent question-answer information identification method of improved deep learning algorithm |
CN118093842A (en) * | 2024-04-25 | 2024-05-28 | 安徽省交通规划设计研究总院股份有限公司 | Optimization method and device of knowledge graph question-answering system |
WO2024146427A1 (en) * | 2023-01-04 | 2024-07-11 | 浙江阿里巴巴机器人有限公司 | Intelligent question answering methods and apparatus, device and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103049433A (en) * | 2012-12-11 | 2013-04-17 | 微梦创科网络科技(中国)有限公司 | Automatic question answering method, automatic question answering system and method for constructing question answering case base |
-
2018
- 2018-11-29 CN CN201811441698.5A patent/CN109271506A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103049433A (en) * | 2012-12-11 | 2013-04-17 | 微梦创科网络科技(中国)有限公司 | Automatic question answering method, automatic question answering system and method for constructing question answering case base |
Non-Patent Citations (2)
Title |
---|
DONG L, WEI F, ZHOU M, ET AL.: "Question answering over freebase with multi-column convolutional neural networks", 《PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING 》 * |
刘里、曾庆田: "自动问答系统研究综述", 《山东科技大学学报(自然科学版)》 * |
Cited By (103)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111739541A (en) * | 2019-03-19 | 2020-10-02 | 上海云思智慧信息技术有限公司 | Conference assistance method and system based on voice, storage medium and terminal |
CN109978060B (en) * | 2019-03-28 | 2021-10-22 | 科大讯飞华南人工智能研究院(广州)有限公司 | Training method and device of natural language element extraction model |
CN109978060A (en) * | 2019-03-28 | 2019-07-05 | 科大讯飞华南人工智能研究院(广州)有限公司 | A kind of training method and device of natural language element extraction model |
CN110232113A (en) * | 2019-04-12 | 2019-09-13 | 中国科学院计算技术研究所 | A kind of method and system improving the accuracy of knowledge base question and answer |
CN110717018A (en) * | 2019-04-15 | 2020-01-21 | 中国石油大学(华东) | Industrial equipment fault maintenance question-answering system based on knowledge graph |
CN110059160B (en) * | 2019-04-17 | 2021-02-09 | 东南大学 | End-to-end context-based knowledge base question-answering method and device |
CN110059160A (en) * | 2019-04-17 | 2019-07-26 | 东南大学 | A kind of knowledge base answering method and device based on context end to end |
CN110096573A (en) * | 2019-04-22 | 2019-08-06 | 腾讯科技(深圳)有限公司 | A kind of text resolution method and relevant device |
CN110096573B (en) * | 2019-04-22 | 2022-12-27 | 腾讯科技(深圳)有限公司 | Text parsing method and device |
CN110120001A (en) * | 2019-05-08 | 2019-08-13 | 成都佳发安泰教育科技股份有限公司 | The method and system that a kind of knowledge based spectrum library mentions point in conjunction with memory curve |
CN112052311A (en) * | 2019-05-20 | 2020-12-08 | 天津科技大学 | Short text question-answering method and device based on word vector technology and knowledge graph retrieval |
CN110334272A (en) * | 2019-05-29 | 2019-10-15 | 平安科技(深圳)有限公司 | The intelligent answer method, apparatus and computer storage medium of knowledge based map |
CN110334272B (en) * | 2019-05-29 | 2022-04-12 | 平安科技(深圳)有限公司 | Intelligent question-answering method and device based on knowledge graph and computer storage medium |
CN111061856B (en) * | 2019-06-06 | 2022-05-27 | 北京理工大学 | Knowledge perception-based news recommendation method |
CN111061856A (en) * | 2019-06-06 | 2020-04-24 | 北京理工大学 | Knowledge perception-based news recommendation method |
CN110046262B (en) * | 2019-06-10 | 2021-03-12 | 南京擎盾信息科技有限公司 | Context reasoning method based on legal expert knowledge base |
CN110046262A (en) * | 2019-06-10 | 2019-07-23 | 南京擎盾信息科技有限公司 | A kind of Context Reasoning method based on law expert's knowledge base |
CN110309283B (en) * | 2019-06-28 | 2023-03-21 | 创新先进技术有限公司 | Answer determination method and device for intelligent question answering |
CN110309283A (en) * | 2019-06-28 | 2019-10-08 | 阿里巴巴集团控股有限公司 | A kind of answer of intelligent answer determines method and device |
CN112214584B (en) * | 2019-07-10 | 2024-06-07 | 国际商业机器公司 | Using knowledge graphs to discover answers with entity relationships |
CN112214584A (en) * | 2019-07-10 | 2021-01-12 | 国际商业机器公司 | Finding answers using knowledge graphs with entity relationships |
CN110569343B (en) * | 2019-08-16 | 2023-05-09 | 华东理工大学 | Clinical text structuring method based on question and answer |
CN110569343A (en) * | 2019-08-16 | 2019-12-13 | 华东理工大学 | question and answer based clinical text structuring method |
CN110516057B (en) * | 2019-08-23 | 2022-10-28 | 深圳前海微众银行股份有限公司 | Petition question answering method and device |
CN110516057A (en) * | 2019-08-23 | 2019-11-29 | 深圳前海微众银行股份有限公司 | A kind of petition letter problem answer method and device |
GB2602902A (en) * | 2019-08-26 | 2022-07-20 | Ibm | Knowledge graph-based query in artificial intelligence chatbot with base query element detection and graph path generation |
US11328181B2 (en) | 2019-08-26 | 2022-05-10 | International Business Machines Corporation | Knowledge graph-based query in artificial intelligence chatbot with base query element detection and graph path generation |
WO2021038380A1 (en) * | 2019-08-26 | 2021-03-04 | International Business Machines Corporation | Knowledge graph-based query in artificial intelligence chatbot with base query element detection and graph path generation |
CN110717019A (en) * | 2019-08-28 | 2020-01-21 | 厦门快商通科技股份有限公司 | Question-answering processing method, question-answering system, electronic device and medium |
CN110795543B (en) * | 2019-09-03 | 2023-09-22 | 腾讯科技(深圳)有限公司 | Unstructured data extraction method, device and storage medium based on deep learning |
CN110795543A (en) * | 2019-09-03 | 2020-02-14 | 腾讯科技(深圳)有限公司 | Unstructured data extraction method and device based on deep learning and storage medium |
CN111078875A (en) * | 2019-12-03 | 2020-04-28 | 哈尔滨工程大学 | Method for extracting question-answer pairs from semi-structured document based on machine learning |
CN111177339A (en) * | 2019-12-06 | 2020-05-19 | 百度在线网络技术(北京)有限公司 | Dialog generation method and device, electronic equipment and storage medium |
CN111177406B (en) * | 2019-12-25 | 2023-09-26 | 中国人民解放军军事科学院军事科学信息研究中心 | Knowledge system automatic construction method and system based on wikidata |
CN111177406A (en) * | 2019-12-25 | 2020-05-19 | 毛彬 | Knowledge system automatic construction method and system based on wikidata |
CN111143539A (en) * | 2019-12-31 | 2020-05-12 | 重庆和贯科技有限公司 | Knowledge graph-based question-answering method in teaching field |
CN111274371A (en) * | 2020-01-14 | 2020-06-12 | 东莞证券股份有限公司 | Intelligent man-machine conversation method and equipment based on knowledge graph |
CN111274371B (en) * | 2020-01-14 | 2023-09-29 | 东莞证券股份有限公司 | Intelligent man-machine conversation method and equipment based on knowledge graph |
CN113220848A (en) * | 2020-01-21 | 2021-08-06 | 华为技术有限公司 | Automatic question answering method and device for man-machine interaction and intelligent equipment |
CN111324709A (en) * | 2020-02-10 | 2020-06-23 | 广西师范大学 | Intelligent question answering method based on subject knowledge graph and convolutional neural network |
CN111324717A (en) * | 2020-02-24 | 2020-06-23 | 武汉大学 | Open domain question and answer oriented answer obtaining method and system based on machine reading understanding |
CN111324717B (en) * | 2020-02-24 | 2023-06-27 | 武汉大学 | Open domain question-answering-oriented answer acquisition method and system based on machine reading understanding |
CN111460116A (en) * | 2020-03-19 | 2020-07-28 | 北京百度网讯科技有限公司 | Question answering method, question answering system, electronic equipment and storage medium |
CN111552786A (en) * | 2020-04-16 | 2020-08-18 | 重庆大学 | Question-answering working method based on keyword extraction |
CN111680134A (en) * | 2020-04-20 | 2020-09-18 | 重庆兆光科技股份有限公司 | Method for measuring question-answering consulting information by information entropy |
CN111680134B (en) * | 2020-04-20 | 2023-05-02 | 重庆兆光科技股份有限公司 | Method for measuring inquiry and answer consultation information by information entropy |
CN111708899A (en) * | 2020-06-13 | 2020-09-25 | 广州华建工智慧科技有限公司 | Engineering information intelligent searching method based on natural language and knowledge graph |
CN111708899B (en) * | 2020-06-13 | 2023-10-03 | 广州华建工智慧科技有限公司 | Engineering information intelligent searching method based on natural language and knowledge graph |
CN111814477A (en) * | 2020-07-06 | 2020-10-23 | 重庆邮电大学 | Dispute focus discovery method and device based on dispute focus entity and terminal |
CN111814477B (en) * | 2020-07-06 | 2022-06-21 | 重庆邮电大学 | Dispute focus discovery method and device based on dispute focus entity and terminal |
CN111949774A (en) * | 2020-07-08 | 2020-11-17 | 深圳鹏锐信息技术股份有限公司 | Intelligent question answering method and system |
CN112084312A (en) * | 2020-07-23 | 2020-12-15 | 江苏海洋大学 | Intelligent customer service system constructed based on knowledge graph |
CN112084312B (en) * | 2020-07-23 | 2023-11-14 | 江苏海洋大学 | Intelligent customer service system constructed based on knowledge graph |
CN112035678A (en) * | 2020-09-03 | 2020-12-04 | 中国银行股份有限公司 | Bank business knowledge graph construction method and device |
CN112101592A (en) * | 2020-09-08 | 2020-12-18 | 中国电力科学研究院有限公司 | Power secondary device defect diagnosis method, system, device and storage medium |
CN112487173A (en) * | 2020-12-18 | 2021-03-12 | 北京百度网讯科技有限公司 | Man-machine conversation method, device and storage medium |
CN112527999A (en) * | 2020-12-22 | 2021-03-19 | 江苏省农业科学院 | Extraction type intelligent question and answer method and system introducing agricultural field knowledge |
CN112527999B (en) * | 2020-12-22 | 2024-05-14 | 江苏省农业科学院 | Extraction type intelligent question-answering method and system for introducing knowledge in agricultural field |
CN112579795A (en) * | 2020-12-28 | 2021-03-30 | 重庆邮电大学 | Intelligent question-answering method based on knowledge graph embedded representation |
CN112766507B (en) * | 2021-01-20 | 2023-04-25 | 河海大学 | Complex problem knowledge base question-answering method based on embedded and candidate sub-graph pruning |
CN112766507A (en) * | 2021-01-20 | 2021-05-07 | 河海大学 | Complex question knowledge base question-answering method based on embedded and candidate subgraph pruning |
CN112800203A (en) * | 2021-02-05 | 2021-05-14 | 江苏实达迪美数据处理有限公司 | Question-answer matching method and system fusing text representation and knowledge representation |
CN112749567A (en) * | 2021-03-01 | 2021-05-04 | 哈尔滨理工大学 | Question-answering system based on reality information environment knowledge graph |
CN112989005B (en) * | 2021-04-16 | 2022-07-12 | 重庆中国三峡博物馆 | Knowledge graph common sense question-answering method and system based on staged query |
CN112989005A (en) * | 2021-04-16 | 2021-06-18 | 重庆中国三峡博物馆 | Knowledge graph common sense question-answering method and system based on staged query |
CN113420231A (en) * | 2021-05-25 | 2021-09-21 | 国网浙江省电力有限公司物资分公司 | Data recommendation algorithm applied to purchasing system |
CN113326361A (en) * | 2021-05-25 | 2021-08-31 | 武汉理工大学 | Knowledge question-answering method and system based on automobile industry map and electronic equipment |
CN113312917A (en) * | 2021-05-28 | 2021-08-27 | 国网江苏省电力有限公司电力科学研究院 | Entity relation extraction method and system based on knowledge reasoning |
CN113220861A (en) * | 2021-06-01 | 2021-08-06 | 中国电子科技集团公司第五十四研究所 | Knowledge map question-answering system for professional field |
CN113220861B (en) * | 2021-06-01 | 2022-09-02 | 中国电子科技集团公司第五十四研究所 | Knowledge map question-answering system for professional field |
CN114168719B (en) * | 2021-06-22 | 2024-08-06 | 北京理工大学 | Interpretable multi-hop question-answering method and system based on knowledge graph embedding |
CN114168719A (en) * | 2021-06-22 | 2022-03-11 | 北京理工大学 | Interpretable multi-hop question-answering method and system based on knowledge graph embedding |
CN113344122A (en) * | 2021-06-29 | 2021-09-03 | 复旦大学 | Operation flow diagnosis method and device and storage medium |
CN113468311A (en) * | 2021-07-20 | 2021-10-01 | 四川启睿克科技有限公司 | Knowledge graph-based complex question and answer method, device and storage medium |
CN113468311B (en) * | 2021-07-20 | 2023-09-19 | 四川启睿克科技有限公司 | Knowledge graph-based complex question and answer method, device and storage medium |
CN113590799A (en) * | 2021-08-16 | 2021-11-02 | 东南大学 | Weak supervision knowledge graph question-answering method based on multi-view reasoning |
CN113792152B (en) * | 2021-08-23 | 2023-07-04 | 南京信息工程大学 | Fusion method of triangular diagram and knowledge graph |
CN113792152A (en) * | 2021-08-23 | 2021-12-14 | 南京信息工程大学 | Method for fusing triangular graph and knowledge graph |
CN113779220A (en) * | 2021-09-13 | 2021-12-10 | 内蒙古工业大学 | Mongolian multi-hop question-answering method based on three-channel cognitive map and graph attention network |
CN113672720A (en) * | 2021-09-14 | 2021-11-19 | 国网天津市电力公司 | Power audit question and answer method based on knowledge graph and semantic similarity |
CN113849613A (en) * | 2021-09-15 | 2021-12-28 | 北方工业大学 | Food safety question-answer reasoning method, system and electronic equipment |
CN114090746A (en) * | 2021-10-09 | 2022-02-25 | 北京百分点科技集团股份有限公司 | Knowledge graph-based answer query method and device and electronic equipment |
CN114090746B (en) * | 2021-10-09 | 2022-12-06 | 北京百分点科技集团股份有限公司 | Knowledge graph-based answer query method and device and electronic equipment |
CN114120166B (en) * | 2021-10-14 | 2023-09-22 | 北京百度网讯科技有限公司 | Video question-answering method and device, electronic equipment and storage medium |
CN114120166A (en) * | 2021-10-14 | 2022-03-01 | 北京百度网讯科技有限公司 | Video question and answer method and device, electronic equipment and storage medium |
CN114186068A (en) * | 2021-11-04 | 2022-03-15 | 国网天津市电力公司 | Audit system basis question-answering method based on multi-level attention network |
CN114153993A (en) * | 2022-02-07 | 2022-03-08 | 杭州远传新业科技有限公司 | Automatic knowledge graph construction method and system for intelligent question answering |
CN114153993B (en) * | 2022-02-07 | 2022-05-06 | 杭州远传新业科技有限公司 | Automatic knowledge graph construction method and system for intelligent question answering |
CN114896407B (en) * | 2022-03-21 | 2024-07-26 | 武汉理工大学 | Question-answering method based on combination of semantic analysis and vector modeling |
CN114896407A (en) * | 2022-03-21 | 2022-08-12 | 武汉理工大学 | Question-answering method based on combination of semantic analysis and vector modeling |
WO2024146427A1 (en) * | 2023-01-04 | 2024-07-11 | 浙江阿里巴巴机器人有限公司 | Intelligent question answering methods and apparatus, device and storage medium |
CN116303976A (en) * | 2023-05-12 | 2023-06-23 | 中国人民解放军国防科技大学 | Penetration test question-answering method, system and medium based on network security knowledge graph |
CN116303976B (en) * | 2023-05-12 | 2023-08-08 | 中国人民解放军国防科技大学 | Penetration test question-answering method, system and medium based on network security knowledge graph |
CN116821712A (en) * | 2023-08-25 | 2023-09-29 | 中电科大数据研究院有限公司 | Semantic matching method and device for unstructured text and knowledge graph |
CN116821712B (en) * | 2023-08-25 | 2023-12-19 | 中电科大数据研究院有限公司 | Semantic matching method and device for unstructured text and knowledge graph |
CN116992007B (en) * | 2023-09-28 | 2023-12-08 | 北京致远互联软件股份有限公司 | Limiting question-answering system based on question intention understanding |
CN116992007A (en) * | 2023-09-28 | 2023-11-03 | 北京致远互联软件股份有限公司 | Limiting question-answering system based on question intention understanding |
CN117290489A (en) * | 2023-11-24 | 2023-12-26 | 烟台云朵软件有限公司 | Method and system for quickly constructing industry question-answer knowledge base |
CN117290489B (en) * | 2023-11-24 | 2024-02-23 | 烟台云朵软件有限公司 | Method and system for quickly constructing industry question-answer knowledge base |
CN117828060A (en) * | 2024-01-03 | 2024-04-05 | 国投人力资源服务有限公司 | Online question-answering method, system and storage medium based on semantic recognition |
CN117891929B (en) * | 2024-03-18 | 2024-05-17 | 南京华飞数据技术有限公司 | Knowledge graph intelligent question-answer information identification method of improved deep learning algorithm |
CN117891929A (en) * | 2024-03-18 | 2024-04-16 | 南京华飞数据技术有限公司 | Knowledge graph intelligent question-answer information identification method of improved deep learning algorithm |
CN118093842A (en) * | 2024-04-25 | 2024-05-28 | 安徽省交通规划设计研究总院股份有限公司 | Optimization method and device of knowledge graph question-answering system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109271506A (en) | A kind of construction method of the field of power communication knowledge mapping question answering system based on deep learning | |
CN110298037B (en) | Convolutional neural network matching text recognition method based on enhanced attention mechanism | |
WO2020000848A1 (en) | Knowledge graph automatic construction method and system for massive unstructured text | |
CN112347268A (en) | Text-enhanced knowledge graph joint representation learning method and device | |
CN109472024A (en) | A kind of file classification method based on bidirectional circulating attention neural network | |
Tang et al. | Multi-label patent categorization with non-local attention-based graph convolutional network | |
CN110222163A (en) | A kind of intelligent answer method and system merging CNN and two-way LSTM | |
CN110059181A (en) | Short text stamp methods, system, device towards extensive classification system | |
CN116127095A (en) | Question-answering method combining sequence model and knowledge graph | |
WO2020010834A1 (en) | Faq question and answer library generalization method, apparatus, and device | |
CN113515632B (en) | Text classification method based on graph path knowledge extraction | |
CN111858896B (en) | Knowledge base question-answering method based on deep learning | |
CN114818717B (en) | Chinese named entity recognition method and system integrating vocabulary and syntax information | |
CN111581364B (en) | Chinese intelligent question-answer short text similarity calculation method oriented to medical field | |
CN115599902B (en) | Oil-gas encyclopedia question-answering method and system based on knowledge graph | |
CN113962219A (en) | Semantic matching method and system for knowledge retrieval and question answering of power transformer | |
CN114707516B (en) | Long text semantic similarity calculation method based on contrast learning | |
CN114238653B (en) | Method for constructing programming education knowledge graph, completing and intelligently asking and answering | |
CN112417170B (en) | Relationship linking method for incomplete knowledge graph | |
CN115840812A (en) | Method and system for intelligently matching enterprises according to policy text | |
CN110851584A (en) | Accurate recommendation system and method for legal provision | |
CN112926325A (en) | Chinese character relation extraction construction method based on BERT neural network | |
CN111582506A (en) | Multi-label learning method based on global and local label relation | |
CN114239730B (en) | Cross-modal retrieval method based on neighbor ordering relation | |
CN114372454B (en) | Text information extraction method, model training method, device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190125 |