CN110245238B

CN110245238B - Graph embedding method and system based on rule reasoning and syntax mode

Info

Publication number: CN110245238B
Application number: CN201910314357.XA
Authority: CN
Inventors: 贾雨葶; 霍晓阳; 傅洛伊; 王新兵; 严宇辰; 田畅达; 王睿杰; 何小晟; 朱元坤
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2019-04-18
Filing date: 2019-04-18
Publication date: 2021-08-17
Anticipated expiration: 2039-04-18
Also published as: CN110245238A

Abstract

The invention provides a graph embedding method and a system based on rule reasoning and syntax schema, wherein the method comprises the following steps that 1, a conditional random field algorithm-a bidirectional long-short term memory network is used for extracting entities/values from original problems; step 2, detecting a possible structure of a query subgraph by adopting a traditional rule reasoning method based on the entity/value information and the schema of the specific field knowledge graph in the first step; step 3, replacing the entity/value in the original question by using the class label result of the first step; step 4, according to the results of the second step of structure detection and the third step of link/attribute extraction, using a traditional rule inference method to form query subgraphs for all original problems; and 5, constructing a sentence pattern schematic diagram for the original problem which can not form a connection query subgraph, using Node2Vec to learn and express the sentence pattern diagram, and simulating hidden links in the original problem by using the output of the bidirectional long-short term memory network. So that the user can quickly convert the natural language into a database query statement.

Description

Graph embedding method and system based on rule reasoning and syntax mode

Technical Field

The invention relates to the field of text network exploration type search, in particular to a graph embedding method and system based on rule reasoning and syntax mode.

Background

With the increasing popularization of the knowledge graph technology, a knowledge base question answering system (KBQA) is more and more concerned by people. The current related KBQA technology is imperfect, the answer accuracy rate is low, open type knowledge graphs are mostly constructed, and a QA system aiming at the knowledge graphs in specific academic fields is lacked. The method has low sensitivity to diversified question sentences and unsatisfactory query results, and the method comprehensively utilizes the advantages of rule reasoning and provides a method for embedding a graph to identify and construct subgraphs of the original question sentences, thereby greatly increasing the sensitivity and accuracy of the system to the question sentences.

The first stage of KBQA is problem understanding, which aims to convert natural language into a machine-interpretable form, such as λ -DCS. Firstly, a knowledge base query subgraph is defined, which can be directly mapped to lambda-DCS, and the problem understanding is converted into a problem-to-subgraph task.

Node2Vec is a framework for learning continuous feature expression for nodes in a network, and obtains feature expression by reserving network neighborhoods of the nodes to the maximum extent in a d-dimensional feature space, and a Node community is generated by second-order random walk. BilSTM refers to a bidirectional long and short term memory network, which is formed by combining a forward long and short term memory network and a backward long and short term memory network, can better capture bidirectional semantic dependence, and is often used for modeling context information in natural language processing tasks.

The prior art related to the present application is patent document CN108268580A, which discloses a question-answering method and device based on knowledge graph. The method comprises the following steps: acquiring a natural query statement input by a user, and identifying a global unique identifier GUID of an entity in the natural query statement aiming at a knowledge graph, wherein the knowledge graph comprises attributes and attribute values of the entity and relations among the entities; analyzing the natural query statement into a syntax tree according to the context-free grammar rule, and obtaining a logic expression corresponding to the natural query statement according to the syntax tree; generating a machine query statement corresponding to the knowledge graph according to the logic expression and the GUID of the entity; and inquiring the question and answer result corresponding to the machine query statement in the knowledge graph according to the machine query statement, and feeding back the question and answer result to the user.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a graph embedding method and system based on rule reasoning and syntax mode.

The invention provides a graph embedding method based on rule reasoning and syntax mode, which comprises the following steps:

step 1: problem Q from primitive using conditional random field algorithm and two-way long-short term memory network_fExtracting entity information or value information to form a class label result;

step 2: based on the entity information or the value information and the schema of the specific field knowledge graph, adopting the structure detection of the traditional rule reasoning to obtain the possible structure of the query subgraph G and forming a structure detection result;

and step 3: replacement of original problem Q_fExtracting link information from the original problem by using the conditional random field algorithm and the bidirectional long-short term memory network again to form a link extraction result;

and 4, step 4: using conventional rule reasoning and graph embedding of sentence patterns to original question Q based on structure detection results and link extraction results_fForming a query subgraph G;

and 5: for a portion of the original problem Q that cannot form a query subgraph_fConstructing sentence pattern graph SSG, using Node2Vec learning to represent sentence pattern graph SSG, and simulating original problem Q by using output of bidirectional long-short term memory network BilSTM_fHidden link e in_h。

Preferably, the conditional random field algorithm is a typical discriminant model obtained by adding an observation value on the basis of a Markov random field;

the bidirectional long and short term memory network is formed by combining a forward long and short term memory network and a backward long and short term memory network, and can capture bidirectional semantic dependence.

Preferably, the structure detection is to check the structure of each link or attribute when searching the spatial structure to see if the extracted link or attribute is connected to only one tag; if so, the link or attribute is set to the connection between the known node and the final unknown node.

Preferably, the entity information or the value information extracted from the original problem is replaced by using a class label result, so as to enhance the extraction result of the entity information or the value information.

Preferably, the step 5 comprises:

step 5.1: replacing the entity with a class type or a value type based on the entity extraction to obtain an enhancement problem;

step 5.2: converting each mark in the enhancement problem into a node, adding an edge to each adjacent node, and constructing to obtain a sentence pattern graph;

step 5.3: embedding the sentence pattern graph SSG by using the Node2Vec, and recording the distance between the embedded vectors of two connected class type or value type nodes as the embedding result of the inter-Node link, wherein the distance between the embedded vector of the unknown Node and the embedded vector connected to the unknown Node is as follows:

e_h＝h_unknown-h_connected

e_hrefers to the embedded vector of the hidden link in the original problem, h_unknownIs an unknown node, h_connectedIs a node connected to an unknown node;

step 5.4: after embedding, problem Q_eSequence enhancementW of sentence₁、W₂、···W_nConversion to node N in SSG₁、 N₂、···N_nSequence as input to BilSTM, finding hidden e by BilSTM_hThe output of BilSTM is e_hThe prediction embedding of (a) is expressed as:

e_predict＝BiLSTM(Q_e)

wherein W denotes a word, subscript n denotes a serial number, BilSTM denotes a bidirectional long-short term memory network, e_hEmbedded vectors referring to hidden links in the original problem, e_predictIs e_hPredictive embedding of (3);

step 5.5: optimizing the parameter θ of BilSTM to make e_pridictAnd e_hMinimum absolute value of distance (d):

θ＝arg_θmin||e_pridict-e_h||

centralizing the output e of a problem Q by testing_pridictExtracted problem Q entity class set v_setIs { V₁， V₂，··，v_nN is the serial number of the node, v_setEach class type or value type node around the middle node defines a Similarity score Similarity as follows:

Similarity(e_pridict,e_V(i)，V_j)＝-α*θ(e_pridict,e_V(i),V_j)-β*||e_pridict-e_V(i),V_j||

selecting V according to the calculation result of the similarity score_jAnd the corresponding link with the largest similarity score;

wherein e is_V(i)Vector values representing edges connecting the i nodes around the current node and the current node; alpha and beta respectively represent optimization parameters;

step 5.6: all query subgraphs G formed at different stages are combined.

The invention provides a graph embedding system based on rule reasoning and syntax mode, which comprises:

module 1: problem Q from primitive using conditional random field algorithm and two-way long-short term memory network_fMiddle liftAcquiring entity information or value information to form a class label result;

and (3) module 2: based on the entity information or the value information and the schema of the specific field knowledge graph, adopting the structure detection of the traditional rule reasoning to obtain the possible structure of the query subgraph G and forming a structure detection result;

and a module 3: replacement of original problem Q_fExtracting link information from the original problem by using the conditional random field algorithm and the bidirectional long-short term memory network again to form a link extraction result;

and (4) module: using conventional rule reasoning and graph embedding of sentence patterns to original question Q based on structure detection results and link extraction results_fForming a query subgraph G;

and a module 5: for a portion of the original problem Q that cannot form a query subgraph_fConstructing a sentence pattern graph SSG, expressing the sentence pattern graph SSG by using Node2Vec learning, and simulating an original problem Q by using the output of a bidirectional long-short term memory network BilSTM_fHidden link e in_h。

Preferably, said module 5 comprises:

module 5.1: replacing the entity with a class type or a value type based on the entity extraction to obtain an enhancement problem;

module 5.2: converting each mark in the enhancement problem into a node, adding an edge to each adjacent node, and constructing to obtain a sentence pattern graph;

module 5.3: embedding the sentence pattern graph SSG by using the Node2Vec, and recording the distance between the embedded vectors of two connected class type or value type nodes as the embedding result of the inter-Node link, wherein the distance between the embedded vector of the unknown Node and the embedded vector connected to the unknown Node is as follows:

e_h＝h_unknown-h_connected

module 5.4: after embedding, problem Q_eW of sequence by enhancement sentence₁、W₂、···W_nConversion to node N in SSG₁、 N₂、···N_nSequence as input to BilSTM, finding hidden e by BilSTM_hThe output of BilSTM is e_hThe prediction embedding of (a) is expressed as:

e_predict＝BiLSTM(Q_e)

module 5.5: optimizing the parameter θ of BilSTM to make e_pridictAnd e_hMinimum absolute value of distance (d):

θ＝arg_θmin||e_pridict-e_h||

module 5.6: all query subgraphs G formed at different stages are combined.

Compared with the prior art, the invention has the following beneficial effects:

1. RI-SSGE is focused on the structural detection of problems to avoid the problem of generalization failure in other models based on various domain-specific knowledge graph templates.

2. Through web representation learning, a new sentence-schema-graph (SSG) is proposed, aiming to mimic human thinking.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

FIG. 1 shows two slave units

To G^*(iii) expansion of (iii);

FIG. 2 shows three types of

And the corresponding isomers. (ii) a

FIG. 3 is a diagram of a portion of schemas and their structural information according to the present invention;

FIG. 4 is the RI-SSGE framework.

Fig. 5 is a structure classification.

FIG. 6 is a SSG build process.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.

The invention relates to the positioning and extraction of entities in natural language, the judgment of query subgraph structure based on inference rules, the conversion of statement entities to category labels and the construction of sentence pattern graphs through the graph embedding framework implementation based on rule inference and syntactic patterns; specifically, the method comprises the following steps:

step S1: problem Q from primitive using conditional random field algorithm-two-way long short term memory network_fExtracting entities/values;

step S2: detecting possible structures of the query subgraph G by adopting a traditional rule reasoning method based on entity/value information in the first step and a schema of a specific field knowledge graph, wherein Q_eRefer to questions, graph-SSG refer to sentence pattern graphs;

step S3: replacing the original question Q with the class-tagged result of the first step_fThe entity/value of (1);

step S4: based on the results of the second and third step structure detection and link/attribute extraction steps, RI-SSGE attempts to use traditional rule inference methods for all Q' s_fG is formed. Wherein RI-SSGE refers to rule inference and sentence pattern graph embedding, Q_fRefers to the original problem, and G refers to the query subgraph.

Step S5: for Q that cannot form a query subgraph_fA sentence pattern graph (SSG) is constructed, and the representation of the SSG is learned by using the method of Node2 Vec. Simulation of Q with the output of BilSTM by representation of SSG_fHidden link e in_h. Wherein Q_fReferring to the original problem, Node2Vec is a framework for learning continuous feature expression for nodes in a network, and feature expression is obtained by reserving the network neighborhood of the nodes in a d-dimensional feature space to the maximum extent. A community of nodes is generated using second-order random walks. BilsTM finger pairThe forward long-short term memory network and the backward long-short term memory network are combined, bidirectional semantic dependence can be captured well, and natural language processing tasks are often used for modeling context information.

The step S1 includes:

step S1.1: use of CRF-BilSTM to solve the original problem Q_fExtract the entity/value. The CRF-BilSTM is a typical discrimination model obtained by adding some observed values (characteristics) on the basis of a Markov random field, and has good application in the fields of word segmentation, part of speech tagging, named entity recognition and the like.

The step S2 includes:

step S2.1: for the problem that the links/attributes can be successfully extracted from in the first step, the structure of each link/attribute is checked to see if there is an extracted link/attribute connected to only one tag when searching the spatial structure. If so, this link/attribute may be the connection between the known node and the final unknown node.

Take the problem of three detection nodes as an example. The query subgraph of the problem is classified into three classes according to the class/value type labels of the three nodes and the precision of the first step

And determining the connection mode of the three nodes and the unknown node. Having three known label classes/values can be classified as one of three structures.

Structure 1. none of these three class labels are connected to the other two, the remaining unknown nodes must connect three known nodes to form a connected subgraph G if the rule inference method and graph (schema) can uniquely determine the class of the unknown node, G is determined at this stage.

Structure 2 there is a link between two tags, the rest of the tags are not linked to the two tags in the schema. The unknown node must connect the isolated node and one of the two connected nodes. Such Q_eThe next phase will be entered.

Structure 3 with two links between the three tags, the remaining nodes that are unknown can be connected to any of the three nodes. It also needs to go to the next stage.

The step S3 includes:

step S3.1: replacing the original question Q with the class-tagged result of the first step_fThe entity/value in the method is used for enhancing Q and improving CRF-BilSTM link/attribute extraction results, wherein CRF-BilSTM refers to a conditional random field algorithm-a two-way long and short term memory network, CRF is a conditional random field, is a typical discrimination model obtained by adding some observed values (characteristics) on the basis of a Markov random field, and has good application in the fields of word segmentation, part of speech tagging, named entity recognition and the like, and BilSTM refers to a two-way long and short term memory network which is formed by combining a forward long and short term memory network and a backward long and short term memory network, can well capture two-way semantic dependence, and is commonly used for modeling context information in natural language processing tasks.

The step S4 includes:

step S4.1: for the problem in the second and third steps from which the links/attributes can be successfully extracted, the structure of each link/attribute is checked to see if there is an extracted link/attribute connected to only one tag when searching the spatial structure. If so, this link/attribute may be the connection between the known node and the final unknown node.

The step S5 includes:

step S5.1: the method comprises the following steps of extracting an entity based on CRF-BilSTM, replacing the entity with a class/value type of the entity, and enhancing Q in the same way as link extraction, wherein the CRF-BilSTM refers to a conditional random field algorithm-a two-way long-short term memory network, the CRF is a conditional random field, is a typical discrimination model obtained by adding some observed values (characteristics) on the basis of a Markov random field, and has good application in the fields of word segmentation, part of speech tagging, named entity recognition and the like, the BilSTM refers to a two-way long-short term memory network, is formed by combining a forward long-short term memory network and a backward long-short term memory network, can well capture two-way semantic dependence, and is often used for modeling context information in natural language processing tasks.

Step S5.2: by mixing Q_eConverts each mark (word/class/value type) in (a) into a node, and adds an edge e to each adjacent node (word1, word2), (word1, class1), (value1, word1), etc. to construct a new graph-SSG; wherein Q is_eRefers to questions, and graph-SSG refers to sentence pattern graphs.

Step S5.3: node2Vec is used to embed SSGs. The distance between the embedded vectors of two connected class/value type nodes is taken as the embedding result of the link between them. Q_eThe embedded vector of the medium hidden link is e_hAnd is a label of BilSTM. From this, an unknown node (h) can be calculated_unknow1) And connecting to the unknown node (h)_connecte) The distance between the embedded vectors of the nodes of (1), i.e.:

e_h＝h_unknow1-h_connecte

step S5.4: after embedding, Q_eW of sequence by enhancement sentence₁、W₂、···W_nConversion to node N in SSG₁、 N₂、···N_nThe sequence serves as the input to BilSTM. Finding hidden e by BilSTM_h. The output of this BilSTM can be considered as e_hThe prediction embedding:

e_predict＝BiLSTM(Q_e).

wherein Q_eThe problem is pointed out, W is a word, BilSTM is a bidirectional long and short term memory network, the bidirectional long and short term memory network is combined with a backward long and short term memory network, bidirectional semantic dependence can be well captured, and the bidirectional long and short term memory network is often used for modeling context information in natural language processing tasks. e.g. of the type_hEmbedded vectors referring to hidden links in the original problem, e_predictIs e_hPredictive embedding of (3);

step S5.5: optimizing parameters θ of BilSTMe_pridictAnd e_hMinimum absolute value of distance (d):

θ＝arg_θmin||e_pridict-e_h||

concentrating the output e of one Q by testing_pridictExtracted Q entity class set v_setIs { V₁，V₂，··，v_n}. Let us be v_setEach class/value type node around the middle node defines a Similarity score Similarity:

Similarity(e_pridict,e_V(i)，V_j)＝-α*θ(e_pridict,e_V(i),V_j)

-β*||e_pridict-e_V(i),V_j||.

based on the results, V is selected_jAnd the corresponding link with the largest similarity function value. The BilSM refers to a bidirectional long and short term memory network, is formed by combining a forward long and short term memory network and a backward long and short term memory network, can better capture bidirectional semantic dependence, and is often used for modeling context information in natural language processing tasks. Q refers to the problem, and alpha and beta are optimization parameters. Theta represents the optimized BilSTM final parameter; arg_θWhen the formula is optimal, theta is taken as an optimal solution; alpha and beta respectively represent hyper-parameters and weight values; | | e_predict-e_h| | represents the distance between the link vector predicted by the model and the actual link vector; e.g. of the type_predictA link vector value representing a model prediction; e.g. of the type_hRepresenting the actual exact link vector value; e.g. of the type_v(i)Vector values representing edges connecting the i nodes around the current node and the current node; v_jRepresents the current node j and the set of nodes around the current node j; | | e_predict-e_V(i),V_jAnd | | represents the distance between the predicted link vector value at the j node and the vector value of the edge of the i node around the j node.

Step S5.6: all G s formed at different stages are combined, where G refers to the query subgraph.

In a specific implementation, as shown in fig. 1, the question is which articles are published by the Pascal professor of michigan university at the top-level conference AAAI2017, and the question is represented in a subgraph manner, where (a) Operation nodes represent Operation operations on unknown nodes, in this question, the total number of the articles is counted, and (b) Multi-unknown nodes represent additional addition of the unknown nodes, in this question, collaborators of the Pascal professor are found in the unknown articles.

The relationship between the query diagram and the chemical structure is shown in FIG. 2, which shows three isomers of pentane, N-pentane: n-pentane, Iso-pentane: isopentane, Neo-pentane: neopentane.

Fig. 3 shows an abstracted skeleton diagram of the knowledge-graph, Partofschema, which shows that the skeleton diagram only shows a partial structure of the entire knowledge-graph: Paper-Author-Field, the bottom half of knowledge base is shown for an exact data example.

The algorithm framework flow chart of the whole patent application as shown in fig. 4 shows the whole process from the original natural language question to the conversion into a so-called question sub-graph.

As shown in FIG. 5, which shows a possible Case of connecting edges under three nodes, Case1, Case 2, Case 3 respectively represent the Case of three nodes being separated, two nodes being connected and three nodes being connected.

As shown in FIG. 6, the process of constructing the SSG graph with the maximum innovation point in the present invention is to predict the key link of author _ is _ in _ Field by using the neural network model to construct an accurate question sub-graph from the non-canonical original natural language input by the user, such as Whereisauuthorin Field (which domain the author studies).

The invention focuses mainly on the actual problem to the sub-graph task (Qf, G) and proves that more complex problems can be easily solved on its basis. A framework based on rule-based reasoning and syntactic pattern embedding (RI-SSGE) is proposed to solve the (Qf, G) task. Inspired by the isomeric structure in chemistry, we focused RI-SSGE on the structural detection of problems to avoid the problem of poor generalization in other models based on various domain-specific knowledge map templates. In order to solve the problem of error propagation, the RI-SSGE creatively combines the traditional rule reasoning method and the graphical representation method, thereby ensuring the performance of the whole framework. After observing that a person can mine hidden relations by combining questions and knowledge graph structures, we propose a new sentence-schema-graph (SSG) in the last network representation learning phase of RI-SSGE, aiming to mimic the way a person thinks. In addition to the RISSGE, a new dataset named AceQG was introduced, which has 133,143 (factual problem, subgraph) pairs on an open academic knowledge graph named AceKG. Experiments on the AceQG and Geoquery-880 data sets show that RI-SSGE is superior to other algorithms.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.

Claims

1. A graph embedding method based on rule reasoning and syntax schema is characterized by comprising the following steps:

and 5: for a portion of the original problem Q that cannot form a query subgraph_fConstructing sentence pattern graph SSG, and learning by using Node2VecThe sentence pattern graph SSG is expressed, and the output of the bidirectional long-short term memory network BilSTM is used for simulating the original problem Q_fA hidden link in;

the step 5 comprises the following steps:

step 5.3: embedding the sentence pattern graph SSG by using the Node2Vec, recording the distance between the embedded vectors of two connected class type or value type nodes as the embedded result of the link between the nodes,

e_h＝h_unknown-h_connected

step 5.4: after embedding, problem Q_eW of sequence by enhancement sentence₁、W₂、···W_nConversion to node N in SSG₁、N₂、···N_nSequence as input to BilSTM, finding hidden e by BilSTM_hThe output of BilSTM is e_hThe prediction embedding of (a) is expressed as:

e_predict＝BiLSTM(Q_e)

centralizing a problem through testingOutput e of Q_pridictExtracted problem Q entity class set v_setIs { V₁，V₂，··，v_nN is the serial number of the node, v_setEach class type or value type node around the middle node defines a Similarity score Similarity as follows:

step 5.6: all query subgraphs G formed at different stages are combined.

2. The method of claim 1 in which the conditional random field algorithm is a canonical discriminant model based on markov random fields plus observations;

3. The graph embedding method based on rule-based reasoning and syntax schema as claimed in claim 1, wherein the structure detection is checking the structure of each link or attribute in searching the spatial structure to see if the extracted link or attribute is connected to only one tag; if so, the link or attribute is set to the connection between the known node and the final unknown node.

4. The rule-based reasoning and syntactic pattern embedding method of claim 1, wherein said replacing the extracted entity information or value information in the original question is performed using class label result to enhance the extraction result of the entity information or value information.

5. A graph embedding system based on rule-based reasoning and syntactic patterns, comprising:

module 1: problem Q from primitive using conditional random field algorithm and two-way long-short term memory network_fExtracting entity information or value information to form a class label result;

and a module 5: for a portion of the original problem Q that cannot form a query subgraph_fConstructing a sentence pattern graph SSG, expressing the sentence pattern graph SSG by using Node2Vec learning, and simulating an original problem Q by using the output of a bidirectional long-short term memory network BilSTM_fA hidden link in;

the module 5 comprises:

module 5.3: embedding the sentence pattern graph SSG by using the Node2Vec, recording the distance between the embedded vectors of two connected class type or value type nodes as the embedded result of the link between the nodes,

e_h＝h_unknown-h_connected

module 5.4: after embedding, problem Q_eW of sequence by enhancement sentence₁、W₂、···W_nConversion to node N in SSG₁、N₂、···N_nSequence as input to BilSTM, finding hidden e by BilSTM_hThe output of BilSTM is e_hThe prediction embedding of (a) is expressed as:

e_predict＝BiLSTM(Q_e)

centralizing the output e of a problem Q by testing_pridictExtracted problem Q entity class set v_setIs { V₁，V₂，··，v_nN is the serial number of the node, v_setEach class type or value type node around the middle node defines a Similarity score Similarity as follows:

wherein e is_V(i)Vector values representing edges connecting the i nodes around the current node and the current node; alpha, beta, fractionRespectively representing optimization parameters;

module 5.6: all query subgraphs G formed at different stages are combined.

6. The system for graph embedding based on regular inference and syntactic patterns according to claim 5, wherein said conditional random field algorithm is a classical discriminant model based on markov random fields with observations;

7. The graph embedding system based on rule-based reasoning and syntax schema of claim 5, wherein the structure detection is checking the structure of each link or attribute in a search space structure to see if the extracted link or attribute is connected to only one tag; if so, the link or attribute is set to the connection between the known node and the final unknown node.

8. The rule-based reasoning and syntactic pattern embedding system of claim 5, wherein said replacing the extracted entity information or value information in the original question is performed using class label results to enhance the extraction of the entity information or value information.