CN104915448B

CN104915448B - A kind of entity based on level convolutional network and paragraph link method

Info

Publication number: CN104915448B
Application number: CN201510372795.3A
Authority: CN
Inventors: 包红云; 郑孙聪; 许家铭; 齐振宇; 徐博; 郝红卫
Original assignee: Institute of Automation of Chinese Academy of Science
Current assignee: Institute of Automation of Chinese Academy of Science
Priority date: 2015-06-30
Filing date: 2015-06-30
Publication date: 2018-03-27
Anticipated expiration: 2035-06-30
Also published as: CN104915448A

Abstract

A kind of entity based on level convolutional network and paragraph link method, including：Represent that changing into sentence vectorization represents by term vectorization using convolutional neural networks；Represent to again pass by convolutional neural networks using sentence vectorization and consider that the sentence order information obtains paragraph vectorization and represented；Sentence vectorization expression and paragraph vectorization represent to export by Softmax, carry out the training of the convolutional neural networks model as supervision message by existing entity；Meanwhile consider that the pair wise similarity informations between paragraph semantic vector feature and Entity Semantics vector characteristics further improve the training of convolutional neural networks model；A test description paragraph is given, carrying out Deep Semantics feature extraction using the neural network model trained obtains the vectorization expression of test paragraph, and being then based on this semantic expressiveness can be directly linked on target entity by Softmax outputs.

Description

Entity and paragraph linking method based on hierarchical convolutional network

Technical Field

The invention relates to the technical field of knowledge base construction, in particular to an entity and paragraph linking method based on a hierarchical convolutional network.

Background

Today, large-scale repositories in wide use are Freebase, WordNet, YAGO, and the like. They all work to build a global repository and allow machines to more conveniently access and obtain structured public information. At the same time, these knowledge bases provide Application Program Structures (APIs) to facilitate people to query richer information about related entities. For example, when we retrieve a city name "Washington d.c" in YAGO database, the results are returned as shown in table 1 below:

TABLE 1

It can be seen that the returned result information is some highly structured organization information. But these structured information do not fit into the actual context and semantic information that people understand an entity. Unlike the YAGO database, Freebase and WordNet return structured information and additionally return descriptive paragraphs related to the search entity, as shown in table 2 below:

TABLE 2

It can be seen that the descriptive paragraphs shown in table 2 are more useful for the user to understand the specific context and semantic information of the query entity words. However, the descriptive paragraph information of Freebase and WordNet is edited by human, which results in limitation of paragraph description on entities under big data and consumes a lot of time and manpower. Therefore, how to design an efficient entity and descriptive paragraph automatic linking method is an urgent task for constructing a knowledge base in the big data age.

As can be seen from the returned results in table 2, the descriptive contents do not necessarily include the query entity words, but only include some related words to describe the entities in a multi-aspect manner. Therefore, to solve this problem, the entity and paragraph linking method needs to be started from two aspects: 1. capturing subject matter information of text from a given descriptive paragraph; 2. important descriptive content related to the entity is found. Most of the conventional methods extract topic information of paragraphs based on topic model methods, such as dirichlet distribution (LDA) and Probabilistic Latent Semantic Analysis (PLSA). The general problems of the methods are that the extracted subject information is obtained based on word co-occurrence information of a document layer, is seriously influenced by high sparsity represented by short text characteristics in social media, and loses word sequence information in the text.

In recent years, with the rise of deep neural networks, some researchers try to learn deep implicit semantic feature representation of a descriptive paragraph by using a deep model and a word vectorization representation method to solve the problem of linking an entity with the paragraph. However, when solving the semantic feature extraction of the descriptive paragraphs, the existing depth model-based method simply treats the entire paragraph as a long sentence for processing or directly performs weighted averaging on a plurality of sentences to obtain a semantic vector. In fact, the sentence order in the paragraph also has semantic logical relationship.

On the other hand, it is also very important to capture descriptive cues in paragraphs that are closely related to entities. The descriptive section in the results returned from table 2 above, although not directly containing the query entity word "Washington d.c", contains many relevant words or phrases such as: "George Washington", "United States" and "Capital", etc. Thus, vectorizing the representation of the features of the entity facilitates the work of linking the entity with the descriptive paragraph.

Disclosure of Invention

In view of the above technical problems, a primary object of the present invention is to provide a method for linking entities and paragraphs based on a hierarchical convolutional network, so that entity words and descriptive paragraphs in the internet can be automatically linked without manual participation, which is helpful for building a semantic knowledge base under big data.

In order to achieve the above object, the present invention provides a method for linking an entity and a paragraph based on a hierarchical convolutional network, comprising the following steps:

converting the word vectorization representation into sentence vectorization representation by utilizing a convolutional neural network, wherein the convolutional network is favorable for extracting important clues of the query entity in the description paragraphs;

the sentence vectorization representation passes through the convolutional neural network again and paragraph vectorization representation is obtained by considering the sentence sequence information;

the sentence vectorization representation and the paragraph vectorization representation are output through Softmax, and training of the convolutional neural network model is carried out by means of existing entities serving as supervision information;

simultaneously considering pair-wise similarity information between the paragraph semantic vector features and the entity semantic vector features to further improve the training of the convolutional neural network model;

given a test description paragraph, deep semantic feature extraction is carried out by utilizing the trained neural network model to obtain vectorization representation of the test paragraph, and then the test description paragraph can be directly linked to a target entity through Softmax output based on the semantic representation.

The entity and paragraph linking method of the invention divides the feature learning problem in the link of the entity and the paragraph into four levels, which are respectively: the method comprises the steps that an original text paragraph is expressed through word vectorization to obtain a characteristic matrix layer; the sentence vectorization obtained through the convolutional neural network represents a feature layer; paragraph vectorization representation feature layers obtained through a convolutional neural network; and obtaining a vectorization representation characteristic layer of the entity words by using a word vector table look-up method. Through convolution feature network and word vector table lookup, the accuracy value ACC of the entity and paragraph linking method on two text data sets is obviously superior to other comparison methods, and compared with the best comparison method II, the accuracy value ACC of the entity and paragraph linking method on the two data sets is respectively improved by 12.4% and 16.76%.

Drawings

FIG. 1 is a flowchart of a method for linking entities and paragraphs based on a hierarchical convolutional network, which is an embodiment of the present invention;

FIG. 2 is a block diagram of a method for linking entities and paragraphs based on a hierarchical convolutional network, which is an embodiment of the present invention;

fig. 3 is a performance diagram of an entity and paragraph linking method based on a hierarchical convolutional network according to an embodiment of the present invention.

Detailed Description

In order that the objects, technical solutions and advantages of the present invention will become more apparent, the present invention will be further described in detail with reference to the accompanying drawings in conjunction with the following specific embodiments.

The invention discloses an entity and paragraph linking method based on a hierarchical convolutional network, which can automatically link entity words and descriptive paragraphs in the Internet without manual participation. Considering the order information of the sentences in the paragraphs, and convoluting the vectorized representation of the sentences again to obtain the vectorized representation of the paragraphs. And then, the entity features are used as supervision information to guide parameter learning of the convolutional neural network model, and simultaneously, pair-wise similarity information between the depth semantic features of the paragraphs and entity semantic vectorization representation is considered to improve learning of the convolutional neural network model. Given a new descriptive paragraph, the trained convolutional neural network model can be used to extract its deep semantic features, and the corresponding entity link is obtained based on the feature output.

More specifically, the method first converts through a word-vectorized representation into a sentence-vectorized representation using a convolutional neural network. And then, the sentence vectorization representation is utilized to pass through the convolutional neural network again, and the sentence order information is considered to obtain paragraph vectorization representation. And the sentence vectorization representation and the paragraph vectorization representation are output through Softmax, and the training of the convolutional neural network model is carried out by taking an existing entity as supervision information. Meanwhile, the training of the convolutional neural network model is further improved by considering the pair-wise similarity information between the paragraph semantic vector features and the entity semantic vector features. Giving a test description section, extracting deep semantic features by using a trained neural network model to obtain vectorization representation of the test section, and directly linking the test description section to a target entity through Softmax output based on the semantic representation.

The entity and paragraph linking method based on the hierarchical convolutional network as an embodiment of the present invention is described in detail below with reference to the accompanying drawings.

Fig. 1 is a flowchart of a method for linking entities and paragraphs based on a hierarchical convolutional network according to an embodiment of the present invention.

Referring to fig. 1, in step S101, extracting vectorization representation features of each sentence in a paragraph to be processed through a convolutional neural network model and word vectorization representation;

according to an exemplary embodiment of the present invention, the step of extracting vectorization representation characteristics of each sentence in the paragraph to be processed through the convolutional neural network model and the word vectorization representation includes:

in step S1011, a sentence in the to-be-processed paragraph is given, a term quantization expression is obtained by using a table lookup method, and the sentence is characterized in a matrix form;

in step S1012, performing one-dimensional convolution on the sentence matrixing expression feature to obtain a feature matrix after convolution;

in step S1013, mean sampling is performed on the convolved feature matrix to compress the features, so as to obtain vectorized representation of the sentence.

According to an exemplary embodiment of the present invention, the step of obtaining a term quantization representation by using a table lookup method and characterizing the sentence in a matrix form comprises:

given a word2vec trained word vector setWhere | V | is the dictionary size and d is the dimension of the word vector. A sentence of length n in any paragraph can be represented as:

s＝(x₁；x₂；...；x_n) (1)

wherein x is_iIs the vectorization representation corresponding to the ith word found in the word vector set by using a table lookup method. Wherein, if the word x_iNot in the trained word vector set, it is directly represented by random initialization in this exemplary embodiment of the invention.

In step S1012, the one-dimensional convolution is performed on the sentence matrixing expression feature, and the step of obtaining the convolved feature matrix includes:

here, use is made ofRepresenting h starting from the ith word in sentence s_sA continuous word feature. Given a one-dimensional convolution kernelThen h is_sThe feature matrix after convolution of the features of the continuous words is as follows:

wherein, b⁽¹⁾Is a bias term, f is an activation function,is h_sCharacteristics of individual continuous wordsThe feature matrix after convolution. The feature matrix of the sentence is convolved as:

in step S1013, the step of performing mean value sampling on the convolved feature matrix to compress the features to obtain vectorized representation of the sentence includes:

in this exemplary embodiment of the present invention, the step of sampling with the mean value is:

to this end, each convolution kernelA d-dimensional feature vector is generatedIf k convolution kernels are used, a vectorized representation of the sentence is finally obtained through one convolution layerThe dimension of the sentence vectorized representation is d · k.

In step S102, learning a deep semantic feature of the paragraph by using a convolutional neural network structure and the sentence vectorization representation;

according to an exemplary embodiment of the present invention, the method for deep semantic feature learning of paragraphs includes:

in step S1021, using the sentence vector features in the paragraphs to characterize the paragraphs in a matrix form according to the word order of the sentences in the paragraphs;

in step S1022, performing one-dimensional convolution on the paragraph matrixing expression feature to obtain a feature matrix after convolution;

in step S1023, mean sampling is performed on the convolved feature matrix to compress the features and perform a linear transformation to obtain a vectorized representation of the paragraph.

According to an exemplary embodiment of the present invention, the step of characterizing the paragraphs in a matrix form according to the word order of the sentences in the paragraphs by using sentence vector features in the paragraphs comprises:

having obtained a vectorized representation of the l sentences of the paragraph, the paragraph can be represented as:

t＝(s₁；s₂；...；s_l) (5)

in step S1022, the one-dimensional convolution is performed on the paragraph matrixing expression feature, and the step of obtaining the convolved feature matrix includes:

here, use is made ofRepresents h starting from the ith sentence in paragraph t_tA continuous sentence characteristic. Given a one-dimensional convolution kernelThen h is_tThe convolution characteristics after convolution of the characteristics of the continuous sentences are as follows:

wherein, b⁽²⁾Is a bias term, f is an activation function,is h_tA characteristic of a continuous sentenceThe features after convolution. The features of the paragraph are convolved as:

in step S1023, the step of performing mean value sampling on the convolved feature matrix to compress the features and performing linear transformation once to obtain vectorized representation of the paragraph includes:

up to this point, through a convolution kernel W⁽²⁾Generating a d.k dimensional feature vectorIn order to facilitate the calculation of the similarity between the paragraph features and the entity features, if the uniformity of the vector dimensions needs to be ensured, the paragraph vector is subjected to linear transformation:

wherein,is a linear transformation matrix and the feature vector z is the final paragraph feature vector in an exemplary embodiment of the invention.

In step S103, the vectorized representation of the sentence and the vectorized representation of the paragraph are respectively subjected to Softmax output to fit the entity to which the paragraph belongs;

according to an exemplary embodiment of the present invention, the method of fitting the vectorized representation of the sentence and the paragraph to the entities to which the paragraph belongs, respectively, comprises the steps of:

in step S1031, performing linear transformation on the sentence vector and the paragraph vector to obtain output vectors, and performing regularization using Dropout technology;

at step S1032, calculating a link probability of the candidate entity using a Softmax function;

according to an exemplary embodiment of the present invention, the step of performing linear transformation on the sentence vector and the paragraph vector to obtain an output vector and performing regularization using Dropout technique includes:

the sentence vector feature s and the paragraph vector feature t are respectively subjected to linear change to obtain two output vectors:

ys＝W⁽⁴⁾·(sοr)+b⁽⁴⁾(10)

y＝W⁽⁵⁾·(zοr)+b⁽⁵⁾(11)

wherein,andis a weight matrix and m is the number of entities, symbols, in an exemplary embodiment of the invention. Represents a multiplication operation of matrix elements, andit is a bernoulli distribution obeying a certain probability p. Overfitting can be prevented using Dropout techniques, which can enhance the robustness of the neural network model.

At step S1032, the step of calculating the link probability of the candidate entity using the Softmax function includes:

calculating a probability value of each corresponding entity word using a Softmax activation function at both of the output layers of the sentence vector feature and the paragraph vector feature, respectively:

then in equation (12) and equation (13), ps_iAnd p_iRespectively representing probability values corresponding to the ith entity word.

In step S104, calculating pair-wise similarity information of the vectorized representation of the entity and the paragraph vectorized representation;

given a set of physical words E ═ E₁，e₂，...，e_mAnd initializing the entity word set by using word2vec, wherein the similarity between the entity word set E and the paragraph feature vector z is as follows:

sim(z，E)＝{z·e₁，z·e₂，...，z·e_m} (14)

wherein the operator z · e represents the similarity of the paragraph feature vector z and the corresponding entity word e.

In step S105, error back propagation training is carried out on the target entity word and paragraph feature vector through Softmax fitting and the pair-wise similarity information of the target entity word;

according to an exemplary embodiment of the present invention, the step of performing error back propagation on the trained convolutional neural network model by Softmax fitting of target entity words and pair-wise similarity information of the paragraph feature vectors and the target entity words comprises:

in step S1051, a target function is set according to the sentence feature and paragraph feature output and the fitting result of the Softmax to the target entity word in the training data set;

in step S1052, a target function is set according to the pair-wise similarity information between the paragraph feature and the target entity word;

in step S1053, a global objective constraint function is set;

in step S1054, parameters in the model are updated by a random gradient descent method;

according to an exemplary embodiment of the present invention, the step of setting an objective function according to sentence and paragraph feature output by using the Softmax to the fitting result of the target entity word in the training data set comprises:

using formulas (10) and (11) and formulas (12) and (13), respectively setting the target constraint functions of the sentence vectorization feature and the paragraph vectorization feature as follows:

wherein L is_sAn objective constraint function, L, for the sentence vectorized features_p1An objective constraint function for the paragraph vectorization feature,set of all middle paragraphs in the corpusAll of the sentences in (a) are collected,is the correct and definite word to which the ith sentence belongsIs the positive idiom to which the ith paragraph belongs.

In step S1052, the step of setting a target function according to pair-wise similarity information between the paragraph feature and the target entity word includes:

in order to enhance the semantic expression ability of the paragraph and the entity, the similarity between the paragraph vectorization feature and the corresponding belonging entity word vectorization feature is enhanced by setting a target constraint function, and the similarity between the paragraph vectorization feature and the corresponding non-belonging entity word vectorization feature is weakened, wherein the target constraint function is as follows:

wherein e is_rIs given the positive idiom to which the paragraph z belongs.

In step S1053, the step of setting the global objective constraint function is as follows:

L＝L_s+(1-α)·L_p1+α·L_p2(18)

α are weight harmonic coefficients used to balance the two constraints L of the paragraph vectorization feature_p1And L_p2。

In step S1054, the step of updating the parameters in the model by using a stochastic gradient descent method includes:

all model training parameters in the set target constraint function are uniformly expressed as theta:

θ＝(x，W⁽¹⁾，b⁽¹⁾，W⁽²⁾，b⁽²⁾，α，W⁽³⁾，W⁽⁴⁾，b⁽⁴⁾，W⁽⁵⁾，b⁽⁵⁾，E) (19)

in an exemplary embodiment of the invention, the objective function is optimized using a random gradient descent method for error back propagation.

In step S106, deep semantic feature extraction is performed on the test descriptive paragraphs using the updated convolutional neural network model, and then linking is performed with corresponding entity words based on vectorized representation of the paragraphs.

According to an exemplary embodiment of the present invention, the step of performing deep semantic feature extraction on the test descriptive section by using the updated convolutional neural network model, and then linking with the corresponding entity word based on the vectorized representation of the section comprises:

in step S1061, a test paragraph text is given, and vectorization features S of sentences in the paragraph are calculated according to formulas (2), (3) and (4);

in step S1062, calculating a vectorization feature z of the paragraph by using formulas (6), (7), (8) and (9);

in step S1063, using the generated vectorized feature z of the paragraph, a linear transformation without Dropout and a Softmax function are used to output a matching probability of the corresponding entity word:

y＝W⁽⁵⁾·z+b⁽⁵⁾(20)

and the entity word with the highest matching probability is the entity word belonging to the test paragraph.

Fig. 2 is a schematic diagram of a framework of a method for linking entities and paragraphs based on a hierarchical convolutional network according to an embodiment of the present invention.

Referring to fig. 2, the entity and paragraph linking method based on the hierarchical convolutional network has four levels of feature vectorization representations, which are:

the first characteristic level is as follows: the method comprises the steps that an original text paragraph is expressed through word vectorization to obtain a feature matrix;

and (2) feature level two: sentence vectorization representation characteristics obtained through a convolutional neural network;

and (3) feature level three: paragraph vectorization representation characteristics obtained through a convolutional neural network;

feature level four: obtaining the vectorization expression characteristics of the entity words by using a word vector table look-up method;

the whole model training stage has three supervision information for guidance, which are respectively as follows:

monitoring information I: the vectorization representation characteristics of the sentence are subjected to linear change and Softmax output, and then the fitting information of the sentence is obtained;

and (5) monitoring information II: the vectorization of the paragraph represents the fitting information of the characteristic to the belonging entity word after linear change and Softmax output;

and (5) monitoring information III: the vectorization of the paragraph represents the Pair-wise similarity information of the entity words after the characteristics are linearly changed;

in order to accurately evaluate the link performance of the entity and the paragraph of the method, the method obtains the precision (ACC) of the method by comparing the consistency of the link results of the entity and the paragraph and the entity to which the paragraph really belongs. Given a descriptive paragraph sample x⁽ⁱ⁾The entity word linked by the method of the invention is e⁽ⁱ⁾And the paragraph is true the physical word isThe definition of accuracy is as follows:

wherein,is the number of descriptive paragraphs, δ (x, y) is an indicator function, 1 when x ≠ y, and 1 when x ≠ yThe number is 0.

Two open text data sets were used in the experiments of the present invention:

history: the data set contains 409 entities, 1704 paragraphs.

Literature: the data set contains 445 entities, 2247 paragraphs.

The present invention does not perform any processing (including word-kill and stem reduction operations) on these text data sets. On average, each paragraph contains 4-6 sentences, while each paragraph contains only 1 entity word. Specific statistics of the data set are shown in table 3:

TABLE 3

The following comparative methods were used in the experiments of the invention:

the first comparison method comprises the following steps: based on a bag-of-words model and a logistic regression method, the method directly adopts the logistic regression method on the bag-of-words model of the original text;

and a second comparison method comprises the following steps: the method adopts a traditional convolutional neural network model to simply consider an entity and paragraph link problem as a classification problem.

The parameters used in the experiments of the invention are set as shown in table 4:

TABLE 4

Data set	ρ	h_s	h_t	d	k
						History	0.5	3	6	100	1
Literature	0.5	3	8	100	1

In Table 4, the parameter ρ is the specific gravity coefficient of Dropout used in model training, h_sBounding size of convolution kernel, h, for sentence vectorized feature representation_tThe frame mouth size of the convolution kernel when the paragraph vectorization feature is represented, d is the dimension of the word vector, and k is the number of the convolution kernels when the sentence vectorization feature is represented.

In the experiment of the present invention, the average precision (ACC) is obtained by performing the link method for all entities and paragraphs 50 times, and the final experiment result is shown in table 5:

TABLE 5

Method of producing a composite material	History/precision value (%)	Literature/precision value (%)
			Comparison method 1	65.10±0.01	61.17±0.05
Comparison method two	77.01±3.92	74.50±10.3
			The method of the invention	89.41±1.05	91.26±0.50

Table 5 shows the evaluation results of the precision value (ACC) of the entity and paragraph linking method on two text data sets by the method of the present invention, the first comparison method and the second comparison method. Test results show that the performance of the method is obviously superior to that of other comparison methods. And compared with the best comparison method two, the method improves the precision value on two data sets by 12.4 percent and 16.76 percent respectively.

Meanwhile, the test of the invention verifies the influence of the size of the sliding word window of the convolution kernel on the performance of the entity and paragraph link precision value of the method of the invention when sentence characteristic representation is carried out, and the test result is shown in fig. 3. It can be seen that the performance of the method of the present invention is optimal on both data sets when the word window size is 3, whereas the performance of the precision value of the method of the present invention decreases when the word window size is greater than 3. Therefore, the sliding word window size of the sentence characteristic convolution kernel adopted in the experiment of the invention is 3.

The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention and are not intended to limit the present invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A entity and paragraph linking method based on a hierarchical convolutional network comprises the following steps:

extracting vectorization representation characteristics of each sentence in the paragraph to be processed through a convolutional neural network model and word vectorization representation;

learning the depth semantic features of the paragraphs by using a convolutional neural network structure and sentence vectorization representation;

respectively outputting entities to which the fitted paragraphs belong by the vectorized representation of the sentence and the vectorized representation of the paragraphs through Softmax;

calculating pair-wise similarity information of the vectorized representation of the entity and the vectorized representation of the paragraph;

error back propagation is carried out by fitting target entity words and paragraph feature vectors with pair-wise similarity information of the target entity words through Softmax to train the convolutional neural network model;

and performing deep semantic feature extraction on the paragraph to be processed by using the updated convolutional neural network model, and then linking with a corresponding entity word based on vectorization representation of the paragraph.

2. The entity and paragraph linking method based on hierarchical convolutional network of claim 1, wherein the step of extracting vectorized representation features of each sentence in the paragraph to be processed through convolutional neural network model and word vectorization representation comprises:

giving a sentence in a paragraph to be processed, obtaining word vectorization representation by utilizing a table look-up method, and representing the sentence in a matrix form;

performing one-dimensional convolution on the sentence matrixing expression characteristic to obtain a feature matrix after convolution;

and performing mean value sampling on the convolved convolution characteristics to compress the characteristics to obtain vectorization expression of the sentence.

3. The hierarchical convolutional network-based entity-to-paragraph linking method of claim 1, wherein the step of learning the deep semantic features of the paragraphs using the convolutional neural network structure and the sentence vectorization representation comprises:

using sentence vector characteristics in the paragraphs to characterize the paragraphs into a matrix form according to the word order of the sentences in the paragraphs;

performing one-dimensional convolution on the paragraph matrixing expression characteristic to obtain a feature matrix after convolution;

and performing mean sampling on the convolved convolution characteristics to compress the characteristics and performing linear transformation once to obtain vectorization expression of the paragraph.

4. The method of claim 1, wherein the step of fitting the vectorized representation of the sentence and the vectorized representation of the paragraph to the entity to which the paragraph belongs via Softmax output respectively comprises:

performing linear transformation on the sentence vectors and the paragraph vectors respectively to obtain output vectors, and performing regularization by using a Dropout technology;

the probability of linkage of the candidate entity is calculated using the Softmax function.

5. The hierarchical convolutional network-based entity-paragraph linking method of claim 1, wherein the method of calculating pair-wise similarity information of the vectorized representation of the entity and the vectorized representation of the paragraph is as follows:

sim(z，E)＝{z·e₁，z·e₂，...，z·e_m}.

6. The hierarchical convolutional network-based entity-paragraph linking method as claimed in claim 1, wherein the step of training the convolutional neural network model by error back propagation through Softmax fitting target entity words and pair-wise similarity information of the paragraph feature vectors and the target entity words comprises:

according to the sentence characteristic and paragraph characteristic output, setting a target function by utilizing the fitting result of the Softmax on the target entity words in the training data set;

setting a target function according to the pair-wise similarity information of the paragraph features and the target entity words;

setting a global target constraint function and carrying out unified fusion on the target functions;

and updating parameters in the convolutional neural network model by using a random gradient descent method.

7. The entity and paragraph linking method based on hierarchical convolutional network of claim 6, wherein the step of setting a target function according to the pair-wise similarity information between the paragraph feature and the target entity word comprises:

<mrow> <msub> <mi>L</mi> <mrow> <mi>p</mi> <mn>2</mn> </mrow> </msub> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mrow> <mo>|</mo> <mi>c</mi> <mo>|</mo> </mrow> </munderover> <munder> <mo>&Sigma;</mo> <mrow> <msub> <mi>e</mi> <mi>j</mi> </msub> <mo>&Element;</mo> <mi>E</mi> <mrow> <mo>(</mo> <msub> <mi>e</mi> <mi>j</mi> </msub> <mo>&NotEqual;</mo> <msub> <mi>e</mi> <mi>r</mi> </msub> <mo>)</mo> </mrow> </mrow> </munder> <mi>m</mi> <mi>a</mi> <mi>x</mi> <mrow> <mo>(</mo> <mn>0</mn> <mo>,</mo> <mn>1</mn> <mo>-</mo> <mi>s</mi> <mi>i</mi> <mi>m</mi> <mo>(</mo> <mrow> <msup> <mi>z</mi> <mrow> <mo>(</mo> <mi>i</mi> <mo>)</mo> </mrow> </msup> <mo>,</mo> <msubsup> <mi>e</mi> <mi>r</mi> <mrow> <mo>(</mo> <mi>i</mi> <mo>)</mo> </mrow> </msubsup> </mrow> <mo>)</mo> <mo>+</mo> <mi>s</mi> <mi>i</mi> <mi>m</mi> <mo>(</mo> <mrow> <msup> <mi>z</mi> <mrow> <mo>(</mo> <mi>i</mi> <mo>)</mo> </mrow> </msup> <mo>,</mo> <msubsup> <mi>e</mi> <mi>j</mi> <mrow> <mo>(</mo> <mi>i</mi> <mo>)</mo> </mrow> </msubsup> </mrow> <mo>)</mo> <mo>)</mo> </mrow> <mo>;</mo> </mrow>

wherein e is_rIs given the correct entity word to which the paragraph z belongs, e_r ⁽ⁱ⁾The entity words are true for the paragraphs.

8. The entity and paragraph linking method based on hierarchical convolutional network of claim 6, wherein the step of setting global objective constraint function to uniformly fuse the objective functions comprises:

setting the global objective constraint function as follows:

L＝L_s+(1-α)·L_p1+α·L_p2；

wherein L is_sα is a weight harmonic coefficient for the target constraint function of sentence vectorization feature, which is used to balance the two constraints of the paragraph vectorization feature, i.e. the paragraph feature output utilizes Softmax to fit the constraint term L of the target entity word in the training data set_p1And a pair-wise similarity constraint term L of paragraph features and the target entity words_p2。

9. The entity and paragraph linking method based on hierarchical convolutional network of claim 1, wherein the step of performing deep semantic feature extraction on the paragraphs to be processed by using the updated convolutional neural network model and then linking with the corresponding entity words based on the vectorized representation of the paragraphs comprises:

giving a paragraph text to be processed, and calculating vectorization characteristics of sentences in the paragraph through a trained convolutional neural network model;

calculating vectorization characteristics of the paragraph through a trained convolutional neural network model;

and outputting the matching probability of the corresponding entity words by using the generated vectorization characteristics of the paragraphs and a Dropout-free linear transformation and a Softmax function.

10. The method of claim 1, wherein the convolutional neural network model uses a sentence-feature convolutional kernel with a sliding word window size of 3.