CN112069408A

CN112069408A - Recommendation system and method for fusion relation extraction

Info

Publication number: CN112069408A
Application number: CN202010931994.4A
Authority: CN
Inventors: 刘琼昕; 宋祥; 卢士帅; 王佳升
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2020-06-15
Filing date: 2020-09-08
Publication date: 2020-12-11
Anticipated expiration: 2040-09-08
Also published as: CN112069408B

Abstract

The invention relates to a recommendation system and method for extracting fusion relations, and belongs to the technical field of content recommendation. An article feature construction module in the system constructs a text feature matrix, a basic entity feature matrix and an enhanced entity feature matrix, and further obtains feature vectors of target articles and interactive articles in user historical behaviors; a user interest construction module obtains a user interest vector; the multilayer perceptron module obtains the probability of a user clicking a target article; the method comprises 1) a relation extraction submodule predicts the relation between entities; 2) acquiring a word embedding set, a basic entity embedding set and an enhanced entity embedding set through knowledge extraction; 3) adopting KCNN to construct article characteristics; 4) constructing user interests and constructing user interests; 5) and splicing the feature vectors and predicting the probability of the user clicking the target object. The system and the method can effectively reduce the influence of the entity on the template, and the accuracy is higher than that of the prior art.

Description

Recommendation system and method for fusion relation extraction

Technical Field

The invention relates to a recommendation system and method for extracting fusion relations, and belongs to the technical field of artificial intelligence, network big data and content recommendation.

Background

The recommendation system based on the content is widely applied to various fields and has wide development prospect. The recommendation system provides personalized service for the user, the distance between the user and the network platform is shortened, and the user experience is greatly improved. In addition, the research concept and processing method of the content-based recommendation system have important reference value in the fields of advertisement putting, search engines and the like.

For example, in a news scenario, with the development of the World Wide Web (WWW), the vast majority of people read news through the internet. Google News (Google News), bang News (Bing News) and the like abroad, and on-line News websites such as News of newwave, News of Tencent and the like in China collect News from various sources and provide readers with a summary view of the News. The number of news is huge, and it is very important to provide personalized news lists for different users in order to alleviate the problem of information overload. The news recommendation scene mainly adopts a recommendation method based on content, and in recent years, a better recommendation result is obtained by fusing knowledge.

In addition, the invention also relates to a relation extraction technology, which aims to automatically extract the relation between the entities from the unstructured text and provide a deep text analysis function for a user. The method breaks through the traditional manual processing method, and can greatly improve the efficiency and the accuracy. Meanwhile, the task can provide key semantic information for a conversation system, a recommendation system and the like; and technical support can be provided for natural language processing tasks such as semantic network labeling, machine translation, emotion analysis and the like, and the method has great research significance.

Conventional content-based recommendation systems often employ existing knowledge-graph information to supplement knowledge, resulting in a lack of pertinence to knowledge. In addition, the traditional relational extraction model only solves the problem that the utilization of the WordNet dictionary information is only limited at a word level, the utilization of the WordNet dictionary information is insufficient, and deep-level relation among entities is lacked.

Disclosure of Invention

The invention aims to solve the problem that the knowledge adopted by the existing content-based recommendation system is lack of pertinence when the knowledge is supplemented, further improve the prediction accuracy of the recommendation system, and provide a recommendation system and a recommendation method for extracting a fusion relationship.

The invention is realized by the following technical scheme.

The recommendation system and method for extracting the fusion relationship comprise a recommendation system for extracting the fusion relationship and a recommendation method for extracting the fusion relationship;

the recommendation system for extracting the fusion relationship comprises a knowledge extraction module, an article feature construction module, a user interest construction module and a multilayer perceptron module;

the knowledge extraction module comprises a word embedding module, a basic entity module and an enhanced entity module;

the basic entity module comprises a basic entity link sub-module, a filtering map sub-module and a knowledge representation learning sub-module; the enhanced entity module comprises an enhanced entity link sub-module, a relation extraction sub-module and a knowledge representation learning sub-module;

the relation extraction submodule comprises a sentence feature extractor, a template feature extractor and a threshold fusion device;

the connection relation of each module in the recommendation system extracted by the fusion relation is as follows:

the knowledge extraction module is connected with the article characteristic construction module; the article characteristic construction module is connected with the user interest construction module; the article characteristic construction module, the user interest construction module and the multilayer perceptron module are connected;

in the knowledge extraction module, a word embedding module, a basic entity module and an enhanced entity module are in parallel relation;

a basic entity link submodule in the basic entity module is connected with a filtering map submodule, and the filtering map submodule is connected with a knowledge representation learning submodule; an enhanced entity link sub-module in the enhanced entity module is connected with a relationship extraction sub-module, and the relationship extraction sub-module is connected with a knowledge representation learning sub-module;

in the relation extraction submodule, a sentence feature extractor and a template feature extractor are respectively connected with a threshold fusion device;

the functions of the modules in the recommendation system extracted by the fusion relationship are as follows:

the knowledge extraction module has the function of extracting knowledge required by the system;

the word embedding module receives the large-scale linguistic data to obtain a word embedding set; the basic entity module receives text description information of the interactive articles in the historical behaviors to obtain a basic entity embedded set; the method comprises the steps that an enhanced entity module receives text description information of interactive articles in historical behaviors to obtain an enhanced entity embedding set;

in the basic entity module, a basic entity link submodule receives text description information of interactive articles in historical behaviors to obtain an entity set; the filtering map submodule receives an external knowledge map to obtain a knowledge map without nodes in the entity set; the knowledge representation learning submodule receives the filtered knowledge map to obtain a basic entity embedding set; in the entity enhancing module, an entity enhancing link submodule receives text description information of interactive articles in historical behaviors to obtain an entity set; the relation extraction submodule receives text description information and an entity set of the interactive articles in historical behaviors to obtain a related knowledge graph; the knowledge representation learning submodule receives a relevant knowledge map to obtain an enhanced entity embedding set;

in the relation extraction submodule, a sentence feature extractor receives a text to obtain sentence features; the template feature extractor receives the text to obtain template features; the threshold fusion device is used for receiving the sentence characteristics and the template characteristics to obtain text characteristics;

the article feature construction module is used for receiving the output of the knowledge extraction module, constructing a text feature matrix, a basic entity feature matrix and an enhanced entity feature matrix, and further obtaining a feature vector of a target article and a feature vector of an interactive article in user historical behaviors; the user interest construction module is used for receiving the output of the article characteristic construction module to obtain a user interest vector; the function of the multilayer perceptron module is to receive the output of the user interest building module and the feature vector of the target object in the output of the object feature building module to obtain the probability of the user clicking the target object;

the recommendation method for extracting the fusion relationship comprises the following steps:

step one, the relation extraction submodule predicts the relation between the entities and comprises the following substeps:

step 1.1, a sentence feature extractor acquires sentence features;

the sentence characteristic extractor is a relation extraction model which is one of an end-to-end model, a lexical model and a syntactic model;

the end-to-end model does not depend on external knowledge at all, and only sentences and entity pair information in the sentences are used;

using lexical information contained in a sentence based on a lexical model, wherein the lexical information comprises named entity identification, part of speech tagging and word net hypernym;

using syntactic information contained in the sentence based on a syntactic model, wherein the syntactic information comprises a phrase structure tree, a dependency tree and a shortest dependency path;

the sentence characteristic extractor utilizes semantic structure information contained in the sentence to extract characteristics, the characteristics are highly related to entity information, and finally sentence characteristics are obtained;

step 1.2, the template feature extractor obtains template features through entity replacement, word embedding, multi-head self-attention mechanism, bidirectional LSTM and attention mechanism operation, and specifically comprises the following steps:

step 1.2A, a template feature extractor replaces an entity in a text with an entity hypernym path through entity replacement operation to obtain a sentence with the entity replaced;

the method for obtaining the sentence with the replaced entity comprises the following steps:

step 1.2A1 setting the longest path length s;

step 1.2A2 initializing the path list to null;

step 1.2A3 obtaining a first hypernym of an entity in a WordNet dictionary;

step 1.2a4, determining whether the hypernym is empty, and deciding whether to jump to step 1.2a5, specifically:

if the hypernym is empty, then go to step 1.2A 5;

if the hypernym is not empty, adding the hypernym into the list, assigning the hypernym to the entity, and returning to the step 1.2A3 for execution;

step 1.2A5 calculating path list length;

step 1.2a6, comparing the length of the path list with the length s of the longest path, and completing the method for obtaining the sentence with the entity replaced, specifically:

if the length of the path list is greater than or equal to the length s of the longest path, intercepting the first s items of the list, and splicing the first s items to obtain a sentence with the entity replaced;

otherwise, if the length of the path list is smaller than the length s of the longest path, adopting 'splicing' to splice all list items to obtain a sentence with the entity replaced;

so far, through the steps from step 1.2a1 to step 1.2a6, the entity in the sentence is replaced by the entity hypernym path, and a sentence with the entity replaced is obtained;

step 1.2B, converting the sentence with the replaced entity obtained in step 1.2A into a position vector;

wherein the elements in the position vector are defined as follows: if a word is an entity hypernym path, its template location flag is "0"; if the word is other words in the sentence, its template position flag is "1";

step 1.2C, carrying out word embedding operation on the sentences obtained in the step 1.2A and the position vectors obtained in the step 1.2B to obtain a word matrix and a template position mark matrix;

specifically, a vocabulary is randomly initialized

Word list

Traverse each word x of the text_iTaking the ith row of the word list W and the word list T to obtain the word x_iWord vector e_iAnd a template position marker vector t_i(ii) a Splicing word vectors of all words in the text to obtain a word matrix x of sentences with entities replaced_e＝[e₁,e₂,…,e_n](ii) a Splicing the template position mark vectors of all words in the text to obtain a template position mark matrix x of the sentence with the entity replaced_t＝[t₁,t₂,…,t_n](ii) a R is a real number field, and is marked with Nxm_e and 2×m_tRepresents the dimension of R;

wherein i is traversed from 1 to n, and n is the length of the replaced entity sentence; n is the total number of words in the word list;

the expression x_iWord vector of, m_eIs the dimension of the word vector;

the expression x_iThe template position marker vector of, m_tIs the dimension of the template position marker vector;

step 1.2D, firstly, performing multi-head self-attention mechanism operation on the word matrix obtained in the step 1.2C, and then splicing the template position mark matrix to obtain the low-order characteristics of the text;

step 1.2E, performing bidirectional LSTM operation on the low-order features of the text acquired in the step 1.2D to obtain high-order features of the text;

step 1.2F, obtaining template characteristics by subjecting the high-order characteristics of the text obtained in step 1.2E to an attention mechanism, and specifically calculating the characteristics as shown in formulas (1) to (5):

M＝tanh(H) (1)

α＝softmax(Mw) (2)

r＝H^Tα (3)

R′＝tanh(r) (4)

R＝dropout(R') (5)

wherein H is a high-order feature of the text; h^TIs the transpose of H; m is_hIs the dimension of the high-order feature;

is a parameter that needs to be trained; alpha is formed by R^n×1Is the weight column vector obtained by calculation;

is a weighted feature vector; the output of the attention mechanism is

softmax (·), tanh (·) is an activation function; dropout (·) represents an operation of randomly replacing a value of a certain dimension of a vector with 0;

step 1.3, a threshold fusion device fuses sentence characteristics and template characteristics to obtain text characteristics; the method specifically comprises the following steps:

step 1.3A, mapping sentence characteristics and template characteristics to the same vector space, and ensuring the same dimensionality of the sentence characteristics and the template characteristics; i.e. C_g＝tanh(W_mapcC+b_mapc)，R_g＝tanh(W_maprR+b_mapr)；

wherein ,

is a sentence feature;

is a template feature;

is a mapping matrix;

is a bias vector; m is_cIs the dimension of the sentence feature; m is_gIs the vector dimension after mapping;

step 1.3B calculation of C_g and R_gThreshold weight on dimension granularity

And

wherein exp (·) denotes exponential operation;

step 1.3C reaction of C_g and R_gRespectively give weight g_C and g_RObtaining text characteristics V;

wherein ,V＝g_C⊙C_g+g_R⊙R_g, "is an element-by-element multiplication operation;

step 1.4, predicting the relation of the text characteristics obtained in the step 1.3 through a full-connection network and a softmax (·) function, wherein the predicting relation specifically comprises the following steps:

firstly, taking the text characteristic V as input, obtaining the probability distribution of the relation category through the full-connection network and the softmax (·) function

Then, the probability distribution is taken

The relationship class corresponding to the maximum value of (1)

As a result of the prediction;

wherein, S represents a sentence,

is a mapping matrix of text features and relationships, b_S∈R^mIs the offset vector, m is the number of relationship classes,

the operation of taking the y value corresponding to the maximum result is shown;

step two, acquiring a word embedding set, a basic entity embedding set and an enhanced entity embedding set through knowledge extraction;

step 2.1, training a word embedding set from a large-scale corpus by using a word2vec word embedding method;

step 2.2, acquiring a basic entity embedding set;

firstly, matching and disambiguating a text and a knowledge base to obtain an entity set contained in the text; because the scale of the original knowledge graph is large, then extracting a subgraph from the original knowledge graph, and removing nodes in the entity set which does not exist to obtain a basic knowledge graph; finally, mapping the entities and the relations in the basic knowledge map to a low-dimensional vector space by adopting a knowledge representation learning method TransD to obtain a basic entity embedding set;

step 2.3, obtaining an enhanced entity embedding set;

firstly, matching and disambiguating a text and a knowledge base to obtain an entity set contained in the text; then marking out corresponding entities in the description text, and adopting a relationship extraction submodule to carry out relationship identification; after entity linkage, one sentence may contain a plurality of entities, all the entities are combined and predicted, and an enhanced knowledge graph is constructed; finally, mapping the entities and the relations in the enhanced knowledge graph to a low-dimensional vector space by adopting a knowledge representation learning method TransD to obtain an enhanced entity embedding set;

thirdly, constructing article characteristics by adopting a knowledge-aware convolutional neural network (KCNN);

step 3.1, constructing a text feature matrix on the basis of the word embedding set obtained in the step two; the method specifically comprises the following steps:

firstly, searching a vector corresponding to each word in an article description text in a word embedding set, and if not, randomly initializing the vector; then all vectors are spliced to obtain a text feature matrix;

3.2, constructing basic entity characteristics on the basis of the basic entity embedded set obtained in the second step; the method specifically comprises the following steps:

firstly, searching a vector corresponding to each word in an article description text in a basic entity embedding set, and if not, replacing the vector by using a zero vector; then through a mapping function f_m(X)＝ReLU(W_mX+b_m) Mapping each vector into a vector space which is the same as the text features; finally, splicing all vectors to obtain a basic entity feature matrix;

wherein ,

is a transformation matrix;

is a bias vector; d_wIs the dimension of word embedding; d_eIs the dimension of the underlying entity embedding;

3.2 constructing an enhanced entity feature matrix on the basis of the enhanced entity embedded set obtained in the second step; the method specifically comprises the following steps:

firstly, searching a vector corresponding to each word in an article description text in an enhanced entity embedding set, and if not, replacing the vector by using a zero vector; then through a mapping function f_m(X)＝ReLU(W_mX+b_m) Mapping each vector into a vector space which is the same as the text features; finally, splicing all vectors to obtain an enhanced entity feature matrix;

3.3, stacking the text feature matrix, the basic entity feature matrix and the enhanced entity feature matrix to be used as three-channel input of the KCNN model;

step 3.4, constructing an article feature vector by using a plurality of convolution kernels to obtain the feature vector of the article;

step four, constructing user interests by using an attention mechanism; the method comprises the following specific steps:

first, the influence degree is calculated by adopting an attention network, and the input information of the attention network is the target object and the user historyFeature vector q of interactive items in a behavior_vAnd

outputting the weight value through the full-connection network after splicing

Then normalization processing is carried out to obtain

Degree of influence of

Finally, constructing an interest feature vector of the user u

Step five, the interest feature vector u of the user u and the feature vector q of the target item v are combined_vAnd (4) splicing, namely predicting the probability p (x) ═ MLP ([ u: q ] of the user u clicking the target item v through a multi-layer perceptron_v])；

Wherein MLP (·) represents a multi-layered perceptron, using a ReLU nonlinear activation function; x represents the input of the model; representing vector splicing operation;

advantageous effects

Compared with the prior art, the recommendation system and method for extracting the fusion relationship have the following beneficial effects:

1. the entity upper-level word path is used for replacing the entity, and the WordNet dictionary information is more fully utilized;

2. in the process of obtaining the template characteristics, the influence of the entity on the template is effectively reduced by using the position word path of the entity and the template position mark;

3. the enhanced entity characteristics are obtained through relationship extraction, and knowledge is more pertinent;

4. compared with the prior art, the accuracy of the recommendation system is improved.

Drawings

FIG. 1 is a schematic diagram of the module components of a recommendation system for fusion relationship extraction;

FIG. 2 is a schematic diagram of predicting relationships between entities using a relationship extraction sub-module;

FIG. 3 is a flow diagram of extracting knowledge;

FIG. 4 is a diagram of a build article feature structure;

FIG. 5 is a diagram of building a user interest structure;

FIG. 6 is a schematic diagram of a recommendation system for fused relationship extraction.

Detailed Description

The following describes a recommendation system and method for fusion relation extraction according to the present invention in detail with reference to the accompanying drawings and embodiments.

Example 1

In this embodiment, with reference to fig. 1 to fig. 6, the recommendation system and method for extracting a fusion relationship according to the present invention are described in a news scene.

And the recommendation system extracted by the fusion relation is used for recommending news. News websites contain a huge amount of news, and in various aspects, it is important to provide interesting news to users. The news website collects news clicked and browsed by a user, forms historical interactive behaviors of the user, analyzes news headline texts, supplements knowledge, extracts interests and hobbies of the user, and predicts the news interested by the user.

FIG. 1 is a schematic diagram of the module components of a recommendation system for fusion relationship extraction. The knowledge extraction module available from fig. 1 is connected to the item feature construction module; the article characteristic construction module is connected with the user interest construction module; the article characteristic construction module, the user interest construction module and the multilayer perceptron module are connected; in the knowledge extraction module, a word embedding module, a basic entity module and an enhanced entity module are in parallel relation; in the basic entity module, a basic entity link submodule is connected with a filtering map submodule, and the filtering map submodule is connected with a knowledge representation learning submodule; in the enhanced entity module, an enhanced entity link sub-module is connected with a relation extraction sub-module, and the relation extraction sub-module is connected with a knowledge representation learning sub-module; in the relation extraction submodule, a sentence feature extractor and a template feature extractor are respectively connected with a threshold fusion device;

FIG. 2 is a schematic diagram of predicting relationships between entities using a relationship extraction sub-module. As can be seen from fig. 2, the relation extraction submodule consists of a sentence feature extractor, a template feature extractor, and a threshold fusion; a sentence characteristic extractor extracts sentence characteristics; the template feature extractor obtains template features through entity replacement, word embedding, a multi-head self-attention mechanism, a bidirectional LSTM and an attention mechanism; the threshold fusion device fuses sentence characteristics and template characteristics to obtain text characteristics; finally, predicting the relation according to the text characteristics;

FIG. 3 is a flow diagram of extracting knowledge; training word embedding set S from large-scale corpus by using word2vec word embedding method_w(ii) a In the acquisition of the basic entity embedded set, firstly, matching and disambiguating the text and a knowledge base to acquire an entity set contained in the text; then extracting a subgraph from the existing knowledge graph, and removing nodes in the entity set which does not exist to obtain a basic knowledge graph; and finally, mapping the entities and the relations in the basic knowledge map to a low-dimensional vector space by adopting a knowledge representation learning method to obtain a basic entity embedding set S_b(ii) a In the acquisition of the enhanced entity embedded set, marking out a corresponding entity in a description text according to the entity set; then, a relation extraction submodule is adopted for relation identification; the method can combine and predict all entities to construct an enhanced knowledge graph; finally, the invention adopts a knowledge representation learning method to map the entities and the relations in the enhanced knowledge map to a low-dimensional vector space to obtain an enhanced entity embedding set S_e；

Fig. 4 is a diagram of a build article feature. As can be seen from FIG. 4, the text feature w and the basic entity feature e are used to construct the object feature_baseAnd enhanced entity features e_enhanceThree-channel input I (stack) as KCNN model after stacking_base,e_enhance) (ii) a Then using convolution kernels

Firstly, performing convolution operation on input I to obtain charactersA eigenvector; then, obtaining a corresponding characteristic value by performing maximum pooling operation on the characteristic vector; finally, splicing the eigenvalues obtained by using a plurality of convolution kernels to obtain the eigenvector of the article;

wherein stack (·) represents a matrix stacking operation; l represents the convolution window size; d_wIs the dimension of word embedding;

FIG. 5 is a block diagram of building a user interest structure. As can be seen from FIG. 5, the present invention uses the attention network to construct the computational influence, and the input information is the target item q_vFeature vector of interactive object in user historical behavior

Outputting the weight value through the full-connection network after splicing

Then obtaining the influence degree s through normalization processing_i(ii) a Finally, constructing an interest feature vector u of the user u;

FIG. 6 is a schematic diagram of a recommendation system for fused relationship extraction. As can be seen from fig. 6, the recommendation system for fusion relationship extraction includes four modules, namely knowledge extraction, article feature construction, user interest construction and a multilayer perceptron, and finally obtains the probability p (x) that the user u clicks the template article v;

the knowledge extraction module obtains a word embedding set, a basic entity embedding set and an enhanced entity embedding set; the article feature construction module is used for receiving the output of the knowledge extraction module to obtain a feature vector of a target article and a feature vector of an interactive article in user historical behaviors; the user interest construction module is used for receiving the output of the article characteristic construction module to obtain a user interest vector; the function of the multilayer perceptron module is to receive the output of the user interest building module and the feature vector of the target object in the output of the object feature building module to obtain the probability of the user clicking the target object;

example 2

And 1.2A, replacing the entity in the text with the superior position word path of the entity to obtain a sentence with the replaced entity. Taking The sentence "The bottom photo is from The New York public library" as an example, The process of obtaining a sentence with an entity replaced is described:

step 1.2a1 sets the longest path length s to 8;

step 1.2A2 initializing path list L to null; initializing entity entry as "photo";

step 1.2A3 obtains the first hypernym "creation" of entity entry in WordNet dictionary;

step 1.2a4 judges that "creation" is not empty, and adds the hypernym "creation" to the list L, i.e., L { "creation" }; assigning "creation" to an entity, i.e., an entity ═ creation;

step 1.2a5 loops step 1.2A3 through step 1.2a4 until entry is "entry" and the first hypernym in the WordNet dictionary is empty, at which point the list L is { "creation", "artifact", "white", "object", "physical _ entry", "entry" };

step 1.2A6, calculating the length of a path list L to be 6;

step 1.2A7 comparing the length of the path list with the length of the longest path as s; the length 6 of the path list is smaller than the length 8 of the longest path, and all list items are spliced by adopting 'i.e. the entity' photo 'is replaced by the entity hypernym path' entity.

Similarly, the entity "library" is replaced with the entity hypernym path "entity. physical _ entity. object. world. entity. structure" through steps 1.2a1 to 1.2a 7;

thus, The entity in The sentence is replaced by The entity hypernym path, and The sentence "The bottom entity, physical _ entity, object, artifact, creation is from The New York public entity, physical _ entity, object, world, artifact, structure", from which The entity is replaced, is obtained;

example 3

When the step 1.2B is implemented specifically, the sentence with the entity replaced is converted into the position vector. Taking The sentence "The bottom, physical _ entry, object, creation, from The New York public, physical _ entry, object, structure," as an example, The process of obtaining a position vector is described:

each word in the traversal sentence, "entry, physical _ entry, object, whole, artifact, structure" is an entity hyperword path, the corresponding template position flag is "0"; the template position marks corresponding to the other words are '1'; to obtain a position vector '1101111110';

example 4

Step 1.2D implementation, input of multi-head self-attention mechanism

Output of

As shown in formulas (6) to (8):

MultiHead(Q,K,V)＝[head₁:head₂:…:head_r]W^M (6)

wherein r is the number of subspaces different from each other, i.e. the number of heads;

is the result of the ith head; [:]indicating splicing operation, splicing result

Is a linear transformation matrix of a multi-head self attention mechanism;

respectively corresponding linear transformation matrixes, and randomly initializing;attention (·) denotes the mechanism of zoom point product Attention; superscript T represents transposition; softmax (·) represents a normalization function; n is the sentence length;

example 5

The experimental comparison results of the fusion relationship extracted recommendation system and various reference methods are shown as follows, and the method has the best effect on the AUC indexes.

Recommendation system comparison effect of table fusion relation extraction

While the foregoing is directed to the preferred embodiment of the present invention, it is not intended that the invention be limited to the embodiment and the drawings disclosed herein. Equivalents and modifications may be made without departing from the spirit of the disclosure, which is to be considered as within the scope of the invention.

Claims

1. A recommendation system for fusion relationship extraction, characterized by: the system comprises a knowledge extraction module, an article feature construction module, a user interest construction module and a multilayer perceptron module;

in the relation extraction submodule, a sentence characteristic extractor and a template characteristic extractor are respectively connected with a threshold fusion device.

2. A recommendation method for fusion relation extraction is characterized in that: the method comprises the following steps:

step 1.1, the sentence feature extractor obtains sentence features, specifically:

step 1.2A1 setting the longest path length s;

step 1.2A2 initializing the path list to null;

step 1.2A3 obtaining a first hypernym of an entity in a WordNet dictionary;

if the hypernym is empty, then go to step 1.2A 5;

step 1.2A5 calculating path list length;

specifically, a vocabulary is randomly initialized

Word list

Traverse each word x of the text_iWord-taking tableW and line i of the vocabulary T, get the word x_iWord vector e_iAnd a template position marker vector t_i(ii) a Splicing word vectors of all words in the text to obtain a word matrix x of sentences with entities replaced_e＝[e₁,e₂,...,e_n](ii) a Splicing the template position mark vectors of all words in the text to obtain a template position mark matrix x of the sentence with the entity replaced_t＝[t₁,t₂,...,t_n](ii) a R is a real number field, and is marked with Nxm_e and 2×m_tRepresents the dimension of R;

the expression x_iWord vector of, m_eIs the dimension of the word vector;

M＝tanh(H) (1)

α＝softmax(Mw) (2)

R′＝tanh(r) (4)

R＝dropout(R') (5)

wherein H is a high-order feature of the text;

is the transpose of H; m is_hIs the dimension of the high-order feature;

is a weighted feature vector; the output of the attention mechanism is

wherein ,

is a sentence feature;

is a template feature;

is a mapping matrix;

step 1.3B calculation of C_g and R_gThreshold weight on dimension granularity

And

wherein exp (·) denotes exponential operation;

firstly, taking the text characteristic V as input, obtaining the probability distribution of the relation category through the full-connection network and the softmax function

Then, the probability distribution is taken

The relationship class corresponding to the maximum value of (1)

As a result of the prediction;

wherein, S represents a sentence,

step 2.2, obtaining a basic entity embedding set, specifically:

step 2.3, obtaining an enhanced entity embedding set, specifically:

wherein ,

is a transformation matrix;

step four, constructing user interests by using an attention mechanism and constructing user interests; the method comprises the following specific steps:

firstly, an attention network is adopted to calculate influence degree, and input information of the attention network is a feature vector q of a target object and an interactive object in historical behaviors of a user_vAnd

outputting the weight value through the full-connection network after splicing

Then normalization processing is carried out to obtain

Degree of influence of

Finally, constructing an interest feature vector of the user u

Wherein MLP (·) represents a multi-layered perceptron, using a ReLU nonlinear activation function; x represents the input of the model; and [ ] represents a vector splicing operation.

3. The system for recommending fused relation extraction as claimed in claim 1, wherein: the knowledge extraction module has the function of extracting knowledge required by the system;

in the relation extraction submodule, a sentence feature extractor receives a text to obtain sentence features; the template feature extractor receives the text to obtain template features; the threshold fusion device is used for receiving the sentence characteristics and the template characteristics to obtain the text characteristics.

4. The system for recommending fused relation extraction as claimed in claim 1, wherein: and the article characteristic construction module receives the output of the knowledge extraction module, constructs a text characteristic matrix, a basic entity characteristic matrix and an enhanced entity characteristic matrix, and further obtains a characteristic vector of the target article and a characteristic vector of the interactive article in the user historical behavior.

5. The system for recommending fused relation extraction as claimed in claim 1, wherein: the user interest building module is used for receiving the output of the article feature building module to obtain a user interest vector.

6. The system for recommending fused relation extraction as claimed in claim 1, wherein: the function of the multilayer perceptron module is to receive the output of the user interest building module and the feature vector of the target object in the output of the object feature building module, and obtain the probability of the user clicking the target object.

7. The recommendation method for fusion relationship extraction as claimed in claim 2, wherein:

in the step 1.1, a sentence feature extractor is a relation extraction model which is one of an end-to-end model, a lexical model and a syntactic model;

the syntactic information contained in the sentence is used based on the syntactic model, including a phrase structure tree, a dependency tree, and the shortest dependency path.