CN107943847B

CN107943847B - Business connection extracting method, device and storage medium

Info

Publication number: CN107943847B
Application number: CN201711061205.0A
Authority: CN
Inventors: 徐冰; 汪伟; 罗傲雪; 肖京
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2017-11-02
Filing date: 2017-11-02
Publication date: 2019-05-17
Anticipated expiration: 2037-11-02
Also published as: CN107943847A; WO2019085328A1

Abstract

The invention discloses a kind of business connection extracting method, device and storage mediums, this method comprises: extracting from knowledge base, there are the business entities of relationship to establish sample database as training sample sentence to sentence；All trained sample sentences comprising a business entity pair are extracted from sample database and are segmented, and each word is mapped to term vector x_i, it is mapped to sentence vector S_i；Term vector x is calculated with LSTM_iThe first hidden layer state vector h_iWith the second hidden layer state vector h_i', splicing obtains comprehensive hidden layer state vector, then obtains feature vector T_i；By feature vector T_iIt substitutes into average vector expression formula and calculates average vector S；Average vector S and the relationship type of business entity pair are substituted into the weight a that softmax classification function calculates each trained sample sentence_i；The sentence comprising Liang Ge business entity is extracted, obtains feature vector T by bi-LSTM_i, it is input to trained RNN model, predicts the relationship of the Liang Ge enterprise, cost of labor is reduced, more accurately predicts the relationship between the Liang Ge business entity.

Description

Business connection extracting method, device and storage medium

Technical field

The present invention relates to processing data information technical field more particularly to a kind of business connection extracting methods, device and meter Calculation machine readable storage medium storing program for executing.

Background technique

The association in news between different enterprises, such as treasury trade, supply chain, cooperation are identified, to business risk early warning There is very great meaning.However now common entity relation extraction method needs manually to carry out the mark of a large amount of training datas, And corpus labeling work generally takes time and effort very much.

Summary of the invention

In view of the foregoing, the present invention provides a kind of business connection extracting method, device and computer readable storage medium, Relationship based on convolutional neural networks can be extracted on model extension to remote supervisory data, efficiently reduce model to artificial The dependence of labeled data, and this business connection extracting method for having supervision has more compared to semi-supervised or unsupervised approaches Good accuracy rate and recall rate.

To achieve the above object, the present invention provides a kind of business connection extracting method, this method comprises:

Sample database establishment step: it is extracted from knowledge base there are the business entity of relationship to sentence as training sample sentence foundation Sample database；

It segments step: extracting all trained sample sentences comprising a business entity pair from sample database, use preset point Word tool segments each trained sample sentence, and each word after participle is mapped to term vector x_i, and by each trained sample sentence It is mapped to sentence vector S_i, input as Recognition with Recurrent Neural Network model first layer；

Splice step: in the second layer of Recognition with Recurrent Neural Network model, being calculated from left to right with shot and long term memory module current Term vector x_iThe first hidden layer state vector h_i, and current word vector x is calculated from right to left_iThe second hidden layer state vector h_i', obtain training the synthesis hidden layer state vector of each word in sample sentence by splicing two hidden layer state vectors, further according to The synthesis hidden layer state vector of all words obtains the feature vector T of each trained sample sentence in training sample sentence_i；

Calculate step: in the third layer of Recognition with Recurrent Neural Network model, according to the feature vector T of each trained sample sentence_i, utilize Average vector expression formula indicates the average vector S of the business entity pair；

Weight determines step: in the last layer of Recognition with Recurrent Neural Network model, the average vector S and the enterprise is real The relationship type of body pair substitutes into the weight a that each trained sample sentence is calculated in softmax classification function_i, obtain trained follow Ring neural network model；

Prediction steps: extracting the sentence comprising Liang Ge business entity from current text, remembers mould by two-way shot and long term Block obtains the feature vector T of sentence_i, by this feature vector T_iAbove-mentioned trained Recognition with Recurrent Neural Network model is inputted, prediction obtains Relationship between the Liang Ge business entity.

Preferably, the participle step includes:

Each word after participle is indicated in the form of one-hot vector, obtains initial term vector, and is each trained sample Sentence ID is mapped as the initial one vector of corresponding training sample sentence by sentence mark sentence ID, by the initial one vector sum instruction The initial term vector for practicing the left and right adjacent word of some word in sample sentence inputs the continuous bag of words, and prediction obtains the word of the word Vector x_i, the sentence vector of each forecast updating training sample sentence, until prediction obtain the word of each word in the training sample sentence to Measure x_i, using the updated sentence vector of last time as the sentence vector S of the training sample sentence_i。

Preferably, the splicing step includes:

From left to right according to current word vector x_iPrevious term vector x_i-1Hidden layer state vector h_i-1It calculates current Term vector x_iThe first hidden layer state vector h_i, and from right to left according to current word vector x_iThe latter term vector x_i+1It is hidden Hide layer state vector h_i+1Calculate current word vector x_iThe second hidden layer state vector h_i’。

Preferably, the average vector expression formula are as follows:

S=sum (a_i*T_i)/n

Wherein a_iIt represents the weight of training sample sentence, be required value, T_iThe feature vector of each trained sample sentence is represented, n represents instruction Practice the quantity of sample sentence.

Preferably, the softmax classification function expression formula are as follows:

Wherein K represents the number of business connection type, and S represents the average vector of the business entity pair,Represent institute State the business connection type of business entity pair, σ (z)_jIt represents and needs the business connection type predicted in each business connection type In probability.

In addition, the present invention also provides a kind of electronic device, which includes: memory, processor and is stored in institute The business connection extraction procedure that can be run on memory and on the processor is stated, the business connection extraction procedure is described Processor executes, it can be achieved that following steps:

Preferably, the splicing step includes:

With shot and long term memory module from left to right according to current word vector x_iPrevious term vector x_i-1Hiding layer state Vector h_i-1Calculate current word vector x_iThe first hidden layer state vector h_i, and from right to left according to current word vector x_iIt is latter A term vector x_i+1Hidden layer state vector h_i+1Calculate current word vector x_iThe second hidden layer state vector h_i’。

Preferably, the average vector expression formula are as follows:

S=sum (a_i*T_i)/n

Wherein K represents the number of business connection type, and S represents the average vector of the business entity pair,Described in representative The business connection type of business entity pair, σ (z)_jIt represents and needs the business connection type predicted in each business connection type Probability.

In addition, to achieve the above object, it is described computer-readable the present invention also provides a kind of computer readable storage medium It include business connection extraction procedure in storage medium, it can be achieved that as above when the business connection extraction procedure is executed by processor Arbitrary steps in the business connection extracting method.

Business connection extracting method, electronic device and computer readable storage medium proposed by the present invention, from unstructured It is extracted in text and is used as training sample sentence there are the sentence of the business entity pair of relationship in knowledge base and establishes sample database.Then in sample All trained sample sentences comprising a business entity pair are extracted in this library, and segment to each trained sample sentence, obtain each instruction Practice the sentence vector S of sample sentence_i, the feature vector T of each trained sample sentence is calculated by shot and long term memory module_i.Then according to each The feature vector T of training sample sentence_iThe average vector S of each trained sample sentence is obtained, average vector S is substituted into softmax classification letter Number is calculated, and the weight a of training sample sentence is determined according to the relationship type of business entity pair_i, obtain trained circulation nerve Network model.The sentence comprising Liang Ge business entity is finally extracted from current text, is obtained by two-way shot and long term memory module To the feature vector T of sentence, this feature vector T is inputted into trained Recognition with Recurrent Neural Network model, predicts Liang Ge enterprise reality Relationship between body improves in news between the recognition capability of relationship different enterprises, reduces and is trained data mark to artificial Dependence.

Detailed description of the invention

Fig. 1 is the schematic diagram of electronic device preferred embodiment of the present invention；

Fig. 2 is the module diagram of business connection extraction procedure preferred embodiment in Fig. 1；

Fig. 3 is the flow chart of business connection extracting method preferred embodiment of the present invention；

Fig. 4 is the frame diagram of prediction module of the present invention.

The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.

Specific embodiment

It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.

As shown in Figure 1, being the schematic diagram of 1 preferred embodiment of electronic device of the present invention.

In the present embodiment, electronic device 1 can be server, smart phone, tablet computer, PC, portable meter Calculation machine and other electronic equipments with calculation function.

The electronic device 1 includes: memory 11, processor 12, knowledge base 13, network interface 14 and communication bus 15.Its In, knowledge base 13 is stored on memory 11, and the sentence for containing business entity pair is extracted from knowledge base 13 as training sample Sentence establishes sample database.

Wherein, network interface 14 optionally may include standard wireline interface and wireless interface (such as WI-FI interface).It is logical Believe bus 15 for realizing the connection communication between these components.

Memory 11 includes at least a type of readable storage medium storing program for executing.The readable storage medium storing program for executing of at least one type It can be the non-volatile memory medium of such as flash memory, hard disk, multimedia card, card-type memory.In some embodiments, described to deposit Reservoir 11 can be the internal storage unit of the electronic device 1, such as the hard disk of the electronic device 1.In other embodiments In, the memory 11 is also possible to the external memory unit of the electronic device 1, such as be equipped on the electronic device 1 Plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card dodge Deposit card (Flash Card) etc..

In the present embodiment, the memory 11 can be not only used for storage be installed on the electronic device 1 using soft Part and Various types of data, such as business connection extraction procedure 10, knowledge base 13 and sample database, can be also used for temporarily storing Output or the data that will be exported.

Processor 12 can be in some embodiments a central processing unit (Central Processing Unit, CPU), microprocessor or other data processing chips, program code or processing data for being stored in run memory 11, example Such as execute the training of the computer program code and each class model of business connection extraction procedure 10.

Preferably, which can also include display, and display is properly termed as display screen or display unit.? Display can be light-emitting diode display, liquid crystal display, touch-control liquid crystal display and OLED (Organic in some embodiments Light-Emitting Diode, Organic Light Emitting Diode) touch device etc..Display is handled in the electronic apparatus 1 for showing Information and for showing visual working interface, such as: display model training result and weight a_iOptimal value.

Preferably, which can also include user interface, and user interface may include input unit such as keyboard (Keyboard), instantaneous speech power such as sound equipment, earphone etc., optionally user interface can also include that the wired of standard connects Mouth, wireless interface.

In Installation practice shown in Fig. 1, closed as enterprise is stored in a kind of memory 11 of computer storage medium It is the program code of extraction procedure 10, when processor 12 executes the program code of business connection extraction procedure 10, realizes following step It is rapid:

In the present embodiment, it is assumed that there are certain relationships in knowledge base for Liang Ge business entity, then real comprising the Liang Ge enterprise The unstructured sentence of body can represent this relationship.Therefore, when we need to identify in news certain Liang Ge business entity it Between association when, from knowledge base extraction include the Liang Ge business entity all unstructured sentences, using the sentence as Training sample sentence establishes sample database.Wherein, the knowledge base is real comprising any two enterprise in history news data by collecting What the unstructured sentence of body was established.For example, it is desired to the association in news between certain Liang Ge business entity be identified, from knowledge base All unstructured sentences containing the Liang Ge business entity are extracted, and establish a sample for the sentence as training sample sentence Library.Wherein business entity includes the relationships such as treasury trade, supply chain and cooperation to existing relationship." Foxconn is for example, sentence Rub and visit the supplier of bicycle " in include business entity to the relationship for " Foxconn ", " rub and visit bicycle ", between business entity " supplier " belongs to supply chain relationship.

All trained sample sentences comprising a business entity pair are extracted from sample database, each trained sample sentence includes this to enterprise The title of industry entity and the relationship type of the business entity pair, and each trained sample sentence is carried out at participle using participle tool Reason.Wherein it is possible to be divided using participles tools such as Stanford Chinese word segmenting tool, jieba participles each trained sample sentence Word processing.Each word after participle is indicated in the form of one-hot vector, obtains initial term vector.Wherein one-hot vector Method refer to each vocabulary be shown as a very long vector, the dimension of vector indicate word number, only one of them dimension Value be 1, remaining dimension be 0, which represents current word.For example, extracting from sample database includes Foxconn and Mo Bai bicycle All trained sample sentences, and each trained sample sentence includes Foxconn and rubs and visit the bicycle Liang Ge business entity title and the enterprise The relationship type (supplier) of industry entity pair.Word segmentation processing is carried out to " Foxconn is the supplier for rubbing and visiing bicycle ", is obtained as follows As a result " Foxconn | be | rub and visit bicycle | | supplier ".If the initial term vector of " Foxconn " is [0100000000], "Yes" Initial term vector be [0010000000].Then ID is marked for each trained sample sentence, sentence ID is mapped as corresponding training sample The initial one vector of sentence.

The initial term vector of the left and right adjacent word of some word in initial one vector sum training sample sentence is inputted into the company Continuous bag of words, prediction obtain the term vector x of the word_i.By the initial one vector update replace with the first update sentence to Amount, will be described in initial term vector input of the left and right adjacent word of next word in the first update sentence vector sum training sample sentence Continuous bag of words, prediction obtain the term vector x of the word_i+1, the first update sentence vector update is replaced with into the second update Sentence vector, such repetitive exercise, training updates the sentence vector of the training sample sentence every time, until prediction obtains training in sample sentence The term vector x of each word_i, i=(0,1,2,3 ..., m), will the updated sentence vector of last time training as the training The sentence vector S of sample sentence_i, i=(0,1,2,3 ..., n).As Recognition with Recurrent Neural Network (Recurrent Neural Network, RNN) model first layer input.For example, by the left adjoining of "Yes" can word " Foxconn ", right adjoining can word The initial term vector and initial one vector of " rub and visit bicycle " input continuous bag of words, and prediction obtains the term vector of "Yes" x₂, initial one vector is once updated, the first update sentence vector is obtained；The left adjoining that " will be rubbed and visit bicycle " can word The initial term vector or current term vector of "Yes", right adjoining can word " " initial word vector sum first to update sentence vector defeated Enter continuous bag of words, prediction obtains the term vector x of " rub and visit bicycle "₃, the first update sentence vector is updated, obtains the Two update the such repetitive exercises of sentence vector ..., until prediction obtain it is above-mentioned it is all can word term vector x_i, update obtains The sentence vector S of the training sample sentence_i.In the process, the sentence ID of each news sentence remains constant.

In the second layer of RNN model, then with shot and long term memory module (Long Short-Term Memory, LSTM) from From left to right is according to current word vector x_iPrevious term vector x_i-1Hidden layer state vector h_i-1Calculate current word vector x_i? One hidden layer state vector h_i, and from right to left according to current word vector x_iThe latter term vector x_i+1Hidden layer state vector h_i+1Calculate current word vector x_iThe second hidden layer state vector h_i', two hiding stratiforms are spliced by Concatenate function State vector obtains training the synthesis hidden layer state vector of each word in sample sentence, hidden further according to the synthesis of all words in training sample sentence Hiding layer state vector obtains the feature vector T of each trained sample sentence_i, i=(0,1,2,3 ..., n).For example, " Foxconn is to rub Visit the supplier of bicycle " in sentence, with LSTM from left to right according to the term vector x of " Foxconn "₁Hidden layer state vector h₁Meter Calculate the term vector x of "Yes"₂The first hidden layer state vector h₂, and from right to left according to the term vector x of " rub and visit bicycle "₃It is hidden Hide layer state vector h₃Calculate the term vector x of "Yes"₂The second hidden layer state vector h₂', it is spelled by Concatenate function Meet two hidden layer state vector (h₂And h₂') the synthesis hidden layer state vector of each word in trained sample sentence is obtained, further according to instruction The synthesis hidden layer state vector for practicing all words in sample sentence obtains the feature vector T of each trained sample sentence_i。

In the third layer of RNN model, according to the feature vector T of each trained sample sentence_i, public using the calculating of average vector Formula: S=sum (a_i*T_i)/n indicates the average vector S of the business entity pair.Wherein a_iRepresent training sample sentence weight, for Definite value, T_iThe feature vector of the trained sample sentence of each of described business entity pair is represented, n represents the quantity of training sample sentence.

In the last layer of RNN model, average vector S is updated to softmax classification function:

Wherein K represents the number of business connection type, and S represents the average vector of the business entity pair,Represent institute State the business connection type of business entity pair, σ (z)_jIt represents and needs the business connection type predicted in each business connection type In probability.According to the relationship type of training Yang Juzhong business entity pair, the weight a of training sample sentence is determined_i.By constantly learning It practises, continues to optimize the weight a of trained sample sentence_i, so that effectively sentence obtains higher weight, and there have the sentence of noise to obtain to be smaller Weight.

In the present embodiment, after RNN model determines, the unstructured sentence of business entity pair can be had to any one Son carries out Relationship Prediction, and the prediction of model is not associated with specific enterprise name.

The sentence of the business entity comprising two relationships to be predicted is extracted from current text, and these sentences are divided Word obtains sentence vector.For example, S₁,S₂,S₃,S₄What is indicated is the vector set of the corresponding sentence of Liang Ge business entity.By double Each sentence is extracted to shot and long term memory module (Bidirectional Long Short-term Memory, bi-LSTM) Feature vector T₁,T₂,T₃,T₄, the feature vector of each sentence is inputted into trained RNN model, obtains Liang Ge enterprise reality Relationship Prediction result between body.

Above-described embodiment propose business connection extracting method, by from non-structured text extract knowledge base in exist The training sample sentence of the business entity pair of relationship establishes sample database.It include all training of a business entity pair in sample drawn library Sample sentence is simultaneously segmented, and the sentence vector S of each trained sample sentence is obtained_i, using LSTM calculate the feature of each trained sample sentence to Measure T_i.Average vector S is substituted into softmax by the average vector S that each trained sample sentence is calculated by the calculation formula of average vector Classification function is calculated, and the weight a of training sample sentence is determined according to the relationship type of business entity pair_i.Finally from current text It is middle to extract the sentence comprising Liang Ge business entity, the feature vector T of sentence is obtained by bi-LSTM_i, by this feature vector T_iIt is defeated Enter trained RNN model, predict the relationship between the Liang Ge business entity, not only reduces cumbersome training data and manually mark Step, and have better accuracy rate and recall rate than other monitor modes.

As shown in Fig. 2, being the module diagram of 10 preferred embodiment of business connection extraction procedure in Fig. 1.Alleged by the present invention Module be refer to complete specific function series of computation machine program instruction section.

In the present embodiment, business connection extraction procedure 10 includes: to establish module 110, word segmentation module 120, splicing module 130, computing module 140, weight determination module 150, prediction module 160, the functions or operations that the module 110-160 is realized Step is similar as above, and and will not be described here in detail, illustratively, such as wherein:

Module 110 is established, there are the business entities of relationship to build sentence as training sample sentence for extracting from knowledge base Vertical sample database；

Word segmentation module 120, for extracting all trained sample sentences comprising a business entity pair from sample database, using pre- If participle tool each trained sample sentence is segmented, each word after participle is mapped to term vector x_i, and by each instruction Practice sample sentence and is mapped to sentence vector S_i, input as RNN model first layer；

Splicing module 130 calculates current word vector x with LSTM for the second layer in RNN model from left to right_i? One hidden layer state vector h_i, and current word vector x is calculated from right to left_iThe second hidden layer state vector h_i', pass through splicing Two hidden layer state vectors obtain training the synthesis hidden layer state vector of each word in sample sentence, further according to institute in training sample sentence There is the synthesis hidden layer state vector of word to obtain the feature vector T of each trained sample sentence_i；

Computing module 140, for the third layer in RNN model, according to the feature vector T of each trained sample sentence_i, using flat Equal vector expression indicates the average vector S of the business entity pair；

Weight determination module 150, for the last layer in RNN model, by the average vector S and the business entity Pair relationship type substitute into softmax classification function the weight a of the trained sample sentence of each of described business entity pair be calculated_i, Obtain trained RNN model；

Prediction module 160 is obtained for extracting the sentence comprising Liang Ge business entity from current text by bi-LSTM To the feature vector T of sentence_i, by this feature vector T_iAbove-mentioned trained RNN model is inputted, prediction obtains Liang Ge enterprise reality Relationship between body.

As shown in figure 3, being the flow chart of business connection extracting method preferred embodiment of the present invention.

In the present embodiment, processor 12 executes the computer journey of the business connection extraction procedure 10 stored in memory 11 The following steps of business connection extracting method are realized when sequence:

Step S10 is extracted to be used as sentence there are the business entity of relationship from knowledge base and sample sentence is trained to establish sample database；

Step S20 extracts all trained sample sentences comprising a business entity pair from sample database, uses preset participle Tool segments each trained sample sentence, and each word after participle is mapped to term vector x_i, and each trained sample sentence is reflected Penetrate the subvector S that forms a complete sentence_i, input as RNN model first layer；

Step S30 calculates current word vector x with LSTM in the second layer of RNN model from left to right_iThe first hidden layer State vector h_i, and current word vector x is calculated from right to left_iThe second hidden layer state vector h_i', it is hidden by splicing two Layer state vector obtain train sample sentence in each word synthesis hidden layer state vector, further according to training sample sentence in all words it is comprehensive It closes hidden layer state vector and obtains the feature vector T of each trained sample sentence_i；

Step S40, in the third layer of RNN model, according to the feature vector T of each trained sample sentence_i, utilize average vector table The average vector S of the business entity pair is indicated up to formula；

Step S50, in the last layer of RNN model, by the average vector S and the relationship type of the business entity pair Substitute into the weight a that the trained sample sentence of each of described business entity pair is calculated in softmax classification function_i, obtain trained RNN model；

Step S60 extracts the sentence comprising Liang Ge business entity from current text, obtains sentence by bi-LSTM Feature vector T_i, by this feature vector T_iAbove-mentioned trained RNN model is inputted, prediction obtains the pass between the Liang Ge business entity System.

In the present embodiment, it is assumed that there are certain relationships in knowledge base for Liang Ge business entity, then real comprising the Liang Ge enterprise The unstructured sentence of body can represent this relationship.When we need to identify the pass in news between certain Liang Ge business entity When connection, extraction includes all unstructured sentences of the Liang Ge business entity from knowledge base, using the sentence as training sample Sentence establishes sample database.Wherein, the knowledge base is non-comprising any two business entity in history news data by collecting What structuring sentence was established.For example, it is desired to identify the association in news between certain Liang Ge business entity, extracts and contain from knowledge base There are all unstructured sentences of the Liang Ge business entity, and establishes a sample database for the sentence as training sample sentence.Its Middle business entity includes the relationships such as treasury trade, supply chain and cooperation to existing relationship.For example, being taken out from non-structured text Take the sentence for containing " Foxconn " and " rub and visit bicycle " Liang Ge business entity pair as training sample sentence, wherein " Foxconn is sentence Rub and visit the supplier of bicycle " in include business entity to the relationship for " Foxconn ", " rub and visit bicycle ", between business entity " supplier " belongs to supply chain relationship.

All trained sample sentences comprising a business entity pair are extracted from sample database, each trained sample sentence includes this to enterprise The title of industry entity and the relationship type of the business entity pair, and each trained sample sentence is carried out at participle using participle tool Reason.For example, extracting all trained sample sentences comprising Foxconn and Mo Bai bicycle from sample database, and each trained sample sentence wraps It includes Foxconn and rubs and visit the relationship type (supplier) of the bicycle Liang Ge business entity title and the business entity pair.It uses The participle tools such as Stanford Chinese word segmenting tool, jieba participle carry out word segmentation processing to each trained sample sentence.Such as: to " rich Scholar's health is the supplier for rubbing and visiing bicycle " carry out word segmentation processing, obtain following result " Foxconn | be | rub and visit bicycle | | supply Quotient ".Each word after participle is indicated in the form of one-hot vector, obtains initial term vector.Wherein one-hot vector Method, which refers to, is shown as a very long vector each vocabulary, the dimension of vector indicate word number, only one of them dimension Value is 1, remaining dimension is 0, which represents current word.For example, the initial term vector of " Foxconn " be [0100000000], The initial term vector of "Yes" is [0010000000].Then ID is marked for each trained sample sentence, sentence ID is mapped as corresponding instruction Practice the initial one vector of sample sentence.

The initial term vector of the left and right adjacent word of some word in initial one vector sum training sample sentence is inputted into the company Continuous bag of words, prediction obtain the term vector x of the word_i.By the initial one vector update replace with the first update sentence to Amount, will be described in initial term vector input of the left and right adjacent word of next word in the first update sentence vector sum training sample sentence Continuous bag of words, prediction obtain the term vector x of the word_i+1, the first update sentence vector update is replaced with into the second update Sentence vector, such repetitive exercise, training updates the sentence vector of the training sample sentence every time, until prediction obtains training in sample sentence The term vector x of each word_i, i=(0,1,2,3 ..., m), will the updated sentence vector of last time training as the training The sentence vector S of sample sentence_i, i=(0,1,2,3 ..., n).For example, in " Foxconn is the supplier for rubbing and visiing bicycle " sentence, it will The left adjoining of "Yes" can word " Foxconn ", right adjoining can word " rub and visit bicycle " initial term vector and initial one vector Continuous bag of words are inputted, prediction obtains the term vector x of "Yes"₂, initial one vector is once updated, obtains first more New sentence vector；The left adjoining that " will be rubbed and visit bicycle " can word "Yes" initial term vector or current term vector, right adjoining it is available Word " " initial word vector sum first update sentence vector and input continuous bag of words, prediction obtain the word of " rub and visit bicycle " to Measure x₃, the first update sentence vector is updated, the second such repetitive exercise of update sentence vector ... is obtained, until prediction Obtain it is above-mentioned it is all can word term vector x_i, update and obtain the sentence vector S of the training sample sentence_i.In the process, Mei Gexin The sentence ID for hearing sentence remains constant.

In the second layer of RNN model, then with LSTM from left to right according to current word vector x_iPrevious term vector x_i-1 Hidden layer state vector h_i-1Calculate current word vector x_iThe first hidden layer state vector h_i, and from right to left according to current word Vector x_iThe latter term vector x_i+1Hidden layer state vector h_i+1Calculate current word vector x_iThe second hidden layer state vector h_i', the synthesis hidden layer that two hidden layer state vectors obtain training each word in sample sentence is spliced by Concatenate function State vector obtains the feature vector of each trained sample sentence further according to the synthesis hidden layer state vector of all words in training sample sentence T_i, i=(0,1,2,3 ..., n).For example, in " Foxconn is the supplier for rubbing and visiing bicycle " sentence, with LSTM root from left to right According to the term vector x of " Foxconn "₁Hidden layer state vector h₁Calculate the term vector x of "Yes"₂The first hidden layer state vector h₂, and from right to left according to the term vector x of " rub and visit bicycle "₃Hidden layer state vector h₃Calculate the term vector x of "Yes"₂? Two hidden layer state vector h₂', two hidden layer state vector (h are spliced by Concatenate function₂And h₂') trained The synthesis hidden layer state vector of each word in sample sentence is obtained further according to the synthesis hidden layer state vector of all words in training sample sentence To the feature vector T of each trained sample sentence_i。

In the third layer of RNN model, according to the feature vector T of each trained sample sentence_i, public using the calculating of average vector Formula: S=sum (a_i*T_i)/n indicates the average vector S of the business entity pair.Wherein a_iRepresent the weight of training sample sentence, T_iGeneration The feature vector of the trained sample sentence of each of business entity pair described in table, n represent the quantity of training sample sentence.It is assumed that from knowledge base The training sample sentence for extracting " Foxconn " and " rub and visit bicycle " entity pair has 50,000, then by the feature vector T of every trained sample sentence_i, I=(0,1,2,3 ..., n) substitutes into the calculation formula of average vector: S=sum (a_i*T_i)/n calculates " Foxconn " and " rubs and visit list The average vector S of vehicle " entity pair.Wherein n is equal to 50,000.

In the last layer of RNN model, average vector S is then updated to softmax classification function:

Wherein K represents the number of business connection type, and S represents the average vector of the business entity pair,Represent institute State the business connection type of business entity pair, σ (z)_jIt represents and needs the business connection type predicted in each business connection type In probability.According to the relationship type of training Yang Juzhong business entity pair, the weight a of training sample sentence is determined_i.By constantly changing Generation study, continues to optimize the weight a of trained sample sentence_i, so that effectively sentence obtains higher weight, and there is the sentence of noise to obtain Lesser weight, to obtain reliable RNN model.

Finally, as shown in figure 4, being the frame diagram of prediction module of the present invention.It extracts comprising two from current text to pre- The sentence of the business entity of survey relationship extracts the sentence comprising " Chinese safety group " and " Bank of China " such as from news, and These sentences are segmented to obtain sentence vector.For example, S₁,S₂,S₃,S₄What is indicated is the corresponding sentence of Liang Ge business entity Vector set.The feature vector T of each sentence is extracted by bi-LSTM₁,T₂,T₃,T₄, then by calculating T_iWith relation object The similarity of type r vector assigns T_iIn the weight that entire sentence is concentrated, finally takes in each sentence weighting and pass through with after Softmax classifier predicts the relationship between " Chinese safety group " and " Bank of China ".

Above-described embodiment propose business connection extracting method, by from non-structured text extract knowledge base in exist The sentence of the business entity pair of relationship is as training sample sentence and establishes sample database.It include a business entity pair in sample drawn library All trained sample sentences and segmented, obtain the sentence vector S of each trained sample sentence_i, each trained sample is calculated using LSTM The feature vector T of sentence_i.Then the average vector S that the business entity pair is indicated by the calculation formula of average vector, will be averaged Vector S substitutes into softmax classification function and is calculated, and the weight of training sample sentence is determined according to the relationship type of business entity pair a_i, obtain trained RNN model.The sentence comprising Liang Ge business entity is finally extracted from current text, by bi-LSTM Obtain the feature vector T of sentence_i, by this feature vector T_iTrained RNN model is inputted, is predicted between the Liang Ge business entity Relationship improves in news between the recognition capability of relationship different enterprises and to the early warning of business risk, reduces cumbersome training The artificial annotation step of data.

In addition, the embodiment of the present invention also proposes a kind of computer readable storage medium, the computer readable storage medium In include business connection extraction procedure 10, following operation is realized when the business connection extraction procedure 10 is executed by processor:

It segments step: extracting all trained sample sentences comprising a business entity pair from sample database, use preset point Word tool segments each trained sample sentence, and each word after participle is mapped to term vector x_i, and by each trained sample sentence It is mapped to sentence vector S_i, input as RNN model first layer；

Splicing step: in the second layer of RNN model, current word vector x is calculated from left to right with LSTM_iThe first hidden layer State vector h_i, and current word vector x is calculated from right to left_iThe second hidden layer state vector h_i', it is hidden by splicing two Layer state vector obtain train sample sentence in each word synthesis hidden layer state vector, further according to training sample sentence in all words it is comprehensive It closes hidden layer state vector and obtains the feature vector T of each trained sample sentence_i；

Calculate step: in the third layer of RNN model, according to the feature vector T of each trained sample sentence_i, utilize average vector Expression formula indicates the average vector S of the business entity pair；

Weight determines step: in the last layer of RNN model, by the average vector S and the pass of the business entity pair Set type substitutes into the weight a that each trained sample sentence is calculated in softmax classification function_i, obtain trained RNN model；

Prediction steps: the sentence comprising Liang Ge business entity is extracted from current text, obtains sentence by bi-LSTM Feature vector T_i, by this feature vector T_iAbove-mentioned trained RNN model is inputted, prediction obtains the pass between the Liang Ge business entity System.

Preferably, the participle step includes:

Preferably, the splicing step includes:

Preferably, the average vector expression formula are as follows:

S=sum (a_i*T_i)/n

Wherein a_iIt represents the weight of training sample sentence, be required value, T_iRepresent the trained sample sentence of each of described business entity pair Feature vector, n represent the quantity of training sample sentence.

The specific embodiment of the computer readable storage medium of the present invention is specific with above-mentioned business connection extracting method Embodiment is roughly the same, and details are not described herein.

The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in one as described above In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone, Computer, server or network equipment etc.) execute method described in each embodiment of the present invention.

The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims

1. a kind of business connection extracting method, which is characterized in that the described method includes:

Sample database establishment step: it extracts to be used as sentence there are the business entity of relationship from knowledge base and sample sentence is trained to establish sample Library；

It segments step: extracting all trained sample sentences comprising a business entity pair from sample database, use preset participle work Tool segments each trained sample sentence, and each word after participle is mapped to term vector x_i, and each trained sample sentence is mapped Form a complete sentence subvector S_i, input as Recognition with Recurrent Neural Network model first layer；

Splice step: in the second layer of Recognition with Recurrent Neural Network model, with shot and long term memory module calculate from left to right current word to Measure x_iThe first hidden layer state vector h_i, and current word vector x is calculated from right to left_iThe second hidden layer state vector h_i', The synthesis hidden layer state vector for obtaining training each word in sample sentence by splicing two hidden layer state vectors, further according to training The synthesis hidden layer state vector of all words obtains the feature vector T of each trained sample sentence in sample sentence_i；

Calculate step: in the third layer of Recognition with Recurrent Neural Network model, according to the spy of the trained sample sentence of each of the business entity pair Levy vector T_i, the average vector S:S=sum (a of the business entity pair is indicated using average vector expression formula_i*T_i)/n, wherein a_iIt represents the weight of each trained sample sentence, be required value, T_iThe feature vector of each trained sample sentence is represented, n represents training sample sentence Quantity；

Weight determines step: in the last layer of Recognition with Recurrent Neural Network model, by the average vector S and the business entity pair Relationship type substitute into softmax classification function the weight a of each trained sample sentence be calculated_i, obtain trained circulation mind Through network model；

Prediction steps: the sentence comprising Liang Ge business entity is extracted from current text, is obtained by two-way shot and long term memory module To the feature vector T of sentence_i, by this feature vector T_iInput above-mentioned trained Recognition with Recurrent Neural Network model, prediction obtain this two Relationship between a business entity.

2. business connection extracting method according to claim 1, which is characterized in that the participle step includes:

Each word after participle is indicated in the form of one-hot vector, obtains initial term vector, and is each trained sample sentence mark Sentence ID is infused, sentence ID is mapped as to the initial one vector of corresponding training sample sentence, by the initial one vector sum training sample The initial term vector of the left and right adjacent word of some word inputs continuous bag of words in sentence, and prediction obtains the term vector x of the word_i, often The sentence vector of the secondary forecast updating training sample sentence, until prediction obtains the term vector x of each word in the training sample sentence_i, with most Sentence vector S of the primary updated sentence vector as the training sample sentence afterwards_i。

3. business connection extracting method according to claim 1, which is characterized in that the splicing step includes:

From left to right according to current word vector x_iPrevious term vector x_i-1Hidden layer state vector h_i-1Calculate current term vector x_iThe first hidden layer state vector h_i, and from right to left according to current word vector x_iThe latter term vector x_i+1Hiding stratiform State vector h_i+1Calculate current word vector x_iThe second hidden layer state vector h_i’。

4. business connection extracting method according to claim 1, which is characterized in that the table of the softmax classification function Up to formula are as follows:

Wherein K represents the number of business connection type, and S represents the average vector of the business entity pair,Represent the enterprise The business connection type of entity pair, σ (z)_jIt is general in each business connection type to represent the business connection type for needing to predict Rate.

5. a kind of electronic device, which is characterized in that described device includes: memory, processor, and enterprise is stored on the memory Industry relationship extraction procedure, the business connection extraction procedure are executed by the processor, it can be achieved that following steps:

6. electronic device according to claim 5, which is characterized in that the splicing step includes:

7. electronic device according to claim 5, which is characterized in that the expression formula of the softmax classification function are as follows:

8. a kind of computer readable storage medium, which is characterized in that include business connection in the computer readable storage medium Extraction procedure, it can be achieved that as described in any one of claims 1 to 5 when the business connection extraction procedure is executed by processor The step of business connection extracting method.