CN108052683A

CN108052683A - A kind of knowledge mapping based on cosine measurement rule represents learning method

Info

Publication number: CN108052683A
Application number: CN201810058745.1A
Authority: CN
Inventors: 常亮; 饶官军; 古天龙; 罗义琴; 祝曼丽; 徐周波
Original assignee: Guilin University of Electronic Technology
Current assignee: Guilin University of Electronic Technology
Priority date: 2018-01-22
Filing date: 2018-01-22
Publication date: 2018-05-18
Anticipated expiration: 2038-01-22
Also published as: CN108052683B

Abstract

The present invention discloses a kind of knowledge mapping based on cosine measurement rule and represents learning method, and the entity in knowledge mapping and relation are randomly-embedded to two vector spaces first；Secondly candidate's entity statistical rules, the corresponding triple collection of statistical correlation relation and candidate's entity vector set are utilized；Cosine similarity construction object vector and the score function of candidate's entity are reused, candidate's entity is evaluated；Candidate's entity vector of all correlativities is finally subjected to unified training with object vector using loss function, and passes through stochastic gradient descent algorithm and minimizes loss function.When reaching optimization aim, you can obtain the optimal expression of each entity vector sum relation vector in knowledge mapping, so as to contacting between better presentation-entity and relation, and can be good at being applied among large-scale knowledge mapping completion.

Description

A kind of knowledge mapping based on cosine measurement rule represents learning method

Technical field

The present invention relates to knowledge mapping technical fields, and in particular to a kind of knowledge mapping based on cosine measurement rule represents Learning method.

Background technology

With the arrival in big data epoch, the bonus of data makes artificial intelligence technology obtain unprecedented rapid development. As the knowledge engineering of main representative and represent that study obtains for association areas such as the machine learning of main representative using knowledge mapping Rapid progress.On the one hand, as expression study is to the depleted of big data bonus so that represent that learning model effect tends to Bottleneck.On the other hand, continued to bring out with this substantial amounts of knowledge mapping, and these contain this substantial amounts of mankind's priori treasure-house But the effective utilization of study is not expressed yet.Fusion knowledge mapping further improves expression learning model with representing that study becomes One of important thinking of effect.Using knowledge mapping as represent symbolicism, with represent study as representative connectionism, increasingly Depart from the track of original each independent development, go on the new road that collaboration is gone forward side by side.

Knowledge mapping is substantially a kind of semantic network, express all kinds of entities, concept and its between semantic relation.Phase For traditional knowledge representation (such as body, traditional semantic network), knowledge mapping has entity/concept coverage rate height, language The advantages such as adopted relation is various, structure is friendly (being typically expressed as RDF format) and quality is higher, so that knowledge mapping is increasingly As big data epoch and artificial intelligence epoch knowledge representation mode the most main.

Triple is a kind of general representation of knowledge mapping, the citation form of triple include (entity 1, relation, Entity 2) and (concept, attribute, property value) etc., entity is element most basic in knowledge mapping, is existed not between different entities Same relation.Concept mainly includes set, object type, things species, such as geographical, personage；Attribute refers to what object had Attributive character, characteristic, such as nationality's date of birth；Property value then refers to the value corresponding to attribute, such as China, 1993-01-12 etc.. Triple is represented usually using (head, relation, tail) (being abbreviated as (h, r, t)), and wherein r represents head entity h and tail Relation between entity t.If Paris is this knowledge of French capital, it can use that (Paris is ... first in knowledge mapping All, it is French) expression of this triple.

The expression study of knowledge mapping is intended to the entity in knowledge mapping and relation being embedded into low-dimensional vector space, by it It is expressed as dense low-dimensional real-valued vectors.Its key is the damage on true (triple (h, r, t)) in reasonable definition knowledge mapping Lose function f_rThe vectorization of (h, t) and two entities h, t of triple represent.Under normal conditions, when true (h, r, t) is set up When, it is expected to minimize f_r(h,t).The fact that consider entire knowledge mapping, then can be by minimizing loss function come the entity that learns It is represented with the vector of relation.Different expression study can define corresponding loss function using different principle and method.When The preceding translation model using TransE models as representative is had received widespread attention with the performance that it is protruded and simple model parameter. However, existing translation model can effectively handle the simple relation of 1-1, but for 1-N, N-1, N-N complex relationship still It is restricted.This results in existing knowledge collection of illustrative plates to represent that learning method can not be advantageously applied to large-scale knowledge mapping.

The content of the invention

To be solved by this invention is that existing knowledge collection of illustrative plates represents that learning method can not handle 1-N, N-1, N-N well The problem of complex relationship, provides a kind of knowledge mapping based on cosine measurement rule and represents learning method.

To solve the above problems, the present invention is achieved by the following technical solutions：

A kind of knowledge mapping based on cosine measurement rule represents learning method, as follows including step：

Entity set in knowledge mapping and set of relations are respectively embedded in entity by step 1 using vectorial random generation method Vector space and relation vector space obtain entity vector sum relation vector；

Step 2, using candidate's entity statistical rules, obtain candidate's entity vector set of random selection triple, and according to The random generation error entity vector set of candidate's entity vector set；

Step 3 constructs the score function between object vector and candidate's entity vector, while specification using cosine similarity The value range of its functional value；

Step 4 is built using score function and distinguishes candidate's entity vector set and false entries vector set based on border Then loss function causes candidate's entity vector set to carry out unified constraint to object vector according to loss function；

Step 5 optimizes loss function value using optimization algorithm, so that the score function value of candidate's entity vector set Close to 1, the score function value of false entries vector set is represented with the optimal vector of learn entity and relation, reached close to 0 Optimization aim.

The specific sub-step of above-mentioned steps 2 is as follows：

Step 2.1 concentrates one triple of random selection from the triple of knowledge mapping；

Step 2.2, the head entity vector sum relation vector for being focused to find out in triple while matching selected triple All tail entities vector, and by the tail entity found out vector formed candidate's tail entity vector set；Meanwhile it is concentrated in triple It finds while matches all entity vectors of the tail entity vector sum relation vector of selected triple, and will be found out Head entity vector forms candidate's head entity vector set；

Step 2.3 carries out random replacement operation, generation error tail entity vector set to candidate's tail entity vector set；Meanwhile Random replacement operation, generation error head entity vector set are carried out to candidate's head entity vector set.

The score function constructed in above-mentioned steps 3 is as follows：

The score function f of candidate's tail entity vector_t(g_t, t) be：

The score function f of candidate's head entity vector_h(g_h, h) be：

The score function f of mistake tail entity vector_t′(g_t, t ') be：

The score function f ' of mistake head entity vector_h(g_h, h ') be：

In the above formulas, α is score function value scope standard parameter；g_tIt is the object vector of tail entity, g_t=h₀+r₀；g_h It is the object vector of an entity, g_h=t₀-r₀；h₀Be selected triple head entity vector, t₀It is selected triple Tail entity vector, r₀It is the relation vector of selected triple；T be candidate's tail entity vector, h be candidate head entity to Amount, t ' are wrong tail entity vectors, and h ' is wrong head entity vector.

Above-mentioned score function value scope standard parameter α ∈ [0,1].

Constructed loss function L is in above-mentioned steps 4：

In formula, γ is the boundary value of setting；f_t′(g_t, t ') and represent the score function of wrong tail entity vector, f_t(g_t,t) Represent the score function of candidate's tail entity vector, f '_h(g_h, h ') and represent the score function of wrong head entity vector, f_h(g_h, h) and table Show the score function of candidate's head entity vector；g_tIt is the object vector of tail entity, g_t=h₀+r₀；g_hIt is the object vector of an entity, g_h=t₀-r₀；h₀Be selected triple head entity vector, t₀Be selected triple tail entity vector, r₀It is institute The relation vector of the triple of selection；T is vectorial for candidate's tail entity, t_cFor candidate's tail entity vector set, t ' is wrong tail entity Vector, t '_cFor wrong tail entity vector set, h is vectorial for candidate's head entity, h_cFor candidate's head entity vector set, h ' is that wrong head is real Body vector, h '_cFor wrong head entity vector set.

Optimization algorithm described in above-mentioned steps 5 is stochastic gradient descent algorithm.

Compared with prior art, the present invention has following features：

First, it is proposed that a kind of candidate's entity statistical rules, statistics obtain candidate's entity vector set of correlativity；

Second, by introducing the cosine similarity between two vectors, calculate between object vector and candidate's entity vector Cosine value is as the difference size between weighing two individuals, is not between two vectors of simple computation compared to Euclidean distance Distance, cosine similarity more focus on difference of two vectors on direction；Existing model is so solved in processing 1-N, N- 1st, deficiencies of the N-N when complex relationships enriches the ability to express of entity and relation, improves model performance on the whole；

3rd, unified constraint is formed with object vector by all candidate's entities for combining correlativity, it is real to improve candidate Body vector and object vector interactivity.

Description of the drawings

A kind of knowledge mapping based on cosine measurement rule of Fig. 1 present invention represents the flow chart of learning method.

Entity and the exemplary plot of relation triple in Fig. 2 knowledge mappings.

A kind of knowledge mapping based on cosine measurement rule of Fig. 3 present invention represents the training objective exemplary plot of learning method, Before wherein (a) is training, (b) is after training.

Specific embodiment

Understand to make the object, technical solutions and advantages of the present invention clearer, below in conjunction with specific example, and with reference to attached Figure, the present invention is described in more detail.

The invention discloses a kind of knowledge mappings based on cosine measurement rule to represent learning method, as shown in Figure 1, first Entity in knowledge mapping and relation are randomly-embedded to two vector spaces；Secondly candidate's entity statistical rules, statistics are utilized The corresponding triple collection of correlativity and candidate's entity vector set, and according to the random generation error entity of candidate's entity vector set to Quantity set；Cosine similarity construction object vector and the score function of candidate's entity are reused, candidate's entity is evaluated；Most Candidate's entity vector of all correlativities is subjected to unified training with object vector using loss function afterwards, and passes through boarding steps It spends descent algorithm and minimizes loss function.When reaching optimization aim, you can obtain each entity vector sum in knowledge mapping and close It is the optimal expression of vector, so as to contacting between better presentation-entity and relation, and can be good at being applied to extensive Knowledge mapping completion among.The present invention is trained by cosine similarity with unified, and overcoming existing model well can not be very The complex relationship problem of good processing 1-N, N-1, N-N, has stronger feasibility and good practicability.

The present invention considers the triple structured message in knowledge mapping, and using (head, relation, Tail typical triple form) carries out the representation of knowledge, and relation is for connection to entity head, tail, and portrays two Relevance between entity.Fig. 2 is the exemplary plot of typical knowledge mapping triple structure.Wherein, circle is expressed as entity section Point (such as " Tang * * ", " her * * ", " base of a fruit * * "), the company side between two entities represent relation (such as " nationality ", " president ", " female Youngster " etc.).In addition it can be seen that entity " Tang * * " between " U.S. " there are many relation, and there is " daughter ", " nationality " relation Multiple entities pair are corresponded to.

A kind of knowledge mapping based on cosine measurement rule represents learning method, using Fig. 2 knowledge mappings as training set, figure 2 include 5 entities, 4 relations, 8 triples.The specific implementation of method includes the following steps：

Entity in knowledge mapping and relation are respectively embedded in two vectors by step 1 using vectorial random generation method Space；

The entity set { " Tang * * ", " agate * " " base of a fruit * * ", " her * * ", " U.S. " } of Fig. 2 knowledge mappings is expressed as by step 11 {e₁,e₂,e₃,e₄,e₅With set of relations { " wife ", " daughter ", " president ", " nationality " } be expressed as { r₁,r₂,r₃,r₄}.Fig. 2 knowledge Collection of illustrative plates includes triple collection { (e₁,r₁,e₂), (e₁,r₂,e₃), (e₁,r₂,e₄), (e₁,r₃,e₅), (e₂,r₂,e₃), (e₃,r₃, e₅), (e₄,r₃,e₅), (e₅,r₄,e₁)}；

They are respectively embedded in entity vector space and relation vector space by step 12, obtain entity vector { e₁,e₂, e₃,e₄,e₅And relation vector { r₁,r₂,r₃,r₄}。

Step 2, using candidate's entity statistical rules, obtain the triple collection of correlativity and candidate's entity vector set, together Shi Suiji generation error entity vector sets.

Candidate's entity statistical rules is as follows：

The first step randomly selects triple (e.g. a, (h from knowledge mapping₀,r₀,t₀))；

Step 21 obtains triple (e.g., (Tang * *, daughter, base of a fruit * *) i.e. (e by randomly selecting₁,r₂,e₃))；

Second step matches (h is included in training set respectively₀,r₀,) with (,r₀,t₀) triple collection (h₀,r₀,t_c) with (h_c,r₀,t₀), whereinIt is with g_t=h₀+r₀For all candidate's tail entity sets of object vector, comprising | t_c| A different tail physical quantities,It is with g_h=t₀-r₀All candidates head entity set of object vector, comprising |h_c| a different head physical quantities；

Step 22, basis ((e₁,r₂,) match triple collection { (e₁,r₂,e₃), (e₁,r₂,e₄), while obtain candidate Tail entity set { e₃,e₄}；

Step 23, according to (,r₂,e₃) match triple collection { (e₁,r₂,e₃), (e₂,r₂,e₃), while obtain candidate's head Entity set { e₁,e₂}；

3rd step, it is random to generate the corresponding false entries collection of candidate's entity set.In the generating process of false entries collection, meeting It is compared with generation error tail entity with candidate's entity in candidate's entity set, only when the mistake being not present in candidate's entity set Tail entity can be included into false entries collection by mistake.

Step 24, random generation error tail entity set { e₅,e₂And wrong head entity set { e₄,e₅}。

Step 3 constructs the score function between object vector and candidate's entity vector, while specification using cosine similarity The value range of its functional value, generally [0,1].

Step 31, using cosine value formula, cosine value cos ＜ a, the b ＞ of vectorial a and vector b are expressed as：

Step 32, based on above-mentioned cosine formula, construct score function：

According to object vector and the score function f of candidate's tail entity of the cosine similarity of candidate's tail entity vector_t(g_t,t) Construction is as follows：

According to object vector and the cosine similarity of candidate's head entity vector, the score function f of construction candidate's head entity_h (g_h, h) and it is as follows：

Wherein, α ∈ [0,1] are score function value scope standard parameters, g_tIt is the object vector of tail entity, g_t=h₀+r₀, g_h It is the object vector of an entity, g_h=t₀-r₀, t is candidate's tail entity vector, and h is candidate's head entity vector.

According to the cosine similarity of object vector and wrong tail entity vector, the score function f of the wrong tail entity of construction_t′ (g_t, t ') and it is as follows：

According to the cosine similarity of object vector and wrong head entity vector, the score function f ' of the wrong head entity of construction_h (g_h, h ') and it is as follows：

Wherein, α ∈ [0,1] are score function value scope standard parameters, g_tIt is the object vector of tail entity, g_t=h₀+r₀, g_h It is the object vector of an entity, g_h=t₀-r₀, t ' is wrong tail entity vector, and h ' is wrong head entity vector.

Step 41, the loss function construction based on border are as follows：

Wherein, [γ+f '-f]₊=max (0, γ+f '-f)；γ is the boundary value of setting；f_t′(g_t, t ') and represent wrong tail The score function of entity vector, f_t(g_t, t) represent candidate's tail entity vector score function, f '_h(g_h, h ') and represent that wrong head is real The score function of body vector, f_h(g_h, h) represent candidate head entity vector score function；g_tIt is the object vector of tail entity, g_t =h₀+r₀；g_hIt is the object vector of an entity, g_h=t₀-r₀；h₀Be selected triple head entity vector, t₀It is selected Triple tail entity vector, r₀It is the relation vector of selected triple；T is vectorial for candidate's tail entity, t_cFor candidate's tail Entity vector set, t ' are that wrong tail entity is vectorial, t '_cFor wrong tail entity vector set, h is vectorial for candidate's head entity, h_cFor candidate Head entity vector set, h ' are that wrong head entity is vectorial, h '_cFor wrong head entity vector set.

Step 42, triple the collection { (e that will be obtained in step 22₁,r₂,e₃), (e₂,r₂,e₃) and candidate's tail entity set {e₃,e₄, score is calculated by the score function in step 32 respectively；

Step 421, (e₁,r₂,e₃) be scored at

Its corresponding wrong triple (e₁,r₂,e₅) be scored at

Step 422, (e₁,r₂,e₄) be scored at

Its corresponding wrong triple (e₁,r₂,e₂) be scored at

Step 43, triple the collection { (e that will be obtained in step 22₁,r₂,e₃), (e₂,r₂,e₃) and candidate's head entity set {e₁,e₂, score is calculated by the score function in step 32 respectively；

Step 431, (e₁,r₂,e₃) be scored at

Its corresponding wrong triple (e₄,r₂,e₃) be scored at

Step 432, (e₂,r₂,e₃) be scored at

Its corresponding wrong triple (e₅,r₂,e₂) be scored at

Step 44 substitutes into the score calculated in step 42 and step 43 in loss function L, obtains loss function value.It is logical It crosses and all candidate's entities vector of correlativity is formed by unified constraint with object vector based on the loss function on border.

Step 5 optimizes loss function value using optimization algorithm, so that the score function value of candidate's entity vector set Close to 1, the score function value of false entries vector set is represented with the optimal vector of entity and relation, reached close to 0 Optimization aim.

Step 51, optimization algorithm will use stochastic gradient descent algorithm to minimize loss function, obtain all candidate's entities The optimal score of vector so as to which the entity and the optimal vector of relation learned represent, reaches optimization aim.

Knowledge mapping of the present invention represents Principles of Translation used by learning method, and referring to Fig. 3, basic thought is：First root According to triple (h, r, t) construction training objective vector g=h+r (or g=t-r), while obtain candidate's entity vector set, such as Fig. 3 In { e₁,e₂,e₃,e₄}；Secondly score is calculated using the cosine similarity between two vectors；Finally, stochastic gradient descent is utilized Algorithm changes candidate's entity vector according to loss function score, while changes direction according to the entirety of preferred entity to change mesh Mark vector.The present invention using the cosine similarity between two vectors, can preferably calculate object vector and candidate's entity to Difference size between amount is not the distance between two vectors of simple computation, cosine similarity is more focused on compared to Euclidean distance Difference of two vectors on direction.So solve deficiency of the existing model when handling 1-N, N-1, N-N when complex relationships, The ability to express of entity and relation is enriched, improves model performance on the whole.

The present invention using the cosine similarity between two vectors, can preferably calculate object vector and candidate's entity to Similitude between amount.It, using candidate's entity statistical rules, is obtained based on embedded model with relation vector using entity vector The triple collection of correlativity and candidate's entity vector set；And it is similar to the cosine of candidate's entity vector to introduce object vector Degree, enhances ability to express of the model to complex relationships such as 1-N, N-1, N-N, at the same construct object vector and candidate's entity to Measure exclusive score function.Finally construct new loss function, by gradient descent algorithm immediately optimize loss function, when up to During to optimum optimization target, it becomes possible to the optimal entity vector sum relation vector of each in knowledge mapping is obtained, so as to more preferable Entity and relation are indicated, and preserve the contact between entity and relation, it is extensive so as to be applied to well Knowledge mapping completion among.

It should be noted that although above embodiment of the present invention is illustrative, it is to the present invention that this, which is not, Limitation, therefore the invention is not limited in above-mentioned specific embodiment.Without departing from the principles of the present invention, it is every The other embodiment that those skilled in the art obtain under the enlightenment of the present invention is accordingly to be regarded as within the protection of the present invention.

Claims

1. a kind of knowledge mapping based on cosine measurement rule represents learning method, it is characterized in that, it is as follows including step：

Entity set in knowledge mapping and set of relations are respectively embedded in entity vector by step 1 using vectorial random generation method Space and relation vector space obtain entity vector sum relation vector；

Step 2, using candidate's entity statistical rules, obtain candidate's entity vector set of random selection triple, and according to candidate The random generation error entity vector set of entity vector set；

Step 3 constructs the score function between object vector and candidate's entity vector using cosine similarity, while specification its letter The value range of numerical value；

Step 4 builds the loss for distinguishing candidate's entity vector set and false entries vector set based on border using score function Then function causes candidate's entity vector set to carry out unified constraint to object vector according to loss function；

Step 5 optimizes loss function value using optimization algorithm, so that the score function value of candidate's entity vector set approaches In 1, the score function value of false entries vector set is represented with the optimal vector of learn entity and relation close to 0, reaches optimization Target.

2. a kind of knowledge mapping based on cosine measurement rule according to claim 1 represents learning method, it is characterized in that, The specific sub-step of step 2 is as follows：

The institute of step 2.2, the head entity vector sum relation vector for being focused to find out in triple while matching selected triple There is tail entity vector, and the tail entity found out vector is formed into candidate's tail entity vector set；Meanwhile it is focused to find out in triple All entity vectors of the tail entity vector sum relation vector of selected triple are matched simultaneously, and the head found out is real Body vector forms candidate's head entity vector set；

Step 2.3 carries out random replacement operation, generation error tail entity vector set to candidate's tail entity vector set；Meanwhile to waiting Head entity vector set is selected to carry out random replacement operation, generation error head entity vector set.

3. a kind of knowledge mapping based on cosine measurement rule according to claim 1 or 2 represents learning method, feature It is that the score function constructed in step 3 is as follows：

The score function f of candidate's tail entity vector_t(g_t, t) be：

<mrow> <msub> <mi>f</mi> <mi>t</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>,</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>&alpha;</mi> <mo>+</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>-</mo> <mi>&alpha;</mi> <mo>)</mo> </mrow> <mfrac> <mrow> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>&CenterDot;</mo> <mi>t</mi> </mrow> <mrow> <msqrt> <msub> <mi>g</mi> <mi>t</mi> </msub> </msqrt> <mo>*</mo> <msqrt> <mi>t</mi> </msqrt> </mrow> </mfrac> <mo>,</mo> </mrow>

The score function f of candidate's head entity vector_h(g_h, h) be：

<mrow> <msub> <mi>f</mi> <mi>h</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>,</mo> <mi>h</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>&alpha;</mi> <mo>+</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>-</mo> <mi>&alpha;</mi> <mo>)</mo> </mrow> <mfrac> <mrow> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>&CenterDot;</mo> <mi>h</mi> </mrow> <mrow> <msqrt> <msub> <mi>g</mi> <mi>h</mi> </msub> </msqrt> <mo>*</mo> <msqrt> <mi>h</mi> </msqrt> </mrow> </mfrac> <mo>,</mo> </mrow>

The score function f of mistake tail entity vector_t′(g_t, t ') be：

<mrow> <msubsup> <mi>f</mi> <mi>t</mi> <mo>&prime;</mo> </msubsup> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>,</mo> <msup> <mi>t</mi> <mo>&prime;</mo> </msup> <mo>)</mo> </mrow> <mo>=</mo> <mi>&alpha;</mi> <mo>+</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>-</mo> <mi>&alpha;</mi> <mo>)</mo> </mrow> <mfrac> <mrow> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>&CenterDot;</mo> <msup> <mi>t</mi> <mo>&prime;</mo> </msup> </mrow> <mrow> <msqrt> <msub> <mi>g</mi> <mi>t</mi> </msub> </msqrt> <mo>*</mo> <msqrt> <msup> <mi>t</mi> <mo>&prime;</mo> </msup> </msqrt> </mrow> </mfrac> <mo>,</mo> </mrow>

The score function f of mistake head entity vector_h′(g_h, h ') be：

<mrow> <msubsup> <mi>f</mi> <mi>h</mi> <mo>&prime;</mo> </msubsup> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>,</mo> <msup> <mi>h</mi> <mo>&prime;</mo> </msup> <mo>)</mo> </mrow> <mo>=</mo> <mi>&alpha;</mi> <mo>+</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>-</mo> <mi>&alpha;</mi> <mo>)</mo> </mrow> <mfrac> <mrow> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>&CenterDot;</mo> <msup> <mi>h</mi> <mo>&prime;</mo> </msup> </mrow> <mrow> <msqrt> <msub> <mi>g</mi> <mi>h</mi> </msub> </msqrt> <mo>*</mo> <msqrt> <msup> <mi>h</mi> <mo>&prime;</mo> </msup> </msqrt> </mrow> </mfrac> <mo>,</mo> </mrow>

In the above formulas, α is score function value scope standard parameter；g_tIt is the object vector of tail entity, g_t=h₀+r₀；g_hIt is head The object vector of entity, g_h=t₀-r₀；h₀Be selected triple head entity vector, t₀It is the tail of selected triple Entity vector, r₀It is the relation vector of selected triple；T is candidate's tail entity vector, and h is that candidate's head entity is vectorial, t ' It is wrong tail entity vector, h ' is wrong head entity vector.

4. a kind of knowledge mapping based on cosine measurement rule according to claim 3 represents learning method, it is characterized in that, Score function value scope standard parameter α ∈ [0,1].

5. a kind of knowledge mapping based on cosine measurement rule according to claim 1 represents learning method, it is characterized in that, Constructed loss function L is in step 4：

<mrow> <mi>L</mi> <mo>=</mo> <munder> <mo>&Sigma;</mo> <mrow> <mi>t</mi> <mo>&Element;</mo> <msub> <mi>t</mi> <mi>c</mi> </msub> <mo>,</mo> <msup> <mi>t</mi> <mo>&prime;</mo> </msup> <mo>&Element;</mo> <msubsup> <mi>t</mi> <mi>c</mi> <mo>&prime;</mo> </msubsup> </mrow> </munder> <msub> <mrow> <mo>&lsqb;</mo> <mi>&gamma;</mi> <mo>+</mo> <msubsup> <mi>f</mi> <mi>t</mi> <mo>&prime;</mo> </msubsup> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>,</mo> <msup> <mi>t</mi> <mo>&prime;</mo> </msup> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>f</mi> <mi>t</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>t</mi> </msub> <mo>,</mo> <mi>t</mi> <mo>)</mo> </mrow> <mo>&rsqb;</mo> </mrow> <mo>+</mo> </msub> <mo>+</mo> <munder> <mo>&Sigma;</mo> <mrow> <mi>h</mi> <mo>&Element;</mo> <msub> <mi>h</mi> <mi>c</mi> </msub> <mo>,</mo> <msup> <mi>h</mi> <mo>&prime;</mo> </msup> <mo>&Element;</mo> <msubsup> <mi>h</mi> <mi>c</mi> <mo>&prime;</mo> </msubsup> </mrow> </munder> <msub> <mrow> <mo>&lsqb;</mo> <mi>&gamma;</mi> <mo>+</mo> <msubsup> <mi>f</mi> <mi>h</mi> <mo>&prime;</mo> </msubsup> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>,</mo> <msup> <mi>h</mi> <mo>&prime;</mo> </msup> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>f</mi> <mi>h</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>g</mi> <mi>h</mi> </msub> <mo>,</mo> <mi>h</mi> <mo>)</mo> </mrow> <mo>&rsqb;</mo> </mrow> <mo>+</mo> </msub> </mrow>

In formula, γ is the boundary value of setting；f_t′(g_t, t ') and represent the score function of wrong tail entity vector, f_t(g_t, t) and it represents to wait Select the score function of tail entity vector, f_h′(g_h, h ') and represent the score function of wrong head entity vector, f_h(g_h, h) and represent candidate The score function of head entity vector；g_tIt is the object vector of tail entity, g_t=h₀+r₀；g_hIt is the object vector of an entity, g_h= t₀-r₀；h₀Be selected triple head entity vector, t₀Be selected triple tail entity vector, r₀It is selected Triple relation vector；T is vectorial for candidate's tail entity, t_cFor candidate's tail entity vector set, t ' is wrong tail entity vector, t′_cFor wrong tail entity vector set, h is vectorial for candidate's head entity, h_cFor candidate's head entity vector set, h ' be wrong head entity to Amount, h '_cFor wrong head entity vector set.

6. a kind of knowledge mapping based on cosine measurement rule according to claim 1 represents learning method, it is characterized in that, Optimization algorithm described in step 5 is stochastic gradient descent algorithm.