CN106156488A - Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology - Google Patents
Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology Download PDFInfo
- Publication number
- CN106156488A CN106156488A CN201610460412.2A CN201610460412A CN106156488A CN 106156488 A CN106156488 A CN 106156488A CN 201610460412 A CN201610460412 A CN 201610460412A CN 106156488 A CN106156488 A CN 106156488A
- Authority
- CN
- China
- Prior art keywords
- relation
- link
- knowledge graph
- main body
- bayes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16Z—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS, NOT OTHERWISE PROVIDED FOR
- G16Z99/00—Subject matter not provided for in other main groups of this subclass
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention discloses a kind of knowledge graph based on Bayes's personalized ordering link Forecasting Methodology.The method considers the common factor of relation between prediction task subject and object by proposing a potential feature link forecast model, and utilizes Bayes's personalized ordering to optimize the forecast model of relation between subject and object.The inventive method can solve the problem that in knowledge graph link prediction the occuring simultaneously on the impact predicted the outcome of relation between subject and object, and avoids overfitting.
Description
Technical field
The present invention relates to the Link Recommendation of knowledge graph, utilize Bayes's personalized ordering model to optimize the suitable of recommended links
Sequence, belongs to computer technology, information network, data mining interleaving techniques application.
Background technology
Knowledge graph is an entity information storehouse that can deposit any data, often it is characterized in that being and other entity
Relevant.Resource description framework (RDF) is the framework that a capture inter-entity is the most mutual.One RDF data collection is equivalent to
One isomery figure, the most each summit and limit may belong to different entity class, and category information is trapped in various entity type and pass
Taxonomical hierarchy structure between set type.From the natural language text of knowledge graph, set up the knowledge graph that can recommend need a large amount of
Work because in extracting entity, relationship map and link prediction these three significant process, extracting data and semantic role mark
Note is the most noisy.Therefore method for designing must reduce noise, this method to have considered below: (1) believable data
Source;(2) probits obtained by a natural language processing engine, have expressed the correct probability analyzed;(3) want
There is main body and the object of priori.Therefore the present invention uses the incorporation model of Bayes's personalization rank algorithm.
Link prediction in knowledge graph is a machine learning method, utilizes the priori of subject and object to assess three
Trust value between key element.
Summary of the invention
Technical problem: it is an object of the invention to provide a kind of knowledge graph based on Bayes's personalized ordering link prediction side
Method, in solution knowledge graph link prediction, between subject and object, the common factor of set of relationship, on the impact predicted the outcome, utilizes Bayes
Personalized ordering optimizes the forecast model of relation between subject and object.
Technical scheme: knowledge graph link method based on Bayes's personalized ordering of the present invention propose one based on
The link forecast model of feature carrys out the link in prediction knowledge figure knowledge graph, and utilizes Bayes's personalized ordering to optimize this mould
Type.
Knowledge graph link method based on Bayes's personalized ordering of the present invention comprises the following steps:
Step 1) set up knowledge graph, specifically comprise the following steps that
Step 11) three elements figure G=(S, P, O) of definition knowledge graph, (s, p, o)=1 represents that main body S and object O exist to G
Relation P, (s, p, o)=0 represents that main body S and object O do not exist relation P, wherein s ∈ S, p ∈ P, o ∈ O to G.Described three elements bag
Containing the relation between main body, object and Subjective and Objective, described s is main body, and described o is object, and described p is the pass between subject and object
System, described S is host complex, and described O is object set, and described P is the set of relation between subject and object.
Step 12) definition bipartite graph Gp(Sp,Op), for relation p between subject and object, Gp(Sp,Op)=0 represents main body sp
With object opBetween there is not relation p, Gp(sp,op)=1 represents main body spWith object opBetween there is relation p, described sp∈Sp,op∈
Op, described bipartite graph is that the vertex set of figure may be partitioned into two mutually disjoint subsets, and two that in figure, each edge depends on
Summit all belongs to the mutually disjoint subset of the two, and the summit in two subsets is non-conterminous.
Step 2) establish the link forecast model, specifically comprise the following steps that
Step 21) link prediction be conceptualized as, to each relation p with at GpIn entity (s, o), target is to set up one
Forecast model Mp(s, o), xs,o=Mp(s, o), described p ∈ P, s ∈ S, o ∈ O, described xs,oIt it is the result of forecast model.
Step 22) use logistic functionIt is used as standards of grading, describedIt is the probability of main body s and object o opening relationships p, describedIt is s Yu o opening relationships p.
Step 3) definition feature based forecast modelWherein spWith opIt is definition respectively
Bipartite graph Gp(Sp,OpSubject and object in), s ∈ S, p ∈ P, o ∈ O, described S are host complex, and described O is object set,
Described P is the set of relation between subject and object, and described T is transposition operation.Described xs,oIt is the result of forecast model,Value
The biggest, spAnd opIn low gt, more for similar, the probability establishing the link relation is the biggest;DescribedDescribed U and V is low dimensional vector, and described IR is real number field, and described K is potential feature
Number.Described vector space is such a set, and the most any two elements are added another yuan that may make up in this set
Element, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is that independent parameter number is less.
Step 4) utilize Bayes's personalized ordering algorithm to be optimized, step is as follows:
Step 41) set up order standardDescribed Dp
It is training sample set, describedRepresent to exist between s with o and link, describedRepresent not exist between s with o and link, described
σ (.) is logistic function, and described log istic function isDescribed a is function variable, describedFor the standardization of data, in order to avoid overfitting, described λ is a constant controlling matching speed, described
Θp={ Up,Vp,bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S interior element
Number, described | O | represents host complex O interior element number.
Step 42) initialize UpSized by for m × K value be all 0 matrix, initialization VpSized by be all 0 for n × K value
Matrix, bpSized by for n × 1 value be all 0 vector.Described n is the element number in object set O, and described m is the theme set
Element number in S, the number of the potential feature of described K.
Step 43) for eachIntegrating step 3) updateValue, finally return
Return Up, Vp, bpValue, described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed.
Step 5) according to step 2) and step 3) calculate the probability of the relation of establishing the linkDescribedIt it is the probability of main body s and object o opening relationships p;.
Step 6) definition prediction threshold values ε, whenTime, then there is relation p between prediction main body s and object o, whenTime, then there is not relation p between prediction main body s and object o.
Beneficial effect: the present invention proposes a kind of knowledge graph based on Bayes's personalized ordering link Forecasting Methodology, specifically
Have the beneficial effect that:
1) we achieve one for the probability assessing knowledge graph three main points in link prediction.Specifically, I
Used for reference successful method, in commending system field, use the algorithm of knowledge graph, and a prominent benchmark dataset entered
Go and assessed thoroughly.
2) we have proposed the Link Recommendation model prediction task that a feature based embeds, and utilize Bayes personalized
Optimisation technique based on rank algorithm sets up learning model for each predicate.
3) we apply a linear regression model (LRM) opening up for the induced subgraph of each predicate and original knowledge graph
Flutter the relation between structure quantitative analysis precision of prediction.Our research indicate that, such as the index such as cluster coefficients and average degree, can
Using use as the index affecting precision of prediction.
Accompanying drawing explanation
Fig. 1 is knowledge graph Link Recommendation method flow based on Bayes's personalization algorithm.
Detailed description of the invention
The link prediction specific embodiment that the present invention uses Bayes's personalized ordering algorithm below is retouched in more detail
State.
With reference to the accompanying drawings 1, knowledge graph link problems is defined as bipartite graph, uses the following step that is embodied as:
(1) relation between the main body of Input knowledge figure, object and Subjective and Objective.
(2) bipartite graph G is builtp(Sp,OpIn), with main body spWith object opCollect the vertex set as bipartite graph, main body and visitor
Between body, relation p is as the limit collection of bipartite graph.Gp(Sp,Op)=0 represents summit spWith summit opBetween there is not limit p, Gp(sp,op)=1
Represent summit spWith summit opBetween there is limit p, described sp∈Sp,op∈Op。
Complete after knowledge graph link problems is defined as bipartite graph, forecast model to be established the link, specifically comprise the following steps that
(1) forecast model of feature based is set up
(2) logistic function is usedIt is used as standards of grading.
In being embodied as,It is worth the biggest, spAnd opIn low gt, more for similar, establish the link relation
Probability is the biggest.DescribedDescribed U and V is low dimensional vector, and described IR is real number field, institute
Stating the number that K is potential feature, described vector space is such a set, and the most any two elements are added and may make up this set
Another interior element, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is independent ginseng
Keep count of less.DescribedIt is the probability of main body s and object o opening relationships p, describedIt is that s Yu o sets up pass
It is p.
After link forecast model has been set up, Bayes's personalized ordering algorithm to be utilized is optimized, and is embodied as step
Rapid as follows:
(1) U is initializedpSized by for m × K value be all 0 matrix, initialization VpSized by for n × K value be all 0 square
Battle array, bpSized by for n × 1 value be all 0 vector.Described n is the element number in object set O, and described m is the theme and gathers S
Interior element number, the number of the potential feature of described K.
(2) for eachUpdateValue, finally return to Up, Vp, bpValue,
Described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed.
DescribedIt it is order standard.
Described DpIt is training sample set, describedRepresent to exist between s with o and link, describedRepresent between s and o
There is not link, described σ (.) is logistic function, and described logistic function isDescribed a is that function becomes
Amount, describedFor the standardization of data, in order to avoid overfitting, described λ is one and controls the normal of matching speed
Number, described Θp={ Up,Vp,bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S
Interior element number, described | O | represents host complex O interior element number.
In being embodied as, according to the probability of linking relationshipSet prediction threshold values ε, when
Time, then there is relation p between prediction main body s and object o, whenTime, then do not exist between prediction main body s and object o
Relation p, describedIt it is the probability of main body s and object o opening relationships p.
Claims (4)
1. knowledge graph based on a Bayes's personalized ordering link Forecasting Methodology, it is characterised in that the method includes following step
Rapid:
Step 1) set up knowledge graph;
Step 2) establish the link forecast model;
Step 3) definition feature based forecast modelWherein spWith opIt is two points of definition respectively
Figure Gp(Sp,OpSubject and object in), s ∈ S, p ∈ P, o ∈ O, described s are main bodys, and described o is object, described p be main body with
Relation between object, described S is host complex, and described O is object set, and described P is the set of relation between subject and object, institute
Stating T is transposition operation;Described xs,oIt is the result of forecast model,It is worth the biggest, spAnd opIn low gt, more for phase
Seemingly, the probability establishing the link relation is the biggest;DescribedDescribed U and V is low dimensional vector,
Described IR is real number field, and described K is the number of potential feature;
Step 4) utilize Bayes's personalized ordering algorithm to be optimized;
Step 5) according to step 2) and step 3) calculate the probability of the relation of establishing the linkDescribedIt is
Main body s and the probability of object o opening relationships p;
Step 6) definition prediction threshold values ε, whenTime, then there is relation p between prediction main body s and object o, whenTime, then there is not relation p between prediction main body s and object o.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that
Described step 1) set up knowledge graph, specifically comprise the following steps that
Step 11) three elements figure G=(S, P, O) of definition knowledge graph, (s, p, o)=1 represents that main body S and object O exist relation to G
(s, p, o)=0 represents that main body S and object O do not exist relation P, wherein s ∈ S, and p ∈ P, o ∈ O, described three elements comprise master for P, G
Relation between body, object and Subjective and Objective, described S is host complex, and described O is object set, and described P is to close between subject and object
The set of system,
Step 12) definition bipartite graph Gp(Sp,Op), for relation p between subject and object, Gp(Sp,Op)=0 represents main body spWith visitor
Body opBetween there is not relation p, Gp(sp,op)=1 represents main body spWith object opBetween there is relation p, described sp∈Sp,op∈Op, institute
State the vertex set that bipartite graph is figure and may be partitioned into two mutually disjoint subsets, and two summits that in figure, each edge depends on are all
Belonging to the mutually disjoint subset of the two, the summit in two subsets is non-conterminous.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that
Described step 2) establish the link forecast model, specifically comprise the following steps that
Step 21) link prediction be conceptualized as, to each relation p with at GpIn entity (s, o), target is to set up a prediction
Model Mp(s, o), xs,o=Mp(s, o), described p ∈ P, s ∈ S, o ∈ O, described xs,oIt it is the result of forecast model;
Step 22) use logistic functionIt is used as standards of grading, described
It is the probability of main body s and object o opening relationships p, describedIt is s Yu o opening relationships p.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that
Described step 4) utilize Bayes's personalized ordering algorithm to be optimized, step is as follows:
Step 41) set up order standardDescribed DpIt it is instruction
Practice sample set, describedRepresent to exist between s with o and link, describedRepresent not exist between s with o and link, described σ (.)
Being logistic function, described logistic function isDescribed a is function variable, describedWith
In the standardization of data, in order to avoid overfitting, described λ is a constant controlling matching speed, described Θp={ Up,Vp,
bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S interior element number, described | O |
Representing host complex O interior element number, described vector space is such a set, and the most any two elements are added and may make up this
Another element in set, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is only
Vertical number of parameters is less;
Step 42) initialize UpSized by for m × K value be all 0 matrix, initialization VpSized by for n × K value be all 0 square
Battle array, bpSized by for n × 1 value be all 0 vector, described n is the element number in object set O, described m be the theme gather S
Interior element number, the number of the potential feature of described K;
Step 43) for eachIntegrating step 3) updateValue, finally return to Up,
Vp, bpValue, described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610460412.2A CN106156488A (en) | 2016-06-22 | 2016-06-22 | Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610460412.2A CN106156488A (en) | 2016-06-22 | 2016-06-22 | Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106156488A true CN106156488A (en) | 2016-11-23 |
Family
ID=57353622
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610460412.2A Pending CN106156488A (en) | 2016-06-22 | 2016-06-22 | Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106156488A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106682095A (en) * | 2016-12-01 | 2017-05-17 | 浙江大学 | Subjectterm and descriptor prediction and ordering method based on diagram |
CN106997488A (en) * | 2017-03-22 | 2017-08-01 | 扬州大学 | A kind of action knowledge extraction method of combination markov decision process |
CN109086373A (en) * | 2018-07-23 | 2018-12-25 | 东南大学 | A method of the fair link forecast assessment system of building |
-
2016
- 2016-06-22 CN CN201610460412.2A patent/CN106156488A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106682095A (en) * | 2016-12-01 | 2017-05-17 | 浙江大学 | Subjectterm and descriptor prediction and ordering method based on diagram |
CN106682095B (en) * | 2016-12-01 | 2019-11-08 | 浙江大学 | The prediction of subject description word and sort method based on figure |
CN106997488A (en) * | 2017-03-22 | 2017-08-01 | 扬州大学 | A kind of action knowledge extraction method of combination markov decision process |
CN109086373A (en) * | 2018-07-23 | 2018-12-25 | 东南大学 | A method of the fair link forecast assessment system of building |
CN109086373B (en) * | 2018-07-23 | 2021-01-12 | 东南大学 | Method for constructing fair link prediction evaluation system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Manna et al. | A fuzzy rank-based ensemble of CNN models for classification of cervical cytology | |
Lei et al. | Shallow convolutional neural network for image classification | |
US20180276542A1 (en) | Recommendation Result Generation Method and Apparatus | |
CN111881342A (en) | Recommendation method based on graph twin network | |
WO2023065859A1 (en) | Item recommendation method and apparatus, and storage medium | |
CN109034228B (en) | Image classification method based on differential privacy and hierarchical relevance propagation | |
Bai et al. | Weakly supervised object localization via transformer with implicit spatial calibration | |
CN107239532B (en) | Data mining method and device | |
Nguyen et al. | A deep manifold-regularized learning model for improving phenotype prediction from multi-modal data | |
CN113378938B (en) | Edge transform graph neural network-based small sample image classification method and system | |
CN112529071B (en) | Text classification method, system, computer equipment and storage medium | |
Wu | Image retrieval method based on deep learning semantic feature extraction and regularization softmax | |
CN106156488A (en) | Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology | |
Wang et al. | Research on maize disease recognition method based on improved resnet50 | |
Zhang et al. | A bagging dynamic deep learning network for diagnosing COVID-19 | |
Li et al. | Piecewise convolutional neural networks with position attention and similar bag attention for distant supervision relation extraction | |
Zhang et al. | Skip-attention encoder–decoder framework for human motion prediction | |
CN110363283A (en) | User property prediction technique and relevant apparatus based on deep learning | |
Aoyagi | A Bayesian learning coefficient of generalization error and Vandermonde matrix-type singularities | |
Jiang et al. | On consensus-optimality trade-offs in collaborative deep learning | |
CN112541128A (en) | Personalized news recommendation method based on characteristic bidirectional dynamic cooperation | |
Yi et al. | OCRNet for Diabetic Foot Ulcer Segmentation Combined with Edge Loss | |
Hannak et al. | Efficient graph signal recovery over big networks | |
Sun et al. | Randomized nonlinear two-dimensional principal component analysis network for object recognition | |
Laskar et al. | Artificial Neural Networks and Gene Expression Programing based age estimation using facial features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20161123 |