CN106156488A - Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology - Google Patents

Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology Download PDF

Info

Publication number
CN106156488A
CN106156488A CN201610460412.2A CN201610460412A CN106156488A CN 106156488 A CN106156488 A CN 106156488A CN 201610460412 A CN201610460412 A CN 201610460412A CN 106156488 A CN106156488 A CN 106156488A
Authority
CN
China
Prior art keywords
relation
link
knowledge graph
main body
bayes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610460412.2A
Other languages
Chinese (zh)
Inventor
陈清子
陈志�
岳文静
刘亚威
王梦伊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Post and Telecommunication University
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing Post and Telecommunication University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Post and Telecommunication University filed Critical Nanjing Post and Telecommunication University
Priority to CN201610460412.2A priority Critical patent/CN106156488A/en
Publication of CN106156488A publication Critical patent/CN106156488A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16ZINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS, NOT OTHERWISE PROVIDED FOR
    • G16Z99/00Subject matter not provided for in other main groups of this subclass

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention discloses a kind of knowledge graph based on Bayes's personalized ordering link Forecasting Methodology.The method considers the common factor of relation between prediction task subject and object by proposing a potential feature link forecast model, and utilizes Bayes's personalized ordering to optimize the forecast model of relation between subject and object.The inventive method can solve the problem that in knowledge graph link prediction the occuring simultaneously on the impact predicted the outcome of relation between subject and object, and avoids overfitting.

Description

Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology
Technical field
The present invention relates to the Link Recommendation of knowledge graph, utilize Bayes's personalized ordering model to optimize the suitable of recommended links Sequence, belongs to computer technology, information network, data mining interleaving techniques application.
Background technology
Knowledge graph is an entity information storehouse that can deposit any data, often it is characterized in that being and other entity Relevant.Resource description framework (RDF) is the framework that a capture inter-entity is the most mutual.One RDF data collection is equivalent to One isomery figure, the most each summit and limit may belong to different entity class, and category information is trapped in various entity type and pass Taxonomical hierarchy structure between set type.From the natural language text of knowledge graph, set up the knowledge graph that can recommend need a large amount of Work because in extracting entity, relationship map and link prediction these three significant process, extracting data and semantic role mark Note is the most noisy.Therefore method for designing must reduce noise, this method to have considered below: (1) believable data Source;(2) probits obtained by a natural language processing engine, have expressed the correct probability analyzed;(3) want There is main body and the object of priori.Therefore the present invention uses the incorporation model of Bayes's personalization rank algorithm.
Link prediction in knowledge graph is a machine learning method, utilizes the priori of subject and object to assess three Trust value between key element.
Summary of the invention
Technical problem: it is an object of the invention to provide a kind of knowledge graph based on Bayes's personalized ordering link prediction side Method, in solution knowledge graph link prediction, between subject and object, the common factor of set of relationship, on the impact predicted the outcome, utilizes Bayes Personalized ordering optimizes the forecast model of relation between subject and object.
Technical scheme: knowledge graph link method based on Bayes's personalized ordering of the present invention propose one based on The link forecast model of feature carrys out the link in prediction knowledge figure knowledge graph, and utilizes Bayes's personalized ordering to optimize this mould Type.
Knowledge graph link method based on Bayes's personalized ordering of the present invention comprises the following steps:
Step 1) set up knowledge graph, specifically comprise the following steps that
Step 11) three elements figure G=(S, P, O) of definition knowledge graph, (s, p, o)=1 represents that main body S and object O exist to G Relation P, (s, p, o)=0 represents that main body S and object O do not exist relation P, wherein s ∈ S, p ∈ P, o ∈ O to G.Described three elements bag Containing the relation between main body, object and Subjective and Objective, described s is main body, and described o is object, and described p is the pass between subject and object System, described S is host complex, and described O is object set, and described P is the set of relation between subject and object.
Step 12) definition bipartite graph Gp(Sp,Op), for relation p between subject and object, Gp(Sp,Op)=0 represents main body sp With object opBetween there is not relation p, Gp(sp,op)=1 represents main body spWith object opBetween there is relation p, described sp∈Sp,op∈ Op, described bipartite graph is that the vertex set of figure may be partitioned into two mutually disjoint subsets, and two that in figure, each edge depends on Summit all belongs to the mutually disjoint subset of the two, and the summit in two subsets is non-conterminous.
Step 2) establish the link forecast model, specifically comprise the following steps that
Step 21) link prediction be conceptualized as, to each relation p with at GpIn entity (s, o), target is to set up one Forecast model Mp(s, o), xs,o=Mp(s, o), described p ∈ P, s ∈ S, o ∈ O, described xs,oIt it is the result of forecast model.
Step 22) use logistic functionIt is used as standards of grading, describedIt is the probability of main body s and object o opening relationships p, describedIt is s Yu o opening relationships p.
Step 3) definition feature based forecast modelWherein spWith opIt is definition respectively Bipartite graph Gp(Sp,OpSubject and object in), s ∈ S, p ∈ P, o ∈ O, described S are host complex, and described O is object set, Described P is the set of relation between subject and object, and described T is transposition operation.Described xs,oIt is the result of forecast model,Value The biggest, spAnd opIn low gt, more for similar, the probability establishing the link relation is the biggest;DescribedDescribed U and V is low dimensional vector, and described IR is real number field, and described K is potential feature Number.Described vector space is such a set, and the most any two elements are added another yuan that may make up in this set Element, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is that independent parameter number is less.
Step 4) utilize Bayes's personalized ordering algorithm to be optimized, step is as follows:
Step 41) set up order standardDescribed Dp It is training sample set, describedRepresent to exist between s with o and link, describedRepresent not exist between s with o and link, described σ (.) is logistic function, and described log istic function isDescribed a is function variable, describedFor the standardization of data, in order to avoid overfitting, described λ is a constant controlling matching speed, described Θp={ Up,Vp,bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S interior element Number, described | O | represents host complex O interior element number.
Step 42) initialize UpSized by for m × K value be all 0 matrix, initialization VpSized by be all 0 for n × K value Matrix, bpSized by for n × 1 value be all 0 vector.Described n is the element number in object set O, and described m is the theme set Element number in S, the number of the potential feature of described K.
Step 43) for eachIntegrating step 3) updateValue, finally return Return Up, Vp, bpValue, described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed.
Step 5) according to step 2) and step 3) calculate the probability of the relation of establishing the linkDescribedIt it is the probability of main body s and object o opening relationships p;.
Step 6) definition prediction threshold values ε, whenTime, then there is relation p between prediction main body s and object o, whenTime, then there is not relation p between prediction main body s and object o.
Beneficial effect: the present invention proposes a kind of knowledge graph based on Bayes's personalized ordering link Forecasting Methodology, specifically Have the beneficial effect that:
1) we achieve one for the probability assessing knowledge graph three main points in link prediction.Specifically, I Used for reference successful method, in commending system field, use the algorithm of knowledge graph, and a prominent benchmark dataset entered Go and assessed thoroughly.
2) we have proposed the Link Recommendation model prediction task that a feature based embeds, and utilize Bayes personalized Optimisation technique based on rank algorithm sets up learning model for each predicate.
3) we apply a linear regression model (LRM) opening up for the induced subgraph of each predicate and original knowledge graph Flutter the relation between structure quantitative analysis precision of prediction.Our research indicate that, such as the index such as cluster coefficients and average degree, can Using use as the index affecting precision of prediction.
Accompanying drawing explanation
Fig. 1 is knowledge graph Link Recommendation method flow based on Bayes's personalization algorithm.
Detailed description of the invention
The link prediction specific embodiment that the present invention uses Bayes's personalized ordering algorithm below is retouched in more detail State.
With reference to the accompanying drawings 1, knowledge graph link problems is defined as bipartite graph, uses the following step that is embodied as:
(1) relation between the main body of Input knowledge figure, object and Subjective and Objective.
(2) bipartite graph G is builtp(Sp,OpIn), with main body spWith object opCollect the vertex set as bipartite graph, main body and visitor Between body, relation p is as the limit collection of bipartite graph.Gp(Sp,Op)=0 represents summit spWith summit opBetween there is not limit p, Gp(sp,op)=1 Represent summit spWith summit opBetween there is limit p, described sp∈Sp,op∈Op
Complete after knowledge graph link problems is defined as bipartite graph, forecast model to be established the link, specifically comprise the following steps that
(1) forecast model of feature based is set up
(2) logistic function is usedIt is used as standards of grading.
In being embodied as,It is worth the biggest, spAnd opIn low gt, more for similar, establish the link relation Probability is the biggest.DescribedDescribed U and V is low dimensional vector, and described IR is real number field, institute Stating the number that K is potential feature, described vector space is such a set, and the most any two elements are added and may make up this set Another interior element, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is independent ginseng Keep count of less.DescribedIt is the probability of main body s and object o opening relationships p, describedIt is that s Yu o sets up pass It is p.
After link forecast model has been set up, Bayes's personalized ordering algorithm to be utilized is optimized, and is embodied as step Rapid as follows:
(1) U is initializedpSized by for m × K value be all 0 matrix, initialization VpSized by for n × K value be all 0 square Battle array, bpSized by for n × 1 value be all 0 vector.Described n is the element number in object set O, and described m is the theme and gathers S Interior element number, the number of the potential feature of described K.
(2) for eachUpdateValue, finally return to Up, Vp, bpValue, Described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed. DescribedIt it is order standard.
Described DpIt is training sample set, describedRepresent to exist between s with o and link, describedRepresent between s and o There is not link, described σ (.) is logistic function, and described logistic function isDescribed a is that function becomes Amount, describedFor the standardization of data, in order to avoid overfitting, described λ is one and controls the normal of matching speed Number, described Θp={ Up,Vp,bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S Interior element number, described | O | represents host complex O interior element number.
In being embodied as, according to the probability of linking relationshipSet prediction threshold values ε, when Time, then there is relation p between prediction main body s and object o, whenTime, then do not exist between prediction main body s and object o Relation p, describedIt it is the probability of main body s and object o opening relationships p.

Claims (4)

1. knowledge graph based on a Bayes's personalized ordering link Forecasting Methodology, it is characterised in that the method includes following step Rapid:
Step 1) set up knowledge graph;
Step 2) establish the link forecast model;
Step 3) definition feature based forecast modelWherein spWith opIt is two points of definition respectively Figure Gp(Sp,OpSubject and object in), s ∈ S, p ∈ P, o ∈ O, described s are main bodys, and described o is object, described p be main body with Relation between object, described S is host complex, and described O is object set, and described P is the set of relation between subject and object, institute Stating T is transposition operation;Described xs,oIt is the result of forecast model,It is worth the biggest, spAnd opIn low gt, more for phase Seemingly, the probability establishing the link relation is the biggest;DescribedDescribed U and V is low dimensional vector, Described IR is real number field, and described K is the number of potential feature;
Step 4) utilize Bayes's personalized ordering algorithm to be optimized;
Step 5) according to step 2) and step 3) calculate the probability of the relation of establishing the linkDescribedIt is Main body s and the probability of object o opening relationships p;
Step 6) definition prediction threshold values ε, whenTime, then there is relation p between prediction main body s and object o, whenTime, then there is not relation p between prediction main body s and object o.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that Described step 1) set up knowledge graph, specifically comprise the following steps that
Step 11) three elements figure G=(S, P, O) of definition knowledge graph, (s, p, o)=1 represents that main body S and object O exist relation to G (s, p, o)=0 represents that main body S and object O do not exist relation P, wherein s ∈ S, and p ∈ P, o ∈ O, described three elements comprise master for P, G Relation between body, object and Subjective and Objective, described S is host complex, and described O is object set, and described P is to close between subject and object The set of system,
Step 12) definition bipartite graph Gp(Sp,Op), for relation p between subject and object, Gp(Sp,Op)=0 represents main body spWith visitor Body opBetween there is not relation p, Gp(sp,op)=1 represents main body spWith object opBetween there is relation p, described sp∈Sp,op∈Op, institute State the vertex set that bipartite graph is figure and may be partitioned into two mutually disjoint subsets, and two summits that in figure, each edge depends on are all Belonging to the mutually disjoint subset of the two, the summit in two subsets is non-conterminous.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that Described step 2) establish the link forecast model, specifically comprise the following steps that
Step 21) link prediction be conceptualized as, to each relation p with at GpIn entity (s, o), target is to set up a prediction Model Mp(s, o), xs,o=Mp(s, o), described p ∈ P, s ∈ S, o ∈ O, described xs,oIt it is the result of forecast model;
Step 22) use logistic functionIt is used as standards of grading, described It is the probability of main body s and object o opening relationships p, describedIt is s Yu o opening relationships p.
Knowledge graph based on Bayes's personalized ordering the most according to claim 1 link Forecasting Methodology, it is characterised in that Described step 4) utilize Bayes's personalized ordering algorithm to be optimized, step is as follows:
Step 41) set up order standardDescribed DpIt it is instruction Practice sample set, describedRepresent to exist between s with o and link, describedRepresent not exist between s with o and link, described σ (.) Being logistic function, described logistic function isDescribed a is function variable, describedWith In the standardization of data, in order to avoid overfitting, described λ is a constant controlling matching speed, described Θp={ Up,Vp, bp, described Up∈IR|S|×K,Vp∈IR|O|×K,bp∈IR|O|×1, described | S | represents host complex S interior element number, described | O | Representing host complex O interior element number, described vector space is such a set, and the most any two elements are added and may make up this Another element in set, arbitrary element obtains another element in this set with Arbitrary Digit after being multiplied, described low-dimensional is only Vertical number of parameters is less;
Step 42) initialize UpSized by for m × K value be all 0 matrix, initialization VpSized by for n × K value be all 0 square Battle array, bpSized by for n × 1 value be all 0 vector, described n is the element number in object set O, described m be the theme gather S Interior element number, the number of the potential feature of described K;
Step 43) for eachIntegrating step 3) updateValue, finally return to Up, Vp, bpValue, described renewalFormula be:
Described
Described
Described
Described
Described
Described α is a constant constant for Schistosomiasis control speed, and described λ is a constant controlling matching speed.
CN201610460412.2A 2016-06-22 2016-06-22 Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology Pending CN106156488A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610460412.2A CN106156488A (en) 2016-06-22 2016-06-22 Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610460412.2A CN106156488A (en) 2016-06-22 2016-06-22 Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology

Publications (1)

Publication Number Publication Date
CN106156488A true CN106156488A (en) 2016-11-23

Family

ID=57353622

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610460412.2A Pending CN106156488A (en) 2016-06-22 2016-06-22 Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology

Country Status (1)

Country Link
CN (1) CN106156488A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106682095A (en) * 2016-12-01 2017-05-17 浙江大学 Subjectterm and descriptor prediction and ordering method based on diagram
CN106997488A (en) * 2017-03-22 2017-08-01 扬州大学 A kind of action knowledge extraction method of combination markov decision process
CN109086373A (en) * 2018-07-23 2018-12-25 东南大学 A method of the fair link forecast assessment system of building

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106682095A (en) * 2016-12-01 2017-05-17 浙江大学 Subjectterm and descriptor prediction and ordering method based on diagram
CN106682095B (en) * 2016-12-01 2019-11-08 浙江大学 The prediction of subject description word and sort method based on figure
CN106997488A (en) * 2017-03-22 2017-08-01 扬州大学 A kind of action knowledge extraction method of combination markov decision process
CN109086373A (en) * 2018-07-23 2018-12-25 东南大学 A method of the fair link forecast assessment system of building
CN109086373B (en) * 2018-07-23 2021-01-12 东南大学 Method for constructing fair link prediction evaluation system

Similar Documents

Publication Publication Date Title
Manna et al. A fuzzy rank-based ensemble of CNN models for classification of cervical cytology
Lei et al. Shallow convolutional neural network for image classification
US20180276542A1 (en) Recommendation Result Generation Method and Apparatus
CN111881342A (en) Recommendation method based on graph twin network
WO2023065859A1 (en) Item recommendation method and apparatus, and storage medium
CN109034228B (en) Image classification method based on differential privacy and hierarchical relevance propagation
Bai et al. Weakly supervised object localization via transformer with implicit spatial calibration
CN107239532B (en) Data mining method and device
Nguyen et al. A deep manifold-regularized learning model for improving phenotype prediction from multi-modal data
CN113378938B (en) Edge transform graph neural network-based small sample image classification method and system
CN112529071B (en) Text classification method, system, computer equipment and storage medium
Wu Image retrieval method based on deep learning semantic feature extraction and regularization softmax
CN106156488A (en) Knowledge graph based on Bayes's personalized ordering link Forecasting Methodology
Wang et al. Research on maize disease recognition method based on improved resnet50
Zhang et al. A bagging dynamic deep learning network for diagnosing COVID-19
Li et al. Piecewise convolutional neural networks with position attention and similar bag attention for distant supervision relation extraction
Zhang et al. Skip-attention encoder–decoder framework for human motion prediction
CN110363283A (en) User property prediction technique and relevant apparatus based on deep learning
Aoyagi A Bayesian learning coefficient of generalization error and Vandermonde matrix-type singularities
Jiang et al. On consensus-optimality trade-offs in collaborative deep learning
CN112541128A (en) Personalized news recommendation method based on characteristic bidirectional dynamic cooperation
Yi et al. OCRNet for Diabetic Foot Ulcer Segmentation Combined with Edge Loss
Hannak et al. Efficient graph signal recovery over big networks
Sun et al. Randomized nonlinear two-dimensional principal component analysis network for object recognition
Laskar et al. Artificial Neural Networks and Gene Expression Programing based age estimation using facial features

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161123