CN107273418A - A kind of across Noumenon property chain inference method based on cloud platform - Google Patents

A kind of across Noumenon property chain inference method based on cloud platform Download PDF

Info

Publication number
CN107273418A
CN107273418A CN201710331029.1A CN201710331029A CN107273418A CN 107273418 A CN107273418 A CN 107273418A CN 201710331029 A CN201710331029 A CN 201710331029A CN 107273418 A CN107273418 A CN 107273418A
Authority
CN
China
Prior art keywords
attribute
reasoning
chain
ontology
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710331029.1A
Other languages
Chinese (zh)
Inventor
陈华钧
陈曦
张宁豫
吴朝晖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201710331029.1A priority Critical patent/CN107273418A/en
Publication of CN107273418A publication Critical patent/CN107273418A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The present invention discloses a kind of across Noumenon property chain inference method based on cloud platform, including:(1) the knowledge mapping H of unified form is obtained using semantic interlink method;(2) attribute chain inference network C is obtained using knowledge mapping;(3) object and attribute are replaced with into corresponding id, forms knowledge mapping H ', attribute chain inference network C ';(4) MapReduce frameworks, the parallel inference according to C ' to knowledge mapping H ' carry out attribute chains are used, and changes renewal C ';(5) the reasoning results are preserved into hdfs, and added it in knowledge mapping H ';(6) circulation step (4) and (5), (7) merge the reasoning results of successive ignition generation on hdfs untill the reasoning new until not producing is true.This method can support the parallel inference across the mass knowledge of body, with very strong autgmentability, have good practical value for the application of extensive semantic data reasoning.

Description

一种基于云平台的跨本体属性链推理方法A cross-ontology attribute chain reasoning method based on cloud platform

技术领域technical field

本发明涉及计算机语义推理技术,具体涉及一种基于云平台的跨本体属性链推理方法。The invention relates to computer semantic reasoning technology, in particular to a cloud platform-based cross-ontology attribute chain reasoning method.

背景技术Background technique

随着语义网的不断发展,建立在资源描述框架之上的语义web描述语言OWL已被广泛地应用于各个领域的本体建模和推理上,包括生命科学、媒体信息、语义时空数据、社交网络等,各领域的语义数据也随之呈爆炸性增长。以链接开放数据(Linked Open Data)工程为例,它提出了链接数据(Linked data)的概念,其宗旨在于号召人们将现有数据发布成语义链接数据,以此将不同数据源可以互联起来。目前为止它已经包含了超过295个数据源和310亿条三元组记录。With the continuous development of the semantic web, the semantic web description language OWL based on the resource description framework has been widely used in ontology modeling and reasoning in various fields, including life sciences, media information, semantic spatiotemporal data, and social networks. And so on, the semantic data in various fields also showed explosive growth. Take the Linked Open Data project as an example. It proposes the concept of Linked data. Its purpose is to call on people to publish existing data as semantically linked data, so as to interconnect different data sources. So far it has contained more than 295 data sources and 31 billion triple records.

这些海量语义数据之间存在着许多隐含的复杂关联关系,可以通过对已有的语义信息进行推理得到其中潜在的语义信息,这些隐藏的语义关系在实际中有着十分重要的意义。例如:生物医药工作者可以利用语义推理的方法得出药物关联关系从而辅助新药的开发,网站数据分析者可以利用用户信息推理互联起来。There are many hidden complex relationships among these massive semantic data, and the potential semantic information can be obtained by inferring the existing semantic information. These hidden semantic relationships are of great significance in practice. For example, biomedical workers can use semantic reasoning methods to obtain drug associations to assist the development of new drugs, and website data analysts can use user information to reason and connect.

然而,现有的语义推理机往往缺乏良好的可扩展性,仅能对小规模本体进行处理,随着OWL本体数据量的不断增长,上述单机环境下运行的推理引擎由于需要将大量本体数据载入内存,在对大规模跨本体数据进行OWL推理时,存在内存溢出、计算性能和可扩展性不足等问题,传统的语义推理机已经难以处理如此海量的语义信息。另一方面,已提出的一些并行推理技术也不能有效的解决大规模复杂语义推理的问题。语义研究领域迫切需要一个高性能的可处理复杂语义关联的推理引擎来改变这种困境。However, the existing semantic inference engines often lack good scalability and can only process small-scale ontology. With the continuous growth of OWL ontology data, the reasoning engine running in the above-mentioned stand-alone environment needs to load a large amount of ontology data. When performing OWL reasoning on large-scale cross-ontology data, there are problems such as memory overflow, insufficient computing performance and scalability, and traditional semantic reasoning machines have been difficult to handle such massive semantic information. On the other hand, some parallel reasoning techniques that have been proposed cannot effectively solve the problem of large-scale complex semantic reasoning. The field of semantic research urgently needs a high-performance reasoning engine that can handle complex semantic associations to change this dilemma.

发明内容Contents of the invention

有鉴于此,本发明提供了一种基于云平台的跨本体属性链推理方法。相比其他方法,本发明实现了通过属性链推理的方法来高效地发现海量语义数据中存在的复杂语义关联信息,而且具有很强的扩展能力。In view of this, the present invention provides a cross-ontology attribute chain reasoning method based on a cloud platform. Compared with other methods, the present invention realizes efficient discovery of complex semantic association information existing in massive semantic data through attribute chain reasoning, and has strong expansion capability.

一种基于云平台的跨本体属性链推理方法,包括以下步骤:A method for cross-ontology attribute chain reasoning based on a cloud platform, comprising the following steps:

(1)利用语义链接方法对多个领域的本体数据进行融合,得到一个统一格式的知识图谱H;(1) Use the semantic linking method to fuse ontology data in multiple fields to obtain a knowledge graph H in a unified format;

(2)利用知识图谱获得一系列表达单个本体内实体与实体之间关系的推理规则A,并采用语义本体描述语言的Owl2属性链元语进行跨本体实体与实体之间复杂关系的建模,得到表达跨本体实体与实体之间关系的多条属性链推理规则B,A与B形成属性链推理网络C;(2) Use the knowledge graph to obtain a series of inference rules A expressing the relationship between entities in a single ontology, and use the Owl2 attribute chain primitive of the semantic ontology description language to model the complex relationship between cross-ontology entities and entities, Obtain multiple attribute chain inference rules B expressing the relationship between entities across ontology entities, and A and B form an attribute chain inference network C;

(3)将知识图谱与属性链推理网络C中的每一个属性对象分配一个与其对应的属性id号,每一个实体对象分配一个与其对应的实体id号,形成知识图谱H′、属性链推理网络C′;(3) Assign a corresponding attribute id number to each attribute object in the knowledge graph and attribute chain inference network C, and assign a corresponding entity id number to each entity object to form knowledge graph H' and attribute chain inference network C';

(4)运用MapReduce框架,按照属性链推理网络C′对知识图谱H′进行属性链的并行推理,并修改更新属性链推理网络C′;(4) Using the MapReduce framework, according to the attribute chain reasoning network C', perform attribute chain parallel reasoning on the knowledge map H', and modify and update the attribute chain reasoning network C';

(5)将步骤(4)推理得到的一系列结果保存至hdfs中,并将其添加至知识图谱H′中;(5) Save a series of results obtained by reasoning in step (4) into hdfs, and add them to the knowledge map H';

(6)判断本次的推理结果与上一次的推理结果是否一致,若是,执行步骤(7);若否,跳转执行步骤(4);(6) Judging whether the reasoning result of this time is consistent with the reasoning result of the last time, if so, execute step (7); if not, jump to execute step (4);

(7)结束推理,合并hdfs上多次迭代生成的推理结果,并去除推理结果中重复的三元组,然后根据属性映射表和实体映射表,还原成相应的文本三元组,将这个结果作为最后的推理结果返回。(7) End the reasoning, merge the reasoning results generated by multiple iterations on hdfs, and remove the repeated triples in the reasoning results, and then restore the corresponding text triples according to the attribute mapping table and entity mapping table, and convert the result Returned as the final inference result.

单个本体内实体和实体的关系可以通过属性来表达,而在跨本体的推理中,这种简单关系可能演化成为非常复杂的链式关系,利用推理规则可以有效的刻画这些推理关系。The relationship between entities and entities in a single ontology can be expressed by attributes, and in cross-ontology reasoning, this simple relationship may evolve into a very complex chain relationship, and these reasoning relationships can be effectively described by using inference rules.

异构跨领域的本体之间存在着语义鸿沟,利用语义链接方法可以将不同本体中表示相同对象的实体以及关系关联起来,以便进行下一步的推理实施。There is a semantic gap between heterogeneous and cross-domain ontologies. The semantic link method can be used to associate entities and relationships representing the same object in different ontologies for the next step of reasoning and implementation.

在采用语义链接方法进行跨领域本体的语义融合时,通过设计多种相似性特征函数计算实体之间的距离,从而进行实体链接和融合。相似性特征函数为:When the semantic linking method is used for semantic fusion of cross-domain ontology, the distance between entities is calculated by designing a variety of similarity feature functions, so as to carry out entity linking and fusion. The similarity feature function is:

Similarity(X,Y)=Jac(X,Y)+Cos(X,Y),Similarity(X,Y)=Jac(X,Y)+Cos(X,Y),

X,Y分别是两个实体的description描述信息,Jac(X,Y)表示其Jaccard相似性,Cossine(X,Y)表示其余弦相似度,当Similarity(X,Y)大于0.8时,进行实体链接。X, Y are the description description information of the two entities respectively, Jac(X, Y) represents the Jaccard similarity, Cossine(X, Y) represents the cosine similarity, when Similarity(X, Y) is greater than 0.8, the entity Link.

所述的属性链推理规则B可以有效地表达推理关系,为后续的推理方法提供规则输入。所述的属性链推理网络C可以有效地刻画跨本体实体之间的可能关系。通过属性链以及属性链网络对复杂的推理过程进行简化,这不仅可以有效的简化推理复杂性,还可以同时提高推理过程的并行效果,为实施分布式的推理算法提供基础。The attribute chain reasoning rule B can effectively express the reasoning relationship and provide rule input for the subsequent reasoning method. The attribute chain reasoning network C can effectively describe the possible relationship between entities across ontology. The complex reasoning process is simplified through the attribute chain and the attribute chain network, which can not only effectively simplify the reasoning complexity, but also improve the parallel effect of the reasoning process at the same time, and provide a basis for implementing distributed reasoning algorithms.

利用id号替换相应的文本对象与属性对象,这样能够大大地提高推理的效率。Use the id number to replace the corresponding text object and attribute object, which can greatly improve the efficiency of reasoning.

所述步骤(3)的具体过程为:The concrete process of described step (3) is:

构建属性映射表,为每一个属性对象分配一个属性id号;Build an attribute mapping table and assign an attribute id number to each attribute object;

构建实体映射表,为每一个实体对象分配一个实体id号;Build an entity mapping table and assign an entity id number to each entity object;

用属性id替换属性链推理网络C中的属性链对象,形成属性链推理网络C′;Replace the attribute chain object in the attribute chain reasoning network C with the attribute id to form the attribute chain reasoning network C';

遍历知识图谱H中的每一个三元组,用实体id和属性id替换相应的头节点、尾节点和关系节点,形成知识图谱H′。Traverse each triple in the knowledge graph H, and replace the corresponding head node, tail node and relationship node with the entity id and attribute id to form the knowledge graph H'.

步骤(4)中,采用MapReduce框架进行推理可以大大提高推理的可能性和可扩展性,具体的实现过程为:In step (4), using the MapReduce framework for reasoning can greatly improve the possibility and scalability of reasoning. The specific implementation process is as follows:

(4-1)Map阶段:以(行号,三元组)键值对作为输入,输出(链接属性id键值,三元组)键值对;(4-1) Map stage: take (line number, triplet) key-value pair as input, and output (link attribute id key value, triplet) key-value pair;

(4-2)Reduce阶段:以Map阶段输出的(链接属性id键值,三元组)键值对作为本阶段的输入,融合id键值相同的三元组,输出(_,新三元组或待处理的三元组);(4-2) Reduce stage: use the (link attribute id key value, triplet) key-value pair output in the Map stage as the input of this stage, fuse triplets with the same id key value, and output (_, new triplet groups or pending triples);

(4-3)合并更新属性链推理网络C′中相邻的属性链对象,并为属性链对象重新分配一个新id;(4-3) Merge and update the adjacent attribute chain objects in the attribute chain reasoning network C′, and reassign a new id for the attribute chain objects;

(4-4)检查是否有新的三元组输出,若是,跳转执行步骤(4-1),若否,输出推理结果。(4-4) Check whether there is a new triple output, if so, jump to step (4-1), if not, output the reasoning result.

传统的语义推理方法都是基于单机的,面对跨本体的海量语义数据具有明显缺陷;而本发明基于云平台的跨本体属性链推理方法利用了云平台的可扩展的优势,可以处理大规模的语义数据,具体优势体现如下:The traditional semantic reasoning method is based on a stand-alone machine, and has obvious defects in the face of massive semantic data across ontology; however, the cloud platform-based cross-ontology attribute chain reasoning method of the present invention takes advantage of the scalability of the cloud platform and can handle large-scale The specific advantages of semantic data are as follows:

(1)本发明利用属性链以及属性链网络对复杂的跨本体推理过程进行简化建模,克服了传统推理器只能完成既定推理规则的语义推理,使得推理的灵活性和可用性大大增强。(1) The present invention uses attribute chains and attribute chain networks to simplify the modeling of the complex cross-ontology reasoning process, which overcomes the semantic reasoning that traditional reasoners can only complete established reasoning rules, and greatly enhances the flexibility and usability of reasoning.

(2)MapReduce作为一个大规模数据计算框架,便于进行并行算法的处理,同时HDFS也可以为大规模的知识图谱提供存储基础,本发明利用HDFS进行大规模语义数据的存储,同时通过MapReduce并行框架进行跨本体属性链的并行推理,大大提高了处理速度。(2) MapReduce, as a large-scale data computing framework, facilitates the processing of parallel algorithms, and HDFS can also provide a storage basis for large-scale knowledge graphs. The present invention utilizes HDFS to store large-scale semantic data, and simultaneously uses the MapReduce parallel framework Perform parallel reasoning across ontology attribute chains, greatly improving processing speed.

附图说明Description of drawings

图1是本发明基于云平台的跨本体属性链推理方法的流程图;Fig. 1 is the flow chart of the cross-ontology attribute chain reasoning method based on cloud platform of the present invention;

图2是实施例1中跨本体海量生物学知识图谱。FIG. 2 is a cross-ontology massive biological knowledge graph in Example 1.

具体实施方式detailed description

为了更为具体地描述本发明,下面结合附图及具体实施方式对本发明的技术方案进行详细说明。In order to describe the present invention more specifically, the technical solutions of the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

参见图1,本发明基于云平台的跨本体属性链推理方法,包括:Referring to Fig. 1, the cross-ontology attribute chain reasoning method based on the cloud platform of the present invention includes:

S01,利用语义链接方法对多个领域的本体数据进行融合,得到一个统一格式的知识图谱H。S01, using the semantic linking method to fuse ontology data in multiple fields to obtain a knowledge graph H in a unified format.

S02,利用知识图谱获得一系列表达单个本体内实体与实体之间关系的推理规则A,并采用语义本体描述语言的Owl2属性链元语进行跨本体实体与实体之间复杂关系的建模,得到表达跨本体实体与实体之间关系的多条属性链推理规则B,A与B形成属性链推理网络C。S02, using the knowledge graph to obtain a series of inference rules A expressing the relationship between entities and entities in a single ontology, and using the Owl2 attribute chain primitive of the semantic ontology description language to model the complex relationship between entities and entities across ontology, and get Multiple attribute chain reasoning rules B expressing the relationship between ontology entities and entities, A and B form attribute chain reasoning network C.

S03,将知识图谱与属性链推理网络C中的每一个属性对象分配一个与其对应的属性id号,每一个实体对象分配一个与其对应的实体id号,形成知识图谱H′、属性链推理网络C′。S03, assign a corresponding attribute id number to each attribute object in the knowledge graph and attribute chain inference network C, and assign a corresponding entity id number to each entity object to form knowledge graph H' and attribute chain inference network C '.

本步骤具体为:This step is specifically:

构建属性映射表,为每一个属性对象分配一个属性id号;Build an attribute mapping table and assign an attribute id number to each attribute object;

构建实体映射表,为每一个实体对象分配一个实体id号;Build an entity mapping table and assign an entity id number to each entity object;

用属性id替换属性链推理网络C中的属性链对象,形成属性链推理网络C′;Replace the attribute chain object in the attribute chain reasoning network C with the attribute id to form the attribute chain reasoning network C';

遍历知识图谱H中的每一个三元组,用实体id和属性id替换相应的头节点、尾节点和关系节点,形成知识图谱H′。Traverse each triple in the knowledge graph H, and replace the corresponding head node, tail node and relationship node with the entity id and attribute id to form the knowledge graph H'.

S04,运用MapReduce框架,按照属性链推理网络C′对知识图谱H′进行属性链的并行推理,并修改更新属性链推理网络C′。S04. Use the MapReduce framework to perform parallel reasoning on attribute chains on the knowledge map H' according to the attribute chain reasoning network C', and modify and update the attribute chain reasoning network C'.

本步骤具体为:This step is specifically:

(4-1)Map阶段:以(行号,三元组)键值对作为输入,输出(链接属性id键值,三元组)键值对;(4-1) Map stage: take (line number, triplet) key-value pair as input, and output (link attribute id key value, triplet) key-value pair;

(4-2)Reduce阶段:以Map阶段输出的(链接属性id键值,三元组)键值对作为本阶段的输入,融合id键值相同的三元组,输出(_,新三元组或待处理的三元组);(4-2) Reduce stage: use the (link attribute id key value, triplet) key-value pair output in the Map stage as the input of this stage, fuse triplets with the same id key value, and output (_, new triplet groups or pending triples);

(4-3)合并更新属性链推理网络C′中相邻的属性链对象,并为属性链对象重新分配一个新id;(4-3) Merge and update the adjacent attribute chain objects in the attribute chain reasoning network C′, and reassign a new id for the attribute chain objects;

(4-4)检查是否有新的三元组输出,若是,跳转执行步骤(4-1),若否,输出推理结果。(4-4) Check whether there is a new triple output, if so, jump to step (4-1), if not, output the reasoning result.

S05,将S04推理得到的一系列结果保存至hdfs中,并将其添加至知识图谱H′中。S05, save a series of results obtained by reasoning in S04 into hdfs, and add them to the knowledge map H'.

S06,判断本次的推理结果与上一次的推理结果是否一致,若是,执行S07;若否,跳转执行S04。S06, judging whether the reasoning result of this time is consistent with the reasoning result of the last time, if yes, go to S07; if not, go to S04.

S07,结束推理,合并hdfs上多次迭代生成的推理结果,并去除推理结果中重复的三元组,然后根据属性映射表和实体映射表,还原成相应的文本三元组,将这个结果作为最后的推理结果返回。S07, end the reasoning, merge the reasoning results generated by multiple iterations on hdfs, and remove the repeated triples in the reasoning results, and then restore the corresponding text triples according to the attribute mapping table and the entity mapping table, and use this result as The final inference result is returned.

实施例1Example 1

本实例首先通过语义链接和融合的方法将多个跨本体的知识库进行语义融合,这里使用跨本体海量生物医学知识图谱为例,如图2所示,该图谱集成了20种不同的知识库,包含接近50亿的三元组数据。将该知识图谱以三元组的形式存储在HDFS文件系统中(如表1),以便进行并行处理。In this example, multiple cross-ontology knowledge bases are semantically fused by means of semantic linking and fusion. Here we use the cross-ontology massive biomedical knowledge graph as an example. As shown in Figure 2, the graph integrates 20 different knowledge bases , containing nearly 5 billion triples of data. The knowledge map is stored in the HDFS file system in the form of triples (as shown in Table 1) for parallel processing.

表1Table 1

Subjectsubject RelationRelation Objectobject Subject_text1Subject_text1 Relation_text1Relation_text1 Object_text1Object_text1 Subject_text2Subject_text2 Relation_text2Relation_text2 Object_text2Object_text2 Subject_text3Subject_text3 Relation_text3Relation_text3 Object_text3Object_text3 Subject_textNSubject_textN Relation_textNRelation_textN Object_textNObject_textN

构建好以上的知识图谱之后,可以通过知识图谱得到一系列推理规则来表达单个本体内实体和实体的关系,对于跨本体之间的实体关系通过多条链路推理规则表达,从而通过构建一个属性链推理网络来有效的进行刻画跨本体实体之间的可能关系。After constructing the above knowledge graph, a series of inference rules can be obtained through the knowledge graph to express the relationship between entities and entities in a single ontology, and the relationship between entities across ontologies can be expressed through multiple link inference rules, so that by constructing an attribute Chain reasoning network to effectively characterize possible relationships between entities across ontology.

随后重写推理网络和知识图谱,并将知识图谱运用MapReduce算法框架按照推理网络进行属性链的并行迭代推理,并修改相应的推理网络以便进行下一轮推理。利用本发明方法完成生物医学跨本体的草药(Herb)和基因(Gene)关联发现的推理,真实的推理结果显示得出的关联实体对具有很高的准确性,同时也具有很高的计算运行效率。Then rewrite the reasoning network and knowledge map, and use the MapReduce algorithm framework to perform parallel iterative reasoning of attribute chains in accordance with the reasoning network, and modify the corresponding reasoning network for the next round of reasoning. Utilize the method of the present invention to complete the reasoning of herb (Herb) and gene (Gene) association discovery across ontology in biomedicine, the real reasoning result shows that the associated entity pair obtained has very high accuracy, and also has very high calculation operation efficiency.

以上所述的具体实施方式对本发明的技术方案和有益效果进行了详细说明,应理解的是以上所述仅为本发明的最优选实施例,并不用于限制本发明,凡在本发明的原则范围内所做的任何修改、补充和等同替换等,均应包含在本发明的保护范围之内。The above-mentioned specific embodiments have described the technical solutions and beneficial effects of the present invention in detail. It should be understood that the above-mentioned are only the most preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, supplements and equivalent replacements made within the scope shall be included in the protection scope of the present invention.

Claims (3)

1.一种基于云平台的跨本体属性链推理方法,包括以下步骤:1. A cross-ontology attribute chain reasoning method based on cloud platform, comprising the following steps: (1)利用语义链接方法对多个领域的本体数据进行融合,得到一个统一格式的知识图谱H;(1) Use the semantic linking method to fuse ontology data in multiple fields to obtain a knowledge graph H in a unified format; (2)利用知识图谱获得一系列表达单个本体内实体与实体之间关系的推理规则A,并采用语义本体描述语言的Owl2属性链元语进行跨本体实体与实体之间复杂关系的建模,得到表达跨本体实体与实体之间关系的多条属性链推理规则B,A与B形成属性链推理网络C;(2) Use the knowledge graph to obtain a series of inference rules A expressing the relationship between entities in a single ontology, and use the Owl2 attribute chain primitive of the semantic ontology description language to model the complex relationship between cross-ontology entities and entities, Obtain multiple attribute chain inference rules B expressing the relationship between entities across ontology entities, and A and B form an attribute chain inference network C; (3)将知识图谱与属性链推理网络C中的每一个属性对象分配一个与其对应的属性id号,每一个实体对象分配一个与其对应的实体id号,形成知识图谱H′、属性链推理网络C′;(3) Assign a corresponding attribute id number to each attribute object in the knowledge graph and attribute chain inference network C, and assign a corresponding entity id number to each entity object to form knowledge graph H' and attribute chain inference network C'; (4)运用MapReduce框架,按照属性链推理网络C′对知识图谱H′进行属性链的并行推理,并修改更新属性链推理网络C′;(4) Using the MapReduce framework, according to the attribute chain reasoning network C', perform attribute chain parallel reasoning on the knowledge map H', and modify and update the attribute chain reasoning network C'; (5)将步骤(4)推理得到的一系列结果保存至hdfs中,并将其添加至知识图谱H′中;(5) Save a series of results obtained by reasoning in step (4) into hdfs, and add them to the knowledge map H'; (6)判断本次的推理结果与上一次的推理结果是否一致,若是,执行步骤(7);若否,跳转执行步骤(4);(6) Judging whether the reasoning result of this time is consistent with the reasoning result of the last time, if so, execute step (7); if not, jump to execute step (4); (7)结束推理,合并hdfs上多次迭代生成的推理结果,并去除推理结果中重复的三元组,然后根据属性映射表和实体映射表,还原成相应的文本三元组,将这个结果作为最后的推理结果返回。(7) End the reasoning, merge the reasoning results generated by multiple iterations on hdfs, and remove the repeated triples in the reasoning results, and then restore the corresponding text triples according to the attribute mapping table and entity mapping table, and convert the result Returned as the final inference result. 2.如权利要求1所述的基于云平台的跨本体属性链推理方法,其特征在于,所述步骤(3)的具体过程为:2. the cross-ontology attribute chain reasoning method based on cloud platform as claimed in claim 1, is characterized in that, the concrete process of described step (3) is: 构建属性映射表,为每一个属性对象分配一个属性id号;Build an attribute mapping table and assign an attribute id number to each attribute object; 构建实体映射表,为每一个实体对象分配一个实体id号;Build an entity mapping table and assign an entity id number to each entity object; 用属性id替换属性链推理网络C中的属性链对象,形成属性链推理网络C′;Replace the attribute chain object in the attribute chain reasoning network C with the attribute id to form the attribute chain reasoning network C′; 遍历知识图谱H中的每一个三元组,用实体id和属性id替换相应的头节点、尾节点和关系节点,形成知识图谱H′。Traverse each triple in the knowledge graph H, and replace the corresponding head node, tail node and relationship node with the entity id and attribute id to form the knowledge graph H'. 3.如权利要求1所述的基于云平台的跨本体属性链推理方法,其特征在于,步骤(4)的具体步骤为:3. the cross-ontology attribute chain reasoning method based on cloud platform as claimed in claim 1, is characterized in that, the concrete steps of step (4) are: (4-1)Map阶段:以(行号,三元组)键值对作为输入,输出(链接属性id键值,三元组)键值对;(4-1) Map stage: take (line number, triplet) key-value pair as input, and output (link attribute id key value, triplet) key-value pair; (4-2)Reduce阶段:以Map阶段输出的(链接属性id键值,三元组)键值对作为本阶段的输入,融合id键值相同的三元组,输出(_,新三元组或待处理的三元组);(4-2) Reduce stage: use the (link attribute id key value, triplet) key-value pair output in the Map stage as the input of this stage, fuse triplets with the same id key value, and output (_, new triplet groups or pending triples); (4-3)合并更新属性链推理网络C′中相邻的属性链对象,并为属性链对象重新分配一个新id;(4-3) Merge and update the adjacent attribute chain objects in the attribute chain reasoning network C′, and reassign a new id for the attribute chain objects; (4-4)检查是否有新的三元组输出,若是,跳转执行步骤(4-1),若否,输出推理结果。(4-4) Check whether there is a new triple output, if so, jump to step (4-1), if not, output the reasoning result.
CN201710331029.1A 2017-05-11 2017-05-11 A kind of across Noumenon property chain inference method based on cloud platform Pending CN107273418A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710331029.1A CN107273418A (en) 2017-05-11 2017-05-11 A kind of across Noumenon property chain inference method based on cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710331029.1A CN107273418A (en) 2017-05-11 2017-05-11 A kind of across Noumenon property chain inference method based on cloud platform

Publications (1)

Publication Number Publication Date
CN107273418A true CN107273418A (en) 2017-10-20

Family

ID=60074124

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710331029.1A Pending CN107273418A (en) 2017-05-11 2017-05-11 A kind of across Noumenon property chain inference method based on cloud platform

Country Status (1)

Country Link
CN (1) CN107273418A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840284A (en) * 2018-12-21 2019-06-04 中科曙光南京研究院有限公司 Family's affiliation knowledge mapping construction method and system
CN110119814A (en) * 2019-04-29 2019-08-13 武汉开目信息技术股份有限公司 Knowledge rule modeling and inference method based on object relationship chain
CN113190689A (en) * 2021-05-25 2021-07-30 广东电网有限责任公司广州供电局 Construction method, device, equipment and medium of electric power safety knowledge graph

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105550190A (en) * 2015-06-26 2016-05-04 许昌学院 Knowledge graph-oriented cross-media retrieval system
CN106445913A (en) * 2016-09-06 2017-02-22 中南大学 MapReduce-based semantic inference method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105550190A (en) * 2015-06-26 2016-05-04 许昌学院 Knowledge graph-oriented cross-media retrieval system
CN106445913A (en) * 2016-09-06 2017-02-22 中南大学 MapReduce-based semantic inference method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XI CHEN等: "BioTCM-SE: A Semantic Search Engine for the Information Retrieval of Modern Biology and Traditional Chinese Medicine", 《COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE》 *
陈曦等: "一种基于Hadoop的语义大数据分布式推理框架", 《计算机研究与发展》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840284A (en) * 2018-12-21 2019-06-04 中科曙光南京研究院有限公司 Family's affiliation knowledge mapping construction method and system
CN110119814A (en) * 2019-04-29 2019-08-13 武汉开目信息技术股份有限公司 Knowledge rule modeling and inference method based on object relationship chain
CN110119814B (en) * 2019-04-29 2022-04-29 武汉开目信息技术股份有限公司 Knowledge rule modeling and reasoning method based on object relation chain
CN113190689A (en) * 2021-05-25 2021-07-30 广东电网有限责任公司广州供电局 Construction method, device, equipment and medium of electric power safety knowledge graph
CN113190689B (en) * 2021-05-25 2023-04-18 广东电网有限责任公司广州供电局 Construction method, device, equipment and medium of electric power safety knowledge graph

Similar Documents

Publication Publication Date Title
Angles et al. An introduction to graph data management
Casado et al. Emerging trends and technologies in big data processing
Kumar Kaliyar Graph databases: A survey
CN106611037A (en) Method and device for distributed diagram calculation
CN103279546B (en) Graph data query method
CN103279543A (en) Path mode inquiring system for massive image data
CN107273418A (en) A kind of across Noumenon property chain inference method based on cloud platform
CN109635119B (en) Industrial big data integration system based on ontology fusion
Hu et al. Towards big linked data: a large-scale, distributed semantic data storage
CN106445913A (en) MapReduce-based semantic inference method and system
Kim et al. Scalable owl-horst ontology reasoning using spark
Gombos et al. Spar (k) ql: SPARQL evaluation method on Spark GraphX
KR20150136734A (en) Data parallel inference method and apparatus thereof
Shibla et al. Improving efficiency of DBSCAN by parallelizing kd-tree using spark
Hassan Storage structures in the era of big data: from data warehouse to lakehouse
Urbani et al. WebPIE: a web-scale parallel inference engine
Erraji et al. Toward a smart approach of migration from relational system database to nosql system: Transformation rules of structure
Bouhissi et al. Toward Data Integration in the Era of Big Data: Role of Ontologies
Gupta et al. A new relevance measure for heterogeneous networks
JP5890000B1 (en) Hybrid rule inference apparatus and method
Almukahel et al. Hybrid approach using fuzzy logic and MapReduce to achieve meaningful used big data
Zhao et al. A Methodology for Indexing Temporal RDF Data.
CN113590582A (en) Distributed graph database optimization method and device, electronic equipment and storage medium
Keswani et al. Enhanced approach to attain competent Big Data pre-processing
Xhafa et al. Performance evaluation of mahout clustering algorithms using a twitter streaming dataset

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171020

RJ01 Rejection of invention patent application after publication