WO2021190091A1 - Knowledge map construction method and device based on knowledge node belonging degree - Google Patents

Knowledge map construction method and device based on knowledge node belonging degree Download PDF

Info

Publication number
WO2021190091A1
WO2021190091A1 PCT/CN2021/071056 CN2021071056W WO2021190091A1 WO 2021190091 A1 WO2021190091 A1 WO 2021190091A1 CN 2021071056 W CN2021071056 W CN 2021071056W WO 2021190091 A1 WO2021190091 A1 WO 2021190091A1
Authority
WO
WIPO (PCT)
Prior art keywords
knowledge
knowledge node
node
target
subordinate
Prior art date
Application number
PCT/CN2021/071056
Other languages
French (fr)
Chinese (zh)
Inventor
董润华
徐国强
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2021190091A1 publication Critical patent/WO2021190091A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Definitions

  • This application relates to the field of computer technology, and in particular to a method and device for constructing a knowledge graph based on the degree of belonging of a knowledge node.
  • Knowledge graph is a large-scale semantic network, which uses entities or concepts as knowledge nodes and is connected by semantic relations. By exploring the associations between entities and integrating semi-structured and unstructured data, knowledge graphs can help machines understand data, explain phenomena, and reason about knowledge, thereby discovering deep relationships and realizing intelligent search and intelligent interaction. At present, it is widely used in banking, insurance, securities, courts, logistics, finance and taxation and other fields.
  • the knowledge graph is composed of multiple knowledge nodes, and multiple knowledge nodes are related to each other, and their interrelationships are represented by the lines in the knowledge graph.
  • the inventor realizes that at present, to construct a knowledge graph of a vertical field, a large amount of manual monitoring is required in the knowledge node acquisition stage to ensure that the acquired knowledge node conforms to the current field. For example, based on the seed map, all the knowledge nodes in the vertical domain are in the seed map, and then according to the knowledge node in the vertical domain, go to the encyclopedia page to search for the next knowledge node associated with it; in the search process, according to each encyclopedia page page (target The tag of the knowledge node) jumps to another encyclopedia webpage (the next knowledge node), and each time the number of jumps increases by 1, it may pull out the content of multiple (for example, 1000) encyclopedia webpages, and whether these contents belong to The seed map needs to be manually deleted, selected and identified.
  • a method for constructing a knowledge graph based on the degree of belonging of a knowledge node comprising: obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are in a preset domain Knowledge node; according to the path length from the target knowledge node to the subordinate knowledge node, calculate the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is from the target knowledge node to the The number of intermediate knowledge nodes connected in the subordinate knowledge node is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, the subordinate knowledge node is deleted, and the information in the preset domain Remaining knowledge nodes; construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  • a knowledge graph construction device based on the degree of belonging of a knowledge node, the device comprising: a knowledge node selection module for obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge A node is a knowledge node in a preset field; a degree calculation module for calculating the degree of belonging from the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node; wherein , The path length is determined by the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node; the node deletion module is used to determine the degree of belonging of the subordinate knowledge node to the target knowledge node When the value is lower than the threshold, delete the subordinate knowledge node and obtain the remaining knowledge nodes in the preset domain; the knowledge graph construction module is used to construct a knowledge graph based on the remaining knowledge nodes in the preset domain.
  • a computer device includes a memory and a processor, the memory stores a computer program, and when the processor executes the computer program, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is implemented, and the knowledge based on the degree of belonging of a knowledge node
  • the graph construction method includes the following steps: obtaining a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; The path length of the subordinate knowledge node is calculated, and the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein, the path length is from the target knowledge node to the intermediate knowledge node connected to the subordinate knowledge node The number is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain; according to the preset domain The remaining knowledge nodes of the building a
  • a computer-readable storage medium having a computer program stored thereon, which when executed by a processor realizes the above-mentioned method for constructing a knowledge graph based on the degree of belonging of a knowledge node, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node includes the following Step: Obtain a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; according to the difference between the target knowledge node and the subordinate knowledge node Path length, calculating the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is determined by the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node; When the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain; construct based on the remaining knowledge nodes in the preset domain
  • This application calculates the degree of belonging of the subordinate knowledge node to the target knowledge node to eliminate the subordinate knowledge nodes with low relevance, ensuring the accuracy of the constructed knowledge graph, and calculates the degree of belonging to the knowledge entry. Streamlined and improved the efficiency of knowledge graph construction.
  • FIG. 1 is a schematic flowchart of a method for constructing a knowledge graph based on the degree of belonging of a knowledge node in an embodiment.
  • Fig. 2 is a schematic flowchart of the steps of obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node in an embodiment.
  • Fig. 3 is a schematic structural diagram of the connection relationship between knowledge nodes and knowledge nodes in a specific embodiment.
  • Figure 4 is a schematic diagram of the entity and entity attributes of a knowledge entry in a specific embodiment.
  • Fig. 5 is a structural block diagram of an apparatus for constructing a knowledge graph based on the degree of belonging of a knowledge node in an embodiment.
  • Fig. 6 is an internal structure diagram of a computer device in an embodiment.
  • the technical solution of this application can be applied to the fields of artificial intelligence, smart city, digital medical care, blockchain and/or big data technology.
  • the data involved in this application such as knowledge node information and/or knowledge graph, can be stored in a database, or can be stored in a blockchain, which is not limited in this application.
  • a method for constructing a knowledge graph based on the degree of belonging of a knowledge node is provided.
  • the method is applied to a server for illustration. It is understandable that the method can also be applied.
  • the terminal it can also be applied to a system including a terminal and a server, and is realized through the interaction between the terminal and the server.
  • the method includes the following steps.
  • Step S110 Obtain a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain.
  • the knowledge node is each node that constitutes the knowledge graph, and each knowledge node is connected to each other in the knowledge graph according to the subordination relationship.
  • the knowledge node can be understood as an entity concept, as shown in Figure 3, T1, T2, C1, C2, C3, C4, and C5 are knowledge nodes.
  • the arrows indicate the connection relationship between the knowledge nodes.
  • the current knowledge node points to its subordinate knowledge nodes through the arrow.
  • Knowledge nodes have corresponding knowledge items, and each knowledge item includes the value of the entity and the value of the entity attribute.
  • "currency" in Figure 4 is an entity, and the value of the entity of "currency" and the value of the entity attribute pass through the solid line.
  • the table in the box area 401 is displayed.
  • the elliptical area 402 includes "Chinese name”, “foreign name”, “function”, “essence”... “Affiliation” is the value of the entity, and the dotted area 403 includes "currency”" " “Attribution” is the value of the entity attribute.
  • Step S120 Calculate the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node; wherein, the path length is from the target knowledge node to the The number of intermediate knowledge nodes connected in the subordinate knowledge nodes is determined.
  • the subordinate knowledge node is the next level knowledge node, the next level knowledge node... or the next N level knowledge node of the target knowledge node.
  • the degree of belonging of the subordinate knowledge node to the target knowledge node is determined by the length of the path connecting the subordinate knowledge node to the target knowledge node; of course, the degree of belonging of the subordinate knowledge node to the target knowledge node is determined by the number of paths that can be connected with the subordinate knowledge to the target knowledge node . That is, the degree of belonging is used to characterize the degree of association between the subordinate knowledge node and the target knowledge node. The greater the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node, the smaller the degree of belonging. The more connection paths from the target knowledge node to the subordinate knowledge node, the greater the degree of belonging.
  • the target knowledge node is A
  • the subordinate knowledge node is D1
  • the path from A to D1 is AB-D1, that is, there is a node B between A and D1
  • another subordinate knowledge node of the target knowledge node A is D2, from A to D1
  • the path of D2 is ABC-D2, that is, there are two nodes B and C between A and D2
  • the degree of belonging of the subordinate knowledge node D1 to the target knowledge node A is greater than the degree of belonging of the subordinate knowledge node D2 to the target knowledge node A.
  • there is one path from target knowledge node A to subordinate knowledge node D1 and there are two paths from target knowledge node A to subordinate knowledge node D2.
  • the degree of belonging of subordinate knowledge node D1 to target knowledge node A is less than that of subordinate knowledge node D2 to target knowledge The degree of belonging of node A.
  • Step S130 when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain.
  • the threshold value needs to be set. The higher the threshold value is, the stricter the condition for filtering subordinate knowledge nodes. Of course, the threshold value cannot be set too high to prevent subordinate knowledge nodes with high correlation with the target knowledge node from being filtered out. .
  • the degree of belonging of the subordinate knowledge node to the target knowledge node is less than the threshold, the subordinate knowledge node does not belong to the domain determined by the target knowledge node and needs to be eliminated.
  • the degree of belonging from the subordinate node to the target knowledge node is not less than a threshold, the subordinate knowledge node belongs to the domain determined by the target knowledge node.
  • the remaining knowledge nodes are knowledge nodes that are determined from the preset knowledge items, and subordinate knowledge nodes that do not meet the requirements are removed.
  • the non-conformity means that the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than Threshold.
  • Step S140 construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  • the corresponding knowledge items can be obtained, the corresponding knowledge items can be obtained through the knowledge nodes, and the value of the entity and the value of the entity attribute of the knowledge item can be obtained to complete the construction of the knowledge graph. .
  • the value of the entity and the value of the entity attribute of the knowledge item can be obtained from the encyclopedia database, and the value of the entity and the value of the entity attribute of the knowledge item can also be collected from the web page.
  • the above-mentioned knowledge graph construction method based on the degree of belonging of the knowledge node, by calculating the degree of belonging of the subordinate knowledge node to the target knowledge node, removes the subordinate knowledge node with low degree of relevance, ensuring the accuracy of the constructed knowledge graph, and passes
  • the method of calculating the degree of belonging is used to streamline knowledge items, which improves the efficiency of knowledge graph construction.
  • the step S110 includes: obtaining the association relationship between the knowledge node and the knowledge node according to the encyclopedia database; obtaining the target in the preset field according to the association relationship between the knowledge node and the knowledge node Knowledge node and subordinate knowledge node.
  • the knowledge node corresponds to the knowledge item.
  • the knowledge node is obtained according to the encyclopedia database, and the knowledge item is first obtained from the encyclopedia database.
  • the encyclopedia database is a database with open source data tables such as Wikipedia, and the knowledge in the data table is obtained through the open source data table interface
  • the relationship between items and knowledge items, the encyclopedia database includes knowledge items in multiple fields.
  • the association relationship between knowledge items is mapped to the association relationship between knowledge nodes.
  • the composition of the knowledge item and the association relationship between the knowledge item is structured data, and the structured data is data with a fixed pattern and subordination relationship, such as data in a database.
  • Semi-structured data data with a basic fixed structure pattern, such as log files, XML documents, JSON documents, etc.
  • Unstructured data data without a fixed pattern, such as document data such as WORD, PDF, PPT, EXCEL, etc., pictures and videos in various formats.
  • the target knowledge node may be the largest knowledge concept in the preset field, and the subordinate knowledge node is a small category or sub-category under the target knowledge node.
  • the target knowledge node is finance, it may exist with the finance.
  • the subordinate knowledge nodes of subordinate relationship include currency, RMB, and U.S. dollar.
  • step S120 includes: step S121, obtaining intermediate knowledge nodes on the connection path of the target knowledge node to the subordinate knowledge node; step S122, calculating the target knowledge node and The number of knowledge nodes that the intermediate knowledge node leads to connections; step S123, calculate the reciprocal of the number of knowledge nodes that lead to connections, to obtain the link weights of the target knowledge node and the intermediate knowledge nodes; step S124, According to the link weight of the target knowledge node and the intermediate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated.
  • the knowledge node and the knowledge node are connected by edges.
  • T1, T2, C1, C2, C3, C4, and C5 are knowledge nodes
  • the arrows from T1 to C3, C4 are the edges of the knowledge node
  • T2 to C1 The arrows of C4, C5 are the edges of the knowledge node
  • the arrows from C3 to C1, C2 are the edges of the knowledge node
  • the arrows from C4 to C1 are the edges of the knowledge node
  • the intermediate knowledge node is on a connection path except for the target knowledge
  • a connection path T1-C3-C2 T1 is the target knowledge node
  • C2 is the subordinate knowledge node
  • the intermediate knowledge node is not C3.
  • the number of knowledge nodes derived from the knowledge node is equal to the number of edges derived from the knowledge node. For example, as shown in Figure 3, two edges are derived from T1, then the connected knowledge node derived from T1 The number of is 2.
  • each knowledge node On the connection path from the target knowledge node to the subordinate knowledge node, except for the subordinate knowledge node, each knowledge node counts the number of knowledge nodes that lead to the connection, and calculates the link weight of the knowledge node according to the number of knowledge nodes that lead to the connection.
  • the link weight is equal to the reciprocal of the number of knowledge nodes that lead to connections. For example, as shown in Figure 3, C3 leads to knowledge nodes C2 and C1, and the number of knowledge nodes that C3 leads to connections is 2, then the link weight of C3 is 1/2 .
  • This implementation eliminates sub-knowledge nodes with low relevance by calculating the degree of belonging of the subordinate knowledge node to the target knowledge node, without manual participation in the construction of the graph, and the degree of belonging threshold can be modified to change the recall rate and accuracy of the domain Spend.
  • This application can specify different fields, and can quickly build a knowledge map of vertical fields.
  • the step S134 includes: calculating the product of the link weights of the target knowledge node and the intermediate knowledge node of each connection path to obtain the weight of each connection path; the connection path Is the connection path from the subordinate knowledge node to the target knowledge node; calculates the sum of the weights of all the connection paths to obtain the degree of belonging of the subordinate knowledge node to the target knowledge node.
  • T1, T2, C1, C2, C3, C4, and C5 are nodes, and the number of knowledge nodes connected to each knowledge node is equal to the number of knowledge nodes derived from it.
  • the number of arrows such as the number of T2 leading arrows is 3, the number of T1 leading arrows is 2, the number of C3 leading arrows is 2, the number of C4 leading arrows is 1, and the link weight of T2 is equal to 1/3 ,
  • the link weight of T1 is equal to 1/2
  • the link weight of C3 is equal to 1/2
  • the link weight of C4 is equal to 1; there are two paths from T2 to C1, one of which must pass through C4, then the path T2, C4, C1
  • the subordinate knowledge node is an end knowledge node; the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated according to the path length from the target knowledge node to the subordinate knowledge node , Including: calculating the degree of belonging of the end knowledge node to the target knowledge node according to the path length from the target knowledge node to the end knowledge node.
  • the end knowledge node is a knowledge node with no edges or zero edges. Since the end knowledge node is connected with the target knowledge node through multi-level knowledge nodes, the correlation between the end knowledge node and the target knowledge node is relatively small or not at all Associated, the end knowledge node needs to be pruned at this time to realize the optimization of the knowledge graph.
  • the method for constructing a knowledge graph based on the degree of belonging of a knowledge node further includes: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold.
  • the threshold value can be set as required, so as to expand and reduce the classification of the field. In one of the embodiments, the threshold can be set to 0.1.
  • step S140 includes: obtaining remaining knowledge nodes in the preset domain; according to the remaining knowledge nodes, obtaining corresponding information boxes through crawler technology to construct knowledge items corresponding to the remaining knowledge nodes The value of the entity and the value of the entity attribute.
  • the corresponding information box is obtained in the Internet page by crawling technology.
  • the knowledge item is "currency”
  • the entity and entity attributes are constructed for currency
  • the value of the currency entity and the value of entity attributes are obtained through crawler technology.
  • the first step is to splice the request url such as: https://baike.baidu .com/item/currency
  • the second step uses xpath to obtain the information box data (the content in the solid line box 401 in Figure 4)
  • the third step saves the obtained information box data to the mongodb database.
  • structured data can be obtained by using crawler technology without parsing semi-structured data (news portal), which improves the efficiency of constructing a knowledge graph of vertical domains.
  • a device for constructing a knowledge graph based on the degree of belonging of a knowledge node including: a knowledge node selection module 210, a degree of belonging calculation module 220, a node deletion module 230, and a knowledge graph constructing module 240.
  • the knowledge node selection module 210 is configured to obtain a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain.
  • the degree of belonging calculation module 220 is configured to calculate the degree of belonging from the subordinate knowledge node to the target knowledge node according to the length of the path from the target knowledge node to the subordinate knowledge node; wherein, the path length is determined by the target knowledge node. The number of intermediate knowledge nodes connected from the knowledge node to the subordinate knowledge node is determined.
  • the node deletion module 230 is configured to delete the subordinate knowledge node when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold value, and obtain the remaining knowledge nodes in the preset domain.
  • the knowledge graph construction module 240 is configured to construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  • the knowledge node selection module 210 includes: an association relationship acquisition unit for acquiring an association relationship between a knowledge node and a knowledge node according to an encyclopedia database; a knowledge node acquisition unit for acquiring an association relationship between a knowledge node and a knowledge node according to the knowledge The relationship between the node and the knowledge node is to obtain the target knowledge node and the subordinate knowledge node in the preset field.
  • the membership degree calculation module 220 includes: an intermediate knowledge node acquisition module for acquiring intermediate knowledge nodes on the connection path between the target knowledge node and the subordinate knowledge node; calculating the number of connected knowledge nodes A unit for calculating the number of knowledge nodes that are derived from the target knowledge node and the intermediate knowledge node; a link weight calculation unit for calculating the reciprocal of the number of knowledge nodes that are derived from the intermediate knowledge node to obtain the target The link weight of the knowledge node and the intermediate knowledge node; a membership degree calculation unit for calculating the membership degree of the subordinate knowledge node to the target knowledge node according to the link weight of the target knowledge node and the intermediate knowledge node .
  • the belonging degree calculation unit includes: a weight calculation subunit for calculating the product of the link weights of the target knowledge node and the intermediate knowledge node of each connection path to obtain each of the The weight of the connection path; the connection path is the connection path from the subordinate knowledge node to the target knowledge node; the membership degree calculation subunit is used to calculate the sum of the weights of all the connection paths to obtain the subordinate knowledge node The degree of belonging to the target knowledge node.
  • the subordinate knowledge node is an end knowledge node; the membership degree calculation module 220 is further configured to calculate the end knowledge node according to the path length from the target knowledge node to the end knowledge node The degree of belonging to the target knowledge node.
  • the knowledge graph construction device further includes: a judging module, configured to judge whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  • the knowledge graph construction module 240 includes: a remaining knowledge node acquiring unit, configured to acquire remaining knowledge nodes in the preset domain; a value construction unit, configured to pass through the remaining knowledge nodes
  • the crawler technology obtains the corresponding information box to construct the value of the entity and the value of the entity attribute of the knowledge item corresponding to the remaining knowledge node.
  • Each module in the apparatus for constructing a knowledge graph based on the degree of belonging of a knowledge node can be implemented in whole or in part by software, hardware, and a combination thereof.
  • the above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 6.
  • the computer equipment includes a processor, a memory, and a network interface connected through a system bus.
  • the processor of the computer device is used to provide calculation and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, a computer program, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium.
  • the database of the computer device is used to store the data of the knowledge item and the association relationship between the knowledge item.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
  • a computer device including a memory and a processor, and a computer program is stored in the memory.
  • the processor executes the computer program, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is implemented, which is based on the knowledge node
  • the method for constructing the knowledge graph of the degree of belonging includes the following steps: obtaining a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; according to the target The length of the path from the knowledge node to the subordinate knowledge node is calculated, and the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein, the path length is determined by the connection between the target knowledge node and the subordinate knowledge node The number of intermediate knowledge nodes is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, the subordinate knowledge node is deleted, and the remaining knowledge nodes
  • the processor further implements the following steps when executing the computer program: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  • a computer-readable storage medium on which a computer program is stored.
  • the knowledge graph construction method includes the following steps: acquiring a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; The path length of the subordinate knowledge node calculates the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is from the target knowledge node to the intermediate knowledge node connected to the subordinate knowledge node The number of is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, the subordinate knowledge node is deleted, and the remaining knowledge nodes in the preset domain are obtained; according to the preset domain The remaining knowledge nodes inside construct a knowledge graph.
  • the following steps are further implemented: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  • the storage medium involved in this application such as a computer-readable storage medium, may be non-volatile or volatile.
  • Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, etc.
  • Volatile memory may include Random Access Memory (RAM) or external cache memory.
  • RAM can be in various forms, such as static random access memory (SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A knowledge map construction method and device based on a knowledge node belonging degree. The method comprises: obtaining a target knowledge node and a slave knowledge node of the target knowledge node (S110), wherein the target knowledge node and the slave knowledge node are knowledge nodes in a preset field; calculating a belonging degree from the slave knowledge node to the target knowledge node according to a path length from the target knowledge node to the slave knowledge node (S120), wherein the path length is determined by the number of intermediate knowledge nodes connected from the target knowledge node to the slave knowledge node; when the belonging degree from the slave knowledge node to the target knowledge node is lower than a threshold, deleting the slave knowledge node, and obtaining the remaining knowledge nodes in the preset field (S130); and constructing a knowledge map according to the remaining knowledge nodes in the preset field (S140). The use of the method can improve the construction efficiency of the knowledge map.

Description

基于知识节点所属度的知识图谱构建方法和装置Method and device for constructing knowledge graph based on degree of belonging of knowledge node
本申请要求于2020年3月26日提交中国专利局、申请号为202010224365.8,发明名称为“基于知识节点所属度的知识图谱构建方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on March 26, 2020, the application number is 202010224365.8, and the invention title is "Knowledge Graph Construction Method and Apparatus Based on the Degree of Knowledge Node Ownership", the entire content of which is incorporated by reference Incorporated in this application.
技术领域Technical field
本申请涉及计算机技术领域,特别是涉及一种基于知识节点所属度的知识图谱构建方法和装置。This application relates to the field of computer technology, and in particular to a method and device for constructing a knowledge graph based on the degree of belonging of a knowledge node.
背景技术Background technique
知识图谱是一种大规模语义网络,以实体或者概念作为知识节点,通过语义关系相连接。通过发掘实体之间的关联,将半结构化、非结构化的数据整合,知识图谱可以帮助机器理解数据、解释现象、知识推理,从而发掘深层关系、实现智慧搜索与智能交互。目前广泛应用在银行、保险、证券、法院、物流、财税等领域。知识图谱由多个知识节点构成,并且多个知识节点相互关联,其互相关联关系通过知识图谱中的连线表示。发明人意识到,目前,构建垂直领域的知识图谱,需要在知识节点获取阶段进行大量人工监控,以确保获取的知识节点符合当前领域。例如,基于种子图谱,种子图谱中都是垂直领域的知识节点,再根据垂直领域的知识节点去百科网页中搜索与其关联的下一知识节点;在搜索过程中,根据每个百科网页页面(目标知识节点)的标签tag跳转到另一个百科网页(下一知识节点),跳转数每加1,可能拉出新扩充的多条(例如1000条)百科网页的内容,而这些内容是否属于种子图谱需要人工进行删选识别。Knowledge graph is a large-scale semantic network, which uses entities or concepts as knowledge nodes and is connected by semantic relations. By exploring the associations between entities and integrating semi-structured and unstructured data, knowledge graphs can help machines understand data, explain phenomena, and reason about knowledge, thereby discovering deep relationships and realizing intelligent search and intelligent interaction. At present, it is widely used in banking, insurance, securities, courts, logistics, finance and taxation and other fields. The knowledge graph is composed of multiple knowledge nodes, and multiple knowledge nodes are related to each other, and their interrelationships are represented by the lines in the knowledge graph. The inventor realizes that at present, to construct a knowledge graph of a vertical field, a large amount of manual monitoring is required in the knowledge node acquisition stage to ensure that the acquired knowledge node conforms to the current field. For example, based on the seed map, all the knowledge nodes in the vertical domain are in the seed map, and then according to the knowledge node in the vertical domain, go to the encyclopedia page to search for the next knowledge node associated with it; in the search process, according to each encyclopedia page page (target The tag of the knowledge node) jumps to another encyclopedia webpage (the next knowledge node), and each time the number of jumps increases by 1, it may pull out the content of multiple (for example, 1000) encyclopedia webpages, and whether these contents belong to The seed map needs to be manually deleted, selected and identified.
然而,发明人发现,现有垂直领域的知识图谱,存在与所属领域不相关的知识节点,导致知识图谱在应用时容易推导出错误的结论。However, the inventor found that the existing knowledge graphs of vertical fields have knowledge nodes that are not related to the field, which makes it easy to derive wrong conclusions when applying the knowledge graphs.
技术问题technical problem
基于此,有必要针对上述技术问题,提供一种能够提高知识图谱构建效率的基于知识节点所属度的知识图谱构建方法、装置、计算机设备和存储介质。Based on this, it is necessary to address the above technical problems and provide a method, device, computer equipment and storage medium for constructing a knowledge graph based on the degree of belonging of a knowledge node that can improve the efficiency of constructing a knowledge graph.
技术解决方案Technical solutions
一种基于知识节点所属度的知识图谱构建方法,所述方法包括:获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;根据所述预设领域内的剩余知识节点构建知识图谱。A method for constructing a knowledge graph based on the degree of belonging of a knowledge node, the method comprising: obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are in a preset domain Knowledge node; according to the path length from the target knowledge node to the subordinate knowledge node, calculate the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is from the target knowledge node to the The number of intermediate knowledge nodes connected in the subordinate knowledge node is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, the subordinate knowledge node is deleted, and the information in the preset domain Remaining knowledge nodes; construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
一种基于知识节点所属度的知识图谱构建装置,所述装置包括:知识节点选取模块,用于获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;所属度计算模块,用于根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;节点删除模块,用于在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;知识图谱构建模块,用于根据所述预设领域内的剩余知识节点构建知识图谱。A knowledge graph construction device based on the degree of belonging of a knowledge node, the device comprising: a knowledge node selection module for obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge A node is a knowledge node in a preset field; a degree calculation module for calculating the degree of belonging from the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node; wherein , The path length is determined by the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node; the node deletion module is used to determine the degree of belonging of the subordinate knowledge node to the target knowledge node When the value is lower than the threshold, delete the subordinate knowledge node and obtain the remaining knowledge nodes in the preset domain; the knowledge graph construction module is used to construct a knowledge graph based on the remaining knowledge nodes in the preset domain.
一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现上述基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;根据所述预设领域内的剩余知识节点构建知识图谱。A computer device includes a memory and a processor, the memory stores a computer program, and when the processor executes the computer program, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is implemented, and the knowledge based on the degree of belonging of a knowledge node The graph construction method includes the following steps: obtaining a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; The path length of the subordinate knowledge node is calculated, and the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein, the path length is from the target knowledge node to the intermediate knowledge node connected to the subordinate knowledge node The number is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain; according to the preset domain The remaining knowledge nodes of the building a knowledge graph.
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现上述基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;根据所述预设领域内的剩余知识节点构建知识图谱。A computer-readable storage medium having a computer program stored thereon, which when executed by a processor realizes the above-mentioned method for constructing a knowledge graph based on the degree of belonging of a knowledge node, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node includes the following Step: Obtain a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; according to the difference between the target knowledge node and the subordinate knowledge node Path length, calculating the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is determined by the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node; When the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain; construct based on the remaining knowledge nodes in the preset domain Knowledge graph.
有益效果Beneficial effect
本申请通过计算从属知识节点到目标知识节点的所属度,来对关联度低的从属知识节点进行剔除,保证了构建的知识图谱的准确性,并且,通过计算所属度的方式来对知识条目进行精简,提高了知识图谱构建效率。This application calculates the degree of belonging of the subordinate knowledge node to the target knowledge node to eliminate the subordinate knowledge nodes with low relevance, ensuring the accuracy of the constructed knowledge graph, and calculates the degree of belonging to the knowledge entry. Streamlined and improved the efficiency of knowledge graph construction.
附图说明Description of the drawings
图1为一个实施例中基于知识节点所属度的知识图谱构建方法的流程示意图。FIG. 1 is a schematic flowchart of a method for constructing a knowledge graph based on the degree of belonging of a knowledge node in an embodiment.
图2为一个实施例中获取目标知识节点和所述目标知识节点的从属知识节点步骤的流程示意图。Fig. 2 is a schematic flowchart of the steps of obtaining a target knowledge node and a subordinate knowledge node of the target knowledge node in an embodiment.
图3为一个具体实施例中知识节点与知识节点连接关系的结构示意图。Fig. 3 is a schematic structural diagram of the connection relationship between knowledge nodes and knowledge nodes in a specific embodiment.
图4为一个具体实施例中知识条目的实体和实体属性的示意图。Figure 4 is a schematic diagram of the entity and entity attributes of a knowledge entry in a specific embodiment.
图5为一个实施例中基于知识节点所属度的知识图谱构建装置的结构框图。Fig. 5 is a structural block diagram of an apparatus for constructing a knowledge graph based on the degree of belonging of a knowledge node in an embodiment.
图6为一个实施例中计算机设备的内部结构图。Fig. 6 is an internal structure diagram of a computer device in an embodiment.
本发明的实施方式Embodiments of the present invention
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions, and advantages of this application clearer and clearer, the following further describes the application in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
本申请的技术方案可应用于人工智能、智慧城市、数字医疗、区块链和/或大数据技术领域。可选的,本申请涉及的数据如知识节点的信息和/或知识图谱等可存储于数据库中,或者可以存储于区块链中,本申请不做限定。The technical solution of this application can be applied to the fields of artificial intelligence, smart city, digital medical care, blockchain and/or big data technology. Optionally, the data involved in this application, such as knowledge node information and/or knowledge graph, can be stored in a database, or can be stored in a blockchain, which is not limited in this application.
在一个实施例中,如图1所示,提供了一种基于知识节点所属度的知识图谱构建方法,本实施例以该方法应用于服务器进行举例说明,可以理解的是,该方法也可以应用于终端,还可以应用于包括终端和服务器的系统,并通过终端和服务器的交互实现。本实施例中,该方法包括以下步骤。In one embodiment, as shown in FIG. 1, a method for constructing a knowledge graph based on the degree of belonging of a knowledge node is provided. In this embodiment, the method is applied to a server for illustration. It is understandable that the method can also be applied. For the terminal, it can also be applied to a system including a terminal and a server, and is realized through the interaction between the terminal and the server. In this embodiment, the method includes the following steps.
步骤S110,获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点。Step S110: Obtain a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain.
其中,知识节点是构成知识图谱的每一个节点,每个知识节点在知识图谱中根据从属关系相互连接,知识节点可以理解为一个实体概念,如图3所示,T1、T2、C1、C2、C3、C4和C5为知识节点,箭头表示知识节点之间的连接关系,当前知识节点通过箭头指向其从属知识节点。知识节点具有对应的知识条目,每个知识条目包括实体的值和实体属性的值,例如,图4中的“货币”为一个实体,“货币”的实体的值和实体属性的值通过实线框区域401内的表格进行展示,椭圆区域402内包括“中文名”“外文名”“作用”“本质”……“归属”都为实体的值,虚线区域403内包括“货币”……“归属”都为实体属性的值。Among them, the knowledge node is each node that constitutes the knowledge graph, and each knowledge node is connected to each other in the knowledge graph according to the subordination relationship. The knowledge node can be understood as an entity concept, as shown in Figure 3, T1, T2, C1, C2, C3, C4, and C5 are knowledge nodes. The arrows indicate the connection relationship between the knowledge nodes. The current knowledge node points to its subordinate knowledge nodes through the arrow. Knowledge nodes have corresponding knowledge items, and each knowledge item includes the value of the entity and the value of the entity attribute. For example, "currency" in Figure 4 is an entity, and the value of the entity of "currency" and the value of the entity attribute pass through the solid line. The table in the box area 401 is displayed. The elliptical area 402 includes "Chinese name", "foreign name", "function", "essence"... "Affiliation" is the value of the entity, and the dotted area 403 includes "currency"..." "Attribution" is the value of the entity attribute.
步骤S120,根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定。Step S120: Calculate the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node; wherein, the path length is from the target knowledge node to the The number of intermediate knowledge nodes connected in the subordinate knowledge nodes is determined.
其中,所述从属知识节点为目标知识节点的下一级知识节点、下二级知识节点……或者下N级知识节点。从属知识节点到目标知识节点的所属度与从属知识节点连接到目标知识节点的路径长度决定;当然,从属知识节点到目标知识节点的所属度与可与从属知识到目标知识节点连接路径的数目决定。即,所属度用于表征从属知识节点到目标知识节点的关联程度,所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数越多,则所属度越小,所述目标知识节点到所述从属知识节点的连接路径越多,则所属度越大。Wherein, the subordinate knowledge node is the next level knowledge node, the next level knowledge node... or the next N level knowledge node of the target knowledge node. The degree of belonging of the subordinate knowledge node to the target knowledge node is determined by the length of the path connecting the subordinate knowledge node to the target knowledge node; of course, the degree of belonging of the subordinate knowledge node to the target knowledge node is determined by the number of paths that can be connected with the subordinate knowledge to the target knowledge node . That is, the degree of belonging is used to characterize the degree of association between the subordinate knowledge node and the target knowledge node. The greater the number of intermediate knowledge nodes connected from the target knowledge node to the subordinate knowledge node, the smaller the degree of belonging. The more connection paths from the target knowledge node to the subordinate knowledge node, the greater the degree of belonging.
例如,目标知识节点为A,从属知识节点为D1,从A到D1的路径有A-B-D1,即A到D1中间存在一个节点B;目标知识节点A另一个从属知识节点为D2,从A到D2的路径有A-B-C-D2,即A到D2中间存在两个个节点B、C;则从属知识节点D1到目标知识节点A的所属度大于从属知识节点D2到目标知识节点A的所属度。例如,目标知识节点A到从属知识节点D1有一条路径,目标知识节点A到从属知识节点D2有两条路径,则从属知识节点D1到目标知识节点A的所属度小于从属知识节点D2到目标知识节点A的所属度。For example, the target knowledge node is A, the subordinate knowledge node is D1, and the path from A to D1 is AB-D1, that is, there is a node B between A and D1; another subordinate knowledge node of the target knowledge node A is D2, from A to D1 The path of D2 is ABC-D2, that is, there are two nodes B and C between A and D2; then the degree of belonging of the subordinate knowledge node D1 to the target knowledge node A is greater than the degree of belonging of the subordinate knowledge node D2 to the target knowledge node A. For example, there is one path from target knowledge node A to subordinate knowledge node D1, and there are two paths from target knowledge node A to subordinate knowledge node D2. Then the degree of belonging of subordinate knowledge node D1 to target knowledge node A is less than that of subordinate knowledge node D2 to target knowledge The degree of belonging of node A.
步骤S130,在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点。Step S130, when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain.
其中,所述阈值需要设置,阈值设置越高,对从属知识节点过滤的条件越严格,当然,阈值也不能设置得太高,以防与目标知识节点相关度高的从属知识节点也被过滤掉。当所述从属知识节点到所述目标知识节点的所属度小于阈值时,则所述从属知识节点不属于目标知识节点确定的领域,需要剔除。当所述从属节点到所述目标知识节点的所属度不小于阈值时,则所述从属知识节点属于所述目标知识节点确定的领域。剩余知识节点为从预设知识条目确定的知识节点中,剔除不符合要求的从属知识节点剩余的知识节点,所述不符合要求即所述从属知识节点到所述目标知识节点的所属度低于阈值。The threshold value needs to be set. The higher the threshold value is, the stricter the condition for filtering subordinate knowledge nodes. Of course, the threshold value cannot be set too high to prevent subordinate knowledge nodes with high correlation with the target knowledge node from being filtered out. . When the degree of belonging of the subordinate knowledge node to the target knowledge node is less than the threshold, the subordinate knowledge node does not belong to the domain determined by the target knowledge node and needs to be eliminated. When the degree of belonging from the subordinate node to the target knowledge node is not less than a threshold, the subordinate knowledge node belongs to the domain determined by the target knowledge node. The remaining knowledge nodes are knowledge nodes that are determined from the preset knowledge items, and subordinate knowledge nodes that do not meet the requirements are removed. The non-conformity means that the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than Threshold.
步骤S140,根据所述预设领域内的剩余知识节点构建知识图谱。Step S140, construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
其中,知识节点和知识条目存在一一对应关系,根据知识节点能够获取相应的知识条目,通过知识节点获取对应知识条目,并获取知识条目的实体的值和实体属性的值,完成知识图谱的构建。Among them, there is a one-to-one correspondence between knowledge nodes and knowledge items. According to the knowledge nodes, the corresponding knowledge items can be obtained, the corresponding knowledge items can be obtained through the knowledge nodes, and the value of the entity and the value of the entity attribute of the knowledge item can be obtained to complete the construction of the knowledge graph. .
其中,可以从百科数据库中获取知识条目的实体的值和实体属性的值,也可以从网页中收集知识条目的实体的值和实体属性的值。Among them, the value of the entity and the value of the entity attribute of the knowledge item can be obtained from the encyclopedia database, and the value of the entity and the value of the entity attribute of the knowledge item can also be collected from the web page.
上述基于知识节点所属度的知识图谱构建方法,通过计算从属知识节点到目标知识节点的所属度,来对关联度低的从属知识节点进行剔除,保证了构建的知识图谱的准确性,并且,通过计算所属度的方式来对知识条目进行精简,提高了知识图谱构建效率。The above-mentioned knowledge graph construction method based on the degree of belonging of the knowledge node, by calculating the degree of belonging of the subordinate knowledge node to the target knowledge node, removes the subordinate knowledge node with low degree of relevance, ensuring the accuracy of the constructed knowledge graph, and passes The method of calculating the degree of belonging is used to streamline knowledge items, which improves the efficiency of knowledge graph construction.
在其中一个实施例中,所述步骤S110包括:根据百科数据库,获取知识节点和知识节点之间的关联关系;根据所述知识节点和知识节点之间的关联关系,获取预设领域内的目标知识节点和从属知识节点。In one of the embodiments, the step S110 includes: obtaining the association relationship between the knowledge node and the knowledge node according to the encyclopedia database; obtaining the target in the preset field according to the association relationship between the knowledge node and the knowledge node Knowledge node and subordinate knowledge node.
其中,知识节点与知识条目对应,根据百科数据库获取知识节点,先从百科数据库中获取知识条目,百科数据库为包括维基百科等具有开源数据表的数据库,通过开源数据表的接口获取数据表中知识条目和知识条目之间的关联关系,百科数据库中包括多个领域的知识条目。通过知识条目之间的关联关系映射到知识节点之间的关联关系。其中,知识条目和知识条目之间的关联关系的组成为结构化数据,结构化数据是具有固定模式和从属关系的数据,如数据库中数据。半结构化数据:有基本固定结构模式的数据,例如日志文件、XML文档、JSON文档等。非结构化数据:没有固定模式的数据,如WORD、PDF、PPT、EXCEL等文档数据,各种格式的图片、视频等。Among them, the knowledge node corresponds to the knowledge item. The knowledge node is obtained according to the encyclopedia database, and the knowledge item is first obtained from the encyclopedia database. The encyclopedia database is a database with open source data tables such as Wikipedia, and the knowledge in the data table is obtained through the open source data table interface The relationship between items and knowledge items, the encyclopedia database includes knowledge items in multiple fields. The association relationship between knowledge items is mapped to the association relationship between knowledge nodes. Among them, the composition of the knowledge item and the association relationship between the knowledge item is structured data, and the structured data is data with a fixed pattern and subordination relationship, such as data in a database. Semi-structured data: data with a basic fixed structure pattern, such as log files, XML documents, JSON documents, etc. Unstructured data: data without a fixed pattern, such as document data such as WORD, PDF, PPT, EXCEL, etc., pictures and videos in various formats.
例如,在维基百科中通过“page”、“categoryLinks”和“redirect”三张表来提取分类知识条目和知识条目之间的关联关系,其中,在“page”(页面)表中保存知识条目,在“categoryLinks”(类别链接)和“redirect”(重定向)表中保存知识条目之间的关联关系。For example, in Wikipedia, three tables of "page", "categoryLinks" and "redirect" are used to extract the relationship between classified knowledge entries and knowledge entries. Among them, the knowledge entries are stored in the "page" table. The association relationship between the knowledge items is stored in the "categoryLinks" and "redirect" tables.
其中,目标知识节点可为所述预设领域内的最大的知识概念,从属知识节点为所述目标知识节点下的小类或者子类,例如,目标知识节点为金融,则与所述金融存在从属关系的从属知识节点包括货币、人民币、美元。Among them, the target knowledge node may be the largest knowledge concept in the preset field, and the subordinate knowledge node is a small category or sub-category under the target knowledge node. For example, if the target knowledge node is finance, it may exist with the finance. The subordinate knowledge nodes of subordinate relationship include currency, RMB, and U.S. dollar.
在其中一个实施例中,如图2所示,步骤S120包括:步骤S121,获取所述目标知识节点到所述从属知识节点连接路径上的中间知识节点;步骤S122,计算所述目标知识节点和所述中间知识节点引出连接的知识节点的个数;步骤S123,计算所述引出连接的知识节点的个数的倒数,得到所述目标知识节点和所述中间知识节点的链接权重;步骤S124,根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度。In one of the embodiments, as shown in FIG. 2, step S120 includes: step S121, obtaining intermediate knowledge nodes on the connection path of the target knowledge node to the subordinate knowledge node; step S122, calculating the target knowledge node and The number of knowledge nodes that the intermediate knowledge node leads to connections; step S123, calculate the reciprocal of the number of knowledge nodes that lead to connections, to obtain the link weights of the target knowledge node and the intermediate knowledge nodes; step S124, According to the link weight of the target knowledge node and the intermediate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated.
其中,知识节点与知识节点通过边相连,例如,图3中,T1、T2、C1、C2、C3、C4和C5为知识节点,T1到C3、C4的箭头为知识节点的边,T2到C1、C4、C5的箭头为知识节点的边,C3到C1、C2的箭头为知识节点的边,C4到C1的箭头为知识节点的边;中间知识节点是在一条连接路径上除了所述目标知识节点和所述从属知识节点的其它知识节点,如图3中,一条连接路径T1-C3-C2,T1是目标知识节点,C2是从属知识节点,则中间知识节点未C3。其中,所述知识节点引出连接的知识节点的个数等于从所述知识节点引出的边的个数,例如,如图3所示,从T1引出两条边,则T1引出的连接的知识节点的个数为2。在目标知识节点到从属知识节点的连接路径上,除了从属知识节点,每个知识节点都计算引出连接的知识节点的个数,根据引出连接的知识节点的个数计算所述知识节点的链接权重,链接权重等于引出连接的知识节点的个数的倒数,例如,如图3中,C3引出知识节点C2和C1,C3引出连接的知识节点的个数为2,则C3的链接权重1/2。Among them, the knowledge node and the knowledge node are connected by edges. For example, in Figure 3, T1, T2, C1, C2, C3, C4, and C5 are knowledge nodes, the arrows from T1 to C3, C4 are the edges of the knowledge node, and T2 to C1 The arrows of C4, C5 are the edges of the knowledge node, the arrows from C3 to C1, C2 are the edges of the knowledge node, and the arrows from C4 to C1 are the edges of the knowledge node; the intermediate knowledge node is on a connection path except for the target knowledge The node and other knowledge nodes of the subordinate knowledge node, as shown in Figure 3, a connection path T1-C3-C2, T1 is the target knowledge node, C2 is the subordinate knowledge node, then the intermediate knowledge node is not C3. Wherein, the number of knowledge nodes derived from the knowledge node is equal to the number of edges derived from the knowledge node. For example, as shown in Figure 3, two edges are derived from T1, then the connected knowledge node derived from T1 The number of is 2. On the connection path from the target knowledge node to the subordinate knowledge node, except for the subordinate knowledge node, each knowledge node counts the number of knowledge nodes that lead to the connection, and calculates the link weight of the knowledge node according to the number of knowledge nodes that lead to the connection. , The link weight is equal to the reciprocal of the number of knowledge nodes that lead to connections. For example, as shown in Figure 3, C3 leads to knowledge nodes C2 and C1, and the number of knowledge nodes that C3 leads to connections is 2, then the link weight of C3 is 1/2 .
本实施通过计算所述从属知识节点到所述目标知识节点的所属度来对关联度小的子知识节点进行剔除,无需人工参与构建图谱,并且可以修改所属度阈值来改变领域的召回率和精准度。本申请能够指定不同的领域,可以快速构建垂直领域知识图谱。This implementation eliminates sub-knowledge nodes with low relevance by calculating the degree of belonging of the subordinate knowledge node to the target knowledge node, without manual participation in the construction of the graph, and the degree of belonging threshold can be modified to change the recall rate and accuracy of the domain Spend. This application can specify different fields, and can quickly build a knowledge map of vertical fields.
在其中一个实施例中,所述步骤S134包括:计算每一条连接路径的所述目标知识节点和所述中间知识节点的链接权重之积,得到每一条所述连接路径的权重;所述连接路径为所述从属知识节点到所述目标知识节点的连接路径;计算所有所述连接路径的权重之和,得到所述从属知识节点到所述目标知识节点的所属度。In one of the embodiments, the step S134 includes: calculating the product of the link weights of the target knowledge node and the intermediate knowledge node of each connection path to obtain the weight of each connection path; the connection path Is the connection path from the subordinate knowledge node to the target knowledge node; calculates the sum of the weights of all the connection paths to obtain the degree of belonging of the subordinate knowledge node to the target knowledge node.
例如,如图3所示,计算C1到T2的所属度过程如下,T1、T2、C1、C2、C3、C4和C5为节点,每个知识节点连接的知识节点的个数等于从从其引出箭头的个数,如T2引出箭头的个数为3,T1引出箭头的个数为2,C3引出箭头的个数为2,C4引出箭头的个数为1,T2的链接权重等于1/3,T1的链接权重等于1/2,C3的链接权重等于1/2,C4的链接权重等于1;T2到C1有两条路径,其中一条路径要经过C4,那么T2、C4、C1这条路径的权重为1/3×1=1/3,T2、C1这条路径的权重为1/3,则C1到T2的所属度为1/3+1/3=2/3。For example, as shown in Figure 3, the process of calculating the degree of belonging of C1 to T2 is as follows, T1, T2, C1, C2, C3, C4, and C5 are nodes, and the number of knowledge nodes connected to each knowledge node is equal to the number of knowledge nodes derived from it. The number of arrows, such as the number of T2 leading arrows is 3, the number of T1 leading arrows is 2, the number of C3 leading arrows is 2, the number of C4 leading arrows is 1, and the link weight of T2 is equal to 1/3 , The link weight of T1 is equal to 1/2, the link weight of C3 is equal to 1/2, and the link weight of C4 is equal to 1; there are two paths from T2 to C1, one of which must pass through C4, then the path T2, C4, C1 The weight of is 1/3×1=1/3, the weight of the path T2 and C1 is 1/3, then the degree of belonging of C1 to T2 is 1/3+1/3=2/3.
在其中一个实施例中,所述从属知识节点为末端知识节点;所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度,包括:根据所述目标知识节点到所述末端知识节点的路径长度,计算所述末端知识节点到所述目标知识节点的所属度。In one of the embodiments, the subordinate knowledge node is an end knowledge node; the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated according to the path length from the target knowledge node to the subordinate knowledge node , Including: calculating the degree of belonging of the end knowledge node to the target knowledge node according to the path length from the target knowledge node to the end knowledge node.
其中,末端知识节点是不存在边或者边为零的知识节点,由于末端知识节点是与目标知识节点之间通过多级知识节点连接,末端知识节点与目标知识节点的关联度比较小或者完全没有关联,此时需要对末端知识节点进行剪枝,以实现对知识图谱进行优化。Among them, the end knowledge node is a knowledge node with no edges or zero edges. Since the end knowledge node is connected with the target knowledge node through multi-level knowledge nodes, the correlation between the end knowledge node and the target knowledge node is relatively small or not at all Associated, the end knowledge node needs to be pruned at this time to realize the optimization of the knowledge graph.
在其中一个实施例中,所述基于知识节点所属度的知识图谱构建方法还包括:判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值。其中,阈值可以根据需要设置,从而对所述领域的分类进行扩大和缩小。在其中一个实施例中,阈值可以设为0.1。In one of the embodiments, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node further includes: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold. Wherein, the threshold value can be set as required, so as to expand and reduce the classification of the field. In one of the embodiments, the threshold can be set to 0.1.
在其中一个实施例中,步骤S140包括:获取所述预设领域内的剩余知识节点;根据所述剩余知识节点,通过爬虫技术获取对应的信息盒子,以构建所述剩余知识节点对应的知识条目的实体的值和实体属性的值。In one of the embodiments, step S140 includes: obtaining remaining knowledge nodes in the preset domain; according to the remaining knowledge nodes, obtaining corresponding information boxes through crawler technology to construct knowledge items corresponding to the remaining knowledge nodes The value of the entity and the value of the entity attribute.
其中,根据所述剩余知识节点对应的知识条目,在互联网页面中通过爬虫技术获取对应的信息盒子。例如,知识条目为“货币”,对货币构建实体和实体属性,通过爬虫技术获取货币的实体的值和实体属性的值,具体的,第一步拼接请求url如:https://baike.baidu.com/item/货币,第二步使用xpath获取信息盒子数据(如图4中实线框401中的内容),第三步将获取下来的信息盒数据保存到mongodb数据库中。Wherein, according to the knowledge item corresponding to the remaining knowledge node, the corresponding information box is obtained in the Internet page by crawling technology. For example, the knowledge item is "currency", the entity and entity attributes are constructed for currency, and the value of the currency entity and the value of entity attributes are obtained through crawler technology. Specifically, the first step is to splice the request url such as: https://baike.baidu .com/item/currency, the second step uses xpath to obtain the information box data (the content in the solid line box 401 in Figure 4), and the third step saves the obtained information box data to the mongodb database.
本实施例中,基于知识可以使用爬虫技术获取到结构化的数据,无需解析半结构化数据(新闻门户网站),提高了构建垂直领域知识图谱的效率。In this embodiment, based on knowledge, structured data can be obtained by using crawler technology without parsing semi-structured data (news portal), which improves the efficiency of constructing a knowledge graph of vertical domains.
应该理解的是,虽然图1-2的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图1-2中的至少一部分步骤可以包括多个步骤或者多个阶段,这些步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤中的步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flowcharts of FIGS. 1-2 are displayed in sequence as indicated by the arrows, these steps are not necessarily performed in sequence in the order indicated by the arrows. Unless specifically stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least part of the steps in Figure 1-2 can include multiple steps or multiple stages. These steps or stages are not necessarily executed at the same time, but can be executed at different times. The execution of these steps or stages The sequence is not necessarily performed sequentially, but may be performed alternately or alternately with other steps or at least a part of the steps or stages in other steps.
在一个实施例中,如图5所示,提供了一种基于知识节点所属度的知识图谱构建装置,包括:知识节点选取模块210、所属度计算模块220、节点删除模块230和知识图谱构建模块240。In one embodiment, as shown in FIG. 5, a device for constructing a knowledge graph based on the degree of belonging of a knowledge node is provided, including: a knowledge node selection module 210, a degree of belonging calculation module 220, a node deletion module 230, and a knowledge graph constructing module 240.
知识节点选取模块210,用于获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点。The knowledge node selection module 210 is configured to obtain a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain.
所属度计算模块220,用于根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定。The degree of belonging calculation module 220 is configured to calculate the degree of belonging from the subordinate knowledge node to the target knowledge node according to the length of the path from the target knowledge node to the subordinate knowledge node; wherein, the path length is determined by the target knowledge node. The number of intermediate knowledge nodes connected from the knowledge node to the subordinate knowledge node is determined.
节点删除模块230,用于在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点。The node deletion module 230 is configured to delete the subordinate knowledge node when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold value, and obtain the remaining knowledge nodes in the preset domain.
知识图谱构建模块240,用于根据所述预设领域内的剩余知识节点构建知识图谱。The knowledge graph construction module 240 is configured to construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
在其中一个实施例中,所述知识节点选取模块210包括:关联关系获取单元,用于根据百科数据库,获取知识节点和知识节点之间的关联关系;知识节点获取单元,用于根据所述知识节点和知识节点之间的关联关系,获取预设领域内的目标知识节点和从属知识节点。In one of the embodiments, the knowledge node selection module 210 includes: an association relationship acquisition unit for acquiring an association relationship between a knowledge node and a knowledge node according to an encyclopedia database; a knowledge node acquisition unit for acquiring an association relationship between a knowledge node and a knowledge node according to the knowledge The relationship between the node and the knowledge node is to obtain the target knowledge node and the subordinate knowledge node in the preset field.
在其中一个实施例中,所述所属度计算模块220包括:中间知识节点获取模块,用于获取所述目标知识节点到所述从属知识节点连接路径上的中间知识节点;连接知识节点个数计算单元,用于计算所述目标知识节点和所述中间知识节点引出连接的知识节点的个数;链接权重计算单元,用于计算所述引出连接的知识节点的个数的倒数,得到所述目标知识节点和所述中间知识节点的链接权重;所属度计算单元,用于根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度。In one of the embodiments, the membership degree calculation module 220 includes: an intermediate knowledge node acquisition module for acquiring intermediate knowledge nodes on the connection path between the target knowledge node and the subordinate knowledge node; calculating the number of connected knowledge nodes A unit for calculating the number of knowledge nodes that are derived from the target knowledge node and the intermediate knowledge node; a link weight calculation unit for calculating the reciprocal of the number of knowledge nodes that are derived from the intermediate knowledge node to obtain the target The link weight of the knowledge node and the intermediate knowledge node; a membership degree calculation unit for calculating the membership degree of the subordinate knowledge node to the target knowledge node according to the link weight of the target knowledge node and the intermediate knowledge node .
在其中一个实施例中,所述所属度计算单元包括:权重计算子单元,用于计算每一条连接路径的所述目标知识节点和所述中间知识节点的链接权重之积,得到每一条所述连接路径的权重;所述连接路径为所述从属知识节点到所述目标知识节点的连接路径;所属度计算子单元,用于计算所有所述连接路径的权重之和,得到所述从属知识节点到所述目标知识节点的所属度。In one of the embodiments, the belonging degree calculation unit includes: a weight calculation subunit for calculating the product of the link weights of the target knowledge node and the intermediate knowledge node of each connection path to obtain each of the The weight of the connection path; the connection path is the connection path from the subordinate knowledge node to the target knowledge node; the membership degree calculation subunit is used to calculate the sum of the weights of all the connection paths to obtain the subordinate knowledge node The degree of belonging to the target knowledge node.
在其中一个实施例中,所述从属知识节点为末端知识节点;所述所属度计算模块220,还用于根据所述目标知识节点到所述末端知识节点的路径长度,计算所述末端知识节点到所述目标知识节点的所属度。In one of the embodiments, the subordinate knowledge node is an end knowledge node; the membership degree calculation module 220 is further configured to calculate the end knowledge node according to the path length from the target knowledge node to the end knowledge node The degree of belonging to the target knowledge node.
在其中一个实施例中,所述知识图谱构建装置,还包括:判断模块,用于判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。In one of the embodiments, the knowledge graph construction device further includes: a judging module, configured to judge whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
在其中一个实施例中,所述知识图谱构建模块240包括:剩余知识节点获取单元,用于获取所述预设领域内的剩余知识节点;值构建单元,用于根据所述剩余知识节点,通过爬虫技术获取对应的信息盒子,以构建所述剩余知识节点对应的知识条目的实体的值和实体属性的值。In one of the embodiments, the knowledge graph construction module 240 includes: a remaining knowledge node acquiring unit, configured to acquire remaining knowledge nodes in the preset domain; a value construction unit, configured to pass through the remaining knowledge nodes The crawler technology obtains the corresponding information box to construct the value of the entity and the value of the entity attribute of the knowledge item corresponding to the remaining knowledge node.
关于基于知识节点所属度的知识图谱构建装置的具体限定可以参见上文中对于基于知识节点所属度的知识图谱构建方法的限定,在此不再赘述。上述基于知识节点所属度的知识图谱构建装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the knowledge graph construction device based on the degree of belonging of a knowledge node, please refer to the above restriction on the method of constructing a knowledge graph based on the degree of belonging of a knowledge node, which will not be repeated here. Each module in the apparatus for constructing a knowledge graph based on the degree of belonging of a knowledge node can be implemented in whole or in part by software, hardware, and a combination thereof. The above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器和网络接口。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机程序和数据库。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的数据库用于存储知识条目和知识条目之间的关联关系的数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种基于知识节点所属度的知识图谱构建方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 6. The computer equipment includes a processor, a memory, and a network interface connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, a computer program, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. The database of the computer device is used to store the data of the knowledge item and the association relationship between the knowledge item. The network interface of the computer device is used to communicate with an external terminal through a network connection. When the computer program is executed by the processor, a method for constructing a knowledge graph based on the degree of belonging of the knowledge node is realized.
本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
在一个实施例中,提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时实现上述基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;根据所述预设领域内的剩余知识节点构建知识图谱。In one embodiment, a computer device is provided, including a memory and a processor, and a computer program is stored in the memory. When the processor executes the computer program, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is implemented, which is based on the knowledge node The method for constructing the knowledge graph of the degree of belonging includes the following steps: obtaining a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; according to the target The length of the path from the knowledge node to the subordinate knowledge node is calculated, and the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein, the path length is determined by the connection between the target knowledge node and the subordinate knowledge node The number of intermediate knowledge nodes is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, the subordinate knowledge node is deleted, and the remaining knowledge nodes in the preset domain are obtained; according to the The remaining knowledge nodes in the preset field construct a knowledge graph.
在一个实施例中,处理器执行计算机程序时还实现以下步骤:判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。In an embodiment, the processor further implements the following steps when executing the computer program: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现上述基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;根据所述预设领域内的剩余知识节点构建知识图谱。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored. When the computer program is executed by a processor, the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is realized. The knowledge graph construction method includes the following steps: acquiring a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain; The path length of the subordinate knowledge node calculates the degree of belonging of the subordinate knowledge node to the target knowledge node; wherein the path length is from the target knowledge node to the intermediate knowledge node connected to the subordinate knowledge node The number of is determined; when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than the threshold, the subordinate knowledge node is deleted, and the remaining knowledge nodes in the preset domain are obtained; according to the preset domain The remaining knowledge nodes inside construct a knowledge graph.
在一个实施例中,计算机程序被处理器执行时还实现以下步骤:判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。In one embodiment, when the computer program is executed by the processor, the following steps are further implemented: judging whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
可选的,本申请涉及的存储介质如计算机可读存储介质可以是非易失性的,也可以是易失性的。Optionally, the storage medium involved in this application, such as a computer-readable storage medium, may be non-volatile or volatile.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-Only Memory,ROM)、磁带、软盘、闪存或光存储器等。易失性存储器可包括随机存取存储器(Random Access Memory,RAM)或外部高速缓冲存储器。作为说明而非局限,RAM可以是多种形式,比如静态随机存取存储器(Static Random Access Memory,SRAM)或动态随机存取存储器(Dynamic Random Access Memory,DRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through a computer program. The computer program can be stored in a non-volatile computer readable storage. In the medium, when the computer program is executed, it may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other media used in the embodiments provided in this application may include at least one of non-volatile and volatile memory. Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, etc. Volatile memory may include Random Access Memory (RAM) or external cache memory. As an illustration and not a limitation, RAM can be in various forms, such as static random access memory (SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM).
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. In order to make the description concise, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered as the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation manners of the present application, and the description is relatively specific and detailed, but it should not be understood as a limitation on the scope of the invention patent. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of this application, several modifications and improvements can be made, and these all fall within the protection scope of this application. Therefore, the scope of protection of the patent of this application shall be subject to the appended claims.

Claims (20)

  1. 一种基于知识节点所属度的知识图谱构建方法,其中,所述方法包括:A method for constructing a knowledge graph based on the degree of belonging of a knowledge node, wherein the method includes:
    获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;Acquiring a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain;
    根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;According to the path length from the target knowledge node to the subordinate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein the path length is from the target knowledge node to the subordinate knowledge node Determine the number of intermediate knowledge nodes connected in;
    在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;When the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain;
    根据所述预设领域内的剩余知识节点构建知识图谱。Construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  2. 根据权利要求1所述的方法,其中,所述获取目标知识节点和所述目标知识节点的从属知识节点,包括:The method according to claim 1, wherein said acquiring the target knowledge node and the subordinate knowledge nodes of the target knowledge node comprises:
    根据百科数据库,获取知识节点和知识节点之间的关联关系;According to the encyclopedia database, obtain the relationship between the knowledge node and the knowledge node;
    根据所述知识节点和知识节点之间的关联关系,获取预设领域内的目标知识节点和从属知识节点。According to the association relationship between the knowledge node and the knowledge node, the target knowledge node and the subordinate knowledge node in the preset field are obtained.
  3. 根据权利要求1所述的方法,其中,所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度,包括:The method according to claim 1, wherein the calculating the degree of belonging of the subordinate knowledge node to the target knowledge node according to the length of the path from the target knowledge node to the subordinate knowledge node comprises:
    获取所述目标知识节点到所述从属知识节点连接路径上的中间知识节点;Acquiring an intermediate knowledge node on a connection path from the target knowledge node to the subordinate knowledge node;
    计算所述目标知识节点和所述中间知识节点引出连接的知识节点的个数;Calculating the number of knowledge nodes that are connected by the target knowledge node and the intermediate knowledge node;
    计算所述引出连接的知识节点的个数的倒数,得到所述目标知识节点和所述中间知识节点的链接权重;Calculate the reciprocal of the number of knowledge nodes that lead to connections, and obtain the link weights of the target knowledge nodes and the intermediate knowledge nodes;
    根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度。According to the link weight of the target knowledge node and the intermediate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated.
  4. 根据权利要求3所述的方法,其中,所述根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度,包括:The method according to claim 3, wherein the calculating the degree of belonging of the subordinate knowledge node to the target knowledge node according to the link weight of the target knowledge node and the intermediate knowledge node comprises:
    计算每一条连接路径的所述目标知识节点和所述中间知识节点的链接权重之积,得到每一条所述连接路径的权重;所述连接路径为所述从属知识节点到所述目标知识节点的连接路径;Calculate the product of the link weights of the target knowledge node and the intermediate knowledge node for each connection path to obtain the weight of each connection path; the connection path is the distance from the subordinate knowledge node to the target knowledge node Connection path
    计算所有所述连接路径的权重之和,得到所述从属知识节点到所述目标知识节点的所属度。Calculate the sum of the weights of all the connection paths to obtain the degree of belonging of the subordinate knowledge node to the target knowledge node.
  5. 根据权利要求1所述的方法,其中,所述从属知识节点为末端知识节点;The method according to claim 1, wherein the subordinate knowledge node is an end knowledge node;
    所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度,包括:The calculating the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node includes:
    根据所述目标知识节点到所述末端知识节点的路径长度,计算所述末端知识节点到所述目标知识节点的所属度。According to the path length from the target knowledge node to the end knowledge node, the degree of belonging of the end knowledge node to the target knowledge node is calculated.
  6. 根据权利要求1所述的方法,其中,还包括:判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。The method according to claim 1, further comprising: determining whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  7. 根据权利要求1所述的方法,其中,所述根据所述预设领域内的剩余知识节点构建知识图谱,包括:The method according to claim 1, wherein the constructing a knowledge graph based on the remaining knowledge nodes in the preset domain comprises:
    获取所述预设领域内的剩余知识节点;Acquiring remaining knowledge nodes in the preset domain;
    根据所述剩余知识节点,通过爬虫技术获取对应的信息盒子,以构建所述剩余知识节点对应的知识条目的实体的值和实体属性的值。According to the remaining knowledge node, the corresponding information box is obtained through crawler technology to construct the value of the entity and the value of the entity attribute of the knowledge item corresponding to the remaining knowledge node.
  8. 一种基于知识节点所属度的知识图谱构建装置,其中,所述装置包括:A device for constructing a knowledge graph based on the degree of belonging of a knowledge node, wherein the device includes:
    知识节点选取模块,用于获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;The knowledge node selection module is used to obtain a target knowledge node and subordinate knowledge nodes of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain;
    所属度计算模块,用于根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;The degree of belonging calculation module is used to calculate the degree of belonging of the subordinate knowledge node to the target knowledge node according to the length of the path from the target knowledge node to the subordinate knowledge node; wherein, the path length is determined by the target knowledge node. Determining the number of intermediate knowledge nodes connected from the node to the subordinate knowledge node;
    节点删除模块,用于在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;A node deletion module, configured to delete the subordinate knowledge node when the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold value, and obtain the remaining knowledge nodes in the preset domain;
    知识图谱构建模块,用于根据所述预设领域内的剩余知识节点构建知识图谱。The knowledge graph construction module is used to construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  9. 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其中,所述处理器执行所述计算机程序时实现基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:A computer device includes a memory and a processor, the memory stores a computer program, wherein the processor implements a method for constructing a knowledge graph based on the degree of belonging of a knowledge node when the processor executes the computer program, and the method for constructing a knowledge graph based on the degree of belonging of a knowledge node is The knowledge graph construction method includes the following steps:
    获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;Acquiring a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain;
    根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;According to the path length from the target knowledge node to the subordinate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein the path length is from the target knowledge node to the subordinate knowledge node Determine the number of intermediate knowledge nodes connected in;
    在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;When the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain;
    根据所述预设领域内的剩余知识节点构建知识图谱。Construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  10. 根据权利要求9所述的计算机设备,其中,执行所述获取目标知识节点和所述目标知识节点的从属知识节点时,包括:The computer device according to claim 9, wherein the execution of the acquisition target knowledge node and the subordinate knowledge node of the target knowledge node comprises:
    根据百科数据库,获取知识节点和知识节点之间的关联关系;According to the encyclopedia database, obtain the relationship between the knowledge node and the knowledge node;
    根据所述知识节点和知识节点之间的关联关系,获取预设领域内的目标知识节点和从属知识节点。According to the association relationship between the knowledge node and the knowledge node, the target knowledge node and the subordinate knowledge node in the preset field are obtained.
  11. 根据权利要求9所述的计算机设备,其中,执行所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度时,包括:8. The computer device according to claim 9, wherein the calculation of the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node comprises:
    获取所述目标知识节点到所述从属知识节点连接路径上的中间知识节点;Acquiring an intermediate knowledge node on a connection path from the target knowledge node to the subordinate knowledge node;
    计算所述目标知识节点和所述中间知识节点引出连接的知识节点的个数;Calculating the number of knowledge nodes that are connected by the target knowledge node and the intermediate knowledge node;
    计算所述引出连接的知识节点的个数的倒数,得到所述目标知识节点和所述中间知识节点的链接权重;Calculate the reciprocal of the number of knowledge nodes that lead to connections, and obtain the link weights of the target knowledge nodes and the intermediate knowledge nodes;
    根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度。According to the link weight of the target knowledge node and the intermediate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated.
  12. 根据权利要求9所述的计算机设备,其中,所述从属知识节点为末端知识节点;The computer device according to claim 9, wherein the subordinate knowledge node is an end knowledge node;
    执行所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度时,包括:The execution of calculating the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node includes:
    根据所述目标知识节点到所述末端知识节点的路径长度,计算所述末端知识节点到所述目标知识节点的所属度。According to the path length from the target knowledge node to the end knowledge node, the degree of belonging of the end knowledge node to the target knowledge node is calculated.
  13. 根据权利要求9所述的计算机设备,其中,所述处理器执行所述基于知识节点所属度的知识图谱构建方法时,还包括:The computer device according to claim 9, wherein, when the processor executes the method for constructing a knowledge graph based on the degree of belonging of a knowledge node, the method further comprises:
    判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。Determine whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  14. 根据权利要求9所述的计算机设备,其中,执行所述根据所述预设领域内的剩余知识节点构建知识图谱时,包括:The computer device according to claim 9, wherein the execution of the construction of the knowledge graph based on the remaining knowledge nodes in the preset domain comprises:
    获取所述预设领域内的剩余知识节点;Acquiring remaining knowledge nodes in the preset domain;
    根据所述剩余知识节点,通过爬虫技术获取对应的信息盒子,以构建所述剩余知识节点对应的知识条目的实体的值和实体属性的值。According to the remaining knowledge node, the corresponding information box is obtained through crawler technology to construct the value of the entity and the value of the entity attribute of the knowledge item corresponding to the remaining knowledge node.
  15. 一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现基于知识节点所属度的知识图谱构建方法,该基于知识节点所属度的知识图谱构建方法包括以下步骤:A computer-readable storage medium having a computer program stored thereon, wherein when the computer program is executed by a processor, a method for constructing a knowledge graph based on the degree of belonging of a knowledge node is implemented, and the method for constructing a knowledge graph based on the degree of belonging of a knowledge node includes The following steps:
    获取目标知识节点和所述目标知识节点的从属知识节点;所述目标知识节点和所述从属知识节点为预设领域内的知识节点;Acquiring a target knowledge node and a subordinate knowledge node of the target knowledge node; the target knowledge node and the subordinate knowledge node are knowledge nodes in a preset domain;
    根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度;其中,所述路径长度由所述目标知识节点到所述从属知识节点中所连接的中间知识节点的个数确定;According to the path length from the target knowledge node to the subordinate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated; wherein the path length is from the target knowledge node to the subordinate knowledge node Determine the number of intermediate knowledge nodes connected in;
    在所述从属知识节点到所述目标知识节点的所属度低于阈值时,删除所述从属知识节点,获取所述预设领域内的剩余知识节点;When the degree of belonging of the subordinate knowledge node to the target knowledge node is lower than a threshold, delete the subordinate knowledge node, and obtain the remaining knowledge nodes in the preset domain;
    根据所述预设领域内的剩余知识节点构建知识图谱。Construct a knowledge graph according to the remaining knowledge nodes in the preset domain.
  16. 根据权利要求15所述的计算机可读存储介质,其中,执行所述获取目标知识节点和所述目标知识节点的从属知识节点时,包括:The computer-readable storage medium according to claim 15, wherein the execution of the acquisition target knowledge node and the subordinate knowledge node of the target knowledge node comprises:
    根据百科数据库,获取知识节点和知识节点之间的关联关系;According to the encyclopedia database, obtain the relationship between the knowledge node and the knowledge node;
    根据所述知识节点和知识节点之间的关联关系,获取预设领域内的目标知识节点和从属知识节点。According to the association relationship between the knowledge node and the knowledge node, the target knowledge node and the subordinate knowledge node in the preset field are obtained.
  17. 根据权利要求15所述的计算机可读存储介质,其中,执行所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度时,包括:The computer-readable storage medium according to claim 15, wherein when performing the calculation of the degree of belonging of the subordinate knowledge node to the target knowledge node based on the path length from the target knowledge node to the subordinate knowledge node ,include:
    获取所述目标知识节点到所述从属知识节点连接路径上的中间知识节点;Acquiring an intermediate knowledge node on a connection path from the target knowledge node to the subordinate knowledge node;
    计算所述目标知识节点和所述中间知识节点引出连接的知识节点的个数;Calculating the number of knowledge nodes that are connected by the target knowledge node and the intermediate knowledge node;
    计算所述引出连接的知识节点的个数的倒数,得到所述目标知识节点和所述中间知识节点的链接权重;Calculate the reciprocal of the number of knowledge nodes that lead to connections, and obtain the link weights of the target knowledge nodes and the intermediate knowledge nodes;
    根据所述目标知识节点和所述中间知识节点的链接权重,计算所述从属知识节点到所述目标知识节点的所属度。According to the link weight of the target knowledge node and the intermediate knowledge node, the degree of belonging of the subordinate knowledge node to the target knowledge node is calculated.
  18. 根据权利要求15所述的计算机可读存储介质,其中,所述从属知识节点为末端知识节点;The computer-readable storage medium according to claim 15, wherein the subordinate knowledge node is an end knowledge node;
    执行所述根据所述目标知识节点到所述从属知识节点的路径长度,计算所述从属知识节点到所述目标知识节点的所属度时,包括:The execution of calculating the degree of belonging of the subordinate knowledge node to the target knowledge node according to the path length from the target knowledge node to the subordinate knowledge node includes:
    根据所述目标知识节点到所述末端知识节点的路径长度,计算所述末端知识节点到所述目标知识节点的所属度。According to the path length from the target knowledge node to the end knowledge node, the degree of belonging of the end knowledge node to the target knowledge node is calculated.
  19. 根据权利要求15所述的计算机可读存储介质,其中,所述计算机程序被处理器执行实现所述基于知识节点所属度的知识图谱构建方法时,还包括:The computer-readable storage medium according to claim 15, wherein, when the computer program is executed by a processor to implement the method for constructing a knowledge graph based on the degree of belonging of a knowledge node, the method further comprises:
    判断所述从属知识节点到所述目标知识节点的所属度是否小于阈值;其中,所述阈值为0.1。Determine whether the degree of belonging of the subordinate knowledge node to the target knowledge node is less than a threshold; wherein the threshold is 0.1.
  20. 根据权利要求15所述的计算机可读存储介质,其中,执行所述根据所述预设领域内的剩余知识节点构建知识图谱时,包括:The computer-readable storage medium according to claim 15, wherein the execution of the construction of the knowledge graph based on the remaining knowledge nodes in the preset field comprises:
    获取所述预设领域内的剩余知识节点;Acquiring remaining knowledge nodes in the preset domain;
    根据所述剩余知识节点,通过爬虫技术获取对应的信息盒子,以构建所述剩余知识节点对应的知识条目的实体的值和实体属性的值。According to the remaining knowledge node, the corresponding information box is obtained through crawler technology to construct the value of the entity and the value of the entity attribute of the knowledge item corresponding to the remaining knowledge node.
PCT/CN2021/071056 2020-03-26 2021-01-11 Knowledge map construction method and device based on knowledge node belonging degree WO2021190091A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010224365.8A CN111444352A (en) 2020-03-26 2020-03-26 Knowledge graph construction method and device based on knowledge node membership
CN202010224365.8 2020-03-26

Publications (1)

Publication Number Publication Date
WO2021190091A1 true WO2021190091A1 (en) 2021-09-30

Family

ID=71652489

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/071056 WO2021190091A1 (en) 2020-03-26 2021-01-11 Knowledge map construction method and device based on knowledge node belonging degree

Country Status (2)

Country Link
CN (1) CN111444352A (en)
WO (1) WO2021190091A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114153990A (en) * 2021-12-07 2022-03-08 支付宝(杭州)信息技术有限公司 Knowledge production pipeline construction method, system and device
CN115309789A (en) * 2022-10-11 2022-11-08 浩鲸云计算科技股份有限公司 Method for generating associated data graph in real time based on intelligent dynamic business object
CN116932780A (en) * 2023-09-13 2023-10-24 之江实验室 Astronomical knowledge graph construction method, resource searching method, device and medium
CN117172322A (en) * 2023-11-03 2023-12-05 中国标准化研究院 Method for establishing digital rural knowledge graph

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444352A (en) * 2020-03-26 2020-07-24 深圳壹账通智能科技有限公司 Knowledge graph construction method and device based on knowledge node membership
CN113836244B (en) * 2021-09-27 2023-04-07 天弘基金管理有限公司 Sample acquisition method, model training method, relation prediction method and device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103729402A (en) * 2013-11-22 2014-04-16 浙江大学 Method for establishing mapping knowledge domain based on book catalogue
CN103870578A (en) * 2014-03-21 2014-06-18 联想(北京)有限公司 Method for displaying associated information between users in network application and electronic equipment
CN106354728A (en) * 2015-07-16 2017-01-25 富士通株式会社 Method and device for generating association intensity between objects with semantic graph
US20180285764A1 (en) * 2017-03-29 2018-10-04 Futurewei Technologies, Inc. Knowledge network platform
CN108647788A (en) * 2018-05-14 2018-10-12 暨南大学 A kind of automatic improved method of association type knowledge base
CN109189936A (en) * 2018-08-13 2019-01-11 天津科技大学 A kind of label semanteme learning method measured based on network structure and semantic dependency
CN111444352A (en) * 2020-03-26 2020-07-24 深圳壹账通智能科技有限公司 Knowledge graph construction method and device based on knowledge node membership

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103729402A (en) * 2013-11-22 2014-04-16 浙江大学 Method for establishing mapping knowledge domain based on book catalogue
CN103870578A (en) * 2014-03-21 2014-06-18 联想(北京)有限公司 Method for displaying associated information between users in network application and electronic equipment
CN106354728A (en) * 2015-07-16 2017-01-25 富士通株式会社 Method and device for generating association intensity between objects with semantic graph
US20180285764A1 (en) * 2017-03-29 2018-10-04 Futurewei Technologies, Inc. Knowledge network platform
CN108647788A (en) * 2018-05-14 2018-10-12 暨南大学 A kind of automatic improved method of association type knowledge base
CN109189936A (en) * 2018-08-13 2019-01-11 天津科技大学 A kind of label semanteme learning method measured based on network structure and semantic dependency
CN111444352A (en) * 2020-03-26 2020-07-24 深圳壹账通智能科技有限公司 Knowledge graph construction method and device based on knowledge node membership

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114153990A (en) * 2021-12-07 2022-03-08 支付宝(杭州)信息技术有限公司 Knowledge production pipeline construction method, system and device
CN115309789A (en) * 2022-10-11 2022-11-08 浩鲸云计算科技股份有限公司 Method for generating associated data graph in real time based on intelligent dynamic business object
CN116932780A (en) * 2023-09-13 2023-10-24 之江实验室 Astronomical knowledge graph construction method, resource searching method, device and medium
CN116932780B (en) * 2023-09-13 2024-01-09 之江实验室 Astronomical knowledge graph construction method, resource searching method, device and medium
CN117172322A (en) * 2023-11-03 2023-12-05 中国标准化研究院 Method for establishing digital rural knowledge graph
CN117172322B (en) * 2023-11-03 2024-03-12 中国标准化研究院 Method for establishing digital rural knowledge graph

Also Published As

Publication number Publication date
CN111444352A (en) 2020-07-24

Similar Documents

Publication Publication Date Title
WO2021190091A1 (en) Knowledge map construction method and device based on knowledge node belonging degree
US20230169117A1 (en) Query language interoperability in a graph database
WO2023273182A1 (en) Multi-source knowledge graph fusion-oriented entity alignment method and apparatus, and system
CN105893349B (en) Classification tag match mapping method and device
CN107391677B (en) Method and device for generating Chinese general knowledge graph with entity relation attributes
US20180232262A1 (en) Mapping heterogeneous application-program interfaces to a database
US20170139991A1 (en) Dynamic query plan based on skew
CN112711591B (en) Data blood margin determination method and device based on field level of knowledge graph
Pernelle et al. An automatic key discovery approach for data linking
DE102016125767A1 (en) Discovery of a connected entity
CN111444181B (en) Knowledge graph updating method and device and electronic equipment
US20130179467A1 (en) Calculating Property Caching Exclusions In A Graph Evaluation Query Language
US10019537B1 (en) System and method for data search in a graph database
US20170124221A1 (en) Message passing in a distributed graph database
US10997181B2 (en) Generating a data structure that maps two files
US20190179934A1 (en) Cloud based validation engine
Mastykash et al. Internet social environmental platforms data representation
De Virgilio Smart RDF data storage in graph databases
US11238084B1 (en) Semantic translation of data sets
CN112257959A (en) User risk prediction method and device, electronic equipment and storage medium
Bai et al. Fixing the inconsistencies in fuzzy spatiotemporal RDF graph
US20220156245A1 (en) System and method for managing custom fields
US20190080015A1 (en) Parsing nested javascript object notation requests
GB2544281A (en) Method and system for data validation in knowledge extraction apparatus
CN106933844B (en) Construction method of reachability query index facing large-scale RDF data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21776934

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 190123)

122 Ep: pct application non-entry in european phase

Ref document number: 21776934

Country of ref document: EP

Kind code of ref document: A1