WO2018228259A1 - Relationship diagram processing method and apparatus - Google Patents

Relationship diagram processing method and apparatus Download PDF

Info

Publication number
WO2018228259A1
WO2018228259A1 PCT/CN2018/090178 CN2018090178W WO2018228259A1 WO 2018228259 A1 WO2018228259 A1 WO 2018228259A1 CN 2018090178 W CN2018090178 W CN 2018090178W WO 2018228259 A1 WO2018228259 A1 WO 2018228259A1
Authority
WO
WIPO (PCT)
Prior art keywords
relationship
virtual
association
core nodes
diagram
Prior art date
Application number
PCT/CN2018/090178
Other languages
French (fr)
Chinese (zh)
Inventor
许凌志
钱伟红
张洪
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2018228259A1 publication Critical patent/WO2018228259A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Definitions

  • the present application relates to a data processing technology, and more particularly to a relationship processing method and apparatus.
  • the method of starting aggregation from a node is mainly to merge the relationships of the same data type to achieve the purpose of simplifying the complex relationship diagram.
  • it is simplified from the node.
  • the combination only according to the data type can not reflect the relationship between the two nodes. relationship.
  • the present application provides a method and a device for processing a relationship diagram, which can simplify the complex relationship diagram and have an associated prominent backbone.
  • the present application provides a relationship diagram processing method, including:
  • the core node being a virtual node formed by a node in a relationship diagram to be simplified or a cluster in a relationship diagram;
  • the similarity calculation is performed on multiple association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by aggregation, so that the obtained virtual association relationship is used as the relationship between the core nodes in the relationship diagram to be simplified.
  • the method further includes: storing the associated relationship between the aggregated virtual association relationship and the pre-aggregation.
  • the method further includes: when the virtual association relationship after the aggregation is triggered, and expanding the selected virtual association relationship according to the associated association relationship corresponding to the virtual association relationship.
  • the expanding the selected aggregated virtual association relationship includes:
  • the similarity calculation is performed on multiple association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by the aggregation:
  • the different dimensions include any combination of the following: a time dimension, a relationship attribute dimension, and a behavior mode dimension.
  • the application also provides an implementation diagram device, including: a division module, an acquisition module, and an aggregation module; wherein
  • a dividing module configured to determine a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a cluster in a relationship diagram to be simplified;
  • An obtaining module configured to acquire multiple association relationships between core nodes
  • the aggregation module is configured to perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship between the core nodes in the relationship diagram to be simplified. Relationship.
  • the device further includes:
  • a storage module configured to correspondingly store the associated virtual association relationship and the pre-aggregation association relationship
  • the expansion module is configured to trigger the virtual association relationship after the aggregation, and expand the selected virtual relationship after the aggregation according to the association relationship corresponding to the virtual association relationship.
  • the expansion module is configured to: read all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and display the read association relationship.
  • the present application further provides a relationship diagram processing apparatus, including a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node being a relationship to be simplified a virtual node formed by a node in the graph or a cluster in the relationship graph; acquiring multiple association relationships between the core nodes; performing similarity calculation on multiple association relationships between the core nodes, and obtaining the core nodes by aggregation
  • the virtual association relationship between the virtual associations is taken as the relationship between the core nodes in the relationship diagram to be simplified.
  • the solution provided by the present application includes: determining a plurality of core nodes in a relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a relationship diagram to be simplified; acquiring between each core node Multiple association relationships; similarity calculations are performed on multiple association relationships between core nodes, and the virtual association relationship between the core nodes is obtained by aggregation, so that the obtained virtual association relationship is regarded as the core node in the relationship diagram to be simplified. Relationship between.
  • the present application utilizes the similarity between the core nodes in the relationship diagram to merge similar relationships to abstract new virtual association relationships, thereby simplifying the complex relationship diagram and highlighting the backbone context.
  • the present application further provides a method for expanding an aggregated independent track, which implements local information expansion in a relationship diagram, so that the relationship between two nodes or between two clusters is separated from the complex relationship diagram, and analyzed. It is also clearer and more convenient.
  • FIG. 2(a) is a schematic diagram of an embodiment of dividing a complex relationship diagram to be simplified into a plurality of clusters in the present application;
  • 2(b) is a schematic diagram of an embodiment in which a complex relationship diagram to be simplified is divided into a plurality of core nodes in the present application;
  • FIG. 3 is a schematic diagram of an embodiment of a similar independent rail of the present application.
  • FIG. 4 is a schematic diagram of an embodiment of a simplified relationship diagram of the present application.
  • FIG. 5 is a schematic diagram of an embodiment of the present application after the independent rails of FIG. 4 are aggregated;
  • FIG. 6 is a schematic diagram of an embodiment of developing an independent rail after aggregation in FIG. 5 in the present application;
  • FIG. 7 is a schematic structural diagram of a device for implementing a simplified relationship diagram of the present application.
  • a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
  • processors CPUs
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-persistent memory, random access memory (RAM), and/or non-volatile memory in a computer readable medium, such as read only memory (ROM) or flash memory.
  • RAM random access memory
  • ROM read only memory
  • Memory is an example of a computer readable medium.
  • Computer readable media includes both permanent and non-persistent, removable and non-removable media.
  • Information storage can be implemented by any method or technology.
  • the information can be computer readable instructions, data structures, modules of programs, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory. (ROM), electrically erasable programmable read only memory (EEPROM), flash memory or other memory technology, compact disk read only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, Magnetic tape cartridges, magnetic tape storage or other magnetic storage devices or any other non-transportable media can be used to store information that can be accessed by a computing device.
  • computer readable media does not include non-transitory computer readable media, such as modulated data signals and carrier waves.
  • the present application provides a relationship diagram processing method, as shown in FIG. 1, including the following steps:
  • Step 100 Determine a plurality of core nodes in the relationship diagram to be simplified.
  • the core node is a virtual node formed by a node in a relationship diagram to be simplified or a cluster in a relationship diagram.
  • this step is to divide the relationship to be simplified into several feature areas.
  • the feature area may be a small area composed of core nodes, or may be divided into several clusters. Specifically, find some core nodes in the relationship diagram to be simplified or divide the relationship diagram to be simplified into several clusters.
  • the method for calculating the core node may include, but is not limited to, pagerank, k-core, and the like.
  • pagerank is an algorithm invented by Google to evaluate the importance of web pages. The principle can also be used to judge the centrality of points in a relational network; k-core is another algorithm for evaluating the central degree of the network's European midpoint. .
  • the simplified diagram is divided into three areas: cluster A, cluster B, and cluster C.
  • the core calculated from the graph to be simplified is displayed. Node A, core node B and core node C.
  • Step 101 Acquire multiple association relationships between core nodes.
  • the path between the core node A and the core node B, or between the cluster A and the cluster B, having no common internal vertices is called For independent rails.
  • the independent track expresses the relationship between two core nodes or clusters, and describes how the two core backbones are related, that is, the relationship information.
  • the line segment represented by the double line is an independent track between the cluster or the core node. Among them, the number of nodes included in an independent track is called the degree of this independent track, denoted by N.
  • N the independent track with less degree of independent track
  • Step 102 Perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship as a relationship between the core nodes in the relationship diagram to be simplified. .
  • the similarity of the association relationship can be calculated and aggregated through different dimensional relationships, for example, the time dimension, that is, the similarity is the simultaneous segment, and/or the relationship attribute dimension, that is, the similarity is the same attribute, and/or the behavior mode dimension. That is, the similarity is the peer.
  • the similarity of the calculated association relationship may be determined according to the relationship type of the independent track expressed by the association relationship. For example, the two independent tracks between the independent track A and the independent track B are travel related. It should be noted that the judgment of the similarity of different business scenarios is different, depending on different strategies. The technical solutions provided by the present application are easy to understand, and the specific policies are not used to limit the scope of protection of the present application, and details are not described herein again.
  • the aggregation may be performed according to the combination of the similarities of the association relationships calculated in the three dimensions, such as the indirect relationship of the same segment with the attribute, the indirect relationship of the same segment with the peer, and the like.
  • FIG. 3 is a schematic diagram of an embodiment of a method for aggregating similar independent tracks. As shown in FIG. 3, from top to bottom, the similarities are described in terms of the same attribute, the peer, and the simultaneous segment.
  • the first merge mode in FIG. 3 it is assumed that there are two independent tracks between the core node A and the core node B, and one of the independent tracks indicates that the device with the IMEI of #3eedf3ed passes the QQ number of 3443223 at the core node A.
  • the login with the core node B, the other independent track indicates that the device with the IMEI #3eedf3ed has logged in between the core node A and the core node B through the QQ number 2222222, and the two independent tracks exist on the same device.
  • the similarity of the same attribute of the login so it is aggregated into an independent track with the IMEI.
  • Another example As shown in the second merge mode in Figure 3, it is assumed that there are two independent tracks between cluster A and cluster B.
  • One of the independent tracks means: around 14:45 on March 15, 16th, flight CA1232 arrives, another An independent track indicates that the flight CA1232 arrived at around 14:45 on March 18th, 16th.
  • the two independent tracks have the similarity of the same flight as the flight, so they are aggregated into independent tracks of the same flight.
  • the aggregation manner provided by the present application greatly simplifies the indirect relationship between the core nodes or the clusters, so that the core nodes or clusters are between The connection is clearer.
  • the method of the present application further includes:
  • the virtual association relationship after the storage aggregation is as follows: the relationship between the independent track representation and the pre-aggregation similar relationship, such as the connection relationship represented by the independent track, that is, the storage of the associated virtual association relationship and the pre-aggregation association relationship .
  • Each aggregated generated edge that is, the aggregated independent track, contains a drilldown field in which the similar independent track connections it contains are stored.
  • the method of the present application further includes:
  • the selected aggregated virtual relationship is expanded, for example, as an independent track.
  • the edge of an aggregate that is, when the independent track is expanded, that is, the user wants to see which aggregated independent track clicks on which independent track is located, at this time, the aggregated independent track is read. All sub-independent tracks under the drilldown field, then display these sub-independent tracks.
  • This application uses the similarity between the core nodes or clusters in the relationship diagram to merge similar relationships to abstract new virtual associations, thus simplifying the complex relationship and highlighting the backbone. Thread.
  • the present application also provides a method for expanding the independent track after aggregation, realizing the local information expansion in the relationship diagram, so that the relationship between the two core nodes of Jiacheng and Bcheng is separated from the complex relationship diagram, and analyzed. It is also clearer and more convenient.
  • FIG. 4 is a schematic diagram of an embodiment of a simplified relationship diagram of the present application. As shown in FIG. 4, it is assumed that the feature area divided by the complex relationship diagram includes three core nodes: A City, B City, and C City. A complex indirect relationship is created between the three core nodes by train. Taking the relationship indicated by the thick solid line in Figure 4 as an example, it is shown that A and B are in the same section of the train HB4540 (A) and the train HB1590 (B). There are other similar simultaneous periods between A and B. Independent track of different trains.
  • FIG. 5 is a schematic diagram of an embodiment in which the independent rails in FIG. 4 are aggregated in the present application.
  • a complex indirect relationship is abstracted into a traveling relationship of simultaneous segments, as shown in FIG. 5, between core nodes.
  • the relationship has become simple and clear.
  • the correspondence between the aggregated independent track and the similar independent track connection relationship before the aggregation is stored.
  • the dotted line between the A city and the B city is the indirect relationship edge after the similar independent track is merged.
  • the data field corresponding to the edge describes all the sub-independent tracks it contains through the drilldown.
  • the corresponding relationship includes the first correspondence between the city and the city, and the second correspondence between the city and the city.
  • FIG. 6 is a schematic diagram of an embodiment of deploying an independent rail after aggregation in FIG. 5 in the present application, and it is assumed that an association relationship between the city and the city B needs to be expanded, as shown in FIG. 6 , according to the first correspondence relationship.
  • the aggregated independent tracks are shown in the diagram. In this way, the local information expansion in the relationship diagram is realized, so that the relationship between the two core nodes of Jiacheng and Bcheng is separated from the complex relationship diagram, and the analysis is also more clear and convenient.
  • the present application further provides a relationship diagram processing apparatus, including at least a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node is to be simplified a virtual node formed by a node in a relationship diagram or a relationship diagram; acquiring multiple association relationships between core nodes; performing similarity calculation on multiple association relationships between core nodes, and obtaining cores by aggregation A virtual association relationship between nodes, so that the obtained virtual association relationship is used as a relationship between core nodes in the relationship diagram to be simplified.
  • a relationship diagram processing apparatus including at least a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node is to be simplified a virtual node formed by a node in a relationship diagram or a relationship diagram; acquiring multiple association relationships between core nodes; performing similarity calculation on multiple association relationships between core nodes, and obtaining cores
  • FIG. 7 is a schematic structural diagram of a device for implementing a simplified relationship diagram of the present application. As shown in FIG. 7, the method includes at least: a dividing module, an acquiring module, and an aggregation module;
  • a dividing module configured to determine a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a cluster in a relationship diagram to be simplified;
  • An obtaining module configured to acquire multiple association relationships between core nodes
  • the aggregation module is configured to perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship between the core nodes in the relationship diagram to be simplified. Relationship.
  • the aggregation module is specifically configured to: calculate a similarity of the association relationship by using a relationship of different dimensions, and perform aggregation to obtain the virtual association relationship.
  • the device of the present application further includes: a storage module, configured to correspondingly store the associated relationship between the aggregated virtual association relationship and the pre-aggregation.
  • the device of the present application further includes:
  • the expansion module is configured to trigger the virtual association relationship after the aggregation, and expand the selected virtual relationship after the aggregation according to the association relationship corresponding to the virtual association relationship.
  • the expansion module is configured to: read all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and display the read association relationship.
  • the present application utilizes the similarity between the indirect relationships between the core nodes or the clusters in the relationship diagram, and merges the similar relationships to abstract new virtual correspondences, thereby simplifying the complex relationship diagram and highlighting the backbone. Thread.
  • the present application further provides a solution for unfolding the aggregated relationship information, and implements local information expansion in the relationship diagram, so that the relationship between the two core nodes of the city and the city is separated from the complex relationship diagram.
  • the analysis is also more clear and convenient.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A relationship diagram processing method and apparatus. The method comprises: determining a plurality of core nodes in a relationship diagram to be simplified, the core nodes being nodes in the relationship diagram to be simplified or virtual nodes constituted by clusters in the relationship diagram (100); obtaining a plurality of association relationships between the core nodes (101); and calculating the similarity of the plurality of association relationships between the core nodes, and performing aggregation to obtain virtual association relationships between the core nodes, so as to use the obtained virtual association relationships as the relationships between the core nodes in the relationship diagram to be simplified (102). In the method, by using the similarity between indirect relationships of core nodes or clusters in a relationship diagram, similar relationships are combined, so as to abstract new virtual corresponding relationships, thereby simplifying the complex relationship diagram, and highlighting the backbone.

Description

一种关系图处理方法及装置Method and device for processing relationship diagram
本申请要求2017年06月16日递交的申请号为201710459378.1、发明名称为“一种关系图处理方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. PCT Application No. No. No. No. No. No. No. No. No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No No
技术领域Technical field
本申请涉及图数据处理技术,尤指一种关系图处理方法及装置。The present application relates to a data processing technology, and more particularly to a relationship processing method and apparatus.
背景技术Background technique
随着互联网数据的快速膨胀,在很多领域都产生了很多大并且复杂的图数据,如社交网络等。为了简化这种图数据的复杂关系图,现有技术中,都是从节点入手进行简化,常见的方法有:利用节点之间的相似性进行聚合,从而达到对复杂关系图的简化;或者裁剪复杂关系图中无关的叶子节点。With the rapid expansion of Internet data, many large and complex graph data, such as social networks, have been produced in many fields. In order to simplify the complex relationship diagram of the graph data, in the prior art, the simplification is started from the node, and the common methods are: using the similarity between the nodes to perform aggregation, thereby simplifying the complex relationship graph; or cutting Unrelated leaf nodes in complex diagrams.
其中,从节点入手进行聚合的方法,主要是对数据类型相同的关系进行合并以达到简化复杂关系图的目的。一方面是从节点入手进行简化,对于复杂的关系图,由于节点繁多,实现起来必然费时费力;另一方面,仅根据数据类型进行合并,不能很好地体现出关联的两个节点之间的关系。Among them, the method of starting aggregation from a node is mainly to merge the relationships of the same data type to achieve the purpose of simplifying the complex relationship diagram. On the one hand, it is simplified from the node. For complex diagrams, due to the large number of nodes, it will take time and effort to implement it. On the other hand, the combination only according to the data type can not reflect the relationship between the two nodes. relationship.
上述两种相关技术的方法,虽然能让复杂关系图中的核心节点或者集群更加突出,尤其对于核心骨干之间间接关系较为简单的图数据比较有效。但是,对于到核心骨干之间关系复杂的复杂关系图,这些方法的简化效果就差强人意了。Although the above two related technologies can make the core nodes or clusters in the complex relationship diagram more prominent, especially the graph data with relatively simple indirect relationship between the core backbones is effective. However, for complex relationships with complex relationships between the core backbones, the simplification of these methods is less than satisfactory.
发明内容Summary of the invention
为了解决上述技术问题,本申请提供了一种关系图处理方法及装置,能够将复杂关系图化繁为简,有关联的突出主干脉络。In order to solve the above technical problem, the present application provides a method and a device for processing a relationship diagram, which can simplify the complex relationship diagram and have an associated prominent backbone.
为了达到本申请目的,本申请提供了一种关系图处理方法,包括:In order to achieve the purpose of the present application, the present application provides a relationship diagram processing method, including:
确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;Determining a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram to be simplified or a cluster in a relationship diagram;
获取各核心节点之间的多个关联关系;Obtaining multiple association relationships between core nodes;
对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The similarity calculation is performed on multiple association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by aggregation, so that the obtained virtual association relationship is used as the relationship between the core nodes in the relationship diagram to be simplified.
可选地,所述方法还包括:对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系。Optionally, the method further includes: storing the associated relationship between the aggregated virtual association relationship and the pre-aggregation.
可选地,所述方法还包括:当触发所述聚合后的虚拟关联关系,根据所述虚拟关联关系对应的所属关联关系,展开选定的聚合后的虚拟关联关系。Optionally, the method further includes: when the virtual association relationship after the aggregation is triggered, and expanding the selected virtual association relationship according to the associated association relationship corresponding to the virtual association relationship.
可选地,所述展开选定的聚合后的虚拟关联关系包括:Optionally, the expanding the selected aggregated virtual association relationship includes:
读取所述聚合后的虚拟关联关系对应的所有所述聚合前的关联关系,并显示读取到的所述关联关系。Reading all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and displaying the read association relationship.
可选地,所述对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系包括:Optionally, the similarity calculation is performed on multiple association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by the aggregation:
通过不同维度的关系计算所述关联关系的相似性并进行聚合得到所述虚拟关联关系。Calculating the similarity of the association relationship through different dimensional relationships and performing aggregation to obtain the virtual association relationship.
可选地,所述不同维度包括以下任意组合:时间维度、关系属性维度、行为模式维度。Optionally, the different dimensions include any combination of the following: a time dimension, a relationship attribute dimension, and a behavior mode dimension.
本申请还提供了一种实现关系图装置,包括:划分模块、获取模块,聚合模块;其中,The application also provides an implementation diagram device, including: a division module, an acquisition module, and an aggregation module; wherein
划分模块,用于确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;a dividing module, configured to determine a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a cluster in a relationship diagram to be simplified;
获取模块,用于获取各核心节点之间的多个关联关系;An obtaining module, configured to acquire multiple association relationships between core nodes;
聚合模块,用于对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The aggregation module is configured to perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship between the core nodes in the relationship diagram to be simplified. Relationship.
可选地,所述装置还包括:Optionally, the device further includes:
存储模块,用于对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系;a storage module, configured to correspondingly store the associated virtual association relationship and the pre-aggregation association relationship;
展开模块,用于当触发所述聚合后的虚拟关联关系,根据所述虚拟关联关系对应的所属关联关系,展开选定的聚合后的虚拟关联关系。The expansion module is configured to trigger the virtual association relationship after the aggregation, and expand the selected virtual relationship after the aggregation according to the association relationship corresponding to the virtual association relationship.
可选地,所述展开模块具体用于:读取所述聚合后的虚拟关联关系对应的所有所述聚合前的关联关系,并显示读取到的所述关联关系。Optionally, the expansion module is configured to: read all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and display the read association relationship.
本申请又提供了一种关系图处理装置,包括存储器和处理器,其中,存储器中存储有以下可执行指令:确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;获取各核心节点之间的多个关联关 系;对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The present application further provides a relationship diagram processing apparatus, including a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node being a relationship to be simplified a virtual node formed by a node in the graph or a cluster in the relationship graph; acquiring multiple association relationships between the core nodes; performing similarity calculation on multiple association relationships between the core nodes, and obtaining the core nodes by aggregation The virtual association relationship between the virtual associations is taken as the relationship between the core nodes in the relationship diagram to be simplified.
本申请提供的方案包括:确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;获取各核心节点之间的多个关联关系;对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。本申请利用关系图中核心节点之间的关联关系间的相似性,对相似的关系进行了合并,以抽象出新的虚拟关联关系,从而将复杂关系图化繁为简,突出了主干脉络。The solution provided by the present application includes: determining a plurality of core nodes in a relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a relationship diagram to be simplified; acquiring between each core node Multiple association relationships; similarity calculations are performed on multiple association relationships between core nodes, and the virtual association relationship between the core nodes is obtained by aggregation, so that the obtained virtual association relationship is regarded as the core node in the relationship diagram to be simplified. Relationship between. The present application utilizes the similarity between the core nodes in the relationship diagram to merge similar relationships to abstract new virtual association relationships, thereby simplifying the complex relationship diagram and highlighting the backbone context.
进一步地,本申请还提供了展开聚合后的独立轨的方法,实现了关系图中的局部信息展开,使得两个节点之间或者两个集群之间的关系从复杂关系图中剥离出来,分析起来也更清晰方便了。Further, the present application further provides a method for expanding an aggregated independent track, which implements local information expansion in a relationship diagram, so that the relationship between two nodes or between two clusters is separated from the complex relationship diagram, and analyzed. It is also clearer and more convenient.
本申请的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本申请而了解。本申请的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Other features and advantages of the present application will be set forth in the description which follows. The objectives and other advantages of the present invention can be realized and obtained by the structure of the invention.
附图说明DRAWINGS
附图用来提供对本申请技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本申请的技术方案,并不构成对本申请技术方案的限制。The drawings are used to provide a further understanding of the technical solutions of the present application, and constitute a part of the specification, which is used together with the embodiments of the present application to explain the technical solutions of the present application, and does not constitute a limitation of the technical solutions of the present application.
图1为本申请关系图处理方法的流程图;1 is a flow chart of a method for processing a relationship diagram of the present application;
图2(a)为本申请中将待简化复杂关系图划分为若干集群的实施例的示意图;2(a) is a schematic diagram of an embodiment of dividing a complex relationship diagram to be simplified into a plurality of clusters in the present application;
图2(b)为本申请中将待简化复杂关系图为划分为若干核心节点的实施例的示意图;2(b) is a schematic diagram of an embodiment in which a complex relationship diagram to be simplified is divided into a plurality of core nodes in the present application;
图3为本申请聚合相似的独立轨的实施例的示意图;3 is a schematic diagram of an embodiment of a similar independent rail of the present application;
图4为本申请简化后的关系图的实施例的示意图;4 is a schematic diagram of an embodiment of a simplified relationship diagram of the present application;
图5为本申请对图4中的独立轨进行聚合后的实施例的示意图;FIG. 5 is a schematic diagram of an embodiment of the present application after the independent rails of FIG. 4 are aggregated; FIG.
图6为本申请中对图5中聚合后的某独立轨进行展开的实施例的示意图;6 is a schematic diagram of an embodiment of developing an independent rail after aggregation in FIG. 5 in the present application;
图7为本申请实现关系图简化的装置的组成结构示意图。FIG. 7 is a schematic structural diagram of a device for implementing a simplified relationship diagram of the present application.
具体实施方式detailed description
为使本申请的目的、技术方案和优点更加清楚明白,下文中将结合附图对本申请的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例 中的特征可以相互任意组合。In order to make the objects, technical solutions and advantages of the present application more clear, the embodiments of the present application will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be arbitrarily combined with each other without conflict.
在本申请一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration of the present application, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。The memory may include non-persistent memory, random access memory (RAM), and/or non-volatile memory in a computer readable medium, such as read only memory (ROM) or flash memory. Memory is an example of a computer readable medium.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括非暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer readable media includes both permanent and non-persistent, removable and non-removable media. Information storage can be implemented by any method or technology. The information can be computer readable instructions, data structures, modules of programs, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory. (ROM), electrically erasable programmable read only memory (EEPROM), flash memory or other memory technology, compact disk read only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, Magnetic tape cartridges, magnetic tape storage or other magnetic storage devices or any other non-transportable media can be used to store information that can be accessed by a computing device. As defined herein, computer readable media does not include non-transitory computer readable media, such as modulated data signals and carrier waves.
在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行。并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。The steps illustrated in the flowchart of the figures may be executed in a computer system such as a set of computer executable instructions. Also, although logical sequences are shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
当图数据比较大时,在复杂关系图上展示的核心节点或者集群之间的联系需要通过多层间接关系才能关联起来。为了将复杂关系图化繁为简,有关联的突出主干脉络,本申请提供一种关系图处理方法,如图1所示,包括以下步骤:When the graph data is relatively large, the connections between the core nodes or clusters displayed on the complex graph need to be related through multiple indirect relationships. In order to simplify the complex relationship diagram and to associate with the prominent backbone, the present application provides a relationship diagram processing method, as shown in FIG. 1, including the following steps:
步骤100:确定待简化的关系图中的多个核心节点。Step 100: Determine a plurality of core nodes in the relationship diagram to be simplified.
其中,核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点。The core node is a virtual node formed by a node in a relationship diagram to be simplified or a cluster in a relationship diagram.
换句话说,本步骤就是要将将待简化关系图划分为若干特征区域。特征区域可以是由核心节点构成的若干小区域,也可以是划分出的若干集群。具体地,找出待简化关系图中的若干核心节点或者将待简化关系图划分成若干集群。In other words, this step is to divide the relationship to be simplified into several feature areas. The feature area may be a small area composed of core nodes, or may be divided into several clusters. Specifically, find some core nodes in the relationship diagram to be simplified or divide the relationship diagram to be simplified into several clusters.
其中,将关系图划分成若干集群有很多方法,比如:社区发现方法如LPA(Label Propagation Algorithm)、SLPA(Speaker-listener Label Propagation Algorithm)等;再如:基于均衡多标签传播的重叠社团发现算法如BMLPA(Multiplex Ligation-dependent Probe Amplification);又如:社团划分算法如Fast Unfolding等。Among them, there are many methods for dividing the relationship diagram into several clusters, such as: community discovery methods such as LPA (Label Propagation Algorithm), SLPA (Speaker-listener Label Propagation Algorithm), etc.; another example: overlapping community discovery algorithm based on balanced multi-tag propagation Such as BMLPA (Multiplex Ligation-dependent Probe Amplification); another example: community partitioning algorithms such as Fast Unfolding.
其中,计算核心节点的方法可以包括但不限于:pagerank,k-core等。其中,pagerank是google发明的一种评估网页重要程度的算法,其原理也可以用来判断关系网络中点的中心程度;k-core是另一种评估关系网路欧中点的中心程度的算法。The method for calculating the core node may include, but is not limited to, pagerank, k-core, and the like. Among them, pagerank is an algorithm invented by Google to evaluate the importance of web pages. The principle can also be used to judge the centrality of points in a relational network; k-core is another algorithm for evaluating the central degree of the network's European midpoint. .
如图2(a)所示,展示了待简化关系图被划分成集群A,集群B和集群C三个区域,如图2(b)所示,显示了从待简化关系图计算出的核心节点A,核心节点B和核心节点C。As shown in Figure 2(a), the simplified diagram is divided into three areas: cluster A, cluster B, and cluster C. As shown in Figure 2(b), the core calculated from the graph to be simplified is displayed. Node A, core node B and core node C.
步骤101:获取各核心节点之间的多个关联关系。Step 101: Acquire multiple association relationships between core nodes.
如图2(a)或图2(b)所示,以顶点A和顶点B为例,核心节点A与核心节点B,或者集群A与集群B之间的、没有公共内部顶点的路径互称为独立轨。独立轨表达了两个核心节点或者集群之间的关联关系,描述了两个核心骨干是如何关联起来的,也就是关联关系信息。如图2(a)或图2(b)所示,双线表示的线段即为集群或者核心节点之间的独立轨。其中,一条独立轨中所包含的节点数量称为这条独立轨的度,用N表示。实际应用中,往往关注较近关系的独立轨,也就是独立轨的度较小的独立轨,比如N<=2等。N的取值取决于应用场景,比如:在同行场景中,根据经验N的值取2是比较合适的,这里涵盖了两个人可能共乘一趟车的情况(N=1),也涵盖了用户在同一时段通过不同的车次到达同一个地方的情况(N=2);再如:在同住的场景中,N的取值为1比较合适,因为同住往往代表着同住在一个地点如酒店等。As shown in FIG. 2(a) or FIG. 2(b), taking the vertex A and the vertex B as an example, the path between the core node A and the core node B, or between the cluster A and the cluster B, having no common internal vertices is called For independent rails. The independent track expresses the relationship between two core nodes or clusters, and describes how the two core backbones are related, that is, the relationship information. As shown in Fig. 2(a) or Fig. 2(b), the line segment represented by the double line is an independent track between the cluster or the core node. Among them, the number of nodes included in an independent track is called the degree of this independent track, denoted by N. In practical applications, it is often concerned with the independent track of the closer relationship, that is, the independent track with less degree of independent track, such as N<=2. The value of N depends on the application scenario. For example, in a peer scenario, it is appropriate to take 2 according to the value of experience N. This covers the case where two people may share a car (N=1), and also covers The situation where the user arrives at the same place through different trains at the same time (N=2); for example: in the scene of living together, the value of N is more appropriate, because living together often means living in one place. Such as hotels.
如何获取独立轨,在图论中有很多算法,其具体算法并不用于限定本发明的保护范围,这里不再赘述。How to obtain an independent track, there are many algorithms in the graph theory, and the specific algorithm is not used to limit the protection scope of the present invention, and details are not described herein again.
步骤102:对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。Step 102: Perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship as a relationship between the core nodes in the relationship diagram to be simplified. .
本步骤中,可以通过不同维度的关系计算关联关系的相似性并进行聚合,比如:时间维度即相似性是同时段、和/或关系属性维度即相似性是同属性、和/或行为模式维度即相似性是同行为。In this step, the similarity of the association relationship can be calculated and aggregated through different dimensional relationships, for example, the time dimension, that is, the similarity is the simultaneous segment, and/or the relationship attribute dimension, that is, the similarity is the same attribute, and/or the behavior mode dimension. That is, the similarity is the peer.
其中,计算关联关系的相似性可以根据关联关系所表达的独立轨的关系类型进行相似性判断,例如独立轨A和独立轨B之间的两条独立轨都是出行相关。需要说明的是,不同业务场景相似性的判断是不一样的,取决于不同的策略。基于本申请提供的技术方案是容易理解的,具体策略并不用于限定本申请的保护范围,这里不再赘述。The similarity of the calculated association relationship may be determined according to the relationship type of the independent track expressed by the association relationship. For example, the two independent tracks between the independent track A and the independent track B are travel related. It should be noted that the judgment of the similarity of different business scenarios is different, depending on different strategies. The technical solutions provided by the present application are easy to understand, and the specific policies are not used to limit the scope of protection of the present application, and details are not described herein again.
本申请中,可以根据三个维度计算出的关联关系的相似性的组合进行聚合,比如聚合同时段同属性的间接关系、聚合同时段同行为的间接关系,等等。In the present application, the aggregation may be performed according to the combination of the similarities of the association relationships calculated in the three dimensions, such as the indirect relationship of the same segment with the attribute, the indirect relationship of the same segment with the peer, and the like.
图3为本申请聚合相似的独立轨的实施例的示意图,如图3所示,从上至下分别是按照同属性、同行为和同时段三个方面描述的相似关系的合并。FIG. 3 is a schematic diagram of an embodiment of a method for aggregating similar independent tracks. As shown in FIG. 3, from top to bottom, the similarities are described in terms of the same attribute, the peer, and the simultaneous segment.
比如:如图3中的第一个合并方式,假设核心节点A与核心节点B之间存在两条独立轨,其中一条独立轨表示:IMEI为#3eedf3ed的设备通过QQ号码为3443223在核心节点A与核心节点B之间进行过登录,另一条独立轨表示:IMEI为#3eedf3ed的设备通过QQ号码为2222222在核心节点A与核心节点B之间进行过登录,两条独立轨存在采用同一设备进行登录的同属性的相似性,因此聚合为同IMEI的独立轨。For example, as shown in the first merge mode in FIG. 3, it is assumed that there are two independent tracks between the core node A and the core node B, and one of the independent tracks indicates that the device with the IMEI of #3eedf3ed passes the QQ number of 3443223 at the core node A. The login with the core node B, the other independent track indicates that the device with the IMEI #3eedf3ed has logged in between the core node A and the core node B through the QQ number 2222222, and the two independent tracks exist on the same device. The similarity of the same attribute of the login, so it is aggregated into an independent track with the IMEI.
再如:如图3中的第二个合并方式,假设集群A与集群B之间存在两条独立轨,其中一条独立轨表示:16年3月15日14:45左右,航班CA1232到达,另一条独立轨表示:16年3月18日14:45左右,航班CA1232到达,两条独立轨存在同航班飞行的同行为的相似性,因此聚合为同航班飞行的独立轨。Another example: As shown in the second merge mode in Figure 3, it is assumed that there are two independent tracks between cluster A and cluster B. One of the independent tracks means: around 14:45 on March 15, 16th, flight CA1232 arrives, another An independent track indicates that the flight CA1232 arrived at around 14:45 on March 18th, 16th. The two independent tracks have the similarity of the same flight as the flight, so they are aggregated into independent tracks of the same flight.
又如:如图3中的第三个合并方式,假设集群A与集群B之间存在两条独立轨,其中一条独立轨表示:16年3月15日14:45左右,航班G124到达,另一条独立轨表示:16年3月15日14:45左右,航班MU1122到达,两条独立轨存在同时段到达的同时段的相似性,因此聚合为同时段到达的独立轨。Another example: As shown in the third merge mode in Figure 3, it is assumed that there are two independent tracks between cluster A and cluster B. One of the independent tracks means: around 14:45 on March 15, 16th, flight G124 arrives, and another An independent track indicates that the flight MU1122 arrives at around 14:45 on March 15th, 16th. The two independent tracks have the similarity of the simultaneous segments arriving at the same time, so they are aggregated into independent tracks that arrive at the same time.
当核心节点或集群之间如图3所示的相似关系很多时,通过本申请提供的聚合方式,很大程度上简化了核心节点或集群之间的间接关系,使得核心节点或集群之间的联系更加清晰了。When there are many similar relationships between the core nodes or the clusters as shown in FIG. 3, the aggregation manner provided by the present application greatly simplifies the indirect relationship between the core nodes or the clusters, so that the core nodes or clusters are between The connection is clearer.
可选地,本申请方法还包括:Optionally, the method of the present application further includes:
存储聚合后的虚拟关联关系如以独立轨表现与聚合前各相似关联关系如以独立轨表现的连接关系的对应关系,即对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系。每一个聚合后生成的边即聚合后的独立轨都含有一个drilldown字段,该drilldown字段中存储了其包含的相似独立轨连接。The virtual association relationship after the storage aggregation is as follows: the relationship between the independent track representation and the pre-aggregation similar relationship, such as the connection relationship represented by the independent track, that is, the storage of the associated virtual association relationship and the pre-aggregation association relationship . Each aggregated generated edge, that is, the aggregated independent track, contains a drilldown field in which the similar independent track connections it contains are stored.
按照本申请独立轨之间的相似性合并简化后的关系图,实现了清晰展示核心节点之间的关联关系。但是,关系图分析时依然可能需要查看某两两核心节点之间的详细关联关系。可选地,本申请方法还包括:According to the similarity between the independent tracks of the present application and the simplified relationship diagram, the relationship between the core nodes is clearly displayed. However, it is still possible to view the detailed association between two core nodes when the graph is analyzed. Optionally, the method of the present application further includes:
根据已存储的对应关系,展开选定的聚合后的虚拟关联关系如以独立轨表现。比如,当触发某个聚合后的边即独立轨展开时,也即用户想查看哪个聚合后的独立轨就点击哪个独立轨所在的边,此时,会读取到该聚合后的独立轨的drilldown字段下的所有子独立轨,然后将这些子独立轨显示出来即可。According to the stored correspondence, the selected aggregated virtual relationship is expanded, for example, as an independent track. For example, when the edge of an aggregate is triggered, that is, when the independent track is expanded, that is, the user wants to see which aggregated independent track clicks on which independent track is located, at this time, the aggregated independent track is read. All sub-independent tracks under the drilldown field, then display these sub-independent tracks.
本申请利用关系图中核心节点或集群之间的关联关系间的相似性,对相似的关系进行了合并,以抽象出新的虚拟关联关系,从而将复杂关系图化繁为简,突出了主干脉络。This application uses the similarity between the core nodes or clusters in the relationship diagram to merge similar relationships to abstract new virtual associations, thus simplifying the complex relationship and highlighting the backbone. Thread.
进一步地,本申请还提供了展开聚合后的独立轨的方法,实现了关系图中的局部信息展开,使得甲城和乙城两个核心节点之间的关系从复杂关系图中剥离出来,分析起来也更清晰方便了。Further, the present application also provides a method for expanding the independent track after aggregation, realizing the local information expansion in the relationship diagram, so that the relationship between the two core nodes of Jiacheng and Bcheng is separated from the complex relationship diagram, and analyzed. It is also clearer and more convenient.
下面结合一个实施例来看看本申请中对独立轨的聚合和展开的实现。The implementation of the aggregation and expansion of the independent tracks in this application is examined below in conjunction with an embodiment.
图4为本申请简化后的关系图的实施例的示意图,如图4所示,假设复杂关系图划分出的特征区域包括三个核心节点:A城、B城和C城。三个核心节点之间通过火车产生了复杂的间接关系。以图4中的粗实线表示的关系为例,表示了甲和乙同时段坐了火车HB4540(甲)和火车HB1590(乙),甲和乙之间还有其他的相似的同时段坐过不同火车的独立关系轨。4 is a schematic diagram of an embodiment of a simplified relationship diagram of the present application. As shown in FIG. 4, it is assumed that the feature area divided by the complex relationship diagram includes three core nodes: A City, B City, and C City. A complex indirect relationship is created between the three core nodes by train. Taking the relationship indicated by the thick solid line in Figure 4 as an example, it is shown that A and B are in the same section of the train HB4540 (A) and the train HB1590 (B). There are other similar simultaneous periods between A and B. Independent track of different trains.
图5为本申请对图4中的独立轨进行聚合后的实施例的示意图,本实施例中,将复杂的间接关系抽象成了同时段的出行关系,如图5所示,核心节点之间的关系变得简单清晰了。这里,会存储聚合后的独立轨与聚合前各相似独立轨连接关系的对应关系,如图5所示,甲城和乙城之间的虚线边即为合并相似独立轨后的间接关系边,该边对应的数据字段中通过drilldown描述了其包含的所有子独立轨,这里对应关系包括甲城与乙城之间的第一对应关系、丙城与乙城之间的第二对应关系。FIG. 5 is a schematic diagram of an embodiment in which the independent rails in FIG. 4 are aggregated in the present application. In this embodiment, a complex indirect relationship is abstracted into a traveling relationship of simultaneous segments, as shown in FIG. 5, between core nodes. The relationship has become simple and clear. Here, the correspondence between the aggregated independent track and the similar independent track connection relationship before the aggregation is stored. As shown in FIG. 5, the dotted line between the A city and the B city is the indirect relationship edge after the similar independent track is merged. The data field corresponding to the edge describes all the sub-independent tracks it contains through the drilldown. The corresponding relationship includes the first correspondence between the city and the city, and the second correspondence between the city and the city.
图6为本申请中对图5中聚合后的某独立轨进行展开的实施例的示意图,假设需要展开甲城与乙城之间的关联关系,如图6所示,按照第一对应关系将聚合后的独立轨展现在关系图中。这样,实现了关系图中的局部信息展开,使得甲城和乙城两个核心节点之间的关系从复杂关系图中剥离出来,分析起来也更清晰方便了。FIG. 6 is a schematic diagram of an embodiment of deploying an independent rail after aggregation in FIG. 5 in the present application, and it is assumed that an association relationship between the city and the city B needs to be expanded, as shown in FIG. 6 , according to the first correspondence relationship. The aggregated independent tracks are shown in the diagram. In this way, the local information expansion in the relationship diagram is realized, so that the relationship between the two core nodes of Jiacheng and Bcheng is separated from the complex relationship diagram, and the analysis is also more clear and convenient.
本申请还提供一种用于关系图处理装置,至少包括存储器和处理器,其中,存储器中存储有以下可执行指令:确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;获取各核心节点之间的多个关联关系;对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The present application further provides a relationship diagram processing apparatus, including at least a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node is to be simplified a virtual node formed by a node in a relationship diagram or a relationship diagram; acquiring multiple association relationships between core nodes; performing similarity calculation on multiple association relationships between core nodes, and obtaining cores by aggregation A virtual association relationship between nodes, so that the obtained virtual association relationship is used as a relationship between core nodes in the relationship diagram to be simplified.
图7为本申请实现关系图简化的装置的组成结构示意图,如图7所示,至少包括:划分模块、获取模块,聚合模块;其中,FIG. 7 is a schematic structural diagram of a device for implementing a simplified relationship diagram of the present application. As shown in FIG. 7, the method includes at least: a dividing module, an acquiring module, and an aggregation module;
划分模块,用于确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;a dividing module, configured to determine a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a cluster in a relationship diagram to be simplified;
获取模块,用于获取各核心节点之间的多个关联关系;An obtaining module, configured to acquire multiple association relationships between core nodes;
聚合模块,用于对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The aggregation module is configured to perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship between the core nodes in the relationship diagram to be simplified. Relationship.
可选地,聚合模块具体用于:通过不同维度的关系计算所述关联关系的相似性并进行聚合得到所述虚拟关联关系。Optionally, the aggregation module is specifically configured to: calculate a similarity of the association relationship by using a relationship of different dimensions, and perform aggregation to obtain the virtual association relationship.
进一步地,本申请装置还包括:存储模块,用于对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系。Further, the device of the present application further includes: a storage module, configured to correspondingly store the associated relationship between the aggregated virtual association relationship and the pre-aggregation.
进一步地,本申请装置还包括:Further, the device of the present application further includes:
展开模块,用于当触发所述聚合后的虚拟关联关系,根据所述虚拟关联关系对应的所属关联关系,展开选定的聚合后的虚拟关联关系。The expansion module is configured to trigger the virtual association relationship after the aggregation, and expand the selected virtual relationship after the aggregation according to the association relationship corresponding to the virtual association relationship.
可选地,展开模块具体用于:读取所述聚合后的虚拟关联关系对应的所有所述聚合前的关联关系,并显示读取到的所述关联关系。Optionally, the expansion module is configured to: read all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and display the read association relationship.
本申请利用关系图中核心节点或集群之间的间接关系间的相似性,对相似的关系进行了合并,以抽象出新的虚拟对应关系,从而将复杂关系图化繁为简,突出了主干脉络。The present application utilizes the similarity between the indirect relationships between the core nodes or the clusters in the relationship diagram, and merges the similar relationships to abstract new virtual correspondences, thereby simplifying the complex relationship diagram and highlighting the backbone. Thread.
进一步地,本申请还提供了展开聚合后的聚合关系信息的方案,实现了关系图中的局部信息展开,使得甲城和乙城两个核心节点之间的关系从复杂关系图中剥离出来,分析起来也更清晰方便了。Further, the present application further provides a solution for unfolding the aggregated relationship information, and implements local information expansion in the relationship diagram, so that the relationship between the two core nodes of the city and the city is separated from the complex relationship diagram. The analysis is also more clear and convenient.
虽然本申请所揭露的实施方式如上,但所述的内容仅为便于理解本申请而采用的实施方式,并非用以限定本申请。任何本申请所属领域内的技术人员,在不脱离本申请所揭露的精神和范围的前提下,可以在实施的形式及细节上进行任何的修改与变化,但本申请的专利保护范围,仍须以所附的权利要求书所界定的范围为准。The embodiments disclosed in the present application are as described above, but the description is only for the purpose of understanding the present application, and is not intended to limit the present application. Any modifications and changes in the form and details of the embodiments may be made by those skilled in the art without departing from the spirit and scope of the disclosure. The scope defined by the appended claims shall prevail.

Claims (10)

  1. 一种关系图处理方法,其特征在于,包括:A method for processing a relationship graph, comprising:
    确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;Determining a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram to be simplified or a cluster in a relationship diagram;
    获取各核心节点之间的多个关联关系;Obtaining multiple association relationships between core nodes;
    对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The similarity calculation is performed on multiple association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by aggregation, so that the obtained virtual association relationship is used as the relationship between the core nodes in the relationship diagram to be simplified.
  2. 根据权利要求1所述的关系图处理方法,其特征在于,所述方法还包括:对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系。The method of claim 1 , wherein the method further comprises: storing the associated virtual association relationship and the pre-aggregation association relationship.
  3. 根据权利要求2所述的关系图处理方法,其特征在于,所述方法还包括:当触发所述聚合后的虚拟关联关系,根据所述虚拟关联关系对应的所属关联关系,展开选定的聚合后的虚拟关联关系。The method of claim 2, wherein the method further comprises: when the aggregated virtual association relationship is triggered, and expanding the selected aggregation according to the associated association relationship corresponding to the virtual association relationship. After the virtual relationship.
  4. 根据权利要求3所述的关系图处理方法,其特征在于,所述展开选定的聚合后的虚拟关联关系包括:The method of processing a relationship diagram according to claim 3, wherein the expanding the selected aggregated virtual association relationship comprises:
    读取所述聚合后的虚拟关联关系对应的所有所述聚合前的关联关系,并显示读取到的所述关联关系。Reading all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and displaying the read association relationship.
  5. 根据权利要求1、2或3所述的关系图处理方法,其特征在于,所述对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系包括:The method for processing a relational graph according to claim 1, 2 or 3, wherein the similarity calculation is performed on a plurality of association relationships between the core nodes, and the virtual association relationship between the core nodes is obtained by aggregation. :
    通过不同维度的关系计算所述关联关系的相似性并进行聚合得到所述虚拟关联关系。Calculating the similarity of the association relationship through different dimensional relationships and performing aggregation to obtain the virtual association relationship.
  6. 根据权利要求5所述的关系图处理方法,其特征在于,所述不同维度包括以下任意组合:时间维度、关系属性维度、行为模式维度。The diagram processing method according to claim 5, wherein the different dimensions comprise any combination of the following: a time dimension, a relationship attribute dimension, and a behavior mode dimension.
  7. 一种关系图处理装置,其特征在于,包括:划分模块、获取模块,聚合模块;其中,A diagram processing device, comprising: a dividing module, an obtaining module, and an aggregation module; wherein
    划分模块,用于确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;a dividing module, configured to determine a plurality of core nodes in the relationship diagram to be simplified, the core node being a virtual node formed by a node in a relationship diagram or a cluster in a relationship diagram to be simplified;
    获取模块,用于获取各核心节点之间的多个关联关系;An obtaining module, configured to acquire multiple association relationships between core nodes;
    聚合模块,用于对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。The aggregation module is configured to perform similarity calculation on multiple association relationships between the core nodes, and obtain a virtual association relationship between the core nodes to obtain the virtual association relationship between the core nodes in the relationship diagram to be simplified. Relationship.
  8. 根据权利要求7所述的关系图处理装置,其特征在于,所述装置还包括:The diagram processing device according to claim 7, wherein the device further comprises:
    存储模块,用于对应存储所述聚合后的虚拟关联关系与所述聚合前的关联关系;a storage module, configured to correspondingly store the associated virtual association relationship and the pre-aggregation association relationship;
    展开模块,用于当触发所述聚合后的虚拟关联关系,根据所述虚拟关联关系对应的所属关联关系,展开选定的聚合后的虚拟关联关系。The expansion module is configured to trigger the virtual association relationship after the aggregation, and expand the selected virtual relationship after the aggregation according to the association relationship corresponding to the virtual association relationship.
  9. 根据权利要求8所述的关系图处理装置,其特征在于,所述展开模块具体用于:读取所述聚合后的虚拟关联关系对应的所有所述聚合前的关联关系,并显示读取到的所述关联关系。The diagram processing device according to claim 8, wherein the expansion module is configured to: read all the pre-aggregation association relationships corresponding to the aggregated virtual association relationship, and display the read The association relationship.
  10. 一种关系图处理装置,包括存储器和处理器,其中,存储器中存储有以下可执行指令:确定待简化的关系图中的多个核心节点,该核心节点为待简化的关系图中的节点或关系图中的集群所构成的虚拟节点;获取各核心节点之间的多个关联关系;对各核心节点之间的多个关联关系进行相似度计算,聚合得到各核心节点之间的虚拟关联关系,以将得到的虚拟关联关系作为待简化关系图中核心节点之间的关系。A relationship diagram processing apparatus includes a memory and a processor, wherein the memory stores an executable instruction: determining a plurality of core nodes in a relationship diagram to be simplified, the core node being a node in a relationship diagram to be simplified or a virtual node formed by a cluster in a relationship graph; acquiring multiple association relationships between core nodes; performing similarity calculation on multiple association relationships between core nodes, and obtaining virtual association relationship between core nodes by aggregation To take the obtained virtual association relationship as the relationship between the core nodes in the relationship diagram to be simplified.
PCT/CN2018/090178 2017-06-16 2018-06-07 Relationship diagram processing method and apparatus WO2018228259A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710459378.1A CN109145178A (en) 2017-06-16 2017-06-16 A kind of relational graph processing method and processing device
CN201710459378.1 2017-06-16

Publications (1)

Publication Number Publication Date
WO2018228259A1 true WO2018228259A1 (en) 2018-12-20

Family

ID=64659549

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/090178 WO2018228259A1 (en) 2017-06-16 2018-06-07 Relationship diagram processing method and apparatus

Country Status (2)

Country Link
CN (1) CN109145178A (en)
WO (1) WO2018228259A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111177876B (en) * 2019-12-25 2023-06-20 支付宝(杭州)信息技术有限公司 Community discovery method and device and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090262664A1 (en) * 2008-04-18 2009-10-22 Bonnie Berger Leighton Method for identifying network similarity by matching neighborhood topology
CN103383767A (en) * 2013-07-12 2013-11-06 西安交通大学 Tax evasion affiliated enterprise identification method based on taxpayer interest association network model
CN103580900A (en) * 2012-08-01 2014-02-12 上海宝信软件股份有限公司 Association analysis system based on event chains
CN104239500A (en) * 2014-09-10 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for establishing health-care food associated knowledge base
CN105550307A (en) * 2015-12-14 2016-05-04 北京锐安科技有限公司 Method for generating net citizen identity relationship network graph
CN106161106A (en) * 2016-08-24 2016-11-23 北京奇艺世纪科技有限公司 A kind of node aggregation method and device being applied to oriented topological diagram

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8396827B2 (en) * 2006-12-29 2013-03-12 Sap Ag Relation-based hierarchy evaluation of recursive nodes
CN106612197B (en) * 2015-10-23 2020-09-22 阿里巴巴集团控股有限公司 Network thumbnail generation method and equipment
CN105426499A (en) * 2015-11-25 2016-03-23 成都数联铭品科技有限公司 Implementation method of data visualization
CN105786980B (en) * 2016-02-14 2019-12-20 广州神马移动信息科技有限公司 Method, device and equipment for merging different instances describing same entity

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090262664A1 (en) * 2008-04-18 2009-10-22 Bonnie Berger Leighton Method for identifying network similarity by matching neighborhood topology
CN103580900A (en) * 2012-08-01 2014-02-12 上海宝信软件股份有限公司 Association analysis system based on event chains
CN103383767A (en) * 2013-07-12 2013-11-06 西安交通大学 Tax evasion affiliated enterprise identification method based on taxpayer interest association network model
CN104239500A (en) * 2014-09-10 2014-12-24 百度在线网络技术(北京)有限公司 Method and device for establishing health-care food associated knowledge base
CN105550307A (en) * 2015-12-14 2016-05-04 北京锐安科技有限公司 Method for generating net citizen identity relationship network graph
CN106161106A (en) * 2016-08-24 2016-11-23 北京奇艺世纪科技有限公司 A kind of node aggregation method and device being applied to oriented topological diagram

Also Published As

Publication number Publication date
CN109145178A (en) 2019-01-04

Similar Documents

Publication Publication Date Title
US10496627B2 (en) Consistent ring namespaces facilitating data storage and organization in network infrastructures
US9665660B2 (en) Logical data flow mapping rules for (sub) graph isomorphism in a cluster computing environment
Özköse et al. Yesterday, today and tomorrow of big data
CN103678520B (en) A kind of multi-dimensional interval query method and its system based on cloud computing
JP6321681B2 (en) Method and apparatus for identifying website users
CN109033234B (en) Streaming graph calculation method and system based on state update propagation
US9372928B2 (en) System and method for parallel search on explicitly represented graphs
CN106844477B (en) To synchronous method after block catenary system, block lookup method and block chain
JP2017520047A (en) Data query method and apparatus
CN109936474B (en) Method and equipment for generating network topological graph
US20190097868A1 (en) Network management systems and methods
CN109918378A (en) A kind of remotely-sensed data storage method and storage system based on block chain
Verma et al. Big Data representation for grade analysis through Hadoop framework
CN111343241B (en) Graph data updating method, device and system
CN107621927B (en) Longitudinal extension method and device based on super-fusion system
US9754015B2 (en) Feature rich view of an entity subgraph
Aslay et al. Mining frequent patterns in evolving graphs
Deng et al. An efficient online direction-preserving compression approach for trajectory streaming data
US10585890B2 (en) Generation of query execution plans
WO2018228259A1 (en) Relationship diagram processing method and apparatus
Wu et al. Cloud-based connected component algorithm
US11250001B2 (en) Accurate partition sizing for memory efficient reduction operations
US11226993B2 (en) Method and apparatus for clustering data stream
CN109412149B (en) Power grid subgraph construction method based on regional division, topology analysis method and device
JP2020087455A (en) Recommendation generation method and recommendation generation device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18818722

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18818722

Country of ref document: EP

Kind code of ref document: A1