WO2021227130A1 - Heterogeneous network community detection method, device, computer apparatus, and storage medium - Google Patents

Heterogeneous network community detection method, device, computer apparatus, and storage medium Download PDF

Info

Publication number
WO2021227130A1
WO2021227130A1 PCT/CN2020/092443 CN2020092443W WO2021227130A1 WO 2021227130 A1 WO2021227130 A1 WO 2021227130A1 CN 2020092443 W CN2020092443 W CN 2020092443W WO 2021227130 A1 WO2021227130 A1 WO 2021227130A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
type
community
heterogeneous network
constraint
Prior art date
Application number
PCT/CN2020/092443
Other languages
French (fr)
Chinese (zh)
Inventor
王越
Original Assignee
深圳计算科学研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳计算科学研究院 filed Critical 深圳计算科学研究院
Publication of WO2021227130A1 publication Critical patent/WO2021227130A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Definitions

  • This application relates to the field of heterogeneous networks, and in particular to a method, device, computer equipment, and storage medium for detecting a heterogeneous network community based on node constraints.
  • community detection is one of the more important steps.
  • the community detection algorithm can be used to search for a specific community structure.
  • the community structure based on k-core is more popular.
  • the community structure of k-core is defined as follows: Given a positive integer k ⁇ 0, the k-core of network G is the largest subgraph H k that satisfies the following conditions: For any node v ⁇ H k , the node degree of node v
  • Figure 1 is an example k-core community, where the subgraph ⁇ A,B,C,D ⁇ is a 3-core community, because in this subgraph, each node has at least three neighbor nodes.
  • the graph ⁇ A,B,C,D,E,F,G,H,I ⁇ is a 1-nucleus community, which consists of two connected subgraphs: ⁇ A,B,C,D,E,F, G ⁇ and ⁇ H,I ⁇ , each node in this subgraph has at least one neighbor node.
  • the larger the parameter k the smaller the detected community, and at the same time, the closer the nodes in the community are.
  • the k-core community is a community structure defined in a homogeneous network, and cannot be used for community detection in a heterogeneous network, because heterogeneous networks often have different types of nodes. In a heterogeneous graph, the structure of the k-core cannot be used directly to detect the community. At the same time, in a heterogeneous network, due to the existence of different types of nodes, users often need to make different settings for different types of neighbor nodes. The definition of k-core community has only one parameter k, so in this case, and Can not meet the needs of practical applications.
  • Figure 2 is an example academic network, which contains three types of nodes: A: author, P: paper, and V: conference. Each node in the graph is marked with the type of the node.
  • side A-P indicates that the author participated in writing the paper
  • side P-V indicates that the paper was published in the conference.
  • a user conducts data mining related analysis for the heterogeneous network, and hopes to find the relevant authors and academic community of papers, satisfying that each author in the community has posted at least 2 papers, and each paper has at least 3 papers Collaborator.
  • the dotted part in Figure 3 is an academic community in the heterogeneous network that meets the user's query needs.
  • the purpose of this application is to provide a heterogeneous network community detection method, device, computer equipment, and storage medium based on node constraints, aiming to solve the problem of community detection for heterogeneous network multi-type nodes in the prior art.
  • a detection algorithm for community structure is to provide a heterogeneous network community detection method, device, computer equipment, and storage medium based on node constraints, aiming to solve the problem of community detection for heterogeneous network multi-type nodes in the prior art.
  • an embodiment of the present application provides a method for detecting a heterogeneous network community based on node constraints, which includes: presetting a triplet ⁇ l 1 ,l 2 ,k> to indicate the type constraint of the heterogeneous network G s; where the triplet ⁇ l 1 ,l 2 ,k> is used to indicate that each node of type l 1 has at least k neighboring nodes of type l 2 , l 1 ,l 2 ⁇ L G , k ⁇ 1, L G is the type of network node the heterogeneous set G;
  • the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then the node v is determined as Invalid node;
  • All non-valid nodes in the heterogeneous network G are added to the node set H to be deleted and deleted, and the non-valid nodes in the node set H to be deleted and each neighbor node form a corresponding message, and add To the message queue Q;
  • an embodiment of the present application provides a heterogeneous network community detection device based on node constraints, which includes:
  • the preset unit is used to preset the type constraint s of the heterogeneous network G expressed by the triplet ⁇ l 1 ,l 2 ,k>; wherein, the triplet ⁇ l 1 ,l 2 ,k> is used to represent Each node of type l 1 has at least k neighbor nodes of type l 2 , l 1 , l 2 ⁇ L G , k ⁇ 1, where L G is a set of types of nodes of the heterogeneous network G;
  • the second obtaining unit is configured to obtain, according to the type constraint set S, a node type set L S that meets the needs of the community;
  • the first judgment unit is used to judge whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type in the type constraint set S constraint;
  • the second judgment unit is configured to: if the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then Determining the node v as an invalid node;
  • the deleting unit is used to add all the non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and compose the non-valid nodes in the node set H to be deleted with each neighbor node The corresponding message is added to the message queue Q;
  • the iterative judgment unit is used to iteratively judge whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node so as to satisfy the query condition Community.
  • an embodiment of the present application provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor executes the computer
  • the program implements the heterogeneous network community detection method based on node constraints described in the first aspect above.
  • an embodiment of the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes the above-mentioned first On the one hand, the heterogeneous network community detection method based on node constraints.
  • this method obtains the type constraint set S and the node type set L S that meet the needs of the community, and then according to the type constraint set S and the node type set L S , all ineffective nodes in the heterogeneous network G are obtained Find out and delete, and finally get the community that meets the query conditions.
  • the embodiments of the present application fulfill the needs of users for personalized community queries based on node constraints in a heterogeneous network.
  • Figure 1 is a schematic diagram of the k-core community provided by the prior art
  • Figure 2 is a schematic diagram of an academic network provided by the prior art
  • Figure 3 is a schematic diagram of the actual community that meets user needs in Figure 2;
  • Figure 4 is a schematic diagram of the 2-core community found based on the prior art
  • FIG. 5 is a schematic flowchart of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
  • FIG. 6 is a schematic diagram of a sub-process of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
  • FIG. 7 is a schematic diagram of another sub-process of the method for detecting a heterogeneous network community based on node constraints according to an embodiment of this application;
  • FIG. 8 is a schematic diagram of another sub-process of the method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
  • FIG. 9 is a schematic diagram of a network for classifying and numbering nodes on an academic network according to an embodiment of the application.
  • FIG. 10 is a schematic block diagram of a heterogeneous network community detection device based on node constraints provided by an embodiment of the application;
  • FIG. 11 is a schematic block diagram of a computer device provided by an embodiment of the application.
  • FIG. 5 is a schematic flowchart of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
  • the method includes steps S501 to S507.
  • a heterogeneous network (or FIG isomers) can be represented by a four-tuple G (V G, E G, L G, ⁇ >, wherein, V G representative of a set of nodes; the representative set of edges E G; L G represents a set of types, the type used to represent all nodes in the network; [Phi] is a function mapping each node to a network type of L G:
  • G[H] represents a subgraph of H.
  • the set of nodes in this subgraph is H, and the set of edges is (H ⁇ H) ⁇ E G.
  • N G (v) ⁇ u:(v,u) ⁇ E G ⁇
  • N G (v) ⁇ u:(v,u) ⁇ E G ⁇
  • each type of nodes may be connected or independent.
  • Each node in the heterogeneous network G has at least one neighbor node.
  • a community requirement that meets the user's query condition is found in a heterogeneous network according to the user's query condition, and the type constraint set S of the community requirement is obtained according to the community requirement.
  • the academic network shown in Figure 2 contains three types of nodes, namely: A author, P paper, and V meeting.
  • Edge AP means that the author participated in writing the paper
  • edge PV means that the paper was published in the meeting.
  • the type constraint set S is defined as the constraints satisfied by the community searched by the user, that is, all the node types in the type constraint set S meet the needs of the community; correspondingly, the type constraint set S is also defined A set of node types L S related to the search community is created, so the set of node types L S also meets the needs of the community.
  • L S ⁇ l
  • step S503 includes:
  • S504 Determine whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type constraint in the type constraint set S.
  • the types of nodes of the academic network include: author A, paper P, and meeting V.
  • the node v Determined as an invalid node.
  • the non-valid nodes in the heterogeneous network G are obtained according to the node type set L S and the type constraint set S, and these non-valid nodes are added to the node set H to be deleted and deleted. Since a non-valid node is deleted, it will affect the effectiveness of its neighbor nodes, so the non-valid node in the node set H to be deleted and each neighbor node form a corresponding message and add it to the message queue Q. In the subsequent steps It is necessary to perform iterative judgment on the message queue Q.
  • S507 Iteratively judge whether a new invalid node is generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node, so as to obtain a community that satisfies the query condition.
  • the neighbor nodes of the invalid node may become a new invalid node, and the new invalid node's own neighbors
  • the node may also be a potential new ineffective node. Therefore, in this embodiment, iteratively determine and delete until all the ineffective nodes are deleted.
  • the remaining nodes are valid nodes, and the subgraph is composed of the remaining nodes. That is, the community that satisfies S.
  • step S507 includes:
  • S703 Determine whether the neighbor node has been deleted from the heterogeneous network, if so, consume the corresponding message, if not, proceed to the next step;
  • This embodiment provides an algorithm for iteratively deleting nodes to query the community.
  • the algorithm is based on a message-passing framework.
  • This message is used to record potentially invalid nodes, because the deletion of v will affect the validity of u; at the same time;
  • the generated messages are saved by the message queue Q, and the messages in the message queue Q are also consumed as a candidate set for detecting invalid nodes; therefore, the message queue Q is in the iterative process of the algorithm It is constantly changing; when the message queue Q is an empty set at that time, it means that there are no more potential invalid nodes, and the algorithm ends.
  • step S507 further includes:
  • S802 Determine whether the new message is the same as the consumed or unconsumed message in the message queue, if otherwise, add the new message to the message queue, and if so, proceed to the next step.
  • each of its own neighbor nodes is also a potential ineffective node
  • the neighbor node and each of its own neighbor nodes form a new message, and then the new message is obtained by the above method.
  • the message is consumed, and the potential ineffective node may have a new neighbor node to continue to form another new message, which is also consumed by the above method; this process is the process of algorithm iteration, and finally all messages are consumed After that, you can get the communities that meet the query conditions.
  • the state of the message queue Q is obtained as:
  • the message queue Q at this time ⁇ (9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2), (7,6),(1,6) ⁇ , for the message (9,3), because node 9 has 3 partners, node 9 is a valid node, therefore, the message did not cause node 9 to be deleted, and at the same time , The message is consumed.
  • the message queue Q at this time ⁇ (2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6), (1,6) ⁇ , for the message (2,3), because node 2 is deleted, the message generates new potential invalid node 1 and potential invalid node 7, and at the same time, the message is consumed.
  • a new message is generated in the manner described above, and whether to add to the message queue is determined, and if it is added to the message queue, it is necessary to continue to judge the new message.
  • the message queue Q at this time ⁇ (6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6) ⁇
  • the message does not generate a new message about the potentially invalid node, and at the same time, the message is consumed.
  • the message queue Q at this time ⁇ (12,6),(13,14),(1,2),(7,2),(7,6),(1,6) ⁇ , for the message (12, 6) Because node 12 has two papers, node 12 is a valid node. Therefore, the message does not cause node 12 to be deleted, and at the same time, the message is consumed.
  • node 13 has There are 3 partners, so node 13 is a valid node. Therefore, the message does not cause node 13 to be deleted, and at the same time, the message is consumed.
  • the message queue Q ⁇ (1,2),(7,2),(7,6),(1,6) ⁇ , for the message (1,2) because nodes 2 and 6 are deleted, the node 1 has no neighbor nodes, therefore, potentially invalid node 1 becomes an invalid node and is deleted. At the same time, because node 1 has no neighbor nodes, deleting node 1 does not generate a new potential invalid node.
  • the message queue Q ⁇ (7,2),(7,6),(1,6) ⁇ , for the message (7,2), because nodes 2 and 6 are deleted, node 7 has no neighbor nodes, Therefore, the potentially invalid node 7 becomes an invalid node and is deleted. At the same time, because the node 7 has no neighbor nodes, the deletion of the node 7 does not generate a new potential invalid node.
  • the message queue Q ⁇ (7,6),(1,6) ⁇ , for the message (7,6), because node 7 has been deleted, there is no need to verify whether node 7 is valid, and the message is consumed .
  • the message queue at this time The remaining nodes in the academic network are valid nodes, and the subgraph formed by these valid nodes is the community that meets the query conditions.
  • step S107 further includes:
  • a heterogeneous network G there may be multiple connected subgraphs satisfying the type constraint set S, and each connected subgraph is connected by ineffective nodes.
  • Each connected subgraph of the heterogeneous network G can be used as Communities that meet the query conditions.
  • the acquiring all connected subgraphs that satisfy the query condition, and taking each connected subgraph as a community that satisfies the query condition further includes:
  • the nodes reserved in the heterogeneous network G are divided to obtain multiple connected subgraphs, and each connected subgraph is regarded as a community that satisfies the query condition.
  • ineffective nodes between multiple connected subgraphs in the heterogeneous network G can be deleted to obtain multiple separate connected subgraphs, or a single connected subgraph can be divided to obtain multiple connected subgraphs. Subgraphs, and each divided connected subgraph can also be used as a community that satisfies the query conditions.
  • the method further includes:
  • N G (v, l 2 ) represents a node of type l 2 among the neighbor nodes of v.
  • the specific meaning of the definition is that if the type of v is l 1 and it has at least k neighbor nodes of type l 2 , then v satisfies the condition s; in addition, if the type of v is not l 1 , then the type constraint s is not defined in v, which means that s does not restrict the number of nodes of type l 2 that v must satisfy. Therefore, in this case, it is also considered that v satisfies the condition s (because the constraint does not work).
  • the embodiment of the present application also provides a heterogeneous network community detection device based on node constraint, which is used to perform any of the aforementioned node constraint-based heterogeneous network community detection methods.
  • Examples please refer to FIG. 10, which is a schematic block diagram of a heterogeneous network community detection device based on node constraints provided by an embodiment of the present application.
  • a heterogeneous network community detection device 1000 based on node constraints includes: a preset unit 1001, a first acquisition unit 1002, a second acquisition unit 1003, a first judgment unit 1004, a second judgment unit 1005, The deletion unit 1006 and the iteration judgment unit 1007.
  • the preset unit 1001 is configured to preset the type constraint s of the heterogeneous network G expressed by the triplet ⁇ l 1 ,l 2 ,k>; wherein, the triplet ⁇ l 1 ,l 2 ,k> is used for l represent each type of node 1 is at least k types of neighbors l 2, l 1, l 2 ⁇ L G , k ⁇ 1, L G is the set of nodes of the network in the heterogeneous type the G ;
  • the second obtaining unit 1003 is configured to obtain, according to the type constraint set S, a node type set L S that meets the needs of the community;
  • the first determining unit 1004 is configured to determine whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each of the type constraint set S Type constraint
  • the second judgment unit 1005 is configured to: if the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, Then determine the node v as an invalid node;
  • the deleting unit 1006 is configured to add all the non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and combine the non-valid nodes in the node set H to be deleted and each neighbor node Compose the corresponding message and add it to the message queue Q;
  • the iterative judgment unit 1007 is configured to iteratively judge whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node to obtain a satisfactory query Conditional community.
  • the device obtains the type constraint set S and the node type set L S that meet the needs of the community, and then finds all the non-valid nodes in the heterogeneous network G according to the type constraint set S and the node type set L S Finally, the community that meets the query conditions is obtained, so as to meet the needs of users for personalized community query based on node constraints in a heterogeneous network.
  • the foregoing device for detecting a heterogeneous network community based on node constraints can be implemented in the form of a computer program, which can be run on a computer device as shown in FIG. 11.
  • FIG. 11 is a schematic block diagram of a computer device according to an embodiment of the present application.
  • the computer device 1100 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
  • the computer device 1100 includes a processor 1102, a memory, and a network interface 1105 connected through a system bus 1101, where the memory may include a non-volatile storage medium 1103 and an internal memory 1104.
  • the non-volatile storage medium 1103 can store an operating system 11031 and a computer program 11032.
  • the processor 1102 can execute a method for detecting a heterogeneous network community based on node constraints.
  • the processor 1102 is used to provide computing and control capabilities, and support the operation of the entire computer device 1100.
  • the internal memory 1104 provides an environment for the operation of the computer program 11032 in the non-volatile storage medium 1103.
  • the processor 1102 can execute the method for detecting heterogeneous network communities based on node constraints. .
  • the network interface 1105 is used for network communication, such as providing data information transmission.
  • the structure shown in FIG. 11 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 1100 to which the solution of the present application is applied.
  • the specific computer device The 1100 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
  • the embodiment of the computer device shown in FIG. 11 does not constitute a limitation on the specific configuration of the computer device.
  • the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged.
  • the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 11, and will not be repeated here.
  • the processor 1102 may be a central processing unit (Central Processing Unit, CPU), and the processor 1102 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
  • a computer-readable storage medium may be a non-volatile computer-readable storage medium.
  • the computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the node constraint-based heterogeneous network community detection method of the embodiment of the present application.
  • the storage medium is a physical, non-transitory storage medium, such as a U disk, a mobile hard disk, a read-only memory (Read-Only Memory, ROM), a magnetic disk, or an optical disk, etc., which can store program codes. medium.
  • a physical, non-transitory storage medium such as a U disk, a mobile hard disk, a read-only memory (Read-Only Memory, ROM), a magnetic disk, or an optical disk, etc., which can store program codes. medium.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application discloses a heterogeneous network community detection method, a device, a computer apparatus, and a storage medium. The method comprises: pre-configuring and representing a type constraint s of a heterogeneous network G by means of a 3-tuple <I1, I2, k>; acquiring, according to a user community requirement, a type constraint set S meeting the community requirement and a node type set LS; determining whether each node v in the heterogeneous network G is a type that belongs to the node type set LS or meets the type constraint set S, obtaining and deleting an invalid node, forming a corresponding message from the invalid node and each neighbor node thereof, and adding the message to a message queue Q; and proceeding to determine whether a new invalid node is generated in the message queue Q, and deleting the new invalid node to obtain a community meeting a query condition. The present application meets a user demand of performing node constraint-based customized community queries in heterogeneous networks.

Description

异构网络社群检测方法、装置、计算机设备及存储介质Heterogeneous network community detection method, device, computer equipment and storage medium
本申请是以申请号为202010401730.8、申请日为2020年5月13日的中国专利申请为基础,并主张其优先权,该申请的全部内容在此作为整体引入本申请中。This application is based on a Chinese patent application with an application number of 202010401730.8 and an application date of May 13, 2020, and claims its priority. The entire content of this application is hereby incorporated into this application as a whole.
技术领域Technical field
本申请涉及异构网络领域,尤其涉及一种基于节点约束的异构网络社群检测方法、装置、计算机设备及存储介质。This application relates to the field of heterogeneous networks, and in particular to a method, device, computer equipment, and storage medium for detecting a heterogeneous network community based on node constraints.
背景技术Background technique
在图数据挖掘中,社群检测是其中比较重要的一步,通过社群检测算法可以针对某一种特定的社群结构进行查找,目前比较流行的是采用基于k-核的社群结构。In graph data mining, community detection is one of the more important steps. The community detection algorithm can be used to search for a specific community structure. At present, the community structure based on k-core is more popular.
但在异构网络中,由于存在不同的类型节点,用户在进行社群检测中往往需要对多个不同类型的邻居节点个数进行自定义的设置;而目前的基于k-核的社群结构只能满足在同构网络中的社群检测,不能对多个不同类型的邻居节点个数进行自定义的设置。However, in a heterogeneous network, due to the existence of different types of nodes, users often need to customize the number of multiple different types of neighbor nodes when performing community detection; and the current community structure based on k-core It can only satisfy community detection in a homogeneous network, and cannot customize the number of neighbor nodes of multiple different types.
k-核的社群结构定义如下:给定一个正整数k≥0,网络G的k-核为满足以下条件的最大子图H k:对于任意节点v∈H k,节点v的节点度
Figure PCTCN2020092443-appb-000001
The community structure of k-core is defined as follows: Given a positive integer k≥0, the k-core of network G is the largest subgraph H k that satisfies the following conditions: For any node v ∈ H k , the node degree of node v
Figure PCTCN2020092443-appb-000001
图1是一个示例k-核社群,其中,子图{A,B,C,D}是一个3-核社群,因为在该子图中,每个节点都有至少三个邻居节点。图{A,B,C,D,E,F,G,H,I}是一个1-核社群,它由两个连通子图组成:{A,B,C,D,E,F,G}和{H,I},在该子图中每个节点都至少有一个邻居节点。参数k越大,所检测到的社群往往越小,同时,社群中的节点之间也更加紧密。Figure 1 is an example k-core community, where the subgraph {A,B,C,D} is a 3-core community, because in this subgraph, each node has at least three neighbor nodes. The graph {A,B,C,D,E,F,G,H,I} is a 1-nucleus community, which consists of two connected subgraphs: {A,B,C,D,E,F, G} and {H,I}, each node in this subgraph has at least one neighbor node. The larger the parameter k, the smaller the detected community, and at the same time, the closer the nodes in the community are.
k-核社群是一种定义在同构网络的社群结构,并不能应用在异构网络中进行社群检测,因为异构网络往往拥有不同类型的节点。在异构图中,并不能直接使用k-核的结构来检测社群。同时,在异构网络中,由于不同类型节点的存在, 用户往往需要针对不同类型的邻居节点进行不同的设定,k-核社群的定义只有一个参数k,因此在这种情况下,并不能满足实际应用的需求。The k-core community is a community structure defined in a homogeneous network, and cannot be used for community detection in a heterogeneous network, because heterogeneous networks often have different types of nodes. In a heterogeneous graph, the structure of the k-core cannot be used directly to detect the community. At the same time, in a heterogeneous network, due to the existence of different types of nodes, users often need to make different settings for different types of neighbor nodes. The definition of k-core community has only one parameter k, so in this case, and Can not meet the needs of practical applications.
图2是一个示例学术网络,它包含了三种节点类型:A:作者,P:论文,V:会议。图中的每个节点都标注了该节点的所属类型。此外,边A-P表示作者参与撰写了论文,边P-V表示论文被发表在会议中。假设一个用户针对该异构网络进行数据挖掘的相关分析,并希望找到有关作者和论文学术社群,满足在该社群中每个作者至少发了2篇论文,并且每个论文至少有3个合作者。图3虚线的部分是该异构网络中满足用户查询需求的一个学术社群。然而,目前基于k-核的社群结构并不能挖掘出这样的学术社群:当k=1时,k-核社群是整个异构网络;当k=2时,其对应的k-核社群由图4的虚线部分所表示,当k=3时,其对应的k-核社群为空集。因此目前基于k-核的社群结构并不能满足在异构网络中包含多类型节点的社群搜索与检测。Figure 2 is an example academic network, which contains three types of nodes: A: author, P: paper, and V: conference. Each node in the graph is marked with the type of the node. In addition, side A-P indicates that the author participated in writing the paper, and side P-V indicates that the paper was published in the conference. Suppose a user conducts data mining related analysis for the heterogeneous network, and hopes to find the relevant authors and academic community of papers, satisfying that each author in the community has posted at least 2 papers, and each paper has at least 3 papers Collaborator. The dotted part in Figure 3 is an academic community in the heterogeneous network that meets the user's query needs. However, the current community structure based on k-cores cannot dig out such an academic community: when k=1, the k-core community is the entire heterogeneous network; when k=2, its corresponding k-core The community is represented by the dashed part of Fig. 4, when k=3, the corresponding k-core community is an empty set. Therefore, the current community structure based on k-core cannot satisfy the community search and detection that includes multiple types of nodes in a heterogeneous network.
因此,现有技术中难以解决针对异构网络多类型节点的社群检测,也没有相应的方法来检测出这种社群结构。Therefore, it is difficult to solve the community detection for multiple types of nodes in a heterogeneous network in the prior art, and there is no corresponding method to detect such a community structure.
申请内容Application content
本申请的目的是提供一种基于节点约束的异构网络社群检测方法、装置、计算机设备及存储介质,旨在解决现有技术中没有针对异构网络多类型节点的社群检测以及对这种社群结构的检测算法。The purpose of this application is to provide a heterogeneous network community detection method, device, computer equipment, and storage medium based on node constraints, aiming to solve the problem of community detection for heterogeneous network multi-type nodes in the prior art. A detection algorithm for community structure.
第一方面,本申请实施例提供了一种基于节点约束的异构网络社群检测方法,其包括:预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合; In the first aspect, an embodiment of the present application provides a method for detecting a heterogeneous network community based on node constraints, which includes: presetting a triplet <l 1 ,l 2 ,k> to indicate the type constraint of the heterogeneous network G s; where the triplet <l 1 ,l 2 ,k> is used to indicate that each node of type l 1 has at least k neighboring nodes of type l 2 , l 1 ,l 2 ∈L G , k≥1, L G is the type of network node the heterogeneous set G;
根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,...,s t},所述s 1,s 2,...,s t表示不同的类型约束; Obtain the user's community needs according to the user's query conditions, and obtain the type constraint set S that meets the community needs, where S={s 1 ,s 2 ,...,s t }, and the s 1 , s 2 ,..., st represent different type constraints;
根据所述类型约束集合S获取满足所述社群需求的节点类型集合L S Obtain a node type set L S that meets the needs of the community according to the type constraint set S;
判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束; Judging whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type constraint in the type constraint set S;
若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判 定为非有效节点; If the type of node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then the node v is determined as Invalid node;
将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;All non-valid nodes in the heterogeneous network G are added to the node set H to be deleted and deleted, and the non-valid nodes in the node set H to be deleted and each neighbor node form a corresponding message, and add To the message queue Q;
根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。Iteratively judge whether a new invalid node is generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node, so as to obtain a community that satisfies the query condition.
第二方面,本申请实施例提供了一种基于节点约束的异构网络社群检测装置,其包括:In the second aspect, an embodiment of the present application provides a heterogeneous network community detection device based on node constraints, which includes:
预设单元,用于预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合; The preset unit is used to preset the type constraint s of the heterogeneous network G expressed by the triplet <l 1 ,l 2 ,k>; wherein, the triplet <l 1 ,l 2 ,k> is used to represent Each node of type l 1 has at least k neighbor nodes of type l 2 , l 1 , l 2 ∈ L G , k≥1, where L G is a set of types of nodes of the heterogeneous network G;
第一获取单元,用于根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,...,s t},所述s 1,s 2,...,s t表示不同的类型约束; The first obtaining unit is configured to obtain the user's community needs according to the user's query conditions, and obtain the type constraint set S that meets the community needs, where S={s 1 ,s 2 ,...,s t }, the s 1 , s 2 ,..., st represent different type constraints;
第二获取单元,用于根据所述类型约束集合S获取满足所述社群需求的节点类型集合L SThe second obtaining unit is configured to obtain, according to the type constraint set S, a node type set L S that meets the needs of the community;
第一判断单元,用于判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束; The first judgment unit is used to judge whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type in the type constraint set S constraint;
第二判断单元,用于若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点; The second judgment unit is configured to: if the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then Determining the node v as an invalid node;
删除单元,用于将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;The deleting unit is used to add all the non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and compose the non-valid nodes in the node set H to be deleted with each neighbor node The corresponding message is added to the message queue Q;
迭代判断单元,用于根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。The iterative judgment unit is used to iteratively judge whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node so as to satisfy the query condition Community.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理 器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述第一方面所述的基于节点约束的异构网络社群检测方法。In a third aspect, an embodiment of the present application provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor executes the computer The program implements the heterogeneous network community detection method based on node constraints described in the first aspect above.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行上述第一方面所述的基于节点约束的异构网络社群检测方法。In a fourth aspect, an embodiment of the present application also provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes the above-mentioned first On the one hand, the heterogeneous network community detection method based on node constraints.
本申请公开了一种基于节点约束的异构网络社群检测方法、装置、计算机设备及存储介质,其中,方法包括:预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合;根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,...,s t},所述s 1,s 2,...,s t表示不同的类型约束;根据所述类型约束集合S获取满足所述社群需求的节点类型集合L S;判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束;若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点;将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。该方法根据用户的社群需求,获取了满足社群需求的类型约束集合S以及节点类型集合L S,然后根据类型约束集合S和节点类型集合L S将异构网络G中所有的非有效节点找出并删除,最后得到满足查询条件的社群。本申请实施例实现了满足用户在异构网络中进行基于节点约束的个性化社群查询的需求。 This application discloses a method, device, computer equipment and storage medium for detecting a heterogeneous network community based on node constraints, wherein the method includes: presetting a triplet <l 1 ,l 2 ,k> to indicate a heterogeneous network The type constraint s of G; where the triplet <l 1 ,l 2 ,k> is used to indicate that each node of type l 1 has at least k neighboring nodes of type l 2 , l 1 ,l 2 ∈L G , k≥1, the L G is the type set of nodes of the heterogeneous network G; obtain the user's community needs according to the user's query conditions, and obtain the type constraint set S that meets the community needs , Where S={s 1 ,s 2 ,...,s t }, the s 1 ,s 2 ,...,s t represent different type constraints; according to the type constraint set S, obtain the The node type set L S required by the community; determine whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies the type constraint set S If the type of node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then The node v is determined to be an invalid node; all the invalid nodes in the heterogeneous network G are added to the node set H to be deleted and deleted, and the invalid nodes in the node set H to be deleted are Each neighbor node composes a corresponding message and adds it to the message queue Q; iteratively judges whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, the new Invalid nodes are deleted, so as to obtain a community that meets the query conditions. According to the user’s community needs, this method obtains the type constraint set S and the node type set L S that meet the needs of the community, and then according to the type constraint set S and the node type set L S , all ineffective nodes in the heterogeneous network G are obtained Find out and delete, and finally get the community that meets the query conditions. The embodiments of the present application fulfill the needs of users for personalized community queries based on node constraints in a heterogeneous network.
附图说明Description of the drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实 施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1为现有技术提供的k-核社群示意图;Figure 1 is a schematic diagram of the k-core community provided by the prior art;
图2为现有技术提供的学术网络示意图;Figure 2 is a schematic diagram of an academic network provided by the prior art;
图3为图2中满足用户需求的实际社群示意图;Figure 3 is a schematic diagram of the actual community that meets user needs in Figure 2;
图4为基于现有技术查找到的2-核社群示意图;Figure 4 is a schematic diagram of the 2-core community found based on the prior art;
图5为本申请实施例提供的基于节点约束的异构网络社群检测方法的流程示意图;FIG. 5 is a schematic flowchart of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
图6为本申请实施例提供的基于节点约束的异构网络社群检测方法的子流程示意图;6 is a schematic diagram of a sub-process of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
图7为本申请实施例提供的基于节点约束的异构网络社群检测方法的又一子流程示意图;FIG. 7 is a schematic diagram of another sub-process of the method for detecting a heterogeneous network community based on node constraints according to an embodiment of this application;
图8为本申请实施例提供的基于节点约束的异构网络社群检测方法的又一子流程示意图;FIG. 8 is a schematic diagram of another sub-process of the method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
图9为本申请实施例提供的对学术网络进行节点分类和编号的网络示意图;FIG. 9 is a schematic diagram of a network for classifying and numbering nodes on an academic network according to an embodiment of the application;
图10为本申请实施例提供的基于节点约束的异构网络社群检测装置的示意性框图;10 is a schematic block diagram of a heterogeneous network community detection device based on node constraints provided by an embodiment of the application;
图11为本申请实施例提供的计算机设备的示意性框图。FIG. 11 is a schematic block diagram of a computer device provided by an embodiment of the application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and appended claims, the terms "including" and "including" indicate the existence of the described features, wholes, steps, operations, elements and/or components, but do not exclude one or The existence or addition of multiple other features, wholes, steps, operations, elements, components, and/or collections thereof.
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使 用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terms used in the specification of this application are only for the purpose of describing specific embodiments and are not intended to limit the application. As used in the specification of this application and the appended claims, unless the context clearly indicates otherwise, the singular forms "a", "an" and "the" are intended to include plural forms.
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should be further understood that the term "and/or" used in the specification and appended claims of this application refers to any combination and all possible combinations of one or more of the associated listed items, and includes these combinations .
请参阅图5,图5为本申请实施例提供的基于节点约束的异构网络社群检测方法的流程示意图;Please refer to FIG. 5, which is a schematic flowchart of a method for detecting a heterogeneous network community based on node constraints according to an embodiment of the application;
如图5所示,该方法包括步骤S501~S507。As shown in Fig. 5, the method includes steps S501 to S507.
S501、预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合。 S501. Pre-set to express the type constraint s of the heterogeneous network G through the triplet <l 1 ,l 2 ,k>; wherein, the triplet <l 1 ,l 2 ,k> is used to indicate that each type is The node of l 1 has at least k neighbor nodes of type l 2 , l 1 , l 2 ∈ L G , k≥1, and the L G is the type set of nodes of the heterogeneous network G.
一个异构网络(或称异构图)可以用一个四元组来表示G(V G,E G,L G,φ>,其中,V G代表节点集合;E G代表边的集合;L G代表一个类型集合,用来表示网络中所有的节点类型;φ是一个函数,将网络每一个节点映射到L G中的一个类型:
Figure PCTCN2020092443-appb-000002
A heterogeneous network (or FIG isomers) can be represented by a four-tuple G (V G, E G, L G, φ>, wherein, V G representative of a set of nodes; the representative set of edges E G; L G represents a set of types, the type used to represent all nodes in the network; [Phi] is a function mapping each node to a network type of L G:
Figure PCTCN2020092443-appb-000002
对于一个节点子集H∈V G,G[H]表示H的子图,该子图的节点集合为H,边的集合为(H×H)∩E G。对于任意节点v,定义其邻居节点为与v之间有边的节点,用N G(v)={u:(v,u)∈E G}表示,同时节点v的节点度为d v=|N G(v)={u:(v,u)∈E G}|。除此之外,N G(v,l)表示节点v的邻居节点中类型为l的节点,也就是说,N G(v,l)={u:(v,u)∈E G∧φ(u)=l}。 For a subset of nodes H∈V G , G[H] represents a subgraph of H. The set of nodes in this subgraph is H, and the set of edges is (H×H)∩E G. For any node v, define its neighbor node as a node with an edge between it and v, which is represented by N G (v)={u:(v,u)∈E G }, and the node degree of node v is d v = |N G (v)={u:(v,u)∈E G }|. In addition, N G (v,l) represents the node of type l among the neighbor nodes of node v, that is to say, N G (v,l)={u:(v,u)∈E G ∧φ (u)=l}.
一个异构网络中,存在多种不同类型的节点,各个类型的节点之间可以是相连的,也可以是独立的,该异构网络G中的每个节点都至少有一个邻居节点。为了更方便地对节点之间的关系进行类型约束,或者称限定,本步骤对类型约束s的表示方式进行了设置:<l 1,l 2,k>;例如:l 1为A,l 2为P,k=2,即该类型约束s=〈A,P,2〉,表示节点类型为A的节点至少有2个节点类型为P的邻居节点。 In a heterogeneous network, there are many different types of nodes, and each type of nodes may be connected or independent. Each node in the heterogeneous network G has at least one neighbor node. In order to more conveniently constrain or limit the relationship between nodes, this step sets the expression of the type constraint s: <l 1 ,l 2 ,k>; for example: l 1 is A, l 2 It is P, k=2, that is, the type constraint s=<A,P,2>, which means that a node with a node type of A has at least two neighboring nodes with a node type of P.
S502、根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,...,s t},所述s 1,s 2,...,s t表示不同的类型约束。 S502. Obtain the user's community needs according to the user's query conditions, and obtain a type constraint set S that meets the community needs, where S={s 1 ,s 2 ,...,s t }, and the s 1 ,s 2 ,...,s t represent different types of constraints.
本实施例中,根据用户的查询条件在一个异构网络中找出满足用户查询条件的社群需求,并根据社群需求得到该社群需求的类型约束集合S。In this embodiment, a community requirement that meets the user's query condition is found in a heterogeneous network according to the user's query condition, and the type constraint set S of the community requirement is obtained according to the community requirement.
举例来说,如图2所示的学术网络中,包含三种节点类型,分别为:A作者,P论文,V会议,边A-P表示作者参与撰写了论文,边P-V表示论文被发表在会议中,假设用户希望查询有关作者和论文的学术社群,该学术社群满足每个作者至少发了2篇论文,并且每个论文至少有3个合作者。那么可获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2},其中s 1=〈A,P,2〉,s 2=〈P,A,3〉。 For example, the academic network shown in Figure 2 contains three types of nodes, namely: A author, P paper, and V meeting. Edge AP means that the author participated in writing the paper, and edge PV means that the paper was published in the meeting. , Suppose the user wants to inquire about the academic community of the author and the paper. The academic community satisfies that each author has posted at least 2 papers, and each paper has at least 3 collaborators. Then the type constraint set S that meets the needs of the community can be obtained, where S={s 1 ,s 2 }, where s 1 =<A,P,2>, s 2 =<P,A,3>.
S503、根据所述类型约束集合S获取满足所述社群需求的节点类型集合L S S503: Obtain a node type set L S that meets the needs of the community according to the type constraint set S.
本实施例中,所述类型约束集合S定义为用户搜索的社群所满足的约束,即该类型约束集合S中的所有节点类型均满足社群需求;相应的,该类型约束集合S也定义了一个查找社群相关的节点类型集合L S,故该节点类型集合L S也满足所述社群需求。 In this embodiment, the type constraint set S is defined as the constraints satisfied by the community searched by the user, that is, all the node types in the type constraint set S meet the needs of the community; correspondingly, the type constraint set S is also defined A set of node types L S related to the search community is created, so the set of node types L S also meets the needs of the community.
具体的,L S={l|<l,l',k>∈S∨<l',l,k>∈S},对于一个节点v来说,若
Figure PCTCN2020092443-appb-000003
Figure PCTCN2020092443-appb-000004
那么该节点v一定不在满足约束的社群中。
Specifically, L S ={l|<l,l',k>∈S∨<l',l,k>∈S}, for a node v, if
Figure PCTCN2020092443-appb-000003
Figure PCTCN2020092443-appb-000004
Then the node v must not be in the community that satisfies the constraints.
在一实施例中,如图6所示,步骤S503包括:In an embodiment, as shown in FIG. 6, step S503 includes:
S601、获取所述类型约束集合S中的每一类型约束;S601. Obtain each type constraint in the type constraint set S;
S602、获取每一类型约束中的节点类型;S602. Obtain the node type in each type of constraint.
S603、将所获取到的所有节点类型加入所述节点类型集合L S中。 S603. Add all the acquired node types to the node type set L S.
本实施例中,以上述举例的学术社群来说,获取其类型约束集合S中的每一类型约束,包括:s 1=〈A,P,2〉,s 2=〈P,A,3〉;获取s 1中的节点类型:A、P,获取s 2中的节点类型:P、A;将节点类型A和节点类型P加入所述节点类型集合L S中,即L S={A,P}。 In this embodiment, taking the academic community as an example above, each type constraint in its type constraint set S is obtained, including: s 1 =<A,P,2>, s 2 =<P,A,3 〉; Get the node types in s 1 : A, P, get the node types in s 2 : P, A; add the node type A and the node type P to the node type set L S , that is, L S = {A ,P}.
S504、判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束。 S504: Determine whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type constraint in the type constraint set S.
以上述学术网络来说,该学术网络的节点的类型包括:A作者、P论文和V会议这三种,该学术网络节点类型集合L S={A,P},类型约束集合S={s 1,s 2},其中s 1=〈A,P,2〉,s 2=〈P,A,3〉;将学术网络的每一个节点v的类型与节点类型集合L S进行对比,即可判断出节点类型v是否属于所述节点类型集合L S。将学术网络中的每一个节点v与所述类型约束集合S中的每一个类型约束进行对比,即可判断出节点v是否满足所述类型约束集合S中的每一个类型约束。 Taking the above-mentioned academic network as an example, the types of nodes of the academic network include: author A, paper P, and meeting V. The academic network node type set L S = {A, P}, the type constraint set S = {s 1 ,s 2 }, where s 1 =<A,P,2>, s 2 =<P,A,3>; compare the type of each node v of the academic network with the set of node types L S , you can It is determined whether the node type v belongs to the node type set L S. By comparing each node v in the academic network with each type constraint in the type constraint set S, it can be determined whether the node v satisfies each type constraint in the type constraint set S.
S505、若所述异构网络G中的节点v的类型不属于所述节点类型集合L S, 或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点。 S505: If the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then the node v Determined as an invalid node.
本实施例中,在异构网络G中,只要一个节点v的类型不属于所述节点类型集合L S,或者该节点v不满足所述类型约束集合S中任意一个类型约束,则该节点v判定为非有效节点。 In this embodiment, in the heterogeneous network G, as long as the type of a node v does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, the node v Determined as an invalid node.
S506、将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中。S506. Add all non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and compose a corresponding message between the non-valid nodes in the node set H to be deleted and each neighbor node. , Added to the message queue Q.
本实施例中,根据所述节点类型集合L S和类型约束集合S得到异构网络G中的非有效节点,将这些非有效节点加入待删除节点集合H中并进行删除。由于一个非有效节点被删除后,会影响其邻居节点的有效性,故将待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中,后续步骤中需要对该消息队列Q进行进行迭代判断。 In this embodiment, the non-valid nodes in the heterogeneous network G are obtained according to the node type set L S and the type constraint set S, and these non-valid nodes are added to the node set H to be deleted and deleted. Since a non-valid node is deleted, it will affect the effectiveness of its neighbor nodes, so the non-valid node in the node set H to be deleted and each neighbor node form a corresponding message and add it to the message queue Q. In the subsequent steps It is necessary to perform iterative judgment on the message queue Q.
S507、根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。S507: Iteratively judge whether a new invalid node is generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node, so as to obtain a community that satisfies the query condition.
本实施例中,将一个非有效节点删除后,该非有效节点的邻居节点(暂时未列入非有效节点的节点)可能变为新的非有效节点,并且该新的非有效节点自身的邻居节点还有可能是潜在的新的非有效节点,故本实施例通过不断的迭代判断并删除,直至将所有的非有效节点删除,剩余的节点即为有效节点,由剩余的节点构成的子图即为满足S的社群。In this embodiment, after deleting an invalid node, the neighbor nodes of the invalid node (nodes that are not listed as invalid nodes for the time being) may become a new invalid node, and the new invalid node's own neighbors The node may also be a potential new ineffective node. Therefore, in this embodiment, iteratively determine and delete until all the ineffective nodes are deleted. The remaining nodes are valid nodes, and the subgraph is composed of the remaining nodes. That is, the community that satisfies S.
在一实施例中,如图7所示,步骤S507包括:In one embodiment, as shown in FIG. 7, step S507 includes:
S701、遍历所述消息队列Q中的消息;S701: Traverse the messages in the message queue Q;
S702、获取所述消息中的邻居节点;S702. Acquire neighbor nodes in the message.
S703、判断所述邻居节点是否已从所述异构网络中删除,若是则将对应的消息消耗,若否则进入下一步;S703: Determine whether the neighbor node has been deleted from the heterogeneous network, if so, consume the corresponding message, if not, proceed to the next step;
S704、判断所述邻居节点是否满足所述类型约束集合S中的每一个类型约束,若是则在异构网络G中保留所述邻居节点,若否则将所述邻居节点作为新的非有效节点,并从所述异构网络G中删除所述新的非有效节点;S704. Determine whether the neighbor node satisfies each type constraint in the type constraint set S, if so, reserve the neighbor node in the heterogeneous network G, if otherwise, use the neighbor node as a new non-valid node, And delete the new non-valid node from the heterogeneous network G;
S705、将对应的消息消耗,并进行下一消息的处理直至消耗完所述消息队 列Q中的所有消息。S705: Consume the corresponding message, and process the next message until all messages in the message queue Q are consumed.
本实施例提供一种迭代删除节点查询社群的算法,该算法基于一个message-passing(消息传递)的框架,当一个非有效节点v被删除时,对于该非有效节点v的每一个邻居节点u(暂未被删除的节点)生成一条消息(u,v),它表示u的邻居v被删除,该消息用来记录潜在的非有效节点,因为v的删除会影响u的有效性;同时,在算法迭代删除节点的过程中,产生的消息用消息队列Q来保存,同时消息队列Q中的消息作为检测非有效节点的候选集也在被消耗掉;因此消息队列Q在算法迭代过程中是不断变化的;当时消息队列Q为空集时,意味着不再有潜在的非有效节点,那么算法结束。This embodiment provides an algorithm for iteratively deleting nodes to query the community. The algorithm is based on a message-passing framework. When an invalid node v is deleted, for each neighbor node of the invalid node v u (nodes that have not yet been deleted) generates a message (u, v), which indicates that u's neighbor v has been deleted. This message is used to record potentially invalid nodes, because the deletion of v will affect the validity of u; at the same time; In the process of iteratively deleting nodes in the algorithm, the generated messages are saved by the message queue Q, and the messages in the message queue Q are also consumed as a candidate set for detecting invalid nodes; therefore, the message queue Q is in the iterative process of the algorithm It is constantly changing; when the message queue Q is an empty set at that time, it means that there are no more potential invalid nodes, and the algorithm ends.
在一实施例中,如图8所示,步骤S507还包括:In an embodiment, as shown in FIG. 8, step S507 further includes:
S801、当从所述异构网络G中删除所述邻居节点时,获取该邻居节点自身的每一邻居节点,并将该邻居节点与自身的每一邻居节点组成新的消息;S801: When the neighbor node is deleted from the heterogeneous network G, obtain each neighbor node of the neighbor node itself, and compose a new message between the neighbor node and each neighbor node of the neighbor node;
S802、判断新的消息是否与所述消息队列中已消耗或未消耗的消息相同,若否则将新的消息添加至所述消息队列中,若是则进行下一步。S802: Determine whether the new message is the same as the consumed or unconsumed message in the message queue, if otherwise, add the new message to the message queue, and if so, proceed to the next step.
本实施例中,一个已删除的邻居节点,其自身的每一邻居节点也属于潜在的非有效节点,将该邻居节点与自身的每一邻居节点组成新的消息,然后通过上述方法对该新的消息进行消耗,并且该潜在的非有效节点还有可能存在新的邻居节点,继续组成又一个新的消息,同样通过上述方法进行消耗;该过程即为算法迭代的过程,最后将所有消息消耗完即可得到满足查询条件的社群。In this embodiment, for a deleted neighbor node, each of its own neighbor nodes is also a potential ineffective node, the neighbor node and each of its own neighbor nodes form a new message, and then the new message is obtained by the above method. The message is consumed, and the potential ineffective node may have a new neighbor node to continue to form another new message, which is also consumed by the above method; this process is the process of algorithm iteration, and finally all messages are consumed After that, you can get the communities that meet the query conditions.
以上述学术网络为例,如图9所示,为了更便于说明,将学术网络进行类型区分以及节点编号;在该学术网络中,其类型约束集合,S={s 1,s 2},其中s 1=〈A,P,2〉,s 2=〈P,A,3〉类型约束集合L S={A,P},由此可得所查找的社群中的节点类型只包含作者和论文两种类型。 Taking the above academic network as an example, as shown in Figure 9, for easier description, the academic network is classified and node numbered; in this academic network, its type constraint set, S={s 1 ,s 2 }, where s 1 =<A,P,2>, s 2 =<P,A,3> type constraint set L S ={A,P}, it can be obtained that the node type in the searched community only contains the author and Two types of papers.
具体的,遍历学术网络中的每个节点,找出待删除节点集合H={3,11,14,2,6},其中,{3,11,14}的节点类型是会议,故判定为非有效节点;此外,{2,6}合作者都少于3个,不满足约束条件S,故判定为非有效节点;然后将H中的节点进行删除;最后将H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中,得到消息队列Q的状态为:Specifically, each node in the academic network is traversed to find the set of nodes to be deleted H={3,11,14,2,6}, where the node type of {3,11,14} is a meeting, so it is determined as Invalid nodes; in addition, {2,6} partners are less than 3 and do not meet the constraint S, so they are judged as invalid nodes; then the nodes in H are deleted; finally, the invalid nodes in H are Each neighbor node composes a corresponding message and adds it to the message queue Q. The state of the message queue Q is obtained as:
Q={(4,3),(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)}。Q={(4,3),(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2) ,(7,6),(1,6)}.
根据消息队列Q={(4,3),(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)}, 检测潜在的非有效节点;对于消息(4,3),因为节点4有3个作者(合作者),故节点4为有效节点,因此,该消息并未导致节点4被删除,同时,该消息被消耗掉。According to the message queue Q={(4,3),(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7 ,2),(7,6),(1,6)}, detect potential invalid nodes; for message (4,3), because node 4 has 3 authors (collaborators), node 4 is a valid node Therefore, the message did not cause node 4 to be deleted, and at the same time, the message was consumed.
此时的消息队列Q={(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)},对于消息(9,3),因为节点9有3个合作者,故节点9为有效节点,因此,该消息并未导致节点9被删除,同时,该消息被消耗掉。The message queue Q at this time={(9,3),(2,3),(6,11),(12,6),(13,14),(1,2),(7,2), (7,6),(1,6)}, for the message (9,3), because node 9 has 3 partners, node 9 is a valid node, therefore, the message did not cause node 9 to be deleted, and at the same time , The message is consumed.
此时的消息队列Q={(2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)},对于消息(2,3),因为节点2被删除,该消息产生新的潜在非有效节点1和潜在非有效节点7,同时,该消息被消耗掉。对于潜在非有效节点,则采用前述描述的方式生成新的消息,并确定是否添加到消息队列,如果添加到消息队列,则需要对新的消息继续进行判断。The message queue Q at this time={(2,3),(6,11),(12,6),(13,14),(1,2),(7,2),(7,6), (1,6)}, for the message (2,3), because node 2 is deleted, the message generates new potential invalid node 1 and potential invalid node 7, and at the same time, the message is consumed. For potentially non-valid nodes, a new message is generated in the manner described above, and whether to add to the message queue is determined, and if it is added to the message queue, it is necessary to continue to judge the new message.
此时的消息队列Q={(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)},对于消息(6,11),因为节点6已经被删除,该消息并未产生新的有关潜在非有效节点的消息,同时,该消息被消耗掉。The message queue Q at this time={(6,11),(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)} For the message (6, 11), because node 6 has been deleted, the message does not generate a new message about the potentially invalid node, and at the same time, the message is consumed.
此时的消息队列Q={(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)},对于消息(12,6),因为节点12有两篇论文,故节点12为有效节点,因此,该消息并未导致节点12被删除,同时,该消息被消耗掉。The message queue Q at this time={(12,6),(13,14),(1,2),(7,2),(7,6),(1,6)}, for the message (12, 6) Because node 12 has two papers, node 12 is a valid node. Therefore, the message does not cause node 12 to be deleted, and at the same time, the message is consumed.
此时的消息队列Q={(13,14),(1,2),(7,2),(7,6),(1,6)},对于消息(13,14),节点13有3个合作者,故节点13是有效节点,因此,该消息并未导致节点13被删除,同时,该消息被消耗掉。At this time, the message queue Q={(13,14),(1,2),(7,2),(7,6),(1,6)}, for the message (13,14), node 13 has There are 3 partners, so node 13 is a valid node. Therefore, the message does not cause node 13 to be deleted, and at the same time, the message is consumed.
此时的消息队列Q={(1,2),(7,2),(7,6),(1,6)},对于消息(1,2)因为节点2和6被删除,导致节点1没有邻居节点,因此,潜在非有效节点1变成非有效节点,并被删除,同时,因为节点1没有邻居节点,因此,删除节点1并未产生新的潜在非有效节点。At this time, the message queue Q={(1,2),(7,2),(7,6),(1,6)}, for the message (1,2) because nodes 2 and 6 are deleted, the node 1 has no neighbor nodes, therefore, potentially invalid node 1 becomes an invalid node and is deleted. At the same time, because node 1 has no neighbor nodes, deleting node 1 does not generate a new potential invalid node.
此时的消息队列Q={(7,2),(7,6),(1,6)},对于消息(7,2),因为节点2和6被删除,导致节点7没有邻居节点,因此,潜在非有效节点7变成非有效节点,并被删除,同时,因为节点7没有邻居节点,因此,节点7的删除并未产生新的潜在非有效节点。At this time, the message queue Q={(7,2),(7,6),(1,6)}, for the message (7,2), because nodes 2 and 6 are deleted, node 7 has no neighbor nodes, Therefore, the potentially invalid node 7 becomes an invalid node and is deleted. At the same time, because the node 7 has no neighbor nodes, the deletion of the node 7 does not generate a new potential invalid node.
此时的消息队列Q={(7,6),(1,6)},对于消息(7,6),因为节点7已经被删除,不需要再验证节点7是否有效,该消息被消耗掉。At this time, the message queue Q={(7,6),(1,6)}, for the message (7,6), because node 7 has been deleted, there is no need to verify whether node 7 is valid, and the message is consumed .
此时的消息队列Q={(1,6)},因为节点1已经被删除,不需要再验证节点1是否有效,该消息被消耗掉。At this time, the message queue Q={(1,6)}, because node 1 has been deleted, there is no need to verify whether node 1 is valid, and the message is consumed.
根据上述算法处理后,此时的消息队列
Figure PCTCN2020092443-appb-000005
该学术网络中剩下的节点即为有效节点,这些有效节点构成的子图即为满足查询条件的社群。
After processing according to the above algorithm, the message queue at this time
Figure PCTCN2020092443-appb-000005
The remaining nodes in the academic network are valid nodes, and the subgraph formed by these valid nodes is the community that meets the query conditions.
在一实施例中,所述步骤S107还包括:In an embodiment, the step S107 further includes:
获取满足查询条件的所有连通子图,将每一连通子图作为满足查询条件的社群。Obtain all connected subgraphs that meet the query conditions, and treat each connected subgraph as a community that meets the query conditions.
在一个异构网络G中,可能存在多个满足类型约束集合S的连通子图,每个连通子图之间由非有效节点进行连接,该异构网络G的每一连通子图均可作为满足查询条件的社群。In a heterogeneous network G, there may be multiple connected subgraphs satisfying the type constraint set S, and each connected subgraph is connected by ineffective nodes. Each connected subgraph of the heterogeneous network G can be used as Communities that meet the query conditions.
在一实施例中,所述获取满足查询条件的所有连通子图,将每一连通子图作为满足查询条件的社群,还包括:In an embodiment, the acquiring all connected subgraphs that satisfy the query condition, and taking each connected subgraph as a community that satisfies the query condition, further includes:
将所述异构网络G中保留的节点进行划分,得到多个连通子图,并将每一连通子图作为满足查询条件的社群。The nodes reserved in the heterogeneous network G are divided to obtain multiple connected subgraphs, and each connected subgraph is regarded as a community that satisfies the query condition.
本实施例中,可以将异构网络G中的多个连通子图之间的非有效节点删除,得到多个单独的连通子图,也可以将一个单独连通子图进行划分,得到多个连通子图,并且每个划分出来的连通子图也可以作为满足查询条件的社群。In this embodiment, ineffective nodes between multiple connected subgraphs in the heterogeneous network G can be deleted to obtain multiple separate connected subgraphs, or a single connected subgraph can be divided to obtain multiple connected subgraphs. Subgraphs, and each divided connected subgraph can also be used as a community that satisfies the query conditions.
在一实施例中,该方法还包括:In an embodiment, the method further includes:
预先设置所述异构网络G中的节点v满足以下条件,则所述节点v满足所述类型约束s:φ(v)≠l 1,或者
Figure PCTCN2020092443-appb-000006
其中,N G(v,l 2)表示v的邻居节点中类型为l 2的节点。
It is preset that the node v in the heterogeneous network G satisfies the following conditions, then the node v satisfies the type constraint s: φ(v)≠l 1 , or
Figure PCTCN2020092443-appb-000006
Among them, N G (v, l 2 ) represents a node of type l 2 among the neighbor nodes of v.
本实施例中,该定义的具体含义是,如果v的类型为l 1,并且它拥有至少至少有k个类型为l 2的邻居节点,那么v是满足条件s;此外,如果v的类型不是l 1,那么类型约束s在v并没有定义,也就是说s并没有约束v必须满足l 2类型节点的个数。因此在这种情况下,也认为v是满足条件s(因为约束并没有起作用)。 In this embodiment, the specific meaning of the definition is that if the type of v is l 1 and it has at least k neighbor nodes of type l 2 , then v satisfies the condition s; in addition, if the type of v is not l 1 , then the type constraint s is not defined in v, which means that s does not restrict the number of nodes of type l 2 that v must satisfy. Therefore, in this case, it is also considered that v satisfies the condition s (because the constraint does not work).
本申请实施例还提供一种基于节点约束的异构网络社群检测装置,该基于节点约束的异构网络社群检测装置用于执行前述基于节点约束的异构网络社群检测方法的任一实施例。具体地,请参阅图10,图10是本申请实施例提供的基于节点约束的异构网络社群检测装置的示意性框图。The embodiment of the present application also provides a heterogeneous network community detection device based on node constraint, which is used to perform any of the aforementioned node constraint-based heterogeneous network community detection methods. Examples. Specifically, please refer to FIG. 10, which is a schematic block diagram of a heterogeneous network community detection device based on node constraints provided by an embodiment of the present application.
如图10所示,基于节点约束的异构网络社群检测装置1000,包括:预设单 元1001、第一获取单元1002、第二获取单元1003、第一判断单元1004、第二判断单元1005、删除单元1006以及迭代判断单元1007。As shown in FIG. 10, a heterogeneous network community detection device 1000 based on node constraints includes: a preset unit 1001, a first acquisition unit 1002, a second acquisition unit 1003, a first judgment unit 1004, a second judgment unit 1005, The deletion unit 1006 and the iteration judgment unit 1007.
预设单元1001,用于预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合; The preset unit 1001 is configured to preset the type constraint s of the heterogeneous network G expressed by the triplet <l 1 ,l 2 ,k>; wherein, the triplet <l 1 ,l 2 ,k> is used for l represent each type of node 1 is at least k types of neighbors l 2, l 1, l 2 ∈L G , k≥1, L G is the set of nodes of the network in the heterogeneous type the G ;
第一获取单元1002,用于根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,...,s t},所述s 1,s 2,...,s t表示不同的类型约束; The first acquiring unit 1002 is configured to acquire the user's community needs according to the user's query conditions, and acquire a type constraint set S that meets the community needs, where S={s 1 ,s 2 ,...,s t }, the s 1 , s 2 ,..., st represent different type constraints;
第二获取单元1003,用于根据所述类型约束集合S获取满足所述社群需求的节点类型集合L SThe second obtaining unit 1003 is configured to obtain, according to the type constraint set S, a node type set L S that meets the needs of the community;
第一判断单元1004,用于判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束; The first determining unit 1004 is configured to determine whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each of the type constraint set S Type constraint
第二判断单元1005,用于若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点; The second judgment unit 1005 is configured to: if the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, Then determine the node v as an invalid node;
删除单元1006,用于将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;The deleting unit 1006 is configured to add all the non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and combine the non-valid nodes in the node set H to be deleted and each neighbor node Compose the corresponding message and add it to the message queue Q;
迭代判断单元1007,用于根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。The iterative judgment unit 1007 is configured to iteratively judge whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node to obtain a satisfactory query Conditional community.
该装置根据用户的社群需求,获取满足社群需求的类型约束集合S以及节点类型集合L S,然后根据类型约束集合S和节点类型集合L S将异构网络G中所有的非有效节点找出并删除,最后得到满足查询条件的社群,从而实现了满足用户在异构网络中进行基于节点约束的个性化社群查询的需求。 According to the user’s community needs, the device obtains the type constraint set S and the node type set L S that meet the needs of the community, and then finds all the non-valid nodes in the heterogeneous network G according to the type constraint set S and the node type set L S Finally, the community that meets the query conditions is obtained, so as to meet the needs of users for personalized community query based on node constraints in a heterogeneous network.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and conciseness of description, the specific working process of the device and unit described above can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
上述基于节点约束的异构网络社群检测装置可以实现为计算机程序的形 式,该计算机程序可以在如图11所示的计算机设备上运行。The foregoing device for detecting a heterogeneous network community based on node constraints can be implemented in the form of a computer program, which can be run on a computer device as shown in FIG. 11.
请参阅图11,图11是本申请实施例提供的计算机设备的示意性框图。该计算机设备1100是服务器,服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。Please refer to FIG. 11, which is a schematic block diagram of a computer device according to an embodiment of the present application. The computer device 1100 is a server, and the server may be an independent server or a server cluster composed of multiple servers.
参阅图11,该计算机设备1100包括通过系统总线1101连接的处理器1102、存储器和网络接口1105,其中,存储器可以包括非易失性存储介质1103和内存储器1104。Referring to FIG. 11, the computer device 1100 includes a processor 1102, a memory, and a network interface 1105 connected through a system bus 1101, where the memory may include a non-volatile storage medium 1103 and an internal memory 1104.
该非易失性存储介质1103可存储操作系统11031和计算机程序11032。该计算机程序11032被执行时,可使得处理器1102执行基于节点约束的异构网络社群检测方法。The non-volatile storage medium 1103 can store an operating system 11031 and a computer program 11032. When the computer program 11032 is executed, the processor 1102 can execute a method for detecting a heterogeneous network community based on node constraints.
该处理器1102用于提供计算和控制能力,支撑整个计算机设备1100的运行。The processor 1102 is used to provide computing and control capabilities, and support the operation of the entire computer device 1100.
该内存储器1104为非易失性存储介质1103中的计算机程序11032的运行提供环境,该计算机程序11032被处理器1102执行时,可使得处理器1102执行基于节点约束的异构网络社群检测方法。The internal memory 1104 provides an environment for the operation of the computer program 11032 in the non-volatile storage medium 1103. When the computer program 11032 is executed by the processor 1102, the processor 1102 can execute the method for detecting heterogeneous network communities based on node constraints. .
该网络接口1105用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理解,图11中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备1100的限定,具体的计算机设备1100可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 1105 is used for network communication, such as providing data information transmission. Those skilled in the art can understand that the structure shown in FIG. 11 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 1100 to which the solution of the present application is applied. The specific computer device The 1100 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
本领域技术人员可以理解,图11中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图11所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 11 does not constitute a limitation on the specific configuration of the computer device. In other embodiments, the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged. For example, in some embodiments, the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 11, and will not be repeated here.
应当理解,在本申请实施例中,处理器1102可以是中央处理单元(Central Processing Unit,CPU),该处理器1102还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件 等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in this embodiment of the application, the processor 1102 may be a central processing unit (Central Processing Unit, CPU), and the processor 1102 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. Among them, the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
在本申请的另一实施例中提供计算机可读存储介质。该计算机可读存储介质可以为非易失性的计算机可读存储介质。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现本申请实施例的基于节点约束的异构网络社群检测方法。In another embodiment of the present application, a computer-readable storage medium is provided. The computer-readable storage medium may be a non-volatile computer-readable storage medium. The computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the node constraint-based heterogeneous network community detection method of the embodiment of the present application.
所述存储介质为实体的、非瞬时性的存储介质,例如可以是U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、磁碟或者光盘等各种可以存储程序代码的实体存储介质。The storage medium is a physical, non-transitory storage medium, such as a U disk, a mobile hard disk, a read-only memory (Read-Only Memory, ROM), a magnetic disk, or an optical disk, etc., which can store program codes. medium.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的设备、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, the specific working process of the above-described equipment, device, and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, these modifications or replacements shall be covered within the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (10)

  1. 一种基于节点约束的异构网络社群检测方法,其特征在于,包括:A heterogeneous network community detection method based on node constraints, which is characterized in that it includes:
    预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合; It is preset that the type constraint s of the heterogeneous network G is represented by the triplet <l 1 ,l 2 ,k>; wherein, the triplet <l 1 ,l 2 ,k> is used to indicate that each type is l 1 The node of has at least k neighbor nodes of type l 2 , l 1 , l 2 ∈ L G , k≥1, where L G is the type set of nodes of the heterogeneous network G;
    根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,…,s t},所述s 1,s 2,...,s t表示不同的类型约束; Obtain the user’s community needs according to the user’s query conditions, and obtain the type constraint set S that meets the community needs, where S={s 1 , s 2 ,..., s t }, the s 1 , s 2 ,...,s t represents different types of constraints;
    根据所述类型约束集合S获取满足所述社群需求的节点类型集合L S Obtain a node type set L S that meets the needs of the community according to the type constraint set S;
    判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束; Judging whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type constraint in the type constraint set S;
    若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点; If the type of node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then the node v is determined as Invalid node;
    将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;All non-valid nodes in the heterogeneous network G are added to the node set H to be deleted and deleted, and the non-valid nodes in the node set H to be deleted and each neighbor node form a corresponding message, add To the message queue Q;
    根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。Iteratively judge whether a new invalid node is generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node, so as to obtain a community that meets the query condition.
  2. 根据权利要求1所述的基于节点约束的异构网络社群检测方法,其特征在于,所述根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群,包括:The method for detecting a heterogeneous network community based on node constraints according to claim 1, wherein the iteratively judging whether a new message queue Q is generated according to each type constraint of the type constraint set S Invalid node, if it is, delete the new invalid node to obtain the community that meets the query conditions, including:
    遍历所述消息队列Q中的消息;Traverse the messages in the message queue Q;
    获取所述消息中的邻居节点;Acquiring neighbor nodes in the message;
    判断所述邻居节点是否已从所述异构网络中删除,若是则将对应的消息消耗,若否则进入下一步;Determine whether the neighbor node has been deleted from the heterogeneous network, if so, consume the corresponding message, if not, go to the next step;
    判断所述邻居节点是否满足所述类型约束集合S中的每一个类型约束,若是则在异构网络G中保留所述邻居节点,若否则将所述邻居节点作为新的非有 效节点,并从所述异构网络G中删除所述新的非有效节点;Determine whether the neighbor node satisfies each type constraint in the type constraint set S, if so, keep the neighbor node in the heterogeneous network G, if otherwise, treat the neighbor node as a new non-valid node and follow Deleting the new non-valid node from the heterogeneous network G;
    将对应的消息消耗,并进行下一消息的处理直至消耗完所述消息队列Q中的所有消息。The corresponding message is consumed, and the next message is processed until all messages in the message queue Q are consumed.
  3. 根据权利要求2所述的基于节点约束的异构网络社群检测方法,其特征在于,所述根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群,还包括:The method for detecting a heterogeneous network community based on node constraints according to claim 2, wherein the iteratively judging whether a new message queue Q is generated according to each type constraint of the type constraint set S Invalid node, if it is, delete the new invalid node to obtain a community that meets the query conditions. It also includes:
    当从所述异构网络G中删除所述邻居节点时,获取该邻居节点自身的每一邻居节点,并将该邻居节点与自身的每一邻居节点组成新的消息;When deleting the neighbor node from the heterogeneous network G, obtain each neighbor node of the neighbor node itself, and compose a new message between the neighbor node and each neighbor node of the neighbor node;
    判断新的消息是否与所述消息队列中已消耗或未消耗的消息相同,若否则将新的消息添加至所述消息队列中,若是则进行下一步。It is judged whether the new message is the same as the consumed or unconsumed message in the message queue, if not, the new message is added to the message queue, and if so, the next step is performed.
  4. 根据权利要求1所述的基于节点约束的异构网络社群检测方法,其特征在于,所述根据所述类型约束集合S获取满足所述社群需求的节点类型集合L S,包括: The method for detecting a heterogeneous network community based on node constraints according to claim 1, wherein the obtaining a node type set L S that meets the needs of the community according to the type constraint set S comprises:
    获取所述类型约束集合S中的每一类型约束;Obtain each type constraint in the type constraint set S;
    获取每一类型约束中的节点类型;Get the node type in each type of constraint;
    将所获取到的所有节点类型加入所述节点类型集合L S中。 Add all the acquired node types to the node type set L S.
  5. 根据权利要求1所述的异基于节点约束的异构网络社群检测方法,其特征在于,所述根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群,还包括:The heterogeneous network community detection method based on node constraints according to claim 1, wherein said iteratively judging whether a new message queue Q is generated according to each type constraint of the type constraint set S If it is a non-valid node, delete the new non-valid node to obtain a community that satisfies the query conditions. It also includes:
    获取满足查询条件的所有连通子图,将每一连通子图作为满足查询条件的社群。Obtain all connected subgraphs that meet the query conditions, and treat each connected subgraph as a community that meets the query conditions.
  6. 根据权利要求5所述的基于节点约束的异构网络社群检测方法,其特征在于,所述获取满足查询条件的所有连通子图,将每一连通子图作为满足查询条件的社群,包括:The method for detecting a heterogeneous network community based on node constraints according to claim 5, wherein the acquiring all connected subgraphs that meet the query conditions, and using each connected subgraph as a community that meets the query conditions, includes :
    将所述异构网络G中保留的节点进行划分,得到多个连通子图,并将每一连通子图作为满足查询条件的社群。The nodes reserved in the heterogeneous network G are divided to obtain multiple connected subgraphs, and each connected subgraph is regarded as a community that satisfies the query condition.
  7. 根据权利要求1所述的基于节点约束的异构网络社群检测方法,其特征在于,还包括:The method for detecting a heterogeneous network community based on node constraints according to claim 1, further comprising:
    预先设置所述异构网络G中的节点v满足以下条件,则所述节点v满足所述类型约束s:It is preset that the node v in the heterogeneous network G satisfies the following conditions, and the node v satisfies the type constraint s:
    φ(v)≠l 1,或者
    Figure PCTCN2020092443-appb-100001
    其中,N G(v,l 2)表示节点v的邻居节点中类型为l 2的节点。
    φ(v)≠l 1 , or
    Figure PCTCN2020092443-appb-100001
    Among them, N G (v, l 2 ) represents a node of type l 2 among neighboring nodes of node v.
  8. 一种基于节点约束的异构网络社群检测装置,其特征在于,包括:A heterogeneous network community detection device based on node constraints, which is characterized in that it comprises:
    预设单元,用于预先设置通过三元组<l 1,l 2,k>表示异构网络G的类型约束s;其中,所述三元组<l 1,l 2,k>用于表示每个类型为l 1的节点至少有k个类型为l 2的邻居节点,l 1,l 2∈L G,k≥1,所述L G为所述异构网络G的节点的类型集合; The preset unit is used to preset the type constraint s of the heterogeneous network G expressed by the triplet <l 1 ,l 2 ,k>; wherein, the triplet <l 1 ,l 2 ,k> is used to represent Each node of type l 1 has at least k neighbor nodes of type l 2 , l 1 , l 2 ∈ L G , k≥1, where L G is a set of types of nodes of the heterogeneous network G;
    第一获取单元,用于根据用户的查询条件获取用户的社群需求,并获取满足所述社群需求的类型约束集合S,其中,S={s 1,s 2,…,s t},所述s 1,s 2,...,s t表示不同的类型约束; The first obtaining unit is used to obtain the user's community needs according to the user's query conditions, and obtain the type constraint set S that meets the community needs, where S={s 1 ,s 2 ,...,s t }, The s 1 , s 2 ,..., st represent different types of constraints;
    第二获取单元,用于根据所述类型约束集合S获取满足所述社群需求的节点类型集合L SThe second obtaining unit is configured to obtain, according to the type constraint set S, a node type set L S that meets the needs of the community;
    第一判断单元,用于判断所述异构网络G中的每一个节点v的类型是否属于所述节点类型集合L S,以及每一个节点v是否满足所述类型约束集合S中的每一个类型约束; The first judgment unit is used to judge whether the type of each node v in the heterogeneous network G belongs to the node type set L S , and whether each node v satisfies each type in the type constraint set S constraint;
    第二判断单元,用于若所述异构网络G中的节点v的类型不属于所述节点类型集合L S,或者所述节点v不满足所述类型约束集合S中任意一个类型约束,则将所述节点v判定为非有效节点; The second judgment unit is configured to: if the type of the node v in the heterogeneous network G does not belong to the node type set L S , or the node v does not satisfy any type constraint in the type constraint set S, then Determining the node v as an invalid node;
    删除单元,用于将所述异构网络G中的所有非有效节点加入至待删除节点集合H中并进行删除,并将所述待删除节点集合H中的非有效节点与每一邻居节点组成对应的消息,加入至消息队列Q中;The deleting unit is used to add all the non-valid nodes in the heterogeneous network G to the node set H to be deleted and delete them, and compose the non-valid nodes in the node set H to be deleted with each neighbor node The corresponding message is added to the message queue Q;
    迭代判断单元,用于根据所述类型约束集合S的每一类型约束迭代判断所述消息队列Q中是否产生了新的非有效节点,若是则将新的非有效节点删除,从而得到满足查询条件的社群。The iterative judgment unit is used to iteratively judge whether a new invalid node has been generated in the message queue Q according to each type constraint of the type constraint set S, and if so, delete the new invalid node so as to satisfy the query condition Community.
  9. 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1至7中任一项所述的基于节点约束的异构网络社群检测方法。A computer device, comprising a memory, a processor, and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the computer program as claimed in claims 1 to The heterogeneous network community detection method based on node constraint described in any one of 7.
  10. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行如权利要 求1至7任一项所述的基于节点约束的异构网络社群检测方法。A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes any one of claims 1 to 7 The above-mentioned heterogeneous network community detection method based on node constraints.
PCT/CN2020/092443 2020-05-13 2020-05-27 Heterogeneous network community detection method, device, computer apparatus, and storage medium WO2021227130A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010401730.8A CN111597396B (en) 2020-05-13 2020-05-13 Heterogeneous network community detection method and device, computer equipment and storage medium
CN202010401730.8 2020-05-13

Publications (1)

Publication Number Publication Date
WO2021227130A1 true WO2021227130A1 (en) 2021-11-18

Family

ID=72185338

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/092443 WO2021227130A1 (en) 2020-05-13 2020-05-27 Heterogeneous network community detection method, device, computer apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN111597396B (en)
WO (1) WO2021227130A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112653765A (en) * 2020-12-24 2021-04-13 南京审计大学 Resource allocation method and device based on community overlapping and embedding analysis

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605793A (en) * 2013-12-04 2014-02-26 西安电子科技大学 Heterogeneous social network community detection method based on genetic algorithm
CN104378771A (en) * 2014-11-28 2015-02-25 北京农业信息技术研究中心 Blackspot-prediction farmland time-varying heterogeneous network node deployment and interactive dispatching method
CN106372239A (en) * 2016-09-14 2017-02-01 电子科技大学 Social network event correlation analysis method based on heterogeneous network
US20170324583A1 (en) * 2013-07-26 2017-11-09 Silver Spring Networks, Inc. Techniques for routing from an endpoint with simultaneous associations to multiple networks
CN110334264A (en) * 2019-06-27 2019-10-15 北京邮电大学 A kind of community detection method and device for isomery dynamic information network

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8060463B1 (en) * 2005-03-30 2011-11-15 Amazon Technologies, Inc. Mining of user event data to identify users with common interests
CN106357418B (en) * 2015-07-15 2019-08-30 阿里巴巴集团控股有限公司 A kind of feature extracting method and device based on complex network
CN105631748A (en) * 2015-12-21 2016-06-01 西北工业大学 Parallel label propagation-based heterogeneous network community discovery method
CN108073944A (en) * 2017-10-18 2018-05-25 南京邮电大学 A kind of label based on local influence power propagates community discovery method
CN108009915B (en) * 2017-12-21 2020-01-17 连连银通电子支付有限公司 Marking method and related device for fraudulent user community
CN108809709A (en) * 2018-06-06 2018-11-13 山东大学 It is a kind of based on the close nature community discovery method propagated with label of node
US10482268B1 (en) * 2018-08-08 2019-11-19 Capital One Services, Llc Systems and methods for access management
CN109948000B (en) * 2019-01-09 2023-04-07 南方科技大学 Abnormal target detection method, device, equipment and storage medium for heterogeneous network
CN110611582A (en) * 2019-04-28 2019-12-24 中南大学 Opportunistic social network effective data transmission method based on node socialization
CN110457404B (en) * 2019-08-19 2021-11-02 电子科技大学 Social media account classification method based on complex heterogeneous network
CN111126510A (en) * 2020-01-02 2020-05-08 深圳计算科学研究院 Method for calculating similarity in heterogeneous network and related components thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170324583A1 (en) * 2013-07-26 2017-11-09 Silver Spring Networks, Inc. Techniques for routing from an endpoint with simultaneous associations to multiple networks
CN103605793A (en) * 2013-12-04 2014-02-26 西安电子科技大学 Heterogeneous social network community detection method based on genetic algorithm
CN104378771A (en) * 2014-11-28 2015-02-25 北京农业信息技术研究中心 Blackspot-prediction farmland time-varying heterogeneous network node deployment and interactive dispatching method
CN106372239A (en) * 2016-09-14 2017-02-01 电子科技大学 Social network event correlation analysis method based on heterogeneous network
CN110334264A (en) * 2019-06-27 2019-10-15 北京邮电大学 A kind of community detection method and device for isomery dynamic information network

Also Published As

Publication number Publication date
CN111597396A (en) 2020-08-28
CN111597396B (en) 2021-05-28

Similar Documents

Publication Publication Date Title
US11748506B2 (en) Access controlled graph query spanning
CN110168523B (en) Change monitoring cross-graph query
US11379422B2 (en) Text deduplication method and apparatus, and storage medium
US9798774B1 (en) Graph data search method and apparatus
US8918375B2 (en) Content aware chunking for achieving an improved chunk size distribution
US7882181B2 (en) Minimizing data transfer from POP3 servers
WO2018010491A1 (en) Method and device for generating service path
US10229208B2 (en) Optimization of query execution
WO2015044155A1 (en) Method, apparatus, and computer program product for data quality analysis
WO2020113947A1 (en) Network file deletion method and device, computer device and storage medium
Han et al. Efficient and effective algorithms for clustering uncertain graphs
US20220004548A1 (en) Recurring search automation with search event detection
WO2017121355A1 (en) Search processing method and device
AU2014253497B2 (en) Entity resolution from documents
WO2021227130A1 (en) Heterogeneous network community detection method, device, computer apparatus, and storage medium
Patgiri et al. Hunting the pertinency of bloom filter in computer networking and beyond: A survey
WO2015165230A1 (en) Social contact message monitoring method and device
CA2734207C (en) Electronic file comparator
CN112667636B (en) Index establishing method, device and storage medium
WO2021189845A1 (en) Detection method and apparatus for time series anomaly point, and device and readable storage medium
US9817891B1 (en) System, method, and computer program for creating metadata-based search queries
Ke et al. Reliability maximization in uncertain graphs
WO2022063147A1 (en) Super node determination method and apparatus, electronic device and storage medium
Zhou et al. Butterfly counting and bitruss decomposition on uncertain bipartite graphs
CN110555158A (en) mutually exclusive data processing method and system, and computer readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20935805

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20935805

Country of ref document: EP

Kind code of ref document: A1