WO2020078370A1 - Community search method - Google Patents

Community search method Download PDF

Info

Publication number
WO2020078370A1
WO2020078370A1 PCT/CN2019/111419 CN2019111419W WO2020078370A1 WO 2020078370 A1 WO2020078370 A1 WO 2020078370A1 CN 2019111419 W CN2019111419 W CN 2019111419W WO 2020078370 A1 WO2020078370 A1 WO 2020078370A1
Authority
WO
WIPO (PCT)
Prior art keywords
community
node
nodes
search
result
Prior art date
Application number
PCT/CN2019/111419
Other languages
French (fr)
Chinese (zh)
Inventor
王朝坤
竺俊超
Original Assignee
清华大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 清华大学 filed Critical 清华大学
Publication of WO2020078370A1 publication Critical patent/WO2020078370A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying

Definitions

  • the present disclosure relates to the field of search technology, for example, to a community search method.
  • a community generally refers to a subgraph in which connections between internal nodes are closer than those between internal and external communities.
  • the community structure excavated from the network helps people to recommend friends, identify criminal groups and predict protein functions.
  • Community search local community discovery refers to given one or more nodes to find the community that contains them. Compared to global community discovery, it pays more attention to the local network structure and returns more personalized community results.
  • the current community search method is mainly based on k-clique, k-core, k-truss and other specific topological structures.
  • the community found based on the method of k-truss structure needs to satisfy the following properties: 1.
  • the number of triangles on each side is not less than k-2; 2. Any two sides can be reached by a series of adjacent triangles.
  • a typical method is to record the truss values of all adjacent edges around each node, and then organize the adjacent points of each node into a tree structure index according to the truss value of the edge, called TCP-index, and finally according to the given Node and k value, constantly find the neighbor nodes that can be expanded from the index, until it can not be expanded, you get a k-truss community containing the given node.
  • a typical method that comprehensively considers the topology structure and node attributes is to first supplement the edges of the original graph according to the attribute similarity between the nodes, thereby constructing a TA-graph, and then based on the k-truss structure on the TA-graph Conduct a community search and finally get the community containing the given node.
  • a technical problem to be solved by the embodiments of the present disclosure is to provide a community search method to solve the problems in the prior art.
  • the community search method includes:
  • the nodes are mapped to node variables, and the corresponding search conditions are written;
  • a single-item community search is performed for each search term
  • the node is mapped to a node variable according to the user's demand for community search, and writing out the corresponding search conditions includes:
  • the nodes that appear in the community are not allowed to be logically undecorated, and the nodes that the community must contain are not decorated;
  • the nodes that must appear in the community at the same time are connected with logic and the nodes that the community must contain and the nodes that are not allowed are also connected with logic and;
  • the community must contain at least one of several nodes that need to be represented by logical or connected nodes.
  • the conversion of the search condition into multiple search terms includes:
  • the performing a single-item community search for each search item includes:
  • the nodes that must be included in the community and the nodes that are not allowed to be included are organized into the necessary node set and the prohibited node set as the input of the single-condition community search process;
  • the extracted one or more common node variables are sorted into necessary node sets and prohibited node sets according to whether they can appear in the community as the input of the single-item community search process, the remaining Partly used to judge the output results;
  • the single-item community search is performed, using the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, while not including
  • the nodes in the prohibited node set include three implementation methods, namely: community search after filtering, weighted filtering, and filtering while searching.
  • the filtered community search method includes:
  • the weighted filtering method includes:
  • the necessary node set is classified into the community result C;
  • a02 according to the given network graph, get the derived subgraph corresponding to the community result. If the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
  • the nodes in the same component are grouped into the same group;
  • the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
  • a06 for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to a , B and d sort the nodes in the node set in multi-key descending order;
  • step a07 if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step a06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step a02;
  • the nodes of the entire network graph are included in the community result C;
  • the performing community search on the new network graph with the necessary node set as input includes:
  • the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
  • the nodes in the same component are grouped into the same group;
  • the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
  • step b07 if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step b06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step b02;
  • the manner of searching while filtering includes:
  • the nodes in the same component are grouped into the same group;
  • the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes and banned nodes that already exist in the community result C are excluded from it;
  • c06 for each node in the candidate node set Candidate, record the number a of the connected different connected components, the number b of the nodes in the connection community result C, the number of nodes connected to non-prohibited nodes in the given network graph d; Sort the nodes in the node set in multi-key descending order according to a, b, and d;
  • step c07 if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step c06, otherwise, add the node c to the community result C, and at the same time the neighbor node of the node Add to candidate node set Candidate, and remove node c from candidate node set Candidate, go to step c02;
  • step c10 Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step c09.
  • the present disclosure has the following advantages:
  • the present disclosure proposes a community search method, which expresses the search conditions in the form of a unified Boolean expression, which is convenient for users to express their search needs personally, and also facilitates the execution of community search under complex conditions; due to the consideration in the process of community search In addition, users do not want to appear in the community, and the community results obtained by the search are more in line with the user's expectations, making the community results more personalized; because it is allowed to consider that the community contains at least one such demand in a given node, a search condition may Obtain multiple different community results, and all meet the search conditions, which makes the user's choice of results more abundant; provides a variety of different implementation methods, you can choose according to actual needs.
  • FIG. 1 is a flowchart of an embodiment of a community search method of the present disclosure
  • FIG. 3 is a flowchart of another embodiment of the community search method of the present disclosure.
  • FIG. 6 is a flowchart of yet another embodiment of the community search method of the present disclosure.
  • FIG. 7 is a flowchart of yet another embodiment of the community search method of the present disclosure.
  • FIG. 8 is a flowchart of another embodiment of the community search method of the present disclosure.
  • FIG. 1 is a flowchart of an embodiment of a community search method of the present disclosure. As shown in FIG. 1, the community search method includes:
  • the node variable is also called Boolean variable, and the search condition is expressed by Boolean expression
  • FIG. 2 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 2, according to the user ’s demand for community search, the nodes are mapped to node variables, and the corresponding search conditions are written out as follows:
  • the nodes that appear in the community are not allowed to be logically undecorated, and the nodes that the community must contain are undecorated.
  • the symbol of the logical negation is
  • the nodes that must appear in the community at the same time are connected by logic and, the nodes that the community must contain and the nodes that are not allowed are also connected by logic and, the symbol of the logic and is " ⁇ ", for example: Boolean formula Indicates that the user wants the community to include node A and node B, and node C is not allowed. ;
  • the community must contain at least one of several nodes that need to be represented by logical or connected nodes.
  • the logical OR symbol is " ⁇ ", for example: the Boolean expression A ⁇ B ⁇ C indicates that the user wants the community to at least Contains one of node A, node B, and node C.
  • FIG. 3 is a flowchart of yet another embodiment of the community search method of the present disclosure. As shown in FIG. 3, the conversion of search conditions into multiple search terms includes:
  • each conjunction of the simplest and OR formula as the search term for example: the simplest and OR formula (A ⁇ B) ⁇ (C ⁇ D) contains two search terms, which are (A ⁇ B ) And (C ⁇ D), if you find that there are several search terms with the same node variable, you can merge these search terms into a new search term, for example: the most simple and or Search terms (A ⁇ B) and Contains the same node variable A, so you can extract the common node variable A, and merge these two conjunctions into In order to reduce the number of search terms, thereby reducing the number of subsequent single-item community search processes, to achieve the purpose of saving time overhead;
  • FIG. 4 is a flowchart of yet another embodiment of the community search method of the present disclosure. As shown in FIG. 4, performing a single-item community search for each search item includes:
  • the nodes that must be included in the community and the nodes that are not allowed to be included are organized into the necessary node set and the prohibited node set as the input of the single-item community search process, because it only contains the community Must appear in the node and nodes that are not allowed to appear;
  • the extracted one or more common node variables are organized into a necessary node set and a prohibited node set according to whether they can appear in the community as input to the single-item community search process , The remaining part is used to distinguish the output results, for example: in two search terms Combined search terms
  • the necessary node set ⁇ A ⁇ is used as the input to the single-item community search process, that is, to find the community containing node A
  • the discriminant as the output result is used to determine whether the community result contains node B or does not contain node D;
  • the single conditional community search described above uses the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, and the nodes that do not contain the prohibited node set include three implementation methods, namely: filtering Post-community search, weighted filtering, and filtering while searching.
  • the filtered community search method includes:
  • FIG. 5 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 5, the weighted filtering method includes:
  • FIG. 6 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 6, performing community search using the necessary node set as input for the new network diagram includes:
  • step 507 if the degree of the candidate node c ranked first is less than the threshold k, then the node c is removed from the candidate node set Candidate, go to step 506, otherwise, the node c is added to the community result, and the neighbor nodes of the node Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step 502;
  • step 510 Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step 509.
  • FIG. 7 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 7, the performing community search on the new network graph using the necessary node set as input includes:
  • the necessary node set is classified into the community result C;
  • the nodes in the same component are grouped into the same group;
  • the node c is removed from the candidate node set Candidate, go to step 606, otherwise, the node c is added to the community result, and the neighbor node of the node is also added Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step 602;
  • step 609 Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step 609.
  • FIG. 8 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 8, the manner of searching while filtering includes:
  • the necessary node set is classified into the community result C;
  • the nodes in the same component are grouped into the same group;
  • the node c is removed from the candidate node set Candidate, go to step 706, otherwise, the node c is added to the community result C, and the neighbor node of the node Add to candidate node set Candidate, and remove node c from candidate node set Candidate;
  • step 710. Delete the node whose degree is lower than k in the derived subgraph of the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise jump to step 709.

Abstract

A community search method comprises: associating a node with a node variable according to a community search requirement of a user, and preparing a corresponding search condition (10); converting the search condition into multiple search items (20); performing a community search for each search item according to a single condition (30); and combining community search results for each single condition, acquiring a combination of the community results, and returning the combination thereof (40). In the method, all search conditions are converted into Boolean expressions, such that a user can easily customize the expression of a search requirement and can easily perform community searches with complex conditions. The method takes into account a community node that a user wishes to exclude, thereby further meeting user needs. Since a community is allowed to comprise at least one of given nodes, one search condition may result in multiple different community results meeting the search condition, providing the user with more options when selecting results. Multiple different implementation methods are provided, and can be selected according to actual requirements.

Description

社区搜索方法Community search method 技术领域Technical field
本公开涉及搜索技术领域,例如涉及一种社区搜索方法。The present disclosure relates to the field of search technology, for example, to a community search method.
背景技术Background technique
由大量节点和节点间的连接关系形成的网络结构广泛存在于计算机科学、生物学、社会学等领域。在网络相关的研究工作中,社区(community)受到人们的持续关注。社区一般指一个内部节点间联系相比于社区内部和外部的节点间联系更加紧密的子图。网络中挖掘出的社区结构有助于人们进行好友推荐,犯罪团伙识别以及蛋白质功能预测。社区搜索(局部社区发现)指的是给定一个或多个节点,寻找包含它们的社区,相比于全局社区发现,它更关注局部的网络结构,并返回更个性化的社区结果。The network structure formed by a large number of nodes and the connection relationship between nodes widely exists in the fields of computer science, biology, sociology and so on. In network-related research work, the community has received constant attention from people. A community generally refers to a subgraph in which connections between internal nodes are closer than those between internal and external communities. The community structure excavated from the network helps people to recommend friends, identify criminal groups and predict protein functions. Community search (local community discovery) refers to given one or more nodes to find the community that contains them. Compared to global community discovery, it pays more attention to the local network structure and returns more personalized community results.
目前的社区搜索方法主要是基于k-clique、k-core、k-truss等特定拓扑结构的,此外还有部分综合考虑了拓扑结构和节点属性的社区搜索方法。The current community search method is mainly based on k-clique, k-core, k-truss and other specific topological structures. In addition, there are some community search methods that comprehensively consider the topology and node attributes.
基于k-truss结构的方法找到的社区需要满足如下性质:1、每条边所在的三角形个数都不小于k-2;2、任意两条边都能通过一系列相邻的三角形到达。一种典型的方法是记录每个节点周围所有邻接边的truss值,而后将每个节点的邻接点按照边的truss值组织成树形结构的索引,称为TCP-index,最后根据给定的节点和k值,不断地从索引中找出可以扩展的邻居节点,直至无法扩展为止,就得到了包含给定节点的k-truss结构的社区。The community found based on the method of k-truss structure needs to satisfy the following properties: 1. The number of triangles on each side is not less than k-2; 2. Any two sides can be reached by a series of adjacent triangles. A typical method is to record the truss values of all adjacent edges around each node, and then organize the adjacent points of each node into a tree structure index according to the truss value of the edge, called TCP-index, and finally according to the given Node and k value, constantly find the neighbor nodes that can be expanded from the index, until it can not be expanded, you get a k-truss community containing the given node.
综合考虑拓扑结构和节点属性的典型方法,比如AGAR方法,就是根据节点间的属性相似度先对原图进行边的补充,从而构建一个TA-graph,再在TA-graph上依据k-truss结构进行社区搜索,最终得到包含给定节点的社区。A typical method that comprehensively considers the topology structure and node attributes, such as the AGAR method, is to first supplement the edges of the original graph according to the attribute similarity between the nodes, thereby constructing a TA-graph, and then based on the k-truss structure on the TA-graph Conduct a community search and finally get the community containing the given node.
目前的社区搜索方法的不足在于只能寻找包含给定节点集的社区。我们在实际进行社区搜索时,经常会遇到这样一些需求:1、社区不仅要包含某些给定点,同时不允许包含另一些给定点;2、社区至少要包含给定若干节点中的任意一个。现有的社区搜索方法无法满足上述需求。因此,现有技术需要改进。The disadvantage of current community search methods is that they can only find communities that contain a given set of nodes. When we actually conduct community search, we often encounter such needs: 1. The community must not only include certain given points, but also not allowed to include other given points; 2. The community must contain at least any one of the given nodes . Existing community search methods cannot meet the above needs. Therefore, the existing technology needs improvement.
上述背景技术内容仅用于帮助理解本公开,而并不代表承认或认可所提及的任何内容属于相对于本公开的公知常识的一部分。The above background content is only used to help understand the present disclosure, and does not mean to acknowledge or endorse that any content mentioned is part of the common general knowledge relative to the present disclosure.
发明内容Summary of the invention
本公开实施例所要解决的一个技术问题是:提供一种社区搜索方法,以解决现有技术存在的问题,所述社区搜索方法包括:A technical problem to be solved by the embodiments of the present disclosure is to provide a community search method to solve the problems in the prior art. The community search method includes:
根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件;According to the user's needs for community search, the nodes are mapped to node variables, and the corresponding search conditions are written;
将搜索条件转化为多个搜索项;Convert search terms into multiple search terms;
对于每一搜索项进行单项条件的社区搜索;A single-item community search is performed for each search term;
将各个单项条件社区搜索的结果合并,即对社区结果取并集进行返回。Combine the results of each single-item community search, that is, take the union result of the community and return it.
在基于本公开上述社区搜索方法的另一个实施例中,所述根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件包括:In another embodiment of the above community search method based on the present disclosure, the node is mapped to a node variable according to the user's demand for community search, and writing out the corresponding search conditions includes:
将用户需求中提及的节点对应为布尔变量;Corresponding to the nodes mentioned in user requirements as Boolean variables;
不允许出现在社区中的节点以逻辑非修饰,社区必须包含的节点无修饰;The nodes that appear in the community are not allowed to be logically undecorated, and the nodes that the community must contain are not decorated;
必须同时出现在社区中的节点以逻辑与连接,社区必须包含的节点和不允许的包含的节点,也用逻辑与连接;The nodes that must appear in the community at the same time are connected with logic and the nodes that the community must contain and the nodes that are not allowed are also connected with logic and;
社区必须包含若干个节点中至少一个需要以逻辑或连接的若干个节点来表示。The community must contain at least one of several nodes that need to be represented by logical or connected nodes.
在基于本公开上述社区搜索方法的另一个实施例中,所述将搜索条件转化为多个搜索项包括:In another embodiment of the above community search method based on the present disclosure, the conversion of the search condition into multiple search terms includes:
枚举满足搜索条件的节点变量取值组合,从而得出与搜索条件等价的主析取范式;Enumerate the value combinations of node variables that satisfy the search condition, so as to obtain the main disjunctive paradigm equivalent to the search condition;
将主析取范式通过Quine-McCluskey算法化简为最简与或式;The main disjunctive paradigm is reduced to the simplest OR by the Quine-McCluskey algorithm;
将最简与或式的每个合取项设置为搜索项;Set each conjunct of the simplest and or as the search term;
提取出不同合取项中出现次数最多的一个没有逻辑非修饰的变量,如果 含有该变量的合取项多于1,则把这些合取项合并成新的搜索项,重复此步骤直至没有合取项可以进行合并。。Extract the most frequent occurrence of a variable without logical undecoration among different conjuncts. If there are more than one conjunct with this variable, merge these conjuncts into a new search term, and repeat this step until there is no conjunct The items can be merged. .
在基于本公开上述社区搜索方法的另一个实施例中,所述对于每一搜索项进行单项条件的社区搜索包括:In another embodiment based on the above community search method of the present disclosure, the performing a single-item community search for each search item includes:
对于合取式形式的搜索项,将其中社区必须包含的节点和不允许包含的节点分别整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入;For search terms in conjunctive form, the nodes that must be included in the community and the nodes that are not allowed to be included are organized into the necessary node set and the prohibited node set as the input of the single-condition community search process;
对于由多个合取式合并得到的搜索项,将提取出的一个或者多个公共节点变量根据是否能够出现在社区中整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入,剩余部分用于对输出结果的判别;For search terms resulting from the combination of multiple conjunctions, the extracted one or more common node variables are sorted into necessary node sets and prohibited node sets according to whether they can appear in the community as the input of the single-item community search process, the remaining Partly used to judge the output results;
进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点。Carry out a single conditional community search, using the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, and does not include the nodes in the prohibited node set.
在基于本公开上述社区搜索方法的另一个实施例中,所述进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点包括三种实现方式,分别为:过滤后社区搜索的方式、加权过滤的方式、边搜索边过滤的方式。In another embodiment of the above community search method based on the present disclosure, the single-item community search is performed, using the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, while not including The nodes in the prohibited node set include three implementation methods, namely: community search after filtering, weighted filtering, and filtering while searching.
在基于本公开上述社区搜索方法的另一个实施例中,所述过滤后社区搜索的方式包括:In another embodiment of the above community search method based on the present disclosure, the filtered community search method includes:
从网络图中删除禁止节点集,得到不含禁止节点的网络图;Delete the forbidden node set from the network graph to get the network graph without prohibited nodes;
对新的网络图,用必要节点集作为输入进行社区搜索。For the new network graph, use the necessary node set as input to conduct a community search.
在基于本公开上述社区搜索方法的另一个实施例中,所述加权过滤的方式包括:In another embodiment of the above community search method based on the present disclosure, the weighted filtering method includes:
为网络图中所有节点赋以数值权重,令必要节点为1,禁止节点为-1,其余节点为0;Assign numerical weights to all nodes in the network graph, making the necessary nodes be 1, prohibiting nodes being -1, and remaining nodes being 0;
除了必要节点和禁止节点以外,迭代更新每个节点的权重,将其赋值为所有邻居节点权重的均值,即:In addition to the necessary and prohibited nodes, iteratively updates the weight of each node and assigns it to the average of the weights of all neighbor nodes, namely:
Figure PCTCN2019111419-appb-000001
Figure PCTCN2019111419-appb-000001
设定节点权重阈值λ,保留节点权重大于等于λ的节点,并将其在原网络图中的导出子图提取出来作为新的网络图;Set the node weight threshold λ, keep the nodes whose node weight is greater than or equal to λ, and extract the derived subgraph in the original network graph as a new network graph;
a01,将必要节点集划入社区结果C;a01, the necessary node set is classified into the community result C;
a02,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;a02, according to the given network graph, get the derived subgraph corresponding to the community result. If the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
a03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;a03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
a04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;a04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
a05,若Candidate为空,则将社区结果C置为空集,并转到步骤a08;a05, if Candidate is empty, set the community result C to an empty set, and go to step a08;
a06,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据a、b和d对节点集中的节点进行多关键字降序排序;a06, for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to a , B and d sort the nodes in the node set in multi-key descending order;
a07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤a06,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤a02;a07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step a06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step a02;
a08,将整个网络图的节点划入社区结果C;a08, the nodes of the entire network graph are included in the community result C;
a09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;a09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return the community result C ;
a10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤a09。a10. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step a09.
在基于本公开上述社区搜索方法的另一个实施例中,所述用必要节点集作为输入对新的网络图进行社区搜索包括:In another embodiment of the above community search method based on the present disclosure, the performing community search on the new network graph with the necessary node set as input includes:
b01,将必要节点集划入社区结果C;b01, the necessary node set is classified into community result C;
b02,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;b02, according to the given network graph, get the derived subgraph corresponding to the community result. If the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
b03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;b03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
b04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;b04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
b05,若Candidate为空,则将社区结果C置为空集,并转到步骤b08;b05, if Candidate is empty, set the community result C to an empty set, and go to step b08;
b06,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据对节点集中的节点进行多关键字降序排序;b06, for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to the The nodes in the node set are sorted in multi-key descending order;
b07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤b06,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤b02;b07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step b06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step b02;
b08,将整个网络图的节点划入社区结果C;b08, the nodes of the entire network graph are included in the community result C;
b09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;b09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return to the community result C ;
b10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤b09。b10. Delete the node whose degree is lower than k in the derived subgraph of community result C from community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step b09.
在基于本公开上述社区搜索方法的另一个实施例中,所述边搜索边过滤的方式包括:In another embodiment based on the above community search method of the present disclosure, the manner of searching while filtering includes:
c01,将必要节点集划入社区结果C;c01, the necessary node set is classified into community result C;
c02,根据给定的网络图,得到社区结果C对应的导出子图。如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果C;c02. Obtain the derived subgraph corresponding to the community result C according to the given network graph. If the derived subgraph of the community result C has only one connected component and the degree of all nodes in the derived subgraph is greater than or equal to the given threshold k, then stop and return to the community result C;
c03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;c03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
c04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并从中排除已经存在于社区结果C中的节点和禁止节点;c04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes and banned nodes that already exist in the community result C are excluded from it;
c05,若Candidate为空,则将社区结果C置为空集,并转到步骤c08;c05, if Candidate is empty, set the community result C to an empty set, and go to step c08;
c06,对于候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中与非禁止节点连接的个数d;根据a、b和d对节点集中的节点进行多关键字降序排序;c06, for each node in the candidate node set Candidate, record the number a of the connected different connected components, the number b of the nodes in the connection community result C, the number of nodes connected to non-prohibited nodes in the given network graph d; Sort the nodes in the node set in multi-key descending order according to a, b, and d;
c07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤c06,否则,将节点c加入社区结果C,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤c02;c07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step c06, otherwise, add the node c to the community result C, and at the same time the neighbor node of the node Add to candidate node set Candidate, and remove node c from candidate node set Candidate, go to step c02;
c08,将整个网络图的节点划入社区结果C并删除禁止节点;c08, classify the nodes of the entire network graph into the community result C and delete the prohibited nodes;
c09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,检查此时社区结果C中是否含有禁止节点;c09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set. If the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, check the community result C at this time Whether it contains prohibited nodes;
c10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,则停止并返回空集,否则转到步骤c09。c10. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step c09.
与现有技术相比,本公开具有以下优点:Compared with the prior art, the present disclosure has the following advantages:
本公开提出了一种社区搜索方法,将搜索条件以统一的布尔表达式形式进行表示,便于用户进行个性化表达搜索需求,也便于复杂条件下社区搜索的执行;由于在社区搜索的过程中考虑了用户不希望出现在社区内的节点,搜索得到的社区结果更符合用户的期望,使得社区结果更加个性化;由于允许考虑社区中至少包含给定节点中一个这样的需求,从而一个搜索条件可能 得到多个不同的社区结果,且都满足搜索条件,这使得用户对结果的选择更为丰富;提供了多种不同的实现方式,可以根据实际需要进行选择。The present disclosure proposes a community search method, which expresses the search conditions in the form of a unified Boolean expression, which is convenient for users to express their search needs personally, and also facilitates the execution of community search under complex conditions; due to the consideration in the process of community search In addition, users do not want to appear in the community, and the community results obtained by the search are more in line with the user's expectations, making the community results more personalized; because it is allowed to consider that the community contains at least one such demand in a given node, a search condition may Obtain multiple different community results, and all meet the search conditions, which makes the user's choice of results more abundant; provides a variety of different implementation methods, you can choose according to actual needs.
附图说明BRIEF DESCRIPTION
构成说明书的一部分的附图描述了本公开的实施例,并且连同描述一起用于解释本公开的原理。The drawings that form a part of the specification describe embodiments of the present disclosure, and together with the description serve to explain the principles of the present disclosure.
参照附图,根据下面的详细描述,可以更加清楚地理解本公开,其中:With reference to the drawings, the present disclosure can be more clearly understood from the following detailed description, in which:
图1为本公开的社区搜索方法的一个实施例的流程图;FIG. 1 is a flowchart of an embodiment of a community search method of the present disclosure;
图2为本公开的社区搜索方法的另一个实施例的流程图;2 is a flowchart of another embodiment of the community search method of the present disclosure;
图3为本公开的社区搜索方法的又一个实施例的流程图;3 is a flowchart of another embodiment of the community search method of the present disclosure;
图4为本公开的社区搜索方法的又一个实施例的流程图;4 is a flowchart of yet another embodiment of the community search method of the present disclosure;
图5为本公开的社区搜索方法的又一个实施例的流程图;5 is a flowchart of another embodiment of the community search method of the present disclosure;
图6为本公开的社区搜索方法的又一个实施例的流程图;6 is a flowchart of yet another embodiment of the community search method of the present disclosure;
图7为本公开的社区搜索方法的又一个实施例的流程图;7 is a flowchart of yet another embodiment of the community search method of the present disclosure;
图8为本公开的社区搜索方法的又一个实施例的流程图。FIG. 8 is a flowchart of another embodiment of the community search method of the present disclosure.
具体实施方式detailed description
现在将参照附图来详细描述本公开的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。Various exemplary embodiments of the present disclosure will now be described in detail with reference to the drawings. It should be noted that the relative arrangement of components and steps, numerical expressions, and numerical values set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为说明书的一部分。Techniques, methods and equipment known to those of ordinary skill in the related art may not be discussed in detail, but where appropriate, the techniques, methods and equipment should be considered as part of the specification.
图1为本公开的社区搜索方法的一个实施例的流程图,如图1所示,所述社区搜索方法包括:FIG. 1 is a flowchart of an embodiment of a community search method of the present disclosure. As shown in FIG. 1, the community search method includes:
10,根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件,节点变量亦称为布尔变量,搜索条件以布尔表达式表示;10. According to the user's needs for community search, correspond the node to the node variable, and write the corresponding search conditions. The node variable is also called Boolean variable, and the search condition is expressed by Boolean expression;
20,将搜索条件转化为多个搜索项;20. Convert the search criteria into multiple search terms;
30,对于每一搜索项进行单项条件的社区搜索;30. For each search term, conduct a single-item community search;
40,将各个单项条件社区搜索的结果合并,即对社区结果取并集进行返回。40. Combine the results of each single-item community search, that is, take the union result of the community results and return it.
图2为本公开的社区搜索方法的另一个实施例的流程图,如图2所示,所述根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件包括:FIG. 2 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 2, according to the user ’s demand for community search, the nodes are mapped to node variables, and the corresponding search conditions are written out as follows:
101,将用户需求中提及的节点对应为布尔变量;101. Corresponding to the node mentioned in the user's needs as a Boolean variable;
102,不允许出现在社区中的节点以逻辑非修饰,社区必须包含的节点无修饰,所述逻辑非的表示符号为
Figure PCTCN2019111419-appb-000002
102, the nodes that appear in the community are not allowed to be logically undecorated, and the nodes that the community must contain are undecorated. The symbol of the logical negation is
Figure PCTCN2019111419-appb-000002
103,必须同时出现在社区中的节点以逻辑与连接,社区必须包含的节点和不允许的包含的节点,也用逻辑与连接,所述逻辑与的表示符号为“∧”,例如:布尔表达式
Figure PCTCN2019111419-appb-000003
表示用户希望社区必须包含节点A和节点B,不允许包含节点C。;
103, the nodes that must appear in the community at the same time are connected by logic and, the nodes that the community must contain and the nodes that are not allowed are also connected by logic and, the symbol of the logic and is "∧", for example: Boolean formula
Figure PCTCN2019111419-appb-000003
Indicates that the user wants the community to include node A and node B, and node C is not allowed. ;
104,社区必须包含若干个节点中至少一个需要以逻辑或连接的若干个节点来表示,所述逻辑或的表示符号为“∨”,例如:布尔表达式A∨B∨C表示用户希望社区至少包含节点A、节点B和节点C中的一个。104. The community must contain at least one of several nodes that need to be represented by logical or connected nodes. The logical OR symbol is "∨", for example: the Boolean expression A∨B∨C indicates that the user wants the community to at least Contains one of node A, node B, and node C.
图3为本公开的社区搜索方法的又一个实施例的流程图,如图3所示,所述将搜索条件转化为多个搜索项包括:FIG. 3 is a flowchart of yet another embodiment of the community search method of the present disclosure. As shown in FIG. 3, the conversion of search conditions into multiple search terms includes:
201,枚举满足搜索条件的节点变量取值组合,从而得出与搜索条件等价的主析取范式;201. Enumerate the value combinations of node variables that satisfy the search condition, so as to obtain a main disjunctive paradigm equivalent to the search condition;
202,将主析取范式通过Quine-McCluskey算法化简为最简与或式;202, the main disjunctive paradigm is reduced to the simplest OR by the Quine-McCluskey algorithm;
203,将最简与或式的每个合取项设置为搜索项,例如:最简与或式(A∧B)∨(C∧D)就包含两个搜索项,分别是(A∧B)和(C∧D),如果发现有若干个搜索项存在相同的节点变量,那么可以把这些搜索项合并为一个新的搜索项,例如:最简与或式
Figure PCTCN2019111419-appb-000004
的两个搜索项(A∧B)和
Figure PCTCN2019111419-appb-000005
含有相同节点变量A,于是可以将公共节点变量A提取,将这两个合取项合并为
Figure PCTCN2019111419-appb-000006
Figure PCTCN2019111419-appb-000007
为减少搜索项的个数,从而减少后续进行单项条件社区搜索过程的次数,达到节省时间开销的目的;
203. Set each conjunction of the simplest and OR formula as the search term, for example: the simplest and OR formula (A∧B) ∨ (C∧D) contains two search terms, which are (A∧B ) And (C∧D), if you find that there are several search terms with the same node variable, you can merge these search terms into a new search term, for example: the most simple and or
Figure PCTCN2019111419-appb-000004
Search terms (A∧B) and
Figure PCTCN2019111419-appb-000005
Contains the same node variable A, so you can extract the common node variable A, and merge these two conjunctions into
Figure PCTCN2019111419-appb-000006
Figure PCTCN2019111419-appb-000007
In order to reduce the number of search terms, thereby reducing the number of subsequent single-item community search processes, to achieve the purpose of saving time overhead;
204,提取出不同合取项中出现次数最多的一个没有逻辑非修饰的变量,如果含有该变量的合取项多于1,则把这些合取项合并成新的搜索项,重复此步骤直至没有合取项可以进行合并。204. Extract a variable with no logical undecoration that occurs most frequently among different conjuncts. If the conjuncts containing the variable are more than 1, then merge these conjuncts into a new search term and repeat this step until No conjuncts can be merged.
图4为本公开的社区搜索方法的又一个实施例的流程图,如图4所示,所述对于每一搜索项进行单项条件的社区搜索包括:FIG. 4 is a flowchart of yet another embodiment of the community search method of the present disclosure. As shown in FIG. 4, performing a single-item community search for each search item includes:
301,对于合取式形式的搜索项,将其中社区必须包含的节点和不允许包含的节点分别整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入,这是因为它只包含社区中必须出现的节点和不允许出现的节点;301. For search terms in conjunctive form, the nodes that must be included in the community and the nodes that are not allowed to be included are organized into the necessary node set and the prohibited node set as the input of the single-item community search process, because it only contains the community Must appear in the node and nodes that are not allowed to appear;
302,对于由多个合取式合并得到的搜索项,将提取出的一个或者多个公共节点变量根据是否能够出现在社区中整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入,剩余部分用于对输出结果的判别,例如:在两个搜索项
Figure PCTCN2019111419-appb-000008
合并成的搜索项
Figure PCTCN2019111419-appb-000009
中,必要节点集{A}作为单项条件社区搜索过程的输入,即用于寻找包含节点A的社区,
Figure PCTCN2019111419-appb-000010
作为输出结果的判别式,即用于判断社区结果是否含有节点B或者不含有节点D;
302. For the search terms resulting from the combination of multiple conjunctions, the extracted one or more common node variables are organized into a necessary node set and a prohibited node set according to whether they can appear in the community as input to the single-item community search process , The remaining part is used to distinguish the output results, for example: in two search terms
Figure PCTCN2019111419-appb-000008
Combined search terms
Figure PCTCN2019111419-appb-000009
In, the necessary node set {A} is used as the input to the single-item community search process, that is, to find the community containing node A,
Figure PCTCN2019111419-appb-000010
The discriminant as the output result is used to determine whether the community result contains node B or does not contain node D;
303,进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点。303. Perform a single conditional community search, and use the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, and does not include the nodes in the prohibited node set.
所述进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点包括三种实现方式,分别为:过滤后社区搜索的方式、加权过滤的方式、边搜索边过滤的方式。The single conditional community search described above uses the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, and the nodes that do not contain the prohibited node set include three implementation methods, namely: filtering Post-community search, weighted filtering, and filtering while searching.
所述过滤后社区搜索的方式包括:The filtered community search method includes:
从网络图中删除禁止节点集,得到不含禁止节点的网络图;Delete the forbidden node set from the network graph to get the network graph without prohibited nodes;
对新的网络图,用必要节点集作为输入进行社区搜索。For the new network graph, use the necessary node set as input to conduct a community search.
图5为本公开的社区搜索方法的又一个实施例的流程图,如图5所示,所述加权过滤的方式包括:FIG. 5 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 5, the weighted filtering method includes:
401,为网络图中所有节点赋以数值权重,令必要节点为1,禁止节点为-1,其余节点为0,即:401, assign numerical weights to all nodes in the network graph, make the necessary nodes to 1, prohibit nodes to be -1, and other nodes to be 0, namely:
Figure PCTCN2019111419-appb-000011
Figure PCTCN2019111419-appb-000011
402,除了必要节点和禁止节点以外,迭代更新每个节点的权重,将其赋值为所有邻居节点权重的均值,即:402. In addition to the necessary and prohibited nodes, iteratively updates the weight of each node and assigns it to the average of the weights of all neighboring nodes, namely:
Figure PCTCN2019111419-appb-000012
Figure PCTCN2019111419-appb-000012
403,设定节点权重阈值λ,保留节点权重大于等于λ的节点,并将其在原网络图中的导出子图提取出来作为新的网络图;403. Set a node weight threshold λ, retain nodes whose node weight is greater than or equal to λ, and extract their derived subgraph in the original network graph as a new network graph;
404,用必要节点集作为输入对新的网络图进行社区搜索。404. Perform community search on the new network graph with the necessary node set as input.
图6为本公开的社区搜索方法的又一个实施例的流程图,如图6所示,所述对新的网络图,用必要节点集作为输入进行社区搜索包括:FIG. 6 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 6, performing community search using the necessary node set as input for the new network diagram includes:
501,将必要节点集划入社区结果C;501, the necessary node set is classified into the community result C;
502,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;502. Obtain the derived subgraph corresponding to the community result according to the given network graph. If the derived subgraph of the community result C has only one connected component and the node degree of all nodes in the derived subgraph is greater than or equal to the given threshold k, Then stop and return to the community results;
503,根据导出子图的连通分量,将同一个分量内的节点划入同一组;503, according to the connected components of the derived subgraph, divide the nodes in the same component into the same group;
504,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;504, classify the neighbor nodes of all nodes in the community result C into the candidate node set Candidate, and exclude nodes that already exist in the community result C;
505,若Candidate为空,则将社区结果C置为空集,并转到步骤508;505, if Candidate is empty, set the community result C to an empty set, and go to step 508;
506,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据a、b和d对节点集中的节点进行多关键字降序排序;506. For each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to a , B and d sort the nodes in the node set in multi-key descending order;
507,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转至步骤506,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转至步骤502;507, if the degree of the candidate node c ranked first is less than the threshold k, then the node c is removed from the candidate node set Candidate, go to step 506, otherwise, the node c is added to the community result, and the neighbor nodes of the node Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step 502;
508,将整个网络图的节点划入社区结果C;508, the nodes of the entire network graph are classified into the community result C;
509,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;509, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return to the community result C ;
510,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤509。510. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step 509.
图7为本公开的社区搜索方法的又一个实施例的流程图,如图7所示,所述用必要节点集作为输入对新的网络图进行社区搜索包括:FIG. 7 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 7, the performing community search on the new network graph using the necessary node set as input includes:
601,将必要节点集划入社区结果C;601, the necessary node set is classified into the community result C;
602,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;602. Obtain the derived subgraph corresponding to the community result according to the given network graph. If the derived subgraph of the community result C has only one connected component and the degree of all nodes in the derived subgraph is greater than or equal to the given threshold k, Then stop and return to the community results;
603,根据导出子图的连通分量,将同一个分量内的节点划入同一组;603, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
604,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;604, classify the neighbor nodes of all nodes in the community result C into the candidate node set Candidate, and exclude nodes that already exist in the community result C;
605,若Candidate,则将社区结果C置为空集,并转到步骤608;605, if Candidate, the community result C is set to an empty set, and go to step 608;
606,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据a、b和d对节点集中的节点进行多关键字降序排序;606, for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to a , B and d sort the nodes in the node set in multi-key descending order;
607,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转至步骤606,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤602;607, if the degree of the candidate node c ranked first is less than the threshold k, the node c is removed from the candidate node set Candidate, go to step 606, otherwise, the node c is added to the community result, and the neighbor node of the node is also added Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step 602;
608,将整个网络图的节点划入社区结果C;608, the nodes of the entire network graph are classified into the community result C;
609,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;609, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return to the community result C ;
610,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤609。610. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step 609.
图8为本公开的社区搜索方法的又一个实施例的流程图,如图8所示,所述边搜索边过滤的方式包括:FIG. 8 is a flowchart of another embodiment of the community search method of the present disclosure. As shown in FIG. 8, the manner of searching while filtering includes:
701,将必要节点集划入社区结果C;701, the necessary node set is classified into the community result C;
702,根据给定的网络图,得到社区结果C对应的导出子图。如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果C;702. Obtain the derived subgraph corresponding to the community result C according to the given network graph. If the derived subgraph of the community result C has only one connected component and the degree of all nodes in the derived subgraph is greater than or equal to the given threshold k, then stop and return to the community result C;
703,根据导出子图的连通分量,将同一个分量内的节点划入同一组;703, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
704,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并从中排除已经存在于社区结果C中的节点和禁止节点;704, classify the neighbor nodes of all nodes in the community result C into the candidate node set Candidate, and exclude the nodes and prohibited nodes that already exist in the community result C from it;
705,若Candidate为空,则将社区结果C置为空集,并转到步骤708;705, if Candidate is empty, the community result C is set to an empty set, and go to step 708;
706,对于候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中与非禁止节点连接的个数d;根据a、b和d对节点集中的节点进行多关键字降序排序;706, for each node in the candidate node set Candidate, record the number a of its connected components, the number b of nodes in the community result C, and the number of nodes connected to non-prohibited nodes in the given network graph d; Sort the nodes in the node set in multi-key descending order according to a, b, and d;
707,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤706,否则,将节点c加入社区结果C,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除;707, if the degree of the candidate node c ranked first is less than the threshold k, then the node c is removed from the candidate node set Candidate, go to step 706, otherwise, the node c is added to the community result C, and the neighbor node of the node Add to candidate node set Candidate, and remove node c from candidate node set Candidate;
708,将整个网络图的节点划入社区结果C并删除禁止节点;708, the nodes of the entire network graph are classified into the community result C and the prohibited nodes are deleted;
709,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;709, if the number of connected components of the derived subgraph of the community result C is greater than 1, then stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, then stop and return to the community result C ;
710,将社区结果C的导出子图中度数低于k的节点删除,如果删除的节点是必要节点集的成员,则停止并返回空集,否则跳转到步骤709。710. Delete the node whose degree is lower than k in the derived subgraph of the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise jump to step 709.
本说明书中各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似的部分相互参见即可。对于系统实施例而言,由于其与方法实施例基本对应,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。Each embodiment in this specification is described in a progressive manner. Each embodiment focuses on the differences from other embodiments, and the same or similar parts between the various embodiments may refer to each other. For the system embodiment, since it basically corresponds to the method embodiment, the description is relatively simple, and the relevant part can be referred to the description of the method embodiment.
本公开的描述是为了示例和描述起见而给出的,而并不是无遗漏的或者将本公开限于所公开的形式。很多修改和变化对于本领域的普通技术人员而言是显然的。选择和描述实施例是为了更好说明本公开的原理和实际应用,并且使本领域的普通技术人员能够理解本公开从而设计适于特定用途的带有各种修改的各种实施例。The description of the present disclosure is given for the sake of example and description, and is not exhaustive or limits the present disclosure to the disclosed form. Many modifications and changes will be apparent to those of ordinary skill in the art. The embodiments are selected and described in order to better explain the principles and practical applications of the present disclosure, and enable those of ordinary skill in the art to understand the present disclosure to design various embodiments with various modifications suitable for specific uses.

Claims (10)

  1. 一种社区搜索方法,其特征在于,包括:A method of community search, which includes:
    根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件;According to the user's needs for community search, the nodes are mapped to node variables, and the corresponding search conditions are written;
    将搜索条件转化为多个搜索项;Convert search terms into multiple search terms;
    对于每一搜索项进行单项条件的社区搜索;A single-item community search is performed for each search term;
    将各个单项条件社区搜索的结果合并,即对社区结果取并集进行返回。Combine the results of each single-item community search, that is, take the union result of the community and return it.
  2. 根据权利要求1所述的社区搜索方法,其特征在于,所述根据用户对于社区搜索的需求,将节点对应于节点变量,写出对应的搜索条件包括:The community search method according to claim 1, wherein, according to the user's demand for community search, the node is corresponded to a node variable, and writing out the corresponding search condition includes:
    将用户需求中提及的节点对应为布尔变量;Corresponding to the nodes mentioned in user requirements as Boolean variables;
    不允许出现在社区中的节点以逻辑非修饰,社区必须包含的节点无修饰;The nodes that appear in the community are not allowed to be logically undecorated, and the nodes that the community must contain are not decorated;
    必须同时出现在社区中的节点以逻辑与连接,社区必须包含的节点和不允许的包含的节点,也用逻辑与连接;The nodes that must appear in the community at the same time are connected with logic and the nodes that the community must contain and the nodes that are not allowed are also connected with logic and;
    社区必须包含若干个节点中至少一个需要以逻辑或连接的若干个节点来表示。The community must contain at least one of several nodes that need to be represented by logical or connected nodes.
  3. 根据权利要求2所述的社区搜索方法,其特征在于,所述将搜索条件转化为多个搜索项包括:The community search method according to claim 2, wherein the conversion of the search condition into multiple search terms includes:
    枚举满足搜索条件的节点变量取值组合,从而得出与搜索条件等价的主析取范式;Enumerate the value combinations of node variables that satisfy the search condition, so as to obtain the main disjunctive paradigm equivalent to the search condition;
    将主析取范式通过Quine-McCluskey算法化简为最简与或式;The main disjunctive paradigm is reduced to the simplest OR by the Quine-McCluskey algorithm;
    将最简与或式的每个合取项设置为搜索项;Set each conjunct of the simplest and or as the search term;
    提取出不同合取项中出现次数最多的一个没有逻辑非修饰的变量,如果含有该变量的合取项多于1,则把这些合取项合并成新的搜索项,重复此步骤直至没有合取项可以进行合并。Extract the most frequent occurrence of a variable without logical undecoration among different conjuncts. If there are more than one conjunct with this variable, merge these conjuncts into a new search term, and repeat this step until there is no conjunct The items can be merged.
  4. 根据权利要求3所述的社区搜索方法,其特征在于,所述对于每一搜索项进行单项条件的社区搜索包括:The community search method according to claim 3, wherein the community search with a single condition for each search item includes:
    对于合取式形式的搜索项,将其中社区必须包含的节点和不允许包含的节点分别整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入;For search terms in conjunctive form, the nodes that must be included in the community and the nodes that are not allowed to be included are organized into the necessary node set and the prohibited node set as the input of the single-condition community search process;
    对于由多个合取式合并得到的搜索项,将提取出的一个或者多个公共节点变量根据是否能够出现在社区中整理成必要节点集和禁止节点集作为单项条件社区搜索过程的输入,剩余部分用于对输出结果的判别;For search terms resulting from the combination of multiple conjunctions, the extracted one or more common node variables are sorted into necessary node sets and prohibited node sets according to whether they can appear in the community as the input of the single-item community search process, the remaining Partly used to judge the output results;
    进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点。Carry out a single conditional community search, using the necessary node set and the prohibited node set to search the community results from the network graph, so that the resulting community contains the necessary node set, and does not include the nodes in the prohibited node set.
  5. 根据权利要求4所述的社区搜索方法,其特征在于,所述进行单项条件社区搜索,利用必要节点集和禁止节点集从网络图中搜索社区结果,使所得社区包含必要节点集,同时不包含禁止节点集中的节点包括三种实现方式,分别为:过滤后社区搜索的方式、加权过滤的方式、边搜索边过滤的方式。The community search method according to claim 4, characterized in that the single-item community search is performed to search the community results from the network graph using the necessary node set and the prohibited node set, so that the resulting community contains the necessary node set and does not include The nodes in the prohibited node set include three implementation methods, namely: community search after filtering, weighted filtering, and filtering while searching.
  6. 根据权利要求5所述的社区搜索方法,其特征在于,所述过滤后社区搜索的方式包括:The community search method according to claim 5, wherein the filtered community search method includes:
    从网络图中删除禁止节点集,得到不含禁止节点的网络图;Delete the forbidden node set from the network graph to get the network graph without prohibited nodes;
    对新的网络图,用必要节点集作为输入进行社区搜索。For the new network graph, use the necessary node set as input to conduct a community search.
  7. 根据权利要求5所述的社区搜索方法,其特征在于,所述加权过滤的方式包括:The community search method according to claim 5, wherein the weighted filtering method includes:
    为网络图中所有节点赋以数值权重,令必要节点为1,禁止节点为-1,其余节点为0;Assign numerical weights to all nodes in the network graph, making the necessary nodes be 1, prohibiting nodes being -1, and remaining nodes being 0;
    除了必要节点和禁止节点以外,迭代更新每个节点的权重,将其赋值为所有邻居节点权重的均值,即:In addition to the necessary and prohibited nodes, iteratively updates the weight of each node and assigns it to the average of the weights of all neighbor nodes, namely:
    Figure PCTCN2019111419-appb-100001
    Figure PCTCN2019111419-appb-100001
    设定节点权重阈值λ,保留节点权重大于等于λ的节点,并将其在原网络图中的导出子图提取出来作为新的网络图;Set the node weight threshold λ, keep the nodes whose node weight is greater than or equal to λ, and extract the derived subgraph in the original network graph as a new network graph;
    用必要节点集作为输入对新的网络图进行社区搜索。Use the necessary node set as input to conduct a community search on the new network graph.
  8. 根据权利要求6所述的社区搜索方法,其特征在于,所述对新的网络图,用必要节点集作为输入进行社区搜索包括:The community search method according to claim 6, wherein the performing community search on the new network graph using the necessary node set as input includes:
    a01,将必要节点集划入社区结果C;a01, the necessary node set is classified into the community result C;
    a02,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;a02, according to the given network graph, get the derived subgraph corresponding to the community result. If the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
    a03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;a03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
    a04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;a04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
    a05,若Candidate为空,则将社区结果C置为空集,并转到步骤a08;a05, if Candidate is empty, set the community result C to an empty set, and go to step a08;
    a06,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据a、b和d对节点集中的节点进行多关键字降序排序;a06, for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to a , B and d sort the nodes in the node set in multi-key descending order;
    a07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤a06,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤a02;a07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step a06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step a02;
    a08,将整个网络图的节点划入社区结果C;a08, the nodes of the entire network graph are included in the community result C;
    a09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;a09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return the community result C ;
    a10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤a09。a10. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step a09.
  9. 根据权利要求7所述的社区搜索方法,其特征在于,所述用必要节点集作为输入对新的网络图进行社区搜索包括:The community search method according to claim 7, wherein the performing community search on the new network graph with the necessary node set as input includes:
    b01,将必要节点集划入社区结果C;b01, the necessary node set is classified into community result C;
    b02,根据给定的网络图,得到社区结果对应的导出子图,如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都大于等于给定阈值k,则停止并返回社区结果;b02, according to the given network graph, get the derived subgraph corresponding to the community result. If the derived subgraph of the community result C has only one connected component and all the nodes in the derived subgraph have node degrees greater than or equal to the given threshold k, Then stop and return to the community results;
    b03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;b03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
    b04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并排除已经存在于社区结果C中的节点;b04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes that already exist in the community result C are excluded;
    b05,若Candidate为空,则将社区结果C置为空集,并转到步骤b08;b05, if Candidate is empty, set the community result C to an empty set, and go to step b08;
    b06,对候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中的点度数d,之后根据对节点集中的节点进行多关键字降序排序;b06, for each node in the candidate node set Candidate, record the number a of the connected connected components, the number b of the nodes in the community result C, the degree d of the node in the given network graph, and then according to the The nodes in the node set are sorted in multi-key descending order;
    b07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤b06,否则,将节点c加入社区结果,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤b02;b07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step b06, otherwise, add node c to the community result, and join the node's neighbors Go to the candidate node set Candidate, and remove the node c from the candidate node set Candidate, go to step b02;
    b08,将整个网络图的节点划入社区结果C;b08, the nodes of the entire network graph are included in the community result C;
    b09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,则停止并返回社区结果C;b09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set, if the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, stop and return to the community result C ;
    b10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,那么停止并返回空集,否则转到步骤b09。b10. Delete the node whose degree is lower than k in the derived subgraph of community result C from community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step b09.
  10. 根据权利要求5所述的社区搜索方法,其特征在于,所述边搜索边过滤的方式包括:The community search method according to claim 5, wherein the manner of searching while filtering includes:
    c01,将必要节点集划入社区结果C;c01, the necessary node set is classified into community result C;
    c02,根据给定的网络图,得到社区结果C对应的导出子图。如果社区结果C的导出子图仅有一个连通分量并且所有节点在导出子图中的节点度数都 大于等于给定阈值k,则停止并返回社区结果C;c02. Obtain the derived subgraph corresponding to the community result C according to the given network graph. If the derived subgraph of the community result C has only one connected component and the degree of all nodes in the derived subgraph is greater than or equal to the given threshold k, then stop and return to the community result C;
    c03,根据导出子图的连通分量,将同一个分量内的节点划入同一组;c03, according to the connected components of the derived subgraph, the nodes in the same component are grouped into the same group;
    c04,将社区结果C中所有节点的邻居节点划入候选节点集Candidate,并从中排除已经存在于社区结果C中的节点和禁止节点;c04, the neighbor nodes of all nodes in the community result C are classified into the candidate node set Candidate, and the nodes and banned nodes that already exist in the community result C are excluded from it;
    c05,若Candidate为空,则将社区结果C置为空集,并转到步骤c08;c05, if Candidate is empty, set the community result C to an empty set, and go to step c08;
    c06,对于候选节点集Candidate中的每个节点,记录其连接不同连通分量的个数a,连接社区结果C中的节点个数b,节点在给定网络图中与非禁止节点连接的个数d;根据a、b和d对节点集中的节点进行多关键字降序排序;c06, for each node in the candidate node set Candidate, record the number a of the connected different connected components, the number b of the nodes in the connection community result C, the number of nodes connected to non-prohibited nodes in the given network graph d; Sort the nodes in the node set in multi-key descending order according to a, b, and d;
    c07,若排在首位的候选节点c的度数小于阈值k,则将节点c从候选节点集Candidate中去除,转到步骤c06,否则,将节点c加入社区结果C,同时将该节点的邻居节点加入到候选节点集Candidate,并把节点c从候选节点集Candidate中去除,转到步骤c02;c07, if the degree of the top candidate node c is less than the threshold k, then remove the node c from the candidate node set Candidate, go to step c06, otherwise, add the node c to the community result C, and at the same time the neighbor node of the node Add to candidate node set Candidate, and remove node c from candidate node set Candidate, go to step c02;
    c08,将整个网络图的节点划入社区结果C并删除禁止节点;c08, classify the nodes of the entire network graph into the community result C and delete the prohibited nodes;
    c09,若社区结果C的导出子图的连通分量个数大于1,则停止并返回空集,如果社区结果C的导出子图中最小点的度数大于等于阈值k,检查此时社区结果C中是否含有禁止节点;c09, if the number of connected components of the derived subgraph of the community result C is greater than 1, stop and return to the empty set. If the degree of the smallest point in the derived subgraph of the community result C is greater than or equal to the threshold k, check the community result C at this time Whether it contains prohibited nodes;
    c10,将社区结果C的导出子图中度数低于k的节点从社区结果C中删除,如果删除的节点是必要节点集的成员,则停止并返回空集,否则转到步骤c09。c10. Delete the node whose degree is lower than k in the derived subgraph of the community result C from the community result C. If the deleted node is a member of the necessary node set, stop and return to the empty set, otherwise go to step c09.
PCT/CN2019/111419 2018-10-16 2019-10-16 Community search method WO2020078370A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811205006.7 2018-10-16
CN201811205006.7A CN109543077B (en) 2018-10-16 2018-10-16 Community search method

Publications (1)

Publication Number Publication Date
WO2020078370A1 true WO2020078370A1 (en) 2020-04-23

Family

ID=65843813

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/111419 WO2020078370A1 (en) 2018-10-16 2019-10-16 Community search method

Country Status (2)

Country Link
CN (1) CN109543077B (en)
WO (1) WO2020078370A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543077B (en) * 2018-10-16 2020-07-31 清华大学 Community search method
CN113254797B (en) * 2021-04-19 2022-09-20 江汉大学 Searching method, device and processing equipment for social network community
CN116485587B (en) * 2023-04-21 2024-04-09 深圳润高智慧产业有限公司 Community service acquisition method, community service providing method, electronic device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120317142A1 (en) * 2009-09-11 2012-12-13 University Of Maryland, College Park Systmen and method for data management in large data networks
CN104636978A (en) * 2015-02-12 2015-05-20 西安电子科技大学 Detection method for overlapping community based on multi-label propagation
CN106796611A (en) * 2014-08-29 2017-05-31 邻客音公司 User interface for generating search inquiry
CN109543077A (en) * 2018-10-16 2019-03-29 清华大学 Community search method

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170032044A1 (en) * 2006-11-14 2017-02-02 Paul Vincent Hayes System and Method for Personalized Search While Maintaining Searcher Privacy
KR20130098772A (en) * 2012-02-28 2013-09-05 삼성전자주식회사 Topic-based community index generation apparatus, topic-based community searching apparatus, topic-based community index generation method and topic-based community searching method
CN103425662B (en) * 2012-05-16 2017-08-25 腾讯科技(深圳)有限公司 Information search method and device in a kind of Web Community
US9461876B2 (en) * 2012-08-29 2016-10-04 Loci System and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
US9652875B2 (en) * 2012-10-29 2017-05-16 Yahoo! Inc. Systems and methods for generating a dense graph
CN105224555B (en) * 2014-06-12 2019-12-10 北京搜狗科技发展有限公司 Searching method, device and system
JP6332243B2 (en) * 2015-11-18 2018-05-30 カシオ計算機株式会社 Information processing apparatus, electronic device, and program
JP6697247B2 (en) * 2015-11-18 2020-05-20 カシオ計算機株式会社 Information processing apparatus, program, and search display method
CN106530039A (en) * 2016-10-26 2017-03-22 深圳市亿家信息科技有限公司 Data processing realization method and system of intelligent community
CN108268603A (en) * 2017-12-22 2018-07-10 中国电子科技集团公司第三十研究所 A kind of community discovery method based on core member's identification
CN108319728A (en) * 2018-03-15 2018-07-24 深圳大学 A kind of frequent community search method and system based on k-star

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120317142A1 (en) * 2009-09-11 2012-12-13 University Of Maryland, College Park Systmen and method for data management in large data networks
CN106796611A (en) * 2014-08-29 2017-05-31 邻客音公司 User interface for generating search inquiry
CN104636978A (en) * 2015-02-12 2015-05-20 西安电子科技大学 Detection method for overlapping community based on multi-label propagation
CN109543077A (en) * 2018-10-16 2019-03-29 清华大学 Community search method

Also Published As

Publication number Publication date
CN109543077B (en) 2020-07-31
CN109543077A (en) 2019-03-29

Similar Documents

Publication Publication Date Title
WO2020078370A1 (en) Community search method
US6556710B2 (en) Image searching techniques
Gupta et al. Top-k interesting subgraph discovery in information networks
JP2006018829A (en) Automated classification generation
Mishra et al. Far efficient K-means clustering algorithm
US10135723B2 (en) System and method for supervised network clustering
Lee et al. Optimizing skyline queries over incomplete data
Huang et al. A link density clustering algorithm based on automatically selecting density peaks for overlapping community detection
Fahim A clustering algorithm based on local density of points
Petkos et al. Graph-based multimodal clustering for social multimedia
CN111061763A (en) Method and device for generating rule execution plan of rule engine
CN112214684B (en) Seed-expanded overlapping community discovery method and device
JP2003256427A (en) Image retrieval device
Zhang et al. Selecting the optimal groups: Efficiently computing skyline k-cliques
CN108280772B (en) Story context generation method based on event association in social network
WO2023206960A1 (en) Product recommendation method and apparatus based on content and collaborative filtering, and computer device
WO2023178767A1 (en) Enterprise risk detection method and apparatus based on enterprise credit investigation big data knowledge graph
JP4188567B2 (en) Document management system, document management method, and recording medium
Chen et al. Service recommendation: Similarity-based representative skyline
Bouanaka et al. An approach for an optimized web service selection based on skyline
US6671402B1 (en) Representing an image with weighted joint histogram
Min et al. Optimal sub-reducts in the dynamic environment
Vu et al. Density-based clustering with side information and active learning
Nguyen Scalable classification method based on rough sets
Pratibha et al. A context-aware recommender engine for smart kitchen

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19873794

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19873794

Country of ref document: EP

Kind code of ref document: A1