WO2021217933A1 - Community division method and apparatus for homogeneous network, and computer device and storage medium - Google Patents

Community division method and apparatus for homogeneous network, and computer device and storage medium Download PDF

Info

Publication number
WO2021217933A1
WO2021217933A1 PCT/CN2020/105766 CN2020105766W WO2021217933A1 WO 2021217933 A1 WO2021217933 A1 WO 2021217933A1 CN 2020105766 W CN2020105766 W CN 2020105766W WO 2021217933 A1 WO2021217933 A1 WO 2021217933A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
community
label
transaction
network
Prior art date
Application number
PCT/CN2020/105766
Other languages
French (fr)
Chinese (zh)
Inventor
曹合心
蔡健
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2021217933A1 publication Critical patent/WO2021217933A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0893Assignment of logical groups to network elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange

Definitions

  • This application relates to the field of big data technology, and in particular to a method, device, computer equipment, and storage medium for dividing a homogeneous network into a community.
  • Homogeneous network refers to a network formed by the interconnection of entities of the same category, which can actually be a network composed of people or computers and computers. People or computers in a homogeneous network can be regarded as nodes, and the actual homogeneous network may have hundreds of millions of nodes. Some nodes in a homogeneous network may form a community node network due to close relationships. The community node network has important practical value, such as promotion based on core users in the community node network, identification of fraudulent gangs, etc., so it is often necessary to obtain information from A network of community nodes is divided into a homogeneous network. The community division of homogeneous networks involves data mining in the field of big data, which usually requires the processing of massive amounts of data.
  • the purpose of the embodiments of the present application is to propose a method, device, computer equipment, and storage medium for community division in a homogeneous network, so as to solve the problem of low accuracy of community division in a homogeneous network.
  • a method for community division of a homogeneous network includes:
  • the node influence value and the relative influence value iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
  • the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  • an embodiment of the present application also provides a community division device for a homogeneous network, including:
  • Information receiving module for receiving financial transaction information
  • the network construction module is configured to use the transaction users in the financial transaction information as nodes, and construct a homogeneous network based on the transaction relationship and transaction data between the transaction users;
  • the label initialization module is used to initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
  • a node calculation module configured to calculate each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
  • the label update module is configured to iteratively update the community label set of each node according to the node influence value and the relative influence value; there is at least one community label in the community label set after the iterative update;
  • the network division module is used to divide the homogenous network into at least the same network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes.
  • a network of community nodes is used to divide the homogenous network into at least the same network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes.
  • an embodiment of the present application also provides a computer device, including a memory and a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor executes
  • the computer-readable instructions implement the following steps:
  • the node influence value and the relative influence value iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
  • the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  • embodiments of the present application also provide a computer-readable storage medium, the computer-readable storage medium stores computer-readable instructions, and the computer-readable instructions implement the following steps when executed by a processor:
  • the node influence value and the relative influence value iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
  • the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  • the embodiments of the present application mainly have the following beneficial effects: after receiving financial transaction information, the transaction user in the financial transaction information is used as a node, a homogeneous network is constructed based on the transaction relationship and transaction data between the transaction users, and initialized The community label of each node in the homogeneous network is obtained, and the community label set of each node in the homogeneous network is obtained; when calculating the node influence value of each node and the relative influence value of each node on neighboring nodes, the transaction data is used as the weight , Taking into account the importance of the node, the accuracy of the calculation result is improved; according to the node influence value and relative influence value, the community label set of each node is iteratively updated, and there is at least one community in the iteratively updated community label set Label, the community label identifies the degree of belonging of the node to a certain community; when the community label in the community label set of each node no longer changes in the iterative update, the community label
  • Figure 1 is an exemplary system architecture diagram to which the present application can be applied;
  • Fig. 2 is a flowchart of an embodiment of a method for community division of a homogeneous network according to the present application
  • Figure 3 is a schematic diagram of a homogeneous network in an embodiment
  • FIG. 4 is a flowchart of a specific implementation of step 204 in FIG. 2;
  • FIG. 5 is a flowchart of another specific implementation manner of step 204 in FIG. 2;
  • FIG. 6 is a flowchart of a specific implementation of step 205 in FIG. 2;
  • FIG. 7 is a flowchart of a specific implementation of step 206 in FIG. 2;
  • FIG. 8 is a schematic structural diagram of an embodiment of a device for dividing a community in a homogeneous network according to the present application.
  • Fig. 9 is a schematic structural diagram of an embodiment of a computer device according to the present application.
  • the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105.
  • the network 104 is used to provide a medium for communication links between the terminal devices 101, 102, 103 and the server 105.
  • the network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
  • the user can use the terminal devices 101, 102, and 103 to interact with the server 105 through the network 104 to receive or send messages and so on.
  • Various communication client applications can be installed on the terminal devices 101, 102, 103.
  • the terminal devices 101, 102, 103 may be various electronic devices with display screens and support for web browsing, including but not limited to smart phones, tablet computers, e-book readers, MP3 players (Moving Picture Experts Group Audio Layer III, dynamic Video experts compress standard audio layer 3), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image experts compress standard audio layer 4) players, laptop portable computers and desktop computers, etc.
  • the server 105 may be a server that provides various services.
  • the community division method of the homogeneous network is generally executed by the server, and accordingly, the community division device of the homogeneous network is generally set in the server.
  • the numbers of terminal devices, networks, and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks, and servers according to implementation needs.
  • FIG. 2 there is shown a flowchart of an embodiment of a method for community division of a homogenous network according to the present application.
  • the community division method of the homogeneous network includes the following steps:
  • Step 201 Receive financial transaction information.
  • financial transaction information may be information that records financial transactions, including transaction users (user names or user accounts, etc.), transaction relationships between user accounts, and transaction amounts.
  • Financial transaction information may also include transaction time and other information; transactions; The relationship can be to record whether a financial transaction has occurred between transaction users, and the direction of transfer when a financial transaction occurs.
  • Financial transaction information is an important data for macroeconomic research and analysis.
  • financial transaction information is stored in the blockchain.
  • the financial transaction information can be stored in a node of a blockchain, and the financial transaction information can be shared between different platforms through the blockchain.
  • Blockchain is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • the electronic device (such as the server shown in FIG. 1) on which the community division method of the homogeneous network runs can receive the financial transaction information uploaded by each terminal through various wired connection methods or wireless connection methods.
  • Various applications or pages that can perform financial transactions are running in the terminal.
  • the user manipulates the terminal to perform financial transactions to generate financial transaction information, and the terminal uploads the financial transaction information to the server.
  • the financial transaction information may be stored in a database, and the server obtains the financial transaction information from the database.
  • the server can obtain financial transaction information from the server of a financial transaction platform such as a bank.
  • step 202 a transaction user in the financial transaction information is used as a node, and a homogenous network is constructed based on the transaction relationship and transaction data between the transaction users.
  • the server extracts the transaction relationship and transaction data between the transaction user and the transaction user from the financial transaction information.
  • the server regards the transaction users as nodes, and determines the relationship between the nodes corresponding to the transaction users according to the transaction relationship and transaction data between the transaction users, and builds a homogeneous network. If there is a transaction relationship between the transaction users corresponding to the two nodes, the two nodes can be directly connected, and the two nodes are adjacent to each other.
  • the step of using transaction users in financial transaction information as nodes and constructing a homogeneous network based on transaction relationships and transaction data between transaction users includes: identifying transaction users in financial transaction information and transactions between transaction users Relationship and transaction data; use the identified transaction users as the nodes in the homogenous network to be constructed; connect the nodes with transaction relationships through the node connection line, and mark the transaction data corresponding to the transaction relationship on the node connection line to obtain the same Quality network.
  • the server recognizes user names in financial transaction information and transaction data between users through a semantic recognition model, and obtains user names and transaction data between users in text format (TXT).
  • the semantic recognition model can use existing Model, such as the BERT model.
  • the user name expressed in text format is loaded into the homogeneous network to be constructed as a node.
  • the two nodes are connected by a node connection line, and the corresponding transaction data is marked on the connection line in a text format, thereby completing the construction of a homogeneous network.
  • the identified transaction users are used as nodes, nodes with transaction relationships are connected through node connection lines, and the corresponding transaction data is marked on the node connection lines, ensuring the accuracy of constructing a homogeneous network.
  • the financial transaction information is assigned an information identifier, corresponding to a certain range of transaction users.
  • the server reads the information identifier after receiving the financial transaction information, and queries whether the historical financial transaction data carrying the information identifier has been processed. If processing has been performed, extract the stored homogeneous network corresponding to the information identifier from the database, and update the homogeneous network according to the latest obtained financial transaction information.
  • the homogenous network constructed by the server can be as shown in FIG. 3, and this application uses the specific embodiment in FIG. 3 to explain the technical solution to be disclosed.
  • the node connection line adopts a directed line segment to identify the transaction relationship between the nodes, and the transaction relationship is represented by numbers. For example, if it is recognized that node A transfers 1500 yuan to node B, a directed line segment from node A to node B is constructed as a node connection line, and the transaction data 1500 is marked on the node connection line.
  • Step 203 Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network.
  • the community label can identify the degree of belonging of the node to a certain community; the community label set can be a set of node community labels.
  • the server assigns different community numbers to the nodes in the homogeneous network, and uniformly sets the attribution degree of the node to the community corresponding to the assigned community number as a default value.
  • the community number corresponds to a community, and a community is a collection of nodes with a certain relationship.
  • a community number and the attribution of the node to the community number constitute a community label of the node, and all the community labels of the node constitute the community label set of the node.
  • the step of initializing the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network includes: configuring a community number to each node in the homogeneous network; initializing each node in the homogeneous network; The degree of attribution of the node to the configured community number; based on the community number of each node and the degree of attribution corresponding to the community number, the community label of each node is generated; the community of each node is constructed according to the community label of each node Set of labels.
  • the server allocates a community number to each node in the homogeneous network, and the community number of each node is different from each other.
  • the server initializes the attribution degree of each node to the configured community number and sets it to 1 uniformly. In the later iterative update, the change interval of the attribution degree is [0,1].
  • the server combines the community number of the node and the attribution degree corresponding to the community number to obtain a community label of the node.
  • the node community label set is the node’s community label set. At the time of initialization, there is only one community label in the community label set of the node.
  • the community label of node A is l(a,1)
  • the community label set of node A is ⁇ l(a,1) ⁇ .
  • the community label sets of nodes B, C, and D are: ⁇ l(b,1) ⁇ , ⁇ l(c,1) ⁇ and ⁇ l(d,1) ⁇ .
  • the community number and corresponding attribution of each node are initialized, and a community label and a community label set are generated, which ensures the normal progress of subsequent community division.
  • Step 204 Calculate each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on adjacent nodes.
  • the node influence value may be the evaluation value of the influence of the node in the homogeneous network
  • the relative influence value may be the evaluation value of the influence of the node on an adjacent node in the homogeneous network.
  • nodes exert influence in a homogeneous network through transaction data.
  • the server uses the transaction data as the weight and calculates according to the preset node calculation method to obtain the node influence value of each node in the homogeneous network. Then according to the calculated node influence value, calculate the relative influence value of each node to its neighboring nodes.
  • Step 205 According to the node influence value and the relative influence value, the community label set of each node is iteratively updated; there is at least one community label in the community label set after the iterative update.
  • the server sorts the nodes in ascending order according to the node influence value, and uses the sort order as the update order of the community tag set of each node.
  • the server updates the community label set of the node based on the community label set of the neighboring nodes of the node.
  • the server obtains the relative influence value of the node's neighboring node on the node and the community label of the neighboring node, and calculates the attribution degree of the node to the community number in the community label of the neighboring node.
  • the server screens from the calculated attribution degrees according to the values, combines at least one attribution degree and its corresponding community number into a new community label of the node, and obtains an iteratively updated community label set. There is at least one community label in the community label set after iterative update, and the community numbers in the community labels are different from each other.
  • the server first iteratively updates the community tag set ⁇ of node A to obtain ⁇ '.
  • the community label set ⁇ of node B updates ⁇ based on ⁇ ′.
  • Step 206 When the community label in the community label set of each node no longer changes during the iterative update, divide the homogeneous network into at least one community node network according to the community label after the iterative update of each node.
  • the community node network can be a network composed of nodes that have certain close connections in financial transactions, which is a sub-network of a homogeneous network.
  • the server After the server completes a round of iterative update for each node in the homogeneous network, it compares whether the current community tag set of each node is consistent with the community tag set after the previous iteration; if it is inconsistent, continue to check The community tag set of each node is updated iteratively; if they are consistent, the community tag set of each node has not changed, then the iteration is stopped, and the community number of the community tag in the current community tag set of each node will be the same The nodes of the community number are included in the same community node network.
  • the server divides at least one community node network from the homogeneous network.
  • the node When there is more than one community label in the community label of a node, the node is divided into more than one community node network and becomes an overlapping node.
  • the community tag set has not changed in the iterative update, including: the number of community tags has not changed, the community number has not changed, and the degree of attribution corresponding to the community number has not changed.
  • the transaction users in the financial transaction information are used as nodes, a homogenous network is constructed based on the transaction relationship and transaction data between the transaction users, and the community tags of each node in the homogenous network are initialized to obtain The community label set of each node in the homogeneous network; when calculating the node influence value of each node and the relative influence value of each node on neighboring nodes, the transaction data is used as the weight, taking into account the importance of the node, and improving the calculation result According to the node’s influence value and relative influence value, the community label set of each node is iteratively updated. There is at least one community label in the community label set after iterative update.
  • the community label identifies the node’s influence on a certain community The belonging degree of the group; when the community label in the community label set of each node no longer changes in the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node. Nodes with more than one community label are divided into more than one community node network, which ensures the accuracy of community division for homogeneous networks.
  • calculating the node influence value may include:
  • step 2041 the transaction data is used as the weight, and the transaction weight value of each node is calculated according to the transaction relationship.
  • the transaction weight value can be a measure of the influence of the transaction data related to the node in the homogeneous network.
  • node A has three node connection lines (AB, AC, AD), and the transaction data on the three node connection lines are added to obtain the node transaction amount k A of node A.
  • Node A has 3 adjacent triangles (ABC, ADC, ABD).
  • the sides of the triangle are node connection lines.
  • the average transaction data on the three node connection lines can be used to obtain the triangle weight of each adjacent triangle, and then the three triangle weights Add up to get the weight e A of node A.
  • the transaction amount node k m e m and weights are added to obtain a weight value transactions node (k m + e m).
  • Step 2042 For each node in the homogeneous network, filter the maximum transaction weight value and the minimum transaction weight value from the transaction weight values corresponding to the neighboring nodes of the node.
  • the server selects the transaction weight value with the largest value and the transaction weight value with the smallest value from the transaction weight values of the node and its neighboring nodes, and uses the transaction weight value with the largest value as the largest transaction weight value. Use the smallest transaction weight value as the smallest transaction weight value
  • Step 2043 Calculate the node influence value of the node in the homogeneous network according to the transaction weight value, the maximum transaction weight value and the minimum transaction weight value of the node.
  • node influence value of node A can be calculated according to the following formula (1):
  • e A is the weight of node A
  • k A is the node transaction amount of node A
  • the node influence value of node A can be obtained as 0.5.
  • the server calculates the node influence value of the remaining nodes in the same way.
  • the transaction data is substituted into the calculation as the weight, which takes into account the differences in financial transactions between the users corresponding to the node, and improves the accuracy of the calculated node influence value.
  • calculating the relative influence value of each node on the neighboring node may include:
  • Step 2044 Calculate the similarity between each pair of adjacent nodes according to the transaction relationship and transaction data.
  • the similarity may be an evaluation value of the similarity of two nodes that are adjacent to each other.
  • node A and node C are adjacent nodes to each other.
  • node A and node C are directly connected by a node connection line, they belong to the first-degree relationship; when node A first passes through node D (or node B), and then arrives At node C, it belongs to the two-degree relationship between the two.
  • the server calculates the similarity Sim between node A and node C according to the following formula (2):
  • Step 2045 For each node in the homogeneous network, determine the maximum similarity of adjacent nodes from the similarity corresponding to the adjacent nodes of the node.
  • node C and node A there is similarity between node C and node A, there is similarity between node C and node B, and there is similarity between node C and node D.
  • node A is used as the main body to calculate the relative influence value of node A on adjacent node C, from the three similarities corresponding to node C, according to the numerical value, the largest similarity is selected as the maximum similarity of node C It is easy to find that the similarity between node C and node D is 0.45, which is the maximum similarity of node C.
  • Step 2046 Based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the relative influence value of the node on the adjacent node is calculated.
  • the relative influence value NNI A (C) of node A on node C is calculated according to the following formula (3):
  • NI A is the node influence value of node A
  • Sim(A,C) is the similarity between node A and node C
  • h is the adjacent node of node C
  • the similarity between adjacent nodes is calculated on the basis of the node influence value
  • the relative influence value of the node on the adjacent node is calculated on the basis of the similarity and the node influence value.
  • the foregoing step 205 may include:
  • Step 2051 Sort the nodes in ascending order according to the node influence value to obtain the update order sequence.
  • the update order is determined first.
  • the server sorts the nodes in ascending order according to the numerical value of the node's influence value, and uses the sorted ordered sequence as the update sequence sequence.
  • the arrangement order of the nodes determines the update order of the node community tag set.
  • the node with the largest node influence value is finally updated iteratively to ensure that its community label can be spread to the greatest extent and affect other nodes.
  • Step 2052 For each node in the update sequence sequence, obtain the relative influence value of each adjacent node of the node on the node and the first label set of the node in turn; the first label set is composed of the community label set of each adjacent node with the largest attribution Degree of community label composition.
  • the server when the server performs the iterative update according to the update sequence sequence, for the node that currently needs to be updated, it queries its neighboring nodes. From the community label set of each neighboring node, the server separately searches for the community label with the greatest degree of attribution, and combines the searched community labels to obtain the node’s first label set; the first label set is used to The community label set is updated. The server also needs to obtain the relative influence value of each neighboring node of the node on the node.
  • Each community label set has a main label, and the main label is the community label with the largest attribution in the community label set, that is, the first label set is composed of the main labels of each neighboring node.
  • node u ⁇ A,B,C,D ⁇ when updating the community label set of node u, first obtain the main label set L Ng of the neighboring nodes of node u, that is, the first label set:
  • L Ng ⁇ l(c 1 ,B 1 ),l(c 2 ,B 2 ),...l(c v ,B v ) ⁇ ,v ⁇ Ng(u)
  • l(c v , B v ) represents the main label of the adjacent node v
  • B v represents the degree of membership of the node v to the community number c v after the last iteration.
  • Step 2053 Based on the relative influence value, calculate the attribution degree of the node to the community number in each community label in the first label set to obtain the second label set.
  • the attribution degree of the node to the community number in each community label in the first label set is calculated by formula (3):
  • NNI v (u) is the relative influence value of the neighboring node v of node u on node u
  • B(c v ,v) is the attribution degree of node v to the community number c v
  • c is the value in L Ng A community number; in the calculation, all the attribution degrees in the first label set participate in the denominator calculation, and the attribution degrees with the community number c in the first label set participate in the numerator calculation
  • B′(c,u) is the pair of node u The degree of attribution of the community whose community number is c in the first label set.
  • a new label set L′ Ng is obtained (using
  • is equal to
  • the attribution degree of some community tags in L′ Ng may be less than 1/
  • the normalized community label set L′′ Ng is used as the second label set of the node.
  • Step 2054 From the second tag set, filter at least one community tag whose attribution degree meets the preset attribution degree condition.
  • the preset attribution degree condition may be a preset condition for screening attribution degree.
  • the server obtains the preset attribution degree condition, searches for the attribution degree that meets the preset attribution degree condition from the second set of tags according to the preset attribution degree condition, and filters the community tag where the found attribution degree is located.
  • the server can filter at least one community tag from the second tag set according to the preset attribution degree condition.
  • the server searches for the maximum attribution degree, obtains the preset interval adjustment value, uses the maximum attribution degree as the right end of the interval, and uses the difference between the maximum attribution degree and the preset interval adjustment value as the left end of the interval to obtain a closed interval .
  • the server screens the community tags whose attribution is within the closed interval.
  • Step 2055 Use the at least one community label selected as the community label set of the node after the iterative update.
  • the server obtains the community label set according to at least one community label that is screened out, and the newly obtained community label set is the community label set after the node is iteratively updated in this round.
  • the nodes are sorted in ascending order according to the node influence value, and the iterative update is performed according to the sort, which ensures that the node with the largest node influence value can have the greatest influence.
  • each node obtain the relative influence value of each adjacent node of the node on the node and the first label set of the node.
  • the first label set is the community label with the largest degree of belonging in the community label set of each adjacent node ;
  • Based on the relative influence value calculate the attribution degree of the node to the community number in each community label in the first label set to obtain the second label set.
  • the calculation of the second label set fully considers the influence of neighboring nodes; from the second label set At least one community label whose attribution degree meets the preset attribution degree condition is selected, so as to obtain the community label set after the node iteration, which ensures the accuracy of the updated community label set.
  • the foregoing step 206 may include:
  • Step 2061 After completing each round of iterative update for each node, determine the community label with the largest attribution in the community label set of each node as the main label of the community label set of each node.
  • the server After the server completes a round of iterative update of each node in the homogeneous network, it compares the attribution degree of the community tag in the community tag set of each node, finds the maximum attribution degree, and takes the community tag where the maximum attribution degree is located as The main label in the community label set of each node.
  • Step 2062 Compare the current primary label of each node with whether the primary label of each node has changed after the previous iteration.
  • the server obtains the primary label of each node after the previous iteration update, and compares whether the current primary label has changed with the primary label after the previous iteration update.
  • the change includes at least one of a change in the community number of the main tag and a change in the degree of attribution.
  • Step 2063 If there is no change, divide the homogenous network into at least one community node network according to the community number of the current community tag of each node.
  • the server reads the community number in the current community label set of each node, and divides the nodes with the same community number into a community node network. The server divides at least one community node network from the homogeneous network. If the main label of each node after the current iteration update has changed compared with the main label after the previous iteration update, the iterative update is continued.
  • a node can be divided into multiple community node networks. When there is more than one community label in the community label of a node, the node is divided into more than one community node network and becomes an overlapping node.
  • transaction users who have close connections in financial transactions can be divided into the same community node network through the embodiments of this application.
  • a homogeneous network containing fraudulent network groups can be stable after a certain number of iterations and updates, and nodes represented by fraudulent groups are classified into a community node network.
  • the main tags are filtered from the community tag set of each node; if the current main tag of each node has not changed compared with the main tag after the previous iteration, it is homogeneous
  • the network tends to be stable. According to the community number of the current community tag of each node, the community node network can be accurately divided from the homogeneous network.
  • this application provides an embodiment of a device for community division of a homogeneous network, and the device embodiment corresponds to the method embodiment shown in FIG. 2 ,
  • the device can be specifically applied to various electronic equipment.
  • the device 400 for community division of a homogeneous network in this embodiment includes: an information receiving module 401, a network construction module 402, a label initial module 403, a node calculation module 404, a label update module 405, and network division Module 406. in:
  • the information receiving module 401 is used to receive financial transaction information.
  • the network construction module 402 is configured to use transaction users in the financial transaction information as nodes, and construct a homogeneous network based on transaction relationships and transaction data between the transaction users.
  • the label initialization module 403 is used to initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network.
  • the node calculation module 404 is configured to calculate each node using transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on adjacent nodes.
  • the label update module 405 is configured to iteratively update the community label set of each node according to the node influence value and the relative influence value; there is at least one community label in the community label set after the iterative update.
  • the network division module 406 is used to divide the homogeneous network into at least one community node network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes .
  • the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
  • the above-mentioned network construction module 402 is further used to identify transaction users in the financial transaction information and transaction relationships and transaction data between the transaction users; and use the identified transaction users as the to-be-built Nodes in a homogeneous network; nodes that have a transaction relationship are connected through a node connection line, and the transaction data corresponding to the transaction relationship is marked on the node connection line to obtain a homogenous network.
  • the identified transaction users are used as nodes, nodes with transaction relationships are connected through node connection lines, and the corresponding transaction data is marked on the node connection lines, ensuring the accuracy of constructing a homogeneous network.
  • the above-mentioned label initializing module 403 is further configured to configure community numbers for each node in the homogeneous network; initialize the attribution degree of each node to the configured community number; The community number of the node and the degree of belonging corresponding to the community number generate the community label of each node; and the community label set of each node is constructed according to the community label of each node.
  • the community number and corresponding attribution of each node are initialized, and a community label and a community label set are generated, which ensures the normal progress of subsequent community division.
  • the above-mentioned node calculation module 404 is further configured to use transaction data as a weight, and calculate the transaction weight value of each node according to the transaction relationship; for each node in the homogeneous network, the slave node's Among the transaction weight values corresponding to adjacent nodes, filter the maximum transaction weight value and the minimum transaction weight value; according to the transaction weight value, maximum transaction weight value and minimum transaction weight value of the node, calculate the node influence value of the node in the homogeneous network .
  • the transaction data is substituted into the calculation as the weight, which takes into account the difference in financial transactions between the users corresponding to the node, and improves the accuracy of the calculated node influence value.
  • the above-mentioned node calculation module 404 is further configured to calculate the similarity between each pair of adjacent nodes according to the transaction relationship and transaction data; for each node in the homogeneous network, the slave node The maximum similarity of adjacent nodes is determined from the similarity corresponding to adjacent nodes; based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the calculation of the node-to-adjacent node The relative influence value of.
  • the relative influence value is calculated based on the node influence value, and the calculation of the node influence value uses transaction data as a weight, which has high accuracy, thereby ensuring the accuracy of the relative influence value.
  • the label update module 405 is further configured to sort the nodes in ascending order according to the node influence value to obtain the update sequence sequence; for each node in the update sequence sequence, obtain the node in sequence The relative influence value of each neighboring node on the node and the first label set of the node; the first label set is composed of the community label with the largest attribution in the community label set of each neighboring node; based on the relative influence value, the node pair is calculated The attribution degree of the community number in each community tag in the first tag set is obtained, and the second tag set is obtained; from the second tag set, at least one community tag whose attribution degree meets the preset attribution degree condition is selected; The group label is used as the community label set of the node after iterative update.
  • the update sequence of the community label takes into account the node influence value, and the update method of the community label takes the relative influence value into consideration, thereby ensuring the accuracy of updating the community label set.
  • the above-mentioned network division module 406 is further configured to collect the community labels of each node with the community label with the largest degree of attribution after completing each round of iterative update of each node, and determine Is the main label of the community label set of each node; compare the current main label of each node with the main label of each node after the previous iteration; if there is no change, according to the community number of the current community label of each node, Divide the homogeneous network into at least one community node network.
  • the homogeneous network tends to be stable when the primary label of each node no longer changes, and the community node network can be accurately divided according to the community number of the current community label.
  • FIG. 9 is a block diagram of the basic structure of the computer device in this embodiment.
  • the computer device 5 includes a memory 51, a processor 52, and a network interface 53 that communicate with each other through a system bus.
  • the figure only shows the computer device 5 with components 51-53, and it is not required to implement all the illustrated components, and more or fewer components may be implemented instead. .
  • the memory 51 includes at least one type of computer-readable storage medium.
  • the computer-readable storage medium may be nonvolatile or volatile.
  • the computer-readable storage medium includes flash memory, hard disk, and multimedia card. , Card-type memory, random access memory, static random access memory, read-only memory, electrically erasable programmable read-only memory, programmable read-only memory, magnetic memory, magnetic disk, optical disk, etc.
  • the memory 51 may be an internal storage unit of the computer device 5, such as a hard disk or a memory, or an external storage device of the computer device 5, such as a plug-in hard disk, a smart memory card, a secure digital card, or a flash memory card. Wait.
  • the memory 51 may also include both the internal storage unit of the computer device 5 and its external storage device.
  • the memory 51 is generally used to store an operating system and various application software installed in the computer device 5, such as computer-readable instructions for a community division method of a homogeneous network.
  • the memory 51 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 52 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments.
  • the processor 52 is generally used to control the overall operation of the computer device 5.
  • the processor 52 is configured to run computer-readable instructions or process data stored in the memory 51, for example, computer-readable instructions for running the community division method of the homogeneous network.
  • the network interface 53 may include a wireless network interface or a wired network interface, and the network interface 53 is generally used to establish a communication connection between the computer device 5 and other electronic devices.
  • the computer device provided in this embodiment can execute the steps of the above-mentioned method for community division of a homogeneous network.
  • the steps of the method for dividing a community of a homogeneous network may be the steps of the method for dividing a community of a homogeneous network in each of the foregoing embodiments.
  • the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
  • This application also provides another implementation manner, that is, a computer-readable storage medium that stores computer-readable instructions for community division of a homogenous network.
  • the computer-readable instructions for the community division of the network may be executed by at least one processor, so that the at least one processor executes the steps of the community division method of the homogeneous network as described above.
  • the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
  • a terminal device which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Abstract

The embodiments of the present application belong to the field of big data, and relate to a community division method and apparatus for a homogeneous network, and a computer device and a storage medium. The method comprises: receiving financial transaction information; taking a transaction user in the financial transaction information as a node, and constructing a homogeneous network according to a transaction relationship and transaction data; initializing a community tag of each node to obtain a community tag set; performing calculation by means of taking the transaction data as a weight, so as to obtain a node influence value of each node and a relative influence value of each node to an adjacent node; iteratively updating the community tag set of each node according to the node influence value and the relative influence value; and when a community tag in the community tag set of each node no longer changes, dividing the homogeneous network into at least one community node network according to the community tag. By means of the present application, the accuracy of community division is improved. In addition, the present application further relates to blockchain technology, and the financial transaction information can be stored in a blockchain.

Description

同质网络的社群划分方法、装置、计算机设备和存储介质Homogeneous network community division method, device, computer equipment and storage medium
本申请要求于2020年04月29日提交中国专利局、申请号为202010356641.6,发明名称为“同质网络的社群划分方法、装置、计算机设备和存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on April 29, 2020, the application number is 202010356641.6, and the invention title is "Methods, Apparatus, Computer Equipment, and Storage Media for Community Division of Homogeneous Networks". The entire content is incorporated into this application by reference.
技术领域Technical field
本申请涉及大数据技术领域,尤其涉及一种同质网络的社群划分方法、装置、计算机设备和存储介质。This application relates to the field of big data technology, and in particular to a method, device, computer equipment, and storage medium for dividing a homogeneous network into a community.
背景技术Background technique
同质网络指由同一类别的实体相互联系形成的网络,实际中可以是人与人或计算机与计算机组成的网络。同质网络中的人或者计算机可以看作节点,实际的同质网络可能会出现上亿个节点。同质网络中的部分节点可能会因关系紧密组成社群节点网络,社群节点网络具有重要的实用价值,比如根据社群节点网络中的核心用户进行推广、识别欺诈团伙等,因此常常需要从同质网络中划分出社群节点网络。同质网络的社群划分涉及大数据领域中的数据挖掘,通常需要对海量数据进行处理。Homogeneous network refers to a network formed by the interconnection of entities of the same category, which can actually be a network composed of people or computers and computers. People or computers in a homogeneous network can be regarded as nodes, and the actual homogeneous network may have hundreds of millions of nodes. Some nodes in a homogeneous network may form a community node network due to close relationships. The community node network has important practical value, such as promotion based on core users in the community node network, identification of fraudulent gangs, etc., so it is often necessary to obtain information from A network of community nodes is divided into a homogeneous network. The community division of homogeneous networks involves data mining in the field of big data, which usually requires the processing of massive amounts of data.
现有的对同质网络的划分技术通常采用结构化的分析手段,由分析人员基于已有经验,在复杂、繁琐的交易表单中进行筛选和比对,耗费大量人工成本。发明人意识到,基于结构化数据的分析方法无法挖掘多个节点之间的相互关联,损失大量连接信息,也没有考虑节点之间重要程度的差异,无法准确识别重要节点。此外,现有的分析方法对节点的划分是不重叠的,即节点单一地属于一个社群节点网络。然而在实际中,例如欺诈团伙,包括中间人、发起人和去向人等多个角色,必然有同时存在于多个社群节点网络的节点,而现有的分析方法无法确定一个节点是否是不同社群节点网络的重叠部分,导致对同质网络社群划分的准确性较低。Existing technologies for dividing homogeneous networks usually use structured analysis methods. Based on existing experience, analysts screen and compare complex and cumbersome transaction forms, which consumes a lot of labor costs. The inventor realizes that the analysis method based on structured data cannot mine the correlation between multiple nodes, loses a lot of connection information, and does not consider the difference in importance between nodes, and cannot accurately identify important nodes. In addition, the existing analysis methods do not overlap the division of nodes, that is, the nodes belong to a single community node network. However, in practice, for example, fraudulent gangs, including multiple roles such as intermediaries, initiators, and destinations, must have nodes that exist in multiple community node networks at the same time, and existing analysis methods cannot determine whether a node is a different community. The overlapping part of the group node network results in a lower accuracy of the homogenous network community division.
发明内容Summary of the invention
本申请实施例的目的在于提出一种同质网络的社群划分方法、装置、计算机设备和存储介质,用以解决对同质网络中社群划分准确性较低的问题。The purpose of the embodiments of the present application is to propose a method, device, computer equipment, and storage medium for community division in a homogeneous network, so as to solve the problem of low accuracy of community division in a homogeneous network.
一种同质网络的社群划分方法,所述方法包括:A method for community division of a homogeneous network, the method includes:
接收金融交易信息;Receive financial transaction information;
以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
为了解决上述技术问题,本申请实施例还提供一种同质网络的社群划分装置,包括:In order to solve the foregoing technical problems, an embodiment of the present application also provides a community division device for a homogeneous network, including:
信息接收模块,用于接收金融交易信息;Information receiving module for receiving financial transaction information;
网络构建模块,用于以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;The network construction module is configured to use the transaction users in the financial transaction information as nodes, and construct a homogeneous network based on the transaction relationship and transaction data between the transaction users;
标签初始模块,用于初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;The label initialization module is used to initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
节点计算模块,用于将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;A node calculation module, configured to calculate each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
标签更新模块,用于根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;The label update module is configured to iteratively update the community label set of each node according to the node influence value and the relative influence value; there is at least one community label in the community label set after the iterative update;
网络划分模块,用于当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。The network division module is used to divide the homogenous network into at least the same network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes. A network of community nodes.
为了解决上述技术问题,本申请实施例还提供一种计算机设备,包括存储器和处理器,以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:In order to solve the above technical problems, an embodiment of the present application also provides a computer device, including a memory and a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor executes The computer-readable instructions implement the following steps:
接收金融交易信息;Receive financial transaction information;
以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
为了解决上述技术问题,本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下步骤:In order to solve the above technical problems, embodiments of the present application also provide a computer-readable storage medium, the computer-readable storage medium stores computer-readable instructions, and the computer-readable instructions implement the following steps when executed by a processor:
接收金融交易信息;Receive financial transaction information;
以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点 迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
与现有技术相比,本申请实施例主要有以下有益效果:接收金融交易信息后,以金融交易信息中的交易用户作为节点,基于交易用户间的交易关系和交易数据构建同质网络,初始化同质网络中各节点的社群标签,得到同质网络中各节点的社群标签集;在计算各节点的节点影响值以及各节点对相邻节点的相对影响值时,将交易数据作为权重,考虑到了节点的重要程度,提高了计算结果的准确性;根据节点影响值和相对影响值,对各节点的社群标签集进行迭代更新,迭代更新后的社群标签集中至少存在一个社群标签,社群标签标识了节点对某个社群的归属度;当各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据各节点迭代更新后的社群标签将同质网络划分为至少一个社群节点网络,存在多于一个社群标签的节点则被划分至多于一个的社群节点网络中,保证了对同质网络进行社群划分的准确性。Compared with the prior art, the embodiments of the present application mainly have the following beneficial effects: after receiving financial transaction information, the transaction user in the financial transaction information is used as a node, a homogeneous network is constructed based on the transaction relationship and transaction data between the transaction users, and initialized The community label of each node in the homogeneous network is obtained, and the community label set of each node in the homogeneous network is obtained; when calculating the node influence value of each node and the relative influence value of each node on neighboring nodes, the transaction data is used as the weight , Taking into account the importance of the node, the accuracy of the calculation result is improved; according to the node influence value and relative influence value, the community label set of each node is iteratively updated, and there is at least one community in the iteratively updated community label set Label, the community label identifies the degree of belonging of the node to a certain community; when the community label in the community label set of each node no longer changes in the iterative update, the community label after iterative update according to each node will be the same The qualitative network is divided into at least one community node network, and nodes with more than one community label are divided into more than one community node network, which ensures the accuracy of community division of the homogeneous network.
附图说明Description of the drawings
下面将对本申请实施例描述中所需要使用的附图作一个简单介绍,显而易见地,下面描述中的附图是本申请的一些实施例。The following will briefly introduce the drawings used in the description of the embodiments of the present application. Obviously, the drawings in the following description are some embodiments of the present application.
图1是本申请可以应用于其中的示例性系统架构图;Figure 1 is an exemplary system architecture diagram to which the present application can be applied;
图2根据本申请的同质网络的社群划分方法的一个实施例的流程图;Fig. 2 is a flowchart of an embodiment of a method for community division of a homogeneous network according to the present application;
图3是一个实施例中同质网络的示意图;Figure 3 is a schematic diagram of a homogeneous network in an embodiment;
图4是图2中步骤204的一种具体实施方式的流程图;FIG. 4 is a flowchart of a specific implementation of step 204 in FIG. 2;
图5是图2中步骤204的另一种具体实施方式的流程图;FIG. 5 is a flowchart of another specific implementation manner of step 204 in FIG. 2;
图6是图2中步骤205的一种具体实施方式的流程图;FIG. 6 is a flowchart of a specific implementation of step 205 in FIG. 2;
图7是图2中步骤206的一种具体实施方式的流程图;FIG. 7 is a flowchart of a specific implementation of step 206 in FIG. 2;
图8是根据本申请的同质网络的社群划分装置的一个实施例的结构示意图;FIG. 8 is a schematic structural diagram of an embodiment of a device for dividing a community in a homogeneous network according to the present application;
图9是根据本申请的计算机设备的一个实施例的结构示意图。Fig. 9 is a schematic structural diagram of an embodiment of a computer device according to the present application.
具体实施方式Detailed ways
本文中在申请的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本申请。The terminology used in the specification of the application herein is only for the purpose of describing specific embodiments, and is not intended to limit the application.
下面将结合附图,对本申请实施例中的技术方案进行清楚、完整地描述。The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the accompanying drawings.
如图1所示,系统架构100可以包括终端设备101、102、103,网络104和服务器105。网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。用户可以使用终端设备101、102、103通过网络104与服务器105交互,以接收或发送消息等。终端设备101、102、103上可以安装有各种通讯客户端应用。终端设备101、102、103可以是具有显示屏并且支持网页浏览的各种电子设备,包括但不限于智能手机、平板电脑、电子书阅读器、MP3播放器(Moving Picture Experts Group Audio Layer III,动态影像专家压缩标准音频层面3)、MP4(Moving Picture Experts Group Audio Layer IV,动态影像专家压缩标准音频层面4)播放器、膝上型便携计算机和台式计算机等等。服务器105可以是提 供各种服务的服务器。As shown in FIG. 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 is used to provide a medium for communication links between the terminal devices 101, 102, 103 and the server 105. The network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on. The user can use the terminal devices 101, 102, and 103 to interact with the server 105 through the network 104 to receive or send messages and so on. Various communication client applications can be installed on the terminal devices 101, 102, 103. The terminal devices 101, 102, 103 may be various electronic devices with display screens and support for web browsing, including but not limited to smart phones, tablet computers, e-book readers, MP3 players (Moving Picture Experts Group Audio Layer III, dynamic Video experts compress standard audio layer 3), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image experts compress standard audio layer 4) players, laptop portable computers and desktop computers, etc. The server 105 may be a server that provides various services.
需要说明的是,本申请实施例所提供的同质网络的社群划分方法一般由服务器执行,相应地,同质网络的社群划分装置一般设置于服务器中。应该理解,图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。继续参考图2,示出了根据本申请的同质网络的社群划分方法的一个实施例的流程图。所述的同质网络的社群划分方法,包括以下步骤:It should be noted that the community division method of the homogeneous network provided in the embodiments of the present application is generally executed by the server, and accordingly, the community division device of the homogeneous network is generally set in the server. It should be understood that the numbers of terminal devices, networks, and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks, and servers according to implementation needs. Continuing to refer to FIG. 2, there is shown a flowchart of an embodiment of a method for community division of a homogenous network according to the present application. The community division method of the homogeneous network includes the following steps:
步骤201,接收金融交易信息。Step 201: Receive financial transaction information.
其中,金融交易信息可以是记录金融交易的信息,可以包括交易用户(用户名称或用户账号等)、用户账号之间的交易关系和交易金额,金融交易信息还可以包括交易时间等其他信息;交易关系可以是记录交易用户之间是否发生了金融交易,以及发生金融交易时的转账流向。金融交易信息是宏观经济研究分析的重要数据。Among them, financial transaction information may be information that records financial transactions, including transaction users (user names or user accounts, etc.), transaction relationships between user accounts, and transaction amounts. Financial transaction information may also include transaction time and other information; transactions; The relationship can be to record whether a financial transaction has occurred between transaction users, and the direction of transfer when a financial transaction occurs. Financial transaction information is an important data for macroeconomic research and analysis.
在一个实施例中,金融交易信息存储于区块链中。为保证上述金融交易信息的私密和安全性,金融交易信息可以存储于一区块链的节点中,通过区块链实现金融交易信息在不同平台之间的共享。区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层。In one embodiment, financial transaction information is stored in the blockchain. In order to ensure the privacy and security of the above-mentioned financial transaction information, the financial transaction information can be stored in a node of a blockchain, and the financial transaction information can be shared between different platforms through the blockchain. Blockchain is a new application mode of computer technology such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
具体地,同质网络的社群划分方法运行于其上的电子设备(例如图1所示的服务器)可以通过各种有线连接方式或者无线连接方式接收各终端上传的金融交易信息。Specifically, the electronic device (such as the server shown in FIG. 1) on which the community division method of the homogeneous network runs can receive the financial transaction information uploaded by each terminal through various wired connection methods or wireless connection methods.
终端中运行着各种可以进行金融交易的应用或页面,用户操纵终端进行金融交易产生金融交易信息,终端将金融交易信息上传至服务器。Various applications or pages that can perform financial transactions are running in the terminal. The user manipulates the terminal to perform financial transactions to generate financial transaction information, and the terminal uploads the financial transaction information to the server.
在一个实施例中,金融交易信息可以存储于数据库中,服务器从数据库中获取金融交易信息。比如,服务器可以从银行等金融交易平台的服务器获取金融交易信息。In one embodiment, the financial transaction information may be stored in a database, and the server obtains the financial transaction information from the database. For example, the server can obtain financial transaction information from the server of a financial transaction platform such as a bank.
步骤202,以金融交易信息中的交易用户作为节点,并基于交易用户间的交易关系和交易数据构建同质网络。In step 202, a transaction user in the financial transaction information is used as a node, and a homogenous network is constructed based on the transaction relationship and transaction data between the transaction users.
具体地,服务器从金融交易信息中提取交易用户和交易用户间的交易关系和交易数据。服务器将交易用户作为节点,根据交易用户之间的交易关系和交易数据,确定交易用户所对应的节点之间的关系,构建同质网络。若两节点所对应的交易用户之间存在交易关系,则两节点可直接相连,且两个节点互为相邻节点。Specifically, the server extracts the transaction relationship and transaction data between the transaction user and the transaction user from the financial transaction information. The server regards the transaction users as nodes, and determines the relationship between the nodes corresponding to the transaction users according to the transaction relationship and transaction data between the transaction users, and builds a homogeneous network. If there is a transaction relationship between the transaction users corresponding to the two nodes, the two nodes can be directly connected, and the two nodes are adjacent to each other.
在一个实施例中,以金融交易信息中的交易用户作为节点,并基于交易用户间的交易关系和交易数据构建同质网络的步骤包括:识别金融交易信息中的交易用户以及交易用户间的交易关系和交易数据;将识别到的交易用户作为待构建的同质网络中的节点;通过节点连接线连接存在交易关系的节点,并将与交易关系对应的交易数据标注于节点连接线,得到同质网络。In one embodiment, the step of using transaction users in financial transaction information as nodes and constructing a homogeneous network based on transaction relationships and transaction data between transaction users includes: identifying transaction users in financial transaction information and transactions between transaction users Relationship and transaction data; use the identified transaction users as the nodes in the homogenous network to be constructed; connect the nodes with transaction relationships through the node connection line, and mark the transaction data corresponding to the transaction relationship on the node connection line to obtain the same Quality network.
具体地,服务器通过语义识别模型识别金融交易信息中的用户名称、用户之间的交易 数据,得到文本格式(TXT)的用户名称、用户之间的交易数据,其中语义识别模型可以采用已有的模型,例如BERT模型。在构建同质网络时,将文本格式表达的用户名称载入待构建的同质网络中作为节点。当节点之间存在交易关系时,以节点连接线连接这两个节点,并将对应的交易数据以文本格式标注于连接线上,从而完成同质网络的构建。Specifically, the server recognizes user names in financial transaction information and transaction data between users through a semantic recognition model, and obtains user names and transaction data between users in text format (TXT). The semantic recognition model can use existing Model, such as the BERT model. When constructing a homogeneous network, the user name expressed in text format is loaded into the homogeneous network to be constructed as a node. When there is a transaction relationship between nodes, the two nodes are connected by a node connection line, and the corresponding transaction data is marked on the connection line in a text format, thereby completing the construction of a homogeneous network.
本实施例中,将识别到的交易用户作为节点,通过节点连接线连接存在交易关系的节点并将对应的交易数据标注于节点连接线上,保证了构建同质网络的准确性。In this embodiment, the identified transaction users are used as nodes, nodes with transaction relationships are connected through node connection lines, and the corresponding transaction data is marked on the node connection lines, ensuring the accuracy of constructing a homogeneous network.
在一个实施例中,金融交易信息被分配有信息标识,对应一定范围的交易用户。服务器接收金融交易信息后读取信息标识,查询是否曾对携带该信息标识的历史金融交易数据进行处理。若曾进行过处理,从数据库中提取存储的与信息标识对应的同质网络,根据最新获取的金融交易信息对同质网络进行更新。In one embodiment, the financial transaction information is assigned an information identifier, corresponding to a certain range of transaction users. The server reads the information identifier after receiving the financial transaction information, and queries whether the historical financial transaction data carrying the information identifier has been processed. If processing has been performed, extract the stored homogeneous network corresponding to the information identifier from the database, and update the homogeneous network according to the latest obtained financial transaction information.
服务器构建的同质网络可以如图3所示,本申请以图3中的具体实施例解释所要公开的技术方案。具体地,参照图3,其中节点A、B、C和D分别是识别到的4个交易用户;节点连接线采用有向线段,标识了节点间的交易关系,交易关系用数字表示。比如,识别到节点A向节点B转账1500元,则搭建从节点A至节点B的有向线段作为节点连接线,并将交易数据1500标注于节点连接线上。The homogenous network constructed by the server can be as shown in FIG. 3, and this application uses the specific embodiment in FIG. 3 to explain the technical solution to be disclosed. Specifically, referring to Figure 3, where nodes A, B, C, and D are the four identified transaction users, respectively; the node connection line adopts a directed line segment to identify the transaction relationship between the nodes, and the transaction relationship is represented by numbers. For example, if it is recognized that node A transfers 1500 yuan to node B, a directed line segment from node A to node B is constructed as a node connection line, and the transaction data 1500 is marked on the node connection line.
步骤203,初始化同质网络中各节点的社群标签,得到同质网络中各节点的社群标签集。Step 203: Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network.
其中,社群标签可以标识节点对某个社群的归属度;社群标签集可以是一个节点社群标签组成的集合。Among them, the community label can identify the degree of belonging of the node to a certain community; the community label set can be a set of node community labels.
具体地,在构建完同质网络后,需要进行初始化。服务器向同质网络中的节点分配不同的社群号,并将节点对分配到的社群号所对应社群的归属度统一设置为默认值。社群号对应于一个社群,社群是具备某种关系的节点的集合。一个社群号和节点对该社群号的归属度组成节点的一个社群标签,节点全部的社群标签组成节点的社群标签集。Specifically, after the homogenous network is constructed, it needs to be initialized. The server assigns different community numbers to the nodes in the homogeneous network, and uniformly sets the attribution degree of the node to the community corresponding to the assigned community number as a default value. The community number corresponds to a community, and a community is a collection of nodes with a certain relationship. A community number and the attribution of the node to the community number constitute a community label of the node, and all the community labels of the node constitute the community label set of the node.
在一个实施例中,初始化同质网络中各节点的社群标签,得到同质网络中各节点的社群标签集的步骤包括:向同质网络中的各节点分别配置社群号;初始化各节点对配置的社群号的归属度;基于各节点的社群号以及与社群号对应的归属度,生成各节点的社群标签;根据各节点的社群标签分别构建各节点的社群标签集。In one embodiment, the step of initializing the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network includes: configuring a community number to each node in the homogeneous network; initializing each node in the homogeneous network; The degree of attribution of the node to the configured community number; based on the community number of each node and the degree of attribution corresponding to the community number, the community label of each node is generated; the community of each node is constructed according to the community label of each node Set of labels.
具体地,服务器向同质网络中的每个节点均配置一个社群号,各节点的社群号互不相同。服务器初始化各节点对配置的社群号的归属度,统一设置为1。在后期的迭代更新中,归属度的变化区间为[0,1]。服务器组合节点的社群号以及与社群号对应的归属度,得到节点的一个社群标签。节点社群标签的集合是节点的社群标签集。在初始化时,节点的社群标签集中仅有一个社群标签。Specifically, the server allocates a community number to each node in the homogeneous network, and the community number of each node is different from each other. The server initializes the attribution degree of each node to the configured community number and sets it to 1 uniformly. In the later iterative update, the change interval of the attribution degree is [0,1]. The server combines the community number of the node and the attribution degree corresponding to the community number to obtain a community label of the node. The node community label set is the node’s community label set. At the time of initialization, there is only one community label in the community label set of the node.
举例说明,参考图3,在初始化时,服务器向节点A、B、C和D依次分配社群号a、b、c和d,节点对社群号的归属度为B=1。则节点A的社群标签为l(a,1),节点A的社群标签集为{l(a,1)},同理可得节点B、C和D的社群标签集依次为:{l(b,1)}、{l(c,1)}和{l(d,1)}。For example, referring to FIG. 3, during initialization, the server assigns community numbers a, b, c, and d to nodes A, B, C, and D in sequence, and the node's attribution degree to the community number is B=1. Then the community label of node A is l(a,1), and the community label set of node A is {l(a,1)}. Similarly, the community label sets of nodes B, C, and D are: {l(b,1)}, {l(c,1)} and {l(d,1)}.
本实施例中,初始化各节点的社群号以及相应的归属度,并生成社群标签和社群标签集,保证了后续社群划分的正常进行。In this embodiment, the community number and corresponding attribution of each node are initialized, and a community label and a community label set are generated, which ensures the normal progress of subsequent community division.
步骤204,将交易数据作为权重对各节点进行计算,得到各节点的节点影响值,以及各节点对相邻节点的相对影响值。Step 204: Calculate each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on adjacent nodes.
其中,节点影响值可以是节点在同质网络中影响力的评估值;相对影响值可以是节点对同质网络中的一个相邻节点影响力的评估值。Among them, the node influence value may be the evaluation value of the influence of the node in the homogeneous network; the relative influence value may be the evaluation value of the influence of the node on an adjacent node in the homogeneous network.
具体地,节点通过交易数据在同质网络中产生影响力。服务器将交易数据作为权重,按照预设的节点计算方式进行计算,得到各节点在同质网络中的节点影响值。再根据计算得到的节点影响值,计算各节点对其相邻节点的相对影响值。Specifically, nodes exert influence in a homogeneous network through transaction data. The server uses the transaction data as the weight and calculates according to the preset node calculation method to obtain the node influence value of each node in the homogeneous network. Then according to the calculated node influence value, calculate the relative influence value of each node to its neighboring nodes.
步骤205,根据节点影响值和相对影响值,对各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签。Step 205: According to the node influence value and the relative influence value, the community label set of each node is iteratively updated; there is at least one community label in the community label set after the iterative update.
具体地,服务器根据节点影响值对节点进行升序排序,将排序顺序作为各节点社群标签集的更新顺序。服务器是以节点的相邻节点的社群标签集,对节点的社群标签集进行更新。Specifically, the server sorts the nodes in ascending order according to the node influence value, and uses the sort order as the update order of the community tag set of each node. The server updates the community label set of the node based on the community label set of the neighboring nodes of the node.
在进行迭代更新时,节点自身原有的社群标签不对自身产生影响。服务器获取节点的相邻节点对节点的相对影响值和相邻节点的社群标签,计算节点对相邻节点社群标签中社群号的归属度。服务器根据数值从计算得到的归属度中进行筛选,将筛选到的至少一个归属度和与其对应的社群号组合为节点新的社群标签,得到迭代更新后的社群标签集。迭代更新后的社群标签集中,至少存在一个社群标签,且社群标签内的社群号互不相同。During the iterative update, the original community label of the node itself does not affect itself. The server obtains the relative influence value of the node's neighboring node on the node and the community label of the neighboring node, and calculates the attribution degree of the node to the community number in the community label of the neighboring node. The server screens from the calculated attribution degrees according to the values, combines at least one attribution degree and its corresponding community number into a new community label of the node, and obtains an iteratively updated community label set. There is at least one community label in the community label set after iterative update, and the community numbers in the community labels are different from each other.
参照图3,假设节点迭代更新的顺序为:A、B、C、D,服务器先对节点A的社群标签集α进行迭代更新,得到α′。接下来对节点B的社群标签集β进行更新时,节点A的社群标签集以α′为准对β进行更新。Referring to Fig. 3, assuming that the order of the iterative update of nodes is: A, B, C, D, the server first iteratively updates the community tag set α of node A to obtain α'. Next, when the community label set β of node B is updated, the community label set of node A updates β based on α′.
步骤206,当各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据各节点迭代更新后的社群标签将同质网络划分为至少一个社群节点网络。Step 206: When the community label in the community label set of each node no longer changes during the iterative update, divide the homogeneous network into at least one community node network according to the community label after the iterative update of each node.
其中,社群节点网络可以是由金融交易中具有一定紧密联系的节点组成的网络,是同质网络的子网络。Among them, the community node network can be a network composed of nodes that have certain close connections in financial transactions, which is a sub-network of a homogeneous network.
具体地,服务器在对同质网络中的各节点完成一轮迭代更新后,比较当前各节点的社群标签集与上一轮迭代更新后的社群标签集是否一致;若不一致,则继续对各节点的社群标签集进行迭代更新;若一致,各节点的社群标签集未发生任何变化,则停止迭代,读取各节点当前社群标签集中社群标签的社群号,将具备相同社群号的节点划入同一个社群节点网络。服务器从同质网络中划分出至少一个社群节点网络。Specifically, after the server completes a round of iterative update for each node in the homogeneous network, it compares whether the current community tag set of each node is consistent with the community tag set after the previous iteration; if it is inconsistent, continue to check The community tag set of each node is updated iteratively; if they are consistent, the community tag set of each node has not changed, then the iteration is stopped, and the community number of the community tag in the current community tag set of each node will be the same The nodes of the community number are included in the same community node network. The server divides at least one community node network from the homogeneous network.
当节点的社群标签中存在多于一个的社群标签时,节点被划分到多于一个的社群节点网络,成为重叠节点。When there is more than one community label in the community label of a node, the node is divided into more than one community node network and becomes an overlapping node.
社群标签集在迭代更新中未发生变化,包括:社群标签数量未发生变化,社群号未发生变化,与社群号所对应的归属度未发生变化。The community tag set has not changed in the iterative update, including: the number of community tags has not changed, the community number has not changed, and the degree of attribution corresponding to the community number has not changed.
本实施例中,接收金融交易信息后,以金融交易信息中的交易用户作为节点,基于交 易用户间的交易关系和交易数据构建同质网络,初始化同质网络中各节点的社群标签,得到同质网络中各节点的社群标签集;在计算各节点的节点影响值以及各节点对相邻节点的相对影响值时,将交易数据作为权重,考虑到了节点的重要程度,提高了计算结果的准确性;根据节点影响值和相对影响值,对各节点的社群标签集进行迭代更新,迭代更新后的社群标签集中至少存在一个社群标签,社群标签标识了节点对某个社群的归属度;当各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据各节点迭代更新后的社群标签将同质网络划分为至少一个社群节点网络,存在多于一个社群标签的节点则被划分至多于一个的社群节点网络中,保证了对同质网络进行社群划分的准确性。In this embodiment, after receiving the financial transaction information, the transaction users in the financial transaction information are used as nodes, a homogenous network is constructed based on the transaction relationship and transaction data between the transaction users, and the community tags of each node in the homogenous network are initialized to obtain The community label set of each node in the homogeneous network; when calculating the node influence value of each node and the relative influence value of each node on neighboring nodes, the transaction data is used as the weight, taking into account the importance of the node, and improving the calculation result According to the node’s influence value and relative influence value, the community label set of each node is iteratively updated. There is at least one community label in the community label set after iterative update. The community label identifies the node’s influence on a certain community The belonging degree of the group; when the community label in the community label set of each node no longer changes in the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node. Nodes with more than one community label are divided into more than one community node network, which ensures the accuracy of community division for homogeneous networks.
进一步的,如图4所示,上述步骤204中,计算节点影响值可以包括:Further, as shown in FIG. 4, in the foregoing step 204, calculating the node influence value may include:
步骤2041,将交易数据作为权重,根据交易关系计算各节点的交易权重值。In step 2041, the transaction data is used as the weight, and the transaction weight value of each node is calculated according to the transaction relationship.
其中,交易权重值可以是与节点相关的交易数据在同质网络中影响力的衡量值。Among them, the transaction weight value can be a measure of the influence of the transaction data related to the node in the homogeneous network.
具体地,参照图3,节点A共有三条节点连接线(A-B、A-C、A-D),将三条节点连接线上的交易数据相加得到节点A的节点交易金额k ASpecifically, referring to Fig. 3, node A has three node connection lines (AB, AC, AD), and the transaction data on the three node connection lines are added to obtain the node transaction amount k A of node A.
代入图3数据可得:k A=1000+1500+10000=12500; Substituting the data in Figure 3 can be obtained: k A =1000+1500+10000=12500;
节点A有3个邻接三角形(A-B-C、A-D-C、A-B-D),三角形的边是节点连接线,对三条节点连接线上的交易数据求平均值可得各邻接三角形的三角形权重,再将三个三角形权重相加得到节点A的权重e ANode A has 3 adjacent triangles (ABC, ADC, ABD). The sides of the triangle are node connection lines. The average transaction data on the three node connection lines can be used to obtain the triangle weight of each adjacent triangle, and then the three triangle weights Add up to get the weight e A of node A.
对于节点A的邻接三角形ABC有:For the adjacent triangle ABC of node A:
e ABC=(1500+10000+20000)/3=10500 e ABC =(1500+10000+20000)/3=10500
同理可得:e ADC=13666,e ABD=4166 The same principle can be obtained: e ADC = 13666, e ABD = 4166
e A=10500+13666+4166=28332 e A =10500+13666+4166=28332
同理可得:e B=39666,e C=46666,e D=40332。 In the same way, we can get: e B =39666, e C =46666, and e D =40332.
对于同质网络中的每个节点,将节点交易金额k m和权重e m分别相加,得到节点的交易权重值(k m+e m)。 For each node in the homogeneous network, the transaction amount node k m e m and weights are added to obtain a weight value transactions node (k m + e m).
步骤2042,对于同质网络中的各个节点,从节点的相邻节点所对应的交易权重值中,筛选最大交易权重值和最小交易权重值。Step 2042: For each node in the homogeneous network, filter the maximum transaction weight value and the minimum transaction weight value from the transaction weight values corresponding to the neighboring nodes of the node.
具体地,服务器从节点及其相邻节点的交易权重值中,筛选数值最大的交易权重值和数值最小的交易权重值,将数值最大的交易权重值作为最大交易权重值
Figure PCTCN2020105766-appb-000001
将数值最小的交易权重值作为最小交易权重值
Figure PCTCN2020105766-appb-000002
Specifically, the server selects the transaction weight value with the largest value and the transaction weight value with the smallest value from the transaction weight values of the node and its neighboring nodes, and uses the transaction weight value with the largest value as the largest transaction weight value.
Figure PCTCN2020105766-appb-000001
Use the smallest transaction weight value as the smallest transaction weight value
Figure PCTCN2020105766-appb-000002
对于图3,易得在对节点A进行计算时,节点A及其相邻节点中,最大交易权重值来自节点C,最小交易权重值来自节点A,其中
Figure PCTCN2020105766-appb-000003
Figure PCTCN2020105766-appb-000004
For Figure 3, when YIDA calculates node A, among node A and its neighboring nodes, the maximum transaction weight value comes from node C, and the minimum transaction weight value comes from node A, where
Figure PCTCN2020105766-appb-000003
Figure PCTCN2020105766-appb-000004
步骤2043,根据节点的交易权重值、最大交易权重值和最小交易权重值,计算节点在同质网络中的节点影响值。Step 2043: Calculate the node influence value of the node in the homogeneous network according to the transaction weight value, the maximum transaction weight value and the minimum transaction weight value of the node.
具体地,可按照下面的公式(1)计算节点A的节点影响值:Specifically, the node influence value of node A can be calculated according to the following formula (1):
Figure PCTCN2020105766-appb-000005
Figure PCTCN2020105766-appb-000005
其中,e A是节点A的权重,k A是节点A的节点交易金额,
Figure PCTCN2020105766-appb-000006
是节点A及其相邻节点中的最大交易权重值,
Figure PCTCN2020105766-appb-000007
是节点A及其相邻节点中的最小交易权重值。
Among them, e A is the weight of node A, k A is the node transaction amount of node A,
Figure PCTCN2020105766-appb-000006
Is the maximum transaction weight value in node A and its neighboring nodes,
Figure PCTCN2020105766-appb-000007
Is the minimum transaction weight value in node A and its neighboring nodes.
将具体数据代入可得:
Figure PCTCN2020105766-appb-000008
Substitute specific data to get:
Figure PCTCN2020105766-appb-000008
据此,可得节点A的节点影响值为0.5。服务器按照相同方法,计算其余各节点的节点影响值。According to this, the node influence value of node A can be obtained as 0.5. The server calculates the node influence value of the remaining nodes in the same way.
本实施例中,在进行节点分析、计算节点影响值时,将交易数据作为权重代入计算,考虑到了节点所对应的用户之间在金融交易时的差异,提高了计算的节点影响值的准确性。In this embodiment, when performing node analysis and calculating the node influence value, the transaction data is substituted into the calculation as the weight, which takes into account the differences in financial transactions between the users corresponding to the node, and improves the accuracy of the calculated node influence value. .
进一步的,如图5所示,上述步骤204中,计算各节点对相邻节点的相对影响值可以包括:Further, as shown in FIG. 5, in the foregoing step 204, calculating the relative influence value of each node on the neighboring node may include:
步骤2044,根据交易关系和交易数据,计算每对相邻节点间的相似度。Step 2044: Calculate the similarity between each pair of adjacent nodes according to the transaction relationship and transaction data.
其中,相似度可以是互为相邻节点的两个节点相似程度的评估值。Wherein, the similarity may be an evaluation value of the similarity of two nodes that are adjacent to each other.
具体地,在计算每对相邻节点间的相似度时,需要根据交易关系和交易数据计算两个相邻节点之间的路径权重。路径权重的计算需要考虑相邻节点的一度关系以及二度关系。Specifically, when calculating the similarity between each pair of adjacent nodes, it is necessary to calculate the path weight between the two adjacent nodes according to the transaction relationship and transaction data. The calculation of the path weight needs to consider the first-degree relationship and the second-degree relationship of adjacent nodes.
参考图3,节点A与节点C互为相邻节点,当节点A与节点C直接通过节点连接线相连时属于两者的一度关系;当节点A先通过节点D(或者节点B),再到达节点C时,属于两者的二度关系。Referring to Figure 3, node A and node C are adjacent nodes to each other. When node A and node C are directly connected by a node connection line, they belong to the first-degree relationship; when node A first passes through node D (or node B), and then arrives At node C, it belongs to the two-degree relationship between the two.
对于图3中的节点A与节点C,在一度关系时有A-C,一度路径权重为:对于第一网络节点A与第二网络节点C之间具有一条节点连接线时,即A-C,此时得出一度路径权重∑w AC1=10000; For node A and node C in Figure 3, there is AC when the relationship is first degree, and the path weight of the first degree is: for a node connection line between the first network node A and the second network node C, that is, AC, then we get The weight of the first-degree path ∑w AC1 =10000;
节点A与节点C在二度关系时,有A-D-C、A-B-C,此时得出二度路径权重
Figure PCTCN2020105766-appb-000009
Figure PCTCN2020105766-appb-000010
When node A and node C are in the second-degree relationship, there are ADC and ABC, and the second-degree path weight is obtained at this time
Figure PCTCN2020105766-appb-000009
Figure PCTCN2020105766-appb-000010
则节点A相对于节点C的路径权重s(A,C)=10000+13125=23125。Then the path weight of node A relative to node C is s(A, C)=10000+13125=23125.
以∑ x∈Ng(A)s(A,x)表示节点A的所有相邻节点相对于A的s之和,则有: Let ∑ x∈Ng(A) s(A,x) represent the sum of s of all adjacent nodes of node A relative to A, then:
x∈Ng(A)s(A,x)=s(A,D)+s(A,C)+s(A,B)=15750+23125+13625=52500, x∈Ng(A) s(A,x)=s(A,D)+s(A,C)+s(A,B)=15750+23125+13625=52500,
同理可得:∑ y∈Ng(C)s(C,y)=s(C,D)+s(C,A)+s(C,B)=42125+23125+34750=100000。 In the same way, we can get: ∑ y∈Ng(C) s(C,y)=s(C,D)+s(C,A)+s(C,B)=42125+23125+34750=100,000.
服务器按照下述的公式(2)计算节点A与节点C之间的相似度Sim:The server calculates the similarity Sim between node A and node C according to the following formula (2):
Figure PCTCN2020105766-appb-000011
Figure PCTCN2020105766-appb-000011
将上述数据代入有:Substitute the above data into:
Figure PCTCN2020105766-appb-000012
Figure PCTCN2020105766-appb-000012
0.32即为节点A与节点C的相似度,服务器按照相同方法计算相邻节点间的相似度。节点C有另两个相邻节点B和D,同理可得:Sim(B,C)=0.39,Sim(D,C)=0.45。0.32 is the similarity between node A and node C, and the server calculates the similarity between adjacent nodes in the same way. Node C has two other adjacent nodes B and D. The same principle can be obtained: Sim(B,C)=0.39, Sim(D,C)=0.45.
步骤2045,对于同质网络中的各个节点,从节点的相邻节点所对应的相似度中确定相邻节点的最大相似度。Step 2045: For each node in the homogeneous network, determine the maximum similarity of adjacent nodes from the similarity corresponding to the adjacent nodes of the node.
具体地,节点C与节点A之间有相似度,节点C与节点B之间有相似度,节点C与节点D之间有相似度。在以节点A作为主体,计算节点A对相邻节点C的相对影响值时, 从节点C所对应的三个相似度中,按照数值大小,筛选最大的相似度作为节点C的最大相似度
Figure PCTCN2020105766-appb-000013
易得节点C与节点D之间的相似度0.45是节点C的最大相似度。
Specifically, there is similarity between node C and node A, there is similarity between node C and node B, and there is similarity between node C and node D. When node A is used as the main body to calculate the relative influence value of node A on adjacent node C, from the three similarities corresponding to node C, according to the numerical value, the largest similarity is selected as the maximum similarity of node C
Figure PCTCN2020105766-appb-000013
It is easy to find that the similarity between node C and node D is 0.45, which is the maximum similarity of node C.
步骤2046,基于节点的节点影响值、节点与相邻节点间的相似度和相邻节点的最大相似度,计算节点对相邻节点的相对影响值。Step 2046: Based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the relative influence value of the node on the adjacent node is calculated.
具体地,参照图3,节点A对节点C的相对影响值NNI A(C)按照下述的公式(3)进行计算: Specifically, referring to Fig. 3, the relative influence value NNI A (C) of node A on node C is calculated according to the following formula (3):
Figure PCTCN2020105766-appb-000014
Figure PCTCN2020105766-appb-000014
其中,NI A是节点A的节点影响值,Sim(A,C)是节点A与节点C的相似度,h是节点C的相邻节点,
Figure PCTCN2020105766-appb-000015
是取节点C与其相邻节点的相似度的最大值,其中节点D与节点C具有最大相似度0.45,代入数据可得:
Among them, NI A is the node influence value of node A, Sim(A,C) is the similarity between node A and node C, h is the adjacent node of node C,
Figure PCTCN2020105766-appb-000015
Is to take the maximum similarity between node C and its neighboring nodes, where node D and node C have the maximum similarity of 0.45. Substituting the data can get:
Figure PCTCN2020105766-appb-000016
Figure PCTCN2020105766-appb-000016
本实施例中,在节点影响值的基础上计算相邻节点间的相似度,并在相似度和节点影响值的基础上计算节点对相邻节点的相对影响值,计算节点影响值时将交易数据作为权重保证了节点影响值的准确性,从而保证了最后计算相对影响值的准确性。In this embodiment, the similarity between adjacent nodes is calculated on the basis of the node influence value, and the relative influence value of the node on the adjacent node is calculated on the basis of the similarity and the node influence value. When calculating the node influence value, the transaction The data is used as the weight to ensure the accuracy of the node's influence value, thereby ensuring the accuracy of the final calculation of the relative influence value.
进一步的,如图6所示,上述步骤205可以包括:Further, as shown in FIG. 6, the foregoing step 205 may include:
步骤2051,根据节点影响值,将各节点进行升序排序,得到更新顺序序列。Step 2051: Sort the nodes in ascending order according to the node influence value to obtain the update order sequence.
具体地,在对节点的社群标签集进行迭代更新之前,先确定更新顺序。服务器根据节点影响值的数值大小,对各节点进行升序排序,以排序后的有序序列作为更新顺序序列。更新顺序序列中,节点的排列顺序决定了节点社群标签集的更新顺序。节点影响值最大的节点最后进行迭代更新,保证其社群标签可以最大程度地得到传播,影响其他节点。Specifically, before iteratively updating the community tag set of the node, the update order is determined first. The server sorts the nodes in ascending order according to the numerical value of the node's influence value, and uses the sorted ordered sequence as the update sequence sequence. In the update sequence sequence, the arrangement order of the nodes determines the update order of the node community tag set. The node with the largest node influence value is finally updated iteratively to ensure that its community label can be spread to the greatest extent and affect other nodes.
步骤2052,对于更新顺序序列的每个节点,依次获取节点的各相邻节点对节点的相对影响值以及节点的第一标签集;第一标签集由各相邻节点社群标签集中具有最大归属度的社群标签组成。Step 2052: For each node in the update sequence sequence, obtain the relative influence value of each adjacent node of the node on the node and the first label set of the node in turn; the first label set is composed of the community label set of each adjacent node with the largest attribution Degree of community label composition.
具体地,服务器在根据更新顺序序列进行迭代更新时,对于当前需要更新的节点,查询其相邻节点。服务器从各相邻节点的社群标签集中,分别查找具有最大归属度的社群标签,将查找到的各社群标签进行组合,得到节点的第一标签集;第一标签集用于对节点的社群标签集进行更新。服务器还需要获取节点的各相邻节点对节点的相对影响值。Specifically, when the server performs the iterative update according to the update sequence sequence, for the node that currently needs to be updated, it queries its neighboring nodes. From the community label set of each neighboring node, the server separately searches for the community label with the greatest degree of attribution, and combines the searched community labels to obtain the node’s first label set; the first label set is used to The community label set is updated. The server also needs to obtain the relative influence value of each neighboring node of the node on the node.
每个社群标签集均存在主标签,主标签是社群标签集中具有最大归属度的社群标签,即第一标签集由各相邻节点的主标签组成。Each community label set has a main label, and the main label is the community label with the largest attribution in the community label set, that is, the first label set is composed of the main labels of each neighboring node.
对于节点u∈{A,B,C,D},在更新节点u的社群标签集时,先获取节点u的相邻节点的主标签集合L Ng,即第一标签集: For node u∈{A,B,C,D}, when updating the community label set of node u, first obtain the main label set L Ng of the neighboring nodes of node u, that is, the first label set:
L Ng={l(c 1,B 1),l(c 2,B 2),...l(c v,B v)},v∈Ng(u) L Ng = {l(c 1 ,B 1 ),l(c 2 ,B 2 ),...l(c v ,B v )},v∈Ng(u)
其中l(c v,B v)表示相邻节点v的主标签,B v表示上轮迭代后,节点v对社群号c v这个社群的从属度。 Among them, l(c v , B v ) represents the main label of the adjacent node v, and B v represents the degree of membership of the node v to the community number c v after the last iteration.
步骤2053,基于相对影响值,计算节点对第一标签集中各社群标签内社群号的归属度, 得到第二标签集。Step 2053: Based on the relative influence value, calculate the attribution degree of the node to the community number in each community label in the first label set to obtain the second label set.
具体地,通过公式(3)计算节点对第一标签集中各社群标签内社群号的归属度:Specifically, the attribution degree of the node to the community number in each community label in the first label set is calculated by formula (3):
Figure PCTCN2020105766-appb-000017
Figure PCTCN2020105766-appb-000017
其中,NNI v(u)是节点u的相邻节点v,对节点u的相对影响值;B(c v,v)是节点v对社群号c v的归属度;c是L Ng中的一个社群号;在进行计算时,第一标签集中所有的归属度均参与分母运算,第一标签集中社群号为c的归属度参与分子运算;B′(c,u)是节点u对第一标签集中社群号为c的社群的归属度。 Among them, NNI v (u) is the relative influence value of the neighboring node v of node u on node u; B(c v ,v) is the attribution degree of node v to the community number c v ; c is the value in L Ng A community number; in the calculation, all the attribution degrees in the first label set participate in the denominator calculation, and the attribution degrees with the community number c in the first label set participate in the numerator calculation; B′(c,u) is the pair of node u The degree of attribution of the community whose community number is c in the first label set.
计算完节点对第一标签集内各社群号的归属度B′(c,u)后,将归属度B′(c,u)及与其对应的社群号进行组合,得到多个社群标签,根据多个标签得到节点的第二标签集。After calculating the attribution degree B′(c,u) of the node to each community number in the first label set, combine the attribution degree B′(c,u) and its corresponding community number to obtain multiple community tags , Obtain the second label set of the node according to multiple labels.
在一个实施例中,在根据公式(3)对一个节点完成计算后,得到新标签集L′ Ng(用|L′|表示L Ng′中新的社群标签个数,|L′|等于第一标签集中社群标签的数量,且等于节点u的相邻节点的数量): In one embodiment, after calculating a node according to formula (3), a new label set L′ Ng is obtained (using |L′| to represent the number of new community labels in L Ng ′, and |L′| is equal to The number of community tags in the first tag set is equal to the number of neighboring nodes of node u):
Figure PCTCN2020105766-appb-000018
Figure PCTCN2020105766-appb-000018
L′ Ng中部分社群标签的归属度可能小于1/|L′|,此类归属度过低,可将其所在的社群标签从|L′|中删除,得到社群标签集L″ Ng。此时L″ Ng中所有归属度相加不等于1,进行归一化: The attribution degree of some community tags in L′ Ng may be less than 1/|L′|. Such attribution is too low. You can delete the community tag where it belongs from |L′| to get the community tag set L″ Ng . At this time , the sum of all attribution degrees in L″Ng is not equal to 1, and normalization is performed:
Figure PCTCN2020105766-appb-000019
Figure PCTCN2020105766-appb-000019
将归一化之后的社群标签集L″ Ng作为节点的第二标签集。 The normalized community label set L″ Ng is used as the second label set of the node.
步骤2054,从第二标签集中,筛选归属度符合预设归属度条件的至少一个社群标签。Step 2054: From the second tag set, filter at least one community tag whose attribution degree meets the preset attribution degree condition.
其中,预设归属度条件可以是预设的筛选归属度的条件。Wherein, the preset attribution degree condition may be a preset condition for screening attribution degree.
具体地,服务器获取预设归属度条件,根据预设归属度条件,从第二标签集中查找符合预设归属度条件的归属度,筛选查找到的归属度所在的社群标签。服务器可根据预设归属度条件,从第二标签集中筛选到至少一个社群标签。Specifically, the server obtains the preset attribution degree condition, searches for the attribution degree that meets the preset attribution degree condition from the second set of tags according to the preset attribution degree condition, and filters the community tag where the found attribution degree is located. The server can filter at least one community tag from the second tag set according to the preset attribution degree condition.
在一个实施例中,服务器查找最大归属度,获取预设区间调节值,将最大归属度作为区间右端点,以最大归属度和预设区间调节值的差值作为区间左端点,得到一个闭区间。服务器筛选归属度在闭区间内的社群标签。In one embodiment, the server searches for the maximum attribution degree, obtains the preset interval adjustment value, uses the maximum attribution degree as the right end of the interval, and uses the difference between the maximum attribution degree and the preset interval adjustment value as the left end of the interval to obtain a closed interval . The server screens the community tags whose attribution is within the closed interval.
举例说明,第二标签集中有3个社群标签,归属度依次为:0.6、0.3和0.1。则0.6为最大归属度,预设区间调节值为0.3,则区间左端点为0.6-0.3=0.3,得到闭区间[0.3,0.6]。则筛选出两个社群标签,分别是归属度0.6和0.3所在的社群标签。For example, there are 3 community tags in the second tag set, and their attribution degrees are 0.6, 0.3, and 0.1 in order. Then 0.6 is the maximum attribution degree, the preset interval adjustment value is 0.3, then the left end of the interval is 0.6-0.3=0.3, and the closed interval [0.3, 0.6] is obtained. Then, two community tags are screened out, which are the community tags where the attribution degree is 0.6 and 0.3.
步骤2055,将筛选到的至少一个社群标签作为迭代更新后节点的社群标签集。Step 2055: Use the at least one community label selected as the community label set of the node after the iterative update.
具体地,服务器根据筛选到的至少一个社群标签得到社群标签集,新得到的社群标签集是节点在本轮迭代更新后的社群标签集。Specifically, the server obtains the community label set according to at least one community label that is screened out, and the newly obtained community label set is the community label set after the node is iteratively updated in this round.
本实施例中,根据节点影响值将各节点进行升序排序,按照排序进行迭代更新,保证了节点影响值最大的节点可以产生最大的影响。对每个节点更新时,获取节点的各相邻节点对节点的相对影响值以及节点的第一标签集,第一标签集内是各相邻节点社群标签集中 具有最大归属度的社群标签;基于相对影响值,计算节点对第一标签集中各社群标签内社群号的归属度,得到第二标签集,第二标签集的计算充分考虑了相邻节点的影响;从第二标签集中筛选归属度符合预设归属度条件的至少一个社群标签,从而得到节点迭代更新后的社群标签集,保证了更新社群标签集的准确性。In this embodiment, the nodes are sorted in ascending order according to the node influence value, and the iterative update is performed according to the sort, which ensures that the node with the largest node influence value can have the greatest influence. When updating each node, obtain the relative influence value of each adjacent node of the node on the node and the first label set of the node. The first label set is the community label with the largest degree of belonging in the community label set of each adjacent node ; Based on the relative influence value, calculate the attribution degree of the node to the community number in each community label in the first label set to obtain the second label set. The calculation of the second label set fully considers the influence of neighboring nodes; from the second label set At least one community label whose attribution degree meets the preset attribution degree condition is selected, so as to obtain the community label set after the node iteration, which ensures the accuracy of the updated community label set.
进一步的,如图7所示,上述步骤206可以包括:Further, as shown in FIG. 7, the foregoing step 206 may include:
步骤2061,在对各节点完成每轮迭代更新后,将各节点的社群标签集中具备最大归属度的社群标签,确定为各节点社群标签集的主标签。Step 2061: After completing each round of iterative update for each node, determine the community label with the largest attribution in the community label set of each node as the main label of the community label set of each node.
具体地,服务器对同质网络中的各节点完成一轮迭代更新后,比较各节点社群标签集内社群标签的归属度,查找最大归属度,将最大归属度所在的社群标签,作为各节点社群标签集中的主标签。Specifically, after the server completes a round of iterative update of each node in the homogeneous network, it compares the attribution degree of the community tag in the community tag set of each node, finds the maximum attribution degree, and takes the community tag where the maximum attribution degree is located as The main label in the community label set of each node.
步骤2062,比较各节点当前的主标签,与前次迭代后各节点的主标签是否发生变化。Step 2062: Compare the current primary label of each node with whether the primary label of each node has changed after the previous iteration.
具体地,服务器获取前次迭代更新后各节点的主标签,比较当前的主标签与前次迭代更新后的主标签是否发生变化。其中,变化包括主标签社群号变化和归属度变化中的至少一种。Specifically, the server obtains the primary label of each node after the previous iteration update, and compares whether the current primary label has changed with the primary label after the previous iteration update. Wherein, the change includes at least one of a change in the community number of the main tag and a change in the degree of attribution.
步骤2063,若未发生变化,根据各节点当前社群标签的社群号,将同质网络划分为至少一个社群节点网络。Step 2063: If there is no change, divide the homogenous network into at least one community node network according to the community number of the current community tag of each node.
具体地,若各节点在本轮迭代更新后的主标签相较前次迭代更新后的主标签未发生变化,对同质网络的运算已收敛,同质网络已稳定。服务器读取各节点当前社群标签集中的社群号,将具有相同社群号的节点划分入一个社群节点网络。服务器从同质网络中划分出至少一个社群节点网络。若各节点在本轮迭代更新后的主标签相较前次迭代更新后的主标签发生了变化,则继续进行迭代更新。Specifically, if the primary label of each node after the current iteration update has not changed compared with the primary label after the previous iteration update, the operation on the homogeneous network has converged, and the homogeneous network has stabilized. The server reads the community number in the current community label set of each node, and divides the nodes with the same community number into a community node network. The server divides at least one community node network from the homogeneous network. If the main label of each node after the current iteration update has changed compared with the main label after the previous iteration update, the iterative update is continued.
节点可被划入多个社群节点网络,当节点的社群标签中存在多于一个的社群标签时,节点被划分到多于一个的社群节点网络,成为重叠节点。A node can be divided into multiple community node networks. When there is more than one community label in the community label of a node, the node is divided into more than one community node network and becomes an overlapping node.
在实际应用中,在金融交易中具有紧密联系的交易用户可通过本申请的实施例被划分到同一个社群节点网络中。例如包含欺诈网络团体的同质网络,可在一定次数的迭代更新后稳定,欺诈团伙所代表的节点被划入一个社群节点网络中。In practical applications, transaction users who have close connections in financial transactions can be divided into the same community node network through the embodiments of this application. For example, a homogeneous network containing fraudulent network groups can be stable after a certain number of iterations and updates, and nodes represented by fraudulent groups are classified into a community node network.
本实施例中,在完成每轮迭代更新后,从各节点的社群标签集中筛选主标签;如果各节点当前的主标签,相较与前次迭代后的主标签未发生变化,则同质网络趋于稳定,根据各节点当前社群标签的社群号,可以准确地从同质网络划分出社群节点网络。In this embodiment, after each round of iterative update is completed, the main tags are filtered from the community tag set of each node; if the current main tag of each node has not changed compared with the main tag after the previous iteration, it is homogeneous The network tends to be stable. According to the community number of the current community tag of each node, the community node network can be accurately divided from the homogeneous network.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,该计算机可读指令可存储于一计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。Those of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions, which can be stored in a computer-readable storage medium. When the computer-readable instructions are executed, they may include the processes of the above-mentioned method embodiments.
虽然附图的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,其可以以其他的顺序执行。Although the steps in the flowchart of the drawings are shown in sequence as indicated by the arrows, these steps are not necessarily executed in sequence in the order indicated by the arrows. Unless explicitly stated in this article, the execution of these steps is not strictly limited in order, and they can be executed in other orders.
进一步参考图8,作为对上述图2所示方法的实现,本申请提供了一种同质网络的社 群划分装置的一个实施例,该装置实施例与图2所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。With further reference to FIG. 8, as an implementation of the method shown in FIG. 2, this application provides an embodiment of a device for community division of a homogeneous network, and the device embodiment corresponds to the method embodiment shown in FIG. 2 , The device can be specifically applied to various electronic equipment.
如图4所示,本实施例所述的同质网络的社群划分装置400包括:信息接收模块401、网络构建模块402、标签初始模块403、节点计算模块404、标签更新模块405以及网络划分模块406。其中:As shown in FIG. 4, the device 400 for community division of a homogeneous network in this embodiment includes: an information receiving module 401, a network construction module 402, a label initial module 403, a node calculation module 404, a label update module 405, and network division Module 406. in:
信息接收模块401,用于接收金融交易信息。The information receiving module 401 is used to receive financial transaction information.
网络构建模块402,用于以金融交易信息中的交易用户作为节点,并基于交易用户间的交易关系和交易数据构建同质网络。The network construction module 402 is configured to use transaction users in the financial transaction information as nodes, and construct a homogeneous network based on transaction relationships and transaction data between the transaction users.
标签初始模块403,用于初始化同质网络中各节点的社群标签,得到同质网络中各节点的社群标签集。The label initialization module 403 is used to initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network.
节点计算模块404,用于将交易数据作为权重对各节点进行计算,得到各节点的节点影响值,以及各节点对相邻节点的相对影响值。The node calculation module 404 is configured to calculate each node using transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on adjacent nodes.
标签更新模块405,用于根据节点影响值和相对影响值,对各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签。The label update module 405 is configured to iteratively update the community label set of each node according to the node influence value and the relative influence value; there is at least one community label in the community label set after the iterative update.
网络划分模块406,用于当各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据各节点迭代更新后的社群标签将同质网络划分为至少一个社群节点网络。The network division module 406 is used to divide the homogeneous network into at least one community node network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes .
本实施例中,计算各节点的节点影响值以及相对影响值时,将交易数据作为权重,考虑到了节点的重要程度,提高了计算的准确性;一个节点可以被划分到至少一个社群节点网络中,保证了同质网络社群划分的准确性。In this embodiment, when calculating the node influence value and relative influence value of each node, the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
在本实施例的一些可选的实现方式中,上述网络构建模块402进一步用于识别金融交易信息中的交易用户以及交易用户间的交易关系和交易数据;将识别到的交易用户作为待构建的同质网络中的节点;通过节点连接线连接存在交易关系的节点,并将与交易关系对应的交易数据标注于节点连接线,得到同质网络。In some optional implementations of this embodiment, the above-mentioned network construction module 402 is further used to identify transaction users in the financial transaction information and transaction relationships and transaction data between the transaction users; and use the identified transaction users as the to-be-built Nodes in a homogeneous network; nodes that have a transaction relationship are connected through a node connection line, and the transaction data corresponding to the transaction relationship is marked on the node connection line to obtain a homogenous network.
本实施例中,将识别到的交易用户作为节点,通过节点连接线连接存在交易关系的节点并将对应的交易数据标注于节点连接线上,保证了构建同质网络的准确性。In this embodiment, the identified transaction users are used as nodes, nodes with transaction relationships are connected through node connection lines, and the corresponding transaction data is marked on the node connection lines, ensuring the accuracy of constructing a homogeneous network.
在本实施例的一些可选的实现方式中,上述标签初始模块403进一步用于向同质网络中的各节点分别配置社群号;初始化各节点对配置的社群号的归属度;基于各节点的社群号以及与社群号对应的归属度,生成各节点的社群标签;根据各节点的社群标签分别构建各节点的社群标签集。In some optional implementations of this embodiment, the above-mentioned label initializing module 403 is further configured to configure community numbers for each node in the homogeneous network; initialize the attribution degree of each node to the configured community number; The community number of the node and the degree of belonging corresponding to the community number generate the community label of each node; and the community label set of each node is constructed according to the community label of each node.
本实施例中,初始化各节点的社群号以及相应的归属度,并生成社群标签和社群标签集,保证了后续社群划分的正常进行。In this embodiment, the community number and corresponding attribution of each node are initialized, and a community label and a community label set are generated, which ensures the normal progress of subsequent community division.
在本实施例的一些可选的实现方式中,上述节点计算模块404进一步用于将交易数据作为权重,根据交易关系计算各节点的交易权重值;对于同质网络中的各个节点,从节点的相邻节点所对应的交易权重值中,筛选最大交易权重值和最小交易权重值;根据节点的交易权重值、最大交易权重值和最小交易权重值,计算节点在同质网络中的节点影响值。In some optional implementations of this embodiment, the above-mentioned node calculation module 404 is further configured to use transaction data as a weight, and calculate the transaction weight value of each node according to the transaction relationship; for each node in the homogeneous network, the slave node's Among the transaction weight values corresponding to adjacent nodes, filter the maximum transaction weight value and the minimum transaction weight value; according to the transaction weight value, maximum transaction weight value and minimum transaction weight value of the node, calculate the node influence value of the node in the homogeneous network .
本实施例中,在进行节点分析、计算节点影响值时,将交易数据作为权重代入计算, 考虑到了节点所对应的用户之间在金融交易时的差异,提高了计算的节点影响值的准确性。In this embodiment, when performing node analysis and calculating the node influence value, the transaction data is substituted into the calculation as the weight, which takes into account the difference in financial transactions between the users corresponding to the node, and improves the accuracy of the calculated node influence value. .
在本实施例的一些可选的实现方式中,上述节点计算模块404进一步用于根据交易关系和交易数据,计算每对相邻节点间的相似度;对于同质网络中的各个节点,从节点的相邻节点所对应的相似度中确定相邻节点的最大相似度;基于节点的节点影响值、节点与相邻节点间的相似度和相邻节点的最大相似度,计算节点对相邻节点的相对影响值。In some optional implementations of this embodiment, the above-mentioned node calculation module 404 is further configured to calculate the similarity between each pair of adjacent nodes according to the transaction relationship and transaction data; for each node in the homogeneous network, the slave node The maximum similarity of adjacent nodes is determined from the similarity corresponding to adjacent nodes; based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the calculation of the node-to-adjacent node The relative influence value of.
本实施例中,基于节点影响值计算相对影响值,节点影响值的计算将交易数据作为权重,具有较高的准确性,从而保证了相对影响值的准确性。In this embodiment, the relative influence value is calculated based on the node influence value, and the calculation of the node influence value uses transaction data as a weight, which has high accuracy, thereby ensuring the accuracy of the relative influence value.
在本实施例的一些可选的实现方式中,上述标签更新模块405进一步用于根据节点影响值,将各节点进行升序排序,得到更新顺序序列;对于更新顺序序列的每个节点,依次获取节点的各相邻节点对节点的相对影响值以及节点的第一标签集;第一标签集由各相邻节点社群标签集中具有最大归属度的社群标签组成;基于相对影响值,计算节点对第一标签集中各社群标签内社群号的归属度,得到第二标签集;从第二标签集中,筛选归属度符合预设归属度条件的至少一个社群标签;将筛选到的至少一个社群标签作为迭代更新后节点的社群标签集。In some optional implementations of this embodiment, the label update module 405 is further configured to sort the nodes in ascending order according to the node influence value to obtain the update sequence sequence; for each node in the update sequence sequence, obtain the node in sequence The relative influence value of each neighboring node on the node and the first label set of the node; the first label set is composed of the community label with the largest attribution in the community label set of each neighboring node; based on the relative influence value, the node pair is calculated The attribution degree of the community number in each community tag in the first tag set is obtained, and the second tag set is obtained; from the second tag set, at least one community tag whose attribution degree meets the preset attribution degree condition is selected; The group label is used as the community label set of the node after iterative update.
本实施例中,社群标签的更新顺序考虑到了节点影响值,社群标签的更新方式考虑到了相对影响值,从而保证了更新社群标签集的准确性。In this embodiment, the update sequence of the community label takes into account the node influence value, and the update method of the community label takes the relative influence value into consideration, thereby ensuring the accuracy of updating the community label set.
在本实施例的一些可选的实现方式中,上述网络划分模块406进一步用于在对各节点完成每轮迭代更新后,将各节点的社群标签集中具备最大归属度的社群标签,确定为各节点社群标签集的主标签;比较各节点当前的主标签,与前次迭代后各节点的主标签是否发生变化;若未发生变化,根据各节点当前社群标签的社群号,将同质网络划分为至少一个社群节点网络。In some optional implementations of this embodiment, the above-mentioned network division module 406 is further configured to collect the community labels of each node with the community label with the largest degree of attribution after completing each round of iterative update of each node, and determine Is the main label of the community label set of each node; compare the current main label of each node with the main label of each node after the previous iteration; if there is no change, according to the community number of the current community label of each node, Divide the homogeneous network into at least one community node network.
本实施例中,各节点的主标签不再变化时同质网络趋于稳定,可以依据当前社群标签的社群号准确地划分出社群节点网络。In this embodiment, the homogeneous network tends to be stable when the primary label of each node no longer changes, and the community node network can be accurately divided according to the community number of the current community label.
为解决上述技术问题,本申请实施例还提供计算机设备。具体请参阅图9,图9为本实施例计算机设备基本结构框图。In order to solve the above technical problems, the embodiments of the present application also provide computer equipment. Please refer to FIG. 9 for details. FIG. 9 is a block diagram of the basic structure of the computer device in this embodiment.
所述计算机设备5包括通过系统总线相互通信连接存储器51、处理器52、网络接口53。图中仅示出了具有组件51-53的计算机设备5,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。。The computer device 5 includes a memory 51, a processor 52, and a network interface 53 that communicate with each other through a system bus. The figure only shows the computer device 5 with components 51-53, and it is not required to implement all the illustrated components, and more or fewer components may be implemented instead. .
所述存储器51至少包括一种类型的计算机可读存储介质,所述计算机可读存储介质可以是非易失性,也可以是易失性,所述计算机可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器、随机访问存储器、静态随机访问存储器、只读存储器、电可擦除可编程只读存储器、可编程只读存储器、磁性存储器、磁盘、光盘等。所述存储器51可以是所述计算机设备5的内部存储单元,例如硬盘或内存,也可以是所述计算机设备5的外部存储设备,例如插接式硬盘,智能存储卡,安全数字卡,闪存卡等。当然,所述存储器51还可以既包括所述计算机设备5的内部存储单元也包括其外部存储设备。本实施例中,所述存储器51通常用于存储安装于所述计算机设备5的操作系统和各类应用软件,例如同质网络的社群 划分方法的计算机可读指令等。此外,所述存储器51还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 51 includes at least one type of computer-readable storage medium. The computer-readable storage medium may be nonvolatile or volatile. The computer-readable storage medium includes flash memory, hard disk, and multimedia card. , Card-type memory, random access memory, static random access memory, read-only memory, electrically erasable programmable read-only memory, programmable read-only memory, magnetic memory, magnetic disk, optical disk, etc. The memory 51 may be an internal storage unit of the computer device 5, such as a hard disk or a memory, or an external storage device of the computer device 5, such as a plug-in hard disk, a smart memory card, a secure digital card, or a flash memory card. Wait. Of course, the memory 51 may also include both the internal storage unit of the computer device 5 and its external storage device. In this embodiment, the memory 51 is generally used to store an operating system and various application software installed in the computer device 5, such as computer-readable instructions for a community division method of a homogeneous network. In addition, the memory 51 can also be used to temporarily store various types of data that have been output or will be output.
所述处理器52在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器52通常用于控制所述计算机设备5的总体操作。本实施例中,所述处理器52用于运行所述存储器51中存储的计算机可读指令或者处理数据,例如运行所述同质网络的社群划分方法的计算机可读指令。The processor 52 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments. The processor 52 is generally used to control the overall operation of the computer device 5. In this embodiment, the processor 52 is configured to run computer-readable instructions or process data stored in the memory 51, for example, computer-readable instructions for running the community division method of the homogeneous network.
所述网络接口53可包括无线网络接口或有线网络接口,该网络接口53通常用于在所述计算机设备5与其他电子设备之间建立通信连接。The network interface 53 may include a wireless network interface or a wired network interface, and the network interface 53 is generally used to establish a communication connection between the computer device 5 and other electronic devices.
本实施例中提供的计算机设备可以执行上述同质网络的社群划分方法的步骤。此处同质网络的社群划分方法的步骤可以是上述各个实施例的同质网络的社群划分方法中的步骤。The computer device provided in this embodiment can execute the steps of the above-mentioned method for community division of a homogeneous network. Here, the steps of the method for dividing a community of a homogeneous network may be the steps of the method for dividing a community of a homogeneous network in each of the foregoing embodiments.
本实施例中,计算各节点的节点影响值以及相对影响值时,将交易数据作为权重,考虑到了节点的重要程度,提高了计算的准确性;一个节点可以被划分到至少一个社群节点网络中,保证了同质网络社群划分的准确性。In this embodiment, when calculating the node influence value and relative influence value of each node, the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
本申请还提供了另一种实施方式,即提供一种计算机可读存储介质,所述计算机可读存储介质存储有用于同质网络的社群划分的计算机可读指令,所述用于同质网络的社群划分的计算机可读指令可被至少一个处理器执行,以使所述至少一个处理器执行如上述的同质网络的社群划分方法的步骤。This application also provides another implementation manner, that is, a computer-readable storage medium that stores computer-readable instructions for community division of a homogenous network. The computer-readable instructions for the community division of the network may be executed by at least one processor, so that the at least one processor executes the steps of the community division method of the homogeneous network as described above.
本实施例中,计算各节点的节点影响值以及相对影响值时,将交易数据作为权重,考虑到了节点的重要程度,提高了计算的准确性;一个节点可以被划分到至少一个社群节点网络中,保证了同质网络社群划分的准确性。In this embodiment, when calculating the node influence value and relative influence value of each node, the transaction data is used as the weight, taking into account the importance of the node, and improving the accuracy of the calculation; a node can be divided into at least one community node network , To ensure the accuracy of homogenous network community division.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
显然,以上所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例,附图中给出了本申请的较佳实施例,但并不限制本申请的专利范围。本申请可以以许多不同的形式来实现凡是利用本申请说明书及附图内容所做的等效结构,直接或间接运用在其他相关的技术领域,均同理在本申请专利保护范围之内。Obviously, the above-described embodiments are only a part of the embodiments of the present application, rather than all of the embodiments. The drawings show preferred embodiments of the present application, but do not limit the patent scope of the present application. This application can be implemented in many different forms. Any equivalent structure made using the content of the description and drawings of this application, directly or indirectly used in other related technical fields, is similarly within the scope of patent protection of this application.

Claims (20)

  1. 一种同质网络的社群划分方法,包括下述步骤:A method of community division in a homogeneous network includes the following steps:
    接收金融交易信息;Receive financial transaction information;
    以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
    初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
    将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
    根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
    当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  2. 根据权利要求1所述的同质网络的社群划分方法,其中,所述以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络的步骤包括:The method for community division of a homogenous network according to claim 1, wherein the transaction users in the financial transaction information are used as nodes, and the homogenous network is constructed based on the transaction relationship and transaction data between the transaction users The steps include:
    识别所述金融交易信息中的交易用户以及所述交易用户间的交易关系和交易数据;Identifying the transaction user in the financial transaction information and the transaction relationship and transaction data between the transaction users;
    将识别到的交易用户作为待构建的同质网络中的节点;Use the identified transaction users as nodes in the homogeneous network to be constructed;
    通过节点连接线连接存在交易关系的节点,并将与所述交易关系对应的交易数据标注于所述节点连接线,得到同质网络。The nodes with transaction relationships are connected through node connecting lines, and transaction data corresponding to the transaction relationships are marked on the node connecting lines to obtain a homogeneous network.
  3. 根据权利要求1所述的同质网络的社群划分方法,其中,所述初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集的步骤包括:The method for dividing a community in a homogeneous network according to claim 1, wherein the step of initializing the community label of each node in the homogenous network to obtain the community label set of each node in the homogeneous network include:
    向所述同质网络中的各节点分别配置社群号;Respectively configuring a community number to each node in the homogeneous network;
    初始化所述各节点对配置的社群号的归属度;Initialize the attribution degree of each node to the configured community number;
    基于所述各节点的社群号以及与社群号对应的归属度,生成所述各节点的社群标签;Generating the community label of each node based on the community number of each node and the attribution degree corresponding to the community number;
    根据所述各节点的社群标签分别构建所述各节点的社群标签集。The community label set of each node is respectively constructed according to the community label of each node.
  4. 根据权利要求1所述的同质网络的社群划分方法,其中,所述将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值的步骤包括:The method for community division of a homogeneous network according to claim 1, wherein the step of calculating each node using the transaction data as a weight to obtain the node influence value of each node comprises:
    将所述交易数据作为权重,根据所述交易关系计算所述各节点的交易权重值;Using the transaction data as a weight, and calculating the transaction weight value of each node according to the transaction relationship;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的交易权重值中,筛选最大交易权重值和最小交易权重值;For each node in the homogeneous network, filter the maximum transaction weight value and the minimum transaction weight value from the transaction weight values corresponding to the neighboring nodes of the node;
    根据所述节点的交易权重值、所述最大交易权重值和所述最小交易权重值,计算所述节点在所述同质网络中的节点影响值。According to the transaction weight value of the node, the maximum transaction weight value and the minimum transaction weight value, the node influence value of the node in the homogeneous network is calculated.
  5. 根据权利要求4所述的同质网络的社群划分方法,其中,所述各节点对相邻节点的相对影响值的计算步骤包括:The method for dividing a community in a homogeneous network according to claim 4, wherein the step of calculating the relative influence value of each node on neighboring nodes comprises:
    根据交易关系和交易数据,计算每对相邻节点间的相似度;According to the transaction relationship and transaction data, calculate the similarity between each pair of adjacent nodes;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的相似度中确定相邻节点 的最大相似度;For each node in the homogeneous network, determine the maximum similarity of adjacent nodes from the similarity corresponding to the adjacent nodes of the node;
    基于所述节点的节点影响值、所述节点与所述相邻节点间的相似度和所述相邻节点的最大相似度,计算所述节点对所述相邻节点的相对影响值。Based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the relative influence value of the node on the adjacent node is calculated.
  6. 根据权利要求3所述的同质网络的社群划分方法,其中,所述根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新的步骤包括:The method for community division of a homogeneous network according to claim 3, wherein the step of iteratively updating the community label set of each node according to the node influence value and the relative influence value comprises:
    根据所述节点影响值,将所述各节点进行升序排序,得到更新顺序序列;According to the node influence value, sort the nodes in ascending order to obtain an update sequence sequence;
    对于所述更新顺序序列的每个节点,依次获取节点的各相邻节点对所述节点的相对影响值以及所述节点的第一标签集;所述第一标签集由所述各相邻节点社群标签集中具有最大归属度的社群标签组成;For each node in the update sequence sequence, the relative influence value of each adjacent node of the node on the node and the first label set of the node are sequentially obtained; the first label set is determined by the adjacent nodes The community label set is composed of the community label with the greatest degree of attribution;
    基于所述相对影响值,计算所述节点对所述第一标签集中各社群标签内社群号的归属度,得到第二标签集;Based on the relative influence value, calculating the attribution degree of the node to the community number in each community label in the first label set to obtain a second label set;
    从所述第二标签集中,筛选归属度符合预设归属度条件的至少一个社群标签;From the second tag set, filter at least one community tag whose attribution degree meets the preset attribution degree condition;
    将筛选到的至少一个社群标签作为迭代更新后所述节点的社群标签集。At least one community label selected is used as the community label set of the node after iterative update.
  7. 根据权利要求3所述的同质网络的社群划分方法,其中,所述当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络的步骤包括:The method for community division of a homogeneous network according to claim 3, wherein, when the community label in the community label set of each node no longer changes in the iterative update, iteratively update according to each node The step of dividing the homogeneous network into at least one community node network by the following community label includes:
    在对所述各节点完成每轮迭代更新后,将所述各节点的社群标签集中具备最大归属度的社群标签,确定为所述各节点社群标签集的主标签;After completing each round of iterative update for each node, determine the community label with the largest attribution in the community label set of each node as the main label of the community label set of each node;
    比较所述各节点当前的主标签,与前次迭代后所述各节点的主标签是否发生变化;Compare the current primary label of each node with the primary label of each node after the previous iteration;
    若未发生变化,根据所述各节点当前社群标签的社群号,将所述同质网络划分为至少一个社群节点网络。If there is no change, divide the homogenous network into at least one community node network according to the community number of the current community label of each node.
  8. 一种同质网络的社群划分装置,包括:A device for dividing a homogeneous network into a community, including:
    信息接收模块,用于接收金融交易信息;Information receiving module for receiving financial transaction information;
    网络构建模块,用于以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;The network construction module is configured to use the transaction users in the financial transaction information as nodes, and construct a homogeneous network based on the transaction relationship and transaction data between the transaction users;
    标签初始模块,用于初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;The label initialization module is used to initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
    节点计算模块,用于将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;A node calculation module, configured to calculate each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
    标签更新模块,用于根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;The label update module is configured to iteratively update the community label set of each node according to the node influence value and the relative influence value; there is at least one community label in the community label set after the iterative update;
    网络划分模块,用于当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。The network division module is used to divide the homogenous network into at least the same network according to the community label after the iterative update of each node when the community label in the community label set of each node no longer changes. A network of community nodes.
  9. 一种计算机设备,包括存储器和处理器,所述存储器中存储有计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory and a processor. The memory stores computer readable instructions. When the processor executes the computer readable instructions, the following steps are implemented:
    接收金融交易信息;Receive financial transaction information;
    以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
    初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
    将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
    根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
    当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  10. 根据权利要求9所述的计算机设备,其中,所述初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集的步骤包括:The computer device according to claim 9, wherein the step of initializing the community label of each node in the homogenous network to obtain the community label set of each node in the homogenous network comprises:
    向所述同质网络中的各节点分别配置社群号;Respectively configuring a community number to each node in the homogeneous network;
    初始化所述各节点对配置的社群号的归属度;Initialize the attribution degree of each node to the configured community number;
    基于所述各节点的社群号以及与社群号对应的归属度,生成所述各节点的社群标签;Generating the community label of each node based on the community number of each node and the attribution degree corresponding to the community number;
    根据所述各节点的社群标签分别构建所述各节点的社群标签集。The community label set of each node is respectively constructed according to the community label of each node.
  11. 根据权利要求9所述的计算机设备,其中,所述将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值的步骤包括:The computer device according to claim 9, wherein the step of calculating each node using the transaction data as a weight to obtain the node influence value of each node comprises:
    将所述交易数据作为权重,根据所述交易关系计算所述各节点的交易权重值;Using the transaction data as a weight, and calculating the transaction weight value of each node according to the transaction relationship;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的交易权重值中,筛选最大交易权重值和最小交易权重值;For each node in the homogeneous network, filter the maximum transaction weight value and the minimum transaction weight value from the transaction weight values corresponding to the neighboring nodes of the node;
    根据所述节点的交易权重值、所述最大交易权重值和所述最小交易权重值,计算所述节点在所述同质网络中的节点影响值。According to the transaction weight value of the node, the maximum transaction weight value and the minimum transaction weight value, the node influence value of the node in the homogeneous network is calculated.
  12. 根据权利要求11所述的计算机设备,其中,所述各节点对相邻节点的相对影响值的计算步骤包括:The computer device according to claim 11, wherein the step of calculating the relative influence value of each node on neighboring nodes comprises:
    根据交易关系和交易数据,计算每对相邻节点间的相似度;According to the transaction relationship and transaction data, calculate the similarity between each pair of adjacent nodes;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的相似度中确定相邻节点的最大相似度;For each node in the homogeneous network, determine the maximum similarity of adjacent nodes from the similarity corresponding to the adjacent nodes of the node;
    基于所述节点的节点影响值、所述节点与所述相邻节点间的相似度和所述相邻节点的最大相似度,计算所述节点对所述相邻节点的相对影响值。Based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the relative influence value of the node on the adjacent node is calculated.
  13. 根据权利要求10所述的计算机设备,其中,所述根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新的步骤包括:10. The computer device according to claim 10, wherein the step of iteratively updating the community tag set of each node according to the node influence value and the relative influence value comprises:
    根据所述节点影响值,将所述各节点进行升序排序,得到更新顺序序列;According to the node influence value, sort the nodes in ascending order to obtain an update sequence sequence;
    对于所述更新顺序序列的每个节点,依次获取节点的各相邻节点对所述节点的相对影响值以及所述节点的第一标签集;所述第一标签集由所述各相邻节点社群标签集中具有最大归属度的社群标签组成;For each node in the update sequence sequence, the relative influence value of each adjacent node of the node on the node and the first label set of the node are sequentially obtained; the first label set is determined by the adjacent nodes The community label set is composed of the community label with the greatest degree of attribution;
    基于所述相对影响值,计算所述节点对所述第一标签集中各社群标签内社群号的归属 度,得到第二标签集;Based on the relative influence value, calculate the attribution degree of the node to the community number in each community label in the first label set to obtain a second label set;
    从所述第二标签集中,筛选归属度符合预设归属度条件的至少一个社群标签;From the second tag set, filter at least one community tag whose attribution degree meets the preset attribution degree condition;
    将筛选到的至少一个社群标签作为迭代更新后所述节点的社群标签集。At least one community label selected is used as the community label set of the node after iterative update.
  14. 根据权利要求10所述的计算机设备,其中,所述当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络的步骤包括:The computer device according to claim 10, wherein when the community label in the community label set of each node no longer changes in the iterative update, the community label after the iterative update of each node is changed The step of dividing the homogeneous network into at least one community node network includes:
    在对所述各节点完成每轮迭代更新后,将所述各节点的社群标签集中具备最大归属度的社群标签,确定为所述各节点社群标签集的主标签;After completing each round of iterative update for each node, determine the community label with the largest attribution in the community label set of each node as the main label of the community label set of each node;
    比较所述各节点当前的主标签,与前次迭代后所述各节点的主标签是否发生变化;Compare the current primary label of each node with the primary label of each node after the previous iteration;
    若未发生变化,根据所述各节点当前社群标签的社群号,将所述同质网络划分为至少一个社群节点网络。If there is no change, divide the homogenous network into at least one community node network according to the community number of the current community label of each node.
  15. 一种计算机可读存储介质,,所述计算机可读存储介质上存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下步骤:A computer-readable storage medium having computer-readable instructions stored thereon, and when the computer-readable instructions are executed by a processor, the following steps are implemented:
    接收金融交易信息;Receive financial transaction information;
    以所述金融交易信息中的交易用户作为节点,并基于所述交易用户间的交易关系和交易数据构建同质网络;Using transaction users in the financial transaction information as nodes, and constructing a homogeneous network based on the transaction relationship and transaction data between the transaction users;
    初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集;Initialize the community label of each node in the homogeneous network to obtain the community label set of each node in the homogeneous network;
    将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值,以及所述各节点对相邻节点的相对影响值;Calculating each node using the transaction data as a weight to obtain the node influence value of each node and the relative influence value of each node on neighboring nodes;
    根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新;迭代更新后的社群标签集中至少存在一个社群标签;According to the node influence value and the relative influence value, iteratively update the community label set of each node; there is at least one community label in the community label set after the iterative update;
    当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络。When the community label in the community label set of each node no longer changes during the iterative update, the homogeneous network is divided into at least one community node network according to the community label after the iterative update of each node.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述初始化所述同质网络中各节点的社群标签,得到所述同质网络中各节点的社群标签集的步骤包括:15. The computer-readable storage medium according to claim 15, wherein the step of initializing the community label of each node in the homogenous network to obtain the community label set of each node in the homogenous network comprises:
    向所述同质网络中的各节点分别配置社群号;Respectively configuring a community number to each node in the homogeneous network;
    初始化所述各节点对配置的社群号的归属度;Initialize the attribution degree of each node to the configured community number;
    基于所述各节点的社群号以及与社群号对应的归属度,生成所述各节点的社群标签;Generating the community label of each node based on the community number of each node and the attribution degree corresponding to the community number;
    根据所述各节点的社群标签分别构建所述各节点的社群标签集。The community label set of each node is respectively constructed according to the community label of each node.
  17. 根据权利要求15所述的计算机可读存储介质,其中,所述将所述交易数据作为权重对所述各节点进行计算,得到所述各节点的节点影响值的步骤包括:The computer-readable storage medium according to claim 15, wherein the step of calculating each node using the transaction data as a weight to obtain the node influence value of each node comprises:
    将所述交易数据作为权重,根据所述交易关系计算所述各节点的交易权重值;Using the transaction data as a weight, and calculating the transaction weight value of each node according to the transaction relationship;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的交易权重值中,筛选最大交易权重值和最小交易权重值;For each node in the homogeneous network, filter the maximum transaction weight value and the minimum transaction weight value from the transaction weight values corresponding to the neighboring nodes of the node;
    根据所述节点的交易权重值、所述最大交易权重值和所述最小交易权重值,计算所述节点在所述同质网络中的节点影响值。According to the transaction weight value of the node, the maximum transaction weight value and the minimum transaction weight value, the node influence value of the node in the homogeneous network is calculated.
  18. 根据权利要求17所述的计算机可读存储介质,其中,所述各节点对相邻节点的相对影响值的计算步骤包括:18. The computer-readable storage medium according to claim 17, wherein the step of calculating the relative influence value of each node on neighboring nodes comprises:
    根据交易关系和交易数据,计算每对相邻节点间的相似度;According to the transaction relationship and transaction data, calculate the similarity between each pair of adjacent nodes;
    对于所述同质网络中的各个节点,从节点的相邻节点所对应的相似度中确定相邻节点的最大相似度;For each node in the homogeneous network, determine the maximum similarity of adjacent nodes from the similarity corresponding to the adjacent nodes of the node;
    基于所述节点的节点影响值、所述节点与所述相邻节点间的相似度和所述相邻节点的最大相似度,计算所述节点对所述相邻节点的相对影响值。Based on the node influence value of the node, the similarity between the node and the adjacent node, and the maximum similarity of the adjacent node, the relative influence value of the node on the adjacent node is calculated.
  19. 根据权利要求16所述的计算机可读存储介质,其中,所述根据所述节点影响值和所述相对影响值,对所述各节点的社群标签集进行迭代更新的步骤包括:16. The computer-readable storage medium according to claim 16, wherein the step of iteratively updating the community tag set of each node according to the node influence value and the relative influence value comprises:
    根据所述节点影响值,将所述各节点进行升序排序,得到更新顺序序列;According to the node influence value, sort the nodes in ascending order to obtain an update sequence sequence;
    对于所述更新顺序序列的每个节点,依次获取节点的各相邻节点对所述节点的相对影响值以及所述节点的第一标签集;所述第一标签集由所述各相邻节点社群标签集中具有最大归属度的社群标签组成;For each node in the update sequence sequence, the relative influence value of each adjacent node of the node on the node and the first label set of the node are sequentially obtained; the first label set is determined by the adjacent nodes The community label set is composed of the community label with the greatest degree of attribution;
    基于所述相对影响值,计算所述节点对所述第一标签集中各社群标签内社群号的归属度,得到第二标签集;Based on the relative influence value, calculating the attribution degree of the node to the community number in each community label in the first label set to obtain a second label set;
    从所述第二标签集中,筛选归属度符合预设归属度条件的至少一个社群标签;From the second tag set, filter at least one community tag whose attribution degree meets the preset attribution degree condition;
    将筛选到的至少一个社群标签作为迭代更新后所述节点的社群标签集。At least one community label selected is used as the community label set of the node after iterative update.
  20. 根据权利要求16所述的计算机可读存储介质,其中,所述当所述各节点的社群标签集中的社群标签在迭代更新中不再变化时,根据所述各节点迭代更新后的社群标签将所述同质网络划分为至少一个社群节点网络的步骤包括:The computer-readable storage medium according to claim 16, wherein, when the community label in the community label set of each node no longer changes during the iterative update, according to the community label after the iterative update of each node The step of dividing the homogeneous network into at least one community node network by the group label includes:
    在对所述各节点完成每轮迭代更新后,将所述各节点的社群标签集中具备最大归属度的社群标签,确定为所述各节点社群标签集的主标签;After completing each round of iterative update for each node, determine the community label with the largest attribution in the community label set of each node as the main label of the community label set of each node;
    比较所述各节点当前的主标签,与前次迭代后所述各节点的主标签是否发生变化;Compare the current primary label of each node with the primary label of each node after the previous iteration;
    若未发生变化,根据所述各节点当前社群标签的社群号,将所述同质网络划分为至少一个社群节点网络。If there is no change, divide the homogenous network into at least one community node network according to the community number of the current community label of each node.
PCT/CN2020/105766 2020-04-29 2020-07-30 Community division method and apparatus for homogeneous network, and computer device and storage medium WO2021217933A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010356641.6A CN111641517A (en) 2020-04-29 2020-04-29 Community division method and device for homogeneous network, computer equipment and storage medium
CN202010356641.6 2020-04-29

Publications (1)

Publication Number Publication Date
WO2021217933A1 true WO2021217933A1 (en) 2021-11-04

Family

ID=72330994

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/105766 WO2021217933A1 (en) 2020-04-29 2020-07-30 Community division method and apparatus for homogeneous network, and computer device and storage medium

Country Status (2)

Country Link
CN (1) CN111641517A (en)
WO (1) WO2021217933A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112116403A (en) * 2020-09-28 2020-12-22 中国建设银行股份有限公司 Information recommendation method, device and equipment
CN114124484B (en) * 2021-11-09 2024-04-05 招商银行股份有限公司 Network attack identification method, system, device, terminal equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8126820B1 (en) * 2007-01-30 2012-02-28 Intuit Inc. Community to support the definition and sharing of source trust level configurations
US20140059124A1 (en) * 2012-08-21 2014-02-27 Sijia Song Social Action by Quick Response (QR) Code
CN103678669A (en) * 2013-12-25 2014-03-26 福州大学 Evaluating system and method for community influence in social network
CN105913235A (en) * 2016-04-21 2016-08-31 中国农业银行股份有限公司 Client account transfer relation analysis method and system
CN106296426A (en) * 2016-08-01 2017-01-04 温州医科大学附属第医院 A kind of overlapping community determines method and device
CN106355506A (en) * 2016-08-15 2017-01-25 中南大学 Method for selecting the initial node with maximum influence in online social network
CN106951524A (en) * 2017-03-21 2017-07-14 哈尔滨工程大学 Overlapping community discovery method based on node influence power
CN108073944A (en) * 2017-10-18 2018-05-25 南京邮电大学 A kind of label based on local influence power propagates community discovery method
CN110503568A (en) * 2019-08-19 2019-11-26 阿里巴巴集团控股有限公司 A kind of community division method, system and equipment

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9292884B2 (en) * 2013-07-10 2016-03-22 Facebook, Inc. Network-aware product rollout in online social networks
CN108681936B (en) * 2018-04-26 2021-11-02 浙江邦盛科技有限公司 Fraud group identification method based on modularity and balanced label propagation
CN110880150A (en) * 2018-09-05 2020-03-13 华为技术有限公司 Community discovery method, device, equipment and readable storage medium
CN110647590A (en) * 2019-09-23 2020-01-03 税友软件集团股份有限公司 Target community data identification method and related device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8126820B1 (en) * 2007-01-30 2012-02-28 Intuit Inc. Community to support the definition and sharing of source trust level configurations
US20140059124A1 (en) * 2012-08-21 2014-02-27 Sijia Song Social Action by Quick Response (QR) Code
CN103678669A (en) * 2013-12-25 2014-03-26 福州大学 Evaluating system and method for community influence in social network
CN105913235A (en) * 2016-04-21 2016-08-31 中国农业银行股份有限公司 Client account transfer relation analysis method and system
CN106296426A (en) * 2016-08-01 2017-01-04 温州医科大学附属第医院 A kind of overlapping community determines method and device
CN106355506A (en) * 2016-08-15 2017-01-25 中南大学 Method for selecting the initial node with maximum influence in online social network
CN106951524A (en) * 2017-03-21 2017-07-14 哈尔滨工程大学 Overlapping community discovery method based on node influence power
CN108073944A (en) * 2017-10-18 2018-05-25 南京邮电大学 A kind of label based on local influence power propagates community discovery method
CN110503568A (en) * 2019-08-19 2019-11-26 阿里巴巴集团控股有限公司 A kind of community division method, system and equipment

Also Published As

Publication number Publication date
CN111641517A (en) 2020-09-08

Similar Documents

Publication Publication Date Title
WO2022126971A1 (en) Density-based text clustering method and apparatus, device, and storage medium
WO2021174944A1 (en) Message push method based on target activity, and related device
CN111698247B (en) Abnormal account detection method, device, equipment and storage medium
US8117609B2 (en) System and method for optimizing changes of data sets
WO2021169115A1 (en) Risk control method, apparatus, electronic device, and computer-readable storage medium
CN106844407B (en) Tag network generation method and system based on data set correlation
CN110866546B (en) Method and device for evaluating consensus node
WO2022095352A1 (en) Abnormal user identification method and apparatus based on intelligent decision, and computer device
WO2021159738A1 (en) Data recommendation method and device based on medical field, and server and storage medium
CN110705683A (en) Random forest model construction method and device, electronic equipment and storage medium
CN109471978B (en) Electronic resource recommendation method and device
CA3036664A1 (en) Method for data structure relationship detection
CN110300084B (en) IP address-based portrait method and apparatus, electronic device, and readable medium
WO2021217933A1 (en) Community division method and apparatus for homogeneous network, and computer device and storage medium
CN110855648B (en) Early warning control method and device for network attack
CN112613917A (en) Information pushing method, device and equipment based on user portrait and storage medium
TW201944306A (en) Method and device for determining high-risk user
CN111784528A (en) Abnormal community detection method and device, computer equipment and storage medium
WO2019061664A1 (en) Electronic device, user's internet surfing data-based product recommendation method, and storage medium
CN104077723A (en) Social network recommending system and social network recommending method
CN110798467A (en) Target object identification method and device, computer equipment and storage medium
Syah Optimization Tree Based Inference to Customer Behaviors in Dynamic Control System
CN114579584A (en) Data table processing method and device, computer equipment and storage medium
WO2023024408A1 (en) Method for determining feature vector of user, and related device and medium
CN111832661B (en) Classification model construction method, device, computer equipment and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20933497

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16.02.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20933497

Country of ref document: EP

Kind code of ref document: A1