CN109977150B

CN109977150B - Classification method based on physical characteristics and implicit style characteristics of data

Info

Publication number: CN109977150B
Application number: CN201910205905.5A
Authority: CN
Inventors: 顾苏杭; 王惠宇; 高佳琴; 王士同
Original assignee: Changzhou Vocational Institute of Light Industry
Current assignee: Changzhou Vocational Institute of Light Industry
Priority date: 2019-03-18
Filing date: 2019-03-18
Publication date: 2023-11-10
Anticipated expiration: 2039-03-18
Also published as: CN109977150A

Abstract

The invention relates to the technical fields of pattern recognition, artificial intelligence and machine learning, in particular to a classification method based on physical characteristics and implicit style characteristics of data, which comprises the following steps: (1) Mapping the data set into a social network containing C sub-networks by using a K nearest neighbor algorithm; (2) Mining implicit style characteristics-authority and influence of data in a constructed social network; (3) Calculating the efficiency of a double-layer structure between each test sample and each node in the social network according to the data distance characteristics and the authoritative style characteristics, and determining an allowable connection set of each sub-network of the test sample; (4) Calculating the sum of the influence of all nodes in each allowed connection set according to the allowed connection set; (5) And judging the test sample label type as a sub-network label type corresponding to the sum with the maximum node influence.

Description

Classification method based on physical characteristics and implicit style characteristics of data

Technical Field

The invention relates to the technical fields of pattern recognition, artificial intelligence and machine learning, in particular to a classification method based on physical characteristics and implicit style characteristics of data.

Background

The data classification technology is always a hot problem of research in the fields of machine learning, pattern recognition, data mining and the like, and particularly combines the data classification technology with practical application, such as intelligent medical treatment, face recognition, intelligent traffic monitoring, market dynamic analysis and the like, thereby further promoting the development of the data classification technology and widening the application prospect of the data classification technology in the fields of military industry, civilian life and the like. The key point of the data classification technology is to select proper data characteristics and construct a data classification model with high precision performance through a classification method.

Traditional classification methods, such as support vector machines, K neighbors, random forests, bayes, decision trees, artificial neural networks, takagi-Sugeno-Kang (TSK) fuzzy classifiers, and the like, train data classification models by utilizing data physical characteristics (such as distance, color, similarity, and the like). However, there is a implied association between data samples in most practical data sets, each class of data samples exhibiting unique implicit style characteristics, typical data sets include: (1) epileptic brain electrical signals: the brain electrical signal waveform of the normal population is obviously different from that of the population suffering from epilepsy; (2) handwriting data set: the font style of each author is significantly different from the other authors; (3) vowel recognition: each vowel in the english language pronounces differently from the other. At present, the traditional classification method only considers the physical characteristics of data in the process of training a data classification model, does not relate to the implicit style characteristics of the data, and does not exist in published documents at home and abroad, so that the data classification method can mine the implicit style characteristics of the data, and simultaneously trains the data classification model by utilizing the physical characteristics of the data and the implicit style characteristics of the data. Thus, existing data classification methods do not conform to the fact that a data set contains implicit style characteristics of the data.

Disclosure of Invention

The invention aims to provide a classification method based on data physical characteristics and implicit style characteristics, which falls on a social network, can accord with the fact that most actual data sets contain data implicit style characteristics, and is used for improving data classification behaviors and improving data classification accuracy by mining the data implicit style characteristics through the social network. In addition, the method does not need to generate a data classification model in the training stage, and can enter the classification stage after determining the implicit style characteristics of the nodes in the social network.

In order to achieve the above object, an embodiment of the present invention provides a classification method based on physical characteristics and implicit style characteristics of data, including the following steps: mapping the data set into a social network containing C sub-networks by using a K nearest neighbor algorithm; mining implicit style characteristics-authority and influence of data in a constructed social network; calculating the efficiency of a double-layer structure between each test sample and each node in the social network according to the data distance characteristics and the authoritative style characteristics, and determining an allowable connection set of each sub-network of the test sample; calculating the sum of the influence of all nodes in each allowed connection set according to the allowed connection set; and judging the test sample label type as a sub-network label type corresponding to the sum with the maximum node influence.

In the above technical solution, for a given data set x= [ X ] ₁ ,x ₂ ,…,x _N ] ^T Wherein x is _i ∈R ^d The tag set is y= [ Y ] ₁ ,y ₂ ,…,y _N ] ^T Mapping a given dataset X into a social network G using a K-nearest neighbor algorithm, further comprising: mapping a given dataset X into a social network g= { G using a K-nearest neighbor algorithm ₁ ,g ₂ ,…,g _Q -wherein Q is equal to the number of categories contained in data set X, each sample X in data set X _i Node v corresponding to social network G _i The method comprises the steps of carrying out a first treatment on the surface of the According to the K-nearest neighbor algorithm, any two nodes v in the social network G _i And v _j The following two conditions are satisfied: (1) Node v _j For node v _i Neighbor node of (2) node v _j And node v _i With the same label, then at node v _i And v _j Is established by v _i As the starting point, v _j Directed edge e being a node _ij The method comprises the steps of carrying out a first treatment on the surface of the According to the established social network G, each sub-network corresponds to each data class in the data set X, and any two sub-networks G _p And g _q Independent of each other, each node in the sub-network has the same label and is the same as the corresponding data class label.

According to the invention, two kinds of data including data authority and data influence are mined in the constructed social network G to have the style characteristics, and the method further comprises the following steps: each node v is first mined in the social network G constructed as described above _i Authority a of (2) _i Sub-network g _q Authority of (2)The node is each data sample in the data set X, and the sub-network is each data class in the data set X. The node v _i Authority a of (2) _i From node v in the social network G _i Is included to fully calculate the node authority. Accordingly, in the social network G, if more other nodes are connected to a certain node, the node has higher authority; if a node connects more other nodes, thenThe node is also highly authoritative. The node v _i Authority a of (2) _i The calculation formula is that

Wherein,and +.>The calculation formulas are respectively

In the formulas (1) to (4),d (D) _i Respectively represent the node v _i Degree of exit, degree of entry, and degree of entry. ζ represents a very small positive value such that outlier or noise samples in the data set X do not affect the classification performance of the classification method.

The node v is subjected to a fuzzification method _i Fuzzification of authority of (2) to obtain node v _i Is a fuzzy weight ω of (2) _i The calculation formula is that

Wherein N represents a datasetX comprises the total number of samples. From equation (2), the fuzzy weight ω _i Is (0, 1) and for the node v _i The higher the value, the higher the corresponding fuzzy weight.

When the authority of any node is determined, a sub-network g can be calculated _q Authority of the formula

Wherein,representing subnetwork g _q The number of nodes involved, i.e. with sub-network g _q The corresponding data class contains the number of samples, v _m Representing subnetwork g _q The mth node is included.

When any node v _i After determining the fuzzy weight of (a), node v can be calculated _i The influence of (a) is calculated as

Wherein,represents the ith node v _i The influence in the h iteration process, alpha represents the damping coefficient of the social network, and the value is generally alpha=0.85. ρ _j The node density representing the social network G is calculated by the following formula

Wherein d _jk Representative node v _j And v _k The distance between the two is Euclidean distance, dc represents cut-off distance, and the value of dc can be set manually so that the node v _j Surrounding ofThe number of the nodes accounts for 1-2% of the number of all the nodes in the social network G. χ (·) represents some judgment function, i.e. if d _jk -dc < 0, then χ (·) =1, whereas χ (·) =0.

When the number of iterative loops in equation (4) reaches a maximum value H or the following condition is satisfied, the iterative loop will terminate.

Wherein I ₂ Represents a 2-normal form, θ represents a small threshold, and can be set manually, e.g., θ=10 ^-4 。

As can be seen from the formula (8), the node influence is calculated by using the density of the nodes in the social network G, i.e. the node influence is calculated according to the actual distribution of the samples in the dataset. And node density is continuously propagated in the iterative process, so that the node influence has dynamic characteristics. In addition, the node authority and the node influence are correlated by using the node fuzzy weight, so that a positive correlation relationship is formed between the node authority and the node influence, namely, the higher the node authority is, the higher the node influence is.

For test set t= [ T ] ₁ ,t ₂ ,…,t _M ] ^T Wherein t is _m ∈R ^d Calculating an allowed connection set between each test sample in the test set and each sub-network in the social network G according to the data physical characteristics and the data implied style characteristics, and further comprising: when embedding a certain test sample T in a test set T into the social network G, the efficiency Λ of the double-layer structure is first calculated _t,j The calculation formula is that

Wherein v is _j Representing subnetwork g _q The j-th node in (a), i.e. with sub-network g _q The jth sample, d, in the corresponding class of data _tj Representing the test sample t and node v _j The distance between the two is Euclidean distance. Gamma represents a balance coefficient, the higher the value thereof, the greater the role of authority representing the node, and conversely, the greater the role of physical characteristics representing the data. From equation (10), the efficiency Λ of the bilayer structure _t,j Is determined by the physical characteristics of the data and the implicit style characteristics of the data. Using the dual layer efficiency, the allowed connection set can be determined for calculating the sum of the allowed connection set influence between the test sample t and each sub-network, the allowed connection set determination criteria being expressed as follows

Wherein the efficiency lambda of the double-layer structure _t,j For the test sample t and the node v _j A function between the test sample t and the node v for checking _j When the connecting edge is established, the efficiency lambda of the double-layer structure is improved _t,j The value is also to reduce the efficiency lambda of the double-layer structure _t,j Values. Accordingly, the allowed connection set may be generatedIs described as follows

1) If there is a node v _j So that lambda is _t,j More than or equal to 1, namely the efficiency of the double-layer structure is improved, the node v is obtained _j Joining to a set of allowed connections

2) If there is no node v _j Satisfying 1), i.e. the efficiency of the bilayer structure is reduced, will be closest to Λ _t,j Node joining to allowed connection set in case of =1The allowed connection set at this time->Only one node is included.

A further improvement of the present invention, based on said generated set of allowed connectionsCalculating a sum of influence of each of the allowed connection set nodes, further comprising: according to the allowed connection set->The sum of the influence of the nodes in the respective allowed connection sets is calculated for determining the sum of the maximum influence. The sum of the influence of each allowed connection centralized node is calculated as the formula

According to the above-mentioned sum of influence of each permitted connection centralized node, the label type of the test sample is judged as the sub-network label type corresponding to the sum of influence of the maximum node, further comprising: determining the maximum influence sum according to the influence sum of the nodes in each allowed connection setThe calculation formula is as follows

According to the sum of the maximum influencesIdentifying the label type of the test sample as the sum +.>A corresponding sub-network tag type.

The invention has the beneficial effects that: the invention can accord with the fact that most actual data sets contain data implicit style characteristics, and the data implicit style characteristics are mined through the social network to improve the data classification behavior and improve the data classification precision.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a flowchart of an algorithm of the present invention;

FIG. 3 is a schematic diagram of a social network of the present invention containing two dataclass dataset mappings;

FIG. 4 is a diagram showing node attributes in a social network in accordance with the present invention;

FIG. 5 is a schematic diagram of node authority in a social network in accordance with the present invention;

FIG. 6 is a schematic diagram of the influence of nodes in a social network according to the present invention;

FIG. 7 is a schematic diagram of an allowed connection set generated in accordance with the efficiency of the bilayer structure of the present invention;

FIG. 8 is a schematic diagram of the present invention for predicting test sample tag types based on the sum of maximum node impact of allowed connection sets.

Detailed Description

The present invention will be further described in detail with reference to the drawings and examples, which are only for the purpose of illustrating the invention and are not to be construed as limiting the scope of the invention.

The technical scheme of the invention is further and fully described below with reference to the specific embodiments and the attached drawings. It is apparent that the specific embodiments may be shown in the following drawings and are merely to illustrate the technical solution of the present invention, not to limit the specific application of the present invention.

As shown in fig. 1, according to a classification method of a social network based on physical characteristics and implicit style characteristics of data, the classification method comprises the following steps:

step one, for a given dataset x= [ X ₁ ,x ₂ ,…,x _N ] ^T Wherein x is _i ∈R ^d The tag set is y= [ Y ] ₁ ,y ₂ ,…,y _N ] ^T Mapping a given dataset X into a social network g= { G using a K-nearest neighbor algorithm ₁ ,g ₂ ,…,g _Q -wherein Q is equal to the number of categories contained in data set X, each sample X in data set X _i Node v corresponding to social network G _i 。

Further, in the first step, according to the K-nearest neighbor algorithm, any two nodes v in the social network G _i And v _j The following two conditions are satisfied: node v _j For node v _i Neighbor node of (a) and node v _j And node v _i With the same label, then at node v _i And v _j Is established by v _i As the starting point, v _j Directed edge e being a node _ij 。

Further, in the first step, according to the social network G, each sub-network corresponds to each data class in the data set X, and any two sub-networks G _p And g _q Independent of each other, each node in the sub-network has the same label and is the same as the corresponding data class label.

Step two, mining two kinds of data including authority and influence of data in the social network G constructed in the step one, namely mining each node v in the social network G constructed in the step 1 firstly _i Authority a of (2) _i Sub-network g _q Authority of (2)The node is each data sample in the data set X, and the sub-network is each data class in the data set X.

The node v _i Authority a of (2) _i From node v in the social network G _i Is included to fully calculate the node authority. Accordingly, in the social network G, if more other nodes are connected to a certain node, the node has higher authority; a node is also more authoritative if it connects more other nodes. The node v _i Authority a of (2) _i The calculation formula is that

Wherein,and +.>The calculation formulas are respectively

In the above-mentioned formula(s),d (D) _i Respectively represent the node v _i Degree of exit, degree of entry, and degree of entry. ζ represents a very small positive value such that outlier or noise samples in the data set X do not affect the classification performance of the classification method.

Where N represents the total number of samples contained in data set X. From equation (5), the fuzzy weight ω _i Is (0, 1) and for the node v _i The higher the value, the higher the corresponding fuzzy weight.

Wherein d _jk Representative node v _j And v _k The distance between the two is Euclidean distance, dc represents cut-off distance, and the value of dc can be set manually so that the node v _j The number of surrounding nodes accounts for 1-2% of the number of all nodes in the social network G. ChiOf) represents a certain judgment function, i.e. if d _jk -dc < 0, then χ (·) =1, whereas χ (·) =0. When the number of iterative loops in equation (4) reaches a maximum value H or the following condition is satisfied, the iterative loop will terminate.

As can be seen from the formula (7), the node influence is calculated by using the density of the nodes in the social network G, i.e. the node influence is calculated according to the actual distribution of the samples in the dataset. And node density is continuously propagated in the iterative process, so that the node influence has dynamic characteristics. In addition, the node authority and the node influence are correlated by using the node fuzzy weight, so that a positive correlation relationship is formed between the node authority and the node influence, namely, the higher the node authority is, the higher the node influence is.

Step three, for test set t= [ T ] ₁ ,t ₂ ,…,t _M ] ^T Wherein t is _m ∈R ^d Calculating an allowable connection set between each test sample in a test set and each sub-network in a social network G according to data physical characteristics and data implied style characteristics, specifically, when embedding a certain test sample T in the test set T into the social network G, firstly calculating double-layer structure efficiency Λ _t,j The calculation formula is that

Wherein v is _j Representing subnetwork g _q The j-th node in (a), i.e. with sub-network g _q The jth sample, d, in the corresponding class of data _tj Representing the test sample t and node v _j The distance between the two is Euclidean distance.Gamma represents a balance coefficient, the higher the value thereof, the greater the role of authority representing the node, and conversely, the greater the role of physical characteristics representing the data. The efficiency lambda of the double-layer structure _t,j Is determined by the physical characteristics of the data and the implicit style characteristics of the data. Using the dual layer efficiency, the allowed connection set can be determined for calculating the sum of the allowed connection set influence between the test sample t and each sub-network, the allowed connection set determination criteria being expressed as follows

Step four, according to the aboveAllowing connection setsThe sum of the influence of the nodes in the respective allowed connection sets is calculated for determining the sum of the maximum influence. The sum of the influence of each allowed connection centralized node is calculated as the formula

Step five, determining the maximum influence sum according to the influence sum of each allowed connection centralized nodeThe calculation formula is as follows

Further, according to the sum of the maximum influencesIdentifying the label type of the test sample as the sum +.>A corresponding sub-network tag type.

As shown in fig. 2, when the data set X and the test data set T are input, in the training phase, the data set X is mapped into the social network G by using the K-nearest neighbor algorithm in the above step 1, and then the authority, the influence and the sub-network authority of each node in the social network G are mined by using the above step two, specifically, the concentration of each node in the social network G is calculated by using the above step two, the concentration of each node and the sub-network authority are calculated by using the above step two, the formulas (1) - (4), and the influence of each node is calculated by using the above step two, the formula (7) and the formula (9). In the classification stage, when a certain test sample t is input, firstly establishing an allowed connection set between the test sample t and each sub-network according to the formula (10) and the formula (11) in the third step, then calculating the sum of node influence in each allowed connection set according to the formula (12) in the fourth step, finally determining the allowed connection set with the sum of the maximum node influence according to the formula (13) in the fifth step, and judging the label type of the test sample as the label type corresponding to the allowed connection set with the sum of the maximum node influence.

As shown in FIG. 3, the input data set X contains two types of data, with labels 0 and 1, respectively, so that the data set X is mapped into a social network G and then contains two mutually independent sub-networks, respectively called'"and" ■ ", and the corresponding tag types for the subnetworks are also 0 and 1, respectively.

As shown in fig. 4, the degree of the partial nodes and the distance between the partial nodes, which is the euclidean distance, are shown in the social network G. The ingress and egress of a node may be determined by the directed edges established between the nodes.

As shown in FIG. 5, the authoritativeness of all nodes and subnetworks is shown in social network G, wherein the authoritativeness of a subnetwork is calculated from the authoritativeness of all nodes in the subnetwork.

As shown in fig. 6, the influence of all nodes is shown in the social network G, where in the process of iteratively calculating the influence of the nodes, the influence of the nodes is made to have a dynamic characteristic by propagating the concentration of each node in the social network, that is, according to the actual distribution condition of each sample in the data set.

As shown in fig. 7, when a certain test sample is embedded into the established social network G, the allowable connection set of the test sample with each sub-network is determined through the efficiency of the dual-layer structure. Wherein, the test sample is denoted by 'o', and the physical characteristics and the implicit style characteristics of the data in the efficiency of the double-layer structure work together through the balance coefficient.

As shown in fig. 8, due to the test sample and subnetwork'The sum of the permissible connection set node influence forces between "is greater than the sum of the permissible connection set node influence forces between the test sample and the subnetwork" ■ ", and therefore, the tag type of the test sample is determined to be" 0".

The foregoing has outlined and described the basic principles, features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, and that the above embodiments and descriptions are merely illustrative of the principles of the present invention, and various changes and modifications may be made without departing from the spirit and scope of the invention, which is defined in the appended claims. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. The classification method based on the physical characteristics and the implicit style characteristics of the data is characterized by comprising the following steps:

step one: for a given dataset x= [ X ₁ ,x ₂ ,…,x _N ] ^T Wherein x is _i ∈R ^d The tag set is y= [ Y ] ₁ ,y ₂ ,…,y _N ] ^T Mapping a given data set X into a social network G by using a K nearest neighbor algorithm, wherein the given data set is an epileptic electroencephalogram signal or handwriting data set or a vowel recognition data set;

step two: mining two kinds of data including authority and data influence in the constructed social network G, wherein the two kinds of data include style characteristics;

step three: for test set t= [ T ] ₁ ,t ₂ ,…,t _M ] ^T Wherein t is _m ∈R ^d Calculating an allowable connection set between each test sample in the test set and each sub-network in the social network G according to the physical characteristics and the implicit style characteristics of the data;

step four: calculating the sum of the influence of the nodes in each allowed connection set according to the allowed connection set generated in the step three;

step five: judging the label type of the test sample as a sub-network label type corresponding to the sum with the maximum node influence according to the calculated sum of the influence of each permitted connection concentrated node in the step four;

two kinds of data authority and data influence are mined in the constructed social network G, and style characteristics are revealed, and further:

each node v is first mined in the social network G constructed as described above _i Authority a of (2) _i Sub-network g _q Authority of (2)The node is each data sample in the data set X, and the sub-network is each data class in the data set X;

the node v _i Authority a of (2) _i From node v in the social network G _i Degree of ingress, egress, and degree of egress to fully calculate the node authority, the node v _i Authority a of (2) _i The calculation formula is that

Wherein,and +.>The calculation formulas are respectively

In the above-mentioned formula(s),d (D) _i Respectively represent the node v _i And xi represents a very small positive value such that outlier or noise samples in the data set X do not affect the classification performance of the classification method;

Wherein N represents the total number of samples contained in data set X;

when authority of any node is determined, calculating a sub-network g _q Authority of the formula

Wherein,representing subnetwork g _q The number of nodes involved, i.e. with sub-network g _q The corresponding data class contains the number of samples, v _m Representing subnetwork g _q The m-th node is included;

when any node v _i After determining the fuzzy weight of (a), calculating node v _i The influence of (a) is calculated as

Wherein,represents the ith node v _i Influence in the h iteration process, wherein alpha represents a social network damping coefficient, and the value is alpha=0.85; ρ _j The node density representing the social network G is calculated by the following formula

Wherein d _jk Representative node v _j And v _k The distance between the two is Euclidean distance, dc represents the cut-off distance, and the value is set manually so that the node v _j The number of surrounding nodes accounts for 1-2% of the number of all nodes in the social network G; χ (·) represents a judgment function, i.e., if d _jk -dc < 0, then χ (·) =1, whereas χ (·) =0; when the number of iterative loops in the formula (7) reaches a maximum value H or satisfies the following condition, the iterative loops are terminated;

wherein I ₂ Representing the 2-normal form, θ represents a threshold.

2. The classification method based on data physical features and implicit style features of claim 1, wherein for a given dataset x= [ X ] ₁ ,x ₂ ,…,x _N ] ^T Wherein x is _i ∈R ^d The tag set is y= [ Y ] ₁ ,y ₂ ,…,y _N ] ^T The given dataset X is mapped to a social network G using a K-nearest neighbor algorithm which, further,

mapping a given data set X using K-nearest neighbor algorithmSocial network g= { G ₁ ,g ₂ ,…,g _Q -wherein Q is equal to the number of categories contained in data set X, each sample X in data set X _i Node v corresponding to social network G _i ；

According to the K-nearest neighbor algorithm, any two nodes v in the social network G _i And v _j The following conditions are satisfied:

node v _j For node v _i Neighbor node of (a) and node v _j And node v _i With the same label, then at node v _i And v _j Is established by v _i As the starting point, v _j Directed edge e being a node _ij 。

3. The classification method based on data physical characteristics and implicit style characteristics according to claim 2, wherein said social network G, each sub-network corresponding to each data class in dataset X, any two sub-networks G _p And g _q Independent of each other, each node in the sub-network has the same label and is the same as the corresponding data class label.

4. A classification method based on data physical characteristics and implicit style characteristics as claimed in claim 3, wherein for the test set t= [ T ] ₁ ,t ₂ ,…,t _M ] ^T Wherein t is _m ∈R ^d Calculating an allowed connection set between each test sample in the test set and each sub-network in the social network G according to the data physical characteristics and the data implied style characteristics, and further comprising:

when embedding a test sample T in a test set T into the social network G, a double-layer efficiency Λ is first calculated _t,j The calculation formula is that

Wherein v is _j Representing subnetwork g _q The j-th node in (a) is connected with the sub-networkg _q The jth sample, d, in the corresponding class of data _tj Representing the test sample t and node v _j The distance between the two is Euclidean distance; gamma represents a balance coefficient, the higher the value thereof, the greater the function of representing the authority of the node, and conversely, the greater the function of representing the physical characteristics of the data;

the permissible connection set determination criteria are expressed as follows

Wherein the efficiency lambda of the double-layer structure _t,j For the test sample t and the node v _j A function between the test sample t and the node v for checking _j When the connecting edge is established, the efficiency lambda of the double-layer structure is improved _t,j The value is also to reduce the efficiency lambda of the double-layer structure _t,j Values.

5. The classification method based on data physical characteristics and implicit style characteristics of claim 4 wherein, according to said generated allowed connection setCalculating a sum of influence of each of the allowed connection set nodes, further comprising:

according to the allowed connection setCalculating the sum of the influence of each allowed connection concentration node for determining the maximum influence sum, wherein the sum of the influence of each allowed connection concentration node is calculated by the formula

6. The classification method based on data physical characteristics and implicit style characteristics according to claim 5, wherein discriminating the label type of the test sample as the sub-network label type corresponding to the sum of the influence of the maximum nodes according to the sum of the influence of the nodes in each allowable connection set, further comprises:

determining the maximum influence sum according to the influence sum of the nodes in each allowed connection setThe calculation formula is as follows