CN113051557B

CN113051557B - Social network cross-platform malicious user detection method based on longitudinal federal learning

Info

Publication number: CN113051557B
Application number: CN202110275639.0A
Authority: CN
Inventors: 张志勇; 宋斌; 梁腾翔; 张丽丽; 卫新乐; 牛丹梅; 李玉祥; 张孝国; 向菲; 张蓝方
Original assignee: Henan University of Science and Technology
Current assignee: Henan University of Science and Technology
Priority date: 2021-03-15
Filing date: 2021-03-15
Publication date: 2022-11-11
Anticipated expiration: 2041-03-15
Also published as: CN113051557A

Abstract

A social network cross-platform malicious user detection method based on longitudinal federal learning comprises the following steps: step 1, constructing a cross-platform malicious user detection hierarchical architecture of a social network based on longitudinal federal learning; step 2, dividing the participating party into an active party and a passive party, and carrying out preprocessing operation on sample data of the active party and the passive party on a data preprocessing layer to obtain structured data; step 3, mapping the structured data processed by the data preprocessing layer to sample data shared by the active side and the passive side; step 4, cooperatively training a global model under the definition of machine learning, and encrypting and decrypting data of an active side and a passive side by using homomorphic encryption to complete the Federal learning layer training; and step 6, transmitting the prediction result obtained by the federal learning layer back to each participant in a data application layer, thereby realizing the high-quality malicious user detection effect.

Description

Social network cross-platform malicious user detection method based on longitudinal federal learning

Technical Field

The invention belongs to the technical field of internet, and particularly relates to a social network cross-platform malicious user detection method based on longitudinal federal learning.

Background

With the rapid development of Online Social Networks (OSNs), as shown by the 45 th statistical report of the development conditions of the china internet, 3 months in 2020, OSNs users reach 9.04 hundred million in scale and the popularity of the internet reaches 64.5%, so that OSNs help people establish Social network application services and gradually become the primary target of malicious users trying to execute illegal activities and malicious hazards, and these malicious behaviors cause adverse effects and huge hazards to the current society.

At present, the traditional machine learning methods, such as semi-supervised clustering, classifiers of support vector machines and the like, rely on big data means to extract and train the behavior characteristics of malicious users, and obtain high-quality detection effect on the OSNs platform.

The technical scheme 1: an article "Detecting MalicusSocial boxes Based on Clicks streams" by Shi P et al (IEEE Access, 2019), which provides a Malicious user detection algorithm Based on spatial and temporal characteristics Based on the transition probability characteristics between click streams of context awareness.

The technical scheme 2 is as follows: WU et al thesis "malicious program multi-feature detection on Android platform" small-sized microcomputer system provides a mixed algorithm based on multiple types of features, and different classifiers are constructed by using large-scale feature data to realize high-efficiency detection results.

However, none of the successful applications of the above schemes is based on social big data, and in an actual application scene, a malicious user has the characteristics of dispersibility, latency, complexity and the like, and data of a single party hardly meets the detection requirement, so that the data of two parties or even multiple parties needs to be jointly trained to achieve a satisfactory detection effect; secondly, with the soundness of laws and regulations, the emphasis on user privacy and Data security has become a worldwide accepted trend, as specified in General Data Protection Regulations (GDPR) issued by the european union, and it has been clearly prohibited to gather user Data of each party without user consent. Therefore, how to solve the problem of data fragmentation under the premise of complying with laws and regulations is undoubtedly an important research subject in the current social network scenario.

Disclosure of Invention

In order to solve the technical problems, the invention provides a social network cross-platform malicious user detection method based on longitudinal federal learning, which fuses multi-party data to perform modeling analysis on the premise of ensuring the privacy of common user data, thereby realizing a high-quality malicious user detection effect.

In order to realize the technical purpose, the adopted technical scheme is as follows: a social network cross-platform malicious user detection method based on longitudinal federal learning comprises the following steps:

step 1, constructing a social network cross-platform malicious user detection hierarchical architecture based on longitudinal federated learning, wherein the architecture comprises a data preprocessing layer, an encryption sample alignment layer, a federated learning layer and a data application layer;

step 2, firstly, selecting a plurality of participants, dividing the participants into an active party and a passive party, providing sample data and tag values of a user as the active party, providing only sample data of the user as the passive party, and carrying out preprocessing operation on the sample data of the active party and the sample data of the passive party in a data preprocessing layer to obtain structured data;

step 3, mapping the common sample data of the active side and the passive side on the sample alignment layer by using a safe intersection solving scheme of an RSA asymmetric encryption algorithm and a Hash mechanism for the structured data processed by the data preprocessing layer;

step 4, after the processing of the step 3, the active side and the passive side already determine the sample data shared by the two sides, the active side and the passive side cooperatively train a global model under the definition of machine learning by using local models of the active side and the passive side, and the active side and the passive side data are encrypted and decrypted by using homomorphic encryption to complete the training of a federal learning layer;

step 5, after the Federal learning layer is trained, the active side and the passive side update the local model training parameters of the active side and the passive side, and output the prediction result;

and 6, encapsulating a data calling interface in the data application layer, transmitting the prediction result obtained by the federal learning layer back to each participant, and updating and classifying local data by each participant to obtain a malicious user detection result.

The specific implementation process of encrypting and decrypting the data of the active party and the passive party by using homomorphic encryption comprises the following steps:

step 2.1, the driving party calculates a first-order gradient value and a second-order gradient value of the sensitive data, encrypts the first-order gradient value and the second-order gradient value by adopting addition homomorphic encryption, and then sends the encrypted gradient values to the driven party;

step 2.2, the passive side carries out barrel division on all the characteristics of the passive side, maps each characteristic value into each barrel, aggregates corresponding encrypted gradient value information according to the characteristic values after barrel division, and then sends the aggregated encrypted gradient information to the active side;

step 2.3, the active party decrypts the received aggregated encrypted gradient information to obtain the optimal division Divide of the current node _max Returning the current node characteristic ID and the threshold ID to the passive party;

step 2.4, the passive side receives the characteristic ID and the threshold value ID to divide the total sample space I of the current node, wherein I _R +I _L ＝I，I _L ，I _R Respectively recording the record ID, the feature ID and the threshold value ID of the current node in the left and right sample spaces, and recording the record ID and the divided left sampleThis space I _L Sending the data to the active side;

step 2.5, the master side according to the record ID and the left sample space I _L Dividing the current node and entering the division of the next node;

step 2.6, iterating the processes of (2.2) to (2.5), after the construction of all the current decision trees is completed, calculating the optimal weight of each leaf node in the decision trees

Finishing the training;

step 2.7, the active side sends the record ID of the current node and the threshold value of the characteristic to the passive side;

step 2.8, the passive side compares the threshold value result of the current node to obtain a search decision and sends the search decision to the active side;

step 2.9, the active side receives the search decision and starts to go to the corresponding child node until reaching a leaf node to obtain a classification label and the optimal weight of the label;

and 2.10, iterating the processes of (2.7) - (2.9), and then performing weighted summation on optimal weights corresponding to the classification labels obtained by traversing all the decision trees to finally obtain label sets of normal users and malicious users.

The preprocessing operation is to convert each participant data into structured data through operations such as data cleaning, random sampling, data binning, data normalization and the like.

The invention has the beneficial effects that:

(1) The invention provides a social network cross-platform malicious user detection method based on longitudinal federal learning, which is realized on the premise of ensuring user privacy and data safety.

(2) The invention provides an effective problem handling mechanism for unstructured data at a data preprocessing layer, and is used for solving the problem of multi-source isomerism of the data.

(3) According to the invention, a data application layer is constructed, and the social network cross-platform malicious user can be detected in real time by packaging a data call interface.

(4) The invention utilizes homomorphic encryption to encrypt and decrypt the data of the active party and the passive party in the algorithm realization process, the algorithm is an end-to-end detection algorithm, has the same accuracy as the traditional machine learning method on the premise of privacy protection, adds a regularization punishment item in the algorithm, improves the generalization capability and the detection effect of the model, encrypts sensitive data and practically ensures the safety and the accuracy of the model.

Drawings

FIG. 1 is a hierarchical architecture for cross-platform malicious user detection in a social network according to the present invention;

FIG. 2 is a flow of a data pre-processing layer of the present invention;

FIG. 3 is a sample alignment layer flow of the present invention;

FIG. 4 is a federated learning layer flow of the present invention;

FIG. 5 is a flow chart of a malicious user detection algorithm for multi-party privacy protection according to the present invention;

FIG. 6 is a social network cross-platform malicious user detection framework of the present invention;

FIG. 7 is a malicious user detection page of the multimedia social network CyVOD desktop version of the present invention.

Detailed Description

With the rapid development of online social networks, social networks gradually become the primary target of malicious users trying to perform illegal activities and malicious hazards while helping people to establish social network application services. Malicious users can remain in a plurality of social network platforms, and try to steal the privacy of the users, penetrate political topics and the like by publishing false information, and the behaviors cause adverse effects and great harm to the current society. At present, the existing machine learning detection method realizes high-quality detection effect based on large-scale data, however, along with the soundness of laws and regulations, the existing machine learning detection method is not good at concentrating user data of all parties to one place, and is still clear. Therefore, the method and the system have the advantages that by means of the federal learning technology, on the premise that data safety and user privacy protection are guaranteed, multi-party data are fused for modeling analysis, and therefore accurate detection of malicious users in the social network platform is achieved.

A social network cross-platform malicious user detection method based on longitudinal federal learning comprises the following steps:

step 1, as shown in fig. 1, a social network cross-platform malicious user detection hierarchical architecture based on longitudinal federated learning is constructed, and the architecture comprises a data preprocessing layer, an encryption sample alignment layer, a federated learning layer and a data application layer.

A data preprocessing layer: in an actual application scenario, due to specific functional requirements, technical levels, storage modes and the like, data of each participant usually does not exist in a structured form, and data preprocessing is used for solving the operation of converting structured data in a modeling process. As shown in fig. 2, the preprocessing operation converts each participant's data into structured data by data cleansing, random sampling, data binning, and data normalization.

Sample alignment layer: the sample alignment layer is used for aligning all participants to share the user by using an encrypted ID matching technology before modeling of all the participants on the premise of ensuring the safety and privacy protection of the user.

Federal learning layer: the federal learning layer is used for model training through an encrypted parameter exchange mode, after determining a common sample of two parties, each participant can cooperatively train a global model under the machine learning definition, however, in order to prevent the privacy disclosure problem in the model training, the federal learning layer needs to introduce a credible cooperative party, and uses a privacy protection technology (such as state encryption) to encrypt and decrypt sample data and coordinate the training process.

A data application layer: after the training of the federal learning layer, each participant updates a local training model and outputs a prediction result, the data application layer transmits the prediction result back to the terminal through a packaged data calling interface, and the terminal updates and classifies local data and provides a detection basis for malicious users.

And 2, firstly selecting a plurality of participants, dividing the participants into an active party and a passive party, providing the sample data and the tag value of the user as the active party, providing only the sample data of the user as the passive party, and preprocessing the sample data of the active party and the passive party in a data preprocessing layer to obtain structured data.

The invention discloses a method for detecting the malicious users, which is characterized in that data with privacy leakage in model training of each participant is called as sensitive data, in order to ensure the safety of the sensitive data, a malicious user detection algorithm facing multi-party privacy protection is packaged in a Federal learning layer, a privacy protection method (homomorphic encryption) is adopted to encrypt the sensitive data, so that the multi-party training can be carried out without exposing the data of each participant, and simultaneously, roles played by each participant in the algorithm are respectively defined as an active party and a passive party.

The initiative side: providing sample data and label values of a user, playing the role of a cooperative party in the training process, and participating in encryption and decryption of sensitive data and coordinating the training process.

A passive side: typically only sample data for the user is provided.

Step 3, mapping the common sample data of the active side and the passive side on the structured data processed by the data preprocessing layer by using a safe intersection solving scheme of an RSA asymmetric encryption algorithm and a Hash mechanism in a sample alignment layer;

step 5, after the Federal learning layer is trained, the active side and the passive side update the local model training parameters of the active side and the passive side, and the prediction result is output to the data application layer;

The invention sets an algorithm target function as the sum of a loss function and a regularization penalty term, and introduces the regularization penalty term to control the complexity of the model and prevent the phenomenon of overfitting, so that the algorithm has more classification efficiency in the solving process, and the target function is as follows:

wherein n is user sample data, t is decision tree,

for the loss function, the true value y is expressed _i And the predicted value

Residual error between, omega (f) _t ) A regularization penalty term.

When the objective function carries out the t-th iteration, the structure and the parameters of the tree of the first t-1 round are determined, and the predicted value of the sample of the t-th round is obtained according to the forward distribution addition method

Equal to the predicted value of the previous t-1 trees

Adding a new decision tree f _t (x _i ) As shown in formula (2):

at this time, equation (2) is substituted into equation (1), and the objective function of the expansion is expressed by equation (3):

next, taylor expansion is performed on equation (3) using a second-order taylor equation, as shown in equation (4):

and the regularization penalty term function of the algorithm set forth herein can be expressed as:

wherein gamma is a complexity parameter, T is the number of leaf nodes, and lambda is the weight value w of the leaf nodes _j The penalty degree parameter of (2). Therefore, formula (5) is substituted for formula (4), and the objective function is further rewritten as:

in the formula (6), lambda, gamma, g _i 、h _i Are all known numbers, only w _j As an unknown number, I _j The method comprises the steps of calculating the optimal weight of a leaf node j according to the process of solving the extreme value of a unitary quadratic function of samples falling on the same leaf node j in the sample division process

Will optimize the weight

In the formula (6), the optimal objective function is obtained as follows:

to obtain the optimal division of the sample space, each time a node is split, the sample of the node is divided into two disjoint sample spaces, and I is set _L ，I _R Sample spaces of left and right subtrees, respectively, I _R +I _L = I represents the total sample space of the current node. Therefore, the sum of the first order gradients and the sum of the second order gradients on both sides of the left and right nodes are expressed as:

finally, the maximum value is found by subtracting the value before splitting from the evaluation index value after splitting the leaf node, and then the optimal division of the sample space is:

as can be seen from the implementation process of the algorithm, in the process of iterating the objective function t each time, the first derivative g of the prediction result y (t-1) of the loss function l relative to the previous t-1 trees is solved _i And second derivative h _i And according to g _i And h _i To obtain the optimal weight and the optimal partition. Therefore, we can easily find that the calculation of the optimal weights and optimal partitions depends on g _i And h _i And g is _i And h _i The computation depends on class label y in the sample _i If g is directly used in the training process _i And h _i The exchange is carried out, there is a risk of privacy disclosure, so the algorithm herein sets g _i And h _i Must be calculated by the master and encrypted using additive homomorphism _i And h _i Encryption, so that the passive party cannot use the derivative information to deduce the label information during the training process.

The specific implementation process of encrypting and decrypting the data of the active side and the passive side by using homomorphic encryption is as shown in fig. 5:

step 2.1, the initiative side calculates a gradient value g of the sensitive data _i And a second order gradient value h _i And using additive homomorphic encryption to obtain a gradient value g _i And a second order gradient value h _i Encrypting, and then sending the encrypted gradient value to a passive party;

step 2.3, the active side decrypts the received aggregated encrypted gradient information to obtain the optimal division Divide of the current node _max Returning the current node characteristic ID and the threshold ID to the passive party;

step 2.4, the passive side receives the characteristic ID and the threshold value ID to divide the total sample space I of the current node, wherein I _R +I _L ＝I，I _L 、I _R Respectively a left sample space and a right sample space, recording the record ID, the feature ID and the threshold ID of the current node, and dividing the record ID and the divided left sample space I _L Sending the data to the active side;

step 2.5, the master side according to the record ID and the left sample space I _L Dividing the current node and entering the next node;

Finishing the training;

and 2.10, iterating the processes of (2.7) to (2.9), traversing all the decision trees, carrying out weighted summation on the optimal weights corresponding to the obtained classification labels, and finally obtaining label sets of normal users and malicious users.

In the algorithm training process, more samples are gradually added into the left sample space, and the left sample space is used for dividing the current node, so that the value with the maximum gain is easily found out, namely the optimal division is obtained.

Example 1

According to the method, a conventional federal learning framework is expanded and improved by combining a multimedia social network CyVOD, a social network cross-platform malicious user detection framework based on longitudinal federal learning is built, as shown in FIG. 6, safe and compliant multi-party data are fused for modeling analysis, high-quality detection is realized on malicious users, and the ecological environment of the social network is further maintained.

The whole framework is divided into four parts, namely a data preprocessing stage, a sample alignment stage, a federal learning stage and a data application stage.

A data preprocessing stage: in the stage, an Android mobile party (an active party) and a PC website party (a passive party) of CyVOD are selected as data providers, an OSNs six-tuple (video, policy, guide, notification, post and false information) metadata experimental platform is built on the basis, 68 users click actions are totally performed, 50898 data are totally counted by 28 user static attribute characteristics of a PC end, 1076307 data are totally counted by 40 user dynamic attribute characteristics of a mobile end, an effective problem processing mechanism is set in the stage, and as shown in FIG. 2, the robustness of the training process is further improved by performing data cleaning, random sampling, data binning, numerical value normalization and other operations on the original data of all participants.

The invention adopts the following problem handling mechanism in the data preprocessing stage: (1) when the problems of repetition, deletion and the like occur, the sample data is processed by adopting the operations of deletion method, filling method and the like; (2) when the distribution is unbalanced, the sample data is randomly sampled, so that the model prediction and classification effects are improved; (3) when the continuous characteristic variable appears, the sample data is subjected to box separation, namely discretization is carried out on the continuous characteristic variable, so that the stability of the model is improved; (4) when the data dimension difference is obvious, normalization processing is carried out on the sample data, and the training speed and the convergence direction of the model are improved.

A sample alignment stage: in the stage, a scheme of safely solving intersection of an RSA algorithm and a hash function is adopted, and common sample IDs of an Android mobile party and a PC website party are mapped. As shown in fig. 3, firstly, the Android mobile party generates a public key and a private key pair by RSA algorithm, and transmits the public key to the PC network station; the PC website side performs hash mapping on the local data ID by using a hash function to ensure that the user ID cannot be transmitted in a plaintext form; secondly, the PC website side encrypts by adopting a public key and sends the encrypted data sample to the Android mobile side, the Android mobile side decrypts by using a local private key after receiving the passive side data sample, and then the local data is mapped by a Hash function and the received PC website side sample data is subjected to safe intersection; and finally, the Android mobile party sends the matched sample ID to the PC website party, and the sample alignment stage is completed.

And (3) a federal learning stage: as shown in fig. 4, an encrypted parameter exchange manner is adopted for model training at this stage, and after the Android mobile party and the PC website party determine sample data common to both parties, homomorphic encryption is introduced to encrypt sensitive data in order to prevent data security and user privacy from being revealed. The detailed process is as follows:

(1) The Android mobile party firstly calculates the gradient value, encrypts the gradient value by utilizing addition homomorphic encryption, and then sends the encrypted gradient value to the PC website party.

(2) The PC website side firstly carries out bucket separation on all characteristics of the PC website side, and maps each characteristic value to each bucket; and secondly, the PC website side aggregates the corresponding encryption gradient information according to the characteristic values after the barrel division, and sends an aggregation result to the Android mobile side.

(3) And the Android mobile party decrypts the received aggregation result, obtains the optimal division of the current node, and returns the current node characteristic ID and the threshold value ID to the PC website party.

(4) And the PC website side receives the feature ID and the threshold ID to divide the current sample space, records the current record ID, the feature ID and the threshold ID, and sends the record ID and the divided left sample space to the Android mobile side.

(5) And the Android mobile party divides the current node according to the record ID and the left sample space and enters the division of the next node.

(6) And (4) until all the decision trees are constructed, and calculating the optimal weight of each leaf node.

(7) After the training is finished, the Android mobile party sends the record ID of the current node and the threshold value of the characteristic to the PC website party.

(8) And the PC website side compares the threshold result of the current node to obtain a search decision and sends the search decision to the Android mobile side.

(9) And the Android mobile party receives the search decision, starts to go to the corresponding child node until each leaf node is reached to obtain the classification label and the weight.

(10) And (5) repeating the processes from (7) to (9) until all the decision trees are traversed, and finally, carrying out weighted summation on optimal weights corresponding to the traversed class labels by the Android mobile party to output class label sets of normal users and malicious users.

A data application stage: after the federal learning layer is trained, each participant updates a local training model, a prediction result is output, the prediction result is transmitted back to the terminal through a data calling interface packaged by CyVOD at the stage, the terminal updates and classifies local data to obtain a malicious user detection result, as shown in FIG. 6, a PC website end carries out marking processing on malicious users, and an administrator can timely process the malicious users.

The method can be packaged in hardware equipment, a detection result is directly obtained by using the hardware equipment, and a processing result is detected by using a display screen.

Claims

1. A social network cross-platform malicious user detection method based on longitudinal federal learning is characterized in that: the method comprises the following steps:

step 4, after the processing of the step 3, the active side and the passive side already determine that the two sides share sample data, the active side and the passive side cooperatively train a global model under the definition of machine learning by using local models of the active side and the passive side, and the active side and the passive side data are encrypted and decrypted by using homomorphic encryption to complete the training of a federal learning layer;

the specific implementation process of encrypting and decrypting the data of the active side and the passive side by using homomorphic encryption comprises the following steps:

step 4.1, the driving party calculates a first-order gradient value and a second-order gradient value of the sensitive data, encrypts the first-order gradient value and the second-order gradient value by adopting addition homomorphic encryption, and then sends the encrypted gradient values to the driven party;

step 4.2, the passive side carries out barrel division on all the characteristics of the passive side, maps each characteristic value into each barrel, aggregates corresponding encrypted gradient value information according to the characteristic values after barrel division, and then sends the aggregated encrypted gradient information to the active side;

step 4.3, the active party decrypts the received aggregated encrypted gradient information to obtain the optimal division Divide of the current node _max Returning the current node characteristic ID and the threshold ID to the passive party;

step 4.4, the passive side receives the characteristic ID and the threshold ID to divide the total sample space I of the current node, wherein I _R +I _L ＝I，I _L 、I _R Respectively as left and right sample spaces, recording record ID, feature ID and threshold ID of current node, and dividing the record ID and left sample space I _L Sending the data to the active side;

step 4.5, the master side according to the record ID and the left sample space I _L Dividing the current node and entering the division of the next node;

step 4.6, iterating (4.2) to (4.5) processes, and after the construction of all the current decision trees is completed, calculating the optimal weight of each leaf node in the decision trees

Finishing the training;

step 4.7, the active side sends the record ID of the current node and the threshold value of the characteristic to the passive side;

step 4.8, the passive side compares the threshold value result of the current node to obtain a search decision and sends the search decision to the active side;

step 4.9, the active side receives the search decision and starts to go to the corresponding child node until reaching a leaf node to obtain a classification label and the optimal weight of the label;

step 4.10, iterating the processes of (4.7) - (4.9), then carrying out weighted summation on optimal weights corresponding to the classification labels obtained by traversing all the decision trees, and finally obtaining label sets of normal users and malicious users;

2. The social network cross-platform malicious user detection method based on longitudinal federated learning of claim 1, wherein: the preprocessing operation is to convert the data of each participant into structured data through data cleaning, random sampling, data binning and data normalization.