CN113420323B

CN113420323B - Data sharing method and terminal equipment

Info

Publication number: CN113420323B
Application number: CN202110626699.2A
Authority: CN
Inventors: 杨会峰; 郭少勇; 黄镜宇; 刘玮; 王栋; 李宁博; 王建伯; 国明; 申培培; 陈连栋; 姜丹; 程凯; 王骏; 刁首人; 韩艳美; 林静; 蔡硕
Original assignee: State Grid Blockchain Technology Beijing Co ltd; State Grid Corp of China SGCC; Beijing University of Posts and Telecommunications; Information and Telecommunication Branch of State Grid Hebei Electric Power Co Ltd
Current assignee: State Grid Blockchain Technology Beijing Co ltd; State Grid Corp of China SGCC; Beijing University of Posts and Telecommunications; Information and Telecommunication Branch of State Grid Hebei Electric Power Co Ltd
Priority date: 2021-06-04
Filing date: 2021-06-04
Publication date: 2022-06-03
Anticipated expiration: 2041-06-04
Also published as: CN113420323A

Abstract

The invention is suitable for the technical field of data interaction, and discloses a data sharing method and terminal equipment, wherein the method comprises the following steps: receiving a sharing request of a data request node, wherein the sharing request comprises model information to be requested; inquiring whether an available model related to the model information to be requested exists in the block chain network; if the available model exists in the block chain network, returning the available model to the data request node; and if no available model exists in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network. According to the invention, by combining the block chain and the federal learning, privacy protection and safety credibility in the data sharing process can be realized, data privacy can be completely ensured, and no data leakage risk exists.

Description

Data sharing method and terminal equipment

Technical Field

The invention belongs to the technical field of data interaction, and particularly relates to a data sharing method and terminal equipment.

Background

The smart city is the best means for realizing accurate and efficient urban management and management service, but with the construction of the smart city, the urban Internet of things perception system relates to massive concurrent events and multi-element heterogeneous Internet of things data, the problems of data availability, sharability, manageability, credibility and the like are increasingly prominent, and in order to realize value-added service of the Internet of things data for urban development, the data sharing and fusion are core problems which need to be solved urgently.

At present, data sharing is generally realized through a block chain technology, but the method cannot completely ensure data privacy, and the risk of data leakage still exists.

Disclosure of Invention

In view of this, embodiments of the present invention provide a data sharing method and a terminal device, so as to solve the problem that in the prior art, data privacy cannot be completely guaranteed and a data leakage risk still exists.

A first aspect of an embodiment of the present invention provides a data sharing method, including:

receiving a sharing request of a data request node, wherein the sharing request comprises model information to be requested;

inquiring whether an available model related to the model information to be requested exists in the block chain network;

if the available model exists in the block chain network, returning the available model to the data request node;

and if no available model exists in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network.

A second aspect of embodiments of the present invention provides a terminal device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the steps of the data sharing method according to the first aspect are implemented.

Compared with the prior art, the embodiment of the invention has the following beneficial effects: the embodiment of the invention receives a sharing request of a data request node, wherein the sharing request comprises model information to be requested; inquiring whether an available model related to the model information to be requested exists in the block chain network; if the available model exists in the block chain network, returning the available model to the data request node; and if no available model exists in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network. According to the embodiment of the invention, by combining the block chain and the federal learning, privacy protection and safety credibility in the data sharing process can be realized, data privacy can be completely ensured, and the risk of data leakage does not exist.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.

Fig. 1 is a schematic flow chart illustrating an implementation of a data sharing method according to an embodiment of the present invention;

FIG. 2 is a schematic block diagram of a data sharing apparatus according to an embodiment of the present invention;

fig. 3 is a schematic block diagram of a terminal device according to an embodiment of the present invention.

Detailed Description

In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.

In order to explain the technical means of the present invention, the following description will be given by way of specific examples.

Fig. 1 is a schematic implementation flow diagram of a data sharing method according to an embodiment of the present invention, and for convenience of description, only a part related to the embodiment of the present invention is shown. As shown in fig. 1, the method may include the steps of:

s101: and receiving a sharing request of the data request node, wherein the sharing request comprises the model information to be requested.

The embodiment of the invention provides a three-layer architecture, which comprises a local equipment layer, a block chain network layer and a cloud platform layer.

1) Local device layer: the intelligent city intelligent monitoring system is deployed at various sensing devices in all corners of a city and is responsible for acquiring and collecting internet of things data in the intelligent city. In the federal learning process, the local device trains the local model using its own data.

2) Block chain network layer: the block chain stores data identification and hash, manages data records on the chain, and stores and manages the learning process of the whole life cycle in the federal learning process.

3) Cloud platform layer: and processing a data sharing request of the data request node, and aggregating all local models into a global model in the federal learning process.

The data sharing method provided by the embodiment of the invention can be applied to a cloud platform layer. That is, the cloud platform receives a sharing request of the data requesting node. The data requesting node may be a node in the local device layer. The sharing request may include information about the global model required by the data requesting node, i.e., the model information to be requested.

S102: querying whether an available model associated with the model information to be requested exists in the blockchain network.

Because the federal learning process of the full life cycle is stored and managed in the blockchain network, the cloud platform can inquire whether a global model corresponding to the model information to be requested exists in the blockchain network, namely the available model.

S103: and if the available model exists in the blockchain network, returning the available model to the data request node.

And if the available model exists in the block chain network, directly returning the available model to the data request node, and ending the sharing request.

S104: and if no available model exists in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network.

If the available model does not exist in the block chain network, starting a new federal learning task, firstly selecting a target data providing node cooperating with the data request node through a node selection algorithm, and recording the target data providing node in the block chain network; then starting federal learning, and recording all model parameters in the learning process in the block chain network; and finally, returning the global model obtained after the federal learning is finished to the data request node, and finishing the sharing request.

The target data providing node is a node of the local device layer, and the number of the target data providing nodes may be one or multiple.

As can be seen from the above description, in the embodiments of the present invention, a sharing request of a data request node is received, where the sharing request includes model information to be requested; inquiring whether an available model related to the model information to be requested exists in the block chain network; if the available model exists in the block chain network, returning the available model to the data request node; and if no available model exists in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network. According to the embodiment of the invention, by combining the block chain and the federal learning, privacy protection and safety credibility in the data sharing process can be realized, data privacy can be completely ensured, and the risk of data leakage does not exist.

In an embodiment of the present invention, the selecting a target data providing node cooperating with the data requesting node includes:

determining a comprehensive trust value of each data providing node by the data request node;

and selecting the data providing node with the comprehensive trust value not less than the preset trust threshold value as a target data providing node.

Optionally, if the number of the data providing nodes of which the comprehensive trust value is not less than the preset trust threshold is greater than the required number, the data providing nodes of the previous required number are selected as the target data providing nodes according to the sequence from large to small of the comprehensive trust value.

In one embodiment of the present invention, determining a comprehensive trust value of a data requesting node for each data providing node comprises:

according to

Determining a comprehensive trust value CT (A, B) of a data requesting node A to a data providing node B;

wherein DT (A, B) provides a direct trust value of node B for the data request node A to the data;

n is the number of direct interactions between the data requesting node A and the data providing node B, e_BiWhen the ith direct interaction between the data request node A and the data providing node B is carried out, the data request node A provides the evaluation value, t, of the node B to the data_nowIs the current time, t_iProviding the completion time of the ith direct interaction of the node B for the data request node A and the data, wherein a is a preset constant and 0<a<1；

RT (B) provides the data request node A with the recommended trust value of the node B for the data;

n (a, C) ═ SN (a, C) -FN (a, C), in (B) is the set of all nodes that interact directly with the data providing node B, R (a, C) is the trust value of DT (C, B) for the data requesting node a, N (a, C) is the number of valid interactions of the data requesting node a with node C, and N (a) is the number of valid interactions in which the data requesting node a participates; SN (A, C) is the number of times that the data request node A and the node C cooperate to participate in data sharing successfully, FN (A, C) is the number of times that the data request node A and the node C cooperate to participate in data sharing fail;

n (A, B) provides the effective interaction times of the node B for the data request node A and the data; h_AIs a preset transaction number threshold.

The above-mentioned node C is a node except the data requesting node a among all nodes directly interacting with the data providing node B.

In the federal learning process, it cannot be guaranteed that all nodes provide good service and reliable resources. In order to ensure the data sharing quality, the embodiment of the invention provides a node selection algorithm based on the credit value, and the data request node selects the data providing node with better performance and more stable service to cooperate according to the credit value.

Based on the social network, the trust relationship between the node A and the node B is mainly established on the basis of direct interaction between the node A and the node B, and the evaluation of the node B by other nodes which have direct interaction with the node B is referred.

When the node A and the node B do not interact effectively, the trust of the node A to the node B only depends on the recommended trust; when the effective interaction times between the node A and the node B exceed a preset transaction time threshold value H set by the node A_AWhen the node A is in use, the node A can make judgment according to the direct trust value of the node A to the node B; trust of node A to node B when the number of valid interactions between node A and node B is in the middle rangeDirect trust and recommendation trust need to be combined.

In an embodiment of the present invention, after the federal learning is finished, the data sharing method further includes:

and determining the evaluation value of the data request node to the target data providing node, and recording the evaluation value of the data request node to the target data providing node in the block chain network.

In one embodiment of the present invention, determining an evaluation value of a data requesting node to a target data providing node includes:

according to

Determining a data requesting node to a target data providing node N_jIs evaluated

Wherein m is the number of the target data providing nodes, and j is more than or equal to 1 and less than or equal to m;

providing a node N for target data_jThe quote value of (a);

for sorting in descending order, the target data provides node N_jThe rank of the quote value of (a) in the quote values of all the target data providing nodes;

providing a node N for target data_jA performance evaluation value of (2);

for sorting in descending order, the target data provides node N_jThe rank of the performance evaluation value of (b) in the performance evaluation values of all target data providing nodes;

size(D_j) Providing a node N for target data_jData set D of_jIs the number of iterations of federated learning,

providing a node N for target data_jThe model quality at the kth iteration; q^kModel quality for the global model at the kth iteration, T^kFor the training duration of the global model at the kth iteration,

providing a node N for target data_jTraining duration at kth iteration; alpha, beta and gamma are all weight coefficients, 0<α<1，0<β<1，0<γ<1, and α + β + γ ═ 1.

Wherein,

the value range of (1) is (0).

The set of target data providing nodes selected by the node selection algorithm is N ═ N₁,…N_j,…,N_mD ═ D, a set of data sets of the respective target data providing nodes is set as D ═ D₁,…D_j,…,D_mThe quotation of each target data providing node is formed into a set

The federate learning occurs in one iteration, and the target data provides a node N_jThe set of model masses at each iteration is

Target data providing node N_jThe training duration at each iteration is comprised of a set of

The set of model qualities of the global model at each iteration is { Q ═ Q¹,..,Q^k,…,Q^lAnd the set of training time lengths of the global model in each iteration is T ═ T { (T)¹,..,T^k,…,T^l}。

After the federal learning is finished, performance evaluation needs to be performed on each target data providing node. In addition, whether the quotation of the node is matched with the service provided by the node is judged, and finally, the target data providing node N of the data requesting node is obtained_jThe evaluation value of (1).

The embodiment of the invention selects the federal learning node based on the credit value, and evaluates the performance of the node in the learning process so as to realize higher-quality federal learning.

It should be noted that the nodes are all nodes in federal learning, and the nodes mentioned in the subsequent optimized Raft algorithm are Raft nodes, that is, nodes in the optimized Raft algorithm.

In an embodiment of the present invention, the data sharing method further includes:

in the block chain network, an optimized Raft algorithm is adopted to keep consensus.

In one embodiment of the present invention, the above-mentioned method for maintaining consensus by using optimized Raft algorithm includes:

when the follower node cannot receive the heartbeat information of the leader node, the follower node sends a communication request to the non-leader node;

if the follower node receives the response message of the non-leader node, the follower node is converted into a candidate node and initiates a master selection request;

and if the follower node does not receive the response message of the non-leader node, stopping selecting the owner.

In the Raft algorithm, nodes are called by Remote Procedure Calls (RPCs). Communication is performed. When network partitioning occurs, isolated nodes continuously increase currentTerm (latest number) and initiate a main selection process because the isolated nodes cannot receive heartbeat messages of the leader; when the network is recovered, the currentTerm of the node is larger than that of other nodes, so that the original leader becomes a follower, the normal work of the original leader is interrupted, and the stability of the algorithm is seriously influenced.

Therefore, the embodiment of the invention is designed to add a PreVote link, and before the follower which does not receive heartbeat information initiates selection of the master role, the follower firstly sends a PreVote RPC request to other nodes to judge whether the communication with other nodes can be carried out. If the election can be started, the election request is initiated by the candidate, if the election can not be started, the election is terminated, and the network is waited to recover, so that the election is optimized.

when the follower node receives the RPC sent by the leader node, judging whether a log index value in the RPC is matched with a log index value of the follower node;

and if the log index value in the RPC is not matched with the log index value of the leader node, sending the log index value of the leader node to the leader node, so that the leader node resends the RPC according to the log index value sent by the follower node.

In the Raft algorithm, after the follower receives the leader's AppendEntries RPC, if the nextIndex in the RPC is checked to be not matched with itself, the request is rejected. When the leader receives the decline reply, it gradually decreases the nextIndex of the follower and sends AppendEntries RPC again until it finds the nextIndex that is consistent with the follower's log. This approach takes a significant amount of time to communicate, reducing the efficiency of the Raft algorithm. Wherein, the AppendEntries RPC is used for log replication or for heartbeat.

Therefore, the embodiment of the invention optimizes the operation, when the nextIndex is not matched, the follower directly sends the index value consistent with the final log to the leader, and the leader resends the Appendentries RPC according to the received nextIndex, so that the log replication optimization is realized, and the communication times when the nextIndex is not matched are effectively reduced. The nextIndex is a log index value and a log number which should be sent to the follower node next time by the leader node.

In one embodiment of the invention, consensus is maintained using an optimized Raft algorithm, comprising:

the candidate node votes for the candidate node and sends a voting message to other nodes, wherein the voting message comprises a client signature of the last agreed log item in the log list of the candidate node;

after receiving the voting message, the other nodes verify the authenticity of the signature of the client, if the signature of the client is real, the other nodes send the message signature to the candidate node, and if the signature of the client is not real, the other nodes refuse voting;

after receiving the message signatures of other nodes beyond 2/3, the candidate node generates a complete signature and sends a message containing the complete signature to the other nodes;

after receiving the message containing the complete signature, other nodes verify the validity of the complete signature, if the complete signature is legal, positive feedback information is sent to the candidate node, and if the complete signature is illegal, negative feedback information is sent to the candidate node;

when the candidate node receives more than 2/3 positive feedback information sent by other nodes, the candidate node is changed to the leader node.

In an embodiment of the present invention, the message signature is (R)_h,s_h). Wherein s is_h＝r_h+x_hα_h*e，e＝Hash(R,X,entry)，X＝∑α_hX_h，R＝∑R_h，α_h＝Hash(l,X_h)，l＝Hash(X₁,X₂,…X_H)，X_h＝x_h*G，R_h＝r_h*G，R_hIs the public key of the random number of the node h, r_hIs a random number, x, of node h_hIs the private key of node h, X_hIs the public key of the node h, G is the origin of the elliptic curve, X is the shared public key, R is the shared public key of the random number, l is the first hash value, alpha_hIs the hash value of node h, e is the second hash value, s_hAnd the entry is the message corresponding to the message signature for the calculated signature of the node h.

The complete signature is(R,s)，s＝∑s_h。

The verifying the validity of the complete signature includes: and verifying the validity of the complete signature by judging whether the equation sG is satisfied or not, wherein if the equation is satisfied, the complete signature is legal, otherwise, the complete signature is illegal.

The Raft algorithm is a non-Byzantine algorithm, is optimized based on a Musig aggregation signature scheme, and simultaneously introduces a digital signature technology to enable a message sent to a leader by a client to contain an instruction to be executed and a corresponding digital signature, so that the Byzantine fault tolerance in the processes of leader election and log replication is realized, and the safety of the algorithm can be improved.

Each Raft node has a private key x_iPublic key X_iA random number r_iAnd a random number public key R_iAnd G is the origin of the elliptic curve. All Raft nodes publish public key X_iAnd a random number public key R_iAnd calculates shared public keys X and R.

The signature of the follower on the message represents the vote for the candidate, which can be selected as the leader when the candidate receives a vote exceeding 2/3. The method specifically comprises the following steps:

in the first step, the candidate node votes for itself and sends RequestVoteRPC to other nodes, and the message contains the client signature of the last agreed log item in the candidate log list.

Secondly, after other nodes receive the message, the authenticity of the signature of the client is verified, and if the authenticity is true, the message is signed (R)_i,s_i) And sending the information to the candidate node, and otherwise rejecting the voting.

And thirdly, the candidate node acquires the complete signature(s) after collecting the partial signatures of other nodes exceeding 2/3, and sends the message attached with the complete signature to other nodes.

And fourthly, after receiving the message, other nodes verify the validity of the complete signature by judging whether the equation sG-R + Xe is true or not, if so, returning a positive feedback, and if not, returning a negative feedback.

And fifthly, the last candidate node becomes a leader after collecting positive feedback of other nodes exceeding 2/3, and the whole election process is completed.

Aiming at the problem that the leader tampers the instruction, the follower can verify the authenticity of the instruction through a client public key, and if the instruction is found to be tampered by the leader, the follower refuses to add the tampered log item and converts the tampered log item into a candidate to start a new round of leader election. Aiming at the problem of tampering the instruction by the follower, the mutual recognition of the rest non-Byzantine followers and the leader is not influenced, and in the later election process, the node cannot become the leader because the node does not possess all the logs with the mutual recognition.

In the Byzantine fault-tolerant log replication process, the signature of the follower on the message represents that the log in the AppendEntries RPC is successfully replicated, the leader receives the partial signature of the follower exceeding 2/3 and then calculates the complete signature, and the follower exceeding 2/3 verifies that the complete signature is correct, and represents that the instruction achieves consensus.

The embodiment of the invention provides an optimized Raft algorithm. The method starts from three aspects of selecting a master, copying the log and safety respectively, and improves the performance of the algorithm.

The above description shows that the embodiment of the invention provides a smart city Internet of things data lightweight credible sharing mechanism, and introduces a block chain and federal learning to realize credible data sharing and privacy protection; a federated learning node selection algorithm and a node evaluation algorithm are designed, and a basis is provided for higher-quality data sharing; and finally, the consensus efficiency of the Raft algorithm is optimized, and the requirements of the smart city on time delay and safety are met.

It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention.

Corresponding to the data sharing method, an embodiment of the present invention further provides a data sharing apparatus, which has the same beneficial effects as the data sharing method. Fig. 2 is a schematic block diagram of a data sharing apparatus according to an embodiment of the present invention, and for convenience of description, only the portions related to the embodiment of the present invention are shown.

In the embodiment of the present invention, the data sharing apparatus 30 may include a receiving module 301, a querying module 302, a first processing module 303, and a second processing module 304.

The receiving module 301 is configured to receive a sharing request of a data request node, where the sharing request includes model information to be requested;

a query module 302, configured to query whether an available model associated with the model information to be requested exists in the blockchain network;

the first processing module 303 is configured to, if an available model exists in the blockchain network, return the available model to the data request node;

and the second processing module 304 is configured to, if no available model exists in the blockchain network, select a target data providing node cooperating with the data request node, start federal learning, return a global model obtained after the federal learning to the data request node, and record the target data providing node and model parameters in the federal learning process in the blockchain network.

Optionally, the second processing module 304 may be further configured to:

according to

n is the direct interaction time of the data requesting node A and the data providing node BNumber e_BiWhen the ith direct interaction between the data request node A and the data providing node B is carried out, the data request node A provides the evaluation value, t, of the node B to the data_nowIs the current time, t_iProviding the completion time of the ith direct interaction of the node B for the data request node A and the data, wherein a is a preset constant and 0<a<1；

RT (A, B) provides a recommended trust value of the node B for the data request node A to the data;

n (a, C) ═ SN (a, C) -FN (a, C), in (B) is the set of all nodes that directly interact with the data providing node B, R (a, C) is the trust value of DT (C, B) for the data requesting node a, N (a, C) is the number of valid interactions of the data requesting node a with node C, and N (a) is the number of valid interactions in which the data requesting node a participates; SN (A, C) is the number of times that the data request node A and the node C cooperate to participate in data sharing successfully, FN (A, C) is the number of times that the data request node A and the node C cooperate to participate in data sharing fail;

n (A, B) provides effective interaction times of the node B for the data request node A and the data; h_AIs a preset transaction number threshold.

Optionally, the second processing module 304 may be further configured to:

Optionally, the second processing module 304 may further be configured to:

according to

Determining a data requesting node to a target data providing node N_jEvaluation value of (2)

providing a node N for target data_jThe quote value of (a);

providing a node N for target data_jA performance evaluation value of (2);

for sorting in descending order, the target data provides node N_jThe rank of the performance evaluation value of (2) among the performance evaluation values of all the target data providing nodes;

size(D_D) Providing a node N for target data_jData set D of_jL is the number of iterations of federal learning,

Optionally, the data sharing apparatus may further include: and a consensus module.

And the consensus module is used for keeping consensus by adopting an optimized Raft algorithm in the block chain network.

Optionally, the consensus module may be further configured to:

and if the log index value in the RPC is not matched with the log index value of the leader node, sending the log index value of the leader node to the leader node so that the leader node resends the RPC according to the log index value sent by the follower node.

Optionally, the consensus module may be further configured to:

after receiving the message containing the complete signature, other nodes verify the validity of the complete signature, if the complete signature is legal, positive feedback information is sent to the candidate node, and if the complete signature is illegal, negative feedback information is sent to the candidate node; wherein,

when the candidate node receives more than 2/3 positive feedback information sent by other nodes, the candidate node transitions to the leader node.

It will be apparent to those skilled in the art that, for convenience and simplicity of description, the foregoing functional units and modules are merely illustrated in terms of division, and in practical applications, the foregoing functional allocation may be performed by different functional units and modules as needed, that is, the internal structure of the data sharing apparatus is divided into different functional units or modules to perform all or part of the above described functions. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working processes of the units and modules in the above-mentioned apparatus may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

Fig. 3 is a schematic block diagram of a terminal device according to an embodiment of the present invention. As shown in fig. 3, the terminal device 40 of this embodiment includes: one or more processors 401, a memory 402, and a computer program 403 stored in the memory 402 and executable on the processors 401. The processor 401 implements the steps in the above-mentioned embodiments of the data sharing method, such as the steps S101 to S104 shown in fig. 1, when executing the computer program 403. Alternatively, the processor 401, when executing the computer program 403, implements the functions of the modules/units in the data sharing apparatus embodiment, for example, the functions of the modules 301 to 304 shown in fig. 2.

Illustratively, the computer program 403 may be partitioned into one or more modules/units that are stored in the memory 402 and executed by the processor 401 to accomplish the present application. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used for describing the execution process of the computer program 403 in the terminal device 40. For example, the computer program 403 may be divided into a receiving module, a querying module, a first processing module and a second processing module, and each module has the following specific functions:

the receiving module is used for receiving a sharing request of a data request node, wherein the sharing request comprises model information to be requested;

the query module is used for querying whether an available model associated with the model information to be requested exists in the block chain network;

the first processing module is used for returning the available model to the data request node if the available model exists in the block chain network;

and the second processing module is used for selecting a target data providing node cooperating with the data request node if no available model exists in the block chain network, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network.

Other modules or units can refer to the description of the embodiment shown in fig. 2, and are not described again here.

The terminal device 40 may be a computing device such as a desktop computer, a notebook, a palm computer, and a cloud server. The terminal device 40 includes, but is not limited to, a processor 401 and a memory 402. Those skilled in the art will appreciate that fig. 3 is only one example of a terminal device 40, and does not constitute a limitation to the terminal device 40, and may include more or less components than those shown, or combine some components, or different components, for example, the terminal device 40 may further include an input device, an output device, a network access device, a bus, etc.

The Processor 401 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

The storage 402 may be an internal storage unit of the terminal device 40, such as a hard disk or a memory of the terminal device 40. The memory 402 may also be an external storage device of the terminal device 40, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like, which are provided on the terminal device 40. Further, the memory 402 may also include both an internal storage unit of the terminal device 40 and an external storage device. The memory 402 is used for storing the computer program 403 and other programs and data required by the terminal device 40. The memory 402 may also be used to temporarily store data that has been output or is to be output.

In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.

Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

In the embodiments provided in the present application, it should be understood that the disclosed data sharing apparatus and method may be implemented in other ways. For example, the above-described data sharing apparatus embodiments are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow in the method of the embodiments described above can be realized by a computer program, which can be stored in a computer-readable storage medium and can realize the steps of the embodiments of the methods described above when the computer program is executed by a processor. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U.S. disk, removable hard disk, magnetic diskette, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signal, telecommunications signal, and software distribution medium, etc. It should be noted that the computer readable medium may contain other components which may be suitably increased or decreased as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media which may not include electrical carrier signals and telecommunications signals in accordance with legislation and patent practice.

The above-mentioned embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.

Claims

1. A method for sharing data, comprising:

inquiring whether an available model related to the model information to be requested exists in a block chain network;

if the available model does not exist in the block chain network, selecting a target data providing node cooperating with the data request node, starting federal learning, returning a global model obtained after the federal learning is finished to the data request node, and recording the target data providing node and model parameters in the federal learning process in the block chain network;

the selecting of the target data providing node cooperating with the data requesting node includes:

determining a comprehensive trust value of the data request node on each data providing node;

selecting a data providing node with a comprehensive trust value not less than a preset trust threshold value as a target data providing node;

the determining the comprehensive trust value of the data request node for each data providing node comprises:

according to

wherein DT (A, B) provides the direct trust value of node B for the data request node A to the data;

n is the number of direct interactions between the data requesting node A and the data providing node B, e_BiWhen the ith direct interaction between the data request node A and the data providing node B is carried out, the data request node A provides the evaluation value, t, of the node B to the data_nowIs the current time, t_iProviding the completion time of the ith direct interaction of the node B for the data request node A and the data, wherein a is a preset constant and is more than 0 and less than 1;

n (a, C) ═ SN (a, C) -FN (a, C), in (B) is the set of all nodes that directly interact with the data providing node B, R (a, C) is the trust value of DT (C, B) for the data requesting node a, N (a, C) is the number of valid interactions of the data requesting node a with node C, and N (a) is the number of valid interactions in which the data requesting node a participates; SN (A, C) isThe number of times that the data request node A and the node C cooperate to participate in data sharing is successful, and FN (A, C) is the number of times that the data request node A and the node C cooperate to participate in data sharing is failed;

n (A, B) provides the effective interaction times of the node B for the data request node A and the data; h_AAnd the preset transaction time threshold value is obtained.

2. The data sharing method according to claim 1, wherein after the federal learning is finished, the data sharing method further comprises:

3. The data sharing method according to claim 2, wherein the determining the evaluation value of the data requesting node to the target data providing node includes:

according to

providing a node N for target data_jThe quote value of (a);

for sorting in descending order, the target data provides node N_jThe rank of the quote value of (a) among the quote values of all the targeted data providing nodes;

providing a node N for target data_jPerformance evaluation value of (2);

size(D_j) Providing a node N for target data_jData set D of_jL is the number of iterations of federal learning,

providing a node N for target data_jTraining duration at kth iteration; alpha, beta and gamma are weight coefficients, alpha is more than 0 and less than 1, beta is more than 0 and less than 1, gamma is more than 0 and less than 1, and alpha + beta + gamma is equal to 1.

4. A data sharing method according to any one of claims 1 to 3, further comprising:

5. The data sharing method of claim 4, wherein the employing the optimized Raft algorithm to maintain consensus comprises:

when the follower node cannot receive the heartbeat information of the leader node, the follower node sends a communication request to a non-leader node;

6. The data sharing method of claim 4, wherein the employing the optimized Raft algorithm to maintain consensus comprises:

7. The data sharing method of claim 4, wherein the employing the optimized Raft algorithm to maintain consensus comprises:

after receiving the voting message, the other nodes verify the authenticity of the client signature, if the client signature is true, the other nodes send the message signature to the candidate node, and if the client signature is not true, the other nodes refuse voting;

after receiving message signatures of other nodes exceeding 2/3, the candidate node generates a complete signature and sends messages containing the complete signature to other nodes;

when the candidate node receives more than 2/3 positive feedback information sent by other nodes, the candidate node transitions to a leader node.

8. A terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the data sharing method according to any one of claims 1 to 7 when executing the computer program.