WO2020107350A1 - Node management method and apparatus for blockchain system, and storage device - Google Patents

Node management method and apparatus for blockchain system, and storage device Download PDF

Info

Publication number
WO2020107350A1
WO2020107350A1 PCT/CN2018/118290 CN2018118290W WO2020107350A1 WO 2020107350 A1 WO2020107350 A1 WO 2020107350A1 CN 2018118290 W CN2018118290 W CN 2018118290W WO 2020107350 A1 WO2020107350 A1 WO 2020107350A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
blockchain system
feature representation
data
feature
Prior art date
Application number
PCT/CN2018/118290
Other languages
French (fr)
Chinese (zh)
Inventor
袁振南
朱鹏新
Original Assignee
区链通网络有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 区链通网络有限公司 filed Critical 区链通网络有限公司
Priority to PCT/CN2018/118290 priority Critical patent/WO2020107350A1/en
Priority to CN201880002421.4A priority patent/CN109964452B/en
Publication of WO2020107350A1 publication Critical patent/WO2020107350A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/30Decision processes by autonomous network management units using voting and bidding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/50Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols using hash chains, e.g. blockchains or hash trees

Definitions

  • the present application relates to the field of network communication technology, and in particular, to a node management method, device, and storage device of a blockchain system.
  • each node server provides computing power and resources for the entire system, which forms the basis of the entire blockchain system.
  • the inventor of the present application found that due to the different state of the nodes in the blockchain system and the computing power they can provide, there may be various abnormal and malicious nodes; there are also nodes that provide long-term stable services . Therefore, it is necessary to manage the access of new nodes, the removal of anomalies and malicious nodes, as well as the promotion of server node permissions for long-term stable service, and perform role conversion and permission adjustment between different role permission nodes.
  • the technical problem mainly solved by the present application is to provide a node management method, device and storage device of the blockchain system, which can realize automatic management and control of the nodes of the blockchain system.
  • a technical solution adopted by the present application is: to provide a node management method of a blockchain system, the method includes obtaining characteristic data of each node; using the characteristic data to obtain a node characteristic representation of each node; using The node feature representation obtains the overall feature representation of the blockchain system; based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; a consensus-based voting is performed on multiple control strategies to determine the The control strategy of the node, and manage the node according to the control strategy.
  • a technical solution adopted by the present application is to provide a node management device of a blockchain system, wherein the device includes a processor, and the processor is used to obtain characteristic data of each node; The data obtains the node feature representation of each node; uses the node feature representation to obtain the overall feature representation of the blockchain system; based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; for multiple The control strategy conducts consensus-based voting to determine the control strategy of the node and manage the node according to the control strategy.
  • a node management device of a blockchain system wherein the device includes a data collection module for acquiring characteristic data of each node; the first feature Representation module, used to obtain the node feature representation of each node using feature data; second feature representation module, used to obtain the overall feature representation of the blockchain system using node feature representation; management strategy module, used to utilize nodes based on reinforcement learning algorithm
  • the feature representation and the overall feature representation training result in regulation strategies, and there are multiple regulation strategies; the voting module is used to conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the nodes according to the regulation strategy.
  • another technical solution adopted by the present application is to provide a device with a storage function, wherein the device stores a program, and when the program is executed, the above node management of the blockchain system is implemented method.
  • this application provides a node management method of the blockchain system, which uses a reinforcement learning algorithm to train the regulation strategy, so that there is no need to set up a central management node to control the area
  • the access, removal and change of authority in the blockchain system are automatically controlled.
  • FIG. 1 is a schematic flowchart of a first embodiment of a node management method of a blockchain system of this application;
  • FIG. 2 is a schematic structural diagram of a first embodiment of a blockchain system of this application.
  • FIG. 3 is a schematic structural diagram of a first embodiment of a node management device of a blockchain system of this application;
  • FIG. 4 is a schematic structural diagram of a second embodiment of a node management device of a blockchain system of the present application.
  • FIG. 5 is a schematic structural diagram of a first embodiment of a device with a storage function according to the present application.
  • This application provides a node management method for a blockchain system. By using machine learning algorithms to extract node feature data and training regulation strategies, the entire system can implement autonomous node authority regulation.
  • FIG. 1 is a schematic flowchart of a first embodiment of a node management method of a blockchain system of the present application.
  • the node management method includes the following steps:
  • the characteristic data is data that reflects the characteristics of the performance and status of the node.
  • it may be one or more of physical hardware data, network data, operating status data, log data, and task assignment data between nodes.
  • a predetermined algorithm is used to process the acquired feature data to obtain its vector representation, which is a node feature representation.
  • the collected node data and neighbor node data are used to update and train the system feature model to obtain the feature representation of the current system state, which is the overall feature representation.
  • reinforcement learning is that agents (agents) learn in a "trial and error” manner, and reward guidance behaviors obtained through interaction with the environment, and the goal is to make the agents get the most rewards.
  • Reinforcement learning is different from supervised learning in connectionism learning, which is mainly expressed in teacher signals.
  • the reinforcement signal provided by the environment in reinforcement learning is an evaluation of the quality of the action (usually a scalar signal), rather than telling reinforcement Learning system RLS (reinforcement learning) how to produce correct action. Because the external environment provides little information, RLS must learn from its own experience. In this way, RLS acquires knowledge in the action-evaluation environment and improves the action plan to adapt to the environment. Through reinforcement learning, the optimal control strategy can be trained.
  • S105 Conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the node according to the regulation strategy.
  • a lower-level role node corresponds to multiple upper-level role nodes
  • different upper-level role nodes will derive multiple control strategies.
  • the results of the node control strategy generated by the control model will be voted based on consensus to determine The control strategy of the node, and then adjust the reputation value of the node according to the control strategy to control the node access, removal or role role adjustment between nodes with different roles, to achieve the management of the node.
  • This embodiment adopts a reinforcement learning algorithm to train the control strategy, so that there is no need to set up a central management node to automatically control and control the access, removal and permission changes of the nodes in the blockchain system.
  • FIG. 2 is a schematic structural diagram of the first embodiment of the blockchain system of the present application.
  • the blockchain system includes node A, node B, and node C as examples, but it is not limited to this architecture.
  • a data collection unit is provided in each node server of the blockchain system, and the data collection unit may be used to collect and/or report characteristic data. Specifically, it is used to collect characteristic data of this node, collect characteristic data of neighbor nodes reported to this node, and collect characteristic data of lower role nodes reported to this node. At the same time, the collected characteristic data can be reported to the corresponding upper role nodes and neighbor nodes.
  • the collected characteristic data reported to the local node will also be reported. That is, cross-level and cross-region feature data will not be reported repeatedly, but will be reported step by step or point by point through the upper role node or neighbor node closest to the node.
  • the blockchain system is a hierarchical blockchain system.
  • the hierarchical blockchain system refers to a blockchain system composed of blockchain nodes with different roles and permissions. The access of the nodes , The rejection and the conversion of node role permissions are jointly determined by the upper role nodes.
  • each node server can be divided into upper role nodes and lower role nodes.
  • the upper role nodes can manage the lower role nodes; there can be multiple upper role nodes, and one upper role node can manage multiple lower role nodes, A lower role node can also be managed by multiple upper role nodes. Role conversion and/or permission adjustment can be performed between different role permission nodes in different tasks or in different time periods.
  • a feature representation unit is set in the upper role node, which is responsible for automatically extracting dynamic data features using a decentralized training algorithm and converting into high-dimensional state feature representation.
  • the feature representations of training nodes, regional features and system features are calculated layer by layer.
  • the collected feature data is processed, and the decentralized graph algorithm is used to train the feature data to obtain a node feature representation.
  • Decentralization means that in a system with many nodes distributed, each node has a highly autonomous feature. The nodes can be freely connected to each other to form a new connection unit. Any node may become a phased center, but it does not have a mandatory central control function. The influence between nodes will form a non-linear causality through the network. That is, an open, flat, and equal system phenomenon or structure is formed. To achieve decentralization (distributed) on the blockchain technology, this depends on the consensus algorithm. The consensus algorithm solves the process of reaching a consensus on a proposal (Proposal) to ensure that the system meets different degrees of consistency.
  • the decentralized deep learning algorithm is used to train the regional feature model on the collected feature data of each node and neighbor nodes to obtain the regional feature representation of the blockchain system.
  • the collected feature data of each node and neighbor nodes and the regional feature representation are used to train and update the system feature model to obtain the feature representation of the current system state, that is, the overall feature representation.
  • a management strategy unit is also provided in the upper role node, which is used to train a control strategy based on reinforcement learning algorithm using node feature representation and overall feature representation training.
  • the reinforcement learning algorithm is mainly composed of agents and the environment, specifically an agent (Agent) to take action (Action) to change their state (State) to get rewards (Reward) and environment (Environment) Cycle process.
  • the environment refers to the object (such as a node server) where the agent is acting, and the agent represents the RL algorithm.
  • the environment first sends a state to the agent, and then takes action based on its knowledge to respond to the state. After that, the environment sends a pair of next states and rewards the agent.
  • the agent will update its knowledge with the rewards returned by the environment to evaluate its final actions. Its strategy depends entirely on the current status (Only present Matters).
  • Reinforcement learning algorithms include Q-learning, sarsa, deep Q Network, policy Gradient, Actor Critic, etc.
  • the regulation strategy model is updated and trained based on the preset target function using the current system state and the current state feature representation of each node.
  • the preset objective function is used to evaluate and measure the current node and its task status.
  • the current system status includes the regional system status and the overall system status.
  • the control strategy may be to eliminate node A1 while increasing the task sharing of nodes A2 and A3.
  • the regulation strategy for the overall environment of task 1 is obtained, for example, the regulation strategy may be to eliminate the node A1 , While increasing the task sharing of nodes A2 and B2.
  • a consensus-based vote will be made on the node control strategy result generated by the control model and the transaction is recorded.
  • the system of task 1 there is also an upper role node B.
  • Lower role nodes A1 and B2 will also report to node B when they report node A; then node B will also be trained to obtain a regulation for nodes A1 and B2
  • Strategies, such as regulation and control strategies can eliminate node A1, reduce the task sharing of node B2, and increase the task sharing of node B1.
  • you will get two different control strategies then you need to vote on the control strategy based on consensus to confirm the final control strategy.
  • the nodes are regulated according to the confirmed regulation strategy.
  • the node changes its state.
  • the upper role node evaluates whether it receives a strategy reward through the throughput of the entire system, the speed of block generation, etc., and then allows the upper node to update its knowledge with the rewards returned by the environment to evaluate its subsequent actions in this cycle. In this way, the optimal control strategy can be trained.
  • the present application also provides a node management device of the blockchain system.
  • FIG. 3 is a schematic structural diagram of a first embodiment of the node management device of the blockchain system of the present application.
  • the node management device 30 of the blockchain system includes a processor 301, which is used to obtain the characteristic data of each node; use the characteristic data to obtain the node characteristic representation of each node; use the node characteristic representation to obtain the block
  • the overall feature representation of the chain system based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; a consensus-based voting is performed on multiple control strategies to determine the control strategy of the node, and Manage the nodes according to the control strategy.
  • the processor 301 is specifically configured to obtain a regional and/or overall environment regulation strategy based on the reinforcement learning algorithm and the preset objective function, using node feature representation and overall feature representation training.
  • the processor 301 is specifically configured to train the feature data using a decentralized graph algorithm to obtain a node feature representation.
  • the processor 301 is specifically used to train a regional feature model on the collected feature data of each node and neighbor nodes using a decentralized deep learning algorithm to obtain a regional feature representation of the blockchain system.
  • the processor 301 is specifically configured to use the collected feature data and regional feature representations of each node and neighbor nodes to train a system model to obtain an overall feature representation.
  • the node management device 30 of the blockchain system can be used to execute the node management method of the above-mentioned blockchain system to manage the nodes of the blockchain system, and has corresponding beneficial effects.
  • the device may be an independent device independent of the server, or a certain module in the server, or a certain processing unit.
  • FIG. 4 is a schematic structural diagram of a second embodiment of a node management device of a blockchain system of the present application.
  • the node management device 40 of the blockchain system is a certain module in the server, which specifically includes a data collection module 401, a first feature representation module 402, a second feature representation module 403, a management and control strategy module 404, and a vote Module 405.
  • the data collection module 401 is used to obtain characteristic data of each node.
  • the first feature representation module 402 is used to obtain the node feature representation of each node using the feature data.
  • the second feature representation module 403 is used to obtain the overall feature representation of the blockchain system using the node feature representation.
  • the management and control strategy module 404 is used to train a control strategy based on a reinforcement learning algorithm using node feature representation and overall feature representation training, and there are multiple control strategies.
  • the voting module 405 is used to conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the node according to the regulation strategy.
  • the node management device 40 of the blockchain system can be used to execute the node management method of the blockchain system described above, manage the nodes of the blockchain system, and have corresponding beneficial effects.
  • the specific process please refer to the description of the above embodiment, I will not repeat them here.
  • FIG. 5 is a schematic structural diagram of a first embodiment of a device with a storage function in the present application.
  • the storage device 50 stores a program 501, and the program 501 is one or more.
  • the program 501 is executed, the node management method of the blockchain system described above is implemented.
  • the specific working process is the same as in the above method embodiment, so it will not be repeated here.
  • devices with storage function can be portable storage media such as U disk, optical disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk, etc.
  • the medium storing the program code may also be a terminal, server, or the like.
  • this application provides a node management method of the blockchain system.
  • decentralized machine learning algorithms for feature training and reinforcement learning algorithms for regulation strategy training there is no need to set up another central management node to control the block.
  • the access, removal and change of authority of the nodes in the chain system are automatically managed and controlled.
  • the disclosed system, device, and method may be implemented in other ways.
  • the device implementation described above is only schematic.
  • the division of the module or unit is only a division of logical functions.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above integrated unit may be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium.
  • the technical solution of the present application essentially or part of the contribution to the existing technology or all or part of the technical solution can be embodied in the form of a software product, the computer software product is stored in a storage medium It includes several instructions to enable a computer device (which may be a personal computer, server, or network device, etc.) or processor to execute all or part of the steps of the methods described in the embodiments of the present application.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Disclosed are a node management method and apparatus for a blockchain system, and a storage device. The method comprises: acquiring characteristic data of each node; acquiring node characteristic representation of each node by using the characteristic data; acquiring overall characteristic representation of a blockchain system by using the node characteristic representation; performing training on the basis of the reinforcement learning algorithm by using the node characteristic representation and the overall characteristic representation to obtain control strategies, there being a plurality of control strategies; and performing consensus-based voting on the plurality of control strategies to determine the control strategy of each node and managing each node according to the control strategy. In such way, the present application can implement the automatic management and control of nodes of the blockchain system.

Description

一种区块链系统的节点管理方法、装置及存储装置Node management method, device and storage device of block chain system 【技术领域】【Technical Field】
本申请涉及网络通信技术领域,特别是涉及一种区块链系统的节点管理方法、装置及存储装置。The present application relates to the field of network communication technology, and in particular, to a node management method, device, and storage device of a blockchain system.
【背景技术】【Background technique】
在区块链系统中各节点服务器为整个系统提供算力、资源,构成了整个区块链系统的基础。本申请的发明人在长期的研究中,发现由于区块链系统中节点的状态和所能提供的算力不尽相同,可能存在各种异常及恶意节点;同时也存在长期稳定提供服务的节点。因此,需要管理新节点的接入、异常及恶意节点的剔除以及对长期稳定服务的服务器节点权限进行提升,在不同角色权限节点之间进行角色转换和权限调整。In the blockchain system, each node server provides computing power and resources for the entire system, which forms the basis of the entire blockchain system. In the long-term research, the inventor of the present application found that due to the different state of the nodes in the blockchain system and the computing power they can provide, there may be various abnormal and malicious nodes; there are also nodes that provide long-term stable services . Therefore, it is necessary to manage the access of new nodes, the removal of anomalies and malicious nodes, as well as the promotion of server node permissions for long-term stable service, and perform role conversion and permission adjustment between different role permission nodes.
【发明内容】[Invention content]
本申请主要解决的技术问题是提供一种区块链系统的节点管理方法、装置及存储装置,能够实现区块链系统节点的自动管控。The technical problem mainly solved by the present application is to provide a node management method, device and storage device of the blockchain system, which can realize automatic management and control of the nodes of the blockchain system.
为解决上述技术问题,本申请采用的一个技术方案是:提供一种区块链系统的节点管理方法,所述方法包括获取各节点的特征数据;利用特征数据获取各节点的节点特征表示;利用节点特征表示获取区块链系统的整体特征表示;基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略,调控策略为多个;对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。In order to solve the above technical problems, a technical solution adopted by the present application is: to provide a node management method of a blockchain system, the method includes obtaining characteristic data of each node; using the characteristic data to obtain a node characteristic representation of each node; using The node feature representation obtains the overall feature representation of the blockchain system; based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; a consensus-based voting is performed on multiple control strategies to determine the The control strategy of the node, and manage the node according to the control strategy.
为解决上述技术问题,本申请采用的一个技术方案是:提供一种区块链系统的节点管理装置,其中,所述装置包括处理器,所述处理器用于获取各节点的特征数据;利用特征数据获取各节点的节点特征表示;利用节点特征表示获取区块链系统的整体特征表示;基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略,调控策略为多个;对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。In order to solve the above technical problems, a technical solution adopted by the present application is to provide a node management device of a blockchain system, wherein the device includes a processor, and the processor is used to obtain characteristic data of each node; The data obtains the node feature representation of each node; uses the node feature representation to obtain the overall feature representation of the blockchain system; based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; for multiple The control strategy conducts consensus-based voting to determine the control strategy of the node and manage the node according to the control strategy.
为解决上述技术问题,本申请采用的另一个技术方案是:提供一种区块链系统的节点管理装置,其中,所述装置包括数据收集模块,用于获取各节点的特征数据;第一特征表示模块,用于利用特征数据获取各节点的节点特征表示;第二特征表示模块,用于利用节点特征表示获取区块链系统的整体特征表示;管控策略模块,用于基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略,调控策略为多个;投票模块,用于对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。In order to solve the above technical problems, another technical solution adopted by the present application is to provide a node management device of a blockchain system, wherein the device includes a data collection module for acquiring characteristic data of each node; the first feature Representation module, used to obtain the node feature representation of each node using feature data; second feature representation module, used to obtain the overall feature representation of the blockchain system using node feature representation; management strategy module, used to utilize nodes based on reinforcement learning algorithm The feature representation and the overall feature representation training result in regulation strategies, and there are multiple regulation strategies; the voting module is used to conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the nodes according to the regulation strategy.
为解决上述技术问题,本申请采用的另一个技术方案是:提供一种具有存储功能的装置,其中,所述装置存储有程序,所述程序被执行时实现上述的区块链系统的节点管理方法。In order to solve the above technical problems, another technical solution adopted by the present application is to provide a device with a storage function, wherein the device stores a program, and when the program is executed, the above node management of the blockchain system is implemented method.
本申请的有益效果是:区别于现有技术的情况,本申请提供一种区块链系统的节点管理方法,该方法通过利用强化学习算法训练调控策略,使得无需另设中心管控节点来对区块链系统中节点的接入、剔除、权限的变更等进行自动管控。The beneficial effects of this application are: different from the situation in the prior art, this application provides a node management method of the blockchain system, which uses a reinforcement learning algorithm to train the regulation strategy, so that there is no need to set up a central management node to control the area The access, removal and change of authority in the blockchain system are automatically controlled.
【附图说明】【Explanation】
图1是本申请区块链系统的节点管理方法第一实施方式的流程示意图;FIG. 1 is a schematic flowchart of a first embodiment of a node management method of a blockchain system of this application;
图2是本申请区块链系统第一实施方式的结构示意图;2 is a schematic structural diagram of a first embodiment of a blockchain system of this application;
图3是本申请本申请区块链系统的节点管理装置第一实施方式的结构示意图;3 is a schematic structural diagram of a first embodiment of a node management device of a blockchain system of this application;
图4是本申请区块链系统的节点管理装置第二实施方式的结构示意图;4 is a schematic structural diagram of a second embodiment of a node management device of a blockchain system of the present application;
图5是本申请具有存储功能的装置第一实施方式的结构示意图。5 is a schematic structural diagram of a first embodiment of a device with a storage function according to the present application.
【具体实施方式】【detailed description】
为使本申请的目的、技术方案及效果更加清楚、明确,以下参照附图并举实施例对本申请进一步详细说明。In order to make the purpose, technical solutions and effects of the present application clearer and clearer, the present application will be described in further detail below with reference to the accompanying drawings and examples.
本申请提供一种区块链系统的节点管理方法,通过利用机器学习算 法提取节点特征数据,并训练调控策略,使得整个系统能够实现自治的节点权限调控。This application provides a node management method for a blockchain system. By using machine learning algorithms to extract node feature data and training regulation strategies, the entire system can implement autonomous node authority regulation.
请参阅图1,图1是本申请区块链系统的节点管理方法第一实施方式的流程示意图。在该实施方式中,节点管理方法包括如下步骤:Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a first embodiment of a node management method of a blockchain system of the present application. In this embodiment, the node management method includes the following steps:
S101:获取各节点的特征数据。S101: Obtain the characteristic data of each node.
其中,特征数据为体现本节点性能、状态等特性的数据。例如可以是物理硬件数据、网络数据、运行状态数据、日志数据、节点间任务分配数据中的一种或多种。Among them, the characteristic data is data that reflects the characteristics of the performance and status of the node. For example, it may be one or more of physical hardware data, network data, operating status data, log data, and task assignment data between nodes.
S102:获取各节点的节点特征表示。S102: Obtain a node feature representation of each node.
其中,利用预定算法对获取的特征数据进行处理得到其向量表示,即为节点特征表示。Among them, a predetermined algorithm is used to process the acquired feature data to obtain its vector representation, which is a node feature representation.
S103:获取区块链系统的整体特征表示。S103: Obtain the overall feature representation of the blockchain system.
其中,利用收集到的节点数据和邻居节点数据对系统特征模型进行更新训练,得到当前系统状态的特征表示,即为整体特征表示。Among them, the collected node data and neighbor node data are used to update and train the system feature model to obtain the feature representation of the current system state, which is the overall feature representation.
S104:基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略,调控策略为多个。S104: Based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train to obtain a control strategy, and there are multiple control strategies.
其中,强化学习是智能体(Agent)以“试错”的方式进行学习,通过与环境进行交互获得的奖赏指导行为,目标是使智能体获得最大的奖赏。强化学习不同于连接主义学习中的监督学习,主要表现在教师信号上,强化学习中由环境提供的强化信号是对产生动作的好坏作一种评价(通常为标量信号),而不是告诉强化学习系统RLS(reinforcement learning system)如何去产生正确的动作。由于外部环境提供的信息很少,RLS必须靠自身的经历进行学习。通过这种方式,RLS在行动-评价的环境中获得知识,改进行动方案以适应环境。通过强化学习能够训练出最优的调控策略。Among them, reinforcement learning is that agents (agents) learn in a "trial and error" manner, and reward guidance behaviors obtained through interaction with the environment, and the goal is to make the agents get the most rewards. Reinforcement learning is different from supervised learning in connectionism learning, which is mainly expressed in teacher signals. The reinforcement signal provided by the environment in reinforcement learning is an evaluation of the quality of the action (usually a scalar signal), rather than telling reinforcement Learning system RLS (reinforcement learning) how to produce correct action. Because the external environment provides little information, RLS must learn from its own experience. In this way, RLS acquires knowledge in the action-evaluation environment and improves the action plan to adapt to the environment. Through reinforcement learning, the optimal control strategy can be trained.
S105:对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。S105: Conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the node according to the regulation strategy.
其中,如果一个下层角色节点对应多个上层角色节点,不同的上层角色节点会得出多个调控策略,在得出调控策略后会对管控模型产生的 节点调控策略结果进行基于共识的投票,确定该节点的调控策略,然后按照调控策略对节点的信誉值进行调整以控制节点的接入、剔除或不同角色节点间的角色权限调整,实现对节点的管理。Among them, if a lower-level role node corresponds to multiple upper-level role nodes, different upper-level role nodes will derive multiple control strategies. After the control strategy is obtained, the results of the node control strategy generated by the control model will be voted based on consensus to determine The control strategy of the node, and then adjust the reputation value of the node according to the control strategy to control the node access, removal or role role adjustment between nodes with different roles, to achieve the management of the node.
该实施方式通过强化学习算法训练调控策略,使得无需另设中心管控节点来对区块链系统中节点的接入、剔除和权限的变更进行自动管控。This embodiment adopts a reinforcement learning algorithm to train the control strategy, so that there is no need to set up a central management node to automatically control and control the access, removal and permission changes of the nodes in the blockchain system.
请参阅图2,图2是本申请区块链系统第一实施方式的结构示意图。在该实施方式中,以区块链系统包括节点A、节点B、节点C为例进行说明,但不仅限于这一种架构。其中,在该实施方式中,区块链系统各节点服务器中设置有数据收集单元,数据收集单元可用于收集和/或上报特征数据。具体用于收集本节点的特征数据,收集上报至本节点的邻居节点的特征数据,收集上报至本节点的下层角色节点的特征数据。同时可以将收集到的特征数据上报至对应的上层角色节点和邻居节点。具体地,上报时,除了上报本节点的特征数据,还会一并上报所收集的上报至本节点的特征数据。即对于跨层级、跨区域的特征数据不会重复上报,而是经由距离该节点最近的上层角色节点或邻居节点进行逐级或逐点上报。Please refer to FIG. 2, which is a schematic structural diagram of the first embodiment of the blockchain system of the present application. In this embodiment, the blockchain system includes node A, node B, and node C as examples, but it is not limited to this architecture. In this embodiment, a data collection unit is provided in each node server of the blockchain system, and the data collection unit may be used to collect and/or report characteristic data. Specifically, it is used to collect characteristic data of this node, collect characteristic data of neighbor nodes reported to this node, and collect characteristic data of lower role nodes reported to this node. At the same time, the collected characteristic data can be reported to the corresponding upper role nodes and neighbor nodes. Specifically, when reporting, in addition to reporting the characteristic data of the local node, the collected characteristic data reported to the local node will also be reported. That is, cross-level and cross-region feature data will not be reported repeatedly, but will be reported step by step or point by point through the upper role node or neighbor node closest to the node.
其中,在一实施方式中,区块链系统是分层式区块链系统,分层式区块链系统指由不同角色权限的区块链节点共同组成的区块链系统,节点的接入、剔除以及节点角色权限的转换由上层角色节点共同决定。在该系统中,各节点服务器可以分为上层角色节点和下层角色节点,上层角色节点可以对下层角色节点进行管理;上层角色节点可以有多个,一个上层角色节点可以管理多个下层角色节点,一个下层角色节点也可以被多个上层角色节点管理。在不同任务或不同时间段中不同角色权限节点之间可以进行角色的转换和/或权限调整。Among them, in one embodiment, the blockchain system is a hierarchical blockchain system. The hierarchical blockchain system refers to a blockchain system composed of blockchain nodes with different roles and permissions. The access of the nodes , The rejection and the conversion of node role permissions are jointly determined by the upper role nodes. In this system, each node server can be divided into upper role nodes and lower role nodes. The upper role nodes can manage the lower role nodes; there can be multiple upper role nodes, and one upper role node can manage multiple lower role nodes, A lower role node can also be managed by multiple upper role nodes. Role conversion and/or permission adjustment can be performed between different role permission nodes in different tasks or in different time periods.
其中,在上层角色节点中设置有特征表示单元,负责利用去中心化训练算法自动提取数据动态特征,并转换为高维状态特征表示。逐层计算训练节点特征表示、区域特征表示和系统特征表示。Among them, a feature representation unit is set in the upper role node, which is responsible for automatically extracting dynamic data features using a decentralized training algorithm and converting into high-dimensional state feature representation. The feature representations of training nodes, regional features and system features are calculated layer by layer.
具体地,对收集的特征数据进行处理,利用去中心化图算法对特征 数据进行训练,得到节点特征表示。去中心化(decentralization)是指在一个分布有众多节点的系统中,每个节点都具有高度自治的特征。节点之间彼此可以自由连接,形成新的连接单元。任何一个节点都可能成为阶段性的中心,但不具备强制性的中心控制功能。节点与节点之间的影响,会通过网络而形成非线性因果关系。即形成一种开放式、扁平化、平等性的系统现象或结构。在区块链技术上来实现去中心化(分布式),这便依赖于共识算法。共识算法解决的是对某个提案(Proposal),大家达成一致意见的过程,保障系统满足不同程度的一致性。Specifically, the collected feature data is processed, and the decentralized graph algorithm is used to train the feature data to obtain a node feature representation. Decentralization means that in a system with many nodes distributed, each node has a highly autonomous feature. The nodes can be freely connected to each other to form a new connection unit. Any node may become a phased center, but it does not have a mandatory central control function. The influence between nodes will form a non-linear causality through the network. That is, an open, flat, and equal system phenomenon or structure is formed. To achieve decentralization (distributed) on the blockchain technology, this depends on the consensus algorithm. The consensus algorithm solves the process of reaching a consensus on a proposal (Proposal) to ensure that the system meets different degrees of consistency.
获取节点特征表示后,再利用去中心化深度学习算法对收集的各节点及邻居节点的特征数据训练区域特征模型,得出区块链系统的区域特征表示。After obtaining the node feature representation, the decentralized deep learning algorithm is used to train the regional feature model on the collected feature data of each node and neighbor nodes to obtain the regional feature representation of the blockchain system.
进一步地,再利用收集的各节点及邻居节点的特征数据及区域特征表示对系统特征模型进行训练更新,得到当前系统状态的特征表示,即整体特征表示。Further, the collected feature data of each node and neighbor nodes and the regional feature representation are used to train and update the system feature model to obtain the feature representation of the current system state, that is, the overall feature representation.
得到状态特征表示后再利用这些状态特征表示训练调控策略模型得出调控策略。具体地,在上层角色节点中还设置有管理策略单元,用于基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略。After the state feature representation is obtained, these state feature representations are used to train the control strategy model to obtain the control strategy. Specifically, a management strategy unit is also provided in the upper role node, which is used to train a control strategy based on reinforcement learning algorithm using node feature representation and overall feature representation training.
其中,强化学习算法(RL算法)主要由智能体和环境组成,具体是一个智能体(Agent)采取行动(Action)从而改变自己的状态(State)获得奖励(Reward)与环境(Environment)发生交互的循环过程。其中环境是指智能体正在作用的对象(例如节点服务器),而智能体代表RL算法。环境首先向智能体发送一个状态,然后基于其知识来采取行动来响应该状态。之后,环境发送一对下一个状态并奖励给智能体。智能体将用环境所回报的奖励来更新其知识,以评估其最后的行动。其所得策略完全取决于当前状态(Only present matters)。强化学习算法包括Q-learning、sarsa、deep Q Network、policy Gradient、Actor Critic等等。Among them, the reinforcement learning algorithm (RL algorithm) is mainly composed of agents and the environment, specifically an agent (Agent) to take action (Action) to change their state (State) to get rewards (Reward) and environment (Environment) Cycle process. The environment refers to the object (such as a node server) where the agent is acting, and the agent represents the RL algorithm. The environment first sends a state to the agent, and then takes action based on its knowledge to respond to the state. After that, the environment sends a pair of next states and rewards the agent. The agent will update its knowledge with the rewards returned by the environment to evaluate its final actions. Its strategy depends entirely on the current status (Only present Matters). Reinforcement learning algorithms include Q-learning, sarsa, deep Q Network, policy Gradient, Actor Critic, etc.
在该实施方式中,利用当前系统状态和各节点当前状态特征表示,基于预设的目标函数对调控策略模型进行更新训练。其中,预设的目标 函数用于对当前节点及其任务状态进行评估衡量,当前系统状态包括区域系统状态及整体系统状态。In this embodiment, the regulation strategy model is updated and trained based on the preset target function using the current system state and the current state feature representation of each node. Among them, the preset objective function is used to evaluate and measure the current node and its task status. The current system status includes the regional system status and the overall system status.
例如,在处理任务1时,环境反馈信息下层角色节点A1目前及cpu使用量、当前任务量以及历史任务状态信息等;其对应的上层角色节点A在训练调控策略时,根据节点A1的当前状态、A1所在区域的区域系统状态及整体系统状态,得出针对任务1的区域环境的调控策略,如调控策略可以是剔除节点A1,同时增加节点A2和A3的任务分担量。或者上层角色节点A在训练调控策略时,根据节点A1的当前状态、A1所在区域的区域系统状态及整体系统状态,得出针对任务1的整体环境的调控策略,如调控策略可以是剔除节点A1,同时增加节点A2和B2的任务分担量。For example, when processing task 1, the current feedback information of the lower role node A1 and the CPU usage, current task volume, and historical task status information of the lower role node; the corresponding upper role node A, when training the regulation strategy, is based on the current state of node A1 1. The regional system status and the overall system status of the area where A1 is located, and the control strategy for the regional environment of task 1 is obtained. For example, the control strategy may be to eliminate node A1 while increasing the task sharing of nodes A2 and A3. Or when the upper role node A trains the regulation strategy, according to the current state of the node A1, the regional system state of the area where A1 is located and the overall system state, the regulation strategy for the overall environment of task 1 is obtained, for example, the regulation strategy may be to eliminate the node A1 , While increasing the task sharing of nodes A2 and B2.
其中,在一实施方式中,如果一个下层角色节点对应多个上层角色节点,在得出调控策略后会对管控模型产生的节点调控策略结果进行基于共识的投票,并记录交易。如,在任务1的系统中,还存在上层角色节点B,下层角色节点A1、B2在上报节点A的同时也会上报节点B;那么节点B也会训练得出一个针对节点A1、B2的调控策略,如调控策略可以是剔除节点A1,降低节点B2的任务分担量,同时提高节点B1的任务分担量。此时,对于节点B2,会得到两个不同的调控策略,那么则需要对调控策略进行基于共识的投票,确认最终的调控策略。In one embodiment, if a lower-level role node corresponds to multiple upper-level role nodes, after the control strategy is obtained, a consensus-based vote will be made on the node control strategy result generated by the control model and the transaction is recorded. For example, in the system of task 1, there is also an upper role node B. Lower role nodes A1 and B2 will also report to node B when they report node A; then node B will also be trained to obtain a regulation for nodes A1 and B2 Strategies, such as regulation and control strategies, can eliminate node A1, reduce the task sharing of node B2, and increase the task sharing of node B1. At this time, for node B2, you will get two different control strategies, then you need to vote on the control strategy based on consensus to confirm the final control strategy.
然后,按照确认后的调控策略对节点进行调控。调整后节点改变自己的状态。调整后上层角色节点通过整个系统的吞吐量,出块速度等评估是否得到策略奖励,进而使上层节点用环境所回报的奖励来更新其知识,以评估其后续的行动,以此循环。通过这种方式能够训练出最优的调控策略。Then, the nodes are regulated according to the confirmed regulation strategy. After adjustment, the node changes its state. After the adjustment, the upper role node evaluates whether it receives a strategy reward through the throughput of the entire system, the speed of block generation, etc., and then allows the upper node to update its knowledge with the rewards returned by the environment to evaluate its subsequent actions in this cycle. In this way, the optimal control strategy can be trained.
基于上述方法,本申请还提供一种区块链系统的节点管理装置,请参阅图3,图3是本申请区块链系统的节点管理装置第一实施方式的结构示意图。在该实施方式中,区块链系统的节点管理装置30包括处理器301,处理器301用于获取各节点的特征数据;利用特征数据获取各节点的节点特征表示;利用节点特征表示获取区块链系统的整体特征表 示;基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略,调控策略为多个;对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。Based on the above method, the present application also provides a node management device of the blockchain system. Please refer to FIG. 3, which is a schematic structural diagram of a first embodiment of the node management device of the blockchain system of the present application. In this embodiment, the node management device 30 of the blockchain system includes a processor 301, which is used to obtain the characteristic data of each node; use the characteristic data to obtain the node characteristic representation of each node; use the node characteristic representation to obtain the block The overall feature representation of the chain system; based on the reinforcement learning algorithm, the node feature representation and the overall feature representation are used to train the control strategy, and the control strategy is multiple; a consensus-based voting is performed on multiple control strategies to determine the control strategy of the node, and Manage the nodes according to the control strategy.
其中,在一实施方式中,处理器301具体用于基于强化学习算法和预设的目标函数,利用节点特征表示和整体特征表示训练得出区域和/或整体环境的调控策略。Wherein, in an embodiment, the processor 301 is specifically configured to obtain a regional and/or overall environment regulation strategy based on the reinforcement learning algorithm and the preset objective function, using node feature representation and overall feature representation training.
其中,在一实施方式中,处理器301具体用于利用去中心化图算法对特征数据进行训练,得到节点特征表示。Among them, in an embodiment, the processor 301 is specifically configured to train the feature data using a decentralized graph algorithm to obtain a node feature representation.
其中,在一实施方式中,处理器301具体用于利用去中心化深度学习算法对收集的各节点及邻居节点的特征数据训练区域特征模型,得出区块链系统的区域特征表示。In one embodiment, the processor 301 is specifically used to train a regional feature model on the collected feature data of each node and neighbor nodes using a decentralized deep learning algorithm to obtain a regional feature representation of the blockchain system.
其中,在一实施方式中,处理器301具体用于利用收集的各节点及邻居节点的特征数据及区域特征表示,训练系统模型,得出整体特征表示。In one embodiment, the processor 301 is specifically configured to use the collected feature data and regional feature representations of each node and neighbor nodes to train a system model to obtain an overall feature representation.
以上,该区块链系统的节点管理装置30可用于执行上述区块链系统的节点管理方法,对区块链系统的节点进行管理,且具有相应的有益效果,具体过程请参阅上述实施方式的描述,在此不再赘述。其中该装置可以是独立于服务器的独立装置,也可以是服务器中的某一模块,或某一处理单元。As mentioned above, the node management device 30 of the blockchain system can be used to execute the node management method of the above-mentioned blockchain system to manage the nodes of the blockchain system, and has corresponding beneficial effects. For the specific process, please refer to the The description will not be repeated here. The device may be an independent device independent of the server, or a certain module in the server, or a certain processing unit.
请参阅图4,图4是本申请区块链系统的节点管理装置第二实施方式的结构示意图。在该实施方式中,区块链系统的节点管理装置40为服务器中的某一模块,具体包括数据收集模块401、第一特征表示模块402、第二特征表示模块403、管控策略模块404和投票模块405。Please refer to FIG. 4, which is a schematic structural diagram of a second embodiment of a node management device of a blockchain system of the present application. In this embodiment, the node management device 40 of the blockchain system is a certain module in the server, which specifically includes a data collection module 401, a first feature representation module 402, a second feature representation module 403, a management and control strategy module 404, and a vote Module 405.
数据收集模块401用于获取各节点的特征数据。The data collection module 401 is used to obtain characteristic data of each node.
第一特征表示模块402用于利用特征数据获取各节点的节点特征表示。The first feature representation module 402 is used to obtain the node feature representation of each node using the feature data.
第二特征表示模块403用于利用节点特征表示获取区块链系统的整体特征表示。The second feature representation module 403 is used to obtain the overall feature representation of the blockchain system using the node feature representation.
管控策略模块404用于基于强化学习算法利用节点特征表示和整体 特征表示训练得出调控策略,调控策略为多个。The management and control strategy module 404 is used to train a control strategy based on a reinforcement learning algorithm using node feature representation and overall feature representation training, and there are multiple control strategies.
投票模块405,用于对多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照调控策略对节点进行管理。The voting module 405 is used to conduct consensus-based voting on multiple regulation strategies, determine the regulation strategy of the node, and manage the node according to the regulation strategy.
该区块链系统的节点管理装置40可用于执行上述区块链系统的节点管理方法,对区块链系统的节点进行管理,且具有相应的有益效果,具体过程请参阅上述实施方式的描述,在此不再赘述。The node management device 40 of the blockchain system can be used to execute the node management method of the blockchain system described above, manage the nodes of the blockchain system, and have corresponding beneficial effects. For the specific process, please refer to the description of the above embodiment, I will not repeat them here.
本申请还提供一种具有存储功能的装置,请参阅图5,图5是本申请具有存储功能的装置第一实施方式的结构示意图。在该实施方式中,存储装置50存储有程序501,程序501为一个或多个,程序501被执行时实现上述区块链系统的节点管理方法。具体工作过程与上述方法实施例中一致,故在此不再赘述,详细请参阅以上对应方法步骤的说明。其中具有存储功能的装置可以是便携式存储介质如U盘、光盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟等各种可以存储程序代码的介质,也可以是终端、服务器等。The present application also provides a device with a storage function, please refer to FIG. 5, which is a schematic structural diagram of a first embodiment of a device with a storage function in the present application. In this embodiment, the storage device 50 stores a program 501, and the program 501 is one or more. When the program 501 is executed, the node management method of the blockchain system described above is implemented. The specific working process is the same as in the above method embodiment, so it will not be repeated here. For details, please refer to the description of the corresponding method steps above. Among them, devices with storage function can be portable storage media such as U disk, optical disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk, etc. The medium storing the program code may also be a terminal, server, or the like.
以上方案,本申请提供一种区块链系统的节点管理方法,通过利用去中心化的机器学习算法进行特征训练,利用强化学习算法进行调控策略训练,使得无需另设中心管控节点来对区块链系统中节点的接入、剔除和权限的变更进行自动管控。The above solution, this application provides a node management method of the blockchain system. By using decentralized machine learning algorithms for feature training and reinforcement learning algorithms for regulation strategy training, there is no need to set up another central management node to control the block. The access, removal and change of authority of the nodes in the chain system are automatically managed and controlled.
在本申请所提供的几个实施方式中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施方式仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method may be implemented in other ways. For example, the device implementation described above is only schematic. For example, the division of the module or unit is only a division of logical functions. In actual implementation, there may be other divisions, for example, multiple units or components may be The combination can either be integrated into another system, or some features can be ignored, or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于 一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施方式方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本申请各个实施方式中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施方式所述方法的全部或部分步骤。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application essentially or part of the contribution to the existing technology or all or part of the technical solution can be embodied in the form of a software product, the computer software product is stored in a storage medium It includes several instructions to enable a computer device (which may be a personal computer, server, or network device, etc.) or processor to execute all or part of the steps of the methods described in the embodiments of the present application.
以上所述仅为本申请的实施方式,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the embodiments of the present application, and therefore do not limit the patent scope of the present application. Any equivalent structure or equivalent process transformation made by the description and drawings of this application, or directly or indirectly used in other related technologies In the field, the same reason is included in the scope of patent protection of this application.

Claims (16)

  1. 一种区块链系统的节点管理方法,其中,所述方法包括:A node management method of a blockchain system, wherein the method includes:
    获取各节点的特征数据;Obtain the characteristic data of each node;
    利用所述特征数据获取各节点的节点特征表示;Use the feature data to obtain a node feature representation of each node;
    利用所述节点特征表示获取区块链系统的整体特征表示;Use the node feature representation to obtain the overall feature representation of the blockchain system;
    基于强化学习算法利用所述节点特征表示和所述整体特征表示训练得出调控策略,所述调控策略为多个;Based on the reinforcement learning algorithm, using the node feature representation and the overall feature representation to train to obtain a control strategy, the control strategy is multiple;
    对所述多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照所述调控策略对节点进行管理。Conduct consensus-based voting on the plurality of control strategies, determine the control strategy of the node, and manage the node according to the control strategy.
  2. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述基于强化学习算法利用节点特征表示和整体特征表示训练得出调控策略包括:The node management method of the blockchain system according to claim 1, wherein the reinforcement learning algorithm based on the node feature representation and the overall feature representation training to obtain the regulation strategy includes:
    基于强化学习算法和预设的目标函数,利用所述节点特征表示和整体特征表示训练得出区域和/或整体环境的调控策略。Based on the reinforcement learning algorithm and the preset objective function, the control strategy of the region and/or the overall environment is obtained by training using the node feature representation and the overall feature representation.
  3. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述按照调控策略对节点进行管理包括:The node management method of the blockchain system according to claim 1, wherein the management of the nodes according to the regulation strategy includes:
    按照所述调控策略控制节点的接入、剔除或不同角色节点间的角色权限调整。According to the regulation strategy, control the node access, removal, or role role adjustment between nodes with different roles.
  4. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述获取各节点的节点特征表示包括:The node management method of the blockchain system according to claim 1, wherein the acquiring the node characteristic representation of each node includes:
    利用去中心化图算法对所述特征数据进行训练,得到所述节点特征表示。Use the decentralized graph algorithm to train the feature data to obtain the node feature representation.
  5. 根据权利要求4所述的区块链系统的节点管理方法,其中,所述获取各节点的节点特征表示之后还包括:The node management method of the blockchain system according to claim 4, wherein after acquiring the node characteristic representation of each node further comprises:
    利用去中心化深度学习算法对收集的各节点及邻居节点的特征数据训练区域特征模型,得出区块链系统的区域特征表示。Using the decentralized deep learning algorithm, the regional feature model is trained on the collected feature data of each node and neighbor nodes, and the regional feature representation of the blockchain system is obtained.
  6. 根据权利要求5所述的区块链系统的节点管理方法,其中,所述获取区块链系统的整体特征表示包括:The node management method of the blockchain system according to claim 5, wherein the acquiring the overall characteristic representation of the blockchain system includes:
    利用收集的各节点及邻居节点的特征数据及所述区域特征表示,训练系统模型,得出所述整体特征表示。Using the collected feature data of each node and neighbor nodes and the regional feature representation, a system model is trained to obtain the overall feature representation.
  7. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述获取各节点的特征数据包括:The node management method of the blockchain system according to claim 1, wherein the acquiring characteristic data of each node includes:
    收集本节点的特征数据,和/或收集上报至本节点的邻居节点的特征数据,和/或收集上报至本节点的下层角色节点的特征数据。Collect characteristic data of this node, and/or collect characteristic data of neighbor nodes reported to this node, and/or collect characteristic data of lower role nodes reported to this node.
  8. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述获取各节点的特征数据还包括:The node management method of the blockchain system according to claim 1, wherein the acquiring characteristic data of each node further comprises:
    将收集到的本节点及上报至本节点的特征数据上报至上层角色节点和/或邻居节点。Report the collected local node and the characteristic data reported to the local node to the upper role node and/or neighbor node.
  9. 根据权利要求1所述的区块链系统的节点管理方法,其中,所述特征数据为物理硬件数据、网络数据、运行状态数据、日志数据或节点间任务分配数据中的一种或多种。The node management method of a blockchain system according to claim 1, wherein the characteristic data is one or more of physical hardware data, network data, operating status data, log data, or task assignment data between nodes.
  10. 一种区块链系统的节点管理装置,其中,所述装置包括处理器,所述处理器用于:A node management device of a blockchain system, wherein the device includes a processor, and the processor is used for:
    获取各节点的特征数据;Obtain the characteristic data of each node;
    利用所述特征数据获取各节点的节点特征表示;Use the feature data to obtain a node feature representation of each node;
    利用所述节点特征表示获取区块链系统的整体特征表示;Use the node feature representation to obtain the overall feature representation of the blockchain system;
    基于强化学习算法利用所述节点特征表示和所述整体特征表示训练得出调控策略,所述调控策略为多个;Based on the reinforcement learning algorithm, using the node feature representation and the overall feature representation to train to obtain a control strategy, the control strategy is multiple;
    对所述多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照所述调控策略对节点进行管理。Conduct consensus-based voting on the plurality of control strategies, determine the control strategy of the node, and manage the node according to the control strategy.
  11. 根据权利要求10所述的区块链系统的节点管理装置,其中,所述处理器具体用于基于强化学习算法和预设的目标函数,利用所述节点特征表示和整体特征表示训练得出区域和/或整体环境的调控策略。The node management device of the blockchain system according to claim 10, wherein the processor is specifically configured to use the node feature representation and the overall feature representation to derive the training area based on the reinforcement learning algorithm and the preset objective function And/or overall environmental control strategies.
  12. 根据权利要求10所述的区块链系统的节点管理装置,其中,所述处理器具体用于利用去中心化图算法对所述特征数据进行训练,得到所述节点特征表示。The node management device of the blockchain system according to claim 10, wherein the processor is specifically configured to train the feature data using a decentralized graph algorithm to obtain the node feature representation.
  13. 根据权利要求12所述的区块链系统的节点管理装置,其中,所 述处理器具体用于利用去中心化深度学习算法对收集的各节点及邻居节点的特征数据训练区域特征模型,得出区块链系统的区域特征表示。The node management device of the blockchain system according to claim 12, wherein the processor is specifically used to train a regional feature model on the collected feature data of each node and neighbor nodes using a decentralized deep learning algorithm to obtain The regional characteristics of the blockchain system.
  14. 根据权利要求13所述的区块链系统的节点管理装置,其中,所述处理器具体用于利用收集的各节点及邻居节点的特征数据及所述区域特征表示,训练系统模型,得出所述整体特征表示。The node management device of the blockchain system according to claim 13, wherein the processor is specifically used to train the system model using the collected feature data of each node and neighbor nodes and the regional feature representation to obtain Describe the overall characteristics.
  15. 一种区块链系统的节点管理装置,其中,所述装置包括:A node management device of a blockchain system, wherein the device includes:
    数据收集模块,用于获取各节点的特征数据;The data collection module is used to obtain the characteristic data of each node;
    第一特征表示模块,用于利用所述特征数据获取各节点的节点特征表示;A first feature representation module, used to obtain the node feature representation of each node using the feature data;
    第二特征表示模块,用于利用所述节点特征表示获取区块链系统的整体特征表示;The second feature representation module is used to obtain the overall feature representation of the blockchain system using the node feature representation;
    管控策略模块,用于基于强化学习算法利用所述节点特征表示和所述整体特征表示训练得出调控策略,所述调控策略为多个;The management and control strategy module is used to train and obtain a regulation strategy based on the reinforcement learning algorithm by using the node feature representation and the overall feature representation training, and there are multiple regulation strategies;
    投票模块,用于对所述多个调控策略进行基于共识的投票,确定该节点的调控策略,并按照所述调控策略对节点进行管理。The voting module is used to conduct consensus-based voting on the multiple control strategies, determine the control strategy of the node, and manage the node according to the control strategy.
  16. 一种具有存储功能的装置,其中,所述装置存储有程序,所述程序被执行时实现权利要求1至9任一项所述的区块链系统的节点管理方法。A device having a storage function, wherein the device stores a program, and when the program is executed, the node management method of the blockchain system according to any one of claims 1 to 9 is realized.
PCT/CN2018/118290 2018-11-29 2018-11-29 Node management method and apparatus for blockchain system, and storage device WO2020107350A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2018/118290 WO2020107350A1 (en) 2018-11-29 2018-11-29 Node management method and apparatus for blockchain system, and storage device
CN201880002421.4A CN109964452B (en) 2018-11-29 2018-11-29 Node management method and device for block chain system and storage device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/118290 WO2020107350A1 (en) 2018-11-29 2018-11-29 Node management method and apparatus for blockchain system, and storage device

Publications (1)

Publication Number Publication Date
WO2020107350A1 true WO2020107350A1 (en) 2020-06-04

Family

ID=67023435

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/118290 WO2020107350A1 (en) 2018-11-29 2018-11-29 Node management method and apparatus for blockchain system, and storage device

Country Status (2)

Country Link
CN (1) CN109964452B (en)
WO (1) WO2020107350A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113570039A (en) * 2021-07-22 2021-10-29 同济大学 Optimized consensus block chain system based on reinforcement learning

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111209336B (en) * 2019-12-30 2020-09-15 广州博士信息技术研究院有限公司 Data distribution method and device based on block chain and server

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180129953A1 (en) * 2016-11-09 2018-05-10 Cognitive Scale, Inc. System for Performing Compliance Operations Using Cognitive Blockchains
CN108712423A (en) * 2018-05-18 2018-10-26 北京三六五八网络科技有限公司 Right management method and device
CN108717460A (en) * 2018-05-25 2018-10-30 济南浪潮高新科技投资发展有限公司 A kind of method and device reached common understanding in block chain
CN108898219A (en) * 2018-06-07 2018-11-27 广东工业大学 A kind of neural network training method based on block chain, device and medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108491266B (en) * 2018-03-09 2021-11-16 联想(北京)有限公司 Data processing method and device based on block chain and electronic equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180129953A1 (en) * 2016-11-09 2018-05-10 Cognitive Scale, Inc. System for Performing Compliance Operations Using Cognitive Blockchains
CN108712423A (en) * 2018-05-18 2018-10-26 北京三六五八网络科技有限公司 Right management method and device
CN108717460A (en) * 2018-05-25 2018-10-30 济南浪潮高新科技投资发展有限公司 A kind of method and device reached common understanding in block chain
CN108898219A (en) * 2018-06-07 2018-11-27 广东工业大学 A kind of neural network training method based on block chain, device and medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113570039A (en) * 2021-07-22 2021-10-29 同济大学 Optimized consensus block chain system based on reinforcement learning
CN113570039B (en) * 2021-07-22 2024-02-06 同济大学 Block chain system based on reinforcement learning optimization consensus

Also Published As

Publication number Publication date
CN109964452A (en) 2019-07-02
CN109964452B (en) 2021-08-10

Similar Documents

Publication Publication Date Title
US10481629B2 (en) Cognitive platform and method for energy management for enterprises
US10520905B2 (en) Using forecasting to control target systems
US20230039182A1 (en) Method, apparatus, computer device, storage medium, and program product for processing data
US10193772B1 (en) User behavior analyzer
Li et al. A new methodology to support group decision-making for IoT-based emergency response systems
Gorokhovatskiy et al. Employment of Intelligent Technologies in Multiparametric Control Systems
Lucarelli et al. A deep reinforcement learning approach for automated cryptocurrency trading
US20190179647A1 (en) Auto throttling of input data and data execution using machine learning and artificial intelligence
CN113486584B (en) Method and device for predicting equipment failure, computer equipment and computer readable storage medium
CN108320045A (en) Student performance prediction technique and device
CN111556173B (en) Service chain mapping method based on reinforcement learning
WO2020107350A1 (en) Node management method and apparatus for blockchain system, and storage device
Vu et al. Distributed adaptive model rules for mining big data streams
WO2023093235A1 (en) Communication network architecture generation method and apparatus, electronic device, and medium
CN112884163A (en) Combined service evaluation method and system based on federated machine learning algorithm and cloud feedback
CN115311205A (en) Industrial equipment fault detection method based on pattern neural network federal learning
Verma et al. Resource demand prediction in multi-tenant service clouds
Glavan et al. Cognitive edge computing through artificial intelligence
WO2016203757A1 (en) Control device, information processing device in which same is used, control method, and computer-readable memory medium in which computer program is stored
CN113543160A (en) 5G slice resource allocation method and device, computing equipment and computer storage medium
Bondy et al. A functional reference architecture for aggregators
Martinez-Julia et al. Explained intelligent management decisions in virtual networks and network slices
US20210295215A1 (en) Technologies for decentralized fleet analytics
CN116975102A (en) Sensitive data monitoring method, system, electronic equipment and storage medium
EP3851921A1 (en) Distributed computer control system and method of operation thereof via collective learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18941173

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18941173

Country of ref document: EP

Kind code of ref document: A1